The JI
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     
 


The Journal of Immunology, 2007, 179, 2419-2427
Copyright © 2007 by The American Association of Immunologists, Inc.

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Data Supplement
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Related articles in The JI
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow Request Permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Retter, I.
Right arrow Articles by Riblet, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Retter, I.
Right arrow Articles by Riblet, R.

Sequence and Characterization of the Ig Heavy Chain Constant and Partial Variable Region of the Mouse Strain 129S11

Ida Retter2,*, Christophe Chevillard2,3,{ddagger}, Maren Scharfe{dagger}, Ansgar Conrad{dagger}, Martin Hafner*, Tschong-Hun Im{dagger}, Monika Ludewig{dagger}, Gabriele Nordsiek{dagger}, Simone Severitt{dagger}, Stephanie Thies{dagger}, America Mauhar{ddagger}, Helmut Blöcker4,{dagger}, Werner Müller4,5,* and Roy Riblet4,{ddagger}

* Department of Experimental Immunology and {dagger} Department of Genome Analysis, Helmholtz Centre for Infection Research, Braunschweig, Germany; and {ddagger} Torrey Pines Institute for Molecular Studies, San Diego, CA 92121


    Abstract
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Disclosures
 References
 
Although the entire mouse genome has been sequenced, there remain challenges concerning the elucidation of particular complex and polymorphic genomic loci. In the murine Igh locus, different haplotypes exist in different inbred mouse strains. For example, the Ighb haplotype sequence of the Mouse Genome Project strain C57BL/6 differs considerably from the Igha haplotype of BALB/c, which has been widely used in the analyses of Ab responses. We have sequenced and annotated the 3' half of the Igha locus of 129S1/SvImJ, covering the CH region and approximately half of the VH region. This sequence comprises 128 VH genes, of which 49 are judged to be functional. The comparison of the Igha sequence with the homologous Ighb region from C57BL/6 revealed two major expansions in the germline repertoire of Igha. In addition, we found smaller haplotype-specific differences like the duplication of five VH genes in the Igha locus. We generated a VH allele table by comparing the individual VH genes of both haplotypes. Surprisingly, the number and position of DH genes in the 129S1 strain differs not only from the sequence of C57BL/6 but also from the map published for BALB/c. Taken together, the contiguous genomic sequence of the 3' part of the Igha locus allows a detailed view of the recent evolution of this highly dynamic locus in the mouse.


    Introduction
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Disclosures
 References
 
The Ig loci of humans and mouse have been heavily investigated during the last three decades. One of the first milestones toward a molecular understanding of the mechanisms generating Ab diversity was the work of Hozumi and Tonegawa (1), who could show that the Ab coding sequence is generated in B cells through a DNA recombination event (1). It took only 7 more years to achieve a detailed description of the phenomena of Ig gene rearrangement and somatic hypermutation (2). Both in these early and many later studies, a main focus has been the analysis of the notably complex H chain locus (Igh locus) of mouse. Three different types of gene segments, called VH (variable), DH (diversity) and JH (joining) genes, occur in the Igh locus in multiple copies. In early B cell development, one VH, DH, and JH gene is selected and recombined to form the variable part of the coding sequence for the H peptide chain (IgH) of the Ab. In a similar manner, the L chain VL region is subsequently formed from one VL and one JL gene in at least one of the L chain loci. To target these V(D)J rearrangements, the recombination machinery is guided by conserved recombination signal sequences (RSS)6 to the sites of recombination. The successful rearrangement of one H and one L chain locus results in the expression of a BCR on the cell surface, determining the further fate of the B cell. In the mature B cell state, the Ab isotype can switch by another DNA recombination event in the constant region of the Igh locus (Igh-C region). This class switch recombination combines the VDJ coding sequence with another Igh-C exon group, resulting in the production of Abs with different effector functions. The mouse Igh-V locus is particularly complex concerning both the size of the V region and the number of VH genes. Thus, special effort is required to elucidate the highly repetitive genomic sequence of this locus.

The murine Igh locus is ~3 million bases (Mb) in size and is located close to the telomere on chromosome 12. The locus is known to be highly polymorphic within the genus Mus (3). Based on detailed Southern blot analyses, the Igh loci of inbred strains were assigned to different haplotypes (4, 5). The strain 129/Sv possesses the Igha haplotype, as does BALB/c, the strain that perhaps has been most widely used in studies of Ab responses in mouse. The Ighb haplotype is present in C57BL/6 and in the context of the mouse genome project (6) the Ighb locus was entirely sequenced (7, 8, 9). In this study, we provide a 1.6-Mb sequence of the 129 substrain 129S1/SvImJ, abbreviated 129S1, spanning the CH {alpha} exons up to the beginning of the large VHJ558 family region. This Igha sequence expands the picture of the murine Igh locus: on the one hand, the comparison of genomic sequence from the two haplotypes allows a detailed view of recent evolutionary changes in copy number and sequence of VH genes and other features, e.g., interspersed repeats. In contrast, the sequence is also of general use and will support the elucidation of the special phenomena occurring in the Igh locus: VDJ recombination, class switch recombination, and somatic hypermutation, all of which are guided, regulated, or at least influenced by different sequence motifs (10, 11, 12).


    Materials and Methods
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Disclosures
 References
 
Identification of Igh bacerial chromosome (BAC)

Igha locus BACs were isolated from the CitbCJ7, abbreviated CT7, library (see http://www.tree.caltech.edu/). This library was prepared from the CJ7 ES cell line (13) derived from the 129S1/SvImJ mouse strain (ftp://ftp.informatics.jax.org/pub/reports/ES_CellLine.rpt). Library superpools and high-density membranes for screening and individual BAC clones were purchased from Research Genetics, now Invitrogen Life Technologies, and are also available from OpenBiosystems. Superpools were screened with a series of sequence-tagged site (STS) described in Ref. 14 . These included sites in the 3' end of the CH region (IgA exon 3), JH region, DH region (DFL16.1), and various VH region sites (VHgroupIII-VH7183, VH11, VHS107, VHGAM3-8, and VHJ606), and a yeast artificial chromosome (YAC) end (ADGC9-left arm) located in the VH10 region. Positive mouse BAC clones were obtained from Research Genetics, plated on agar plates containing 12.5 µg/ml chloramphenicol, and confirmed by colony PCR with the identifying primer sets. BAC ends were sequenced either directly from T7 and SP6 primers or following amplification by Vector-Hexamer PCR (15). BAC end sequences were deposited in EMBL/GenBank/DDBJ accession nos. BH021141–BH021349. Contig assembly proceeded by assessing STS content using the screening sites, D12Mit markers and others, and extensive Southern blot analysis using VH probes and BAC and YAC ends, all as described in Ref. 14 . Gaps were closed by developing new screening PCR assays from BAC end sequences.

Sequencing and assembly

Sequencing of 23 BACs of the CT7 library was performed by a shotgun approach as follows: sheared fragments of either 1 or 3 kb in length (GeneMachines) were subcloned separately into a pTZ18R vector. At least 800 clones were selected from each clone library, most of the plasmid DNA was prepared following a protocol supplied by Millipore. One-third of the selected clones were amplified by TempliPhi (384-well format) basically following the instructions of the supplier (Amersham Pharmacia Biotech). Cycle sequencing was routinely performed using a DYEnamic ET terminator cycle sequencing premix kit (Amersham Pharmacia Biotech) and UPO/RPO primer (MWG-BioTech). Most of the separations were run on Applied Biosystems 377 slab gel sequencers and one-third of the samples on MegaBACE capillary sequencers. Data were assembled and edited using the GAP4 program (16).

Sequence analysis

The VH genes were annotated using a procedure developed for the automatic generation of the database VBASE2 (17). In this procedure, VH genes are detected by a BLAST search (18) of the BAC sequences with known germline VH gene sequences. BLAST hits with a minimum identity of 80% and minimum alignment length of 200 bp are analyzed with the DNAPLOT program (W. Müller and H. H. Althaus, unpublished data; http://www.dnaplot.de) and matched to VDJ rearrangements from the EMBL/GenBank/DDBJ The DNAPLOT analysis is limited to exon 2 of the VH genes, but includes detection of RSS elements. Exon 1 of each VH gene, DH gene, JH gene, and CH gene have been annotated manually by sequence comparisons with BLAST, PipMaker (19) and other alignment programs, referring to previously published annotations; the EMBL/GenBank/DDBJ accession no. of the respective reference sequences are given in the feature table of the EMBL/GenBank/DDBJ entry AJ851868.

Dot plots and percent identity plots (PIPs) have been generated with the PipMaker program, parameters: "search both strands," "show all matches" sensitivity mode: "default," "show all matches" for dot plots, and "chaining" for PIPs. Interspersed repeats have been detected by using the Repeatmasker web server (A. F. A. Smit, R. Hubley, and P. Green, unpublished data; http://www.repeatmasker.org). The searches were performed against the Repbase mouse dataset with the "cross_match" search engine and slow sensitivity mode. Phylogenetic trees have been generated with IMGT alignments created with DNAPLOT (20) and visualized with MEGA version 3.1 (21). Further sequence analysis and formatting have been done with the emboss program package (22) and Perl scripts.


    Results
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Disclosures
 References
 
The sequence of the 3' half of the Igha locus

To elucidate the genomic sequence of the murine Igha haplotype, 23 BACs from the Citb, or CT7, library of the 129S1 mouse were selected using a physical map of the locus assembled with YAC and BAC end STS, D12Mit simple sequence length polymorphism sites, and Igh-C, J, D, and V gene segments (Refs. 14 and 15 ; C. Chevillard and R. Riblet, unpublished observations). The BACs were sequenced with a 10-fold coverage on average. The BAC inserts represent an overall length of 2.5 Mb and were assembled to a sequence of 1.6 Mb. This sequence covers approximately one-half of the Igh locus, including the CH region, JH region, DH region, and the JH-proximal part of the VH region (Fig. 1). The assembly is contiguous between the BACs 407I12 and 34H6, except for two simple repeat stretches of unknown length, one in 34H6 and one in the overlap of 459E6 and 436C3. The unfinished sequence of BAC 4K8 overlaps 407I12 and extends the assembly toward the JH distal part of the VH region with 12 single fragments. The order of these fragments is hypothetical, based on sequence comparison with the homologous Ighb region in C57BL/6. The complete assembled sequence and annotation are available from EMBL/GenBank/DDBJ (accession no. AJ851868).


Figure 1
View larger version (49K):
[in this window]
[in a new window]

 
FIGURE 1. Physical map of the 129S1 Igh locus, 3' part. The locus is shown in 5'->3' orientation displayed in segments of 200 kb each. Positions of genes are depicted as vertical lines. Short vertical lines indicate pseudogenes and "possibly functional" genes. VH genes are colored corresponding to their sequence family. Positions of the underlying BACs are indicated as gray horizontal lines below the scaling line. The unfinished sequence of the most 5' BAC, 4K8, is marked with a gray box; preliminary gene nomenclature in this 5' region is indicated by #. The sequence and annotation identifying each gene segment and its sequence coordinates are available from EMBL/GenBank/DDBJ (accession no. AJ851868). VH gene sequences may also be downloaded from VBASE2 (http://www.vbase2.org).

 
Annotation of VH, DH, JH, and CH genes in the assembled sequence

The Igha sequence assembly was screened for VH genes using a BLAST search and a subsequent V gene analysis method based on the program DNAPLOT (17). This method annotates the V gene beginning from the coding sequence of the mature peptide chain up to the noncoding RSS sites at the 3' end of each V gene. The results of this procedure were complemented by the annotation of exon 1. VH genes with obvious defects like stop codons, defective splice sites, and abnormal RSS elements were marked as pseudogenes. To further address the question of V gene functionality in a defined in silico approach, VH genes were matched against 6190 Ig VDJ rearrangements extracted from EMBL/GenBank/DDBJ. Those VH genes with a 100% match to a rearranged sequence are classified as functional. A physical map of the annotated region is shown in Fig. 1.

In the 3' half of the Igha V region, 128 VH genes were annotated, of which 49 have been found in rearrangements. Of 79 VH genes without clear evidence of functionality, 63 are obvious pseudogenes. The remaining 16 VH genes are designated as "possibly functional." To further analyze the potential functionality of the VH genes, we performed a pairwise sequence comparison of all 79 possibly functional and pseudogenes with the 49 functional VH genes. The result (summarized in Table I) shows that the majority of pseudogenes have major changes in their sequence, whereas some of them are very similar to functional genes. Although 12 possibly functional VH genes have at least 90% sequence identity to a functional gene, this is also the case for 6 pseudogenes. This indicates that a high sequence similarity to a functional gene cannot be taken as evidence for functionality, and the functionality of the 16 possibly functional genes cannot be proven based on the available data. However, because somatic mutations in the VDJ rearrangements prevent a 100% match to a germline sequence, it is likely that at least some of the possibly functional sequences are indeed functional.


View this table:
[in this window]
[in a new window]

 
Table I. Sequence comparison of VH pseudogenes and "possibly functional" VH genes vs functional VH genes

 
One functional and four pseudogenes have an exact duplicated counterpart. The VH gene sequence family assignment revealed that the currently elucidated part of the locus comprises members from 12 of 15 known VH gene families, lacking representation from only the most 5' VH3609P, VH10, and VHJ558 families (Table II), with the exception of the pseudogene VHJ558.a1psi.119. Seven VH pseudogenes could not be assigned to a certain VH gene family and were designated as VH relics, of which four are located in the VH7183 region and are closely related. Functional RSS elements were found at the 3' end of 72 VH genes. All RSS elements from VH genes of the families VHGAM-8 and VHSM7, and 2 RSS elements of the VH36-60 family exhibit a 22-bp spacer. Twenty VH pseudogenes show abnormal RSS elements. A table listing all VH genes with their names, properties, and positions within the assembly is available as supplemental data. 7


View this table:
[in this window]
[in a new window]

 
Table II. Summary of the functional VH genes and VH pseudogenes in the JH-proximal part of the mouse Igha locus grouped by familya

 
The VH genes have been named according to previously published nomenclature rules (4, 9). Each name consists of three parts separated by dots: the first part gives the VH gene family. The second part gives one letter for the Igh haplotype, in this case "a," and the number of the VH gene within this particular family followed by the psi tag for pseudogenes. The third part gives a number for the position of the VH genes within the locus, where both the absolute and the family numbering run from the JH proximal to the distal side of the locus. Names for VH genes within the 12 fragments of BAC 4K8 are given provisionally and therefore are marked with # at the end. Because other nomenclature schemata for VH genes have been published (8, 23) and numerous trivial names are in use, we collected all names to our knowledge for each VH gene and make them available in the database VBASE2 (17).

The JH proximal VH region of the Igha and Ighb haplotype: sequence comparison and VH gene allele assignment

The distinct polymorphisms in the murine Ig receptor loci are due to the strong evolutionary dynamics of these loci. The divergent evolution of VH genes has been described as resulting from diversifying selection and evolution by birth and death process (24). The inbred strains provide something like a snapshot of these loci, so that haplotype and even allele assignment is possible. We compared the Igha and Ighb haplotype by a dot plot of the homologous VH region sequences from 129S1 and C57BL/6 (Fig. 2A). This plot indicates that the overall structure, namely, the position of the discrete VH gene family clusters, is very similar in both haplotypes, a finding that confirms previous experimental results based on Southern blot analyses (25). Concerning individual VH gene family clusters, the dot plot shows two regions of expansion in the Igha locus: the VH7183/VHQ52 region (previously described in Ref. 26) and the interspersed VHGAM3-8/VH12 families. The VH7183/VHQ52 cluster is well conserved between both strains at the 3' end, followed by a sequence region which is much larger in Igha than in Ighb. This local expansion results in approximately twice as many functional VH 7183/VH Q52 genes in the Igha repertoire compared with Ighb (Table II). Similarly, the VHGAM3-8/VH12 region is larger and comprises more VH genes in Igha compared with Ighb. A detailed view of the differences between Igha and Ighb is provided by a percent identity plot of the homologous regions (supplemental data). 7 The plot displays annotations and interspersed repeats in the 129S1 sequence along with the percent identity detected in the C57BL/6 sequence.


Figure 2
View larger version (54K):
[in this window]
[in a new window]

 
FIGURE 2. Dot plots of the JH proximal VH and DH region. Positions of VH and DH genes from 129S1 are indicated as vertical lines, pseudogenes are marked as half-size vertical lines. A, 129S1 VH region, masked sequence vs C57BL/6 VH region, unmasked sequence. Two family regions underlying VH expansion in 129S1 are indicated as boxes. B, DH region of 129S1, masked (y-axis) vs unmasked (x-axis) sequence. Boxes indicate two homology blocks.

 
To resolve the V region comparison to the level of single VH genes, we performed a multiple sequence alignment of the combined VH gene repertoire of both strains. Whenever an Igha VH gene had its best match within the Ighb repertoire and inversely the best match of this Ighb VH gene was the referring Igha V gene, we designated this VH gene pair as alleles (Table III). In case of the duplicated Igha V genes, one Ighb allele refers to the pair of Igha alleles. For all alleles a minimum sequence identity of 90% was postulated. With this method, 53 pairs of alleles were assigned. The genes of the VHJ606 family were excluded from the allele assignment, as both the order of these genes is hypothetical and it is unlikely that all VHJ606 genes are included in the current assembly. The allele assignment is in accordance with the dot plot of the genomic Igha and Ighb sequences (Fig. 2A); alleles occur in regions having high overall sequence similarity, whereas regions of expansion have much less similarity and contain no alleles.


View this table:
[in this window]
[in a new window]

 
Table III. VH gene alleles in 129S1 and C57BL/6

 
The Igh-D region of 129S1 consists of two separate homology blocks

In the Igh-D region of 129S1, 15 DH genes are annotated and assigned to one of the sequence families DSP2, DFL16, DST4, or DQ52. In addition, three copies of the truncated DH gene DMB1 (27), DMB1–DMB3, were found. The DH gene DSP2.2 is duplicated and is designated as DSP2.2a and DSP2.2b. Two new DH genes are seen, DFL16.3 and DST4.3, which are probably not functional (see below). The dot plot of the DH region (Fig. 2B) shows that the DH genes are arranged in two separate homology blocks: The main block, including all DSP2 genes, starts before DFL16.1 and ends after DST4. Upstream of this major DH cluster there is an additional small block consisting of the genes DST4.2, DMB1, and DFL16.3. The DH gene DQ52 constitutes the 3' end of the DH region and is separate from the main DH gene block. Both the main and the small blocks are composed of multiple related repeats of a 3-kb sequence containing one DH gene.

The sequences of the DH genes DFL16.3, DST4.2, and DST4.3 (Fig. 3) were checked in silico for potential functionality. A BLAST search of the DH gene sequences was performed against a database comprised of the junctional regions of 6190 Ig VDJ rearrangements extracted from EMBL/GenBank/DDBJ. For DFL16.3 and DST4.3, no specifically matching rearrangements were found, so that both DH genes are classified as nonfunctional. For DST4.2, six rearrangements were found that shared the DST4.2-specific insertion of a cytidine nucleotide in the second position. Interestingly, three rearrangements stem from a mouse line where the main block of DH genes has been deleted (27). Thus, it seems that the lack of DH genes from the main block enforces the usage of the otherwise seldom or nonused DH genes of the small block.


Figure 3
View larger version (18K):
[in this window]
[in a new window]

 
FIGURE 3. Alignment of the DH gene families DST4 and DFL16. RSS elements are marked as gray boxes.

 
Comparison of the DH genes of 129S1, BALB/c, and C57BL/6

Although the Igh-V regions of different inbred strains of mice allow a clear assignment to a relatively small number of haplotypes, the Igh-D regions show even more polymorphism (28). Although DH gene usage and the role of the DH region within the process of VDJ rearrangement have been intensively studied (29, 30, 31, 32, 33), a complete nucleotide sequence of the DH region of BALB/c has never been published. The existing map is based on Southern blot hybridization experiments (34, 35). The Mouse Genome Sequencing Consortium provided a complete sequence of the DH region of C57BL/6, which has recently been annotated (7, 36). Fig. 4 compares the DH regions of 129S1, BALB/c, and C57BL/6. The three maps have a common feature, namely, the main DH gene block is separate from the downstream DH gene DQ52. The small DH gene block is present in 129S1 and C57BL/6, but it is unknown whether it exists in BALB/c. Compared with 129S1 and BALB/c, the main DH gene block is smaller in C57BL/6 and contains fewer genes. The main blocks of 129S1 and BALB/c contain different DSP2 genes. Differences in the size of the main DH blocks may be due to the fact that the BALB/c map is of limited resolution. However, the available data show notable differences in the DH region of both Igha haplotype strains 129S1 and BALB/c.


Figure 4
View larger version (8K):
[in this window]
[in a new window]

 
FIGURE 4. Map of DH genes in 129S1, BALB/c, and C57BL/6. The single maps are scaled equally. The DH gene DQ52 is used as a bench-mark for positioning. A, 129S1; B, BALB/c (adapted from Ref. 35 ); and C, C57BL/6 (adapted from Ref. 36 ).

 
The JH and CH regions of 129S1

Close to the most downstream DH gene DQ52, four JH genes are located within a short stretch of 1.5 kb. The adjacent CH region contains eight genes coding for eight different isotypes of the H chain of the Ab. Each CH gene consists of a group of three to five exons coding for different domains in the Ab molecule. B cells can switch the isotype of the produced Ab by class switch recombination. This is mediated by repetitive sequences upstream of the particular isotype exon group, the switch (S) regions. JH genes, CH region exons, and S regions were annotated referring to previously published annotation of these sequences from BALB/c mice (for references, see EMBL/GenBank/DDBJ AJ851868). In addition to the previously known CH{gamma} pseudogenes (37), one new CH{gamma} pseudogene group was found, consisting of CH2, CH3, and a truncated M1 exon. The new pseudogene group was designated as CHpsi{gamma}0 and is in an inverted orientation located upstream of the CH{gamma}3 group, thereby heading the CH{gamma} cluster.

To depict the genomic structure of the CH region, comprising internal sequence homologies as well as location and size of repetitive sequences, we performed a dot plot of the genomic sequence (Fig. 5). In the dot plot, S regions appear as black boxes adjacent to the I exon. The S region of CH{gamma}1 is noticeably enlarged. Four sequence blocks comprising the CH{gamma} exon groups show strong conservation, starting upstream of the I exons and extending beyond the membrane exons of each group. Parts of this conserved block are inversely inserted upstream of CH{gamma}3, CH{gamma}2b, and CH{gamma}2a. These findings about the overall CH region structure are in accordance with data published on the CH region of BALB/c (37, 38). A refined view of the relationship between the JH and CH loci of both strains is gained by comparison of the coding sequences. The sequences of the four JH genes of 129S1 and BALB/c are identical, except for a silent point mutation in the third valine codon of JH1 (C vs T). Remarkably, the CH region coding sequences from 129S1 and BALB/c show several amino acid exchanges concerning the isotypes IgD, IgG1, IgG2b, IgE, and IgA (Table IV). However, the CDS of IgM, IgG3, and IgG2a are identical.


Figure 5
View larger version (39K):
[in this window]
[in a new window]

 
FIGURE 5. Dot plot of the 129S1 CH region. Masked (y-axis) vs unmasked (x-axis) sequence. To display S regions, simple repeats were not removed from the masked sequence. Positions of CH genes and CH pseudogenes are indicated as vertical lines and half-size lines, respectively.

 

View this table:
[in this window]
[in a new window]

 
Table IV. Differences in CDS of CH region genes between 129S1 and BALB/c

 

    Discussion
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Disclosures
 References
 
Nomenclature of the mouse Igh-V genes

For a sensible gene nomenclature, the gene name should provide a reasonable amount of information about the gene. In particular for the Igh locus of mouse, numerous nomenclature suggestions have been made, each representing different aspects of the genes. The IMGT (http://imgt.cines.fr/) established a systematic nomenclature by assigning a number to each V gene family and numbering V genes within each family (39). This nomenclature regards genes independent of their chromosomal position and haplotype, but it accounts for V gene alleles. de Bono (8) applied another rule set to the Ighb-V genes, introducing position-dependent numbering. The nomenclature rules that Johnston et al. (9) and we applied to the genomic sequence of the Ighb and Igha loci, respectively, include both haplotype and positional information, thereby increasing the information content of the name. Concerning the family nomenclature, we used the well-established family names that trace back to the original myelomas and other cell lines from which the first family members were derived. It is obvious that there is no benefit in maintaining different nomenclatures in parallel. However, until the Mouse Genomic Nomenclature Committee establishes a definite standard, we have to accept diverse opinions and the resulting parallel nomenclatures. To generate transparency, we have made available a list of corresponding names for each V gene in the germline V gene database VBASE2 (http://www.vbase2.org).

Heterogeneity within the Igha haplotype

Serological studies using polyclonal antiallotype sera classified 129/Sv and BALB/c together as Igha (40), and later studies with mAbs to BALB/c (i.e., IgM, IgD, and IgG1) bound 129/Sv Ig as well (41, 42). RFLP analyses of the VH region showed a strong correspondence between the CH region and VH region haplotypes (4, 43), meaning that strains with identical CH region patterns often also exhibit the same VH region patterns. For 129/Sv, comprehensive Southern blot analyses revealed restriction patterns identical to the VH region of BALB/c, with the exception of the pattern for the VH3909P family at the distal part of the locus (5).

Based on this, it was unexpected to find the remarkable differences between the CH coding sequences of 129S1 and BALB/c that are listed in Table IV. The DH region sequence of 129S1 shows a mixed haplotype, represented by a DST4-Igha allele and a DQ52-Ighb allele. A detailed comparison with the DH region of BALB/c is limited by the lack of BALB/c genomic sequence. However, based on the available sequence information, there are obvious differences between the DH regions of BALB/c and 129S1. Taken together, our results point to a distinct heterogeneity within the Igha haplotype that had not been detected in previous experiments and which might affect, to a minor degree, also the VH region of 129S1.

Interspersed repeats in the murine Igh locus

An unusually high content of interspersed repeats has been repeatedly noted for the murine Igh and Igk loci (14, 15, 44). In the elucidated VH region of 129S1, Repeatmasker analysis shows that interspersed repeats occupy 54% of the sequence: the content of LINE-1 (L1) elements is 36%, whereas the SINE content is <2%. This considerably deviates from the average for the mouse genome, where 19% L1 and 8% SINE content have been reported (6). The distribution of repetitive elements in the VH region of 129S1 is very similar to the distribution that was reported for the VH region of C57BL/6 (9). This unusually high density of LINE elements has several interesting parallels; a high density has been noted around other monoallelically expressed genes (45), and the X chromosome has a high LINE density. Lyon (46) has proposed that this L1 density is a factor in the heterochromatization of the inactive X. To visualize interspersed repeats in relation to the V, D, J, and C gene positions, we performed a percent identity plot (PIP) of the Igha vs Ighb sequence with the PipMaker program (supplemental data). 7 The PIP shows not only differences between the haplotypes, but also the position and class of interspersed repeats in the 129S1 sequence and can therefore be taken as a high-resolution map of the region. It graphically displays the relatively uniform distribution of L1 elements throughout the VH region and rarity in the CH region; also apparent are insertions of L1 elements in 129S1 that are absent in C57BL/6, consistent with the continuing evolution of this complex locus.

Expansion of functional and nonfunctional VH genes by block duplications

Duplications of both large and small sequence blocks are a common phenomenon in the Ig loci (9, 47, 48, 49) and are assumed to be an essential force in the generation of multiple gene copies in these loci. Ota and Nei (24) explain the maintenance of diversity within the large VH gene family by selective forces favoring diversification of the duplicated genes. To explain the high number of pseudogenes within the Ig loci, Kawasaki et al. (48) suggested the coamplification and fixation of pseudogenes along with adjacent functional V genes. We tested this hypothesis on the VH7183/VHQ52 region of the murine Igh-V locus, which is expanded in the 129S1 strain compared with C57BL/6 (Fig. 2A). We generated a phylogenetic tree of the VH7183 Igha family (Fig. 6A) and noticed that there is a clear separation of nonfunctional and functional VH7183 sequences, indicating that the nonfunctional sequences have evolved independently from the functional sequences and there are, with the exception of VH7183.a1psi.1 and VH7183.a43psi.70, no pseudogenes related to the functional sequences. Looking at the physical map of the VH7183 region, one can see repeated patterns where a certain functional gene is close to a pseudogene in a conserved distance and order (Fig. 6B). The first, most obvious example of such a block consists of a VH relic sequence next to a functional VHQ52 gene. This block, indicated by a noncolored box in Fig. 6B, appears four times within the VH7183/VHQ52 region. In addition, we find other blocks involving pseudogenes of the VH7183 family next to functional genes of the VHQ52 or the VH7183 family, indicated as colored boxes in Fig. 6B. When we superimpose these blocks on the phylogenetic tree shown in Fig. 6A, we can nicely see clustering of the pseudogenes within the phylogenetic tree. From our analysis, we conclude that functional VH genes are duplicated as large stretches of DNA containing more flanking sequences than necessary to encode for a functional V gene and by this "blockwise" duplication pseudogenes are expanded as well. In Fig. 6A, we further show the relationship to corresponding alleles of the C57BL/6 locus, indicating that the phylogenetic distribution between the functional and nonfunctional VH genes remains valid also in case of the C57BL/6 strain. When we analyzed the order of functional and nonfunctional sequences within the entire Igha region, we could see an underrepresentation of two functional VH genes next to each other (data not shown). In case of a random distribution, we would have expected a higher fraction of neighboring functional VH genes. This finding supports the idea of blockwise expansion of large segments of DNA containing at least one functional VH gene. We cannot rule out that the adjacent pseudogenes comprise functional regulatory elements that are used, for example, for the opening of the locus during B lymphocyte development or later by the functional VH genes and are thereby positively selected. However, the latter possibility seems unlikely since there are examples of pseudogenes located both upstream and downstream of a functional gene within the indicated homology blocks. In addition, the fact that we find only one orientation of VH genes, irrespective if these are functional, nonfunctional, or relics, points to a directed mechanism in the evolution of this locus. Currently, we have no conclusive explanation for the driving forces underlying the evolution of VH genes. Given the complexity of the genealogical trees of mouse inbred strains (50), we cannot state when and how modifications of the VH genes happened. Either they occurred during the generation of inbred strains within the last 100 years or these differences represent allelic variants of wild mice selected over a time span of 100,000 years or more that have been fixed during the generation of inbred lines. As BALB/c and 129S1 mice, although distantly related in the genealogical tree of mouse inbred strains, have similar Igh haplotypes, we favor the latter possibility.


Figure 6
View larger version (24K):
[in this window]
[in a new window]

 
FIGURE 6. Block duplications in the VH7183 region. Additional explanations are given in the text. A, Phylogenetic tree of the VH7183 family of 129S1. The tree shows a functional (right) and a pseudogene (left) branch. Colored large dots correspond to boxes in B. Small red dots indicate the existence of a VH gene allele in C57BL/6, whereas framed dots represent functional, others nonfunctional Igh-Vb genes. B, Patterns of block duplications in the VH7183/VHQ52 region. Boxes of the same color represent identical patterns. The VH7183 pseudogene of each colored box corresponds to a dot-marked pseudogene in A.

 

    Acknowledgments
 
We thank Klaus Rajewsky and Blair Prochnow for critical review of this manuscript.


    Disclosures
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Disclosures
 References
 
The authors have no financial conflict of interest.


    Footnotes
 
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1 This research was supported by the German Bundesministerium für Bildung und Forschung through Grant PT DLR (FKZ 01KW0003) and the Bioinformatics Competence Centre "Intergenomics" Grant 031U110A/031U210A and National Institutes of Health Grant R01 AI23548. Back

2 I.R. and C.C. are co-first authors. Back

3 Current address: Institut National de la Santé et de la Recherche Médicale, Immunology and Genetics of Parasitic Diseases, Marseille, France and Laboratory of Parasitology-Mycology, Faculté de Medecine, Université de la Méditerranée, Marseille, France. Back

4 H.B., W.M., and R.R. are co-last authors. Back

5 Address correspondence and reprint requests to Prof. Werner Muller at his current address: University of Manchester, Bill Ford Chair of Cellular Immunology, Michael Smith Building, Oxford Road, Manchester, U.K. Back

6 Abbreviations used in this paper: RSS, recombination signal sequence; BAC, bacterial artificial chromosome; STS, sequence-tagged site; PIP, percent identity plot; YAC, yeast artificial chromosome. Back

7 The online version of this article contains supplemental material. Back

Received for publication February 12, 2007. Accepted for publication May 29, 2007.


    References
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Disclosures
 References
 

  1. Hozumi, N., S. Tonegawa. 1976. Evidence for somatic rearrangement of immunoglobulin genes coding for variable and constant regions. Proc. Natl. Acad. Sci. USA 73: 3628-3632. [Abstract/Free Full Text]
  2. Tonegawa, S.. 1983. Somatic generation of antibody diversity. Nature 302: 575-581. [Medline]
  3. Tutter, A., R. Riblet. 1989. Evolution of the immunoglobulin heavy chain variable region (Igh-V) locus in the genus Mus. Immunogenetics 30: 315-329. [Medline]
  4. Brodeur, P. H., R. Riblet. 1984. The immunoglobulin heavy chain variable region (Igh-V) locus in the mouse: I. One hundred Igh-V genes comprise seven families of homologous genes. Eur. J. Immunol. 14: 922-930. [Medline]
  5. Tutter, A., R. Riblet. 1988. Duplications and deletions of VH genes in inbred strains of mice. Immunogenetics 28: 125-135. [Medline]
  6. Waterston, R. H., K. Lindblad-Toh, E. Birney, J. Rogers, J. F. Abril, P. Agarwal, R. Agarwala, R. Ainscough, M. Alexandersson, P. An, et al 2002. Initial sequencing and comparative analysis of the mouse genome. Nature 420: 520-562. [Medline]
  7. Riblet, R.. 2004. Immunoglobulin heavy chain genes of mouse. T. Honjo, and F. W. Alt, and M. Neuberger, eds. Molecular Biology of B Cells 19-26. Elsevier Academic, London.
  8. de Bono, B., M. Madera, C. Chothia. 2004. VH gene segments in the mouse and human genomes. J. Mol. Biol. 342: 131-143. [Medline]
  9. Johnston, C. M., A. L. Wood, D. J. Bolland, A. E. Corcoran. 2006. Complete sequence assembly and characterization of the C57BL/6 mouse Ig heavy chain V region. J. Immunol. 176: 4221-4234. [Abstract/Free Full Text]
  10. Cobb, R. M., K. J. Oestreich, O. A. Osipovich, E. M. Oltz. 2006. Accessibility control of V(D)J recombination. Adv. Immunol. 91: 45-109. [Medline]
  11. Yang, S. Y., S. D. Fugmann, D. G. Schatz. 2006. Control of gene conversion and somatic hypermutation by immunoglobulin promoter and enhancer sequences. J. Exp. Med. 203: 2919-2928. [Abstract/Free Full Text]
  12. Zarrin, A. A., M. Tian, J. Wang, T. Borjeson, F. W. Alt. 2005. Influence of switch region length on immunoglobulin class switch recombination. Proc. Natl. Acad. Sci. USA 102: 2466-2470. [Abstract/Free Full Text]
  13. Swiatek, P. J., T. Gridley. 1993. Perinatal lethality and defects in hindbrain development in mice homozygous for a targeted mutation of the zinc finger gene Krox20. Genes Dev. 7: 2071-2084. [Abstract/Free Full Text]
  14. Chevillard, C., J. Ozaki, C. D. Herring, R. Riblet. 2002. A three-megabase yeast artificial chromosome contig spanning the C57BL mouse Igh locus. J. Immunol. 168: 5659-5666. [Abstract/Free Full Text]
  15. Herring, C. D., C. Chevillard, S. L. Johnston, P. J. Wettstein, R. Riblet. 1998. Vector-hexamer PCR isolation of all insert ends from a YAC contig of the mouse Igh locus. Genome Res. 8: 673-681. [Abstract/Free Full Text]
  16. Staden, R.. 1996. The Staden sequence analysis package. Mol. Biotechnol. 5: 233-241. [Medline]
  17. Retter, I., H. H. Althaus, R. Munch, W. Muller. 2005. VBASE2, an integrative V gene database. Nucleic Acids Res. 33: D671-D674. [Abstract/Free Full Text]
  18. Altschul, S. F., T. L. Madden, A. A. Schaffer, J. Zhang, Z. Zhang, W. Miller, D. J. Lipman. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25: 3389-3402. [Abstract/Free Full Text]
  19. Schwartz, S., Z. Zhang, K. A. Frazer, A. Smit, C. Riemer, J. Bouck, R. Gibbs, R. Hardison, W. Miller. 2000. PipMaker: a web server for aligning two genomic DNA sequences. Genome Res. 10: 577-586. [Abstract/Free Full Text]
  20. Lefranc, M. P., V. Giudicelli, C. Ginestoux, J. Bodmer, W. Muller, R. Bontrop, M. Lemaitre, A. Malik, V. Barbie, D. Chaume. 1999. IMGT, the international ImMunoGeneTics database. Nucleic Acids Res. 27: 209-212. [Abstract/Free Full Text]
  21. Kumar, S., K. Tamura, M. Nei. 2004. MEGA3: Integrated software for molecular evolutionary genetics analysis and sequence alignment. Brief Bioinform. 5: 150-163. [Abstract/Free Full Text]
  22. Rice, P., I. Longden, A. Bleasby. 2000. EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 16: 276-277. [Medline]
  23. Giudicelli, V., D. Chaume, M. P. Lefranc. 2005. IMGT/GENE-DB: a comprehensive database for human and mouse immunoglobulin and T cell receptor genes. Nucleic Acids Res. 33: D256-D261. [Abstract/Free Full Text]
  24. Ota, T., M. Nei. 1994. Divergent evolution and evolution by the birth-and-death process in the immunoglobulin VH gene family. Mol. Biol. Evol. 11: 469-482. [Abstract]
  25. Mainville, C. A., K. M. Sheehan, L. D. Klaman, C. A. Giorgetti, J. L. Press, P. H. Brodeur. 1996. Deletional mapping of fifteen mouse VH gene families reveals a common organization for three Igh haplotypes. J. Immunol. 156: 1038-1046. [Abstract]
  26. Williams, G. S., A. Martinez, A. Montalbano, A. Tang, A. Mauhar, K. M. Ogwaro, D. Merz, C. Chevillard, R. Riblet, A. J. Feeney. 2001. Unequal VH gene rearrangement frequency within the large VH7183 gene family is not due to recombination signal sequence variation, and mapping of the genes shows a bias of rearrangement based on chromosomal location. J. Immunol. 167: 257-263. [Abstract/Free Full Text]
  27. Koralov, S. B., T. I. Novobrantseva, K. Hochedlinger, R. Jaenisch, K. Rajewsky. 2005. Direct in vivo VH to JH rearrangement violating the 12/23 rule. J. Exp. Med. 201: 341-348. [Abstract/Free Full Text]
  28. Trepicchio, W., Jr, K. J. Barrett. 1985. The Igh-V locus of MRL mice: restriction fragment length polymorphism in eleven strains of mice as determined with VH and D gene probes. J. Immunol. 134: 2734-2739. [Abstract]
  29. Reth, M. G., F. W. Alt. 1984. Novel immunoglobulin heavy chains are produced from DJH gene segment rearrangements in lymphoid cells. Nature 312: 418-423. [Medline]
  30. Alessandrini, A., S. V. Desiderio. 1991. Coordination of immunoglobulin DJH transcription and D-to-JH rearrangement by promoter-enhancer approximation. Mol. Cell. Biol. 11: 2096-2107. [Abstract/Free Full Text]
  31. Chang, Y., C. J. Paige, G. E. Wu. 1992. Enumeration and characterization of DJH structures in mouse fetal liver. EMBO J. 11: 1891-1899. [Medline]
  32. Atkinson, M. J., Y. Chang, J. W. Celler, C. Huang, C. J. Paige, G. E. Wu. 1994. Overusage of mouse DH gene segment, DFL16.1, is strain-dependent and determined by cis-acting elements. Dev. Immunol. 3: 283-295. [Medline]
  33. Nitschke, L., J. Kestler, T. Tallone, S. Pelkonen, J. Pelkonen. 2001. Deletion of the DQ52 element within the Ig heavy chain locus leads to a selective reduction in VDJ recombination and altered D gene usage. J. Immunol. 166: 2540-2552. [Abstract/Free Full Text]
  34. Ichihara, Y., H. Hayashida, S. Miyazawa, Y. Kurosawa. 1989. Only DFL16, DSP2, and DQ52 gene families exist in mouse immunoglobulin heavy chain diversity gene loci, of which DFL16 and DSP2 originate from the same primordial DH gene. Eur. J. Immunol. 19: 1849-1854. [Medline]
  35. Feeney, A. J., R. Riblet. 1993. DST4: a new, and probably the last, functional DH gene in the BALB/c mouse. Immunogenetics 37: 217-221. [Medline]
  36. Ye, J.. 2004. The immunoglobulin IGHD gene locus in C57BL/6 mice. Immunogenetics 56: 399-404. [Medline]
  37. Akahori, Y., Y. Kurosawa. 1997. Nucleotide sequences of all the {gamma} gene loci of murine immunoglobulin heavy chains. Genomics 41: 100-104. [Medline]
  38. Shimizu, A., N. Takahashi, Y. Yaoita, T. Honjo. 1982. Organization of the constant-region gene family of the mouse immunoglobulin heavy chain. Cell 28: 499-506. [Medline]
  39. Martinez-Jean, C., G. Folch, M. P. Lefranc. 2001. Nomenclature and overview of the mouse (Mus musculus and Mus sp.) immunoglobulin {kappa} (IGK) genes. Exp. Clin. Immunogenet. 18: 255-279. [Medline]
  40. Lieberman, R.. 1978. Genetics of IgCH (allotype) locus in the mouse. Springer Semin. Immunopathol. 1: 7-30. [Medline]
  41. Kitamura, D., K. Rajewsky. 1992. Targeted disruption of µ chain membrane exon causes loss of heavy-chain allelic exclusion. Nature 356: 154-156. [Medline]
  42. Jung, S., K. Rajewsky, A. Radbruch. 1993. Shutdown of class switch recombination by deletion of a switch region control element. Science 259: 984-987. [Abstract]
  43. Ben-Neriah, Y., J. B. Cohen, G. Rechavi, R. Zakut, D. Givol. 1981. Polymorphism of germ-line immunoglobulin VH genes correlates with allotype and idiotype markers. Eur. J. Immunol. 11: 1017-1020. [Medline]
  44. Brekke, K. M., W. T. Garrard. 2004. Assembly and analysis of the mouse immunoglobulin {kappa} gene sequence. Immunogenetics 56: 490-505. [Medline]
  45. Mostoslavsky, R., N. Singh, T. Tenzen, M. Goldmit, C. Gabay, S. Elizur, P. Qi, B. E. Reubinoff, A. Chess, H. Cedar, Y. Bergman. 2001. Asynchronous replication and allelic exclusion in the immune system. Nature 414: 221-225. [Medline]
  46. Lyon, M. F.. 2000. LINE-1 elements and X chromosome inactivation: a function for "junk" DNA?. Proc. Natl. Acad. Sci. USA 97: 6248-6249. [Free Full Text]
  47. Matsuda, F., K. Ishii, P. Bourvagnet, K. Kuma, H. Hayashida, T. Miyata, T. Honjo. 1998. The complete nucleotide sequence of the human immunoglobulin heavy chain variable region locus. J. Exp. Med. 188: 2151-2162. [Abstract/Free Full Text]
  48. Kawasaki, K., S. Minoshima, N. Shimizu. 2000. Propagation and maintenance of the 119 human immunoglobulin V{lambda} genes and pseudogenes during evolution. J. Exp. Zool. 288: 120-134. [Medline]
  49. Kawasaki, K., S. Minoshima, E. Nakato, K. Shibuya, A. Shintani, S. Asakawa, T. Sasaki, H. G. Klobeck, G. Combriato, H. G. Zachau, N. Shimizu. 2001. Evolutionary dynamics of the human immunoglobulin {kappa}locus and the germline repertoire of the V{kappa} genes. Eur. J. Immunol. 31: 1017-1028. [Medline]
  50. Beck, J. A., S. Lloyd, M. Hafezparast, M. Lennon-Pierce, J. T. Eppig, M. F. Festing, E. M. Fisher. 2000. Genealogies of mouse inbred strains. Nat. Genet. 24: 23-25. [Medline]

Related articles in The JI:

IN THIS ISSUE

The JI 2007 179: 2033-2034. [Full Text]  



This article has been cited by other articles:


Home page
EndocrinologyHome page
S. M. McLachlan, H. A. Aliesky, P. N. Pichurin, C.-R. Chen, R. W. Williams, and B. Rapoport
Shared and Unique Susceptibility Genes in a Mouse Model of Graves' Disease Determined in BXH and CXB Recombinant Inbred Mice
Endocrinology, April 1, 2008; 149(4): 2001 - 2009.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Data Supplement
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Related articles in The JI
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow Request Permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Retter, I.
Right arrow Articles by Riblet, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Retter, I.
Right arrow Articles by Riblet, R.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS