|
|
||||||||


*
Dipartimento di Genetica, Biologia e Biochimica, Università di Torino, Torino, Italy;
Departments of Medicine and Microbiology and Immunology, and Cancer Center, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642; and
Dipartimento di Scienze Cliniche e Biologiche, Università di Torino, Torino, Italy
| Abstract |
|---|
|
|
|---|
2 kb with almost complete homology between
EP1-A1 and E-A2, flanked by two
minisatellites. These results are consistent with the hypothesis that
segments of complete homology may be required for efficient homologous
recombination in humans. The possible role of minisatellites as
recombination signals is inferred, in agreement with current
knowledge. | Introduction |
|---|
|
|
|---|
45% (22). Seven types of deletions
and 10 types of duplications have been reported so far, spanning from
20 to 150 kb. The most frequent deletion/duplication encompasses the
A1, GP, G2, G4, and
E genes (
150 kb). Analysis of the overall structure of
the deleted/duplicated haplotypes suggests that they probably
originated through unequal crossing-over during meiosis, due to
mispairing between highly homologous regions, especially of the two
large G-G-E-A clusters. Haplotypes with deletions or duplications are present with a variable frequency in all populations analyzed thus far, with a significantly higher frequency of duplications over deletions (Refs. 20 and 21 and the references therein). According to the unequal crossover model, duplications and deletions should originate at the same rate. Therefore, the higher frequency of duplicated vs deleted haplotypes suggests that selective pressures may be operating on the locus to eliminate deletions and/or favor duplications.
Our understanding of the origins of this remarkable genetic instability and our interpretation of the role that selection may play in determining the distribution of the variant haplotypes in the population depend upon establishing the molecular mechanisms that lead to the individual recombination events. So far, the recombination sites of only two deletions have been mapped. The first study, based on a Southern blot approach followed by library construction and sequencing, succeeded in locating the breakpoint of a G1-G4 gene deletion to a highly homologous region located 3' of G3 and G4, outside of the switch sequences (23). More limited analyses of an A1-E deletion mapped the breakpoint to a large region between the E genes and the switch A regions (24, 25).
In this report, we use Southern blotting, long-range PCR, and sequencing to define the deletion breakpoint in three individuals homozygous for a deletion from A1 to E, which is the most common type of deletion in the Italian population (20).
| Materials and Methods |
|---|
|
|
|---|
Peripheral blood samples were obtained from three unrelated
subjects (BON, DEM, and MON) and 10 controls from different peninsular
Italian regions. These three subjects are homozygous for a deletion of
150 kb encompassing the A1, GP, G2,
G4, and E genes. Analysis of IGHC RFLP
markers and Gm allotypes suggests a different origin of the deleted
haplotypes in MON and DEM (10, 13); BON is homozygous for
all tested markers (26). Informed consent was obtained
from all subjects.
Southern blot analysis
A total of 5 µg of genomic DNA, extracted using standard phenol/chloroform methods (27), was digested with SacI, PstI, PvuII, and SacI+PstI restriction endonucleases (Roche Diagnostics, Monza, Italy) and run in 0.8% TBE (0.045 M Tris-borate, 0.002 M EDTA)-agarose. DNA was transferred onto Hybond N+ nylon filters (Amersham Pharmacia, Little Chalfont, U.K.) and hybridized to the following genomic probes using standard techniques:
pHS
2-2.6 is a 2.6-kb SacI/PstI fragment (see
Fig. 1
A) subcloned from the
-TOU
-
phage (kindly provided by M. P. Lefranc,
University of Montpellier II, Montpellier, France).
|
1-0.9 is a 0.9-kb PstI-PvuII fragment (see
Fig. 1
11 (kindly provided by T.
Honjo, Kyoto University, Kyoto, Japan). These probes cross-hybridize with two regions between EP1-A1 and E-A2 genes.
Cloning and sequencing
The subcloning of
-TOU
-
and pH-Ig
11 was performed
using standard procedures (27). The clones obtained were
sequenced using a primer walking strategy and dye-terminator chemistry
on a model 377 ABI Prism automatic sequence analyzer (Applied
Biosystems, Perkin-Elmer, Monza, Italy). Data were elaborated using the
Navigator program (Perkin-Elmer) and compared with GenBank with BLAST.
PCR
To amplify the regions downstream of EP1 and E, specific forward primers 886 (5'-acactgggttcagctgacactacactgggttc-3') and 889 (5'-ggcctcacacagccctccggtgtaccacag-3') were used with the same reverse primer (891, 5'-tgtgtccacagatgtggctggatgctgagg-3') located 9 kb downstream of each gene. The reaction was performed using the "Expand Long Template PCR System" (Roche Diagnostics) according to the manufacturers instructions with the following cycling parameters: 2 min at 93°C; 10 cycles of 93°C for 10 s, 65°C for 30 s, and 68°C for 12 min; 20 cycles of 93°C for 10 s, 65°C for 30 s, and 68°C for 12 min with an increase of 20 s per cycle; and a final extension of 10 min at 68°C.
Hemi-nested PCRs of regions A (Fig. 1
B; primers 886 or 889
and 399: 5'-agtgaccggagacccaggggcctg-3') and B (primers 562
(5'-tgggcaaaaccagaagaggcc-3') and 891) were amplified from one-fiftieth
of the first PCR using the same long-range PCR conditions described
above and the following parameters: 94°C for 2 min; 10 cycles of
94°C for 10 s, 65°C for 30 s, 68°C for 6 min; 20 cycles
of 94°C for 10 s, 65°C for 30 s, 68°C for 6 min with a
20-s increase per cycle; and a final extension of 10 min at 68°C. The
two products were digested using frequent-cutting restriction
endonucleases and separated on a 3% low melting point agarose or 3:1
Nusieve agarose (FMC, Rockland, ME) in 1x TBE buffer.
Restriction sites were localized on the sequence using the mapping facilities at the Biocomputing Service Group (Heidelberg, Germany). The critical region C was amplified in three overlapping fragments, using one-fiftieth of the 9-kb product as a template. Primer pairs were: 1817 (5'-ctcccaggcccctggtctccgggtcatcc-3')/1843 (5'-ggggagtcagctttgtctgtgctcagtggg-3'), 562/2265 (5'-cgatggaattttctgtgcgtgcgtgaggacgcctgcc-3'), and E05 (5'-tttctctgcccgcaccctga-3')/891.
Sequence comparison
Region C was screened for the following sequences involved in
hot spots and chromosomal rearrangements using Lalign 2.0 DNA analysis
software (52): the
consensus sequence GCTGGTGG of Escherichia
coli, which stimulates generalized recombination
(28); the human hypervariable minisatellite consensus
sequences GGGCAGGARG (29) and AGAGGTGGGCAGGTGG
(30); the consensus sequences for Translin (ATGCAG and
GCCCWCCT), which are found at chromosomal translocation breakpoints in
lymphoid malignancies (31); the ATGACGT sequence present
in the recombinogenic M26 allele of
Schizosaccharomyces pombe ade6 (32); the
tetranucleotide repeats TCTG46 and
CAGG79 and the 289-bp consensus sequence
identified in two recombinational hot spots of mouse MHC,
between the Aß 3 and
Aß 2 genes, and in the Eß gene, respectively
(33, 34, 35, 36); and the retrotransposon long terminal repeat
sequence TCATACACCACGCAGGGGTAGAGGACT, located at the recombinational
hot spot of the E ß gene in mouse
MHC (37).
| Results |
|---|
|
|
|---|
The structure of the two homologous EP1-A1 and
E-A2 regions as reported in the literature (2)
and integrated by present data is shown in Fig. 1
A.
Southern blot analyses of BON, DEM and MON had previously demonstrated
a BamHI-RFLP containing EP1 at the 5' end of the
deleted region (i.e., absence of the BamHI site circled in
Fig. 1
) and a PstI-RFLP containing A2 at its 3'
end (Refs. 10, 13, and 26; and data not shown); therefore, we assumed
that the breakpoint was located in the intergenic region between
EP1 and A2 in all six deleted chromosomes.
In normal chromosomes, three restriction sites distinguish the
EP1-A1 from the E-A2 region (Fig. 1
A):
a PvuII site upstream of the A1 switch absent in
the corresponding position upstream of A2, a SacI
site present only in the middle of the switch A2 sequence,
and the above-cited BamHI site present within the
E gene but not within EP1.
Thus, genomic DNA of the three probands was further analyzed with
probes pHS
2-2.6 and pHS
1-0.9 (see Fig. 2
) that, after digestion with the
aforementioned enzymes, hybridize to fragments specific for the
EP1-A1 or the E-A2 regions (Fig. 3
) (38).
|
|
2-2.6 and pHS
1-0.9 showed a
single 4.8-kb band in BON and DEM and a heterozygous 4.8/5.4-kb pattern
in MON, with the latter band arising from a polymorphism (Fig. 2
1-0.9 to double
SacI/PstI digest (Fig. 2
2.0 kb, confirming the presence of the switch
A2 SacI site. Furthermore, the two probes show the specific
2.9- to 3.1-kb fragments on PstI filters, indicating that
the region upstream of A2 is not deleted (Fig. 2
10 kb between these sites and the aforementioned
BamHI site (see Fig. 1Characterization of the EP1-A1 and E-A2 intergenic regions
The genomic sequences available in GenBank only partially cover
the intervals between EP1-A1 and E-A2.
Unsequenced regions of
2.6 kb and
6.5 kb lay between
E-A2 and between EP1-A1, respectively (see Fig. 1
A). We tried to fill these gaps by sequencing the entire
2.6-kb SacI-PstI subclone from
-TOU
-
phage for E-A2 and the corresponding region from plasmid
pH-Ig
11 for EP1-A1, as well as the 5' adjacent
SacI-SacI fragment of 2.3 kb.
The two sequences were almost completely identical (identity equaling 98%). Their analysis in deleted haplotypes was exploited to refine breakpoint mapping by restriction analysis and sequencing (see below).
Variable numbers of tandem repeats (VNTRs) were also detected in
the corresponding positions of the two regions. A 33-mer VNTR sequence
(consensus AAGGATTTCD AGGTGARGGM VCCCTGGATT GTC) spans
approximately one-third of the SacI-PstI fragment
(see Fig. 1
A) in both the EP1-A1 and the
E-A2 regions; because of its large extension (>0.9 kb), we
could not overlap sequences from the two strands. PCR with primers
flanking the minisatellite allowed us to establish a wide length
variability on genomic DNA, with smaller alleles of
2530 repeats
and larger alleles of
4045 repeats. The consensus sequence did not
reveal any homology with known sequences or hypervariable
minisatellites. The VNTR polymorphisms can show identical patterns in
both the EP1-A1 and E-A2 regions; this makes them
less suitable for defining deletion breakpoints (see also Fig. 2
B).
In addition, within the 0.5-kb SacI-PstI fragment
of the SacI-SacI subclone from both the
EP1-A1 and E-A2 regions, sequencing revealed a
VNTR with repeats of 4041 bp (consensus TCCCTTCACC CACCCCTCCT
GCAYCTATC YDCAGCCCCC A), each possessing a canonical minisatellite core
(CNTCCTGCCC) and a sequence (CCCACCCC) perfectly matching Nakamuras
consensus (CCCACNNC) (39). Two repeats also showed a
-like sequence (CCACAGC) with a single nucleotide missing from the
consensus (CCACCAGC) (28).
Restriction analysis of long-range PCR products
Most of the critical interval in which the breakpoints are located
was amplified by long-range PCR. Two specific forward primers (886 for
EP1 or 889 for E) were designed where that the
two sequences diverge (i.e., EP1 becomes a truncated
pseudogene); a common reverse primer (891) was tailored 9 kb
downstream. For each region, two hemi-nested reactions (primers
indicated in Fig. 1
B) produced fragment A (
3.5 kb) and
fragment B (
4 kb).
The two amplification products from four control subjects (eight
haplotypes) were then tested with several frequent-cutting enzymes to
detect patterns specific for the EP1-A1 and the
E-A2 regions. Four enzymes (DdeI,
AluI, MaeIII, and SmaIII) in fragment
A and four (AluI, DdeI, NlaIII, and
MaeIII) in fragment B were found to differentiate the
EP1-A1 region from the E-A2 region (Fig. 4
A).
|
Fine mapping of the breakpoint
The sequencing of region C in four control subjects in
-TOU
-
and in pH-Ig
11 permitted us to draw the map shown in Fig. 5
. Only positions differing between
EP1-A1 and E-A2 in all of the chromosomes studied
are reported.
|
Known signals favoring recombination were searched for in region C (see
Materials and Methods). One
sequence and three binding
sites for Translin showed complete identity with their consensus; in
addition, a number of sites showed a single nucleotide difference from
other putative recombinogenic sequences (Fig. 5
and Table I
).
|
| Discussion |
|---|
|
|
|---|
The most common deletion/duplication, involving the genes from
A1 to E, depends upon mispairing between two
G-G-E-A blocks. Recently, a homozygous A1-E
deletion (T17) was independently investigated by Wiebe et al.
(24) and Chen et al. (25). Using a Southern
blot approach, the deletion breakpoint was located in two different
fragments due to the different interpretations of the RFLP pattern. One
group had assigned the breakpoint to a 5.2-kb
SacI/SacI fragment containing the switch
A2. However, this region is highly polymorphic because of
both the switch and the 33-mer VNTR; therefore, the fragment was likely
a normal polymorphism unrelated to the deletion. Chen et al.
(25) restricted the breakpoint region to a 2.0-kb
BamHI-SacI fragment just downstream of the
EP1 gene. Unfortunately, this conclusion is also incorrect
because Southern blotting can only place the T17 breakpoint between the
discriminatory sites reported in Fig. 1
. It should be noted that the
PstI digestion used by these authors cannot distinguish the
EP1-A1 from the E-A2 derived fragments. Using
long-range PCR, we have evidence that fragments of the same length can
originate from either region, likely due to the 33-mer VNTR (see Fig. 3
and the legend to Fig. 2
).
Our study localizes the recombination for the A1-E deletion
in three unrelated homozygotes within the intergenic region between
EP1/E and A1-A2 genes, which show
98%
sequence identity. More precisely, the breakpoint was pinpointed in all
individuals by sequencing to a
2-kb region of absolute homology
(only two nucleotide differences) between two VNTRs. Furthermore one
haplotype (DEM2) also shows an intermixed restriction site pattern for
EP1-A1 and E-A2, compatible with a gene
conversion event in proximity of the exchange site. Gene conversion
events have been reported by several authors in association with
meiotic crossing-over and other homologous recombination events. These
findings are in agreement with current double-strand break models of
recombination (43).
Our data suggest that in humans, long stretches of identity favor recombination between two sequences. Indeed, sequence homology has been demonstrated in recombinational hot spots such as those of the mouse MHC locus (44), pseudoautosomal region of sex chromosomes (45), hereditary neuropathy with liability to pressure palsies, and Charcot-Marie-Tooth 1A disease (46).
In addition to sequence identity, other specific factors or signals may have targeted the recombination event between the two G-G-E-A blocks. In fact, the same mispairing could theoretically also generate other deleted/duplicated haplotypes involving the GP-A2, EP1-G4, and G1-G4 genes: however, these are significantly rarer. Variable selective pressures are one potential explanation for this difference. However, it seems unlikely for a deletion of the A1-GP-G2-G4-E genes to be significantly favored in terms of fitness over one spanning, for instance, GP-G2-G4-E-A2. Alternatively, specific recombination-prone sequences between EP1-A1 and/or between E-A2 may have favored recombination in the interval. The low linkage disequilibrium level observed in particular within the E-A2 region (47, 48, 49) supports this model.
Repetitive DNA sequences have been proposed as preferred sites of recombination from microorganisms to humans. In the IGHC locus, the repetitive "switch" elements upstream of each functional CH gene are good candidates in light of their programmed involvement in somatic switch recombination. In an insertional mutagenesis assay, the Sµ region was shown to represent a recombinogenic hot spot with a 100-fold higher insertion frequency than the average genomic region (50).
In this respect, the two VNTRs found in the surroundings of the
recombinational region may be particularly important. The consensus
core sequence of human hypervariable minisatellites shows similarity to
the E. coli
sequence (39), a hot spot that
promotes generalized recombination catalyzed by RecA-RecBC
(51). Direct experimental evidence has also been obtained
showing that minisatellite sequences enhance the ability of a DNA
molecule to undergo recombination at a distance, perhaps acting as
recognition sites for a specific enzyme or macromolecular complex
(30).
In our study, the breakpoint critical region in all six investigated
haplotypes lay in the same
2-kb sequence containing two VNTRs
and several consensus sequences that are known to be involved in
recombination in humans, yeast, and bacteria. This strongly suggests,
although it does not directly prove, that these recombinogenic elements
may represent the site of the unequal crossing-over, and may even
coincide with an alleged recombination hot spot of the IGHC
locus.
| Acknowledgments |
|---|
| Footnotes |
|---|
2 Electronic database information: Biocomputing Service Group, http://www.genius.embnet.dkfz-heidelberg.de:8080/menu/; Lalign 2.0 DNA analysis software, http://vega.igh.cnrs.fr/bin/lalign-guess.cgi. EMBL accession numbers are as follows: intergenic EP1-A1 region, AJ238957 and AJ238959; intergenic E-A2 region, AJ238958 and AJ238960; breakpoint region in BON, AJ238961; breakpoint region in DEM, AJ238962; breakpoint region in MON, AJ238963; 3' of the 33-mer VNTR for all of the subjects, AJ238964. ![]()
3 Address correspondence and reprint requests to Dr. Alfredo Brusco, Dipartimento di Genetica, Biologia e Biochimica, via Santena 19, 10126 Torino, Italy. E-mail address: ![]()
4 Abbreviations used in this paper: IGHC, Ig heavy chain constant region locus; RFLP, restriction fragment length polymorphism; VNTR, variable number of tandem repeats. ![]()
Received for publication June 7, 1999. Accepted for publication August 3, 1999.
| References |
|---|
|
|
|---|
,
, and
genes. Nature 300:709.[Medline]
2 gene. Am. J. Hum. Genet. 38:67.[Medline]
hot-spot of recombination in the murine major histocompatibility complex. Science 234:173.
) regions. Hum. Genet. 97:695.[Medline]
genes reveal linkage disequilibrium and genomic organization. Proc. Natl. Acad. Sci. USA 80:6952.
hot-spots of generalised recombination. Cell 34:709.[Medline]
This article has been cited by other articles:
![]() |
M. L. Lundqvist, D. L. Middleton, S. Hazard, and G. W. Warr The Immunoglobulin Heavy Chain Locus of the Duck. GENOMIC ORGANIZATION AND EXPRESSION OF D, J, AND C REGION GENES J. Biol. Chem., December 7, 2001; 276(50): 46729 - 46736. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Vergnaud and F. Denoeud Minisatellites: Mutability and Genome Architecture Genome Res., July 1, 2000; 10(7): 899 - 907. [Abstract] [Full Text] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |