|
|
||||||||

*
University of Aberdeen, Department of Molecular and Cell Biology, Institute of Medical Sciences, Foresterhill, Aberdeen, Scotland; and
Guildhay Ltd., Guildford, Surrey, United Kingdom
| Abstract |
|---|
|
|
|---|
gene families
(V
VV
IX). The heavy and
light chain variable region gene
loci were found to be far more diverse than previously
thought. | Introduction |
|---|
|
|
|---|
The diversity and organization of the variable region VH-D-JH heavy chain and VL-JL light chain genes of humans and mice are well understood, as are the mechanisms involved in generating the enormous primary Ab repertoire necessary to fulfil the immune systems protective role (11, 12). In recent years much research has also been directed toward the study of domesticated animals of economic importance such as chickens and large farm animals, revealing variations in both the site of primary repertoire generation and the mechanisms used.
In humans and mice, B cell lymphopoiesis occurs in the bone marrow
(13) and continues throughout life. In the chicken the
bursa of Fabricius has been recognized as the site of B cell
development for some time (14), and in rabbits the
appendix functions as a bursal equivalent (15). More
recently it has been shown that other gut-associated lymphoid tissues,
i.e., the ileal Peyers patches, serve as sites of B cell
diversification in sheep and cattle, although diversification is also
seen in the bovine spleen (16, 17). Birds, together with a
number of mammal species, have less extensive germline
VH repertoires than mice and humans and/or
exhibit a restricted expression of Ig VH genes
and, perhaps as a consequence, use strategies of primary repertoire
development that overcome this limitation (18). Sheep, cattle, and
swine are all thought to express relatively few
VH genes belonging to a single
VH family, that of the former two being
homologous to human VH4 and the latter to human
VH3 (19, 20, 21, 22, 23). The sheep Ig light
chain primary repertoire is diversified by extensive somatic
hypermutation and is independent of Ag (16, 24), whereas
cattle and possibly swine also use templated gene conversion by
nonreciprocal recombination (17, 25, 26). Chicken and
rabbit have more extensive VH loci in that each
possess
100 genes, related to human VH3.
However, VH gene usage is limited by restricted
functionality or preferential expression, and gene conversion plays a
significant role in Ab diversity (27, 28).
In this study, we demonstrate for the first time that phage display technology can be used to study diversity, that sheep possess and utilize a more diverse Ig germline gene pool than was previously thought, and that sequences derived from pseudogenes may contribute to an ongoing process of IgG diversification.
| Materials and Methods |
|---|
|
|
|---|
Phagemid vector was transformed into Escherichia
coli TG1 (supE thi-1
(lac-proAB)
(mcrB-hsdSM)5(rK-mK-)(F'
traD36 proAB
laqIqZ
M15)).
Soluble expression was conducted in E. coli XL1-Blue
(supE44, hsdR17, recA1,
endA1, gyrA96, thi-1,
relA1, lac(F' proAB,
lacIqZ
M15, tn10
(tetr))).
Isolation of mRNA and production of cDNA
Total mRNA was isolated with the Quick-prep-mRNA purification
kit (Pharmacia, Milton Keynes, U.K.) from a total of 400 mg spleen
removed from a 10-year-old Welsh breed/Suffolk sheep that had been
hyperimmunized against a hapten target (atrazine) conjugated to
bovine-thyroglobulin (Guildhay, Surrey, U.K.). To prepare cDNA, 200 ng
mRNA was made up to 25 µl with RNase-free water, and 25 pmol of FOR
primer specific for sheep heavy,
, or
light chain constant
regions was added. The mixture was heated to 70°C for 10 min and
cooled to 42°C before adding 8 µl of 5x concentration first-strand
buffer (Life Technologies, Paisley, U.K.), 4 µl 0.1 M DTT, and 1 µl
dNTP mix (10 mM each). The mixture was incubated at 42°C for 2 min
before adding 1 µl (200 U) SuperScript II (Life Technologies) reverse
transcriptase, and incubation continued at 42°C for 50 min and then
for 15 min at 70°C.
PCR rescue and linking of variable heavy and light chain genes
PCR reactions comprising 25 pmol each OvVHBACK and OvVHFOR
primers, 1 µl dNTP mix (25 mM each), 5 µl 10x concentration
Bioline reaction buffer (160 mM
NH4SO4, 670 mM Tris-HCl (pH
8.8 at 25°C), 0.1% Tween 20), 2 µl 50 mM
MgCl2, 1 µl heavy chain cDNA, and sterile water
to 50 µl were prepared. The reactions were heated to 94°C for 5 min
and held while 0.5 µl Bioline Taq DNA polymerase (5
U/µl) was added. They were then incubated for 30 temperature cycles
of 94°C for 1 min, 60°C for 1 min, 72°C for 1 min, and then a
final incubation of 72°C for 7 min. Separate sets of 10 PCR reactions
were performed using each combination of OvVHBACK and OvJHFOR primers
(Fig. 1
). Both
and
light chains
were amplified as above using OvV
BACK/OvJ
FOR and
OvV
BACK/OvJ
FOR primer combinations with
and
cDNA
templates, respectively. PCR products were purified by electrophoresis
through a 1% TAE agarose gel, and bands of the correct size were
excised. DNA was recovered using QIAquick columns (Qiagen, Surrey,
U.K.) and eluted with 10 mM Tris-HCl (pH 8.5). Purified
VH and VL (
or
) DNA
was linked together by PCR. Approximately 1020 ng each of heavy and
light chain DNA was used per reaction for seven cycles as described
above. Reactions were then heated to 94°C and held while 25 pmol each
of OvVHBACKSfi and OvJ
FORNot or OvJ
FORNot pull-through primers
were added. PCR was continued for an additional 30 cycles. Separate
reactions were performed for each combination of pull-through primers
with the appropriate VH and
VL template DNA. Multiple reactions were
performed as necessary. Linked heavy and light chains were purified by
electrophoresis using low melting point SeaPlaque GTG agarose (FMC
Bioproducts, Rockland, IL). DNA was recovered from gel slices with
AgarACE (Promega, Southampton, U.K.) and then with phenol chloroform
extraction and ethanol precipitation.
|
Purified VH-VL
DNA was digested with 20 U of SfiI (Roche, Sussex, U.K.) per
µg DNA for 10 h at 65°C, the buffer composition was altered,
and 20 U per µg DNA NotI (Roche) was added. Incubation was
continued at 37°C for 16 h. The phagemid vector (10 µg) pDM-1
(kindly provided by D. McGregor, Rowett Research Institute, Aberdeen,
U.K.) was digested in the same way. The scFv fragments were ligated
with vector (2 µg each) using 10 U T4 DNA ligase (Roche) for 16
h at 16°C. Ligated DNA was extracted with phenol/chloroform, ethanol
precipitated, washed twice with 1.0 ml 70% ethanol, and redissolved in
20 µl sterile water. Transformation of E. coli TG1
(Stratagene, Cambridge, U.K.) was conducted by electroporation
(29) using 2 µl ligation product/40 µl cells, and
transformants were plated onto tryptone yeast extract (TYE) agar
containing 1% glucose and 100 µg/ml ampicillin (TYE-Glu-Amp). PCR
was performed on individual colonies using AH-1 and Fd seq1 primers to
determine the proportion of clones containing a scFv fragment of the
correct size (
850 bp). After incubation overnight at 30°C, the
colonies were scraped off into 2 ml of 2x TY-Amp-15% glycerol per
plate and pooled. Aliquots were prepared and stored at -80°C. To
rescue phage, 100 µl of glycerol stock (
3 x
109 cells) was inoculated into 500 ml of 2x
TY-Glu-Amp and incubated with shaking at 37°C to an
OD600 of 0.6 (12 h). M13KO7 helper phage
(Pharmacia) was added at 20x multiplicity to 50 ml of the culture that
was incubated at 37°C without shaking for 30 min. Infected cells were
pelleted, resuspended in 500 ml 2x TY-Amp-Kan-Glu, and incubated
overnight with shaking at 30°C. Phage particles were concentrated
from the culture supernatant by two successive precipitations with 1/5
volume PEG (20% polyethylene glycol weight to volume ratio, 2.5 M
NaCl) as described by Griffiths et al. (30).
Ab gene sequence analysis
Ab VH and VL genes
from clones selected at random from the original library glycerol
stocks were PCR amplified using the AH-1 and Fd seq1 primers. PCR
products were sequenced using the same primers on an ABI 377 automated
DNA sequencer (Applied Biosystems, Foster City, CA) in both directions.
Sequences were compared and dendograms were constructed using the GAP
and PILEUP programs (Daresbury, U.K.). Comparisons were restricted to
those parts of the rearranged genes encoded by the
VH, V
, or V
gene segments.
| Results |
|---|
|
|
|---|
The library was constructed from a sheep immunized with atrazine
conjugated to BSA for the isolation high-affinity anti-atrazine Abs
(to be described elsewhere). The sheep was sacrificed, and samples of
the spleen were removed. Variable region genes were amplified by PCR,
and the library was constructed as illustrated in Fig. 2
. The linker sequence is a modification
of that described by Chaudhary et al. (31). By using
multiple electroporations, a library containing 1.1 x
109 clones was produced. PCR revealed that 85%
contained inserts of the correct size, and digestion with the enzyme
BstN1 indicated that 89% of these had unique restriction
patterns (not shown). Therefore, the library was estimated to include
8.5 x 108 different clones.
|
Variations in the nomenclature used by researchers when discussing the organization and ancestry of heavy chain variable region genes can lead to confusion. In the context of this article and in reference to others works, the following system will be used: heavy chain gene families, where applicable, will be numbered using Arabic numerals; and the three major homologous groups will be referred to as "clans" and will be identified by Roman numerals using the classification of Kabat et al. (32), such that clan I includes human VH1, VH5, and VH7 families and clan II includes human VH2, VH4, and VH6 families.
Sheep heavy chains
A large number of clones were selected at random and the
VH and VL chain genes were
amplified by PCR with the AH-1 and Fd seq1 primers (Fig. 1
). The
sequences of 45 rearranged heavy chain genes are compared with the V5a
germline sequence (19) and JH1
sequence (Ref. 38 and Fig. 3
). A
dendogram constructed through Daresbury using the PILEUP program
demonstrates significant clustering (Fig. 4
). The region from framework region 1
(FR1)3 to the end of
FR3 was used for this analysis, including complementarity-determining
regions (CDRs) 1 and 2.
|
|
80% homology. Berman (39) and Matthyssens and
Rabbitts (40) have shown that cDNA and genomic DNA from
the same family differ by only a few nucleotides. This criterion has
previously been applied to the analysis of both germline genes and
rearranged IgM swine genes (26), IgM and IgG1 from cattle
(22), and IgG from cattle (21). Therefore, it
is assumed that the accumulation of somatic mutations during affinity
maturation is not sufficient to affect the family classification of
cDNAs. Seven groups can be identified with two or more members, and
other individual clones (H6, H23, and H17) appear not to be closely
related to any other sequences. Consensus sequences for the nine groups
indicated in Fig. 4
A similar analysis was conducted on the members of
VH1 together with the closely positioned
sequences H6 and H23 (Table I
). It is
notable that many of the clones share >80% homology with some,
including the V5a germline gene, and <80% with others. However, the
dendogram suggests a clear phylogenetic relationship with the family
VH1.
|
The lengths of CDR3s we have observed in sheep heavy chains range
from 23 aa in clone H257 down to 3 aa in clones H69 and H261. The
JH locus has been characterized by Dufour and Nau
(38), who identified two functional
JH segments and four pseudogenes in a region 5 kb
upstream of the Cmu gene and spanning 1.85 kb. Both
functional genes have been included in Fig. 3
, and the rearranged
sequences are compared with JH1. As with the 5'
end of FR1, the terminal eight codons of FR4 are encoded by primers
used in PCR, and so sequence variability should be ignored in this
region. Of the sequences belonging to VH1, 3 of
20 have used the JH2 gene segment as determined
by the presence of G in position 2 of codon 105 (Kabat numbering)
encoding an arginine (CGA). The remaining 17 have a proline
here (CCA). This preferential use of JH1 is in
agreement with previous findings (38).
Examination of the JH-encoded regions of the
remaining sequences belonging to families other than
VH1 reveals some interesting features. All such
sequences except H261 have either AG (19 clones) or AA (five clones) in
positions 2 and 3 of codon 105, giving rise to a glutamine residue. The
frequency with which this occurs is too great to be a result of
convergent somatic mutation. Analysis of the published
JH segments reveals that the pseudogenes
JH-ps2, JH-ps3, and
JH-ps4 all encode codons 103105 in this way
(5'-TGGGGCCAG-3') (38). Moreover, clones H242, H225, H168,
H9, and H297 include the motif 5'-TGCTTTTGA-3' (boxed sequence in Fig. 3
). Once again this sequence is found in the correct position in the
pseudogene JH-ps3. A second motif,
5'-ACGG-3' is found spanning the codons 2 and 3 aa upstream
of position 101 (Fig. 3
) in clones H3, H204, H11, H264, H217, H257,
H15, and H23. This motif is found in JH2, and
so its presence in H3 is not unexpected. However, the region
encoding positions 101102 in six of the other seven clones
suggests that these rearrangements involved the use of
JH1 and not JH2.
Chains
Thirteen of the clones sequenced from the unselected
phage library contained
light chains (Fig. 5
). Five new families can be identified
(Table II
), two of which include two
genes and a further three each assigned from a single gene. The
phylogenetic relationship of all of the sheep V
families is
illustrated in the dendogram in Fig. 6
.
In a previous study (41), the amino acid at the 3' end of
CDR3 (Kabat position 97) was found to be either alanine or serine,
corresponding to the use of the J
1 and J
2 gene segments,
respectively. All of the clones we have identified have a threonine
(ACT/ACG) in this position, the ACT codon being found in the pseudogene
J
3. In three clones (K227, K13, and K321) the last two codons of
CDR3 are 5'-TGG ACG-3', which does not correspond with any of the three
known J
gene segments.
|
|
|
Chains
Forty-seven rearranged V
sequences were aligned with the 5.1
germline gene described previously (Ref. 16 and Fig. 7
). All were
found to segregate with members of the V
I family described by
Reynaud et al. (Ref. 16 and not shown). GAP analysis of
light chains revealed a wide range of diversity between these
sequences that belong to the same family (not shown). Twenty-five
percent of the clones had less than 80% homology with the majority of
other clones. Clones L123 and L21 have >80% identity with the 17
germline genes belonging to the closely related but distinct family IV
described by Reynaud et al. (16). However, L123 has 91.6%
identity with 16.1 and L21 has 94.9% identity with 4.1, and so these
clones are clearly members of the V
I family.
|
light chains is a feature common to
many domesticated species including sheep (42, 43). The
fact that only a single V
family was identified despite the extent
of sequence divergence observed validates the sequence analysis methods
applied and supports the existence of the multiple
VH and V
families we have shown. | Discussion |
|---|
|
|
|---|
By producing a phage display library, we potentially have access
to the whole expressed IgG repertoire of the host animal. The primer
sequence used to produce heavy chain cDNA (CH1FOR) is conserved in both
IgG1 and IgG2 isotypes. In total, 45 heavy chains, 47
light chains,
and 13
light chains were sequenced from clones selected at random.
Sequences have been analyzed according to the established method of
Kabat et al. (32 which assumes that sequences that
diverge by greater than 80% are derived from different germline gene
families. In some cases this analysis has provided only a single family
member sequence and may be less decisive. However, in these cases
homology with all other sequences is less than 70% with no clustering
of sequence variation, and therefore it is unlikely due to PCR errors
or artifacts such as template jumping during library construction.
Because we have sequenced only 47 clones, a germline family providing
2% of the repertoire would be represented by a single clone.
Heavy chain genes
All previously reported VH genes from sheep
belong to a single family (19) that shows greatest
homology to the single family expressed in cattle (22).
When compared with human VH genes, both sheep and
cattle are homologous to VH4, a member of clan II
(38). The heavy chain genes of other species expressing a
single family such as swine, rabbit, and chicken are more closely
related to human VH3 (clan III) (23, 44, 45), which has been proposed as the ancestral
VH gene family (46). We have
identified nine heavy chain gene families in sheep, eight of which have
not been previously reported (Fig. 4
). Of the new families,
VH5, VH7,
VH8, and VH9 are homologues
of clan VH II, together with the
VH1 family already described.
VH3, VH4, and
VH6 are homologues of clan
VH I, and VH2 is a
homologue of clan VH III. The isolation of sheep
genes related to clan VH III confirms the
evidence obtained by Tutter and Riblet (46), who observed
hybridization of probes derived from the murine S107 and 7183 gene
families to sheep genomic DNA. The sheep families
VH1, VH7,
VH8, and VH9 are most
similar to human VH4, and the sheep
VH5 is most similar to human
VH6 (not shown). Saini et al. (47)
reported detecting homologues of murine VH11 in
bovine genomic DNA by Southern blot. In view of the close homology
between ovine and bovine Ig genes, the sequences reported in this study
may prove valuable in a more extensive analysis of the bovine
genome.
All of the sequences we have obtained were derived from cDNA and so were being expressed in the host animal. Greater than 55% of the VH genes belong to new sheep families. Previous detailed studies have not revealed the heavy chain diversity that we have seen. Selective breeding has not resulted in variations in the diversity of Ig light chain loci between sheep belonging to different breeds (35). A possible factor is the age of the donor sheep. Heavy chain genes sequenced by Patri and Nau (36), Dufour et al. (19), and Dufour and Nau (38) were derived from spleen obtained from a slaughterhouse or were from animals described as adult. Previous analyses of light chain genes have used material taken either from fetuses or lambs up to 4 mo after birth (16, 24) or from sheep of unspecified age (34, 35). We have used spleen from an animal that was more than 10 years old when sacrificed, leaving the possibility that sheep may utilize a greater diversity of germline heavy chain genes in later life than are expressed when young.
From our results it can be seen that members of the same gene family
frequently share unusually low nucleotide identity, i.e., <80%. This
is particularly noticeable in the sheep heavy chain family
VH1 and the
family V
I. A similar
observation has been reported in the llama for both the conventional
VH and the VHH homodimer
cDNA sequences (37) and may be a result of the necessity
to generate a diverse Ab repertoire from a gene pool of restricted
diversity. Our data demonstrate that in contrast to the llama, the
sheep heavy chain germline is more diverse than had been previously
thought. In addition, from analysis of heavy chain CDR3 there is
evidence of use of JH pseudogene segments, which
has not previously been reported in sheep. If there are in fact only
two functional JH segments in sheep, as described
by Dufour and Nau (38), then the apparent use of
pseudogenes has been restricted to genes belonging to families other
than VH1. It is possible that the sheep genome
families VH29 may include multiple highly
homologous V genes that have originated during evolution by gene
duplication and subsequent mutation. The clones with pseudogene
sequences may represent further somatic mutation of these genes or gene
conversion events.
Light chain genes
The
genes represent six separate families. There is evidence
that sheep preferentially express certain
gene families at
different stages of development (41), with V
IV
dominating during the final stages of gestation. The single published
gene isolated from adult tissue (35) belongs to this
family. That this group is a major contributor to the adult V
repertoire is confirmed by our data in that six of the 13 genes we have
identified are V
IV. However, phylogenetic comparison (Fig. 6
)
suggests that they form a separate subgroup to the V
4 gene. The
J
-encoded region of our clones does not closely match the
distinctive regions of any of the three known J
segments and may
indicate that sheep possess a more extensive genomic J
region than
that sequenced to date.
The sheep V
repertoire is known to be diverse, including at least
six different families (16). Only genes from V
I, II,
and VI have been identified from cDNA and so are known to be expressed
(41). All such rearranged genes were isolated from fetal
material. The 47 unselected sequences described in this study belong to
the V
I family, which leads us to suggest that in contrast to the
VH and V
repertoires, V
gene usage is
restricted in adult sheep. We have found that expressed V
I genes
are frequently highly divergent from known germline sequences. The
extent of the observed divergence is such that two genes, L2 and L222,
have >80% sequence identity only with each other, and a further two,
L21 and L123, have >80% identity not only with V
I germline genes
but also with the "17" germline gene belonging to the related but
separate family V
II.
In conclusion, ruminants are thought to possess a smaller and less diverse gene pool than humans and mice and to utilize different mechanisms for generating their primary immune repertoire. Our studies indicate a greater level of functional diversity than previously described in sheep, though this does not necessarily imply a larger gene pool. Ab diversity and repertoire development are important components in animal health and understanding of disease processes. The studies we describe demonstrate the value of sheep Ab phage display libraries and provide a powerful new tool for such research.
| Acknowledgments |
|---|
| Footnotes |
|---|
2 Address correspondence and reprint requests to Dr. W. J. Harris, Department of Molecular and Cell Biology, University of Aberdeen, Institute of Medical Sciences, Foresterhill, Aberdeen AB25 2ZD, Scotland, U.K. ![]()
3 Abbreviations used in this paper: FR, framework region; CDR, complementarity-determining region. ![]()
Received for publication September 21, 1999. Accepted for publication April 6, 2000.
| References |
|---|
|
|
|---|
. Science 246:1275.
-heavy chains: limited VH gene repertoire, combinatorial diversification by D gene segments and evolution of the heavy chain locus. EMBO J. 7:739.[Medline]
This article has been cited by other articles:
![]() |
W. J. J. Finlay, I. Shaw, J. P. Reilly, and M. Kane Generation of High-Affinity Chicken Single-Chain Fv Antibody Fragments for Measurement of the Pseudonitzschia pungens Toxin Domoic Acid. Appl. Envir. Microbiol., May 1, 2006; 72(5): 3343 - 3349. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Baker, K. Belov, and R. D. Miller Unusually Similar Patterns of Antibody V Segment Diversity in Distantly Related Marsupials J. Immunol., May 1, 2005; 174(9): 5665 - 5671. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |