|
|
||||||||




* State Key Laboratory of Biocontrol, The Open Laboratory for Marine Functional Genomics of State High-Tech Development, Department of Biochemistry, Guangzhou Center for Bioinformatics, College of Life Sciences, Sun Yat-Sen (Zhongshan) University, Guangzhou, Peoples Republic of China;
Department of Marine Biotechnology, Ocean University of China, Qingdao, Peoples Republic of China;
School of Life Sciences, Xiamen University, Xiamen, Peoples Republic of China; and
Center for Advanced Biotechnology and Medicine, University of Medicine and Dentistry of New Jersey-Robert Wood Johnson Medical School, Piscataway, NJ 08854
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
In this study, we report the identification of genes whose homologues involve the immune system especially the AIS. The large scale expressed sequence tag (EST) analyses in Chinese amphioxus presented in this study provide a link between the genomic study of Ciona and the EST analyses in lamprey in further understanding the origin and evolution of AIS. All evidence we demonstrated supports the general statement that the conventional AIS appear after the jawless/jawed vertebrate but not the cephalochordate/vertebrate split. Furthermore, the molecules identified in this study represent the precursors of vertebrate AIS-related genes and the other housekeeping genes that are waiting for recruitment by the emergence of AIS.
| Materials and Methods |
|---|
|
|
|---|
Adults of Chinese amphioxus, Branchiostoma belcheri tsingtauense and B. belcheri (Gray), were collected from Kioachow Bay near Qingdao and Xiaman, China, respectively, and kept alive with seawater and sea alga. Tissues of intestine, notochord, and ovary were harvested from adult animals. In breeding seasons, gastrula, neurula, and larva were bred up from the collected oosperm in a man-made seawater environment (17, 18).
Library construction and cDNA sequencing
Library construction and cDNA sequencing were performed according to the procedures previously described in the analyses of the neurula cDNA library (19). The adult animal library was constructed from B. belcheri (Gray), and the other libraries including ovary, neurula, gastrula, larva, and intestine were all prepared from B. belcheri tsingtauense.
EST analysis
All 5' EST reads were treated with software PHRED (20, 21) to remove vector sequences and low-quality regions, and then assembled into consensus sequences with software STACKPACK (version 2.1 patch 1) (22, 23). The consensus sequences were used as ESTs to search against GenBank with the BLASTX program (24). EST clustering was performed using STACKPACK2.1 with default setting.
Phylogenetics analyses
Consensus cDNA sequences of genes were searched using the BLASTX algorithm against NR Database from the National Center for Biotechnology Information (NCBI). The homologous sequences were selected from the BLASTX search results as the input for phylogenetic analysis. Consensus phylogenetic tree was constructed with the Phylogeny Inference Package (PHYLIP) (25). The input data for phylogenetic methods is the most consistent alignment that was obtained with the CLUSTALW program and reduced by GBLOCKS (26). Two methods were used to construct the phylogenetic tree using the NEIGHBOUR and PROTPARS programs (PHYLIP). The statistical significance of phylogenies was estimated by using the SEQBOOT program (PHYLIP) by bootstrap analysis with 100 pseudoreplicated data sets, grouping the bootstrap values higher than 50 as significant. The consensus trees were generated by CONSENSE program (PHYLIP) and were plotted by TREEVIEW (version 1.3).
Bioinformatics analyses
Some immune-related ESTs were obtained from the BLASTX results of our database (E value <1e5 and the length of amino acid sequence >50). All domains that each molecule contained are identified through RPSBLAST in NCBI first. Then Pfam (27) and Prosite (28) databases are queried to find whether the model organisms have the molecules that contain those domains. For the IgSF members, the whole sequences from amphioxus are used to search against the proteomes of the model organisms. Considering the proteome of ascidian is unavailable, TBLASTX program was used to search against the ascidian cDNA database, which is from
http://ghost.zool.lyoto-u.ac.jp/indexrl.html
, and the ENSEMBL server was used to predict the homologous genes to refine our analyses (29).
In situ hybridization
Adults of the Chinese amphioxus, B. belcheri tsingtauense, were collected and kept in filtered seawater for 2 days. Then the whole body was fixed with 4% paraformaldehyde in PBS and later was made into 14-µm transverse sections in slices by frozen sectioning. The digoxigenin-labeled probes were prepared by using the plasmids that contain the sequences of the interesting ESTs with the SP6 promoter sequence at the 3' end of the sequences as template and got the antisense probes with the SP6 RNA polymerase according to the protocol of the digoxigenin DIG RNA labeling kit (Roche). In situ hybridization was according to Li et al. (30). The negative control probes were prepared in the opposite direction. The ESTs used as probes of in situ hybridization were chosen from the database and cover the IgSF, MHC region genes, and the CD molecules.
| Results |
|---|
|
|
|---|
A total of 23,095 sequences were generated by randomly sequencing the clones of six cDNA libraries of Chinese amphioxus, including ovary, neurula, gastrula, larva, intestine, and the whole body of an adult animal. In total, 9009 derived consensus sequences were fed to BLASTX search against GenBank, and 3795 were identified under the expect value of 0.01.
Previously, Panopoulou et al. (31) released 14,189 ESTs from two normalized cDNA libraries of another amphioxus species, Branchiostoma floridae, and this data set was downloaded and subjected to an overall clustering with our own data collection. Only 2.94% EST clusters were shared by two data sets, these clusters accounted for 9508 ESTs of our own data set and 5547 ESTs of Florida amphioxus, respectively, and these ESTs mainly represented abundant mRNAs. Nonetheless, this implied a considerable amount of novel genes residing in our data set exclusively, although most sequences from both data sets were not supposed to be species-specific.
As major contributors of our whole EST collection (64.7%), libraries of intestine, neurula, and ovary provided 80.1% of immune-related ESTs in this study. The percentages of unique sequences from these three libraries were all above 71%, indicating that these libraries had a good coverage of the low abundant mRNAs. Conversely, as expected from their different sources, the profiles of gene expression, delineated according to BLASTX search results, exhibited apparent discrepancies among the three libraries.
In addition to the global profile, up to 291 EST clusters (accounting for 937 ESTs) were sorted out as immune-related EST clusters from six libraries. These clusters of EST were broken down into 10 catalogs (Table I), among which the most interesting entries in our dataset were listed in supplemental Table 1. 5 In this immune-related data set, 183 ESTs were singletons, and the other 754 ESTs were grouped into 108 clusters, of which 18 clusters contained over 10 copies.
|
The genes encoding IgSF members
Through the recombination of multiple copies of various interspersed V, (D)J, and C segments in somatic cells, Igs and TCRs have developed the diversity to recognize various Ags. These two pivotal effectors of adaptive immunity are generally believed to be restricted in jawed vertebrates (32). Igs and TCRs share an essential "Ig-fold" that comprises a sandwich-like domain of two
-pleated sheets and they belong to the IgSF. According to the length between two cysteines and the number of the
-sheets, the IgSF domain can be classified into four types: V, C1, C2, and I (33). C1 type is restricted in some molecules only in jawed vertebrates such as TCR, Ig, MHC class I and class II, tapasin, signal regulatory proteins, and butyrophilins. However, the other three types are widespread in several kinds of molecules through the metazoa. Using the short-primer PCR, largely based on the conserved short motifs in the Ig-fold along with other methods (34), many genes bearing the typical V-C structures have been isolated (10, 35, 36) and most of them function as the receptors of innate immunity, suggesting the evolutionary relationship between innate immunity and adaptive immunity (37). Recently, some IgSF members bearing the V or V-C structures have been found in ascidian genome (12) and in amphioxus (10, 11). The architecture of these genes showed more or less similarities to the Ag receptors and provided insights to the primitive forms of Ag binding receptors.
In our intestine cDNA library of Chinese amphioxus, two kinds of IgSF members are found: one is the VCBP and the other is a novel member of IgSF. The VCBP molecules in our Chinese amphioxus library have just
70% identities in amino acid sequence with the VCBP4 of the Florida amphioxus. In addition, the different splicing forms of the VCBP have been detected by RT-PCR (data not shown). The alternate splicing that operates in this multigene family suggests its potential diversity and flexibility to different ligands.
The novel member of IgSF encodes 338 amino acid residues and comprises a signal peptide, one V-type domain, one C-type domain, a transmembrane region, and a cytoplasmic region, and named as V and C domain-bearing protein (VCP). In terms of the entire protein organization, this gene is dramatically different from the VCBPs and the CD47-like (V domain-bearing) genes found previously in amphioxus. Despite the variety of the Ag binding receptors, the amino acid sequences of IgSF domains still have some canonical positions at residues of G16, C23, W41, L89, D98, Y102, and C104 (10, 33, 34). Alignments of the first Ig domain of VCP with the other V-type domain of rearranging and nonrearranging IgSF molecules indicate that some canonical residues are highly conserved in VCP, especially the residues that are crucial for the formation of the Ig-fold structure (Fig. 1). The secondary structure prediction programs (Jpred and PSIpred) (38, 39) indicate the presence of eight
-strands in the V domain and seven in the C domain (Fig. 2). Through IgBLAST server at NCBI (
www.ncbi.nlm.nih.gov/igblast/
), two loops corresponding to the CDR1 and CDR2 of Ag receptor molecules can be inferred from the V domain of VCP, whereas a loop equivalent to the CDR3 loop of Ag receptor molecules also can be identified between F and G
-strands of the V domain (Fig. 2). However, the typical diglycine bulge (GxG or FGxG) of J region in the Ag receptor molecules is missing. The consensus pattern, [FY]-x-C-x-[VA]-x-H, is found to exist in the F
-strand of the second IgSF domain of VCP within the C1 type IgSF domain for the formation of conserved disulfide bond according to the analysis using the PROSITE program (29) (Fig. 2). An extra cysteine exists between the C domain and transmembrane region of VCP, suggesting the possibility of the dimerization of two VCP molecules or other partners. Along with CDRs, VCP also contains some predicted phosphorylation sites, such as protein kinase C phosphorylation site and casein kinase II phosphorylation site in the cytoplasmic region (Fig. 2), suggesting the ability of VCP to mediate the signal transduction through these phosphorylation sites. It is intriguing that an ITIM-like motif is found in the cytoplasmic region of VCP, indicating the possibility of the existence of ITIM-dependent signal transduction (Fig. 2).
|
|
In this study, in situ hybridization to transverse section through the upper or lower pharyngeal region of sexual-matured amphioxus indicates a selective expression of VCP in the wall cells of pharynx and hepatic cecum, a diverticulum of the intestine that may be the precursor of the liver (or pancreas) of vertebrates (Fig. 3, AC), similar to the observation on the other IgSF molecules such as VCBPs in previous studies (10).
|
Homologues of Ag peptide processing and presentation genes and MHC anchor genes
The Ag presentation system in AIS involved many proteases and chaperones, and Danchin et al. (44) demonstrated that many of them gained their specific function in Ag processing and presentation through various levels of co-option. Many homologues of proteasomes and chaperones were found in our database. Besides the genes encoding proteasome (proteasome, macropain) subunit
-type (PSMB)7/10 and PSMB5/8 previously reported (45, 46), other homologues of the housekeeping genes PSMB6/9 and PA28
were also found in our library. Like the genes found in lamprey (6) and ascidian (12), amphioxus seems to have no inducible forms that change their expression pattern or even their biochemical behavior for Ag processing after genomic duplication. Homologues of the chaperones involved in the process of MHC I molecules assembly were identified in our library such as calreticulin, chaperone protein Dnak (Hsp70), and the glucose-regulated protein of 78 kDa (GRP78). As for the ATP-binding cassette transporters, just homologues of ATP-binding cassette BI and C sequences were found in our database. A homologue of stress-induced ribosome-associated protein 4, RAMP4, which interacts with nascent MHC class II-associated invariant chains in mammalian lymphocytes (47), was also identified.
An EST similar to IFN-
-induced lysosomal thiol (GILT) reductase was found in our database. In vertebrate this enzyme is expressed constitutively in APCs and induced by IFN-
in other cell types. By reducing disulfide bonds under acid condition, GILT can facilitate the processing and presentation of Ag peptides containing disulfide bonds by MHC II molecule to Ag-specific T cells during the proteolysis within lysosomes. In the absence of GILT, presentation of two major lysozyme antigenic epitopes to T cell is partially or completely abrogated (48). The Amphi-GILT found in our database contains the conserved active site CxxC, and nine cysteines downstream of the active site just like the counterparts in human and mouse (Fig. 4). The phylogenetic tree (Fig. 5) shows the evolutionary relationship between Amphi-GILT and other proteins of the GILT family that contain the conserved active domain. According to its sequence similarity and phylogenetic analysis, Amphi-GILT gene is more similar to vertebrate ones than invertebrate, suggesting a closer function to its counterparts in vertebrates. The similar result has been reported in the phylogenetic analysis of immunoproteasomes (46), in which the rate of amino acid substitution has been substantially accelerated in all the IFN-
-inducible forms. Their accelerated evolving rate beginning with amphioxus might be functionally required for themselves: evolve fast to deal with multifarious foreign antigenic peptides during evolution.
|
|
BL (NF of
light polypeptide gene enhancer in B cell inhibitor-like 1), which are linked anchor genes in vertebrates MHC region. The study on the genomic organization of these new anchor genes will help to ultimately reveal the evolution of MHC genomic organization from nonvertebrates to vertebrates. Homologues of genes involved in immune signaling
In our database, homologues to many kinds of conserved enzymes used not only by signaling of AIS but also by general signal transduction were identified, including protein tyrosine kinase, protein tyrosine phosphatase, MAPK, MEK, and small GTPase (Ras, Rho, and Ran). Some homologues to transcriptional factors, the final effectors of the signal transduction, were found in our libraries, such as Collier/Olf1/early B cell factor (COE/EBF) gene family member, Ets, and lymphoid transcription factor (LyF). EBF belongs to a conserved helix-loop-helix gene family. The EST homologous to EBF in our database was found to have 80% identity over the entire molecule and >90% identity over a stretch of 370 residues, suggesting its essential feature of dimerization and DNA binding. In situ hybridization also indicated the expression of Amphi-EBF in pharynx and hepatic cecum (data not shown). In addition, homologues to B cell adaptor for PI3K and CD3
-associated signal transducer found recently in lymphocyte-like cells library in lamprey (7) were detected in our database. Our database also contains sequences encoding several other mammalian homologues involved in the signaling pathways shared by many different cell types including lymphocytes, such as TNFR and pre-B cell colony-enhancing factor. Though the existence of these signal-related molecules was not regarded as the direct evidence for the existence of AIS signaling in amphioxus, they provided a sufficient preparation for the appearance of the AIS.
In contrast, many homologues of CD molecules were identified in amphioxus, such as CD9, CD20, CD53, CD63, CD81, and CD82. Some of these CD molecules, such as CD9, CD53, CD63, and CD81, belong to the tetraspanins family that feature four transmembrane domains, a small extracellular loop 1 (EC1), and a mushroom-like large EC2 (49). Functionally, tetraspanins act as molecule organizers of membrane microdomains and multimolecular complexes, such as signaling complexes on the lymphocyte surface (50). By their nature, tetraspanins widely participate in cell migration, motility, adhesion, activation, and proliferation. One member of the tetraspanins family, CD53, is specifically expressed in B cells, T cells, dendritic cells, monocytes, NK cells, and granulocytes, and involved in T cell activation, positive selection, and B cell integrin adhesion (51). In our database we found a CD53 homologue with 37% identity and 55% similarity to the mouse CD53 and has a conserved site of potential N-glycosylation in EC2. However, unlike the canonical pattern of mammalian CD53 (CCG, DW, PxSC, GC), this amphioxus homologue has two extra cysteines in the EC2 (Fig. 6). The expression of the Amphi-CD53 gene was detected by in situ hybridization of adult amphioxus and the signal was specifically located in the pharynx and hepatic cecum particularly in the pharynx wall (Fig. 3, E and F). The tetraspanin family has many members involved in signal transduction including immune-related signaling between cells. Although some homologues of the family members were found in our database, whether they have the immune function needs further study. Just like the other families whose members are involved in immune function, a certain member in the tetraspanin family may be waiting for obtaining the new immune-related function in amphioxus.
|
Because many homologues of housekeeping genes involved in not only AIS but also other biologic systems have been detected in our database, the comparative analyses of these genes are performed in this study. The results of domain analyses are indicated in Table II, in which the two members of IgSF show intriguing results. For VCP, when using the whole sequence to search against the database of model organisms, there is no distinct homologue, whereas VCBP just has homologue in ascidian. In contrary, the domain analyses reveal the fact that Ig domains in VCP and VCBP are present in many kinds of molecules in metazoa, but the Ig domains coming from different molecules show lower similarities with each other. For VCBP, the chitin-binding domain is also widespread in metazoa. These comparative analyses indicate some scenarios of the evolution of these two molecules. Although the functions of these two IgSF members are unclear now, but for the molecular characteristics of these genes, they might have their function in the defense system in amphioxus. To deal with the increasing diversity of ligands they might evolve rapidly through changes of sequence or expression pattern to have novel function, followed by the recruitment into new interaction pathways. Another scenario is that with the appearance of V(D)J rearrangement machinery, these two IgSF members become nonessential for the defense system, thus lost after evolutionary split of cephalochordate/vertebrate. However, their domains became the "candidates" of some shuffling events and were involved with other new molecules existing in vertebrates.
|
| Discussion |
|---|
|
|
|---|
Although classical MHC molecules are not yet to be identified in amphioxus, several genes with housekeeping functions in Ag processing and presentation have been found, as similarly found in ascidian (12, 13). Our data in this study are consistent with the previous research: the inducible forms of those housekeeping genes that are directly involved in the Ag processing and presentation are not present in amphioxus. Those housekeeping genes, among which the human counterparts map to the different MHC paralogous regions, tended to be in a contiguous genomic fragment in amphioxus and ascidian genome. The absence of the functional forms and the existence of this proto-MHC region revealed the possibility that those housekeeping genes are ready to be co-opted by the bona fide MHC molecules during evolution.
The origin of lymphocytes is another interesting issue (56). Thus far, no distinct lymphoid tissues have been found in agnathan and protochordate by histologic studies but some lymphocyte-like cells were isolated from the intestine and the associated typhlosole of lamprey (5). In our studies, many genes homologous to the counterparts for the mammalian lymphocytes signaling and development were identified, including some transcriptional factors and many enzymes being used in different cell lines in immune system. In contrast, the gene expression pattern analyzed in this study showed amazing similarity among different genes. These genes were expressed mostly in pharynx and intestine (hepatic cecum), the first defense against the foreign pathogens in amphioxus, providing the possibility that the ancestor of lymphocytes might derive from the cells located in intestine through the evolution.
In summary, we identified extensive ESTs homologous to genes participating in immune response in vertebrate. Although none of the key genes specific to the AIS was found, the existence of members belonging to multigene families (IgSF, proteasomes, enzymes, transcription factors, and other) involved in immune or nonimmune responses suggested that precursors of the genes involved in vertebrate AIS may have existed in protochordate. Many of these precursors gained new roles in adaptive immune response when the key genes evolved into the jawed vertebrates. Amphioxus represents the pivotal position during evolution, and it is generally known that some important steps such as the genomic duplication took place at the cephalochordate/vertebrate split (57, 58). This large-scale genomic duplication provided the opportunities for the emergence of the new forms of those multifamily members that may evolve specific roles in the AIS. In short, the adaptive immune system is in its twilight in amphioxus. The Ag binding molecules, Ag processing and presentation molecules, and signal transduction-related molecules were all in their evolution and waiting for immunologic "Big Bang" to be recruited for AIS. As such, the functional analyses on those genes identified and interactions among them in comparison with other vertebrates like human and mouse will be of particular interest for further studies. However, the sequencing of the whole genome of amphioxus along with the analyses of the other protochordate genome sequence might be pivotal to the study of the original status of the immune-related gene organization, especially the genes related to AIS before vertebrate.
| Disclosures |
|---|
|
|
|---|
| Acknowledgments |
|---|
| Footnotes |
|---|
1 This work was supported by the "Outstanding Young Scientist" Award (No. 39725007), Projects 2004AA621030 and 2003AA626010 of State High-Tech Development Project (863) of Ministry of Science and Technology of China, Key Project (0107) of Ministry of Education, Project 30300264 of National Natural Science Foundation, and Key Projects of Commission of Science and Technology of Guangdong Province and Guangzhou City. ![]()
2 C.Y., M.D., and X.W. contributed equally to this work. ![]()
3 Address correspondence and reprint requests to Dr. Anlong Xu, Department of Biochemistry, College of Life Sciences, Sun Yat-Sen University, 135 Xingangxi Road, 510275 Guangzhou, Peoples Republic of China. E-mail address: ls36{at}zsu.edu.cn ![]()
4 Abbreviations used in this paper: AIS, adaptive immune system; EST, expressed sequence tag; IgSF, Ig superfamily; VCBP, V region-containing chitin binding protein; VCP, V and C domain-bearing protein; GILT, IFN-
-induced lysosomal thiol reductase; EBF, early B cell factor. ![]()
5 The online version of this article contains supplemental material. ![]()
Received for publication July 28, 2004. Accepted for publication December 17, 2004.
| References |
|---|
|
|
|---|
-type proteasome subunits regulated reciprocally by IFN-
: structural comparison, chromosomal localization, and analysis of the promoter. J. Immunol. 159:2760.[Abstract]
This article has been cited by other articles:
![]() |
S. Huang, S. Yuan, L. Guo, Y. Yu, J. Li, T. Wu, T. Liu, M. Yang, K. Wu, H. Liu, et al. Genomic analysis of the immune gene repertoire of amphioxus reveals extraordinary innate complexity and diversity Genome Res., July 1, 2008; 18(7): 1112 - 1126. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. P. Terwilliger, K. M. Buckley, D. Mehta, P. G. Moorjani, and L.C. Smith Unexpected diversity displayed in cDNAs expressed by the immune cells of the purple sea urchin, Strongylocentrotus purpuratus Physiol Genomics, September 14, 2006; 26(2): 134 - 144. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE |