|
|
||||||||


* Sars International Centre for Marine Molecular Biology, High Technology Centre, Bergen, Norway;
Center of Marine Biotechnology, University of Maryland Biotechnology Institute, Baltimore, MD 21202; and
Howard Hughes Medical Institute and Molecular Biology Institute, University of California, Los Angeles, CA 90095
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
The protochordate Oikopleura dioica is a pelagic tunicate (Urochordata) that occupies a phylogenetic niche close to the transition between vertebrates and invertebrates (7). Oikopleura is an appendicularia, a term that refers to their house appendices. The filter-feeding house of appendicularia enables them to use a wide range of food particles and thus become an important component of marine zooplankton. To maintain adequate filtration rates, the house must frequently be replaced. For O. dioica, a new house is synthesized every 34 h over a life cycle of 56 days at 15°C (8). To date, there is no evidence for any molecules that may be ascribed to an immune function in these animals. Because no lymphocyte-like cells have been identified in other deuterosomes, the earliest period for the emergence of lymphocytes is concomitant with the divergence of cartilaginous fish, >420 million years ago. Around this time point, two possible gene duplication or partial duplications are thought to have occurred, creating a large gene pool that provided opportunities to further gene diversification and adopt new roles peculiar to vertebrates (9, 10). Recent evidence of the existence of transcription factors involved in lymphocyte differentiation in skate (cartilaginous fish) and lamprey indicates that they duplicated around the time of the appearance of Ig, TCR, and MHC (11, 12, 13, 14, 15). Therefore, the isolation of regulatory transcription factors can be used to identify homologues and putative hemopoietic sites, which may improve our understanding about the evolution of lymphopoiesis in vertebrates. One important family of transcription factors is the Ikaros family, whose members function as key regulators of hemopoiesis.
The development of the lymphoid system, at least in higher vertebrates, is regulated by a network of transcription factors in hemopoietic stem cells and signaling mediated through cell contact and growth factor receptors. Gene inactivation experiments in mice have identified several transcription factors, including the GATA family, Pax5, PU.1, and Ikaros that are crucial for early lymphocyte lineage development and the development of other hemopoietic cell types (16, 17, 18, 19, 20). The expression of Ikaros is restricted to hemopoietic cells and is essential for hemopoietic stem cell differentiation to the lymphocyte lineages. It is expressed in the earliest hemopoietic progenitors and throughout the life of the lymphocyte as well as in all stages of B, T, and NK cell lineage development (18). Ikaros is a member of a small, closely related gene family, two members of which, Aiolos and Helios, are also implicated in lymphocyte development (21, 22, 23). These Ikaros family proteins can contain up to six C2H2 zinc fingers (ZFs)4 organized into two distinct domains. The four N-terminal ZF motifs participate in DNA binding, whereas the two C-terminal ZFs are involved in homo- and heterodimerization between members of the Ikaros clan (i.e., Ikaros, Helios, and Aiolos). The Ikaros multigene family function as regulators of hemopoiesis with proposed roles in gene activation and silencing during lymphocyte development (24, 25); in particular, its association with heterochromatin containing transcriptionally inert genes suggests that it may act as a negative regulator (26, 27).
The ZF motifs of the Ikaros gene family have been conserved throughout vertebrate evolution, allowing genes from phylogenetically distant vertebrate species to be cloned (11, 18, 28, 29, 30, 31). In this study, we describe an Ikaros family-like (IFL) transcription factor from the most primitive vertebrate, the agnathan Atlantic hagfish Myxine glutinosa, and from two marine urochordates, O. dioica and Ciona intestinalis, which lends insight into the evolution of this essential gene family and that of adaptive immunity of vertebrates.
| Materials and Methods |
|---|
|
|
|---|
Atlantic hagfish were collected in baited traps offshore from the Marine Research Institute field station at Espegrend, Norway, and then transferred to a darkened aquarium containing fresh seawater at 8°C. O. dioica were collected from the fjords around Bergen, Norway, and cultured, as previously described (8).
Isolation and analysis of IFL cDNA and genomic clones
A cDNA clone encoding a pseudo-Ikaros gene (in-frame stop codon) was isolated from a Pacific hagfish (Eptatretus stouti) cDNA library (kind gift from M. Flajnik, University of Maryland School of Medicine) by cross-hybridization using a probe encoding rainbow trout Ikaros ZF14. The pseudo-Ikaros transcript was used as a template to generate a cDNA probe encompassing ZF14. This probe was used to screen a M. glutinosa peripheral blood leukocyte
ZAP Express (Stratagene, La Jolla, CA) cDNA library. Two identical full-length hagfish Ikaros-like (HIL) clones with uninterrupted open reading frames (ORFs) were isolated. HIL cDNA was used as a template to generate a probe by PCR recognizing the HIL ZFs homologous to the IFL ZF5 and 6. This probe was used to screen a
DASH II M. glutinosa genomic DNA library. Four overlapping
clones were identified, and Southern analyses of the clones with probes recognizing HIL 5' and 3' untranslated regions (UTRs) revealed that only a 3' portion of the gene (exons E
and E
in Fig. 2) was present. Subsequent efforts to clone the missing 5' genomic DNA from the library using a 5'UTR cDNA probe were unsuccessful. The intron-exon boundaries of the genomic DNA encoding ZF14 were subsequently determined by PCR. The position of the forward and reverse primers (see Fig. 2) was chosen based on the known position of intron-exon boundries in lamprey, trout, and mouse. PCR experiments were conducted using genomic DNA as template to generate amplicons both across these predicted boundries and within predicted exons. All amplicons were cloned and sequenced to verify that they corresponded to the expected products.
|
DASH II genomic library. Two overlapping clones were obtained that were found to encode an apparently complete IFL gene. Genomic DNA was then used as a template to produce a probe by PCR recognizing the O. dioica ZF homologous by sequence and position to vertebrate IFL ZF5. This probe was used to isolate a full-length O. dioica Ikaros-like (OIL) cDNA from a
ZAP Express day 3 library (8). In situ hybridization using OIL
A probe derived from the full-length OIL clone was labeled with digoxigenin-11-dUTP (Roche MB, Mannheim, Germany) by nick translation. Three-day-old animals were fixed with 4% paraformaldehyde in 0.1 M MOPS, pH 7.5, and 0.5 M NaCl for 1 h at room temperature, washed in 2x SSC containing 0.1% Tween 20 (2x SSCT), and then stored in methanol at -20°C overnight. The probe was resuspended in hybridization buffer (50% formamide, 2x SSCT, 25 mM sodium phosphate, pH 7.2, 10 mM EDTA, and 15% dextran sulfate) at a concentration of 5 ng/µl and applied to the animals. Samples and probe were then denatured at 90°C for 2 min and immediately cooled at 42°C. Probes were detected by overnight incubation with FITC-conjugated sheep anti-digoxigenin Fab (Roche MB). After washing, samples were counterstained with 1 µM TO-PRO-3 (Molecular Probes, Eugene, OR) and mounted in Vectashield mounting medium (Vector Laboratories, Burlingame, CA). Images were collected with a TCS SP laser-scanning confocal microscope (Leica, Deerfield, IL) equipped with Leica Confocal Software. When the probe was omitted from control samples otherwise subjected to the same procedure, no signal was detected over a low intensity diffuse background common to all samples. Also, a number of Fab Abs unrelated to that used in this study showed no nonspecific staining of any O. dioica structures when used under the same conditions.
GST fusion proteins and shift assay
cDNAs encoding the N-terminal ZF of OIL (ZF14) and HIL (ZF13) DNA binding regions were cloned into pGEX-4T (Pharmacia, Peapack, NJ) for the production of Ikaros-GST fusion products for EMSA analysis using a consensus Ikaros target sequence as the probe. Briefly, small scale cultures were induced with isopropyl
-D-thiogalactoside (1.5 h) in the presence of 50 mM ZnCl. Fusion proteins were then purified from the lysates using glutathione-Sepharose spin columns (Amersham, Arlington Heights, IL). All buffers postinduction contained 10 mM ZnCl. A double-stranded probe conforming to Ikaros binding site (IKBS4) (32) was end labeled with [
-32P]ATP. Approximately 25,000 cpm of labeled probe was incubated with the purified fusion proteins (
5 µg) at room temperature. Samples were loaded directly onto 10% PAGE gels containing glycerol, electrophoresed, dried, and exposed to film for 16 h. Specificity was determined by adding a 50 molar excess of cold IKBS4.
Dimerization assay
Flag-tagged and untagged murine, OIL, and HIL1 constructs were generated using the murIK I parental vector in conjunction with PCRSoeing, as described (25). Briefly, the murine dimerization ZF (DZF) domain was replaced by OIL and HIL1 aa 488562 and 384447, respectively. Human embryo kidney 293T cells were transfected and harvested 48 h later. The cytoplasmic fraction was used for all DZF assays. Coimmunoprecipitation and chemical cross-linking assays have been described (33).
Sequence analysis
DNA sequencing was performed using Applied Biosystems (Foster City, CA) cycle-sequencing chemistry. GCG software package (Genetics Computing Group, Madison, WI), BLAST (NCIB), and Pfam (version 7.7b) were used for sequence assembly and analysis. Amino acid sequences were aligned using Clustal X version 1.81 and visual inspection. From this alignment, a phylogenetic tree was constructed by the neighbor-joining method and bootstrapped 1000 times.
| Results and Discussion |
|---|
|
|
|---|
Two identical clones were isolated from an Atlantic hagfish PBL cDNA library that contained an apparently full-length sequence. The 4933-bp sequence included a 628-bp 5'UTR, an ORF of 1341 bp encoding 447 aa, and a 2964-bp 3'UTR. We have named this clone HIL1. BLASTX analysis of the nucleotide sequence showed a significant alignment of HIL1 with Ikaros family members from jawed vertebrates, especially Ikaros and Helios. Highest similarity (60%), however, was with a lamprey (Petromyzon marinus) Ikaros family member (e-102). The inferred protein from HIL1 was 41 and 29% identical with the lamprey and human Ikaros proteins, respectively. Pfam analyses suggested the presence of four C2H2 ZFs (Fig. 1; HIL1 ZF1, 2, 3, and 6) within the inferred HIL1 protein; however, manual inspection suggested a fifth C2H2 ZF to be present (Fig. 1; HIL1 ZF5). The alignment suggests that the five HIL1 ZFs are equivalent to mammalian Ikaros ZFs 1, 2, 3, 5, and 6. For DNA binding, Ikaros family members must possess at least three of the four N-terminal ZFs (18).
|
BLASTX analyses of a library of shotgun clones derived from O. dioica genomic DNA revealed ZF containing clones with high homology to vertebrate IFL proteins. This sequence was used to amplify a probe from an O. dioica cDNA library that showed a relatively high level of identity with vertebrate IFL transcripts, and was used to screen the O. dioica cDNA library. Only one apparently full-length cDNA clone encoding an O. dioica IFL protein (OIL) was obtained (Fig. 1). This 1890-bp transcript contained an 83-bp 5'UTR, an ORF of 1686 bp (562 aa), and a 228-bp 3'UTR. BLASTX scores showed highest homology to mouse and skate Helios (e-40) and lamprey Ikaros (e-38). Pfam analysis suggested OIL to encode six C2H2 ZFs (OILZF16). The OIL protein shared only 19% identity with the hagfish and P. marinus Ikaros. Alignment of the amino acid sequence of Ikaros family members from four diverse vertebrate species as well as the IFL proteins from agnathans and O. dioica showed that the length of each ZF was conserved (Fig. 1).
Finally, a tBLASTn search of the recently updated C. intestinalis (ascidian tunicate) database using OIL as bait revealed the presence of two Ciona IFL genes that showed a high degree of similarity to the OIL sequence (Ciona IFL1) and murine/human Pegasus (Ciona IFL2) (Fig. 1). The Ciona IFL2 gene encoded two N-terminal ZFs corresponding to the first and third ZFs for mammalian Pegasus (38% similarity for the three Pegasus-like sequences), which contains three N-terminal ZFs (34), while the Ciona IFL1 encoded four N-terminal ZFs. Ciona IFL1 is 39% similar to OIL, with almost perfect identity for ZF14. Both Ciona genes encode the two C-terminal ZFs that form the dimerization domain. Finally, it appears that ZF4 differs among the IFL members in that some members possess the standard C2H2 finger (i.e., murine-IK and Pegasus), while others most likely use C3H fingers (i.e., Oiko-IFL, murine-AI) for DNA recognition and binding.
Exon organization of hagfish and O. dioica IFL genes
The exon boundaries of mouse and lamprey Ikaros genes are shown in Fig. 2. Both of these genes contain seven exons (E17). Four identical recombinant
phage clones were isolated from the hagfish genomic library. Southern analysis of one of these clones (gHIL) using HIL1 5' and 3' UTR cDNA probes indicated that part of the 5' end of the gene was missing. Nucleotide sequencing revealed that gHIL began within the equivalent of mouse/lamprey intron 5. Two exons were observed in gHIL (Fig. 2); the first (E
) contains the equivalent of mouse/lamprey exon 6. The second exon (E
) is similar to mouse/lamprey exon 7 and encoded ZF5 and ZF6. The presence of exons E
-E
(mouse/lamprey exons 35) was confirmed by PCR.
Two overlapping clones were isolated from the O. dioica genomic library, and the exon organization of the O. dioica OIL gene is shown in Fig. 2. The gene contained nine exons (Ea-i). The positions of only two intron-exon boundaries were conserved between O. dioica and the mouse/lamprey Ikaros genes. These were the boundaries between Eb/c (mouse/lamprey E3/4) and Ec/d (mouse/lamprey E4/5). One other notable feature of the gene structure was the observation that ZF5 was split between exons Eh and Ei. The amino termini of both HIL1 and OIL are shorter than those of their mouse/lamprey Ikaros family counterparts. It is possible that the sequence equivalent to that contributed by mouse/lamprey exons E1 and E2 has been spliced out of HIL1. The 5' sequence of the OIL gene did not suggest the presence of an equivalent to mouse/lamprey exon E1.
Phylogenetic analyses
The amino termini of the agnathan and urochordate proteins appeared to be of variable length, so phylogenetic analysis was conducted on the amino acid sequences encompassing the first to sixth ZFs of vertebrate IFL members Ikaros, Helios, and Aiolos. The two agnathan IFLs clustered tightly, forming their own separate lineage just before the branch leading to rest of the Ikaros family members from vertebrates that possess an adaptive immune system (i.e., skates-humans). Although the branch length leading to the Pegasus IFLs is deep, it does indicate that two IFL genes are present within nonvertebrate deuterostomes and that the Pegasus clade itself is most likely the result of a separate, older duplication event that occurred before the emergence of protochordates. All of these observations were supported by high bootstrap values (Fig. 3). Finally, it should be noted that a second alignment was generated (data not shown), in which the second ZF of Ciona IFL2 was aligned with the ZF3 region instead of the ZF4 region. The topology of the NJ-tree resulting from the second alignment was nearly identical with that shown in Fig. 3.
|
In higher vertebrates, expression of Ikaros is confined to hemopoietic sites. To determine the tissues that express the gene encoding HIL1, RT-PCR was performed on total RNA extracted from a number of potential hemopoetic tissues. Fig. 4A shows that HIL1 was expressed in all tissues tested, with strongest expression in blood, gills, and intestine. Coincidentally, intestinal cells from lamprey with lymphocyte-like morphology were shown to express both Spi-B, a transcription factor involved in hemopoiesis, and IFL transcripts, lending support that the gut is an important site for lymphocyte-like cell development in agnathan fish (6, 14). RT-PCR also revealed the presence of two other transcripts that were present in all tissues and mirrored the expression of the predominant transcript. These bands were cloned and sequenced and revealed the isoform HIL2 and pseudo-IFL sequences that showed 80 and 90% amino acid identity with each other and the original E. stouti pseudo-transcript, respectively, which may represent additional IFL genes in hagfish, as suggested from the phylogenetic analysis. The psuedo-IFL transcripts possessed either in-frame stop codons or frame shifts (data not shown).
|
Expression of the OIL gene (Fig. 4B) was determined at different developmental stages of O. dioica. Very weak expression was observed in the oocyte. Strongest expression was in early tadpole (2 h posthatching) and at 4/5 days posthatching. As well as the major transcript, there was also a slightly larger transcript amplified, whose strength of expression seemed to mirror that of the OIL transcript. Cloning and sequencing of this fragment revealed that it contained exons Eh and Ei as well as the intervening intron. Whole mount in situ hybridization was then performed on 20 day 3 animals using the full-length OIL cDNA as the probe (Fig. 5) that showed that the anterior Fol cells of the oikoplastic epithelium were strongly positive (yellow). This site is mainly involved in the formation of food concentration filters in the O. dioica house. Interestingly, the Fol cells undergo a gene amplification event (polyploidization) that most likely assists in quick manufacturing of the house (36). In mammals, Ikaros has been shown to interact with nucleosome-remodeling deacetylase complexes that include the SNF2-related (sucrose nonfermenting) helicase-ATPase Mi-2 and histone deacetylases (reviewed in Ref.27). These points raise the possibility that OIL may be involved in chromatin-remodeling events during the development of the O. dioica filter house.
|
To examine whether OIL and HIL1 behave as true members of the Ikaros gene family, we examined the DNA-binding potential of OIL and HIL1-GST fusion proteins. Ikaros and its closely related family members each contain up to four N-terminal C2H2 ZFs. Ikaros proteins containing at least three N-terminal ZFs can recognize and bind the consensus Ikaros target site (GGGA) found in the promoter regions of immunologically relevant genes such as TdT, CD3, CD8, and
5 (37, 38, 39). GST fusion proteins containing the N-terminal ZFs in OIL and HIL1 were tested in gel shift mobility assays using a consensus (IKBS4) Ikaros target (32).
The results (Fig. 6, lane 2) indicate that the N-terminal ZFs from HIL1 were capable of weakly associating with the probe and that the interaction was specific (Fig. 6, lane 3). An interaction was not observed for the OIL GST fusion protein with the consensus target. The murine IK DNA binding region (data not shown) displays a stronger association with the IKBS4 probe, which is most likely due to specific amino acids found in murine Ikaros as compared with HILI. Thus, it appears that although the OIL N-terminal ZFs are similar to that of the true Ikaros clan, they do not possess the same overall binding specificity. This later finding is consistent with the fact that key residues found in the N-terminal ZFs involved in specific base recognition have diverged from the residues found in the vertebrate proteins. Exactly what role OIL plays in the development of O. dioica awaits further investigation.
|
Previous studies showed that the C-terminal ZFs of Ikaros represent a bona fide dimerization domain referred to as a DZF domain (33, 40). This domain (Fig. 7A) supports homodimerization as well as heterodimerization with the corresponding domains of the closely related family members Aiolos and Helios (22). However, the Ikaros DZF could not form heterodimers with the Drosophila Hunchback DZF domain, although this domain supported homodimerization. To determine whether the putative DZF domains from the hagfish HIL1 and O. dioica OIL proteins were capable of supporting homodimerization or heterodimerization with murine Ikaros, chemical cross-linking and coimmunoprecipitation assays were performed. For these experiments, the sequences encoding the C-terminal fingers from the HIL1 and OIL proteins were substituted for the corresponding sequences of murine Ikaros in the context of mammalian expression plasmids. These expression plasmids encoded small untagged or Flag epitope-tagged proteins after transfection into HEK 293T cells. For the cross-linking assay, extracts from the transfected cells were treated with the cross-linker dithiobis[succinimidyl suberate] for a limited time, followed by Western blot analysis using Abs against an N-terminal domain of Ikaros that is retained in all proteins. The results of this analysis revealed efficient homodimerization of a protein containing the murine Ikaros DZF, but no cross-linked homodimers or heterodimers containing the HIL1 or OIL DZF domains were observed (Fig. 7B, and data not shown). These domains were also unable to form detectable heterodimers with the Ikaros DZF (data not shown).
|
Concluding remarks
Gene inactivation experiments have clearly shown the central importance of Ikaros family members in the development of B and T lymphocyte and NK cell and dendritic cell lineages. Specifically, Ikaros has been implicated in the activation of the CD8
gene and potential silencing of the TdT and
5 genes (37, 38, 39) during lymphocyte development. In both fish and higher vertebrates, Ikaros expression coincides with the known sites for hemopoiesis (41, 42). Our present study indicates that Ikaros, which is an essential component for the regulatory network orchestrating the development and maintenance of the adaptive immune system, was in place before the emergence of jawed vertebrates and that this is an ancient gene family. This argument is based upon: 1) the existence of IFL genes in agnathan fish (presented in this study and in Refs. 11 and 29) that have similar gene structures to Ikaros; 2) high sequence conservation of the ZF domains; 3) expression of HIL1 in tissues housing lymphocyte-like cells in hagfish; 4) evidence for alternatively spliced isoforms in both lamprey and hagfish; 5) phylogenetic analysis and the presence of two protochordate IFLs; 6) ability to bind the Ikaros DNA binding sites; and 7) the capacity to homo- and heterodimerize through the DZF domain, all of which are characteristics of true Ikaros family members. Based upon our phylogenetic analysis and biochemical inspection, it appears that the OIL and Ciona IFL genes seem to have preserved some of the properties of the ancestral IFL gene. Thus, based upon their presence and lack of evidence for lymphocyte-like cells in protochordates, we suspect that the OIL and CIL gene products most likely have a role outside of any hemopoietic duty, although this awaits further analysis. Future studies will address the role that Ikaros plays in agnathan fish, which to date have been reported to possess lymphocyte-like cells, but lack true components of adaptive immunity, including major histocompatibility Ags, Igs, TCRs, recombination-activating genes, or immunological memory.
| Footnotes |
|---|
2 P.M.C. and J.D.H. contributed equally to this work and share joint first authorship. ![]()
3 Address correspondence and reprint requests to Dr. John Hansen, Center of Marine Biotechnology, 701 E. Pratt Street, Baltimore, MD 21202. E-mail address:hansenj{at}umbi.umd.edu ![]()
4 Abbreviations used in this paper: ZF, zinc finger; DZF, dimerization ZF; HIL, hagfish Ikaros-like; IFL, Ikaros family-like; OIL, O. dioica Ikaros-like; ORF, open reading frame; UTR, untranslated region. ![]()
Received for publication June 26, 2003. Accepted for publication September 26, 2003.
| References |
|---|
|
|
|---|
gene locus is regulated by the Ikaros family of proteins. Mol. Cell 10:1403.[Medline]
5 promoter silences transcription through a mechanism that does not require heterochromatin formation. EMBO J. 20:2812.[Medline]
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |