Comparative genomic analysis and evolution of the T cell receptor loci in the opossum Monodelphis domestica
© Parra et al; licensee BioMed Central Ltd. 2008
Received: 21 November 2007
Accepted: 29 February 2008
Published: 29 February 2008
All jawed-vertebrates have four T cell receptor (TCR) chains: alpha (TRA), beta (TRB), gamma (TRG) and delta (TRD). Marsupials appear unique by having an additional TCR: mu (TRM). The evolutionary origin of TRM and its relationship to other TCR remain obscure, and is confounded by previous results that support TRM being a hybrid between a TCR and immunoglobulin locus. The availability of the first marsupial genome sequence allows investigation of these evolutionary relationships.
The organization of the conventional TCR loci, encoding the TRA, TRB, TRG and TRD chains, in the opossum Monodelphis domestica are highly conserved with and of similar complexity to that of eutherians (placental mammals). There is a high degree of conserved synteny in the genomic regions encoding the conventional TCR across mammals and birds. In contrast the chromosomal region containing TRM is not well conserved across mammals. None of the conventional TCR loci contain variable region gene segments with homology to those found in TRM; rather TRM variable genes are most similar to that of immunoglobulin heavy chain genes.
Complete genomic analyses of the opossum TCR loci continue to support an origin of TRM as a hybrid between a TCR and immunoglobulin locus. None of the conventional TCR loci contain evidence that such a recombination event occurred, rather they demonstrate a high degree of stability across distantly related mammals. TRM, therefore, appears to be derived from receptor genes no longer extant in placental mammals. These analyses provide the first genomic scale structural detail of marsupial TCR genes, a lineage of mammals used as models of early development and human disease.
The hallmarks of the vertebrate adaptive immune system are antigen specific receptors, the T cell receptors (TCR) and immunoglobulins (Ig) encoded by genes that undergo somatic DNA recombination to generate diverse binding specificities. The TCR are expressed by thymus-derived lymphocytes (T cells) that play a major role in regulation and effector functions of immune responses. Each T cell expresses a unique TCR that binds a specific antigen resulting in the activation of an immune response [1, 2]. TCR are heterodimers comprised of either alpha (TRA) and beta (TRB) or gamma (TRG) and delta (TRD) combinations, respectively. These two combinations define the two major lineages of T cells: αβT cells and γδT cells [3, 2]. αβT cells typically recognize peptide antigens presented on major histocompatibility complex (MHC) encoded molecules. In contrast γδT cells have been shown to be either MHC restricted or in some cases, similar to Ig, able to bind free antigen . Ig are expressed by antibody forming cells (B cells), which produce both a membrane bound form of Ig that comprises the B cell receptor (BCR) and a soluble form that is free antibody. Like TCR, Ig are made up of two different chain types, a heavy (IgH) and light (IgL) chain. TCR and Ig chains both contain variable domains that bind the antigen and membrane-proximal constant (C) domains. It is the variable domains that are encoded by gene segments that undergo somatic recombination to generate diversity in binding specificity. The gene segments encoding the variable domains of TRA, TRG and IgL chains are the variable (V) and joining (J) gene segments, while the variable domains of TRB, TRD and IgH chains are encoded by exons assembled from V, diversity (D) and J gene segments . Recombination of these gene segments takes place in the thymus for developing T cells and adult bone-marrow for developing B cells .
Of the immunoglobulin superfamily (IgSF) members, the TCR and Ig are each other's nearest relatives, however there are dissimilarities to their genetic structure and evolutionary history [6, 7]. For example all jawed-vertebrates appear to contain the same four homologous TCR isotypes: TRA, TRB, TRG, and TRD . In contrast there is variability in the number and class of Ig isotypes in different vertebrate lineages [9, 10]. In addition the organization of TCR loci appears to be more conserved than Ig. For example, in cartilaginous fish most Ig loci are organized as multiple, unlinked clusters of [V-(D)-J-C], limiting the combinatorial usage of their gene segments . Whereas, in bony fish and tetrapods the majority of Ig loci are organized in the translocon style of Vn-(D)n-Jn-Cn . TCR loci tend to be organized in the translocon style in all lineages. These differences between TCR and Ig genes are likely the result of dissimilar selection forces on the two different antigen receptor systems and have made determining the evolutionary relationship of the TCR and Ig chains to each other unclear .
The relationship between Ig and TCR is further muddled by the recent discoveries in marsupials and sharks of TCR loci that appear to be hybrids between ancestral Ig and TCR loci [13, 14]. In marsupials, a mammalian lineage that diverged from eutherians (placental mammals) 186 to 193 million years ago (MYA), a new fifth TCR chain, named TCR mu (TRM) has been identified [14, 15]. Unlike the conventional TCR, TRM has a tandem cluster organization and its origins appear to have involved a recombination between and ancestral TCR locus, most likely a TRD and IgH. TRM appears analogous to an unusual shark TCR called NAR-TCR, which utilizes the C regions of TRD and upstream Ig-like V regions . Both TRM and NAR-TCR are expressed in an atypical TCR isoform that contains double variable domains. Marsupial TRM and shark NAR-TCR, however, are not orthologous but rather the product of convergent evolution generating common features . Nonetheless the presence of TCR with these features in both marsupials and cartilaginous fish, make it likely analogous TCR will be found in other vertebrate lineages and illustrate a level of plasticity in TCR evolution heretofore unrealized.
The availability of the first completely sequenced marsupial genome provides the opportunity to investigate the evolutionary origins of TRM and its relationship to the conventional TCR in mammals . Towards this aim we have determined the complete genomic organization, content and evolution of the loci encoding both the conventional TCRs (TRA, TRB, TRG and TRD) and the recently discovered TRM locus in the opossum Monodelphis domestica. In addition, these analyses provide a level of detail for the TCR genes of a marsupial that has only been available for a limited number of eutherians such as human and mouse.
Results and Discussion
Number of TRV, TRD, TRJ and TRC genes found in human, mouse and opossum. The numbers of functional genes are indicated in parenthesis.
To determine which of the 74 V gene segments are also used as TRA/DV we performed RT-PCR on 23 day old thymus RNA using combinations of primer pairs specific for each of the V segment subgroups (see below) paired with either TRA or TRD C regions (TRAC and TRDC respectively) (not shown). This age was chosen as an early age where the thymus is fully mature . Nineteen of the TRAV segments and all four of the functional TRDV segments were found expressed with both TRAC and TRDC resulting in a total of 23 apparent TRA/DV segments (Figure 1 and Table 1). The number of opossum TRA/DV may be an underestimate since it is possible some TRAV may be rarely expressed with TRDC or may appear at different times during development than was examined. Alternatively this number could be an overestimate as well, since it is possible that some combinations are negatively selected in the thymus and not used in the periphery. Either way this appears to be a comparatively high number of TRA/DV segments in the opossum relative to human and mouse (Table 1).
V gene segments evolve by gene duplication and deletion resulting in degrees of relatedness amongst segments . These are defined as subgroups with the segments belonging to the same subgroup by having 80% or greater nucleotide identity. By this criterion the current 68 TRAV segments can be placed into 41 subgroups, where the nucleotide identity between subgroups ranges from 32.1 to 79.5%. The six TRDV gene segments were sufficiently different, with nucleotide identity ranging from 33.6 to 61.9%, that each belonged to its own distinct subgroup. Phylogenetic analyses using TRAV and TRDV segments were performed to elucidate their evolutionary relatedness. Opossum sequences were compared with sequences from human, mouse, rabbit, cow, sheep, and chicken using the same dataset as used by Su et al.  to define the major phylogenetic groups (Figure 2). Eight groups of V gene segments (designated A through H) with bootstrap values greater than 89% emerged from the inclusion of the marsupial sequences (Figure 2). All eight contain opossum sequences (Figure 2). Four of these, groups A through D, were defined previously and only included TRAV or TRAV/D . However, the addition of opossum sequences revealed four new V groups (E through H) not previously recognized or requiring reevaluation. Group E contains TRAV, TRDV and TRA/DV sequences; groups F and G only TRDV sequences; and group H TRAV and TRA/DV sequences. The addition of the opossum sequences to this analysis substantiate the statement that all these subgroups were present in the common ancestor of amniotes and that some species have lost segments that belong to different subgroups .
These analyses also allow us to evaluate the evolution of mammalian V segments that can be utilized with either TRA or TRD chains over a larger time-span than has been available previously. TRA/DV sequences clearly belong to different groups rather than forming a monophyletic cluster (Figure 2) and, as observed for human and mouse, TRA/DV regions are dispersed throughout the opossum TRA/D locus, although most are located towards the 3' end of the locus (Figure 1). Considering that αβ and γδT cells recognize potentially very different antigens these results continue to support the high level of plasticity in V gene utilization at this locus.
The number and complexity of D and J gene segments in the opossum TRA/D locus is also comparable to that of human and mouse (Figure 1, Table 1). Encoding TRD chains are at least two D and six J gene segments (TRDD and TRDJ respectively), all of which are located upstream of a single TRDC. There are at least 53 TRAJ segments located between the TRDC and TRAC all of which appear to be functional by the criteria defined above for the V gene segments (Figure 1, Table 1).
There are 36 opossum TRBV segments (Figure 4, Table 1) and these can be grouped in 28 subgroups based on nucleotide identity with the value between subgroups ranging from 38.5 to 68.5%. Compared with other TCR, the TRB locus in eutherians appears to contain a higher number of V pseudogenes, where 19% and 34% of TRBV are pseudogenes in human and mouse, respectively. This pattern appears to hold for the opossum since nine of the 36 TRBV segments (25%) appear to be pseudogenes (Table 1).
Phylogenetic analyses of the TRBV regions from different mammals and chicken reveal that the opossum TRBV regions are very diverse with six groups (A to F) identified (Figure 5). Human, mouse and opossum TRBV sequences are found in all six groups, supporting their presence before the divergence of marsupial and eutherian mammals (Figure 5) . Three sequences chickenB1S1, mouseB2 and opossumB20 did not cluster with any other sequence, nor with each other, and therefore not included within any of the groups.
As in human and mouse, the opossum TRB D, J, and C genes are organized in tandem cassettes. The human and mouse TRB locus contains two of such D-J-C cassettes while the opossum has four (Figures 3C–3F and 4). In the opossum, each cassette contains a single TRBD, four or five TRBJ, and a single TRBC. All four cassettes appear to be functional and TRBJ gene segments from each have been found in TRB transcripts (data not shown).
The four opossum TRBC regions are very similar to each other at the nucleotide level and in intron – exon organization. Each TRBC region is encoded by four exons (Figure 3C–F). Exon 1 encodes the immunoglobulin domain which contains two conserved cysteines residues important for the intra chain disulfide bond formation. There are three potential N-glycosylation sites in exon 1. Exon 1 of TRBC1, TRBC2 and TRBC3 have 100% nucleotide identity and TRBC4 differs by only a single non-synonymous nucleotide substitution (K10E) that encodes a lysine (AAA) instead of the glutamate (GAA). Exon 2 encodes the Cp and it contains a conserved cysteine residue used for the inter chain disulfide bond (Figure 3C–F). The Cp from the four opossum TRB share more than 82.6% nucleotide identity. Exon 3 encodes the Tm region and contains a lysine residue involved in the interaction with the CD3 complex. The Tm regions are much conserved among the four opossum TRBC, sharing greater than 85.1% nucleotide identity. Exon 4 encodes the cytoplasmic region and it also includes 3' untranslated region.
Due to the high degree of sequence similarity among the four cassettes described above it is difficult to fully reconstruct the duplication events that led to the current arrangement in the opossum. However, cassettes 2 and 3 share two characteristics indicating they are derived from a relatively recent tandem duplication. First of all, both cassettes are nearly identical in nucleotide sequence over a 12.8 kb region that extends from 4.8 kb upstream of a non-functional copy of cyclin A2 including the gene segments TRBD, TRBJ to 100 bp downstream of TRBC (Additional file 2). Secondly both cassettes 2 and 3 have a cyclin A2 gene 5' of the D-J-C gene segments, which is not present in the other two cassettes (Figure 4). Cyclin A2 is also associated with the cow TRB locus but is not in human, mouse or chicken. This is consistent with the cyclin A2 gene being inserted near the TRB D-J-C cassettes prior to the divergence of marsupials and eutherians, with subsequent loss in some eutherian species such as human and mouse.
Previously, phylogenetic analyses of the mammalian and avian TRGV segments revealed the presence of eight ancient groups . Addition of the opossum TRGV sequences to these analyses revealed two additional groups, group I and J. Group I contains human, mouse and opossum TRGV4. Support for this group is low (Figure 7), however it is likely that these three sequences from opossum, human and mouse are derived from the same ancestral gene since they also have similar RSS sequences (data not shown). The new group J contains only the members of the opossum TRGV1 subgroup (Figure 7), which is related to the previously defined group F that contains sequences from human, rabbit, sheep and cow . This relationship is not well supported however, and two groups may not have arisen from a common ancestral gene segment (Figure 7). Opossum TRGV3 segments group in a previously defined group A. The single member of opossum TRGV2 subgroup does not clearly cluster with any existing group (Figure 7).
There is only a single opossum TRGC region (Figure 3G and 6) in contrast to four in mouse (three of which are functional) and two in human (both functional) (Table 1). The opossum TRGC is encoded by three exons. Exon 1 encodes the immunoglobulin domain and contains a single N-glycosylation site. An unusual characteristic described previously in marsupials was the absence of the second cysteine residue required for the formation of intrachain disulfide bond in the TRGC region . Exon 2 encodes the Cp and exon 3 encodes the Tm, Ct and 3'UTR regions (Figure 3G).
Previously we reported that the TRM V gene segments appeared to be more similar to Ig V gene segments than that of TCR . This conclusion was drawn from an analysis of a limited number of available marsupial TCR V gene segments at that time. Availability of the complete TCR genomic sequences described above allows us to further test this observation. TRM is organized in tandem clusters where complete clusters contain two classes of V segments: a single non-rearranged V gene segment (TRMV) and an unusual V gene segment which is already joined to D and J genes (TRMVj) in the germline DNA . The opossum has six such complete clusters. The remaining two clusters are partial, lacking the TRMV and TRMD gene segments (Figure 8) .
To investigate the evolutionary history of the individual TRM clusters we compared the sequence and organization of the six complete clusters (clusters 1, 2, 3, 4, 5, and 7) and two partial clusters (clusters 6 and 8). These analyses revealed three classes of clusters based on gene content and nucleotide sequence identity. These three classes likely represent the most ancient duplications of TRM clusters. The lineages generated by these older duplications are represented by clusters 1, 2 and 3, respectively. Clusters 5 and 7 have similar gene content (three TRMD segments) and share greater nucleotide sequence identity to cluster 3 and represent more recent whole cluster duplications that would have followed the duplication of an additional D segment in this lineage . Clusters 4, 6, and 8 contain two TRMD segments and share greatest nucleotide sequence identity to cluster 2, representing another round of more recent duplications. Cluster 1 is a third class unto itself based on not sharing significant sequence similarity to the others . These results are consistent with the current complement of TRM clusters in opossum being the result of whole cluster duplications followed by divergence of each cluster. These duplication events have resulted in partial clusters cases and different numbers of clusters in different marsupial species; bandicoots for example appear to have only two TRM clusters [14, 29].
In addition to the six functional TRMV gene segments located within the TRM locus, there is a single TRMV orphon gene (TRMV-OR2) located on opossum chromosome 2 in a region containing a number of flanking sequences resembling long interspersed repeat elements (LINE)(data not shown). TRM is the only of the TCR loci in the opossum for which orphon V gene segments have been found. TRMV-OR2 appears to be non-functional and has no leader peptide, but it does contain a complete V gene segment and the recombination signal sequence (RSS). TRMV-OR2 is most similar to TRMV2 and TRMV4, sharing 96 and 95% nucleotide identity, respectively. The high degree of identity between these functional gene segments and the orphon is consistent with the latter being the result of a relatively recent duplication event that, based on the flanking LINE elements may have been due to transposition . TRMV-OR2 provides additional evidence for the role of retroelements in both gene translocation in marsupials. A similar translocation was reported previously for opossum MHC class I genes, where two class I loci (UB and UC) had been translocated outside the MHC region. UB and UC are similarly tightly flanked by retroelements [40, 41]. Furthermore, TRMV-OR2 may also provide an additional connection between retroelements and the evolution of TCR genes themselves in marsupials. TRMVj is a variable region gene segment that appears to have been generated by retro-transposition since it is lacking an intron that all V gene segments have and is already joined to the D and J gene segments in the germline . Although highly speculative, it is possible that translocated orphons such as TRMV-OR2 may have contributed to the activation of local retroelement genes to allow their co-expression when the TRM locus is actively transcribed. This would have been a necessary step in the retro-translocation events that generated TRMVj.
Is TRM present in lineages other than marsupials?
The availability of a large number of vertebrate genome sequences, with varying degrees of depth of sequence coverage, provided the opportunity to search for TRM or TRM-like genes in species other than marsupials. A search of the available genomes using the BLAST algorithm and opossum TRM sequences was unable to identify a homologue in any of the eutherian species available. This search included human, mouse, rabbit, dog, cat, cow, horse, hedgehog, elephant and armadillo. The rabbit, cat, hedgehog, elephant and armadillo are low coverage genome sequences ranging from 1.86× to 2×. While it is possible that in any given species the TRM locus was missed in the sequencing, it is unlikely that such a random gap would have been consistently present in all the eutherian genomes. Therefore we conclude that TRM is not likely present in any eutherian lineage. We also searched the current chicken and the anole lizard (Anolis carolinensis) genomes in a similar manner and were unable to detect clear TRM homologues. In the case of chicken and all the eutherian genomes, the TRD homologues were identifiable indicating that our search strategies are able to pick up this conventional TCR locus. Furthermore, we were able to identify a clear TRM homologue in the recently completed platypus genome sequence (Ornithorhynchus_anatinus-5.0) available at GenBank. Further characterization of the platypus TRM locus is ongoing and beyond the scope of this paper. Nonetheless identification of these genes in a monotreme, which are separated from marsupials and eutherians by 217 to 231 MY , further supports that our search strategies should be able to identify TRM homologues in other species. These results are also consistent with TRM being present early in the evolution of mammals and therefore likely lost in the eutherian lineage.
First and foremost, we have described in detail the genomic content and complexity of the T cell receptor loci for the opossum Monodelphis domestica, the first such analysis available for a marsupial. The opossum is arguably the most extensively studied marsupial species and is used as a model of human disease and development. The opossum, for example, is one of the few mammalian model organisms that develop melanoma following exposure to ultraviolet radiation providing a cancer model . Additionally, opossums are a natural host and a reservoir of the causative agent of Chagas disease, Trypanosoma cruzi . Like all marsupials, opossums give birth to highly altricial young also providing a model for early immune as well as other anatomical system development. Further characterization of the immune system in the opossum, and T cell immunobiology in particular, is important for better understanding of these disease and developmental models. Complete characterization of the TCR genomics in this species is one step in that direction.
The opossum whole genome assembly MonDom5 was used in this study and it is available at GenBank under the accession number AAFR00000000. The location for all TCR gene segments in MonDom5 is provided in Additional file 3. For comparative purposes the current genome assemblies from human (NCBI 36), mouse (NCBI m37), cow (Btau_3.1) and chicken (WASHUC2), rabbit (RABBIT), dog (CanFam2.0), cat (CAT), horse (preEnsembl EquCab2), hedgehog (eriEur1), elephant (BROAD E1), armadillo (ARMA) and the anole lizard (AnoCar1.0) were searched for any evidence of TRM. Genomes were analyzed using BLAST assembled genomes tools [23, 35].
Opossum lymphoid tissues were collected and immediately processed to extract the RNA or stored in RNAlater® (Ambion, Austin, TX) at 4°C for 24 hours and stored at -80°C for use later. Whole RNA extraction was performed using the Trizol RNA extraction protocol (Invitrogen, Carlsbad, CA). Tissue was homogenized in 1 ml of Trizol® Reagent per 100 mg until the tissue was completely dispersed. Phase separation was done using 200 μl of chloroform per 1 ml of Trizol. RNA was precipitated with 500 μl of isopropanol per 1 ml of trizol, washed with 70% ethanol and resuspended in 50 to 100 μl of DEPC water. DNase treatment to remove contaminating DNA has been performed using Ambion's kit TURBO DNA-free (Ambion, Austin, TX). Each sample was quantified using the NanoDrop ND-1000 Spectrophotometer (NanoDrop Technologies, Wilmington, DE).
Reverse transcription, PCR and sequencing
Reverse transcription-polymerase chain reactions (RT-PCR) were performed using GeneAmp RNA PCR Core Kit (Applied Biosystems, Foster City, CA). Amplifications of cDNAs were performed using AdvantageTM-HF 2 PCR (BD Biosciences, CLONTECH Laboratories, Palo Alto, California) with the conditions: 94°C for 1 minute, denaturation at 94°C for 30 seconds, annealing/extension according to the melting temperature of the primers, and a final extension period of 68°C for 5 minutes.
PCR products were cloned using TOPO TA Cloning® Kit for sequencing (Invitrogen, Carlsbad, CA). Plasmids were sequenced using BigDye Terminator Cycle Sequencing Kit v3 (Applied Biosystems, Foster City, CA) in 10 μl reactions and analyzed on an ABI Prism 3100 DNA automated sequencer (PerkinElmer Life And Analytical Sciences Inc, Wellesley, MA). Analyses of chromatograms were done using the SequencherTM 4.6 program (Gene Codes Corporation, Ann Arbor, MI).
Identification of TRV, TRD, TRJ, TRC genes
To determine the location, content and organization of the TCR genes, the whole opossum genome was searched using the BLAST algorithm. TRA, TRB, TRG, TRD and TRM were located using sequences previously isolated [28, 29, 38, 14]. The V and J segments were located by similarity to corresponding segments from other species and by identifying the flanking conserved RSS.
Rapid amplification of 5' complementary DNA ends (5' RACE) performed on opossum thymus mRNA was used to identify novel, expressed V, D and J segments. In addition to using 5' RACE, PCR using primers that are specific for each V family were also used in RT-PCR to amplify cDNA containing VDJ recombinations that are underrepresented in the RACE PCR. Primers used to amplify the conventional TCR are complementary to the most conserved sequences of the V regions and have been paired with primers located on the C regions (Additional file 4). Sequences obtained by these means were compared with the whole opossum genome using the BLAST algorithm to identify novel TCR gene segments. This approach allowed the identification of gene segments in six possible reading frames, which also help to find gene segments located in the reverse orientation and D segments that may be used in multiple reading frames. The exon-intron organization of V regions was determined using sequences obtained by 5' RACE.
Determination of the exon-intron organization in the C regions was done using BLAST to compare available cDNA sequences that encode complete TCR chains to the opossum genomic sequences. Additionally, transcripts encoding the C terminal end of TCR were obtained by 3' RACE performed on thymus cDNA. Clones obtained by 3' RACE PCR were used to identify the CP, TM and CT and 3' untranslated regions (UTR) for each one of the TCR. cDNA made using the Oligo dT primer was used to perform 3' RACE PCR. The Oligo dT primer supplies a priming site for the GeneRacerTM 3' PCR primers. Sequences obtained by 3' RACE were aligned with the germline sequences to determine the location, intron-exon boundaries and splice sites of these exons.
Opossum gene segments were named following the IMGT nomenclature established for human and mouse . TRV segments were numbered according to their location, from the 5' to 3' end of the locus. TRAJ segments were numbered from 3' to 5' according to a nomenclature proposed by Koop et al.  and followed by IMGT. The TRBD, TRBJ and TRBC found in four cassettes in the opossum are numbered according to the cassette to which they belong and the position of the cassette from 5' to 3' in the locus.
Nucleotide sequences that encode from FR1 through FR3 of the V regions identified from each one of the opossum TCR loci were compared to TCR V sequences from other species retrieved from GenBank. The sequences were aligned using ClustalX  and BioEdit  programs. Phylogenetic analyses were performed using MEGA version 3.0  for the distance methods (neighbor joining and minimum evolution). Confidence values were obtained from bootstrap analyses using 1000 replications.
The accession numbers for sequences used in the phylogenetic analyses are:
Human: A8.4: D13077; A3: X57534; A16: DQ097913; A9.2: X57531; A40: DQ097942; A10: DQ097904; A27: DQ341447; A25: DQ097923; A35: DQ097935; A22: DQ097920; A20: X70305; A34: DQ097934; A30: X58768; A41: DQ097943; A36/DV7: X61070; A17: DQ097914; A6: X58747; A24: M17661; A21: U50404; A23/DV6: D13071; A29/DV5: M17664; A1.1: X04939; A26.1: L06886; A4: M17663; A38-2/DV8: D13074; A14: S51029; D3: M23326; D1: AY357942; D2: S24406; A19: Z46641; A2: DQ097917. Mouse: A9: M33586; A17: X60319; A12.3: M38680; A6: M34200; A11: DQ340292; A13: M38102; A4: L47342; A7: X56719; A14.3: L77149; A5.4: M38681; A10: X57397; A3.1: X02967; A1: M22604; A2: X03760; A21/DV12: M94080; A15-2/DV6-2: M37599; D5: X12729; D4: M23545; TRADV16D: M16118; D2.2: M37280. Rabbit: D4: 38120; 885: M12885; D1: D26555; D5: D38121. Sheep: A622: M55622; D1S1: Z12989; D6: AJ005908; D7: AJ809501; D4: AJ005906; D2: AJ005904; D5: Z12995; D3S2: Z12996; A35: U78035. Cow: 014: D90014; 013: D90013; 015: D90015; 011: D90011; 012: D90012; 017: D90017; D113: D16113; 016: D90016; D116: D16116. Chicken: 611: GDU04611; 612: GDU04612; 613: GDU04613.
Human: B11.3: X58797; B7.2: U07975; B12.1: X07224; B14: X06154; B17: U03115; B2: M64351; B27: U66061; B25.1: L27610; B13: U03115; B4.3: X58812; B21.1: L27608; B23: L27614; B5: X61439; B9: M27380; B3.1: U07977; B15: U03115; B24.1: L27612; B10.3: U17047; B28: U08314; B6.9: X61447; B19: U48260; B30: L06893; B20.1: X72719; B29.1: M13847. Mouse: B14: AE000664; B16: L29434; B3: AE000663; B21: X16691; B26: K02548; B23: X59150; B24: M61184; B12: M30881; B2: AE000663; B4: X56725; B17: AE000664; B29: X00696; B10: X16694; B13.3: M15616; B19: AJ249821; B31: X03277; B30: X16695; B20: M11859; B1: X01642. Rabbit: B8: BAA04245; B11: BAA04248; B9: BAA04246; B7S1: BAA04241; B1: AAA31472; B5: BAA04239; B10: BAA04247; B2: M13895; B6: BAA04240. Sheep: B6S1: AAB88431; B8S1: AAB88433; B10S1: AAB88434; B1S5: AAB88425; B7S1: AAB88432; B22S1: AAB88440; B3S1: AAB88427; B13S1: PQ0068; B15S1: AAB88438; B12S1: AAB88435; B17S1: AAB88439; B2S1: AAB88426; B4S1: AAB88430. Cow: B126: PQ0062; B129: PQ0065; B125: PQ0061; B122: JQ0473; B123: PQ0060; B124: PQ0059. Chicken: B1S1: B36198; B2S2: AAA62753.
Human: G2: M13429; G3: S60779; G4: S60780; G2: M27335. Mouse: G3: AF037352; G6: M13338; G4: M13336; G7: AF037352; G1: Z22847. Rabbit: G1S2: D38137; G1S3: D38138; G1S1: D38135; G1S4: D38139; G2S1: D38142. Sheep: G6S1: Z13007; G2S1: Z12999; G2S2: Z13000; G2S3: Z13001; G2S4: Z13002; G5S1: Z13005; G5S2: Z13006; G1S1: Z12998; G3S1: Z13003; G4S1: Z13004. Cow: G6: AY560834; G1S1: D16119; G1S3: D16131; G5S11: D16126; G5S7: D16130; G5S6: D16129; G5S16: D16133; G3S1: U73186; G3S2: U73187; G4S1: U73188. Platypus: V3.1: AAY82120; V3.2: AAY82094; V1.1: AAY82100; V1.2: AAY82093; V1.3: AAY82109; V2.2: AAY82110; V2.3: AAY82114; V2.1: AAY82119. Chicken: G3S8: U78235; G3S4: U78231; G3S3: U78230; G2S7: U78225; G2S8: U78226; G2S9: U78227; G1S4: U78212; G1S5: U78213; G1S3: U78210; G1S8: U78216.
IgH: Hsheep80: U80145; Hmouse21: M27021; Hhuman47: X64147; Maxolot43: L20243; Hmouse69: K01569; Hhuman14: X05714; Mxenopus84: M20484; Hmouse70: M21470; Hhuman00: DQ454900; HHlama75: AY544575; HHlama33: AY342133; HHlama32: AJ629032; HHlama86: AF441486; Hpig94: U15194; Hhuman62: U80162; Hopossum13: AF012113; Hopossum24: AF012124; Hhuman86: M99686; Hmouse65: M25465; Hmouse85: M31285; Hmouse01: X03301; Mchicken48: M30348; Mzebrafish80: AF281480; Mcatfish60: DQ230560; Mtrout03: DQ831803; Mrockcod55: AF303555; Hwolffish86: AY188786; Hsturgeon61: DQ257661; Wnurseshark50: U51450; Wlungfish27: AF437727; Mnurseshark51: M92851; Hhornshark49: X13449; Hsandbarshark57: AY548357. NAR-TCR: TnarsharK88: DQ022688; Tnarshark10: DQ022710; Tnarshark06: DQ022706. NAR: Nwobbegong92: AF336092; Nwobbegong94: AF336094; Nnurseshar03: AF447103; Nnurseshar81: AY114781. IgL: Khuman92: Y08392; Khuman98: DQ915098; Ksheep10: X54110; Krabbit53: AF211353; Krat09: U39609; Khorse11: X75611; Kopossum08: AY074408; Lnurseshark44: GCU15144; Khamster65: U17165; Khuman98: AY320598; Lmouse88: DQ986488; Lray46: AB062646; Lseabream69: EF555069; Lsturgeon42: AJ387842; Lsalmon17: AF273017; Lseabass79: AJ291779; Ltrout60: X65260; Lhuman77: AF216777; Lopossum81: AF049781; Lchicken67: S65967; Lxenopus78: L76578; Lhuman69: Z73669. TRG: TGhuman29: M13429; TGplatypus21: DQ011321; TAmouse98: M34198; TArabbit85: M12885; TGhuman79: S60779; TGcow86: U73186; TGrabbit42: D38142; TGcow88: U73188. TRA/D: TAsalmon97: EF467297; TAsalmon05: EF466505; TAfugu52: AY198352; TAselfish71: AY198371; TAselfish49: AY198349; TAcod44: AJ133844; TAfugu95: AB222395; TDhalibut71: AB076071; TDfugu79: AB222479; TAzebrafish50: AL592550; TADhuman96: Z14996; TAsheep35: U78035; TDsheep03: AJ005903. TRB: TBtrout23: OSU18123; TBseabream37: AM490437; TBfugu32: AB222432; TBselfish13: AF324813; TBopossum8.1: XM_001363148; TBmouse82: DQ983582; TBsheep11: AF030011; TBrabbit19: D17419; TBhuman04: M27904; TBsheep17: AF030017; TBcow29: D90129; TBmouse16: M15616.
Gray short-tailed opossum: IGHV1-1: AAC48826; IGHV1-6: AAC48820; IGHV1-11: AAC48816; IGHV1-16: AAC48836; IGHV1-23: AAC48846; IGHV2-1: AAC48849. Remaining germline sequences are available at AAFR00000000. Brush-tail possum: 70: AAL87470; 71: AAL87471; 72: AAL87472; 73: AAL87473; 74: AAL87474; 76: AAL87476; 77: AAL87477; 78: AAL87478; 79: AAL87479; 91: AAD41691; 92: AAD41692; 41: AAT40441; 40: AAT40440; 44: AAT40444; 43: AAT40443; 42: AAT40442. Virginia opossum: P83 is an unpublished sequence kindly provided by Dr. R. Riblet. Tammar Wallaby: sequences are unpublished but available upon request. Northern Brown Bandicoot: Bandicoot58: AY586158. Shark NTCR: AAY98815.
Dot plot analyses
Comparisons of the genomic sequence were performed using the program Spin from the Staden Package . Dot matrix plots were generated to determine degrees of similarity among cassettes for the TRB locus. The sequence analyzed includes the four TRB cassettes with coding and non-coding sequence from 3kb upstream of the most 5' TRBD (TRBD1) segment to the most 3' TRBC (TRBC4) segment, which comprises 51 kb. Although not shown, similar comparisons were performed for the other TCR loci.
T cell receptor. TRA: T cell receptor alpha. TRB: T cell receptor beta. TRG: T cell receptor gamma. TRD: T cell receptor delta. TRM: T cell receptor mu. Ig: immunoglobulin. V: variable gene segment. D: Diversity gene segment. J: Joining gene segment. C: constant region. Tm: transmembrane region. Cp: connecting peptide. Ct: cytoplasmic region. RSS: recombination signal sequence. MHC: major histocompatibility complex.
This work was supported by funding from the National Institutes of Health Initiative for Maximizing Student Diversity (ZEP, JT) and Institutional Development Award (IDeA) programs (MLB, JH), the Robert C. McNair program (AML) and the National Science Foundation (RDM).
- Kronenberg M, Siu G, Hood LE, Shastri N: The molecular genetics of the T cell antigen receptor and T cell antigen recognition. Ann Rev Immunol. 1986, 4: 529-41. 10.1146/annurev.iy.04.040186.002525.View Article
- Clevers H, Alarcon B, Wileman T, Terhorst C: The T Cell Receptor/CD3 Complex: A Dynamic Protein Ensemble. Ann Rev Immunol. 1988, 6: 629-662. 10.1146/annurev.iy.06.040188.003213.View Article
- Brenner MB, McLean J, Dialynas DP, Strominger JL, Smith JA, Owen FL, Seidman JG, Rosen SF, Krangel MS: Identification of a putative second T-cell receptor. Nature. 1986, 322: 45-9. 10.1038/322145a0.View Article
- Hayday AC: γδ cells: a right time and a right place for a conserved third way of protection. Ann Rev Immunol. 2000, 18: 975-1026. 10.1146/annurev.immunol.18.1.975.View Article
- Davis MM, Chein YH: Fundamental Immunology. 2003, Paul WE Lippincott Williams and Wilkins. Philadelphia, PA, 227-258. 5
- Marchalonis JJ, Schluter SF: Evolution of variable and constant domains and joining segments of rearranging immunoglobulins. FASEB J. 1989, 3: 2469-2479.PubMed
- Marchalonis JJ, Bernstein RM, Shen SX, Schluter SF: Emergence of the immunoglobulin family: conservation in protein sequence and plasticity in gene organization. Glycobiology. 1996, 6: 657-663. 10.1093/glycob/6.7.657.PubMedView Article
- Rast JP, Anderson MK, Strong SJ, Luer C, Litman RT, Litman GW: α, β, γ and δT cell antigen receptor genes arose early in vertebrate phylogeny. Immunity. 1997, 6: 1-11. 10.1016/S1074-7613(00)80237-X.PubMedView Article
- Flajnik MF: The last flag unfurled? A new immunoglobulin isotype in fish expressed in early development. Nat Immunol. 2005, 6: 229-230. 10.1038/ni0305-229.PubMedView Article
- Hsu E, Pulhman N, Rumfelt LL, Flajnik MF: The plasticity of immunoglobulin gene systems in evolution. Immunol Rev. 2006, 210: 8-26. 10.1111/j.0105-2896.2006.00366.x.PubMedPubMed CentralView Article
- Flajnik MF: Comparative analyses of immunoglobulin genes: surprises and portents. Nat Rev Immunol. 2002, 2: 688-698. 10.1038/nri889.PubMedView Article
- Richards MH, Nelson JL: The evolution of vertebrate antigen receptors: a phylogenetic approach. Mol Biol Evol. 2000, 17: 146-155.PubMedView Article
- Criscitiello MF, Saltis M, Flajnik MF: An evolutionarily mobile antigen receptor variable region gene: Doubly rearranging NAR-TcR genes in sharks. Proc Natl Acad Sci USA. 2006, 103: 5036-5041. 10.1073/pnas.0507074103.PubMedPubMed CentralView Article
- Parra ZE, Baker ML, Schwarz R, Deakin J, Lindblad-Toh K, Miller RD: A unique T cell receptor discovered in marsupials. Proc Natl Acad Sci USA. 2007, 104: 9776-9781. 10.1073/pnas.0609106104.PubMedPubMed CentralView Article
- Rheede TV, Bastiaans T, Boone DN, Hedges SB, De-Jong WW, Madsen O: The Platypus Is in Its Place: Nuclear Genes and Indels Confirm the Sister Group Relation of Monotremes and Therians. Mol Biol Evol. 2006, 23: 587-597. 10.1093/molbev/msj064.PubMedView Article
- Mikkelsen TJ, Wakefield MJ, Aken B, Amemiya CT, Chang JL, Duke S, Garber M, Gentles AJ, Goodstadt L, Heger A, Jurka J, Kamal M, Mauceli E, Searle SMJ, Sharpe T, Baker ML, Batzer MA, Venos PV, Belov K, Clamp M, Cook A, Cuff J, Das R, Davidow L, Deakin JE, Fazzari MJ, Glass JL, Grabherr M, Greally JM, Gu W, Hore TA, Huttley GA, Jirtle RL, Koina E, Lee JT, Mahony S, Marra MA, Miller RD, Nicholls RD, Oda M, Papemfuss AT, Parra ZE, Pollock DD, Ray DA, Schein JE, Speed TP, Thompson K, VandeBerg JL, Wade CM, Walker JA, Waters PD, Webber C, Weidman JR, Xie X, Zody MC, Broad Institute Genome Sequencing Platform, Broad Institute Whole Genome Assembly Team, Marshall-Graves JA, Ponting CP, Breen M, Samollow PB, Lander ES, Lindblad-Toh K: Genome of the marsupial Monodelphis domestica reveals adaptive turnover of coding and non-coding sequences. Nature. 2007, 447: 167-177. 10.1038/nature05805.PubMedView Article
- Deakin JE, Parra ZE, Graves JAM, Miller RD: Physical Mapping of T cell receptor loci (TRA, TRB, TRD and TRG) in the opossum (Monodelphis domestica). Cytogenet Genome Res. 2006, 112: 342K-10.1159/000089901.PubMedView Article
- Satyanarayana K, Hata S, Devlin P, Roncarolo MG, De Vries JE, Spits H, Strominger JL, Krangel MS: Genomic organization of the human T-cell antigen-receptor α/δ locus. Proc Natl Acad Sci USA. 1988, 85: 8166-8170. 10.1073/pnas.85.21.8166.PubMedPubMed CentralView Article
- Chien YH, Iwashima M, Kaplan KB, Elliot JF, Davis MM: A new T-cell receptor gene located within the alpha locus and expressed early in T-cell differentiation. Nature. 1987, 327: 677-682. 10.1038/327677a0.PubMedView Article
- Kubota T, Wang JY, Gobel TWF, Hockett RD, Cooper MD, Chen CH: Characterization of an avian (Gallus gallus domesticus) TCR αδ locus. J Immunol. 1999, 163: 3858-3866.PubMed
- Giudicelli V, Chaume D, Lefranc MP: IMGT/GENE-DB: a comprehensive database for human and mouse immunoglobulin and T cell receptor genes. Nucleic Acids Res. 2005, 33: D256-D261. 10.1093/nar/gki010.PubMedPubMed CentralView Article
- Glusman G, Rowen L, Lee I, Boysen C, Roach JC, Smith AFA, Wang K, Koop BF, Hood L: Comparative genomics of the human and mouse T cell receptor loci. Immunity. 2001, 15: 337-349. 10.1016/S1074-7613(01)00200-X.PubMedView Article
- ENSEMBL. [http://www.ensembl.org]
- Hedges SB, Parker PH, Sibley CG, Kumar S: Continental breakup and the ordinal diversification of birds and mammals. Nature. 1996, 381: 226-229. 10.1038/381226a0.PubMedView Article
- Hubbard GB, Saphire DG, Hackleman SM, Silva MV, Vanderberg JL, Stone WH: Ontongeny of the thymus gland of a marsupial (Monodelphis domestica) Laboratory animal. Science. 1991, 41: 227-232.
- Nei M, Gu X, Sitnikova T: Evolution by the birth and death process in multigene families of the vertebrate immune system. Proc Natl Acad Sci USA. 1997, 94: 7799-7806. 10.1073/pnas.94.15.7799.PubMedPubMed CentralView Article
- Su C, Jakobsen I, Gu X, Nei M: Diversity and evolution of T-cell receptor variable region genes in mammals and birds. Immunogenetics. 1999, 50: 301-308. 10.1007/s002510050606.PubMedView Article
- Baker ML, Rosenberg GH, Zuccolotto P, Harrison GA, Deane EM, Miller RD: Further characterization of T cell receptor chains of marsupials. Dev Comp Immunol. 2001, 25: 495-507. 10.1016/S0145-305X(01)00016-7.PubMedView Article
- Baker ML, Osterman AK, Brumburgh S: Divergent T cell receptor delta chains from marsupials. Immunogenetics. 2005, 57: 665-673. 10.1007/s00251-005-0030-0.PubMedView Article
- Call ME, Wucherpfennig KW: The T cell receptor: critical role of the membrane environment in receptor assembly and function. Ann Rev Immunol. 2005, 23: 101-125. 10.1146/annurev.immunol.23.021704.115625.View Article
- Rowen L, Koop BF, Hood L: The complete 685-kilobase DNA sequence of the Human βT cell receptor locus. Science. 1996, 272: 1755-1762. 10.1126/science.272.5269.1755.PubMedView Article
- Lefranc MP, Lefranc G: The T Cell Receptor FactsBook. 2001, Academic Press. Harcourt Science and technology company, San Diego, CA, 48-70.
- Vernooij BT, Lenstra JA, Wang K, Hood L: Organization of the murine T-cell receptor gamma locus. Genomics. 1993, 17: 566-574. 10.1006/geno.1993.1373.PubMedView Article
- Lefranc MP, Rabbitts TH: Two tandemly organized human genes encoding the T-cell gamma constant region sequences show multiple rearrangement in different T-cell types. Nature. 1985, 316: 464-466. 10.1038/316464a0.PubMedView Article
- NCBI. [http://www.ncbi.nlm.nih.gov/BLAST/]
- Miccoli MC, Antonacci R, Vaccarelli G, Lanave C, Massari S, Cribiu EP, Ciccarese S: Evolution of TRG clusters in cattle and sheep genomes as drawn from the structural analysis of the ovine TRG2@ locus. J Mol Evol. 2003, 57: 52-62. 10.1007/s00239-002-2451-9.PubMedView Article
- Conrad ML, Mawer MA, Lefranc MP, McKinnell L, Whitehead J, Davis SK, Pettman R, Koop BF: The genomic sequence of the bovine T cell receptor gamma TRG loci and localization of the TRGC5 cassette. Vet Immunol Immunopathol. 2007, 115: 346-356. 10.1016/j.vetimm.2006.10.019.PubMedView Article
- Parra ZE, Arnold T, Nowak MA, Hellman L, Miller RD: TCR gamma chain diversity in the spleen of the duckbill platypus (Ornithorhynchus anatinus). Dev Comp Immunol. 2006, 30: 699-710. 10.1016/j.dci.2005.10.002.PubMedView Article
- Childs G, Maxson R, Cohn RH, Kedes L: Orphons: dispersed genetic elements derived from tandem repetitive genes of eukaryotes. Cell. 1981, 23: 651-663. 10.1016/0092-8674(81)90428-1.PubMedView Article
- Miska KB, Wright AM, Lundgren R, Sasaki-McClees R, Osterman AK, Gale JM, Miller RD: Analysis of a marsupial MHC region containing two recently duplicated class I loci. Mamm Genome. 2004, 15: 851-854. 10.1007/s00335-004-2224-4.PubMedView Article
- Belov K, Deakin JE, Papenfuss AT, Baker ML, Melman SD, Siddle HV, Gouin N, Goode DL, Sargeant TJ, Robinson MD, Wakefield MJ, Mahony S, Cross JG, Benos PV, Samollow PB, Spedd TP, Graves JA, Miller RD: Reconstructing an ancestral mammalian immune supercomplex from a marsupial major histocompatibility complex. PLOS Biol. 2006, 4: e46-10.1371/journal.pbio.0040046.PubMedPubMed CentralView Article
- Ley RD, Reeve VE, Kusewitt DF: Photobiology of Monodelphis domestica. Dev Comp Immunol. 2000, 24: 503-516. 10.1016/S0145-305X(00)00013-6.PubMedView Article
- Teixeira AR, Monteiro PS, Rebelo JM, Arganaraz ER, Vieira D, Lauria-Pires L, Nascimento R, Vexenat CA, Silva AR, Ault SK, Costa JM: Emerging Chagas Disease: Trophic Network and Cycle of Transmission of Trypanosoma cruzi from Palm Trees in the Amazon. Emerg Infect Dis. 2001, 7: 100-112.PubMedPubMed CentralView Article
- IMGT. [http://imgt.cines.fr/]
- Koop BF, Rowen L, Wang K, Kuo CL, Seto D, Lenstra JA, Howard S, Shan W, Deahpande P, Hood L: The Human T-cell receptor TCRAC/TCRDC (Cα/Cδ) region: Organization, sequence, and evolution of 976 kb of DNA. Genomics. 1994, 19: 478-493. 10.1006/geno.1994.1097.PubMedView Article
- Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The ClustalX windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997, 24: 4876-4882. 10.1093/nar/25.24.4876.View Article
- Hall TA: BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser. 1999, 41: 95-98.
- Kumar S, Tamura K, Nei M: MEGA3: Integrated software for Molecular Evolutionary Genetics Analysis and sequence alignment. Brief Bioinform. 2004, 5: 150-163. 10.1093/bib/5.2.150.PubMedView Article
- Staden package. [http://staden.sourceforge.net/]
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.