Comparative genomic analysis and evolution of the T cell receptor loci in the opossumMonodelphis domestica
© Parra et al. 2008
Received: 21 November 2007
Accepted: 29 February 2008
Published: 29 February 2008
Skip to main content
© Parra et al. 2008
Received: 21 November 2007
Accepted: 29 February 2008
Published: 29 February 2008
All jawed-vertebrates have four T cell receptor (TCR) chains: alpha (TRA), beta (TRB), gamma (TRG) and delta (TRD). Marsupials appear unique by having an additional TCR: mu (TRM). The evolutionary origin of TRM and its relationship to other TCR remain obscure, and is confounded by previous results that support TRM being a hybrid between a TCR and immunoglobulin locus. The availability of the first marsupial genome sequence allows investigation of these evolutionary relationships.
The organization of the conventional TCR loci, encoding the TRA, TRB, TRG and TRD chains, in the opossumMonodelphis domesticaare highly conserved with and of similar complexity to that of eutherians (placental mammals). There is a high degree of conserved synteny in the genomic regions encoding the conventional TCR across mammals and birds. In contrast the chromosomal region containing TRM is not well conserved across mammals. None of the conventional TCR loci contain variable region gene segments with homology to those found in TRM; rather TRM variable genes are most similar to that of immunoglobulin heavy chain genes.
Complete genomic analyses of the opossum TCR loci continue to support an origin of TRM as a hybrid between a TCR and immunoglobulin locus. None of the conventional TCR loci contain evidence that such a recombination event occurred, rather they demonstrate a high degree of stability across distantly related mammals. TRM, therefore, appears to be derived from receptor genes no longer extant in placental mammals. These analyses provide the first genomic scale structural detail of marsupial TCR genes, a lineage of mammals used as models of early development and human disease.
The hallmarks of the vertebrate adaptive immune system are antigen specific receptors, the T cell receptors (TCR) and immunoglobulins (Ig) encoded by genes that undergo somatic DNA recombination to generate diverse binding specificities. The TCR are expressed by thymus-derived lymphocytes (T cells) that play a major role in regulation and effector functions of immune responses. Each T cell expresses a unique TCR that binds a specific antigen resulting in the activation of an immune response [1,2]. TCR are heterodimers comprised of either alpha (TRA) and beta (TRB) or gamma (TRG) and delta (TRD) combinations, respectively. These two combinations define the two major lineages of T cells: αβT cells and γδT cells [3,2]. αβT cells typically recognize peptide antigens presented on major histocompatibility complex (MHC) encoded molecules. In contrast γδT cells have been shown to be either MHC restricted or in some cases, similar to Ig, able to bind free antigen . Ig are expressed by antibody forming cells (B cells), which produce both a membrane bound form of Ig that comprises the B cell receptor (BCR) and a soluble form that is free antibody. Like TCR, Ig are made up of two different chain types, a heavy (IgH) and light (IgL) chain. TCR and Ig chains both contain variable domains that bind the antigen and membrane-proximal constant (C) domains. It is the variable domains that are encoded by gene segments that undergo somatic recombination to generate diversity in binding specificity. The gene segments encoding the variable domains of TRA, TRG and IgL chains are the variable (V) and joining (J) gene segments, while the variable domains of TRB, TRD and IgH chains are encoded by exons assembled from V, diversity (D) and J gene segments . Recombination of these gene segments takes place in the thymus for developing T cells and adult bone-marrow for developing B cells .
Of the immunoglobulin superfamily (IgSF) members, the TCR and Ig are each other's nearest relatives, however there are dissimilarities to their genetic structure and evolutionary history [6,7]. For example all jawed-vertebrates appear to contain the same four homologous TCR isotypes: TRA, TRB, TRG, and TRD . In contrast there is variability in the number and class of Ig isotypes in different vertebrate lineages [9,10]. In addition the organization of TCR loci appears to be more conserved than Ig. For example, in cartilaginous fish most Ig loci are organized as multiple, unlinked clusters of [V-(D)-J-C], limiting the combinatorial usage of their gene segments . Whereas, in bony fish and tetrapods the majority of Ig loci are organized in the translocon style of Vn-(D)n-Jn-Cn . TCR loci tend to be organized in the translocon style in all lineages. These differences between TCR and Ig genes are likely the result of dissimilar selection forces on the two different antigen receptor systems and have made determining the evolutionary relationship of the TCR and Ig chains to each other unclear .
The relationship between Ig and TCR is further muddled by the recent discoveries in marsupials and sharks of TCR loci that appear to be hybrids between ancestral Ig and TCR loci [13,14]. In marsupials, a mammalian lineage that diverged from eutherians (placental mammals) 186 to 193 million years ago (MYA), a new fifth TCR chain, named TCR mu (TRM) has been identified [14,15]. Unlike the conventional TCR, TRM has a tandem cluster organization and its origins appear to have involved a recombination between and ancestral TCR locus, most likely aTRDandIgH.TRMappears analogous to an unusual shark TCR calledNAR-TCR, which utilizes the C regions of TRD and upstream Ig-like V regions . Both TRM and NAR-TCR are expressed in an atypical TCR isoform that contains double variable domains. Marsupial TRM and shark NAR-TCR, however, are not orthologous but rather the product of convergent evolution generating common features . Nonetheless the presence of TCR with these features in both marsupials and cartilaginous fish, make it likely analogous TCR will be found in other vertebrate lineages and illustrate a level of plasticity in TCR evolution heretofore unrealized.
The availability of the first completely sequenced marsupial genome provides the opportunity to investigate the evolutionary origins of TRM and its relationship to the conventional TCR in mammals . Towards this aim we have determined the complete genomic organization, content and evolution of the loci encoding both the conventional TCRs (TRA, TRB,TRGandTRD) and the recently discoveredTRMlocus in the opossumMonodelphis domestica. In addition, these analyses provide a level of detail for the TCR genes of a marsupial that has only been available for a limited number of eutherians such as human and mouse.
Number of TRV, TRD, TRJ and TRC genes found in human, mouse and opossum. The numbers of functional genes are indicated in parenthesis.
To determine which of the 74 V gene segments are also used asTRA/DVwe performed RT-PCR on 23 day old thymus RNA using combinations of primer pairs specific for each of the V segment subgroups (see below) paired with either TRA or TRD C regions (TRACandTRDCrespectively) (not shown). This age was chosen as an early age where the thymus is fully mature . Nineteen of theTRAVsegments and all four of the functionalTRDVsegments were found expressed with both TRAC and TRDC resulting in a total of 23 apparentTRA/DVsegments (Figure 1and Table 1). The number of opossumTRA/DVmay be an underestimate since it is possible someTRAVmay be rarely expressed with TRDC or may appear at different times during development than was examined. Alternatively this number could be an overestimate as well, since it is possible that some combinations are negatively selected in the thymus and not used in the periphery. Either way this appears to be a comparatively high number ofTRA/DVsegments in the opossum relative to human and mouse (Table 1).
V gene segments evolve by gene duplication and deletion resulting in degrees of relatedness amongst segments . These are defined as subgroups with the segments belonging to the same subgroup by having 80% or greater nucleotide identity. By this criterion the current 68TRAVsegments can be placed into 41 subgroups, where the nucleotide identity between subgroups ranges from 32.1 to 79.5%. The sixTRDVgene segments were sufficiently different, with nucleotide identity ranging from 33.6 to 61.9%, that each belonged to its own distinct subgroup. Phylogenetic analyses usingTRAVandTRDVsegments were performed to elucidate their evolutionary relatedness. Opossum sequences were compared with sequences from human, mouse, rabbit, cow, sheep, and chicken using the same dataset as used by Suet al.  to define the major phylogenetic groups (Figure 2). Eight groups of V gene segments (designated A through H) with bootstrap values greater than 89% emerged from the inclusion of the marsupial sequences (Figure 2). All eight contain opossum sequences (Figure 2). Four of these, groups A through D, were defined previously and only included TRAV or TRAV/D . However, the addition of opossum sequences revealed four new V groups (E through H) not previously recognized or requiring reevaluation. Group E contains TRAV, TRDV and TRA/DV sequences; groups F and G only TRDV sequences; and group H TRAV and TRA/DV sequences. The addition of the opossum sequences to this analysis substantiate the statement that all these subgroups were present in the common ancestor of amniotes and that some species have lost segments that belong to different subgroups .
These analyses also allow us to evaluate the evolution of mammalian V segments that can be utilized with either TRA or TRD chains over a larger time-span than has been available previously. TRA/DV sequences clearly belong to different groups rather than forming a monophyletic cluster (Figure 2) and, as observed for human and mouse, TRA/DV regions are dispersed throughout the opossumTRA/Dlocus, although most are located towards the 3' end of the locus (Figure 1). Considering that αβ and γδT cells recognize potentially very different antigens these results continue to support the high level of plasticity in V gene utilization at this locus.
The number and complexity of D and J gene segments in the opossumTRA/Dlocus is also comparable to that of human and mouse (Figure 1, Table 1). Encoding TRD chains are at least two D and six J gene segments (TRDDandTRDJrespectively), all of which are located upstream of a singleTRDC. There are at least 53 TRAJ segments located between theTRDCandTRACall of which appear to be functional by the criteria defined above for the V gene segments (Figure 1, Table 1).
There are 36 opossumTRBVsegments (Figure 4, Table 1) and these can be grouped in 28 subgroups based on nucleotide identity with the value between subgroups ranging from 38.5 to 68.5%. Compared with other TCR, theTRBlocus in eutherians appears to contain a higher number of V pseudogenes, where 19% and 34% ofTRBVare pseudogenes in human and mouse, respectively. This pattern appears to hold for the opossum since nine of the 36TRBVsegments (25%) appear to be pseudogenes (Table 1).
Phylogenetic analyses of the TRBV regions from different mammals and chicken reveal that the opossum TRBV regions are very diverse with six groups (A to F) identified (Figure 5). Human, mouse and opossum TRBV sequences are found in all six groups, supporting their presence before the divergence of marsupial and eutherian mammals (Figure 5) . Three sequences chickenB1S1, mouseB2 and opossumB20 did not cluster with any other sequence, nor with each other, and therefore not included within any of the groups.
As in human and mouse, the opossum TRB D, J, and C genes are organized in tandem cassettes. The human and mouseTRBlocus contains two of such D-J-C cassettes while the opossum has four (Figures 3C–3Fand 4). In the opossum, each cassette contains a singleTRBD, four or fiveTRBJ, and a singleTRBC. All four cassettes appear to be functional andTRBJgene segments from each have been found in TRB transcripts (data not shown).
The four opossum TRBC regions are very similar to each other at the nucleotide level and in intron - exon organization. Each TRBC region is encoded by four exons (Figure 3C-F). Exon 1 encodes the immunoglobulin domain which contains two conserved cysteines residues important for the intra chain disulfide bond formation. There are three potential N-glycosylation sites in exon 1. Exon 1 of TRBC1, TRBC2 and TRBC3 have 100% nucleotide identity and TRBC4 differs by only a single non-synonymous nucleotide substitution (K10E) that encodes a lysine (AAA) instead of the glutamate (GAA). Exon 2 encodes the Cp and it contains a conserved cysteine residue used for the inter chain disulfide bond (Figure 3C-F). The Cp from the four opossum TRB share more than 82.6% nucleotide identity. Exon 3 encodes the Tm region and contains a lysine residue involved in the interaction with the CD3 complex. The Tm regions are much conserved among the four opossum TRBC, sharing greater than 85.1% nucleotide identity. Exon 4 encodes the cytoplasmic region and it also includes 3' untranslated region.
Due to the high degree of sequence similarity among the four cassettes described above it is difficult to fully reconstruct the duplication events that led to the current arrangement in the opossum. However, cassettes 2 and 3 share two characteristics indicating they are derived from a relatively recent tandem duplication. First of all, both cassettes are nearly identical in nucleotide sequence over a 12.8 kb region that extends from 4.8 kb upstream of a non-functional copy ofcyclin A2including the gene segmentsTRBD, TRBJto 100 bp downstream ofTRBC(Additional file 2). Secondly both cassettes 2 and 3 have acyclin A2gene 5' of the D-J-C gene segments, which is not present in the other two cassettes (Figure 4).Cyclin A2is also associated with the cow TRB locus but is not in human, mouse or chicken. This is consistent with thecyclin A2gene being inserted near the TRB D-J-C cassettes prior to the divergence of marsupials and eutherians, with subsequent loss in some eutherian species such as human and mouse.
Previously, phylogenetic analyses of the mammalian and avianTRGVsegments revealed the presence of eight ancient groups . Addition of the opossum TRGV sequences to these analyses revealed two additional groups, group I and J. Group I contains human, mouse and opossum TRGV4. Support for this group is low (Figure 7), however it is likely that these three sequences from opossum, human and mouse are derived from the same ancestral gene since they also have similar RSS sequences (data not shown). The new group J contains only the members of the opossum TRGV1 subgroup (Figure 7), which is related to the previously defined group F that contains sequences from human, rabbit, sheep and cow . This relationship is not well supported however, and two groups may not have arisen from a common ancestral gene segment (Figure 7). Opossum TRGV3 segments group in a previously defined group A. The single member of opossum TRGV2 subgroup does not clearly cluster with any existing group (Figure 7).
There is only a single opossum TRGC region (Figure 3Gand 6) in contrast to four in mouse (three of which are functional) and two in human (both functional) (Table 1). The opossum TRGC is encoded by three exons. Exon 1 encodes the immunoglobulin domain and contains a single N-glycosylation site. An unusual characteristic described previously in marsupials was the absence of the second cysteine residue required for the formation of intrachain disulfide bond in the TRGC region . Exon 2 encodes the Cp and exon 3 encodes the Tm, Ct and 3'UTR regions (Figure 3G).
Previously we reported that the TRM V gene segments appeared to be more similar to Ig V gene segments than that of TCR . This conclusion was drawn from an analysis of a limited number of available marsupial TCR V gene segments at that time. Availability of the complete TCR genomic sequences described above allows us to further test this observation.TRMis organized in tandem clusters where complete clusters contain two classes of V segments: a single non-rearranged V gene segment (TRMV) and an unusual V gene segment which is already joined to D and J genes (TRMVj) in the germline DNA . The opossum has six such complete clusters. The remaining two clusters are partial, lacking theTRMVandTRMDgene segments (Figure 8) .
To investigate the evolutionary history of the individual TRM clusters we compared the sequence and organization of the six complete clusters (clusters 1, 2, 3, 4, 5, and 7) and two partial clusters (clusters 6 and 8). These analyses revealed three classes of clusters based on gene content and nucleotide sequence identity. These three classes likely represent the most ancient duplications of TRM clusters. The lineages generated by these older duplications are represented by clusters 1, 2 and 3, respectively. Clusters 5 and 7 have similar gene content (threeTRMDsegments) and share greater nucleotide sequence identity to cluster 3 and represent more recent whole cluster duplications that would have followed the duplication of an additional D segment in this lineage . Clusters 4, 6, and 8 contain twoTRMDsegments and share greatest nucleotide sequence identity to cluster 2, representing another round of more recent duplications. Cluster 1 is a third class unto itself based on not sharing significant sequence similarity to the others . These results are consistent with the current complement of TRM clusters in opossum being the result of whole cluster duplications followed by divergence of each cluster. These duplication events have resulted in partial clusters cases and different numbers of clusters in different marsupial species; bandicoots for example appear to have only two TRM clusters [14,29].
In addition to the six functional TRMV gene segments located within theTRMlocus, there is a single TRMV orphon gene (TRMV-OR2) located on opossum chromosome 2 in a region containing a number of flanking sequences resembling long interspersed repeat elements (LINE)(data not shown). TRM is the only of the TCR loci in the opossum for which orphon V gene segments have been found. TRMV-OR2 appears to be non-functional and has no leader peptide, but it does contain a complete V gene segment and the recombination signal sequence (RSS). TRMV-OR2 is most similar to TRMV2 and TRMV4, sharing 96 and 95% nucleotide identity, respectively. The high degree of identity between these functional gene segments and the orphon is consistent with the latter being the result of a relatively recent duplication event that, based on the flanking LINE elements may have been due to transposition . TRMV-OR2 provides additional evidence for the role of retroelements in both gene translocation in marsupials. A similar translocation was reported previously for opossum MHC class I genes, where two class I loci (UB and UC) had been translocated outside the MHC region. UB and UC are similarly tightly flanked by retroelements [40,41]. Furthermore, TRMV-OR2 may also provide an additional connection between retroelements and the evolution of TCR genes themselves in marsupials. TRMVj is a variable region gene segment that appears to have been generated by retro-transposition since it is lacking an intron that all V gene segments have and is already joined to the D and J gene segments in the germline . Although highly speculative, it is possible that translocated orphons such as TRMV-OR2 may have contributed to the activation of local retroelement genes to allow their co-expression when the TRM locus is actively transcribed. This would have been a necessary step in the retro-translocation events that generated TRMVj.
The availability of a large number of vertebrate genome sequences, with varying degrees of depth of sequence coverage, provided the opportunity to search for TRM or TRM-like genes in species other than marsupials. A search of the available genomes using the BLAST algorithm and opossum TRM sequences was unable to identify a homologue in any of the eutherian species available. This search included human, mouse, rabbit, dog, cat, cow, horse, hedgehog, elephant and armadillo. The rabbit, cat, hedgehog, elephant and armadillo are low coverage genome sequences ranging from 1.86× to 2×. While it is possible that in any given species the TRM locus was missed in the sequencing, it is unlikely that such a random gap would have been consistently present in all the eutherian genomes. Therefore we conclude that TRM is not likely present in any eutherian lineage. We also searched the current chicken and the anole lizard (Anolis carolinensis) genomes in a similar manner and were unable to detect clear TRM homologues. In the case of chicken and all the eutherian genomes, the TRD homologues were identifiable indicating that our search strategies are able to pick up this conventional TCR locus. Furthermore, we were able to identify a clear TRM homologue in the recently completed platypus genome sequence (Ornithorhynchus_anatinus-5.0) available at GenBank. Further characterization of the platypus TRM locus is ongoing and beyond the scope of this paper. Nonetheless identification of these genes in a monotreme, which are separated from marsupials and eutherians by 217 to 231 MY , further supports that our search strategies should be able to identify TRM homologues in other species. These results are also consistent with TRM being present early in the evolution of mammals and therefore likely lost in the eutherian lineage.
First and foremost, we have described in detail the genomic content and complexity of the T cell receptor loci for the opossumMonodelphis domestica, the first such analysis available for a marsupial. The opossum is arguably the most extensively studied marsupial species and is used as a model of human disease and development. The opossum, for example, is one of the few mammalian model organisms that develop melanoma following exposure to ultraviolet radiation providing a cancer model . Additionally, opossums are a natural host and a reservoir of the causative agent of Chagas disease,Trypanosoma cruzi. Like all marsupials, opossums give birth to highly altricial young also providing a model for early immune as well as other anatomical system development. Further characterization of the immune system in the opossum, and T cell immunobiology in particular, is important for better understanding of these disease and developmental models. Complete characterization of the TCR genomics in this species is one step in that direction.
The opossum whole genome assembly MonDom5 was used in this study and it is available at GenBank under the accession number AAFR00000000. The location for all TCR gene segments in MonDom5 is provided in Additional file 3. For comparative purposes the current genome assemblies from human (NCBI 36), mouse (NCBI m37), cow (Btau_3.1) and chicken (WASHUC2), rabbit (RABBIT), dog (CanFam2.0), cat (CAT), horse (preEnsemblEquCab2), hedgehog (eriEur1), elephant (BROAD E1), armadillo (ARMA) and the anole lizard (AnoCar1.0) were searched for any evidence of TRM. Genomes were analyzed using BLAST assembled genomes tools [23,35].
Opossum lymphoid tissues were collected and immediately processed to extract the RNA or stored in RNAlater®(Ambion, Austin, TX) at 4°C for 24 hours and stored at -80°C for use later. Whole RNA extraction was performed using the Trizol RNA extraction protocol (Invitrogen, Carlsbad, CA). Tissue was homogenized in 1 ml of Trizol®Reagent per 100 mg until the tissue was completely dispersed. Phase separation was done using 200 μl of chloroform per 1 ml of Trizol. RNA was precipitated with 500 μl of isopropanol per 1 ml of trizol, washed with 70% ethanol and resuspended in 50 to 100 μl of DEPC water. DNase treatment to remove contaminating DNA has been performed using Ambion's kit TURBO DNA-free (Ambion, Austin, TX). Each sample was quantified using the NanoDrop ND-1000 Spectrophotometer (NanoDrop Technologies, Wilmington, DE).
Reverse transcription-polymerase chain reactions (RT-PCR) were performed using GeneAmp RNA PCR Core Kit (Applied Biosystems, Foster City, CA). Amplifications of cDNAs were performed using AdvantageTM-HF 2 PCR (BD Biosciences, CLONTECH Laboratories, Palo Alto, California) with the conditions: 94°C for 1 minute, denaturation at 94°C for 30 seconds, annealing/extension according to the melting temperature of the primers, and a final extension period of 68°C for 5 minutes.
PCR products were cloned using TOPO TA Cloning®Kit for sequencing (Invitrogen, Carlsbad, CA). Plasmids were sequenced using BigDye Terminator Cycle Sequencing Kit v3 (Applied Biosystems, Foster City, CA) in 10 μl reactions and analyzed on an ABI Prism 3100 DNA automated sequencer (PerkinElmer Life And Analytical Sciences Inc, Wellesley, MA). Analyses of chromatograms were done using the SequencherTM 4.6 program (Gene Codes Corporation, Ann Arbor, MI).
To determine the location, content and organization of the TCR genes, the whole opossum genome was searched using the BLAST algorithm. TRA, TRB, TRG, TRD and TRM were located using sequences previously isolated [28,29,38,14]. The V and J segments were located by similarity to corresponding segments from other species and by identifying the flanking conserved RSS.
Rapid amplification of 5' complementary DNA ends (5' RACE) performed on opossum thymus mRNA was used to identify novel, expressed V, D and J segments. In addition to using 5' RACE, PCR using primers that are specific for each V family were also used in RT-PCR to amplify cDNA containing VDJ recombinations that are underrepresented in the RACE PCR. Primers used to amplify the conventional TCR are complementary to the most conserved sequences of the V regions and have been paired with primers located on the C regions (Additional file 4). Sequences obtained by these means were compared with the whole opossum genome using the BLAST algorithm to identify novel TCR gene segments. This approach allowed the identification of gene segments in six possible reading frames, which also help to find gene segments located in the reverse orientation and D segments that may be used in multiple reading frames. The exon-intron organization of V regions was determined using sequences obtained by 5' RACE.
Determination of the exon-intron organization in the C regions was done using BLAST to compare available cDNA sequences that encode complete TCR chains to the opossum genomic sequences. Additionally, transcripts encoding the C terminal end of TCR were obtained by 3' RACE performed on thymus cDNA. Clones obtained by 3' RACE PCR were used to identify the CP, TM and CT and 3' untranslated regions (UTR) for each one of the TCR. cDNA made using the Oligo dT primer was used to perform 3' RACE PCR. The Oligo dT primer supplies a priming site for the GeneRacerTM 3' PCR primers. Sequences obtained by 3' RACE were aligned with the germline sequences to determine the location, intron-exon boundaries and splice sites of these exons.
Opossum gene segments were named following the IMGT nomenclature established for human and mouse . TRV segments were numbered according to their location, from the 5' to 3' end of the locus. TRAJ segments were numbered from 3' to 5' according to a nomenclature proposed by Koopet al.  and followed by IMGT. The TRBD, TRBJ and TRBC found in four cassettes in the opossum are numbered according to the cassette to which they belong and the position of the cassette from 5' to 3' in the locus.
Nucleotide sequences that encode from FR1 through FR3 of the V regions identified from each one of the opossum TCR loci were compared to TCR V sequences from other species retrieved from GenBank. The sequences were aligned using ClustalX  and BioEdit  programs. Phylogenetic analyses were performed using MEGA version 3.0  for the distance methods (neighbor joining and minimum evolution). Confidence values were obtained from bootstrap analyses using 1000 replications.
The accession numbers for sequences used in the phylogenetic analyses are:
Human: A8.4: D13077; A3: X57534; A16: DQ097913; A9.2: X57531; A40: DQ097942; A10: DQ097904; A27: DQ341447; A25: DQ097923; A35: DQ097935; A22: DQ097920; A20: X70305; A34: DQ097934; A30: X58768; A41: DQ097943; A36/DV7: X61070; A17: DQ097914; A6: X58747; A24: M17661; A21: U50404; A23/DV6: D13071; A29/DV5: M17664; A1.1: X04939; A26.1: L06886; A4: M17663; A38-2/DV8: D13074; A14: S51029; D3: M23326; D1: AY357942; D2: S24406; A19: Z46641; A2: DQ097917.Mouse: A9: M33586; A17: X60319; A12.3: M38680; A6: M34200; A11: DQ340292; A13: M38102; A4: L47342; A7: X56719; A14.3: L77149; A5.4: M38681; A10: X57397; A3.1: X02967; A1: M22604; A2: X03760; A21/DV12: M94080; A15-2/DV6-2: M37599; D5: X12729; D4: M23545; TRADV16D: M16118; D2.2: M37280.Rabbit: D4: 38120; 885: M12885; D1: D26555; D5: D38121.Sheep: A622: M55622; D1S1: Z12989; D6: AJ005908; D7: AJ809501; D4: AJ005906; D2: AJ005904; D5: Z12995; D3S2: Z12996; A35: U78035.Cow: 014: D90014; 013: D90013; 015: D90015; 011: D90011; 012: D90012; 017: D90017; D113: D16113; 016: D90016; D116: D16116.Chicken: 611: GDU04611; 612: GDU04612; 613: GDU04613.
Human: B11.3: X58797; B7.2: U07975; B12.1: X07224; B14: X06154; B17: U03115; B2: M64351; B27: U66061; B25.1: L27610; B13: U03115; B4.3: X58812; B21.1: L27608; B23: L27614; B5: X61439; B9: M27380; B3.1: U07977; B15: U03115; B24.1: L27612; B10.3: U17047; B28: U08314; B6.9: X61447; B19: U48260; B30: L06893; B20.1: X72719; B29.1: M13847.Mouse: B14: AE000664; B16: L29434; B3: AE000663; B21: X16691; B26: K02548; B23: X59150; B24: M61184; B12: M30881; B2: AE000663; B4: X56725; B17: AE000664; B29: X00696; B10: X16694; B13.3: M15616; B19: AJ249821; B31: X03277; B30: X16695; B20: M11859; B1: X01642.Rabbit: B8: BAA04245; B11: BAA04248; B9: BAA04246; B7S1: BAA04241; B1: AAA31472; B5: BAA04239; B10: BAA04247; B2: M13895; B6: BAA04240.Sheep: B6S1: AAB88431; B8S1: AAB88433; B10S1: AAB88434; B1S5: AAB88425; B7S1: AAB88432; B22S1: AAB88440; B3S1: AAB88427; B13S1: PQ0068; B15S1: AAB88438; B12S1: AAB88435; B17S1: AAB88439; B2S1: AAB88426; B4S1: AAB88430.Cow: B126: PQ0062; B129: PQ0065; B125: PQ0061; B122: JQ0473; B123: PQ0060; B124: PQ0059.Chicken: B1S1: B36198; B2S2: AAA62753.
Human: G2: M13429; G3: S60779; G4: S60780; G2: M27335.Mouse: G3: AF037352; G6: M13338; G4: M13336; G7: AF037352; G1: Z22847.Rabbit: G1S2: D38137; G1S3: D38138; G1S1: D38135; G1S4: D38139; G2S1: D38142.Sheep: G6S1: Z13007; G2S1: Z12999; G2S2: Z13000; G2S3: Z13001; G2S4: Z13002; G5S1: Z13005; G5S2: Z13006; G1S1: Z12998; G3S1: Z13003; G4S1: Z13004.Cow: G6: AY560834; G1S1: D16119; G1S3: D16131; G5S11: D16126; G5S7: D16130; G5S6: D16129; G5S16: D16133; G3S1: U73186; G3S2: U73187; G4S1: U73188.Platypus: V3.1: AAY82120; V3.2: AAY82094; V1.1: AAY82100; V1.2: AAY82093; V1.3: AAY82109; V2.2: AAY82110; V2.3: AAY82114; V2.1: AAY82119.Chicken: G3S8: U78235; G3S4: U78231; G3S3: U78230; G2S7: U78225; G2S8: U78226; G2S9: U78227; G1S4: U78212; G1S5: U78213; G1S3: U78210; G1S8: U78216.
IgH: Hsheep80: U80145; Hmouse21: M27021; Hhuman47: X64147; Maxolot43: L20243; Hmouse69: K01569; Hhuman14: X05714; Mxenopus84: M20484; Hmouse70: M21470; Hhuman00: DQ454900; HHlama75: AY544575; HHlama33: AY342133; HHlama32: AJ629032; HHlama86: AF441486; Hpig94: U15194; Hhuman62: U80162; Hopossum13: AF012113; Hopossum24: AF012124; Hhuman86: M99686; Hmouse65: M25465; Hmouse85: M31285; Hmouse01: X03301; Mchicken48: M30348; Mzebrafish80: AF281480; Mcatfish60: DQ230560; Mtrout03: DQ831803; Mrockcod55: AF303555; Hwolffish86: AY188786; Hsturgeon61: DQ257661; Wnurseshark50: U51450; Wlungfish27: AF437727; Mnurseshark51: M92851; Hhornshark49: X13449; Hsandbarshark57: AY548357.NAR-TCR: TnarsharK88: DQ022688; Tnarshark10: DQ022710; Tnarshark06: DQ022706.NAR: Nwobbegong92: AF336092; Nwobbegong94: AF336094; Nnurseshar03: AF447103; Nnurseshar81: AY114781.IgL: Khuman92: Y08392; Khuman98: DQ915098; Ksheep10: X54110; Krabbit53: AF211353; Krat09: U39609; Khorse11: X75611; Kopossum08: AY074408; Lnurseshark44: GCU15144; Khamster65: U17165; Khuman98: AY320598; Lmouse88: DQ986488; Lray46: AB062646; Lseabream69: EF555069; Lsturgeon42: AJ387842; Lsalmon17: AF273017; Lseabass79: AJ291779; Ltrout60: X65260; Lhuman77: AF216777; Lopossum81: AF049781; Lchicken67: S65967; Lxenopus78: L76578; Lhuman69: Z73669.TRG: TGhuman29: M13429; TGplatypus21: DQ011321; TAmouse98: M34198; TArabbit85: M12885; TGhuman79: S60779; TGcow86: U73186; TGrabbit42: D38142; TGcow88: U73188.TRA/D: TAsalmon97: EF467297; TAsalmon05: EF466505; TAfugu52: AY198352; TAselfish71: AY198371; TAselfish49: AY198349; TAcod44: AJ133844; TAfugu95: AB222395; TDhalibut71: AB076071; TDfugu79: AB222479; TAzebrafish50: AL592550; TADhuman96: Z14996; TAsheep35: U78035; TDsheep03: AJ005903.TRB: TBtrout23: OSU18123; TBseabream37: AM490437; TBfugu32: AB222432; TBselfish13: AF324813; TBopossum8.1: XM_001363148; TBmouse82: DQ983582; TBsheep11: AF030011; TBrabbit19: D17419; TBhuman04: M27904; TBsheep17: AF030017; TBcow29: D90129; TBmouse16: M15616.
Gray short-tailed opossum: IGHV1-1: AAC48826; IGHV1-6: AAC48820; IGHV1-11: AAC48816; IGHV1-16: AAC48836; IGHV1-23: AAC48846; IGHV2-1: AAC48849. Remaining germline sequences are available at AAFR00000000.Brush-tail possum: 70: AAL87470; 71: AAL87471; 72: AAL87472; 73: AAL87473; 74: AAL87474; 76: AAL87476; 77: AAL87477; 78: AAL87478; 79: AAL87479; 91: AAD41691; 92: AAD41692; 41: AAT40441; 40: AAT40440; 44: AAT40444; 43: AAT40443; 42: AAT40442.Virginia opossum: P83 is an unpublished sequence kindly provided by Dr. R. Riblet.Tammar Wallaby: sequences are unpublished but available upon request.Northern Brown Bandicoot: Bandicoot58: AY586158.SharkNTCR: AAY98815.
Comparisons of the genomic sequence were performed using the program Spin from the Staden Package . Dot matrix plots were generated to determine degrees of similarity among cassettes for the TRB locus. The sequence analyzed includes the four TRB cassettes with coding and non-coding sequence from 3kb upstream of the most 5' TRBD (TRBD1) segment to the most 3' TRBC (TRBC4) segment, which comprises 51 kb. Although not shown, similar comparisons were performed for the other TCR loci.
T cell receptor
T cell receptor alpha
T cell receptor beta
T cell receptor gamma
T cell receptor delta
T cell receptor mu
variable gene segment
Diversity gene segment
Joining gene segment
recombination signal sequence
major histocompatibility complex
This work was supported by funding from the National Institutes of Health Initiative for Maximizing Student Diversity (ZEP, JT) and Institutional Development Award (IDeA) programs (MLB, JH), the Robert C. McNair program (AML) and the National Science Foundation (RDM).
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.