Evolution of the mitochondrial genome in snakes: Gene rearrangements and phylogenetic relationships
© Yan et al; licensee BioMed Central Ltd. 2008
Received: 08 July 2008
Accepted: 28 November 2008
Published: 28 November 2008
Snakes as a major reptile group display a variety of morphological characteristics pertaining to their diverse behaviours. Despite abundant analyses of morphological characters, molecular studies using mitochondrial and nuclear genes are limited. As a result, the phylogeny of snakes remains controversial. Previous studies on mitochondrial genomes of snakes have demonstrated duplication of the control region and translocation of trnL to be two notable features of the alethinophidian (all serpents except blindsnakes and threadsnakes) mtDNAs. Our purpose is to further investigate the gene organizations, evolution of the snake mitochondrial genome, and phylogenetic relationships among several major snake families.
The mitochondrial genomes were sequenced for four taxa representing four different families, and each had a different gene arrangement. Comparative analyses with other snake mitochondrial genomes allowed us to summarize six types of mitochondrial gene arrangement in snakes. Phylogenetic reconstruction with commonly used methods of phylogenetic inference (BI, ML, MP, NJ) arrived at a similar topology, which was used to reconstruct the evolution of mitochondrial gene arrangements in snakes.
The phylogenetic relationships among the major families of snakes are in accordance with the mitochondrial genomes in terms of gene arrangements. The gene arrangement in Ramphotyphlops braminus mtDNA is inferred to be ancestral for snakes. After the divergence of the early Ramphotyphlops lineage, three types of rearrangements occurred. These changes involve translocations within the I Q M tRNA gene cluster and the duplication of the CR. All phylogenetic methods support the placement of Enhydris plumbea outside of the (Colubridae + Elapidae) cluster, providing mitochondrial genomic evidence for the familial rank of Homalopsidae.
Snakes are a large group of reptiles with a broad range of morphological features, of which many are evolutionarily selected by their habitats. Snakes have conventionally been divided into two groups. The fossorial scolecophidians (blindsnakes and threadsnakes) are small snakes with a small gape size that feed on small prey on a frequent basis. The second major group, the alethinophidians (or "true snakes") are more ecologically diverse and most species feed on relatively large prey on an infrequent basis. True snakes are further split into the Henophidia and the Caenophidia. The caenophidians, which are also called advanced snakes, include the aquatic genus Acrochordus and the Colubroidea. The Colubroidea is subdivided into the families Atractaspididae, Elapidae, Viperidae, and Colubridae. A small colubrid subfamily, Homalopsinae, was first attributed familial rank by Günther in 1864, and was recognized as subfamily in the 20th century by most researchers until it was reassigned familial status in recent years [2–4].
Recent phylogenetic analyses, based primarily on molecular analyses of a few mitochondrial or nuclear genes failed to reach a consensus in several aspects [2, 3, 5, 6]. For instance, the composition of the family Colubridae, the putative paraphyly and the hierarchical structuring into subfamilies remain contentious issues. The mitochondrial genome has several advantages for phylogenetic studies [7, 8], and has been widely used in constructing animal phylogeny including snakes .
Previous studies of snake mitochondrial genomes have demonstrated that duplication of the control region and translocation of trnL are two visible features of the alethinophidian mtDNAs [9, 10]. Moreover, translocation and pseudogenization of trnP have been found in some caenophidian snakes . The Texas threadsnake (Leptotyphlops dulcis) possesses a different gene arrangement and loses its origin of light strand replication (OL) . In the present study, we determined complete mitochondrial DNA sequences from four snake families. The sequence information allowed the gene organizations, mitochondrial genome evolution, and phylogenetic relationships among these major snake families to be identified.
Characteristics of the snake mitochondrial genomes
List of taxa used in this study
Dong and Kumazawa, 2005
Dong and Kumazawa, 2005
Dong and Kumazawa, 2005
Dong and Kumazawa, 2005
Kumazawa et al., 1998
Jiang et al., 2007
Dong and Kumazawa, 2005
Jiang et al., 2007
Dong and Kumazawa, 2005
Macey et al., 2004
Kumazawa and Nishida, 1999
Janke et al., 2001
Kumazawa and Endo, 2004
General characteristics of four snake mitochondrial genomes
Genome Length (bp)
G + C nucleotide content (%)
To further substantiate the above premises, we performed the SH statistical test  on both amino acid and nucleotide data sets that allows comparison of alternative phylogenetic hypotheses. Results of the SH test strongly rejected the placement of Enhydris plumbea within the colubrids cluster (P < 0.001). The monophyly of scolecophidian snakes was not rejected using both amino acid and nucleotide acid data sets (P > 0.05).
Evolution of snake mitochondrial genomes
Based on the phylogenetic relationships among the tested snakes and the comparisons of their gene organizations (Fig. 1, 2), we estimated the processes of evolutionary events occurred in snake mitochondrial genomes. In early snake lineages (type I and II), gene arrangements are similar to that of typical vertebrate, but OL was lost within the W ANCY tRNA gene cluster. Incompatible with the commonly accepted view on monophyly of scolecophidian snakes [15–18], our phylogenetic estimates strongly supports Ramphotyphlops braminus being the sister lineage to all remaining species sampled. Given that the monophyly of scolecophidian snakes was not rejected in SH statistical test, loss of OL may occur in two different scenarios, independently (if nonmonophyly) or descend from a common ancestry (if monophyly). After the divergence of the Ramphotyphlops lineage, changes involving the I Q M cluster took place. First in type II, trnQ underwent a long distance translocation (~1.2 kb) from one gene cluster to another (Fig. 1, 2). Subsequently, in the early alethinophidian lineage, the control region was duplicated and trnL relocated to the I Q M cluster, giving rise to type III which is present in most alethinophidian snakes (including henophidians, Acrochordus and Naja). New types emerged during the split in Caenophidia. Type IV is found in two branches, Dinodon semicarinatus, Pantherophis slowinskii (Colubridae), and Enhydris plumbea (Homalopsidae), and characteristic changes (P*) likely appeared ahead of node f, which then disappeared in Elapidae. It is also conceivable that the present of P* was resulted from independent evolution in Colubridae and Homalopsidae. Distinct arrangements (type V and VI) were found in viperids, suggesting that trnP was translocated in early stage of the viperid radiations . Type VI, with no pseudogenes close to CRI, was found in two paraphyletic taxa, suggesting that P* could have been independently eliminated.
Familial rank of Homalopsidae
The Homalopsinae have been generally recognized as a valid monophyletic clade within the Colubridae [19, 20] and assigned a subfamilial rank, despite they being assigned familial  or tribal  status historically. Recent molecular studies placed the Homalopsinae as the sister group to most other members of the Colubroidea [2, 3, 5], and a familial status has been reassigned accordingly [2–4].
In this study, the placement of Enhydris plumbea, a representative of the Homalopsidae, as the sister lineage to the Colubridae + Elapidae clade was strongly supported by all phylogenetic methods. Moreover, SH test strongly rejected the hypothesis that Enhydris plumbea falls within the colubrids cluster (P < 0.001). The familial rank of Homalopsidae is therefore considered well-supported. Our work for the first time establishes the monophyly and distinctiveness of this family with phylogenetic evidence derived from complete mitochondrial genome sequences.
In this study, six types of mitochondrial gene arrangement in snakes are summarized. Two notable features of the alethinophidian mtDNA, duplication of the CR and translocation of trnL, are presented. The gene arrangement in Ramphotyphlops braminus mtDNA is indentical to that found in typical vertebrates, suggesting an ancestral arrangement. The well supported phylogenetic topology helps to reconstruct the evolution of mitochondrial gene arrangements in snakes. We propose that, after the divergence of the early Ramphotyphlops lineage, three types of changes involving the I Q M gene cluster occurred. These include the translocation of trnQ in the early Leptotyphlops lineage, the duplication of CR and translocation of trnL in the early alethinophidian lineage, and the translocation of trnP in the early viperid lineage. All phylogenetic methods support the placement of Enhydris plumbea outside of the (Colubridae + Elapidae) cluster, providing mitochondrial genomic evidence for the familial rank of Homalopsidae. The monophyly of Scolecophidia is not rejected in our study. However, a more comprehensive sampling of snake mitochondrial genomes is necessary to further refine the phylogenetic relationships among major groups of snakes.
Samples, DNA amplification, and sequencing
Snakes from three alethinophidian families and one scoleophidian family were sampled (Table 1). Total DNA was extracted from a small quantity (20 mg) of tissues by DNeasy Tissue Kit (Qiagen). Several short mtDNA fragments were amplified using Ex-Taq DNA polymerase (Takara) and sequenced in order to design taxon-specific primers. PCRs were performed in a MJ PTC-200 thermal cycler under the profile: 5 min at 95°C followed by 35 cycles of 95°C for 30 s, 50–55°C for 30 s, and 72°C for 90 s. PCR products of 1~2.5 kb were purified and then sequenced employing an ABI 310 or 3700 system with bi-directional and several internal primers. Short fragments were assembled into a continuous sequence. In the mtDNA sequences thus obtained, 37 individual genes were identified based on corresponding homologues from other vertebrates. Identification of tRNA genes was based on their secondary structures using software DNASIS 2.5 (Hitachi Engineering, Tokyo, Japan), whereas boundaries of rRNA genes and control regions were tentatively defined by the boundaries of adjacent coding genes. The mtDNA sequences, with annotations, have been deposited at GenBank (DQ343647–DQ343650).
Taxa, alignment and phylogenetic analyses
We assembled 14 serpent ingroups with complete mitochondrial genomes available, and chose 4 taxa from 4 saurian families (Amphisbaena schmidti ; Eumeces egregius ;Iguana iguana ; and Varanus komodoensis ) as outgroups (Table 1). Two data sets were prepared for concatenated amino acid sequences and for concatenated light-strand nucleotide sequences of the 12 protein genes. Nad6, the only protein gene encoded by the light strand, has been excluded for increased proportion of T and G in all codon positions due to the strand-specific base composition bias of mtDNAs. Multiple alignments were analyzed with the Gblocks program  to select conserved amino acid residues, which was later used as a backbone to align the corresponding nucleotide sequences.
The level of saturation in the whole codons, and at the first, second, and the third codon positions was independently analyzed using scatter plot graphics, by comparing the uncorrected p-distance with the distance calculated by applying the best-fit evolutionary model (GTR + I + G) selected by the Modeltest 3.7 . The third positions of the protein genes were removed from the nucleotide data set because of high substitutional rates and consequent saturation as a source of noise in phylogenetic analyses. Thus a final alignment of 6566 bases was obtained.
Phylogenetic analyses were carried out using maximum likelihood (ML), Bayesian (BI), maximum parsimony (MP) and neighbor-joining (NJ) methods. The ML analyses with the nucleotide data set were conducted with PAUP*4.0b10 by a heuristic search with TBR branch swapping with 10 random taxon additions. The general reversible model (GTR + I + G) and parameters optimized by Modeltest 3.7 were used. Bayesian phylogenetic analyses of the nucleotide sequences were performed with MrBayes 3.1  using a GTR + I + G model. The Markov chain Monte Carlo process was set to run four chains simultaneously. Posterior probabilities were calculated from the majority-rule consensus trees constructed after excluding the burn-in.
ML analyses with the amino acid data set were conducted using PUZZLE 5.2  with the mtREV24 substitution matrix and amino acid frequency estimated from the data set. The Bayesian analyses of the amino acid data were conducted with MrBayes 3.0 using the mtREV24 + I + G model and an empirical amino acid frequency. The Bayesian tree and posterior probability values were obtained using the same procedures described above.
We would like to thank Xiang Ji, Hongying Sun and Jianhua Dai for help in sample collection. Support for this research was provided by the National Natural Science Foundation of China (NSFC) grant (No. 30570249) to KYZ.
- Günther ACLG, ed.: Reptiles of British India. 1864, London: The Ray Society
- Lawson R, Slowinski JB, Crother BI, Burbrink FT: Phylogeny of the Colubroidea (Serpentes): new evidence from mitochondrial and nuclear genes. Mol Phylogenet Evol. 2005, 37 (2): 581-601. 10.1016/j.ympev.2005.07.016.PubMedView ArticleGoogle Scholar
- Vidal N, Delmas AS, David P, Cruaud C, Couloux A, Hedges SB: The phylogeny and classification of caenophidian snakes inferred from seven nuclear protein-coding genes. C R Biol. 2007, 330 (2): 182-187. 10.1016/j.crvi.2006.10.001.PubMedView ArticleGoogle Scholar
- Wiens JJ, Kuczynski CA, Smith SA, Mulcahy DG, Sites JW, Townsend TM, Reeder TW: Branch lengths, support, and congruence: testing the phylogenomic approach with 20 nuclear Loci in snakes. Syst Biol. 2008, 57 (3): 420-431. 10.1080/10635150802166053.PubMedView ArticleGoogle Scholar
- Kelly CM, Barker NP, Villet MH: Phylogenetics of advanced snakes (Caenophidia) based on four mitochondrial genes. Syst Biol. 2003, 52 (4): 439-459. 10.1080/10635150390218132.PubMedView ArticleGoogle Scholar
- Vidal N, Hedges SB: Higher-level relationships of snakes inferred from four nuclear and mitochondrial genes. C R Biol. 2002, 325 (9): 977-985. 10.1016/S1631-0691(02)01510-X.PubMedView ArticleGoogle Scholar
- Saccone C, Gissi C, Reyes A, Larizza A, Sbisa E, Pesole G: Mitochondrial DNA in metazoa: degree of freedom in a frozen event. Gene. 2002, 286 (1): 3-12. 10.1016/S0378-1119(01)00807-1.PubMedView ArticleGoogle Scholar
- Boore JL: Animal mitochondrial genomes. Nucleic Acids Res. 1999, 27 (8): 1767-1780. 10.1093/nar/27.8.1767.PubMedPubMed CentralView ArticleGoogle Scholar
- Dong S, Kumazawa Y: Complete mitochondrial DNA sequences of six snakes: phylogenetic relationships and molecular evolution of genomic features. J Mol Evol. 2005, 61 (1): 12-22. 10.1007/s00239-004-0190-9.PubMedView ArticleGoogle Scholar
- Kumazawa Y, Ota H, Nishida M, Ozawa T: Gene rearrangements in snake mitochondrial genomes: highly concerted evolution of control-region-like sequences duplicated and inserted into a tRNA gene cluster. Mol Biol Evol. 1996, 13 (9): 1242-1254.PubMedView ArticleGoogle Scholar
- Kumazawa Y: Mitochondrial DNA sequences of five squamates: phylogenetic affiliation of snakes. DNA Res. 2004, 11 (2): 137-144. 10.1093/dnares/11.2.137.PubMedView ArticleGoogle Scholar
- Asakawa S, Kumazawa Y, Araki T, Himeno H, Miura K, Watanabe K: Strand-specific nucleotide composition bias in echinoderm and vertebrate mitochondrial genomes. J Mol Evol. 1991, 32 (6): 511-520. 10.1007/BF02102653.PubMedView ArticleGoogle Scholar
- Jiang ZJ, Castoe TA, Austin CC, Burbrink FT, Herron MD, McGuire JA, Parkinson CL, Pollock DD: Comparative mitochondrial genomics of snakes: extraordinary substitution rate dynamics and functionality of the duplicate control region. BMC Evol Biol. 2007, 7: 123-10.1186/1471-2148-7-123.PubMedPubMed CentralView ArticleGoogle Scholar
- Shimodaira H, Hasegawa M: Multiple comparisons of log-likelihoods with applications to phylogenetic inference. Mol Biol Evol. 1999, 16 (8): 1114-1116.View ArticleGoogle Scholar
- Lee MSY, Scanlon JD: Snake phylogeny based on osteology, soft anatomy and ecology. Biol Rev. 2002, 77 (3): 333-401. 10.1017/S1464793102005924.PubMedView ArticleGoogle Scholar
- Heise PJ, Maxson LR, Dowling HG, Hedges SB: Higher-level snake phylogeny inferred from mitochondrial DNA sequences of 12S rRNA and 16S rRNA genes. Mol Biol Evol. 1995, 12 (2): 259-265.PubMedGoogle Scholar
- Vidal N, David P: New insights into the early history of snakes inferred from two nuclear genes. Mol Phylogenet Evol. 2004, 31 (2): 783-787. 10.1016/j.ympev.2004.01.001.PubMedView ArticleGoogle Scholar
- Underwood G, British M: A contribution to the classification of snakes. 1967, British Museum (Natural History)Google Scholar
- Zug GR, Vitt LJ, Caldwell JP, eds.: Herpetology. 2001, San Diego: Academic Press
- Greene HW, ed.: Snakes: the evolution of mystery in nature. 1997, Berkeley: Univ. of California Press
- Dowling HG, Duellman WE, eds.: Systematic herpetology: a synopsis of families and higher categories. 1978, New York: HISS Publications
- Macey JR, Papenfuss TJ, Kuehl JV, Fourcade HM, Boore JL: Phylogenetic relationships among amphisbaenian reptiles based on complete mitochondrial genomic sequences. Mol Phylogenet Evol. 2004, 33 (1): 22-31. 10.1016/j.ympev.2004.05.003.PubMedView ArticleGoogle Scholar
- Kumazawa Y, Nishida M: Complete mitochondrial DNA sequences of the green turtle and blue-tailed mole skink: statistical evidence for archosaurian affinity of turtles. Mol Biol Evol. 1999, 16 (6): 784-792.PubMedView ArticleGoogle Scholar
- Janke A, Erpenbeck D, Nilsson M, Arnason U: The mitochondrial genomes of the iguana (Iguana iguana) and the caiman (Caiman crocodylus): implications for amniote phylogeny. Proc Biol Sci. 2001, 268 (1467): 623-631. 10.1098/rspb.2000.1402.PubMedPubMed CentralView ArticleGoogle Scholar
- Kumazawa Y, Endo H: Mitochondrial genome of the Komodo dragon: efficient sequencing method with reptile-oriented primers and novel gene rearrangements. DNA Res. 2004, 11 (2): 115-125. 10.1093/dnares/11.2.115.PubMedView ArticleGoogle Scholar
- Castresana J: Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol. 2000, 17 (4): 540-552.PubMedView ArticleGoogle Scholar
- Posada D, Crandall KA: MODELTEST: testing the model of DNA substitution. Bioinformatics. 1998, 14 (9): 817-818. 10.1093/bioinformatics/14.9.817.PubMedView ArticleGoogle Scholar
- Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19 (12): 1572-1574. 10.1093/bioinformatics/btg180.PubMedView ArticleGoogle Scholar
- Schmidt HA, Strimmer K, Vingron M, von Haeseler A: TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics. 2002, 18 (3): 502-504. 10.1093/bioinformatics/18.3.502.PubMedView ArticleGoogle Scholar