- Research article
- Open Access
Venom gland transcriptomes of two elapid snakes (Bungarus multicinctus and Naja atra) and evolution of toxin genes
- Yu Jiang†1, 2,
- Yan Li†3,
- Wenhui Lee†4,
- Xun Xu1,
- Yue Zhang1,
- Ruoping Zhao1,
- Yun Zhang4Email author and
- Wen Wang1Email author
© Jiang et al; licensee BioMed Central Ltd. 2011
- Received: 3 July 2010
- Accepted: 3 January 2011
- Published: 3 January 2011
Kraits (genus Bungarus) and cobras (genus Naja) are two representative toxic genera of elapids in the old world. Although they are closely related genera and both of their venoms are very toxic, the compositions of their venoms are very different. To unveil their detailed venoms and their evolutionary patterns, we constructed venom gland cDNA libraries and genomic bacterial artificial chromosome (BAC) libraries for Bungarus multicinctus and Naja atra, respectively. We sequenced about 1500 cDNA clones for each of the venom cDNA libraries and screened BAC libraries of the two snakes by blot analysis using four kinds of toxin probes; i.e., three-finger toxin (3FTx), phospholipase A2 (PLA2), kunitz-type protease inhibitor (Kunitz), and natriuretic peptide (NP).
In total, 1092 valid expressed sequences tags (ESTs) for B. multicinctus and 1166 ESTs for N. atra were generated. About 70% of these ESTs can be annotated as snake toxin transcripts. 3FTx (64.5%) and β bungarotoxin (25.1%) comprise the main toxin classes in B. multicinctus, while 3FTx (95.8%) is the dominant toxin in N. atra. We also observed several less abundant venom families in B. multicinctus and N. atra, such as PLA2, C-type lectins, and Kunitz. Peculiarly a cluster of NP precursors with tandem NPs was detected in B. multicinctus. A total of 71 positive toxin BAC clones in B. multicinctus and N. atra were identified using four kinds of toxin probes (3FTx, PLA2, Kunitz, and NP), among which 39 3FTx-postive BACs were sequenced to reveal gene structures of 3FTx toxin genes.
Based on the toxin ESTs and 3FTx gene sequences, the major components of B. multicinctus venom transcriptome are neurotoxins, including long chain alpha neurotoxins (α-ntx) and the recently originated β bungarotoxin, whereas the N. atra venom transcriptome mainly contains 3FTxs with cytotoxicity and neurotoxicity (short chain α-ntx). The data also revealed that tandem duplications contributed the most to the expansion of toxin multigene families. Analysis of nonsynonymous to synonymous nucleotide substitution rate ratios (dN/dS) indicates that not only multigene toxin families but also other less abundant toxins might have been under rapid diversifying evolution.
- Bacterial Artificial Chromosome
- Bacterial Artificial Chromosome Clone
- Bacterial Artificial Chromosome Library
- Toxin Gene
- Venom Gland
Snake venoms comprise a diverse array of toxins that have a variety of biochemical and pharmacological functions and can be conveniently classified as hemotoxins and neurotoxins . Recently, it has been documented that most of the snake toxins were recruited or derived from normal body proteins in the common ancestor of venomous squamates (Toxicofera) or advanced snakes (Caenophidia) 100-200 million years ago (mya) [2–5]. Associated with the species radiation of advanced snakes in the Cenozoic era, a predator-prey arms race led to the explosive appearance of toxic arsenals, and typically, several toxin multigene families were generated by gene duplication, followed by fast diversification [4, 6]. The birth-and-death model was proposed to explain the emergence of taxon-specific toxin groups ; i.e., new toxin genes consistently emerged through gene duplication with the divergence of taxa, but some toxin genes got deleted or were degenerated in other lineages. However, due to the low-depth sequencing of toxin transcripts for each species, fast evolution of toxin genes, and lack of genome sequences, the detailed elaboration of snake venom evolution is still unclear.
The Elapidae family (elapids), which has approximately 300 venomous snakes in 61 genera, is a monophyletic group among advanced snakes [8, 9]. Several broad radiated lineages (diverged rapidly between around 31 and 26 mya, based on fossil records and molecular evidence ) have been identified within the Elapidae, including the Afro-Asian cobras, Oriental kraits, Asian-American corals, and Australian and marine snakes, which are well known to be the most toxic snakes in the world. So far, the gene expression profiles of venom glands from four species [10–12] have been reported using EST sequencing. However, the kraits (genus Bungarus) and cobras (genus Naja), as the most diverse and representative toxic elapids in the old world , lack genomic and venom EST data.
In the present study, we prepared cDNA libraries from the venom glands of the two representative old world elapid snakes, Bungarus multicinctus and Naja atra, and sequenced about 1500 clones for each library. We also constructed genomic bacterial artificial chromosome (BAC) libraries for the two snakes and conducted a screen for venom genes. Our results identified many new snake toxins, such as multiple groups of 3FTxs, some novel lectins, and a peculiar natriuretic peptide (NP), and revealed that toxin genes have experienced rapid evolution and gene expansion.
Venom gland cDNA libraries and ESTs
Composition of ESTs obtained from B. multicinctus and N. atra venom glands
Number of ESTs
Number of clusters
Representation over total clones (%)
In snake venoms, 3FTx, PLA2, Kunitz, C-type lectins, and metalloproteinase have been reported to be the major multigene toxin families , and they comprised 97.1% in B. multicinctus and 98.0% in N. atra in our toxin EST data. We also identified a peculiar cluster of NP precursors with tandem NPs. Besides, β bungarotoxin is a Bungarous-specific toxin, which is a heterodimeric protein complex of phospholipase A2 (A chain) and kunitz peptides (B chain) covalently linked by one disulphide bridge [4, 14]. Here, we detected 7 clusters (108 ESTs) of A chain of β bungarotoxin and 4 clusters (83 ESTs) of B chain as a major toxin in B. multicinctus (25.1% of all toxin ESTs).
Analysis of toxin genes
Three-finger toxins (3FTxs)
3FTxs are neurotoxins and can bind to very different neurological targets (neuronal, pre-synaptic, postsynaptic, and synaptic); some of them also have non-neurotoxic activities (cytotoxicity and platelet inhibition) [4, 5]. Its family members have a similar structure, with three loops and four conserved disulfide bridges, and contain about 60-74 amino acid residues. The ancestral 3FTx was thought to have 10 cysteines and acquired some amelioration: deletion of the ancestral C2 and C3 cysteines potentiated alpha-neurotoxic activity and acquired additional newly evolved cysteines . In 2003, Fry grouped 33 types of 3FTxs from 276 protein sequences, with a number of "orphan groups" of 3FTxs whose functional roles are not yet known . Since then, many other kinds of 3FTxs have been successively detected [12, 15, 16]. So far, more than five hundred variant 3FTx sequences across the entire advanced snakes have been observed. A birth-and-death mode  and a mechanism of duplication and divergence , allowing venom snakes to adapt to a variety of prey, were proposed to explain the generation of the 3ftx superfamily. But, the evolutionary relationships of all kinds of 3FTxs are still uncertain, such as the notable alpha neurotoxic clades (short chain, long chain, kappa, and type III), which are not monophyletic, based on the phylogenetic tree of 3FTxs constructed by Fry . It is in contrast to previous hypotheses that alpha neurotoxins (α-ntx) diverged from a common ancestral gene that separated before the divergence of the elapid sub-family .
Likelihood ratio test of positive selection in toxins
ω (p 1 )
ω (p 1 )
We also noticed a special contig, bm047 (a homologous protein previously isolated in Bungarus candidus by Kuhn et al. , named bucandin), which was conspicuously different from other contigs and even had a unique signal peptide (Figure 2). Based on the phylogenetic analysis, contig bm047 might have diverged from other 3FTx genes at an early stage of the origin of snakes (data not shown). It is possible that bm047 has a conserved function and has not diversified into a multi-copy gene family.
Phospholipase A2 and A chain of β bungarotoxin (PLA2)
Elapidae venom PLA2 exerts a multiplicity of novel, nonenzymatic activities, including neurotoxic and antiplatelet activity (group I), whereas viperid venom PLA2 is synovial-type (group II) PLA2 . The elapid PLA2, which has 120 amino acids and 6 or 7 disulfide bonds, was thought to be derived independently from nontoxic pancreatic-type PLA2 .
We identified four clusters of PLA2-like ESTs with intact coding sequence (CDS) in B. multicinctus and one full-length cluster in N. atra, which covered the major PLA2 types detected before in B. multicinctus and N. atra. In B. multicinctus, one cluster (bm008) enclosing 21 ESTs showed 100% identity with the PLA2 sequence of B. multicinctus in GenBank (X53406.1). Another three clusters in B. multicinctus were similar to the A chain of β bungarotoxins, with a notable substitution at residue 99 (from Leu to Cyc) compared with cluster bm008, which may be important for the formation of interchain disulfides in β bungarotoxin. Based on the gene tree, it is clear that A chain of β bungarotoxin in B. multicinctus quite possibly duplicated from the PLA2 gene in B. multicinctus. We estimated an average dS value of 0.27 between PLA2 contig bm008 and the A chain of β bungarotoxins, which is smaller than the dS value (0.29) between bm008 and the N. atra PLA2 na022, indicating that the A chain of β bungarotoxin might have derived from PLA2 after the divergence of Bungarus and Naja.
Kunitz-type protease inhibitor and B chain of β bungarotoxin (Kunitz)
Kunitz belongs to the superfamily of bovine pancreatic trypsin-like inhibitors (BPTIs), with the ancestral function of inhibiting a diverse array of serine proteases and with a peptide chain of around 60 amino acid residues and 3 disulfide bonds [3, 24]. It evolved the toxin ability to inhibit various physiological processes, such as blood coagulation, fibrinolysis, host defense, and action potential transduction. In elapid venoms, besides the common Kunitz with trypsin or chymotrypsin inhibitor activity, the dendrotoxins in Dendroaspis have presynaptic neurotoxicity , and the B chain of β bungarotoxin is thought to act as an affinity probe to guide the A chain to its target .
In B. multicinctus, four clusters (83 ESTs) of the B chain of a β bungarotoxin-like sequence and one partial kunitz-type protease inhibitor-like cluster (2 ESTs) were obtained. In N. atra, only one singlet of a kunitz-type protease inhibitor-like EST was detected. To understand the evolutionary relationship of the kunitz-type protease inhibitor and B chain of β bungarotoxin, we constructed a phylogenetic tree (Additional file 2: Figure S1). We also obtained an average ~0.3 dS value between the kunitz-type protease inhibitor and B chain of β bungarotoxin. Compared with the similar dS value of 0.27 between PLA2 contig bm008 and A chain of β bungarotoxin, we propose that the A chain and B chain of β bungarotoxin evolved separately from PLA2 and Kuntiz around the same time as the common ancestor of Bungarus.
C-type lectins or C-type lectin-like proteins are ubiquitous components of animal venoms and contain distinct diversified subgroups, including true lectins, coagulant proteins, platelet aggregation agonists, and platelet aggregation antagonists . The elapid C-type lectins are arranged inside the true lectins group, except one M. corallinus C-type lectin . The true lectins cause aggregation of erythrocytes in a dose-dependent manner, which are normally composed of two covalently linked identical subunits, each consisting of 135-141 amino acids .
Here, 29 C-type lectins-like ESTs were grouped into 9 clusters (4 contigs and 5 singlets) in B. multicinctus, while only one contig from 5 ESTs was identified in N. atra. Interestingly, all of these clones do not match the previously sequenced C-type lectins in B. multicinctus and N. atra. Phylogenetic analysis revealed that a cluster of three contigs (bm016, bm017, bm018) that we sequenced were grouped separately from all of the known Bungarus lectins, indicating that this is a novel cluster that diverged from the main group (Additional file 2: Figure S2).
Metalloproteinase is the dominant component in viperid venoms, which exerts anticoagulant or coagulant effects and results in a severe bleeding by interfering with blood coagulation and hemostatic plug formation or by degrading the basement membrane or extracellular matrix components of the victims . But, few metalloproteinases are detected in elapids , which produce long cDNA products (about 1.8 kb). Here, three partial singlet ESTs in N. atra represented two kinds of metalloproteinase isoforms, which were respectively similar to EF080840 in N. atra and AY101383 in Naja mossambica as P-III type metalloproteinases.
Natriuretic peptide (NP)
The NP family functions to control natriuresis, diuresis, blood pressure, homeostasis, and inhibition of aldosterone secretion in all vertebrates or is used by snakes to interrupt these physiological processes of preys [29, 30] and thus has attracted attention as ideal candidates of hypotensive and vasodilator agents. It has a conserved ring structure, consisting of 17 amino acids, with extensions of a few amino acids in the two termini. Four major members of the NP family have been indentified in vertebrates--atrial NP (ANP), B-type NP (BNP), C-type NP (CNP), and ventricular NP (VNP)--and structurally, CNP lacks the C-terminal tail, whereas other NPs have both tails [29, 31]. All of the snake venom NPs seem to have originated from a common ancestral venom, CNP, and a complete snake CNP pro-peptide is constituted by a signal peptide, a linker, a flexible N-terminal extension (also named the CNP-53 isoform, with a 17-aa core ring), and a mature peptide (CNP-22). Nevertheless, snake NPs have recruited some new segments, such as several bradykinin-potentiating peptides (BPPs), located in the NP precursor in the Viperidae family, whereas an alterable C-terminal extension was acquired in the Elapidae family, which may result in novel physiological actions [13, 32, 33].
In the cDNA libraries, except for the major multigene toxin families, there were some other toxins that were widely found in snake venoms but that only constituted a small part of the total ESTs in a species. Most of these toxins had been recruited into the venom system before the last common ancestor of elapid and viperid snakes with assistant toxic functions or unknown functions, such as CRISP, NGF, LAO, cystatin, and acetylcholinesterase [4, 5]. Besides, vespryn (also named Ohanin, as first detected in Ophiophagus hannah) and prothrombin activator (similar to the mammalian blood coagulation factor Xa) were only found in that elapid species so far .
In this study, we obtained many full or partial novel sequences of these less abundant toxins: one cluster in B. multicinctus and three clusters in N. atra coding for CRISP, one cluster in B. multicinctus and one cluster in N. atra coding for NGF, one cluster in B. multicinctus and one cluster in N. atra coding for LAO, three clusters in B. multicinctus and one cluster in N. atra coding for vespryn, one cluster coding for cystatin, and one cluster coding for acetylcholinesterase in N. atra.
Comparison of the major venom components among elapid snakes
Previously several other transcriptomes of the elapid venom gland had been reported, such as marine snakes (Lapemis curtus and Acalyptophis peronii), Australian snake (Austrelaps labialis), and coral snake (Micrurus corallines) [10–12]. However, besides 3FTxs, PLA2 is also another major venom component in these species. We also noticed that the major types, 3FTxs, are very different. The major 3FTx types from the marine and Australian snakes are long chain α-ntx and short chain α-ntx, whereas Micrurus corallines mainly contains orphan gourp XII and some unknown groups of 3FTxs. In contrast, the major components of B. multicinctus venom transcriptome are 3FTxs (mainly belong to long chain α-ntx group) and recently originated β bungarotoxin while the N. atra venom transcriptome mainly contains cytotoxin and short chain α-ntx groups of 3FTx. Other less abundant toxins were not observed in the transcriptomes of the marine and Australian snakes, maybe due to the small number of sequenced clones.
Besides transcripts, some studies also described peptide sequences for some elapid snake toxins. A study characterizing venom proteins of N. atra identified 124 protein segments or peptides , 74% of which belonged to 3FTxs and 11% were PLA2. In another study of Naja kaouthia venom proteomics , 61 venom proteins segments or peptides were identified, most of which were covered by our EST sequence, except the cobra venom factors. Two studies on coral snakes identified 26 and 11 toxin proteins or peptides, respectively [39, 40], 3FTx and PLA2 proteins were also the major toxin components. So, the major toxins are basically consistent with our results observed in the N. atra venom gland transcriptome while Bungarus has quite different components. However, it seems the PLA2 transcripts have higher expression efficiency, considering that the PLA2 ESTs only comprise 1.2% of our toxin ESTs in N. atra while PLA2 peptides were observed to compose of a considerable amount in the N. atra venom .
Analyses on the ratios of nonsynonymous to synonymous substitution rates (dN/dS) for the protein-coding ESTs of all toxin genes
It has been known that the 3FTx and PLA2 toxin multigene families have been subjected to positive Darwinian selection [6, 41], but whether this has happened in the less abundant toxins is still unknown. We used the maximum likelihood model to estimate the dN/dS ratios of all kinds of toxins in elapid snake venoms. The representative sequences used here were from our EST data or downloaded from the GenBank database (Additional file 2: Table S1). Except for orphan group II of 3FTxs in B. multicinctus, almost all of the toxins had some sites that showed evidence of positive selection dN/dS ratios, and the dN/dS ratios for these sites ranged from 3.01 to 19.52 (Table 2). These results suggest that most toxin genes in elapid venoms might have been subjected to strong pressure of prey species shifts or arsenal competition, leading to the accelerated innovation of antigenic epitopes in toxins.
BAC Library construction and sequencing of 3FTx genes
In order to understand the gene structure of toxin genes, two independent BAC libraries containing 73,728 (B. multicinctus) and 82,944 (N. atra) clones, with average inserted genomic segments of 120 and 100 kb respectively, were constructed and arrayed into 384-well microtiter plates. Examples of 16 digested clones are presented in Additional file 2: Figure S3. We estimated that the percentage of clones without inserts was about 10%. Based on the snake's haploid genome size of 2.38 pg for Bungarus fasciatus and 2.59 pg for Naja haje , we estimated the genome coverage of the libraries to be around 3.3 × for B. multicinctus and 2.9 × for N. atra. These coverages theoretically provide 97.6% and 96.0% probability of obtaining any unique sequence in the B. multicinctus and N. atra BAC libraries, respectively, assuming random cloning .
Number of positive toxin BAC clones identified by different toxin probes
number of positive toxin BAC clones
probe types used in blot screening
3FTx long chain α-ntx
3FTx short chain α-ntx
3FTx orphan group IV
3FTx orphan group I
3FTx orphan group IXX
3FTx orphan group II
β bungarotoxin A-chain
β bungarotoxin B-chain
Sum (remove overlap clones)
Totaled with 7 probes
Totaled with 5 probes
Pairwise genetic distance among putative tandem duplicates
Average pairwise distance
Because the coding sequences of 3FTx genes are short and too diversified, it is difficult to construct an accurate phylogenetic relationship for all these diverse subgroups of 3FTx. So, we tried to use only the intron sequences for genic phylogenetic analyses, especially using intron II, which has a stable length of about 550 bp . But, the 3FTx gene tree (Figure 5) still confuses us with multiple radiated sub-families, just like the phylogenetic relationship of the 3FTx constructed by Fry [4, 7]. Taking α-ntx for example, we observed two monophyletic lineages of short chain α-ntx from N. atra and two monophyletic lineages of long chain α-ntx, respectively, from B. multicinctus and N. atra, which indicates that multiple groups of α-ntx separated before the divergence of Bungarus and Naja. Considering that the long chain α-ntx originated mainly by one splicing site shifting event from short chain α-ntx  and that both the short and long chain α-ntx coexist in some Australian and marine snakes, such as Austrelaps labialis , Lapemis curtus , and Laticauda semifasciata , the hypotheses that α-ntx diverged from a common ancestor seems reasonable. So, it is very likely that these broad radiated groups of 3FTx (containing the multiple α-ntx, cytotoxin, and other orphan 3FTx groups) appeared right before the explosive speciation of major elapid subfamilies in the mid-Oligocene . In brief, our data suggest that 3FTx genes probably experienced family explosion and thereby generated the major groups before the species radiation of Elapidae subfamilies. After that, by massive duplication, perhaps mainly by tandem duplication, the 3FTx gene family acquired numerous new members and became the most abundant toxin family in the two elapid venoms.
Both B. multicinctus and N. atra venoms are very toxic, respectively, with LD50 of 0.108 mg/kg and 0.29 mg/kg when injected subcutaneously (Venomdoc database from Fry), and they belong to two closely related genera . However, the major components of the B. multicinctus venom transcriptome are neurotoxins, including long chain α-ntx and β bungarotoxin, whereas the N. atra venom transcriptome mainly has cytotoxicity and a little bit of neurotoxicity from short chain α-ntx. In this study, we present the toxin profiles of B. multicinctus and N. atra by sequencing their venom gland transcriptomes. Then, the BAC libraries for these two elapids were constructed, and 25 BACs containing 3FTx genes were partially sequenced. The data revealed that tandem duplications contributed the majority of the expansions of toxin multigene families in the two elapids. We detected positive selection in every toxin subfamily and found that not only the multigene toxin families but also the less abundant toxins were under rapid adaptive evolution.
Fresh venom glands and blood cells were obtained, respectively, from an individual of Bungarus multicinctus and Naja atra, both of which were collected from Zhejiang province, China. The protocol was approved by the ethics committee of Kunming Institute of Zoology, CAS, China.
cDNA library construction and sequencing
Total RNA was extracted from venom glands using the RNeasy Mini Kit (Qiagen, Germany). The mRNA was purified from total RNA, using the Oligotex mRNA kit (Qiagen). The purified mRNA was used to make cDNA library, following the instructions in the SuperScript Plasmid System for cDNA Synthesis and Cloning Kit (plasmid used: pCMV-SPORT6 for B. multicinctus, pSPORT1 for N. atra) (Invitrogen, USA). Plasmids were purified using the QIAprep spin miniprep kit (Qiagen). Purified plasmids were sequenced by cycle sequencing reactions using the BigDye Terminator v3.1 kit (Applied Biosystem, USA) and an automated DNA sequencer (Model 3100A, Applied Biosystem, USA).
cDNA sequence cluster assembly
After removing the vector, adaptors, and low-quality sequences, ESTs were assembled into contiguous clusters using the CAP3 program , with the setting that only joined sequences with at least 95% identity. Each cluster (contigs with more than one EST or singlets with one EST) was then searched against the GenBank databases using BLASTX and BLASTN algorithms to identify similar sequences with an e-value cutoff < 10-5, as described by Ching AT et al. . The signal peptide was predicted using the local SignalP 3.0 server . A final annotation table was generated in Microsoft Excel format, containing all the relevant information about clusters and singlets. EST sequences were deposited in GenBank dbEST under accession numbers HO056271 to HO058536.
Bacterial artificial chromosome (BAC) library construction and characterization
High-molecular-weight DNA was extracted from snake blood cells, partially digested with Eco R I, and cloned into the CopyControl pCC1BAC vector (Epicentre, USA) according to a previously described method . To estimate the recombination ratio of the libraries and the average size of inserted DNA fragments, 100 randomly selected BAC clone DNAs were digested with Not I and separated by PFGE, as described by Wang et al. .
Screening the snake BAC genomic library
All the colonies in 384-well microtiter plates were replica-plated onto nylon membranes for hybridization using standard techniques . Two rounds of screening were performed. In the first one, a mixture of 12 probes was used, which represented four kinds of toxin genes. These probes were respectively prepared for the consensus toxin coding sequences by labeling their PCR product with digoxigenin. They were designed based on the cDNA sequences of 3FTx, β bungarotoxin A-chain (or PLA2), β bungarotoxin B-chain, and NP (Additional file 2: Table S3). Then, the 79 positive clones were respectively hybridized with each kind of probe to exclude false positives and identify the concrete toxin gene type.
Subcloning 3FTx genes and sequencing
All 39 3FTx gene-positive clones, verified by colony hybridization, were digested with Eco R I and analyzed on 0.8% agarose gels prior to blotting on a nylon membrane. Then, 3FTx gene-containing fragments were identified by hybridization with the same probes used in the library screening and subcloned into the CopyControl pCC1BAC vector. These subclones were then tested by PCR using specific primers, designed based on the sequence of the toxin ESTs (Additional file 2: Table S3). PCR amplifications were then ligated into the pMD18-T vector (TaKaRa, Dalian), and the fragments were sequenced using an ABI 3100A Genetic Analyzer. The sequences obtained from the same subclone were assembled into error-free consensus sequences using SeqMan II software (DNAStar, Version 6.0). These sequences were compared with our ESTs and published sequences in the GenBank database using the BLASTn program. Exon-intron structures of genes were double-checked using the Sim4 Program for spliced alignments .
DNA or protein sequences were aligned with the MUSCLE program  with manual adjustments. The number of nucleotide substitutions per site (Kn) in the non-coding regions (introns) was estimated using the Kimura 2-parameter model, and the numbers of nucleotide substitutions per synonymous site (dN) and per nonsynonymous (dS) in the putative protein-coding regions were computed for pairs according to the method of the realistic evolutionary model in PAML . Neighbor-joining (NJ) trees were constructed using the software MEGA version 4 , and the maximum likelihood (ML) trees were constructed using phylip-3.68 , evaluated by 1000 bootstrap replications, considering ML bootstrap proportions > 70% to indicate significant support, with transition-transversion ratios estimated from the data (implemented in tree-puzzle-5.2 ). There are several inframe indels in the coding region of ESTs, and those sites were removed in the sequence analysis.
In order to detect selection in a toxin subfamily, the likelihood ratio test (LRT), implemented in PAML4.2, was employed to detect positive selection using both the tests of M1a versus M2a and M8 versus M7 comparisons with the phylogeny-based approach [19, 56]. The putative coding sequences of ESTs that showed high divergence from the other sequences in a sub-family were removed from the analysis.
We thank Xin Li and Qi Zhou for helpful discussions. This work was supported by the 100 Talents Program of Chinese Academy of Sciences and a CAS-Max Planck Society Fellowship to WW; and grants from the 973 Program (2010CB529800) and the Chinese National Natural Science Foundation (30630014) to YZ.
- Calvete JJ, Sanz L, Angulo Y, Lomonte B, Gutierrez JM: Venoms, venomics, antivenomics. FEBS Lett. 2009, 583 (11): 1736-1743. 10.1016/j.febslet.2009.03.029.PubMedView ArticleGoogle Scholar
- Fry BG, Vidal N, Norman JA, Vonk FJ, Scheib H, Ramjan SF, Kuruppu S, Fung K, Hedges SB, Richardson MK, et al: Early evolution of the venom system in lizards and snakes. Nature. 2006, 439 (7076): 584-588. 10.1038/nature04328.PubMedView ArticleGoogle Scholar
- Fry BG, Scheib H, van der Weerd L, Young B, McNaughtan J, Ramjan SF, Vidal N, Poelmann RE, Norman JA: Evolution of an arsenal: structural and functional diversification of the venom system in the advanced snakes (Caenophidia). Mol Cell Proteomics. 2008, 7 (2): 215-246.PubMedView ArticleGoogle Scholar
- Fry BG, Vidal N, van der Weerd L, Kochva E, Renjifo C: Evolution and diversification of the Toxicofera reptile venom system. J Proteomics. 2009, 72 (2): 127-136. 10.1016/j.jprot.2009.01.009.PubMedView ArticleGoogle Scholar
- Fry BG: From genome to "venome": molecular origin and evolution of the snake venom proteome inferred from phylogenetic analysis of toxin sequences and related body proteins. Genome Res. 2005, 15 (3): 403-420. 10.1101/gr.3228405.PubMedPubMed CentralView ArticleGoogle Scholar
- Kordis D, Gubensek F: Adaptive evolution of animal toxin multigene families. Gene. 2000, 261 (1): 43-52. 10.1016/S0378-1119(00)00490-X.PubMedView ArticleGoogle Scholar
- Fry BG, Wuster W, Kini RM, Brusic V, Khan A, Venkataraman D, Rooney AP: Molecular evolution and phylogeny of elapid snake venom three-finger toxins. J Mol Evol. 2003, 57 (1): 110-129. 10.1007/s00239-003-2461-2.PubMedView ArticleGoogle Scholar
- Vonk FJ, Admiraal JF, Jackson K, Reshef R, de Bakker MA, Vanderschoot K, van den Berge I, van Atten M, Burgerhout E, Beck A, et al: Evolutionary origin and development of snake fangs. Nature. 2008, 454 (7204): 630-633. 10.1038/nature07178.PubMedView ArticleGoogle Scholar
- Kelly C, Barker N, Villet M, Broadley D: Phylogeny, biogeography and classification of the snake superfamily Elapoidea: a rapid radiation in the late Eocene. Cladistics. 2009, 25 (1): 38-63. 10.1111/j.1096-0031.2008.00237.x.View ArticleGoogle Scholar
- Pahari S, Bickford D, Fry BG, Kini RM: Expression pattern of three-finger toxin and phospholipase A2 genes in the venom glands of two sea snakes, Lapemis curtus and Acalyptophis peronii: comparison of evolution of these toxins in land snakes, sea kraits and sea snakes. BMC Evol Biol. 2007, 7: 175-10.1186/1471-2148-7-175.PubMedPubMed CentralView ArticleGoogle Scholar
- Doley R, Tram NN, Reza MA, Kini RM: Unusual accelerated rate of deletions and insertions in toxin genes in the venom glands of the pygmy copperhead (Austrelaps labialis) from Kangaroo island. BMC Evol Biol. 2008, 8: 70-10.1186/1471-2148-8-70.PubMedPubMed CentralView ArticleGoogle Scholar
- Leao LI, Ho PL, Junqueira-de-Azevedo Ide L: Transcriptomic basis for an antiserum against Micrurus corallinus (coral snake) venom. BMC Genomics. 2009, 10: 112-10.1186/1471-2164-10-112.PubMedPubMed CentralView ArticleGoogle Scholar
- Ching AT, Rocha MM, Paes Leme AF, Pimenta DC, de Fatima DFM, Serrano SM, Ho PL, Junqueira-de-Azevedo IL: Some aspects of the venom proteome of the Colubridae snake Philodryas olfersii revealed from a Duvernoy's (venom) gland transcriptome. FEBS Lett. 2006, 580 (18): 4417-4422. 10.1016/j.febslet.2006.07.010.PubMedView ArticleGoogle Scholar
- Rowan E: What does [beta]-bungarotoxin do at the neuromuscular junction?. Toxicon. 2001, 39 (1): 107-118. 10.1016/S0041-0101(00)00159-8.PubMedView ArticleGoogle Scholar
- Pawlak J, Mackessy SP, Sixberry NM, Stura EA, Le Du MH, Menez R, Foo CS, Menez A, Nirthanan S, Kini RM: Irditoxin, a novel covalently linked heterodimeric three-finger toxin with high taxon-specific neurotoxicity. FASEB J. 2009, 23 (2): 534-545. 10.1096/fj.08-113555.PubMedView ArticleGoogle Scholar
- Li J, Zhang H, Liu J, Xu K: Novel genes encoding six kinds of three-finger toxins in Ophiophagus hannah (king cobra) and function characterization of two recombinant long-chain neurotoxins. Biochem J. 2006, 398 (2): 233-242. 10.1042/BJ20060004.PubMedPubMed CentralView ArticleGoogle Scholar
- Jeyaseelan K, Poh SL, Nair R, Armugam A: Structurally conserved alpha-neurotoxin genes encode functionally diverse proteins in the venom of Naja sputatrix. FEBS Lett. 2003, 553 (3): 333-341. 10.1016/S0014-5793(03)01039-1.PubMedView ArticleGoogle Scholar
- Tamiya T, Fujimi TJ: Molecular evolution of toxin genes in Elapidae snakes. Mol Divers. 2006, 10 (4): 529-543. 10.1007/s11030-006-9049-x.PubMedView ArticleGoogle Scholar
- Yang Z: PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007, 24 (8): 1586-1591. 10.1093/molbev/msm088.PubMedView ArticleGoogle Scholar
- Li W-H: Molecular Evolution. Sinauer. 1997, Sunderland, MAGoogle Scholar
- Kuhn P, Deacon AM, Comsa DS, Rajaseger G, Kini RM, Uson II, Kolatkar PR: The atomic resolution structure of bucandin, a novel toxin isolated from the malayan krait, determined by direct methods. erratum. Acta Crystallogr D Biol Crystallogr. 2000, 56 (Pt 12): 1702-10.1107/S090744490001739X.PubMedView ArticleGoogle Scholar
- Heinrikson RL, Krueger ET, Keim PS: Amino acid sequence of phospholipase A2-alpha from the venom of Crotalus adamanteus. A new classification of phospholipases A2 based upon structural determinants. J Biol Chem. 1977, 252 (14): 4913-4921.PubMedGoogle Scholar
- Fujimi TJ, Kariya Y, Tsuchiya T, Tamiya T: Nucleotide sequence of phospholipase A(2) gene expressed in snake pancreas reveals the molecular evolution of toxic phospholipase A(2) genes. Gene. 2002, 292 (1-2): 225-231. 10.1016/S0378-1119(02)00682-0.PubMedView ArticleGoogle Scholar
- Yuan C, He Q, Peng K, Diao J, Jiang L, Tang X, Liang S: Discovery of a distinct superfamily of Kunitz-type toxin (KTT) from tarantulas. PLoS One. 2008, 3: 10-View ArticleGoogle Scholar
- Harvey AL, Karlsson E: Protease inhibitor homologues from mamba venoms: facilitation of acetylcholine release and interactions with prejunctional blocking toxins. Br J Pharmacol. 1982, 77 (1): 153-161.PubMedPubMed CentralView ArticleGoogle Scholar
- Ogawa T, Chijiwa T, Oda-Ueda N, Ohno M: Molecular diversity and accelerated evolution of C-type lectin-like proteins from snake venom. Toxicon. 2005, 45 (1): 1-14. 10.1016/j.toxicon.2004.07.028.PubMedView ArticleGoogle Scholar
- Matsui T, Fujimura Y, Titani K: Snake venom proteases affecting hemostasis and thrombosis. Biochim Biophys Acta. 2000, 1477 (1-2): 146-156. 10.1016/S0167-4838(99)00268-X.PubMedView ArticleGoogle Scholar
- Guo XX, Zeng L, Lee WH, Zhang Y, Jin Y: Isolation and cloning of a metalloproteinase from king cobra snake venom. Toxicon. 2007, 49 (7): 954-965. 10.1016/j.toxicon.2007.01.003.PubMedView ArticleGoogle Scholar
- Inoue K, Takei Y: Molecular evolution of the natriuretic peptide system as revealed by comparative genomics. Comparative Biochemistry and Physiology Part D: Genomics and Proteomics. 2006, 1 (1): 69-76. 10.1016/j.cbd.2005.10.002.Google Scholar
- St Pierre L, Flight S, Masci PP, Hanchard KJ, Lewis RJ, Alewood PF, de Jersey J, Lavin MF: Cloning and characterisation of natriuretic peptides from the venom glands of Australian elapids. Biochimie. 2006, 88 (12): 1923-1931. 10.1016/j.biochi.2006.06.014.PubMedView ArticleGoogle Scholar
- Trajanovska S, Donald J: Molecular cloning of natriuretic peptides from the heart of reptiles: loss of ANP in diapsid reptiles and birds. General and comparative endocrinology. 2008, 156 (2): 339-346. 10.1016/j.ygcen.2008.01.013.PubMedView ArticleGoogle Scholar
- Baxter G: The natriuretic peptides. Basic research in cardiology. 2004, 99 (2): 71-75. 10.1007/s00395-004-0457-8.PubMedView ArticleGoogle Scholar
- Fry BG, Wickramaratana JC, Lemme S, Beuve A, Garbers D, Hodgson WC, Alewood P: Novel natriuretic peptides from the venom of the inland taipan (Oxyuranus microlepidotus): isolation, chemical and biological characterisation. Biochem Biophys Res Commun. 2005, 327 (4): 1011-1015. 10.1016/j.bbrc.2004.11.171.PubMedView ArticleGoogle Scholar
- Ho PL, Soares MB, Maack T, Gimenez I, Puorto G, Furtado MF, Raw I: Cloning of an unusual natriuretic peptide from the South American coral snake Micrurus corallinus. Eur J Biochem. 1997, 250 (1): 144-149. 10.1111/j.1432-1033.1997.00144.x.PubMedView ArticleGoogle Scholar
- Schweitz H, Vigne P, Moinier D, Frelin C, Lazdunski M: A new member of the natriuretic peptide family is present in the venom of the green mamba (Dendroaspis angusticeps). J Biol Chem. 1992, 267 (20): 13928-13932.PubMedGoogle Scholar
- Reza MA, Swarup S, Kini RM: Structure of two genes encoding parallel prothrombin activators in Tropidechis carinatus snake: gene duplication and recruitment of factor X gene to the venom gland. J Thromb Haemost. 2007, 5 (1): 117-126. 10.1111/j.1538-7836.2006.02266.x.PubMedView ArticleGoogle Scholar
- Li S, Wang J, Zhang X, Ren Y, Wang N, Zhao K, Chen X, Zhao C, Li X, Shao J, et al: Proteomic characterization of two snake venoms: Naja naja atra and Agkistrodon halys. Biochem J. 2004, 384 (Pt 1): 119-127.PubMedPubMed CentralView ArticleGoogle Scholar
- Kulkeaw K, Chaicumpa W, Sakolvaree Y, Tongtawe P, Tapchaisri P: Proteome and immunome of the venom of the Thai cobra, Naja kaouthia. Toxicon. 2007, 49 (7): 1026-1041. 10.1016/j.toxicon.2007.01.019.PubMedView ArticleGoogle Scholar
- Olamendi-Portugal T, Batista CV, Restano-Cassulini R, Pando V, Villa-Hernandez O, Zavaleta-Martinez-Vargas A, Salas-Arruz MC, Rodriguez de la Vega RC, Becerril B, Possani LD: Proteomic analysis of the venom from the fish eating coral snake Micrurus surinamensis: novel toxins, their function and phylogeny. Proteomics. 2008, 8 (9): 1919-1932. 10.1002/pmic.200700668.PubMedView ArticleGoogle Scholar
- Dokmetjian JC, Del Canto S, Vinzon S, de Jimenez Bonino MB: Biochemical characterization of the Micrurus pyrrhocryptus venom. Toxicon. 2009, 53 (3): 375-382. 10.1016/j.toxicon.2008.12.015.PubMedView ArticleGoogle Scholar
- Lynch VJ: Inventing an arsenal: adaptive evolution and neofunctionalization of snake venom phospholipase A2 genes. BMC Evol Biol. 2007, 7: 2-10.1186/1471-2148-7-2.PubMedPubMed CentralView ArticleGoogle Scholar
- De Smet W: The nuclear Feulgen-DNA content of the vertebrates (especially reptiles), as measured by fluorescence cytophotometry, with notes on the cell and chromosome size. Acta Zool Pathol Antverp. 1981, 76: 119-167.Google Scholar
- Clarke L, Carbon J: A colony bank containing synthetic Col El hybrid plasmids representative of the entire E. coli genome. Cell. 1976, 9 (1): 91-99. 10.1016/0092-8674(76)90055-6.PubMedView ArticleGoogle Scholar
- Fujimi TJ, Nakajyo T, Nishimura E, Ogura E, Tsuchiya T, Tamiya T: Molecular evolution and diversification of snake toxin genes, revealed by analysis of intron sequences. Gene. 2003, 313: 111-118. 10.1016/S0378-1119(03)00637-1.PubMedView ArticleGoogle Scholar
- Huang X, Madan A: CAP3: A DNA sequence assembly program. Genome Res. 1999, 9 (9): 868-877. 10.1101/gr.9.9.868.PubMedPubMed CentralView ArticleGoogle Scholar
- Bendtsen JD, Nielsen H, von Heijne G, Brunak S: Improved prediction of signal peptides: SignalP 3.0. J Mol Biol. 2004, 340 (4): 783-795. 10.1016/j.jmb.2004.05.028.PubMedView ArticleGoogle Scholar
- Wang Z, Miyake T, Edwards SV, Amemiya CT: Tuatara (Sphenodon) genomics: BAC library construction, sequence survey, and application to the DMRT gene family. J Hered. 2006, 97 (6): 541-548. 10.1093/jhered/esl040.PubMedView ArticleGoogle Scholar
- Wang W, Xu HL, Lin LP, Su B, Wang YQ: Construction of a BAC library for Chinese amphioxus Branchiostoma belcheri and identification of clones containing Amphi-Pax genes. Genes Genet Syst. 2005, 80 (3): 233-236. 10.1266/ggs.80.233.PubMedView ArticleGoogle Scholar
- Chiu CH, Amemiya CT, Carr JL, Bhargava J, Hwang JK, Shashikant CS, Ruddle FH, Wagner GP: A recombinogenic targeting method to modify large-inserts for cis-regulatory analysis in transgenic mice: construction and expression of a 100-kb, zebrafish Hoxa-11b-lacZ reporter gene. Dev Genes Evol. 2000, 210 (2): 105-109. 10.1007/s004270050016.PubMedView ArticleGoogle Scholar
- Florea L, Hartzell G, Zhang Z, Rubin GM, Miller W: A computer program for aligning a cDNA sequence with a genomic DNA sequence. Genome Res. 1998, 8 (9): 967-974.PubMedPubMed CentralGoogle Scholar
- Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32 (5): 1792-1797. 10.1093/nar/gkh340.PubMedPubMed CentralView ArticleGoogle Scholar
- Yang Z, Nielsen R: Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models. Mol Biol Evol. 2000, 17 (1): 32-43.PubMedView ArticleGoogle Scholar
- Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24 (8): 1596-1599. 10.1093/molbev/msm092.PubMedView ArticleGoogle Scholar
- Felsenstein J: PHYLIP (phylogeny inference package) version 3.6. Distributed by the author Department of Genome Sciences, University of Washington, Seattle. 2005Google Scholar
- Schmidt H, Strimmer K, Vingron M, Von Haeseler A: TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics. 2002, 18 (3): 502-10.1093/bioinformatics/18.3.502.PubMedView ArticleGoogle Scholar
- Yang Z, Nielsen R, Goldman N, Pedersen AM: Codon-substitution models for heterogeneous selection pressure at amino acid sites. Genetics. 2000, 155 (1): 431-449.PubMedPubMed CentralGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.