- Research article
- Open Access
Snake venoms are integrated systems, but abundant venom proteins evolve more rapidly
BMC Genomicsvolume 16, Article number: 647 (2015)
While many studies have shown that extracellular proteins evolve rapidly, how selection acts on them remains poorly understood. We used snake venoms to understand the interaction between ecology, expression level, and evolutionary rate in secreted protein systems. Venomous snakes employ well-integrated systems of proteins and organic constituents to immobilize prey. Venoms are generally optimized to subdue preferred prey more effectively than non-prey, and many venom protein families manifest positive selection and rapid gene family diversification. Although previous studies have illuminated how individual venom protein families evolve, how selection acts on venoms as integrated systems, is unknown.
Using next-generation transcriptome sequencing and mass spectrometry, we examined microevolution in two pitvipers, allopatrically separated for at least 1.6 million years, and their hybrids. Transcriptomes of parental species had generally similar compositions in regard to protein families, but for a given protein family, the homologs present and concentrations thereof sometimes differed dramatically. For instance, a phospholipase A2 transcript comprising 73.4 % of the Protobothrops elegans transcriptome, was barely present in the P. flavoviridis transcriptome (<0.05 %). Hybrids produced most proteins found in both parental venoms. Protein evolutionary rates were positively correlated with transcriptomic and proteomic abundances, and the most abundant proteins showed positive selection. This pattern holds with the addition of four other published crotaline transcriptomes, from two more genera, and also for the recently published king cobra genome, suggesting that rapid evolution of abundant proteins may be generally true for snake venoms. Looking more broadly at Protobothrops, we show that rapid evolution of the most abundant components is due to positive selection, suggesting an interplay between abundance and adaptation.
Given log-scale differences in toxin abundance, which are likely correlated with biosynthetic costs, we hypothesize that as a result of natural selection, snakes optimize return on energetic investment by producing more of venom proteins that increase their fitness. Natural selection then acts on the additive genetic variance of these components, in proportion to their contributions to overall fitness. Adaptive evolution of venoms may occur most rapidly through changes in expression levels that alter fitness contributions, and thus the strength of selection acting on specific secretome components.
Selection acts differently on different proteins, depending on their function. Understanding mechanisms underlying these differences has been a major thrust in molecular evolutionary biology [1–6]. A dominant pattern observed in studies of diverse model systems, ranging from yeast to mammals, is that secreted proteins evolve faster than intracellular proteins [7–11]. Reasons for this phenomenon remain poorly understood. Non-adaptive explanations posit that extracellular proteins experience relaxed selection, and are more tolerant of mutations for structural, ecological, or evolutionary reasons [8, 9]. Alternatively, extracellular proteins may play a larger role in evolutionary interactions with the environment and other organisms, which should make them more likely targets of positive selection [10, 11], as predicted by coevolutionary theory [12, 13].
In our view this uncertainty results from methodologies used in previous studies, which focused on large-scale analysis of evolutionary rate correlates. Although some authors could identify major genome-wide patterns, given the large number of genes involved, they could not determine proximal ecological factors that drive selection acting on extracellular gene products. Here we use snake venoms as a model system to illuminate the interplay between ecology and evolutionary rates of secreted proteins. Consisting of analytically tractable numbers of components with distinct pharmacological roles, snake venom proteomes have well-defined ecological roles in the immobilization of prey. Thus, comparing the composition of snake venoms between species as a function of prey choice and venom protein expression level, can provide general insights into how secreted proteins evolve.
Snake venoms function as integrated systems. Roles of individual constituents depend upon their concentrations and their interactions with other venom components . Moreover, venom constituents also interact with compounds in prey tissues, most often in highly specific ways. Snake venom composition must respond to ontogenetic and evolutionary changes in diet, and presumably must also compensate to overcome predator or prey resistance, should that develop [15–17]. As a result, venoms from different species, and even those from different populations of the same species, appear optimally targeted to the chemistries of specific prey species. Furthermore, the structures of snake venom proteins evolve extremely rapidly (with a ratio of non-synonymous to synonymous substitutions greater than one), suggesting intense, positive, Darwinian selection. Consisting of mixtures of specific, evolutionarily conserved compounds with discrete pharmacologies, snake venoms provide an unusual opportunity to quantitatively study genomic consequences of ecological interactions, particularly how natural selection acts on rapidly co-evolving gene complexes.
Numerous and diverse studies attest to the strength of selection operating on venom composition. Various studies document a close match between venom composition and its effectiveness in subduing prey, suggestive of adaptation by the snakes. For example, Mackessy  showed that northern and southern Pacific rattlesnakes (Crotalus viridis oreganus and C. v. helleri) exhibit a pronounced shift in venom chemistry when young adults switch from lizard to rodent prey. The venom becomes less toxic, but much more proteolytic, presumably to enhance digestion of more voluminous prey with much smaller surface-to-volume ratios , although McCue  has disputed this conclusion. Daltry et al.  opined that venom variation in the Malayan pitviper (Calloselasma rhodostoma) is closely associated with diet. Chijiwa and colleagues  documented stark differences in venom phospholipases A2 (PLA2s) in populations of habus (Protobothrops flavoviridis) on different islands in the Ryukyu Archipelago. They inferred that these compositional differences reflect different selective pressures related to preferred prey. Experimental studies have shown that coral snake venoms are most toxic to preferred prey organisms . Gibbs and Mackessy  found that toxicity of four pygmy rattlesnake (Sistrurus) venoms was correlated with the proportion of mice and lizards in the diet, with each taxon being most toxic to its preferred prey species. Conversely, the sea snake, Aipysurus eydouxii, feeds exclusively on fish eggs, obviating the necessity of venom. As a result, selection on its venom PLA2s has relaxed, leading to a range of dysfunctional mutations [25, 26].
In addition, many studies that focused on individual toxins or toxin classes have discovered evidence of accelerated protein sequence evolution, consistent with positive selection. It is well documented that contrary to normal tissue isozymes, venom protein exons diversify much more rapidly than introns [27–31]. In most venomous snakes, toxin gene duplication followed by neofunctionalization, has resulted in diverse pharmacologies [32, 33]. As an example, PLA2s manifest presynaptic neurotoxicity, myotoxicity, cardiotoxicity, anticoagulation, and cause vascular perforation (hemorrhage), hemolysis, and pain [34–36]. Although not as well studied as PLA2s, other snake toxin classes including cysteine-rich secretory proteins (CRISPs), disintegrins, C-type lectins, serine proteases, metalloproteases, and three-finger toxins all show evidence of rapid gene diversification, or of positive Darwinian selection [33, 37–43]. Moreover, the most rapidly evolving residues are those that appear on toxin surfaces, controlling their interactions with molecular targets in prey organisms [32, 44].
Some have suggested that accelerated evolution is predominantly a product of intrinsic mechanisms that enhance genetic diversity, independent of selective pressures. Kini and Chan  reasoned that accelerated evolution results from relaxed rather than directional selection. Mebs  argued that there is no evidence that strong selective forces drive the development of more potent toxins to overcome prey resistance, or to exploit new biochemical pathways for more effective intoxication of prey. More recently Kini and Chinnasamy  proposed that triplet nucleotide sequences determine the spontaneous mutation rate, an idea that requires further consideration.
In addition, phylogeny clearly plays a role in venom composition. More closely related species generally have more similar venoms than more distantly related species (e.g. the four mamba species have venoms that are qualitatively similar, but distinctly different from those of all other elapids). However, geography, and reduced gene flow may also influence venom chemistry. Angulo et al.  found pronounced differences in venom composition (only ~15 % similarity) between Atropoides nummifer and Atropoides picadoi. Since adults of both species are predominantly rodent predators, the differences appear to have a phylogenetic, rather than an ecological or geographic basis. However, the earlier work of Jiménez-Porras makes it clear that geographic separation of conspecific populations also affects venom composition in A. nummifer . Angulo et al. did not indicate the geographic origins of their specimens, except to say that all were collected in Costa Rica.
Although toxin components interact to immobilize prey , historically, most studies of snake venom constituents have focused on specific toxin classes, examining long-term evolutionary trends in a variety of snake taxa. Consequently, although macroevolutionary patterns of individual venom proteins are somewhat understood, their regulation and microevolution remain almost entirely unknown . However, if individual toxins (gene products) fluctuate in abundance and importance over short evolutionary time scales , evolutionary analysis of individual venom constituents may be misleading, as individual genes may experience different selective pressures from lineage to lineage. Furthermore, rates of venom protein evolution may depend on the relative abundance of various venom protein classes, and on their relative importance in immobilizing the prey.
Here we examine the interplay between component abundance and evolutionary rate, using two closely related pitviper species, the habu (Protobothrops flavoviridis) and the Sakishima habu (Protobothrops elegans). The former is native to Okinawa and nearby islands, and the latter to the Yaeyama Islands, located 400 km to the southwest in the Ryukyu Archipelago. The Sakishima habu is most closely related to the Taiwan habu (P. mucrosquamatus) , which is not surprising since it occupies the Ryukyu Islands closest to Taiwan. In 1976, however, human activities released approximately 100 Sakishima habus in the vicinity of Itoman, Okinawa [52–54]. The geological isolation of the Yaeyamas and Okinawa, is estimated at about 1.6 million years, which matches patterns of divergence of several island endemics [51, 55–57], though molecular estimates place divergence between these species at about 10 million years .
Between 1990 and 2000, more than 500 Sakishima habus were captured and brought to the Prefectural Institute of Health and Environment . During this same period, five specimens of intermediate morphology were collected, suggesting possible hybridization. Recent collection of two more intermediate specimens prompted us to investigate whether these specimens are actual hybrids. Using a combination of transcriptomic and quantitative proteomic techniques , we identified venom components and determined their abundances in three specimens each of both parental species, and in the two putative hybrids, all collected in the same local area (Fig. 1). Because the putative hybrids are rare, and could not be sacrificed for venom gland extraction, we used only proteomic analysis of their venoms. We confirmed that they are genuine hybrids; however, serendipitously, these data also revealed some surprising patterns in the tempo and pattern of secreted protein evolution. Thus, an ostensibly simple question of possible pitviper hybridization unexpectedly illuminated fundamental evolutionary principles, which we could then verify in other species. Here then, are both intertwined stories.
Results and discussion
We used a combination of transcriptomic and proteomic techniques to characterize the venoms of two closely related habus. By combining both approaches, we benefitted from the dynamic range and accuracy of transcriptomics, while proteomic data distinguished between proteins secreted into the venom, and non-venom components, allowing us to contrast evolutionary patterns in the two classes of genes. In addition, proteomics permitted us to quantify venom composition in rare and valuable hybrid specimens that could not be sacrificed for gland extraction. Unexpectedly, however, these hybrids provided insights into venom level regulation and evolution, showing heritability and additive regulation of venom component expression levels.
We also found a positive correlation between abundance of venom gene transcripts, and their evolutionary rates. This result is not obvious, given the non-stoichiometric action of enzymatic snake venom components, but it suggests that venomous snakes invest more in the production of proteins that are particularly important to them. Consequently, these proteins experience high rates of evolution, often with ω > 1, and show evidence of positive selection (Fig. 6; Additional file 1: Table S3). We propose that regulation of venom constituent levels may be a major short-term mechanism by which snakes adapt to new prey types, and may be a driver of protein molecular evolution.
The two Protobothops venoms were qualitatively similar at the level of protein families, and had many one-to-one gene homologs (Fig 3). Both venoms were heavily dominated by phospholipases A2, serine proteases, metalloproteases, and BPP-CNP, in that order (Table 1). Protobothrops flavoviridis venom possesses Factor IX/X inactivators that are absent in P. elegans venom. LAO transcripts were 6× more abundant in habu venom and VEGF transcripts were 3× as numerous (Table 1). Toxins having the primary function of hypotension or anti-coagulation, account for at least 21 % more of the P. flavoviridis transcriptome than the P. elegans transcriptome (Table 1). Presently, it is not possible to know whether any of the phospholipases A2 in either venom are anticoagulant; however, Zhao et al.  present strong evidence that catalytic activity is required for anticoagulant activity. This implies that the major P. elegans myotoxic PLA2 is not anticoagulant, while the two major PLA2s of P. flavoviridis venom may be. Taken together, it appears that the envenomation strategy of P. flavoviridis is focused much more heavily on provoking hypotension and distributing venom proteins throughout the prey, strategies that makes sense for mammalian prey, with much smaller surface-to-volume ratios than lizards, on which even adult P. elegans feed.
Herein we address the composition of parental and hybrid venoms, confirmation of the hybrid status of the putative hybrids, and how natural selection acts on venoms as integrated systems.
Composition of parental and hybrid venoms
Both Protobothrops venom gland transcriptomes were dominated by phospholipases A2 (PLA2s). In the Protobothrops elegans transcriptome, two PLA2s comprised 76.9 % of all transcripts whereas in P. flavoviridis four isozymes represented 55.5 % (Table 1; Additional file 2: Table S1 and Additional file 3: Table S2). The PLA2 content in the P. elegans transcriptome is very high, but reasonable in light of pit viper venom chemistry. For example, in some neotropical rattlesnake (Crotalus durissus) venoms, crotoxin, a heterodimeric, PLA2 neurotoxin can comprise as much as 88 % of total venom protein [60–62].
Based on homology to other PLA2s of known pharmacology, P. elegans transcript comp43_c0_seq1 that constituted 73.4 % of all transcripts, encodes a noncatalytic, myotoxic PLA2 myotoxin that shows considerable homology to other Asian crotaline myotoxins (Fig. 2a; Additional file 4: Figure S1A; Additional file 5: Figure S3; Additional file 2: Table S1). In contrast, while P. flavoviridis also has a transcript (Pf_comp552_c0_seq1) for a homologous, non-catalytic, myotoxic PLA2, this transcript represents <0.05 % of the latter transcriptome. Both the P. elegans and P. flavoviridis myotoxins have an arginine residue (position 48) where catalytic PLA2s have aspartic acid in order to bind the Ca2+ ion required for catalysis. Many New World crotalines have lysine in this position. All three P. elegans specimens and the two hybrids express Pe_comp43_c0_seq1 heavily, but no snakes in this study produced the P. flavoviridis homolog, Pf comp552_c0_seq1 (Fig 3; Fig. 2; Additional file 6: Figure S5).
All other PLA2s in both venoms are presumably catalytic. The dominant PLA2s in P. flavoviridis venom (Pf comp41_c0_seq1 and comp40_c0_seq1) are both basic, as is P. elegans comp47_c0_seq1, which is homologous to Pf comp41_c0_seq1. Their pharmacologies are unknown. All three P. elegans venoms contained Pe_comp47_c0_seq1, but none of the P. flavoviridis PLA2s. Likewise, all three P. flavoviridis venoms contained the three Pf PLA2s, but not Pe comp47_c0_seq1. The hybrids, however, produced all five PLA2s from both parental species (Fig 3; Fig. 2; Additional file 6: Figure S5).
Protobothrops elegans venom apparently has no neurotoxic PLA2s, but P. flavoviridis comp48_c0_seq1, which encodes a weak presynaptic neurotoxin similar to trimucrotoxin , comprised 6.3 % of the P. flavoviridis transcriptome. Although some neurotoxic PLA2s are also myotoxic [64, 65], myotoxicity appears to be much more important for P. elegans than for P. flavoviridis. Nonetheless, it is difficult to offer an ecological explanation for this difference. In general, myotoxicity seems to be most often associated with mammal predation by large terrestrial crotalines [66–68], but it has also been reported in small arboreal species . Without appropriate pharmacological studies in native prey species as well as laboratory animals, it is impossible to know whether such differences are adaptive or ecologically irrelevant. It may be that some subsets of myotoxic PLA2s are specifically adapted to reptilian rather than mammalian skeletal muscle, but if so, the structural determinants are currently unknown.
Relative to PLA2s, one other matter deserves mention. The P. elegans cDNA library contains a peculiar transcript, Pe_comp103_c0_seq1, that represented 0.2 % of the transcriptome (Additional file 2: Table S1). Using the Frame 3 reverse translation of comp103_c0_seq1, BLASTP and TBLASTX searches both suggested that the best match is a PLA2 gene cluster from P. flavoviridis (in Frame 1) . However, both sequences are liberally punctuated with stop codons, and the longest, unbroken, translated segment of the P. elegans transcript, using Frame 1, encodes only 45 amino acids, or about 37 % of a typical Type II PLA2. BLASTP and pfam searches for the 45-residue protein yielded no results. However, mass spectrometry of the eight venoms detected three peptides (a total of 46 times) that cover 55.8 % of the putative protein translated in Frame 1 (Fig 3). Further analysis of these peptides was done using de-novo sequencing algorithms and manual annotation, in order to confirm their identity (Additional file 7: Figure S2). This protein was not only detected in all three specimens of P. elegans; it was also detected in both hybrid venoms (Fig 3). Clearly, the snakes are producing this, but it does not appear to be a PLA2 derivative, the TBLASTX searches notwithstanding. Its function, assuming that it has one, is unknown.
In both transcriptomes, serine proteases (SPs) are the second most abundant protein family (Table 1). Six SPs comprise 10.4 % of all P. elegans transcripts, whereas eight represent 11.8 % of all P. flavoviridis transcripts. Based on homology, P. elegans SP 1 (5.5 % of all transcripts) (Additional file 2: Table S1), is a thrombin-like enzyme (TLE) and a possible kallikrein. Protobothrops elegans SPs 2, 3, and 6 are probable TLEs, while based on BLASTP searches, SPs 4 and 5 are probable plasminogen activators. Protobothrops flavoviridis SPs 1 and 4 are probably TLEs, based on sequence homology to DAV-PA from Deinagkistrodon acutus venom (Additional file 3: Table S2). Protobothrops flavoviridis SP2, a complete transcript, is an isoform of a serine protease variously known as habutobin or flavoxobin. It is a strongly anticoagulant thrombin-like enzyme that releases only fibrinopeptide A [71, 72], inhibits collagen-induced platelet aggregation , and releases urokinase-tissue plasminogen activator from bovine arterial endothelial cells . In addition, it functions as a C3-convertase that activates the complement cascade . SP5 most closely resembles KN5 from Viridovipera stejnegeri venom (Tsai and Wang, direct submission: http://www.ncbi.nlm.nih.gov/protein/82239734), for which no pharmacological data are available. SP7 is a probable plasminogen activator, based on similarity to another homolog from V. stejnegeri venom . The majority of the SPs in both venoms appear to function in multiple ways as anticoagulants. As with PLA2s, the hybrids produce SPs from both parental species, but all peptides could be attributed to specific parental transcripts; there were no peptides of ambiguous (potentially common) origin.
At first glance, it might appear contradictory for snakes to employ both procoagulant and anticoagulant enzymes, but in reality, this is a well-coordinated strategy to render the prey’s blood wholly incoagulable with astonishing speed. Venom thrombin-like enzymes (TLEs) consume fibrinogen, yielding small fibrin clots, and hypotensive fibrinopeptides [77, 78]. However, the commencement of inappropriate clotting triggers the prey’s anti-clotting cascade, which rapidly destroys the fibrin clots . Protobothrops flavoviridis venom contains a Factor IX/X inactivator (3.6 % of the transcriptome) that is lacking in P. elegans venom, and kallikrein-like enzymes, which P. elegans does have (Additional file 2: Table S1, Additional file 3: Table S2). But kallikrein converts plasminogen to plasmin, which in turn, digests fibrin. Various venom serine proteases have both thrombin-like and kallikrein activities [80–85]; thus fibrinogen-clotting and plasminogen activation reside in the same protein. Regardless, many crotaline venoms also contain serine and metalloproteases that degrade fibrin directly [86, 87].
Many snake venom TLEs clot fibrinogen less effectively than thrombin  and TLEs are commonly more effective against fibrinogens of some mammal species than others , but the existence of so many weakly clotting TLEs, the capacity of various crotaline TLEs to degrade prothrombin , and the existence of directly fibrinolytic venom enzymes, confirm that the objective is not to clot blood, but to clear the bloodstream of fibrinogen . Even in human envenomations by small crotalines, blood can be rendered incoagulable within minutes . Defibrinogenation would occur in small mammalian prey far more rapidly. Incoagulable blood presumably facilitates the distribution of venom and endogenous prey tissue hydrolases, promoting prey digestion. In addition to fibrin cleavage, plasmin also inactivates many endogenous clotting factors, thereby acting as an anticoagulant ; however, this also suggests that the strategy may be to prevent endogenous coagulation factors from producing properly clotted fibrin. For a more detailed look at the role of serine proteases in envenomation, see the Additional file 9.
In both transcriptomes, P-II metalloproteases are the third most abundant protein family (P. elegans: 4.8 % and P. flavoviridis: 11.3 %), followed by P-III metalloproteases (P. elegans: 3.2 % and P. flavoviridis: 6.0 %) (Table 1). Protobothrops elegans venom contains as many as six P-II MPs, one apparent P-I MP, and as many as nine P-III MPs. However, caution must be exercised in predicting the number of members of these highly diversified families. From previous experience, it is clear that partial transcripts of large proteins tend to overestimate numbers of homologs and paralogs in these families . Our earlier study suggested that there could be as many as 12 P-II MPs and as many as 9 P-III MPs in P. flavoviridis venom. The present study found evidence for no more than 4 P-II MPs, but suggested as many as 11 P-III MPs (Additional file 3: Table S2). The problem seems to be that the Trinity assembler is unable to deal effectively highly diversified families of large proteins, such as MPs and SPs. Reassembly of both transcriptomes using the new assembler, VTBuilder, suggested assembling two pairs of these (Pf_comp65_co_seq1 with Pf_comp75_c0_seq1; Pf_comp69_c0_seq1 with Pf_comp72_c0_seq1), resulting in an estimated 7 P-III MPs. Nonetheless, overall, the two studies present a unified view of the relative importance of the various toxin families.
The putative P-I metalloprotease transcript identified in the P. elegans transcriptome had no apparent homolog in P. flavoviridis. Oddly, while peptides from this protein were not identified in any of the P. elegans venom samples, Hybrid #2 did express it. P-II metalloproteases were expressed in the hybrids at the slightly lower levels than those seen in either of the parental venoms. Four of five P. elegans enzymes were identified in the hybrids and three out of four from P. flavoviridis. In the two parental species, P-II metalloproteases were expressed somewhat more highly than P-III enzymes. MPs were expressed at about 12–15 % in both hybrids (Fig 3; Additional file 6: Figure S5).
It is difficult to predict metalloprotease pharmacology based upon subclass identity alone, owing to a dearth of comparative structure-function studies. The structural complexity of P-III enzymes in particular, gives rise to a great variety of pharmacologies, including hemorrhage, inflammation, apoptosis, fibrinogen and fibrin degradation, prothrombin activation, and platelet aggregation inhibition . The extremely hemorrhagic nature of P. flavoviridis venom is well known [95–98], and that characteristic reflects the well-documented presence of hemorrhagic MPs [99–102]. P-III MPs 2, 4, 7, 9, and 11 are clearly isomers of HR1, a hemorrhagic P-III MP (Additional file 3: Table S2) [103–106]. MP P-IIIs 1, 3, 5, 6, 8, and 10 appear homologous to flavorase (HV1), characterized as an apoptosis-inducing protein (Additional file 3: Table S2) ; however, apoptosis is too slow a process to be relevant to envenomation; its more important contribution to envenomation is probably to degrade fibrinogen .
Despite the presence of an apparent P-I MP in the P. elegans transcriptome, we are unable to say whether an independent P-I MP exists in this venom, or whether this represents an incomplete P-II or P-III transcript. The putative P-I transcript was itself incomplete and represented only 0.1 % of the transcriptome. Moreover, no peptides from this transcript were found in any of the three P. elegans venoms, although they were found in the venom of hybrid #2. Graminelysin I, from the venom of Trimeresurus gramineus, a P-I MP, has been shown to be processed post-translationally from a P-III precursor , but that situation is probably not analogous to this one.
Bradykinin-potentiating peptides and C-type natriuretic peptide
The gene that encodes multiple bradykinin-potentiating peptides and a single C-type natriuretic peptide (BPP-CNP) was found in both transcriptomes (P. elegans: 1.4 % and P. flavoviridis: 2.6 %) (Table 1). All of these gene products are hypotensive, at least in mammals, and cause prey to go into circulatory shock, limiting prey flight . These peptides were more highly expressed in P. flavoviridis venoms and peptides from hybrid venoms corresponded only to the P. flavoviridis transcript (Fig 3; Additional file 6: Figure S5).
Other venom constituents
C-type lectins that act upon blood platelets were found in the transcriptomes and proteomes of both species and hybrid venoms showed peptides from both parental venoms (Fig 3; Additional file 6: Figure S5). Protobothrops flavoviridis produces an anticoagulant C-type lectin that binds and inactivates coagulation Factors IX/IXa and X/Xa [110, 111], but P. elegans does not. Both hybrids manifested this protein, but at levels substantially lower than seen in P. flavoviridis (Fig 3; Additional file 6: Figure S5).
L-amino acid oxidase represented 3.1 % of the P. flavoviridis transcriptome, but only 0.5 % in P. elegans (Tables 1, Additional file 2: Table S1 and Additional file 3: Table S2). However, LAO peptides accounted for 14.9 % of the P. elegans venom proteome and 13.4 % in P. flavoviridis (Fig 3). Vascular endothelial growth factor, was an even more minor element of the two transcriptomes (1.7 % of P. flavoviridis, but only 0.5 % of P. elegans). All other components represented less than 1 % of one or both transcriptomes (Tables 1, Additional file 2: Table S1 and Additional file 3: Table S2). These included phospholipase B, cysteine-rich secretory proteins, 5′-nucleotidase, nerve growth factor, phosphodiesterase, galactose-binding lectin, glutaminyl cyclase, hyaluronidase, dipeptidyl peptidase IV, and aminopeptidase A. Pharmacologies of these proteins relative to their roles in envenomation have been reviewed in Aird  and Aird et al. .
“Common” peptides in the hybrid proteomes
For most protein families, it was possible to detect in the hybrid venoms, a large number of unique peptides that could be assigned to specific transcripts from each of the parental species (Fig 3; Additional file 6: Figure S5). However, for some families there were a large number of peptides that could be matched to one or more transcripts in both parental venoms. That is, they were potentially common to both. Not surprisingly, the percentage of common peptides in the hybrid proteomes was greatest in the least diversified protein families. These families generally have only one transcript per species. Examples include APA (69.4 % common), LAO (55.6 %), PDE (67.2 %), and CRISP (51.8 %) (Fig 3). Two stark exceptions were 5′-nucleotidase and PLB (0 %). Highly diversified protein families showed much lower percentages of common peptides, the extreme case being serine proteases, for which no common peptides were found.
Quantitative proteomics of the hybrids
Proteomic data can be quantified by standardizing absolute peptide abundance by the predicted protein length . This approach produced significant correlations between protein and transcriptomic abundance estimates, with close agreement between protein sample replicates from different individuals (Fig. 4). However, reference transcripts were only available for the parent species, and not for the putative hybrids. In order to make comparisons of protein abundance, a common reference was needed. We identified peptides using each transcriptome as a reference, in order to determine which performed better. The P. flavoviridis reference produced the most significant correlation between transcriptomic and proteomic data for both species (Spearman correlation with P. elegans: r = 0.29, p = 0.011, P. flavoviridis r = 0.50, p = 5.3 × 10−9), so it was used to call peptides for the hybrids (Fig. 4).
Non-metric, multidimensional scaling analysis of the putative hybrid proteomes showed that they were largely intermediate between those of the parental species (Fig. 5a). The expression level of homologous venom components was similarly intermediate (Fig. 5b).
Confirmation of the putative hybrids
The two putative hybrids expressed all venom proteins from both parental species, except for those corresponding to the most minor transcripts (Fig 3; Additional file 6: Figure S5), and had distinct venom profiles (Fig. 5). Venom proteins produced by only one of the parental species also appeared in hybrid venoms (e.g., P. elegans PLA2 Pe_comp43_c0_seq1 and P. flavoviridis Factor IX/X inactivators, Pf_comp55_c0_seq1 and Pf_comp76_c0_seq1) (Fig 3; Additional file 5: Figure S3). They also had roughly intermediate levels of protein expression for most venom proteins, compared to the parental species (Fig 3, Fig. 5; Additional file 8: Figure S4; Additional file 6: Figure S5). The strong correlation between protein levels in the two parental species and in the hybrids indicates that venom levels are genetically controlled (Fig. 5). Hybrid venom protein levels are largely intermediate between those of the parental species, suggesting that each component may be largely additively regulated, though there is also evidence that hybrids show higher levels of proteins, relative to the two parental species, perhaps a manifestation of “hybrid vigor”. Additive effects, as opposed to dominance and epistatic effects, are particularly good targets for natural selection , though, more generally, genetic control of venom constituent levels makes them a potential target for natural selection. Consequently, rapid population-level changes in venom chemistry are expected in the face of sufficient selective pressure. This mode of regulation is consistent with the rapid evolution of snake venoms seen in Protobothrops and other taxa.
Given their morphological intermediacy, and the additive compositions of their venoms, it is apparent that the putative hybrids are, in fact, bona fide hybrids. We know of no other way to explain the “hybrid” nature of these venoms except for hybridization between the native P. flavoviridis and the invasive P. elegans.
While the morphologically intermediate animals (Fig. 1) are legitimate hybrids, without tissue samples, it is impossible to determine which parent pertained to which species, and we also cannot be certain whether the hybrids are F1 or F2 hybrids. Given the substantial intermediacy of hybrid venom composition relative to the two parental species, it is virtually certain that they do not represent back-crosses with either parental species, which would be expected to skew venom composition toward that parental species. Nonetheless, it is clear that P. elegans and P. flavoviridis are not reproductively isolated by pre-mating isolating mechanisms. Hybridization with invasive species can threaten conservation of P. flavoviridis, and future studies should address its extent and impact.
How natural selection operates on venoms
Evolutionary rate analysis
Venom gland transcripts for which venom peptides were sequenced by mass spectrometry, had much higher rates of non-synonymous to synonymous (dN/dS) substitutions relative to tissue components of the venom gland transcriptomes (Kruskal-Wallis p-value = 1.8 × 10−10) (Fig. 5a). Among the proteins detected in either P. elegans or P. flavoviridis venom by mass spectrometry, there was a positive correlation between the average abundance of a toxin in the transcriptome, and its evolutionary rate (r = 0.53, d.f. = 24, p = 0.0055; Fig. 5b). Including only proteins detected in both venoms, the correlation remained, at lower significance, due to the decreased sample size (r = 0.48, d.f. = 17, p = 0.037). Protein abundance levels were likewise correlated with evolutionary rate (r = 0.62, d.f. = 17, p = 0.0041). By contrast, the transcriptional abundance of non-venom (tissue protein) transcripts was negatively correlated with evolutionary rate (rs = -0.087, n = 1012, p = 0.0057). There was evidence of positive, significant selection acting on most venom proteins with dn/ds >1. PLA2s, metalloproteases and C-type lectins all showed evidence of positive selection in Protobothrops as a whole (p < 0.05). PARRIS failed to detect positive selection acting on serine proteases (p = 0.34, despite inferring dn/ds = 1.27; Table S3), but MEME found significant evidence of episodic diversifying selection at four sites, a result consistent with the absence of common peptides (Fig 3).
Control of expression levels
Differences in abundance of homologous proteins between P. flavoviridis and P. elegans are dramatic. It is possible that the expression level of each venom component is simply a function of its promoter strength. In such a case, we would predict that individual toxin levels should be weakly correlated with levels of other transcripts, and that promoter regions should show classical signs of positive selection, such as reduced haplotypic diversity, and strong sequence divergence between species. Both of these predictions can be tested by conducting a gene co-expression level analysis study, and an investigation of genetic diversity upstream of venom toxin genes, respectively.
Energetics and selection
Although enzymatic venom constituents do not act stoichiometrically as non-catalytic toxins do, nonetheless, greater abundance necessarily signifies a greater role for the constituent in question in prey immobilization and/or digestion, regardless of the presence or absence of catalytic activity. Snake venoms are the most concentrated glandular secretions in the animal kingdom, with solutes, 90 % of which are proteins, comprising 25–35 % by mass [113, 114]. Given the high cost of venom production as a function of overall metabolism , more abundant components must necessarily carry a higher biosynthetic cost to the snake. Therefore, snakes should optimize the abundance of venom constituents to maximize immobilizing power, while minimizing cost. Both venom gene expression levels and gene product sequences should be under strong selective pressure. This seems to be the case generally in Protobothrops, at least for sequences above dN/dS = 1 (Fig. 6), which all showed some evidence of positive selection, in our study and in earlier studies of accelerated evolution, which covered C-type lectins, serine proteases and PLA2s [28, 30, 37, 115], and also in our re-analysis (Table S3). This suggests that the higher rates of evolution in more abundant components are due to adaptation, rather than to relaxed selection.
Venom proteins and tissue proteins
Comparison of two closely related species that became geographically isolated several million years ago [51, 57, 58], allowed us to examine the interplay between venom composition and evolutionary rate of venom proteins. Their venoms show much greater sequence divergence than venom gland cellular (tissue) components. Interestingly, while secreted venom components have evolved much faster than the cellular (non-secreted) portion of the venom gland transcriptome, more abundant venom proteins evolved most rapidly (Fig. 6). This pattern contrasted with the cellular portion of the venom gland transcriptome, where there was a negative relationship between protein abundance and evolutionary rate, similar to that found in most organisms [116, 117]. Together with data including other Protobothrops species, which show evidence for positive selection on the most abundant venom components (Table S3), this strongly argues against the suggestion that snakes inject so much venom into their prey that the resulting ‘overkill’ greatly relaxes selection on venom components [45, 118].
Extending the present analysis to other datasets
Patterns observed in the two Protobothrops species appear to hold generally true for crotaline snakes, based on data from three more species in two additional genera (Fig. 7a). Furthermore, the recently published genome of the king cobra (Family Elapidae) , shows the same pattern (Fig. 7b). These data from four additional studies, employing a range of methodologies, also show that more abundant snake venom proteins evolve faster. It is also possible that other organisms that depend on proteinaceous secretions for parasitism, competition, and defense may exhibit similar patterns, though further work will be necessary to test the generality of this conclusion, evident in snakes.
The findings that venom protein concentrations can change dramatically over relatively short periods of time, as in the case of myotoxic PLA2s (Fig 3), and that relative concentration affects evolutionary rates of venom constituents, helps us to understand the trade-offs involved in venom formulation. Venoms are costly for snakes to produce . Given that venom components can range in abundance by 3–5 orders of magnitude  (Fig. 5b), their cost to the snake likely also varies greatly, with more abundant components being more costly. Optimal foraging considerations suggest that snakes should invest in venom constituents in proportion to the benefit each of them provides. Therefore, variability in efficacy of the more abundant components will have the greatest fitness consequences for the snake, and those components should experience higher levels of selection. The higher evolutionary rates we observe in more abundant components are the evolutionary signature of this process. This also suggests that changes in venom component levels, rather than protein sequence-level changes, may be the most rapid and effective way for snakes to exploit new prey species.
Deposition of data
Sequences of genes confirmed by mass spectrometry were curated manually by comparison with data publicly available at NCBI GenBank, and deposited in the DNA Data Bank of Japan under accession numbers AB984571-AB984614 for P. elegans and AB985221-AB985266 for P. flavoviridis.
Collection of snakes and extraction of venoms
Three Protobothrops flavoviridis, three P. elegans, and two putative hybrids were employed in this study. These specimens were captured in Nanjo City, Itoman City, and Yaese Town in southern Okinawa as part of a prefectural program to remove habus from densely inhabited areas. All animals, six males and two females, were adults, ranging from 82 to 150 cm total length (Table 1). Snakes were maintained at the Okinawa Prefectural Institute of Health and the Environment (OPIHE) after capture. Venoms were manually extracted into plastic beakers, transferred to cryovials, fast frozen on dry ice, and then stored at -30 °C until use.
Removal of venom glands
Four days after venom extraction, one animal of each species was euthanized under pentobarbital anesthesia (>100 mg/kg), in accordance with OIST animal care protocols. Right and left venom glands and underlying skeletal muscle were excised, placed individually into 2 mL cryovials, flash frozen in liquid nitrogen, and stored at -80 °C until use.
Isolation of total mRNA from venom glands and first strand cDNA synthesis
RNA was isolated using Qiagen RNeasy Plus Mini Kits. One μg of total RNA was used for first strand cDNA synthesis. ERCC ExFold RNA Spike-In Mix (Life Technologies) was included in the cDNA synthesis reaction according to the manufacturer’s protocol. To the RNA and spike-in mix, 2 μL of 10 μM poly T_START oligo 5′-AATTGCAGTGGTATCAACGCAGAGCGGCCGCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTVN were added. The 9-μL mixture was incubated at 65 °C for 3 min, and then chilled on ice. The 11-μL reaction, containing 4 μL of 5× first strand synthesis buffer (Invitrogen), 1 μL of 10 mM dNTP (Promega), 2 μL of 0.1 M DTT (Invitrogen), 2 μL of 12 μM template-switching RNA oligo (5′-AAGCAGUGGUAUCAACGCAGAGUACAUGGG), 1 μL RNase inhibitor (Qiagen), and 1 μL Superscript II Reverse Transcriptase (Invitrogen) were added to each sample. Reactions were incubated at 42 °C for 60 min and the enzyme was heat inactivated at 65 °C for 15 min. 80 μL MilliQ water were added to each cDNA reaction.
Second strand cDNA synthesis
Second strand cDNA was synthesized with a limited number of PCR cycles. The 50-μL PCR reaction consisted of 1× Phusion HF buffer (Thermo Scientific), 200 μM dNTP (Promega), 0.5 μM START primer (5′-CGCCAGGGTTTTCCCAGTCACGACAATTGCAGTGGTATCAACGCAGA), 0.5 μM TS_long primer (5′-CTTGTAGGTTAAGTGGAGAGCTAACAATTTCACACAGGAAAGCAGTGGTATCAACGC), 0.5 μL of 2 U/μL Phusion DNA polymerase (Thermo Scientific), and 10 μL diluted cDNA. Twelve 50-μL PCR reactions were set up for each cDNA sample. PCR was carried out with the following conditions: initial denaturation at 98 °C for 30 s, with 8 cycles of denaturation at 98 °C for 10 s, 68 °C for 6 min, followed by final extension at 72 °C for 10 min. PCR reactions were concentrated with an Amicon Ultra-0.5 column (Millipore).
Concentrated PCR products were purified by solid phase reversible immobilization  using Dynabeads MyOne Carboxylic Acid (Invitrogen). 17 % PEG was used for purification. 1 ng of ds-cDNA was used library preparation and sequencing, using a Nextera XT DNA sample kit (illumina) according to manufacturer’s directions. Ten cycles of PCR were used for library amplification. Pooled libraries were sequenced paired-end for 250 cycles on an illumina Miseq system.
Reduction and alkylation of snake venoms prior to enzymatic digestion
Crude venoms were gently vortexed for several seconds to suspend microsomes. Reduction was accomplished by adding 36 μL 8 M urea, 2 μL venom, 2 μL 500 mM DTT in ultrapure water, and 10 μL 500 mM of ammonium bicarbonate to 200 μL PCR tubes. Tubes were incubated at 60 °C for 45 min in an MJ PTC 200 Peltier thermocycler. Then, 4 μL 500 mM sodium iodoacetate were added to each tube and mixed gently. Tubes were incubated at 37 °C for 30 min in the dark. Afterward, an additional 2 μL 500 mM DTT were added to quench the alkylation reaction. Sample desalting was done by pipetting the reaction from the PCR tubes into 10K Nanosep centrifugal filters (Pall Corp.) and centrifuging them at 18,407 × g for 25 min. Samples were rehydrated with 50 μL of 50 mM ammonium bicarbonate and were again centrifuged to further de-salt the samples.
After the second centrifugation, 10 μL 8 M urea were added to the membrane and the membrane surface was wetted with the urea by rotating the tube manually. Then 90 μL 50 mM ammonium bicarbonate were added to the membrane and the filter was vortexed gently to re-suspend de-salted proteins. Solutions were pipetted from the filter and placed in clean 200 μL PCR tubes. Finally 10 μg of trypsin (Pierce 90055) dissolved in 10 μL 1 mM HCl were added to each tube. Tubes were incubated 24 h at 37 °C. Then, samples were pipetted back into their original 10K Nanosep filters and centrifuged again at 18,407 × g for 30 min to collect the de-proteinated filtrate which was frozen at -30 °C until analysis by mass spectrometry.
The procedure for chymotrypsin (Pierce 90056) was nearly identical to that for trypsin, except that after wetting the Nanosep filter with 10 μL 8 M urea, 10 μL 50 mM ammonium bicarbonate, 5 μL 200 mM CaCl2, and 65 μL ultrapure water were added to the membrane. All subsequent procedures were identical to those for trypsin.
Formic acid hydrolyses
Crude venoms were gently vortexed for few seconds. Digestion of proteins was done by mixing 2 μL venom, 4 μL 500 mM DTT in ultrapure water and 94 μL 2 % formic acid (Wako 063-04192) in 200 μL PCR tubes. Tubes were incubated for 2 h at 99 °C and then frozen at -30 °C until analysis by mass spectrometry.
For each sample, 3 technical repeats were run for each of the three hydrolyses; thus each sample was analyzed nine times. For each mass spectrogram, 1 μL was injected into a Waters nanoACQUITY UPLC Class-M with a Waters SYNAPT G2-S High Definition Mass Spectrometer equipped with Ion Mobility in tandem. Peptides were trapped in a nanoACQUITY 2G-V/MTrap 5 μm Symmetry C18 trap column at 5 μL/min, and separated in a nanoACQUITY UPLC 1.7 μm BEH130 C18 75 μm × 200 mm analytical column at 300 nL/min, 35 °C, with a gradient of 3–60 % B in 90 min. The column was equilibrated in 0.1 % formic acid in water and the elution buffer (B) was 0.1 % formic acid in acetonitrile. Mass spectrometer settings were as follows: 1.8 kV spray voltage, cone voltage 30 V, acquisition mode fast-DDA with ion mobility separation, resolution 25,000 FWHM, continuum mode, mass range 100–2,000 m/z, acquisition rate 1 Hz/function, quad profile manual using first mass 400, 25 % dwell time, and a 25 % scan time ramp, second mass 600 with 25 % dwell time and third mass 800, collision energy ramped from 25 to 45 eV, and ion mobility wave velocity 1,200 m/s. The mass spectrometer was calibrated with Glu-fibrinopeptide B 100 fmol/uL in 50 % methanol in water. The same solution was used as a lock mass at 1 μL/min. The top eight most intense ions from each full MS scan were fragmented in a data-dependent manner with CID, using an exclusion of 60 s.
Analysis of gene expression, quantitative proteomics, and evolutionary rate correlations were performed using the R statistical package. Scripts performing the analysis, as well as the accompanying SQL tables are provided on DataDryad http://dx.doi.org/10.5061/dryad.3948r.
Transcriptome assembly, quantitation and validation
8.0 and 8.5 million read pairs comprising 3.6 and 3.7 gigabases were sequenced from the P. elegans and P. flavoviridis libraries, respectively. Sequences were quality and adaptor trimmed using Trimmomatic, with the SLIDINGWINDOW:4:30 MINLEN:65 parameters and Nextera adaptor sequences . Reads were then assembled using the Trinity suite (r20131110) , and then re-mapped to the assembly using the RSEM (v1.2.13) pipeline, with bowtie2 as the mapper [122, 123]. Ninety nine and ninety eight percent of the reads could be mapped to the reference in this manner for P. elegans and P. flavoviridis, respectively. Venom transcripts in Fig 3 were manually curated individually, and compared to published sequences.
Fragments per kilobase mapped (FPKM) output from RSEM were used to determine the coverage cutoff for proteomic analysis and for analysis of relative transcript expression levels. To determine the lowest FPKM cutoff for the transcriptomic analysis, we used the relationship between expected and observed values for ERCC spike-in controls. As the cutoff increased, so did the strength of the correlation, reaching a maximum of 0.96 at FPKM 32. Assembled sequences more abundant than this cutoff were used as reference sequences for downstream proteomic analysis.
Well after the completion of this study, a new assembler, VTBuilder , was published, specifically designed to remedy the deficiencies of Trinity, which typically produces large numbers of incomplete transcripts with highly diversified protein families. We validated our unfiltered assembly and compared it against the new method by evaluating the assembly of spike-in transcripts. We found that VTBuilder detected only 18 and 34 of the 92 spike-in controls, for P. elegans and P. flavoviridis, respectively, vs. 73 and 82, assembled by Trinity. Furthermore, all but a few of the spike-ins were represented by multiple sequences, as many as 22, whereas virtually all Trinity assemblies matched a unique spike-in control, or produced at most two matches. This evaluation suggests that the Trinity assembly performed well for the known sequences in our data, which spanned a wide range of concentrations. All but one of the venom transcripts from Fig 3 could be found in the VTBuilder assembly (Pe_comp1091_c0_seq1, a low-abundance aminopeptidase was missing). VTBuilder suggested that five pairs of low-abundance Trinity transcripts assemblies were incompletely assembled by the latter (three in P. elegans and two in P. flavoviridis). In most cases, but not always, VTBuilder transcripts had better matches to NCBI protein data. In any case, even assuming the VTBuilder analysis was correct in its reconstruction, no more than one of these pairs had a homolog in the other transcriptome, so differences in assembly should not bias our estimates of evolutionary rates (see below).
Evolutionary rate analysis
For purposes of evolutionary rate analysis, proteins that had more than one high quality matching peptide from the venom proteome were classified as venom components; everything else was considered a tissue component. Likely coding sequences for transcripts passing the FPKM filter were determined using FrameDP . Putative homologs were then selected among the predicted protein sequences using reciprocal best BLAST. Predicted protein sequence pairs were aligned using MUSCLE  and codon alignments were created using PAL2NAL . For each pair of codon aligned sequences, pairwise evolutionary rates (the ratio of synonymous to non-synonymous substitutions) were computed using a maximum likelihood model implemented in PAML (parameters: runmode = -2, cleandata = 0, model = 1, NSsites = ) . The R script containing the evolutionary rate analysis can be found in the Additional file 9 and an SQL database with the raw data has been deposited in DataDryad (see Data Accessibility section). The combination of the R script and database is sufficient to reproduce the entire analysis, starting after PAML computations.
We tested the generality of relationships between toxin abundance and evolution by combining results from the present study with those of three recent investigations that reported relative toxin abundances [42, 129, 130]. For each protein class, sequences greater than 150 base pairs were codon-aligned using Geneious (v. 8.0.5), with each alignment inspected manually for quality. Signal peptides were detected by comparison with published and annotated homologs and were removed. A tree for each protein alignment was then computed using RAXML with the GTRCAT model . The free-ratio model implemented in codeml was then used to estimate evolutionary rates along all branches of each tree. As we were interested in only the most recent evolutionary events, for each protein we considered only the branch leading up to each leaf in the tree. We then removed any extreme values from the data set (dN/dS > 3), which typically resulted from a near total lack of synonymous substitutions occurring along some branches, and averaged both the evolutionary rates and expression levels in each class. We also computed a correlation between relative transcript abundance within a toxin class and its evolutionary rate using values previously reported for the king cobra genome project .
Detecting positive selection in Protobothrops
In order to determine whether the increased evolutionary rates were due to relaxed or to positive selection, we added publicly available Protobothrops sequences for genes showing evidence of accelerated evolution (dn/ds = 1). We found additional data for C-type lectins, PLA2s, metalloproteases, and serine proteases. Sequence alignments were tested for the presence of recombination breakpoints and partitioned using SBP, and then tested for positive or diversifying selection using PARRIS and MEME algorithms implemented on the DataMonkey server [132–136].
Acquisition of mass spectrometric data employed a Waters MassLynx v4.0. Chromatographic data was processed using Waters Progenesis QI for Proteomics for mass correction (using acquired lock mass data), alignment, normalization, isotope grouping, peak picking, and ion drift separation, using default values. For peptide identification, files of fragmentation spectra (~330,000 spectra), generated only from monoisotopic precursors, were exported in mgf format for analysis using Mascot (version 2.4), and Proteome Discoverer (version 1.2, using Sequest). Search parameters for both algorithms were: a maximum of two missed cleavages, with precursor and fragment mass tolerance set to 20 ppm and 0.3 Da, respectively. Carboxyamidomethylation of cysteine was set as a fixed modification, while methionine oxidation, asparagine and glutamine deamidation, and N-terminal acetylation were set as variable modifications for tryptic and chymotryptic digests. Asparagine and glutamine deamidation and formylation of peptide N-termini were set as variable modifications for formic acid digests. Reagents used for sequencing (trypsin, R and K; chymotrypsin, F, L, W, and Y; formic acid, D and M) were specified in each case.
A database from our transcript data was constructed using TransDecoder , http://transdecoder.github.io, version r20131110), using a minimum length open reading frame of 5 amino acids or more than 10 nucleotides, as parameters. These relaxed settings were chosen to avoid the loss of small natural peptides present in the venoms. The common Repository of Adventitious Proteins – cRAP (http://www.thegpm.org/crap/) was merged with this constructed database for searching (Protobothrops elegans plus cRAP = 116,501 entries; Protobothrops flavoviridis plus cRAP = 430,984 entries; Protobothrops hybrids plus cRAP = 237,041). Protein and peptide identifications from Mascot and Sequest were combined, setting the false discovery rate to 0.1 % at the peptide level. Proteins with one hit were manually verified in all replicates (chromatographic peak), filtering out those that did not appear in at least 80 % of the runs.
Availability of supporting data
The R script containing the evolutionary rate analysis and an SQL database with the raw data used in the bioinformatic analysis, which contains raw mass spectrometry data, transcript counts, and other types of data, can be found on DataDryad: doi:10.5061/dryad.3948r The combination of the R script and database is sufficient to reproduce the entire analysis, starting after PAML computations.
Fraser HB, Hirsh AE, Steinmetz LM, Scharfe C, Feldman MW. Evolutionary rate in the protein interaction network. Science. 2002;296(5568):750–2.
Subramanian S, Kumar S. Gene expression intensity shapes evolutionary rates of the proteins encoded by the vertebrate genome. Genetics. 2004;168(1):373–81.
Drummond DA, Bloom JD, Adami C, Wilke CO, Arnold FH. Why highly expressed proteins evolve slowly. Proc Natl Acad Sci U S A. 2005;102(40):14338–43.
Zhang J, He X. Significant impact of protein dispensability on the instantaneous rate of protein evolution. Mol Biol Evol. 2005;22(4):1147–55.
Makino T, Gojobori T. The evolutionary rate of a protein is influenced by features of the interacting partners. Mol Biol Evol. 2006;23(4):784–9.
Liao BY, Zhang J. Null mutations in human and mouse orthologs frequently result in different phenotypes. Proc Natl Acad Sci U S A. 2008;105(19):6987–92.
Luz H, Vingron M. Family specific rates of protein evolution. Bioinformatics. 2006;22(10):1166–71.
Liao BY, Weng MP, Zhang J. Impact of extracellularity on the evolutionary rate of mammalian proteins. Genome Biol Evol. 2010;2:39–43.
Julenius K, Pedersen AG. Protein evolution is faster outside the cell. Mol Biol Evol. 2006;23(11):2039–48.
Dean MD, Good JM, Nachman MW. Adaptive evolution of proteins secreted during sperm maturation: an analysis of the mouse epididymal transcriptome. Mol Biol Evol. 2008;25(2):383–92.
Winter EE, Goodstadt L, Ponting CP. Elevated rates of protein secretion, evolution, and disease among tissue-specific genes. Genome Res. 2004;14(1):54–61.
Van Valen L. Molecular evolution as predicted by natural selection. J Mol Evol. 1974;3(2):89–101.
Dawkins R, Krebs JR. Arms races between and within species. Proc R Soc Lond B Biol Sci. 1979;205(1161):489–511.
Aird SD. Ophidian envenomation strategies and the role of purines. Toxicon. 2002;40(4):335–93.
Barchan D, Ovadia M, Kochva E, Fuchs S. The binding site of the nicotinic acetylcholine receptor in animal species resistant to alpha-bungarotoxin. Biochemistry. 1995;34(28):9172–6.
Takacs Z, Wilhelmsen KC, Sorota S. Cobra (Naja spp.) nicotinic acetylcholine receptor exhibits resistance to Erabu sea snake (Laticauda semifasciata) short-chain alpha-neurotoxin. J Mol Evol. 2004;58(5):516–26.
Jansa SA, Voss RS. Adaptive evolution of the venom-targeted vWF protein in opossums that eat pitvipers. PLoS ONE. 2011;6(6):e20997.
Mackessy SP. Venom ontogeny in the Pacific rattlesnakes Crotalus viridis helleri and C. v. oreganus. Copeia. 1988;1988:92–101.
Mackessy SP. Characterization of the major metalloprotease isolated from the venom of the northern pacific rattlesnake, Crotalus viridis oreganus. Toxicon. 1996;34(11-12):1277–85.
McCue MD. Prey envenomation does not improve digestive performance in western diamondback rattlesnakes (Crotalus atrox). J Exp Zool A Ecol Genet Physiol. 2007;307(10):568–77.
Daltry JC, Wuster W, Thorpe RS. Diet and snake venom evolution. Nature. 1996;379(6565):537–40.
Chijiwa T, Deshimaru M, Nobuhisa I, Nakai M, Ogawa T, Oda N, et al. Regional evolution of venom-gland phospholipase A2 isoenzymes of Trimeresurus flavoviridis snakes in the southwestern islands of Japan. Biochem J. 2000;347(Pt 2):491–9.
Silva Junior NJ, Aird SD. Prey specificity, comparative lethality and compositional differences of coral snake venoms. Comp Biochem Physiol C Toxicol Pharmacol. 2001;128(3):425–56.
Gibbs HL, Mackessy SP. Functional basis of a molecular adaptation: prey-specific toxic effects of venom from Sistrurus rattlesnakes. Toxicon. 2009;53(6):672–9.
Li M, Fry BG, Kini RM. Eggs-only diet: its implications for the toxin profile changes and ecology of the marbled sea snake (Aipysurus eydouxii). J Mol Evol. 2005;60(1):81–9.
Li M, Fry BG, Kini RM. Putting the brakes on snake venom evolution: the unique molecular evolutionary patterns of Aipysurus eydouxii (Marbled sea snake) phospholipase A2 toxins. Mol Biol Evol. 2005;22(4):934–41.
Ogawa T, Oda N, Nakashima K, Sasaki H, Hattori M, Sakaki Y, et al. Unusually high conservation of untranslated sequences in cDNAs for Trimeresurus flavoviridis phospholipase A2 isozymes. Proc Natl Acad Sci U S A. 1992;89(18):8557–61.
Nakashima K, Nobuhisa I, Deshimaru M, Nakai M, Ogawa T, Shimohigashi Y, et al. Accelerated evolution in the protein-coding regions is universal in crotalinae snake venom gland phospholipase A2 isozyme genes. Proc Natl Acad Sci U S A. 1995;92(12):5605–9.
Nobuhisa I, Nakashima K, Deshimaru M, Ogawa T, Shimohigashi Y, Fukumaki Y, et al. Accelerated evolution of Trimeresurus okinavensis venom gland phospholipase A2 isozyme-encoding genes. Gene. 1996;172(2):267–72.
Ogawa T, Nakashima K, Nobuhisa I, Deshimaru M, Shimohigashi Y, Fukumaki Y, et al. Accelerated evolution of snake venom phospholipase A2 isozymes for acquisition of diverse physiological functions. Toxicon. 1996;34(11-12):1229–36.
Chuman Y, Nobuhisa I, Ogawa T, Deshimaru M, Chijiwa T, Tan NH, et al. Regional and accelerated molecular evolution in group I snake venom gland phospholipase A2 isozymes. Toxicon. 2000;38(3):449–62.
Alape-Giron A, Persson B, Cederlund E, Flores-Diaz M, Gutierrez JM, Thelestam M, et al. Elapid venom toxins: multiple recruitments of ancient scaffolds. Eur J Biochem. 1999;259(1-2):225–34.
Fry BG, Wuster W, Kini RM, Brusic V, Khan A, Venkataraman D, et al. Molecular evolution and phylogeny of elapid snake venom three-finger toxins. J Mol Evol. 2003;57(1):110–29.
Kini RM. Phospholipase A2 – a complex multifunctional protein puzzle. In: Kini RM, editor. Venom phospholipase A2 enzymes: structure, function and mechanism. Hoboken: Wiley; 1997. p. 1–28.
Lynch VJ. Inventing an arsenal: adaptive evolution and neofunctionalization of snake venom phospholipase A2 genes. BMC Evol Biol. 2007;7:2.
Bohlen CJ, Chesler AT, Sharif-Naeini R, Medzihradszky KF, Zhou S, King D, et al. A heteromeric Texas coral snake toxin targets acid-sensing ion channels to produce pain. Nature. 2011;479(7373):410–4.
Deshimaru M, Ogawa T, Nakashima K, Nobuhisa I, Chijiwa T, Shimohigashi Y, et al. Accelerated evolution of crotalinae snake venom gland serine proteases. FEBS Lett. 1996;397(1):83–8.
Ogawa T, Chijiwa T, Oda-Ueda N, Ohno M. Molecular diversity and accelerated evolution of C-type lectin-like proteins from snake venom. Toxicon. 2005;45(1):1–14.
Juarez P, Comas I, Gonzalez-Candelas F, Calvete JJ. Evolution of snake venom disintegrins by positive Darwinian selection. Mol Biol Evol. 2008;25(11):2391–407.
Casewell NR, Wagstaff SC, Harrison RA, Renjifo C, Wuster W. Domain loss facilitates accelerated evolution and neofunctionalization of duplicate snake venom metalloproteinase toxin genes. Mol Biol Evol. 2011;28(9):2637–49.
Sunagar K, Johnson WE, O’Brien SJ, Vasconcelos V, Antunes A. Evolution of CRISPs associated with toxicoferan-reptilian venom and mammalian reproduction. Mol Biol Evol. 2012;29(7):1807–22.
Aird SD, Watanabe Y, Villar-Briones A, Roy MC, Terada K, Mikheyev AS. Quantitative high-throughput profiling of snake venom gland transcriptomes and proteomes (Ovophis okinavensis and Protobothrops flavoviridis). BMC Genomics. 2013;14(1):790–816.
Vonk FJ, Casewell NR, Henkel CV, Heimberg AM, Jansen HJ, McCleary RJ, et al. The king cobra genome reveals dynamic gene evolution and adaptation in the snake venom system. Proc Natl Acad Sci U S A. 2013;110(51):20651–6.
Kini RM, Chan YM. Accelerated evolution and molecular surface of venom phospholipase A2 enzymes. J Mol Evol. 1999;48(2):125–32.
Mebs D. Toxicity in animals. Trends in evolution? Toxicon. 2001;39(1):87–96.
Kini RM, Chinnasamy A. Nucleotide sequence determines the accelerated rate of point mutations. Toxicon. 2010;56(3):295–304.
Angulo Y, Escolano J, Lomonte B, Gutierrez JM, Sanz L, Calvete JJ. Snake venomics of Central American pitvipers: clues for rationalizing the distinct envenomation profiles of Atropoides nummifer and Atropoides picadoi. J Proteome Res. 2008;7(2):708–19.
Jiménez-Porras JM. Venom proteins of the Fer-de-lance, Bothrops atrox, from Costa Rica. Toxicon. 1964;2:155–66.
Ohno M, Menez R, Ogawa T, Danse JM, Shimohigashi Y, Fromen C, et al. Molecular evolution of snake toxins: is the functional diversity of snake toxins associated with a mechanism of accelerated evolution? Prog Nucleic Acid Res Mol Biol. 1998;59:307–64.
Dias GS, Kitano ES, Pagotto AH, Sant’anna SS, Rocha MM, Zelanis A, et al. Individual variability in the venom proteome of juvenile Bothrops jararaca specimens. J Proteome Res. 2013;12(10):4585–98.
Guo P, Malhotra A, Li PP, Pook CE, Creer S. New evidence on the phylogenetic position of the poorly known Asian pitviper Protobothrops kaulbacki (Serpentes : Viperidae : Crotalinae) with a redescription of the species and a revision of the genus Protobothrops. Herpetol J. 2007;17(4):237–46.
Katsuren S, Nishimura M, Nagata E, Ohama M. Numbers of Snakes Arriving Alive in Mass into Okinawa by Traders. Report by Okinawa Prefectural Institute of Health and Environment 1996, 30:133-136.
Nishimura M, Akamine H. Dispersal Range of an Alien Viperid Snake, Trimeresurus elegans Established in Southern Okinawa Island after the Escape in 1976 - Results of Preliminary Studies in 2002. Annual Report of Okinawa Prefectural Institute of Health and Environment 2002, 36:89-92.
Terada K. The distribution, population density and controls of Protobothrops mucrosquamatus, Protobothrops elegans, Elaphe taeniura friesei, three snake species established on Okinawa Island. Bull Herpetol Soc Jpn. 2011;2011(2):161–8.
Ota H. Geographic patterns of endemism and speciation in amphibians and reptiles of the Ryukyu archipelago, Japan, with special reference to their paleogeographical implications. Res Popul Ecol. 1998;40(2):189–204.
Ota H. The current geographic faunal pattern of reptiles and amphibians of the Ryukyu archipelago and adjacent regions. Tropics. 2000;10(1):51–62.
Osozawa S, Su Z-H, Oba Y, Yagi T, Watanabe Y, Wakabayashi J. Vicariant speciation due to 1.55 Ma isolation of the Ryukyu islands, Japan, based on geological and GenBank data. Entomol Sci. 2013;16:267–77.
Wuster W, Peppin L, Pook CE, Walker DE. A nesting of vipers: Phylogeny and historical biogeography of the Viperidae (Squamata: Serpentes). Mol Phylogenet Evol. 2008;49(2):445–59.
Zhao K, Zhou Y, Lin Z. Structure of basic phospholipase A2 from Agkistrodon halys Pallas: implications for its association, hemolytic and anticoagulant activities. Toxicon. 2000;38(7):901–16.
Fraenkel-Conrat H. Snake venom neurotoxins related to phospholipase A2. J Toxicol-Toxin Rev. 1982;1(2):205–21.
Aird SD, Kaiser II. Comparative studies on three rattlesnake toxins. Toxicon. 1985;23(3):361–74.
Boldrini-França J, Rodrigues RS, Fonseca FP, Menaldo DL, Ferreira FB, Henrique-Silva F, et al. Crotalus durissus collilineatus venom gland transcriptome: analysis of gene expression profile. Biochimie. 2009;91(5):586–95.
Tsai IH, Lu PJ, Wang YM, Ho CL, Liaw LL. Molecular cloning and characterization of a neurotoxic phospholipase A2 from the venom of Taiwan habu (Trimeresurus mucrosquamatus). Biochem J. 1995;311(Pt 3):895–900.
Kini RM, Evans HJ. A model to explain the pharmacological effects of snake venom phospholipases A2. Toxicon. 1989;27(6):613–35.
Kini RM. Excitement ahead: structure, function and mechanism of snake venom phospholipase A2 enzymes. Toxicon. 2003;42(8):827–40.
Cameron DL, Tu AT. Characterization of myotoxin a from the venom of prairie rattlesnake (Crotalus viridis viridis). Biochemistry. 1977;16(11):2546–53.
Francis B, Gutierrez JM, Lomonte B, Kaiser II. Myotoxin II from Bothrops asper (Terciopelo) venom is a lysine-49 phospholipase A2. Arch Biochem Biophys. 1991;284(2):352–9.
Cintra AC, Marangoni S, Oliveira B, Giglio JR. Bothropstoxin-I: amino acid sequence and function. J Protein Chem. 1993;12(1):57–64.
Angulo Y, Chaves E, Alape A, Rucavado A, Gutierrez JM, Lomonte B. Isolation and characterization of a myotoxic phospholipase A2 from the venom of the arboreal snake Bothriechis (Bothrops) schlegelii from Costa Rica. Arch Biochem Biophys. 1997;339(2):260–6.
Ikeda N, Chijiwa T, Matsubara K, Oda-Ueda N, Hattori S, Matsuda Y, et al. Unique structural characteristics and evolution of a cluster of venom phospholipase A(2) isozyme genes of Protobothrops flavoviridis snake. Gene. 2010.
Kinjoh K, Kosugi T, Nakamura M, Hanashiro K, Sunagawa M, Tokeshi Y, et al. Habutobin splits the Arg16-Gly17 bond in the A alpha chain of rabbit fibrinogen. Thromb Haemost. 1997;77(6):1127–8.
Nejime T, Kinjoh K, Nakamura M, Hanashiro K, Sunagawa M, Eguchi Y, et al. Habutobin recognizes Thr(7) in the sequence of fibrinopeptide A of rabbit fibrinogen. Toxicon. 2000;38(8):1029–41.
Nakamura M, Kinjoh K, Miyagi C, Oka U, Sunagawa M, Yamashita S, et al. Pharmacokinetics of habutobin in rabbits. Toxicon. 1995;33(9):1201–6.
Sunagawa M, Hanashiro K, Nakamura M, Kosugi T. Habutobin releases plasminogen activator (U-PA) from bovine pulmonary artery endothelial cells. Toxicon. 1996;34(6):691–9.
Yamamoto C, Tsuru D, Oda-Ueda N, Ohno M, Hattori S, Kim ST. Flavoxobin, a serine protease from Trimeresurus flavoviridis (habu snake) venom, independently cleaves Arg726-Ser727 of human C3 and acts as a novel, heterologous C3 convertase. Immunology. 2002;107(1):111–7.
Zhang Y, Wisner A, Xiong Y, Bon C. A novel plasminogen activator from snake venom. Purification, characterization, and molecular cloning. J Biol Chem. 1995;270(17):10246–55.
Gladner JA, Murtaugh PA, Folk JE, Laki K. Nature of peptides released by thrombin. Ann N Y Acad Sci. 1963;104:47–52.
Osbahr Jr AJ, Colman RW, Gladner JA, Laki K. The nature of the peptides released from canine fibrinogen. Biochem Biophys Res Commun. 1964;14:555–8.
Rojnuckarin P, Intragumtornchai T, Sattapiboon R, Muanpasitporn C, Pakmanee N, Khow O, et al. The effects of green pit viper (Trimeresurus albolabris and Trimeresurus macrops) venom on the fibrinolytic system in human. Toxicon. 1999;37(5):743–55.
Markland FS, Kettner C, Schiffman S, Shaw E, Bajwa SS, Reddy KN, et al. Kallikrein-like activity of crotalase, a snake venom enzyme that clots fibrinogen. Proc Natl Acad Sci U S A. 1982;79(6):1688–92.
Zaganelli GL, Zaganelli MG, Magalhaes A, Diniz CR, de Lima ME. Purification and characterization of a fibrinogen-clotting enzyme from the venom of jararacucu (Bothrops jararacussu). Toxicon. 1996;34(7):807–19.
Serrano SM, Hagiwara Y, Murayama N, Higuchi S, Mentele R, Sampaio CA, et al. Purification and characterization of a kinin-releasing and fibrinogen-clotting serine proteinase (KN-BJ) from the venom of Bothrops jararaca, and molecular cloning and sequence analysis of its cDNA. Eur J Biochem. 1998;251(3):845–53.
Komori Y, Tatematsu R, Tanida S, Nikai T. Thrombin-like enzyme, flavovilase, with kinin-releasing activity from Trimeresurus flavoviridis (habu) venom. J Nat Toxins. 2001;10(3):239–48.
Jia YH, Jin Y, Lu QM, Li DS, Wang WY, Xiong YL. Jerdonase, a novel serine protease with kinin-releasing and fibrinogenolytic activity from Trimeresurus jerdonii venom. Sheng Wu Hua Xue Yu Sheng Wu Wu Li Xue Bao (Shanghai). 2003;35(8):689–94.
Oyama E, Takahashi H. Substrate specificity of two thrombin like enzymes (elegaxobin, elegaxobin II) from the venom of Trimeresurus elegans (Sakishima-habu), using neutralizing antibody. Toxicon. 2006;48(6):601–10.
Bajwa SS, Markland FS, Russell FE. Fibrinolytic enzyme(s) in western diamondback rattlesnake (Crotalus atrox) venom. Toxicon. 1980;18(3):285–90.
Retzios AD, Markland Jr FS. A direct-acting fibrinolytic enzyme from the venom of Agkistrodon contortrix contortrix: effects on various components of the human blood coagulation and fibrinolysis systems. Thromb Res. 1988;52(6):541–52.
Wei WL, Sun JJ, Chen JS. Synergism of procoagulation effect of thrombin-like enzymes from Deinagkistrodon acutus and Agkistrodon halys snake venoms. Zhongguo Yao Li Xue Bao. 1996;17(6):527–31.
Santoro ML, Sano-Martins IS. Different clotting mechanisms of Bothrops jararaca snake venom on human and rabbit plasmas. Toxicon. 1993;31(6):733–42.
Pirkle H, Markland FS, Theodor I. Thrombin-like enzymes of snake venoms: actions on prothrombin. Thromb Res. 1976;8(5):619–27.
Swenson S, Markland Jr FS. Snake venom fibrin(ogen)olytic enzymes. Toxicon. 2005;45(8):1021–39.
Kamiguti AS, Cardoso JL. Haemostatic changes caused by the venoms of South American snakes. Toxicon. 1989;27(9):955–63.
Hoover-Plow J. Does plasmin have anticoagulant activity? Vasc Health Risk Manag. 2010;6:199–205.
Fox JW, Serrano SM. Structural considerations of the snake venom metalloproteinases, key members of the M12 reprolysin family of metalloproteinases. Toxicon. 2005;45(8):969–85.
Maeno H. Biochemical analysis of pathological lesions caused by habu snake venom with special reference to hemorrhage. J Biochem. 1962;52(5):343–50.
Yoshikura H, Ogawa H, Osaka A, Omori-Sato T. Action of Trimeresurus flavoviridis venom and the partially purified hemorrhagic principles on animal cells cultivated in vitro. Toxicon. 1966;4(3):183–90.
Homma M, Kosuge T, Okonogi T, Hattori Z, Sawai Y. A histopathological study on arterial lesions caused by Habu (Trimeresurus flavoviridis) venom. Jpn J Exp Med. 1967;37(4):323–36.
Tsuchiya M, Oshio C, Ohashi M, Fujishiro Y. Cinematographic analysis of the hemorrhage induced by the venom of Trimeresurus flavoviridis. Bibl Anat. 1975;13:192–3.
Omori-Satoh T, Ohsaka A. Purification and some properties of hemorrhagic principle I in the venom of Trimeresurus flavoviridis. Biochim Biophys Acta. 1970;207(3):432–44.
Takahashi T, Osaka A. Purification and some properties of two hemorrhagic principles (HR2a and HR2b) in the venom of Trimeresurus flavoviridis; complete separation of the principles from proteolytic activity. Biochim Biophys Acta. 1970;207(1):65–75.
Omori-Satoh T, Sadahiro S. Resolution of the main hemorrhagic principle of Trimeresurus flavoviridis venom into two parts, HRIA and HRIB [proceedings]. Jpn J Med Sci Biol. 1978;31(2):205.
Nikai T, Niikawa M, Komori Y, Sekoguchi S, Sugihara H. Proof of proteolytic activity of hemorrhagic toxins, HR-2a and HR-2b, from Trimeresurus flavoviridis venom. Int J Biochem. 1987;19(3):221–6.
Omori-Satoh T, Sadahiro S. Resolution of the major hemorrhagic component of Trimeresurus flavoviridis venom into two parts. Biochim Biophys Acta. 1979;580(2):392–404.
Takeya H, Oda K, Miyata T, Omori-Satoh T, Iwanaga S. The complete amino acid sequence of the high molecular mass hemorrhagic protein HR1B isolated from the venom of Trimeresurus flavoviridis. J Biol Chem. 1990;265(27):16068–73.
Yonaha K, Iha M, Tomihara Y, Nozaki M, Yamakawa M. Characterization of three hemorrhagic factors from the venom of Okinawa habu (Trimeresurus flavoviridis). Toxicon. 1991;29(6):703–11.
Kishimoto M, Takahashi T. Molecular cloning of HR1a and HR1b, high molecular hemorrhagic factors, from Trimeresurus flavoviridis venom. Toxicon. 2002;40(9):1369–75.
Masuda S, Hayashi H, Atoda H, Morita T, Araki S. Purification, cDNA cloning and characterization of the vascular apoptosis-inducing protein, HV1, from Trimeresurus flavoviridis. Eur J Biochem. 2001;268(11):3339–45.
Masuda S, Ohta T, Kaji K, Fox JW, Hayashi H, Araki S. cDNA cloning and characterization of vascular apoptosis-inducing protein 1. Biochem Biophys Res Commun. 2000;278(1):197–204.
Wu WB, Chang SC, Liau MY, Huang TF. Purification, molecular cloning and mechanism of action of graminelysin I, a snake-venom-derived metalloproteinase that induces apoptosis of human endothelial cells. Biochem J. 2001;357(Pt 3):719–28.
Atoda H, Morita T. A novel blood coagulation factor IX/factor X-binding protein with anticoagulant activity from the venom of Trimeresurus flavoviridis (Habu snake): isolation and characterization. J Biochem. 1989;106(5):808–13.
Matsuzaki R, Yoshiara E, Yamada M, Shima K, Atoda H, Morita T. cDNA cloning of IX/X-BP, a heterogeneous two-chain anticoagulant protein from snake venom. Biochem Biophys Res Commun. 1996;220(2):382–7.
Fisher RA. The genetical theory of natural selection. Oxford: Claredon Press; 1930.
Bieber AL. Metal and nonprotein constituents in snake venoms. In: Lee C-Y, editor. Snake venoms, vol. 52. Berlin: Springer; 1979. p. 295–306.
McCue MD, Mason R. Cost of producing venom in three North American pitviper species. Copeia. 2006;2006(4):818–25.
Nakashima K, Ogawa T, Oda N, Hattori M, Sakaki Y, Kihara H, et al. Accelerated evolution of Trimeresurus flavoviridis venom gland phospholipase A2 isozymes. Proc Natl Acad Sci U S A. 1993;90(13):5964–8.
Krylov DM, Wolf YI, Rogozin IB, Koonin EV. Gene loss, protein sequence divergence, gene dispensability, expression level, and interactivity are correlated in eukaryotic evolution. Genome Res. 2003;13(10):2229–35.
Hahn MW, Kern AD. Comparative genomics of centrality and essentiality in three eukaryotic protein-interaction networks. Mol Biol Evol. 2005;22(4):803–6.
Sasa M. Diet and snake venom evolution: can local selection alone explain intraspecific venom variation? Toxicon. 1999;37(2):249–52. author reply 253-260.
Tin MM, Economo EP, Mikheyev AS. Sequencing degraded DNA from non-destructively sampled museum specimens for RAD-tagging and low-coverage shotgun phylogenetics. PLoS ONE. 2014;9(5):e96793.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20.
Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011;29:644–52.
Li H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics. 2011;27(21):2987–93.
Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9(4):357–60.
Archer JP, Whiteley G, Casewell NR, Harrison RA, Wagstaff SC. VTBuilder: a tool for the assembly of multi isoform transcriptomes. BMC Bioinformatics. 2014;15(1):389.
Gouzy J, Carrere S, Schiex T. FrameDP: sensitive peptide detection on noisy matured sequences. Bioinformatics. 2009;25(5):670–1.
Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32(5):1792–7.
Suyama M, Torrents D, Bork P. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 2006;34(Web Server issue):W609–612.
Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24(8):1586–91.
Rokyta DR, Lemmon AR, Margres MJ, Aronow K. The venom-gland transcriptome of the eastern diamondback rattlesnake (Crotalus adamanteus). BMC Genomics. 2012;13:312.
Rokyta DR, Wray KP, McGivern JJ, Margres MJ. The transcriptomic and proteomic basis for the evolution of a novel venom phenotype within the Timber Rattlesnake (Crotalus horridus). Toxicon. 2015;98C:34–48.
Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22(21):2688–90.
Pond SL, Frost SD. Datamonkey: rapid detection of selective pressure on individual sites of codon alignments. Bioinformatics. 2005;21(10):2531–3.
Kosakovsky Pond SL, Posada D, Gravenor MB, Woelk CH, Frost SD. Automated phylogenetic detection of recombination using a genetic algorithm. Mol Biol Evol. 2006;23(10):1891–901.
Scheffler K, Martin DP, Seoighe C. Robust inference of positive selection from recombining coding sequences. Bioinformatics. 2006;22(20):2493–9.
Delport W, Poon AF, Frost SD, Kosakovsky Pond SL. Datamonkey 2010: a suite of phylogenetic analysis tools for evolutionary biology. Bioinformatics. 2010;26(19):2455–7.
Murrell B, Wertheim JO, Moola S, Weighill T, Scheffler K, Kosakovsky Pond SL. Detecting individual sites subject to episodic diversifying selection. PLoS Genet. 2012;8(7):e1002764.
Haas BJ, Papanicolaou A, Yassour M, Grabherr M, Blood PD, Bowden J, et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat Protoc. 2013;8(8):1494–512.
This work was made possible by generous support of the Okinawa Institute of Science and Technology Graduate University (OIST) to ASM, and by the OIST internship program for supporting the research visit of SA. We are very grateful to Risa Sato for professional translation and interpretation that enabled numerous aspects of this collaboration despite linguistic barriers. We also express sincere thanks to Miguel Grau Lopez for uploading sequences to the DNA Data Bank of Japan and for running the VTBuilder assemblies.
The authors declare that they have no competing interests.
KT collected and maintained the snakes and assisted with venom extractions. SDA milked the snakes and extracted the glands. SDA and SA developed techniques and performed protein cleavages. MMT created and sequenced the cDNA libraries. AVB performed the mass spectrometry and analyzed the mass spectrometric data. ASM performed the bioinformatics data analyses. SDA, ASM, MMT, and AVB wrote the manuscript. All authors reviewed and edited it. All authors read and approved the final manuscript.
Supplementary Table 3.
Transcript details for the Protobothrops elegans transcriptome. Complete transcripts are highlighted in blue; incomplete transcripts are in yellow. Relative transcript abundance is provided in the column “FPKM %.” These data provide the most quantitative estimates of venom composition; however, some of the incomplete transcripts pertaining to the same protein families (e.g., metalloproteases) may, in fact, be the same protein, if the transcripts were completely nonoverlapping. (PDF 132 kb)
Transcript details for the Protobothrops flavoviridis transcriptome. Details as for Additional file 2: Table S1. (PDF 130 kb)
Phospholipases A2 in venoms of P. elegans, P. flavoviridis, and hybrids between the two. A. Myotoxic PLA2s. Catalytic PLA2s have a critical Asp residue in position 48 (See panels B-D). All of these enzymes manifest a basic residue in this position, preventing the binding of a critical Ca2+ ion involved in catalysis. However, while most New World crotaline myotoxins have Lys at this position, P. elegans and some other Asian crotalines have substituted Arg. Partial transcript comp552_c0_seq1 from P. flavoviridis is also a myotoxin, but it is a minor venom component (Additional file 3: Table S2). Signal peptides are shown to the left of the vertical black line. B. Acidic PLA2s. These enzymes are strongly lipolytic and myotoxic, though much less myotoxic than the non-catalytic myotoxins shown in panel D. C. Moderately basic PLA2s. P. flavoviridis PLA2 1 is most similar to PL-Y, PL-X, and PLA-B. Like those in panel B, these enzymes are strongly lipolytic and moderately myotoxic. PLA-B induces edema . D. Weakly neurotoxic PLA2s. The partial transcript, P. flavoviridis comp48_c0_seq1 is identical, as far as can be seen to PLA-N(O) . (PDF 140 kb)
Mean frequency per specimen with which individual amino acid residues in the noncatalytic, myotoxic P. elegans PLA2, comp43_c0_seq1, were sequenced in P. elegans and hybrid venoms. Not only were most of the same peptides sequenced in these venoms, but they were sequenced with the same relative frequencies. (PDF 234 kb)
Hybrids express most proteins found in the two parental species. Every colored node corresponds to a transcript and is color-coded by toxin family. The toxins are connected to black nodes labeled with species identities, by edges whose thickness corresponds to the protein expression level in that species. Expression values are given as a percentage of the total, averaged across all samples, and range from less than 0.001 to 19 %. More detail on expression levels of individual toxins is presented in Fig 3. Virtually all toxins are expressed in the hybrids, except for a few venom components, which are poorly expressed in the parental species, and may have been missed in the hybrids due stochasticity involved in mass spectrometric detection. (PNG 1018 kb)
Manual identification of CID fragment ions from a composite spectrum using >40 MS2 spectra from the same chromatographic peak associated with Protobothrops elegans transcript comp103_c0_seq1. Annotation was performed using the PEAKS studio 7.0 de novo sequencing module, followed by a search with the same database used for Mascot searches, and manual annotation to ascertain correctness. This process confirmed the sequence of the peptide and that Frame 1 is correct frame for translation. (PDF 320 kb)
Venom peptides sequenced from both parental taxa and hybrids pertaining to transcripts for the homologous, basic, catalytic PLA2s, P. elegans comp47_c0_seq1 and P. flavoviridis comp41_c0_seq1. Hybrids inherited both homologs. (PDF 12 kb)
Supplementary Material. (DOCX 70 kb)