- Research article
- Open Access
Transcriptome analysis of the venom gland of the scorpion Scorpiops jendeki: implication for the evolution of the scorpion venom arsenal
BMC Genomicsvolume 10, Article number: 290 (2009)
The family Euscorpiidae, which covers Europe, Asia, Africa, and America, is one of the most widely distributed scorpion groups. However, no studies have been conducted on the venom of a Euscorpiidae species yet. In this work, we performed a transcriptomic approach for characterizing the venom components from a Euscorpiidae scorpion, Scorpiops jendeki.
There are ten known types of venom peptides and proteins obtained from Scorpiops jendeki. Great diversity is observed in primary sequences of most highly expressed types. The most highly expressed types are cytolytic peptides and serine proteases. Neurotoxins specific for sodium channels, which are major groups of venom components from Buthidae scorpions, are not detected in this study. In addition to those known types of venom peptides and proteins, we also obtain nine atypical types of venom molecules which haven't been observed in any other scorpion species studied to date.
This work provides the first set of cDNAs from Scorpiops jendeki, and one of the few transcriptomic analyses from a scorpion. This allows the characterization of a large number of venom molecules, belonging to either known or atypical types of scorpion venom peptides and proteins. Besides, our work could provide some clues to the evolution of the scorpion venom arsenal by comparison with venom data from other scorpion lineages.
Based on cladistic morphological analysis, the extant scorpions can be phylogenetically divided into 14 families. All scorpions possess a homologous venom apparatus which consists of the vesicle holding a pair of venom glands and the hypodermic aculeus used to inject the venom. Scorpion venom is a combinatorial library of peptides and proteins which could cause toxicological responses and can be candidates for drug design and development. The general compositions of scorpion venoms vary among different families. For instance, in a comparative LC/MS analysis of two scorpion species from the families Buthidae and Ischnuridae, vast abundance difference was observed in venom components with molecular size from 5000 to 10,000 Da. Furthermore, such differences in venom compositions could also be observed from genus to genus, and even between different species within a genus[5, 6].
Hundreds of venom peptides and proteins have been characterized from various scorpion species. It is noteworthy that most of these venom molecules are obtained by either bioassay-guided fractionation or PCR-based methods conducted with cDNA libraries. Due to their medical importance, most research performed to date has focused on Buthidae scorpions. Buthid venoms mainly consist of four different families of neurotoxins which specifically target ion channels, including sodium channels, potassium channels, chloride channels, and calcium channels [8–10]. However, in contrary to buthids, little attention has been paid to the other thirteen non-Buthidae families. As several classes of venom peptides and proteins from non-Buthidae scorpions were shown to possess unique primary sequences and biological activity, it is worth exploring the venom compositions of non-Buthidae scorpions.
The scorpion Scorpiops jendeki is distributed in Yunnan province, Southwest China. It was once considered to be a member of the family Scorpiopidae, but now it is classified into the family Euscorpiidae after a very thorough phylogenetic analysis. The Euscorpiidae family is among the most widely distributed groups of extant scorpions, and it covers Europe, Asia, Africa, and America. Euscorpiids are considered to be harmless scorpions which possess no threat to human health. So far, euscorpiid venoms haven't been studied yet.
Different from bioassay-guided isolation, an "-ome" approach such as transcriptomic or proteomic analysis could help uncover the real diversity of scorpion venom components. Not only known types of venom peptides and proteins but also atypical venom molecules could be obtained by such an approach. Until now, proteomic studies have been employed in assessing the diversity of venom compositions from several scorpion species. Only one transcriptomic analysis has been conducted on the venom gland of a scorpion. An extensive knowledge of venom compositions from different scorpion species is helpful in understanding the envenomation and providing candidate molecules for drug development. Furthermore, comparative analysis of venom constituents from different scorpion lineages could also provide a clue to the evolutionary track of scorpion venom arsenal, as illustrated in the snake venom systems [14–16].
In this work, we carried an EST approach to overview the transcriptome of the Scorpiops jendeki venom gland. A great number of venom peptides and proteins, belonging to known and atypical toxin types, were identified through the first transcriptome study on the venom gland of a Euscorpiidae scorpion. Besides, venom data comparison among different scorpion lineages provides some clues to the evolutionary track of the scorpion venom arsenal.
EST sequencing and clustering
The titer of the non-amplified cDNA library is 3.5 × 106 cfu/ml with more than 98% recombination efficiency. The random sequencing of this library gave readable sequences for a total of 871 clones. After being processed as described in the "Materials and Methods" section, the high quality EST sequences were submitted into the dbEST (accession numbers: GH547439–GH548309). The average length of these processed sequences was 546 bp. After being grouped with stringent parameters, the ESTs formed 293 clusters of unique sequences, including 199 singletons and 94 contigs consisting of two or more ESTs (Figure 1). In this study, the terms "contig" and "singleton" were used as the same meaning as described in Egassembler.
To attempt a functional classification of these unique sequences, we compared the consensual cluster sequences against SWISS-PROT and GenBank NCBI databases by BLAST algorithms. 208 clusters (644 ESTs) provided significant hits (Expect value < e-4), whereas the other 85 clusters (227 ESTs) hadn't good matches (Table 1). Among the matched clones, 59 clusters (445 ESTs) are deduced to be secretory peptides and proteins. For the non-match set, the longest ORFs from each cluster were predicted and screened for possible signal peptides. Among the non-matched clones, 39 clusters (175 ESTs) are supposed to possess a signal peptide, and 26 clusters (27 ESTs) haven't ORF found. Since the cDNA library was not amplified, the clone number was expected to reflect the actual prevalence of a given transcript in the original biological sample. So transcripts related to secretory proteins, including venom peptides and other physiological proteins, make up more than 70% of total ESTs of the Scorpiops jendeki venom gland.
Known toxin types
known toxin types have been characterized from the scorpion Scorpiops jendeki. They are encoded by 359 ESTs (33 clusters), accounting for approximately 40% of the total venom gland transcripts (Table 1).
α-KTxs have a wide phylogenetic distribution, and have been obtained from almost all scorpion species studied so far. The newly identified α-KTxs are encoded by seven clusters (six contigs and one singleton, 45 ESTs). Among them, three clusters (SJE076C, SJE093C and SJE094C) code for α-KTxs contrained by 3 disulfide bridges, whereas the other clusters for α-KTxs with four disulfide bridges (Figure 2). For some toxins, the fourth disulfide bridge is of great importance in reaching the correct bioactive conformation. They share the Toxin_2 domain (Pfam: PF00451) with other previously characterized scorpion short-chain toxins which could act on shaker-related channels or Ca2+-activated K+-channels.
It is noteworthy that SJE042C consists of two almost identical ESTs differing by only a few nucleotides. The translated sequences are named SJE042C.1 and SJE042C.2, respectively. Similar phenomenon has also been extensively observed in other types of venom peptides and proteins discussed followingly. The possibility that these minor differences are derived in the course of cDNA library construction and sequencing could be excluded, as the phenomenon can hardly be observed in the clusters encoding common cellular proteins. Such subtle differences in EST sequences reflect the polymorphism of scorpion venom peptide genes.
Interestingly, although SJE009C have four disulfide bridges, it shows closer relationship with SJE093C and SJE094C, the α-KTxs with three disulfide bridges. This highlights the evolutionary relationship between α-KTxs with 3 disulfide bridges and those with 4 disulfide bridges.
Due to poor knowledge of their functions, scorpine-like peptides were once classified into "orphan" venom components. Several recent studies have demonstrated that scorpines possess anti-malaria and antimicrobial activities[22, 23]. Besides, they can also function as K+ channel blockers. Two clusters of scorpine-like peptides, SJE005C and SJE056C, were identified in this work (Figure 3A). They show great similarity with scorpine-like peptides obtained from other scorpion species, such as Heterometrus laoticus and Hadrurus gertschi[13, 25].
Scorpine-like peptides show obvious sequence similarity to β-family of KTxs. But distinct to β-KTxs, they don't possess a putative short pro-sequence following the signal peptide. Until now, all scorpine-like peptides are exclusively obtained from non-Buthidae scorpions, whereas all β-KTxs are from Buthidae scorpions (Figure 3B). The Scorpine-like peptide Tco 41.46-2, which is isolated from Tityus costatus (Buthidae), should be classified into β-KTxs, based on sequence similarity and the presence of a pro-peptide. As scorpion neurotoxins are paralogous genes of defensins, scorpion defensins were used to root the phylogeny tree[26, 27]. The reconstructed phylogeny relationship strongly suggests that β-KTxs and scorpine-like peptides share a common ancestor before the lineage split between Buthidae and the non-Buthidae families. After the lineage split, β-KTxs and scorpine-like peptides evolve independently in different scorpion families.
Calcines can act against ryanodine receptors, a type of intracellular endoplasmic/SR (sarcoplasmic reticulum) calcium release channels distributed in cardiac and skeletal muscle[28, 29]. They penetrate into the cell via interaction with membrane lipids. Structurally, they are characterized to harbor an inhibitor cystine knot fold, which is shared by a large number of polypeptides from diverse animal species[31, 32]. Based on their cell-penetrating ability, calcines have been used as a non-toxic drug carrier to overcomes drug resistance in cancer therapy. In this study, one cluster (SJE010C, 51 ESTs) were identified to encode calcines (Figure 4). There are five variants (SJE010C.1-SJE010C.5) with subtle differences. Interestingly, the cysteine pattern has been changed in SJE010C.1. These newly identified calcines also harbor the Toxin_27 domain (Pfam: PF08099).
The first cytolytic linear peptide, named IsCT, was got from the scorpion Opisthacanthus madagascariensis, a member of the family Scorpionidae[34, 35]. Then this type of venom peptides were later found in the scorpion Mesobuthus martensii (Buthidae). Their precursors consist of a signal peptide, a mature peptide and a C-terminal propeptide rich in acidic amino acids. Cytolytic peptides possess broad activity spectra against microbes and hemolytic activity. They are suggested to lyse cell membranes via pore formation or destabilization of membrane phospholipid packing, based on their amphiphilic α-helical structures.
In the transcriptome of the Scorpiops jendeki venom gland, cytolytic peptide precursors are the most highly expressed venom peptide transcripts. There are nine clusters (eight contigs and one singleton, 88 ESTs), representing approximately 10% of venom gland mRNAs. In contrast, in our previous investigation of Mesobuthus martensii venom, cytolytic peptides were observed at a rather low expression level (data not shown). Based on sequence similarity, the cytolytic peptides obtained in this study are divided into two clades: SJE020C, SJE063C and SJE122S form one clade; while the other clade consists of SJE007C, SJE026C, SJE046C, SJE048C, SJE072C and SJE086C(Figure 5). Translated sequences from each clade are almost identical in the signal peptide region, but rather variable in mature peptide and propeptide regions.
Trypsin inhibitor like (TIL) peptide
A trypsin inhibitor like venom peptide, BmKAPi, has previously been characterized from the scorpion Mesobuthus martensii. The trypsin inhibitor like domain (Pfam: PF01826) contains ten cysteine residues that form five disulphide bonds. However, the exact function of trypsin inhibitor like peptides from scorpion venoms hasn't been clarified. Four clusters (three contigs and one singleton, 15 ESTs) were identified to encode trypsin inhibitor like peptides (Figure 6).
Of note, SJE017C is almost identical to SJE037C, except for a 72 bp insertion into the former. Which molecular mechanism causes this phenomenon would depend on uncovering their genomic organizations and structures. Interestingly, a nonsense mutation in the 72 bp insertion of SJE017C results in a premature stop codon. Three ESTs in SJE017C represent different transcripts of the same gene, as they are not completely identical. So the possibility of an error in the sequencing is excluded. Resequencing these three clones further supports the nonsense mutation. So the cluster SJE017C may represent a pseudogene.
Secretory peptides with trypsin inhibitor like domain can also be found in the venom glands of mosquito[41–43]. They function as serine protease inhibitors or antimicrobial peptides[44, 45]. So convergent evolution has repeatedly selected genes coding for proteins containing the trypsin inhibitor like cysteine rich domain as templates for venom molecules.
The known lysozymes within the animal phyla are classified into 3 different types: chicken type (c-type), invertebrate type (i-type), goose-type (g-type). A c-type lysozyme has previously been partially sequenced in a proteomic analysis of the venom from the scorpion Tityus stigmurus. In this work, one cluster (SJE022C, 9 ESTs) was identified to code c-type lysozymes (Figure 7). They are greatly homologous to c-type lysozymes from other sources. Generally, lysozymes play an important defense role in the innate immunity. The exact biological role of lysozymes from scorpion venoms remains to be explored, as they have a relatively high expression level. As demonstrated in a previous report, lysozyme can also function as the termite egg recognition pheromone.
La1 is the most abundant venom peptide obtained from the scorpion Liocheles australasiae, which was once considered to be a member of the family Hemiscorpiidae, but now has been classified into the family Ischnuridae. Acturally, this type of venom peptides was firstly characterized from the scorpion Mesobuthus martensii at the transcript level. Until now, there have been no clues to their biological function. This work revealed six clusters of La1-like peptides, including four contigs and two singletons (Figure 8). In terms of primary sequence similarity and the position of eight cysteines, they are homologous to several known peptides, including secretory peptides from the salivary gland of Ixodes scapularis ticks. This demonstrates that La1-like peptides have an ancient origin.
Opistoporin like peptide
The cluster SJE051C is identified to encode an antimicrobial peptide which shares the Antimicrobial_7 domain (Pfam: PF08102) with opistoporins and pandinin (Figure 9). Opistoporins are antimicrobial peptides isolated from the venom of the South-African scorpion Opistophtalmus carinatus, whereas pandinin is from the scorpion Pandinus imperator[51, 52]. These peptides form essentially amphipathic helical structures and demonstrate high antimicrobial efficiency against Gram-negative and Gram-positive bacteria. Besides, it is also homologous to BmKbpp, which is a bradykinin-potentiating peptide obtained from the Chinese scorpion Mesobuthus martensii.
Anionic peptides have previously been characterized from Mesobuthus martensii and Tityus costatus, two scorpion species from the family Buthidae[36, 54]. As the name suggests, this type of venom peptides are rich in acidic amino acid residues (aspartic acid and glutamic acid). A cluster (SJE089C, 2 ESTs) was identified to encode anionic peptides (Figure 10). It is not clear what their biological role is. As the vast majority of scorpion venom peptides are basic, anionic peptides are suggested to play a part in balancing the pH value of scorpion venom liquid.
SPSVs (serine proteases from scorpion venoms)
To date, most studies performed on scorpion venoms have focused on isolation and characterization of neurotoxins and antimicrobial peptides. Although proteolytic enzyme activities have been detected in the venom of several scorpion species for a long time[55, 56], the first serine proteinase-like protein has recently been purified and partially sequenced in a screen for drug candidates targeting cancer cells. Two clusters (SJE003C and SJE030C, 78 ESTs) were identified to encode serine proteases from scorpion venoms, here named SPSVs (Figure 11). As their precursors are composed of more than 200 amino acid residues, they represent important parts of the venom proteins with high molecular weight (>20 KDa). SPSVs may be involved in post-translational processing of other venom peptides, and can also function as "spreading factors" in order to facilitate the spread of other venom peptides.
The atypical possible toxin types
In addition to those known types of venom peptides and proteins as described above, there are also several clusters supposed to encode novel venom peptide types, base on their high expression level and the presence of the signal peptide.
A highly expressed type of venom peptides was identified to be encoded by clusters SJE002C and SJE021C containing 37 and 22 ESTs each (Figure 12). Here we named them jendins. They have no hit found against any public database, indicating that jendins are an atypical peptide types from scorpion venoms. Jendin precursors consist of a signal sequence of 23 residues and a premature peptide of 37 residues. The premature peptide has a typical processing signal (Gly-Lys-Arg) at positions 14–16. It remains to be explored whether jendins have a similar post-translational processing as cytolytic peptides[34, 58]. Furthermore, their biological function remains to be investigated.
Besides, there are several medium-abundant clusters which are deduced to encode eight novel types of scorpion venom peptides [see Additional file 1]. They are either cysteine-free or cysteine-rich. Similar to jendins, they have not homologs found from public database. The presence of atypical venom peptides and proteins indicates that scorpion venoms are a rather complex pool, and multiple currently unkown types of venom peptides and proteins remain to be characterized from different scorpion lineages.
Common cellular protein ESTs
The scorpion venom gland is a specialized organ for synthesizing and secreting venom components. As demonstrated in Scorpiops jendeki, transcripts for different types of venom peptides and proteins account for more than 50% of the full transcriptome. So it is interesting to overview the physiological state of the venom gland when it highly expresses venom peptides and proteins.
Among the matched non-toxin transcripts, 153 clusters (260 ESTs) have their physiological function found (Figure 13). Most of these clusters consist of only one or a few ESTs. Although the limited sequencing data of this study is far from the complete description of Scorpiops jendeki venom gland, it could be used to roughly estimate the ralative redundance of each category. Genes, which are involved in RNA transcription and especially protein metabolism, are highly expressed in the Scorpiops jendeki venom gland. The molecules related to protein metabolism are mainly diverse kinds of ribosomal proteins responsible for protein synthesis. Besides, protein synthesis and other metabolic process are highly energy-consuming, and protein processing and transporting is also intense for the newly-synthesized venom peptides. Accordingly, high expression levels are also observed in the gene sets within the transport category which are mainly responsible for the energy generation and protein sorting.
During more than 400 million years of evolution, scorpions have developed an efficient venom arsenal, composed of extremely diverse active components, to prey captures and deter competitors. The venom molecules are able to induce both toxicological and immunological responses, and also offer a tremendous resource for use in drug development. Usually transcriptome or proteome approach is employed to explore the complexity of venom components. Several recent studies performed on many venomous species demonstrate that venom proteome and transcriptome depart in their relative abundances of different toxin families[59, 60]. However, the ESTs-based transcriptome strategy has been shown to be effective in uncovering the real diversity of venom compositions[13, 61]. Not only sequences of known toxin types but also atypical venom molecules could be characterized by such a transcriptomic approach.
In this work, we have employed a transcriptomic approach to investigate possible venom components from the scorpion Scorpiops jendeki. Before RNA extraction, the scorpion specimens are milked by electrical stimulation. So the gene expression profiling obtained in this work represents the activated-state transcription of the venom glands. The transcripts for possible venom constitutes make up approximately 50% of the Scorpiops jendeki transcirptome. It is much higher than that observed for the scorpion Hadrurus gertschi (approximately 30%). Such difference may be attributed to genetic variations. This work could be used in comparative studies of gene expression profiling among different scorpion species.
Among different scorpion venoms, there are great variability in proportion of different types of venom peptides and proteins. A previous study conducted a comparative proteomic analysis of scorpion venom components with the method of mass finger print comparison among three different Tityus venoms. It shows that the proportion of molecular weight distribution is rather variable among Tityus cambridgei, Tityus costatus and Tityus discrepans. Until now, there is only one transcriptome study of scorpion venom glands. In the transcriptome of the Hadrurus gertschi venom gland, α-KTxs and scorpine-like peptides are most highly expressed, accounting 17.7% of the total ESTs. However, the most prevalent types of venom peptides and proteins are cytolytic peptides and SPSVs in Scorpiops jendeki. Approximately 19% of the total ESTs encode for the precursors of these two types of molecules. It is noteworthy that the four types (SPSVs, La1-like peptides, calcines, and jendins), with a high expression level in Scorpiops jendeki, were not detected in Hadrurus gertschi at all. Although different types of venom molecules couldn't arise in proteins at the same level of their mRNAs, we could definitely conclude that there is great difference in venom compositions between Scorpiops jendeki and Hadrurus gertschi. Furthermore, the venom compositions of Scorpiops jendeki must be different from that of Buthidae scorpions, whose major groups of venom constitutes are neurotoxins affecting Na+ channels (NaScTxs) and K+ channels (KTxs).
Great diversity has also been observed in primary sequences of most highly expressed venom peptides and proteins. We can exclude the possibility that such diversity is caused by the artifact in cDNA library construction or DNA sequencing. A negative control is that 31 ESTs from SJE009C encode one identical translated sequence. Such diversity may mainly be attributed to variations in scorpion population, as the cDNA library was constructed with the RNA extracted from about 50 specimens. However, a previous study demonstrates that such polymorphism could also arise at the level of individual scorpion. Whatever, such diversity extensively observed in different types of venom peptides and proteins reflects the dynamic process of diversification. It is beneficial for the survival of scorpions, as the more and more complex venom arsenal could meet their demands for interaction with their prey, predators, and competitors.
The most striking observation of this study is the absence of NaScTxs in Scorpiops jendeki. This phenomenon has also been observed in the non-Buthidae scorpion Hadrurus gertschi (Caraboctonidae), on which a transcriptomic analysis has been conducted. NaScTxs are peptides of 58–76 residues in length and characterized to possess a structure core, named Cysteine-Stabilized α/β motif (CS-αβ), tightly packed by three conserved disulfide bridges. They are a major group of venom components from Buthidae scorpions. NaScTxs and KTxs are suggested to evolve from a common progenitor, based their similarities in gene organizations, intron features and structure cores. But their evolutionary history is difficult to reconstruct, due to high diversity of each toxin types[63, 64]. Similar to NaScTxs, KTxs are also defined by the presence of the conserved CS-αβ motif. Distinct to NaScTxs, KTxs have been obtained from most scorpion species, both Buthidae and non-Buthidae, currently under investigated. The difference between the phylogeny distribution of NaScTxs and KTxs could provide some clues to their evolutionary relationship.
Until now, many types of venom peptides and proteins have been obtained from diverse scorpion species. Some types are found to be widely distributed among scorpion species from different families, in case of α-KTxs. However, some other types appear to be restricted to particular scorpion lineages. For instance, jendins haven't been detected in other scorpion species. Scorpine-like peptides have not been obtained from Buthidae scorpions, although some Buthidae scorpion species have been extensively studied. So far transcriptome studies are lacking even for the medically imprtant Buthidae scorpions. However, this work implies that the presence of additional, atypical toxin types in many scorpion lineages is most likely. The presence of these common and uncommon venom molecules among different lineages reflects the dynamic evolutionary process of the scorpion venom arsenal. In order to depict such a process, extensive studies should be conducted on diverse scorpion species, especially from the non-Buthidae families.
In conclusion, we conducted a transcriptomic analysis of Scorpiops jendeki venom gland. Scorpiops jendeki belong to the family Euscorpiidae whose venoms have never been investigated. So our work greatly expanded the current knowledge of scorpion venoms. We obtained ten known types and nine atypical types of venom peptides and proteins. These molecules provide a rich hitherto unexplored resource for drugdevelopment. Besides, some clues can be provided into the evolution of scorpion venom arsenal by comparing the presence of common and umcomon types of venom peptides and proteins among different scorpion lineages.
cDNA library construction
50 specimens of Scorpiops jendeki were collected in Yunnan province, Southwest China. They were milked 2 days before RNA isolation as described previously. Total RNA was extracted with TRIZOL Reagent (Invitrogen, Carlsbad, CA, USA), and then mRNA was purified with FastTrack 2.0 mRNA Isolation Kit(Invitrogen). The cDNA library was constructed from 5 μg of mRNA using the Creator SMART cDNA Library Construction Kit (Clontech Laboratories, Palo Alto, CA). cDNA inserts were directionally cloned into the plasmids pDNR-LIB digested by restriction enzymes Sfi IA and Sfi IB. The recombinant plasmids were transformed into electrocompenent Escherichia coli DH10B (Invitrogen).
To obtain an unbiased overview of the venom gland transcriptome, random colonies were selected and cultured in appropriate Luria Broth culture medium containing 30 μg/ml of chloramphenicol. After overnight culture, plasmid DNA was isolated using alkaline lysis method. Purified plasmids were single-pass sequenced on an ABI 3730xl sequencer using the standard M13 forward primer and BigDye terminator v3.1 cycle sequencing kit (Applied Biosystems, Foster City, CA, USA).
The trace files of sequenced clones were subjected to Phred program, the cutoff Phred score was set to 40. After these sequences were strictly trimmed, the got high-quality sequences were processed on the website EGassembler http://egassembler.hgc.jp/ with the default parameter. Vector and adaptor sequences were removed using the program Cross_Match. After removing the PolyA tail, we discarded those sequences shorter than 100 bp. The resulted sequences were deposited into the dbEST, and then assembled into clusters with the program CAP3.
Each cluster was annotated by being searched against SWISS-PROT http://www.expasy.org/tools/blast/ and GenBank NCBI database http://www.ncbi.nlm.nih.gov/blast with BLAST algorithms. After BLAST search, the unmatched clusters were further identified for open reading frames using the ORFfinder http://www.ncbi.nlm.nih.gov/projects/gorf/. Considering the extreme diversity of scorpion toxins, those clusters putative to encode venom peptides was reexamined manually to pick out individual different isoforms.
All clusters were checked for the existence of signal peptides using the SignalP 3.0 program http://www.cbs.dtu.dk/services/SignalP/. All types of venom peptides and proteins are annotated by searching against Pfam protein families database http://pfam.sanger.ac.uk/.
Alignment and phylogeny analysis
The sequences used for alignment and phylogeny analysis were retrieved from SWISS-PROT databsae http://www.expasy.org/tools/blast/. The alignment was performed by Clustal_X 1.83 software followed by manual adjustment, and viewed by the software Jalview. Phylogeny analysis was carried out with Neighbor joining method implemented in MEGA3.1.
Michael ES, Victor F: High-level systematics and phylogeny of the extant scorpions (Scorpiones: Orthosterni). Euscorpius. 2003, 11: 1-175.
David WS: The Biology of Scorpions. Edited by: Gary A Polis. 1990, Stanford: Stanford University Press
Menez A: Functional architectures of animal toxins: a clue to drug design?. Toxicon. 1998, 36 (11): 1557-1572. 10.1016/S0041-0101(98)00148-2.
Miyashita M, Otsuki J, Hanai Y, Nakagawa Y, Miyagawa H: Characterization of peptide components in the venom of thescorpion Liocheles australasiae (Hemiscorpiidae). Toxicon. 2007, 50 (3): 428-437. 10.1016/j.toxicon.2007.04.012.
Batista CV, Roman-Gonzalez SA, Salas-Castillo SP, Zamudio FZ, Gomez-Lagunas F, Possani LD: Proteomic analysis of the venom from the scorpion Tityus stigmurus: biochemical and physiological comparison with other Tityus species. Comp Biochem Physiol C Toxicol Pharmacol. 2007, 146 (1–2): 147-157. 10.1016/j.cbpc.2006.12.004.
Dyason K, Brandt W, Prendini L, Verdonck F, Tytgat J, du Plessis J, Muller G, Walt van der J: Determination of species-specific components in the venom of Parabuthus scorpions from southern Africa using matrix-assisted laser desorption time-of-flight mass spectrometry. Rapid Commun Mass Spectrom. 2002, 16 (8): 768-773. 10.1002/rcm.637.
He QY, He QZ, Deng XC, Yao L, Meng E, Liu ZH, Liang SP: ATDB: a uni-database platform for animal toxins. Nucleic acids research. 2008, D293-297. 36 Database
Rodriguez de la Vega RC, Possani LD: Current views on scorpion toxins specific for K+-channels. Toxicon. 2004, 43 (8): 865-875. 10.1016/j.toxicon.2004.03.022.
Rodriguez de la Vega RC, Possani LD: Overview of scorpion toxins specific for Na+ channels and related peptides: biodiversity, structure-function relationships and evolution. Toxicon. 2005, 46 (8): 831-844. 10.1016/j.toxicon.2005.09.006.
Kozminsky-Atias A, Bar-Shalom A, Mishmar D, Zilberberg N: Assembling an arsenal, the scorpion way. BMC evolutionary biology. 2008, 8 (1): 333-10.1186/1471-2148-8-333.
Kovarik F: Revision of family Scorpiopidae(Scorpiones), with descriptions of six new species. Acta Societats Zoologicae Bohemoslovenicae. 2000, 64: 153-201.
Batista CV, D'Suze G, Gomez-Lagunas F, Zamudio FZ, Encarnacion S, Sevcik C, Possani LD: Proteomic analysis of Tityus discrepans scorpion venom and amino acid sequence of novel toxins. Proteomics. 2006, 6 (12): 3718-3727. 10.1002/pmic.200500525.
Schwartz EF, Diego-Garcia E, Rodriguez de la Vega RC, Possani LD: Transcriptome analysis of the venom gland of the Mexicanscorpion Hadrurus gertschi (Arachnida: Scorpiones). BMC genomics. 2007, 8: 119-10.1186/1471-2164-8-119.
Fry BG, Scheib H, Weerd van der L, Young B, McNaughtan J, Ramjan SF, Vidal N, Poelmann RE, Norman JA: Evolution of an arsenal: structural and functional diversification of the venom system in the advanced snakes (Caenophidia). Mol Cell Proteomics. 2008, 7 (2): 215-246.
Fry BG, Wuster W: Assembling an arsenal: origin and evolution of the snake venom proteome inferred from phylogenetic analysis of toxin sequences. Molecular biology and evolution. 2004, 21 (5): 870-883. 10.1093/molbev/msh091.
Fry BG, Vidal N, Norman JA, Vonk FJ, Scheib H, Ramjan SF, Kuruppu S, Fung K, Hedges SB, Richardson MK, et al: Early evolution of the venom system in lizards and snakes. Nature. 2006, 439 (7076): 584-588. 10.1038/nature04328.
Masoudi-Nejad A, Tonomura K, Kawashima S, Moriya Y, Suzuki M, Itoh M, Kanehisa M, Endo T, Goto S: EGassembler: online bioinformatics service for large-scale processing, clustering and assembling ESTs and genomic DNA fragments. Nucleic acids research. 2006, W459-462. 10.1093/nar/gkl066. 34 Web Server
Carrega L, Mosbah A, Ferrat G, Beeton C, Andreotti N, Mansuelle P, Darbon H, De Waard M, Sabatier JM: The impact of the fourth disulfide bridge in scorpion toxins of the alpha-KTx6 subfamily. Proteins. 2005, 61 (4): 1010-1023. 10.1002/prot.20681.
Pi C, Liu J, Peng C, Liu Y, Jiang X, Zhao Y, Tang S, Wang L, Dong M, Chen S, et al: Diversity and evolution of conotoxins based on gene expression profiling of Conus litteratus. Genomics. 2006, 88 (6): 809-819. 10.1016/j.ygeno.2006.06.014.
Froy O, Sagiv T, Poreh M, Urbach D, Zilberberg N, Gurevitz M: Dynamic diversification from a putative common ancestor of scorpion toxins affecting sodium, potassium, and chloride channels. Journal of molecular evolution. 1999, 48 (2): 187-196. 10.1007/PL00006457.
Diego-Garcia E, Schwartz EF, D'Suze G, Gonzalez SA, Batista CV, Garcia BI, de la Vega RC, Possani LD: Wide phylogenetic distribution of Scorpine and long-chain beta-KTx-like peptides in scorpion venoms: identification of "orphan" components. Peptides. 2007, 28 (1): 31-37. 10.1016/j.peptides.2006.06.012.
Conde R, Zamudio FZ, Rodriguez MH, Possani LD: Scorpine, an anti-malaria and anti-bacterial agent purified from scorpion venom. FEBS letters. 2000, 471 (2–3): 165-168. 10.1016/S0014-5793(00)01384-3.
Carballar-Lejarazu R, Rodriguez MH, de la Cruz Hernandez-Hernandez F, Ramos-Castaneda J, Possani LD, Zurita-Ortega M, Reynaud-Garza E, Hernandez-Rivas R, Loukeris T, Lycett G, et al: Recombinant scorpine: a multifunctional antimicrobial peptide with activity against different pathogens. Cell Mol Life Sci. 2008, 65 (19): 3081-3092. 10.1007/s00018-008-8250-8.
Diego-Garcia E, Abdel-Mottaleb Y, Schwartz EF, de la Vega RC, Tytgat J, Possani LD: Cytolytic and K+ channel blocking activities of beta-KTx and scorpine-like peptides purified from scorpion venoms. Cell Mol Life Sci. 2008, 65 (1): 187-200. 10.1007/s00018-007-7370-x.
Uawonggul N, Thammasirirak S, Chaveerach A, Arkaravichien T, Bunyatratchata W, Ruangjirachuporn W, Jearranaiprepame P, Nakamura T, Matsuda M, Kobayashi M, et al: Purification and characterization of Heteroscorpine-1 (HS-1) toxin from Heterometrus laoticus scorpion venom. Toxicon. 2007, 49 (1): 19-29. 10.1016/j.toxicon.2006.09.003.
Rodriguez de la Vega RC, Possani LD: On the evolution of invertebrate defensins. Trends Genet. 2005, 21 (6): 330-332. 10.1016/j.tig.2005.03.009.
Froy O: Convergent evolution of invertebrate defensins and nematode antibacterial factors. Trends in microbiology. 2005, 13 (7): 314-319. 10.1016/j.tim.2005.05.001.
Valdivia HH, Kirby MS, Lederer WJ, Coronado R: Scorpion toxins targeted against the sarcoplasmic reticulum Ca2+-release channel of skeletal and cardiac muscle. Proceedings of the National Academy of Sciences of the United States of America. 1992, 89 (24): 12185-12189. 10.1073/pnas.89.24.12185.
Tripathy A, Resch W, Xu L, Valdivia HH, Meissner G: Imperatoxin A induces subconductance states in Ca2+ release channels (ryanodine receptors) of cardiac and skeletal muscle. The Journal of general physiology. 1998, 111 (5): 679-690. 10.1085/jgp.111.5.679.
Boisseau S, Mabrouk K, Ram N, Garmy N, Collin V, Tadmouri A, Mikati M, Sabatier JM, Ronjat M, Fantini J, et al: Cell penetration properties of maurocalcine, a natural venom peptide active on the intracellular ryanodine receptor. Biochimica et biophysica acta. 2006, 1758 (3): 308-319. 10.1016/j.bbamem.2006.02.007.
Mosbah A, Kharrat R, Fajloun Z, Renisio JG, Blanc E, Sabatier JM, El Ayeb M, Darbon H: A new fold in the scorpion toxin family, associated with an activity on a ryanodine-sensitive calcium channel. Proteins. 2000, 40 (3): 436-442. 10.1002/1097-0134(20000815)40:3<436::AID-PROT90>3.0.CO;2-9.
Zhu S, Darbon H, Dyason K, Verdonck F, Tytgat J: Evolutionary origin of inhibitor cystine knot peptides. Faseb J. 2003, 17 (12): 1765-1767.
Aroui S, Ram N, Appaix F, Ronjat M, Kenani A, Pirollet F, De Waard M: Maurocalcine as a Non Toxic Drug Carrier Overcomes Doxorubicin Resistance in the Cancer Cell Line MDA-MB 231. Pharmaceutical research. 2009, 26 (4): 836-845. 10.1007/s11095-008-9782-1.
Dai L, Corzo G, Naoki H, Andriantsiferana M, Nakajima T: Purification, structure-function analysis, and molecular characterization of novel linear peptides from scorpion Opisthacanthus madagascariensis. Biochemical and biophysical research communications. 2002, 293 (5): 1514-1522. 10.1016/S0006-291X(02)00423-0.
Dai L, Yasuda A, Naoki H, Corzo G, Andriantsiferana M, Nakajima T: IsCT, a novel cytotoxic linear peptide from scorpion Opisthacanthus madagascariensis. Biochemical and biophysical research communications. 2001, 286 (4): 820-825. 10.1006/bbrc.2001.5472.
Zeng XC, Wang SX, Zhu Y, Zhu SY, Li WX: Identification and functional characterization of novel scorpion venom peptides with no disulfide bridge from Buthus martensii Karsch. Peptides. 2004, 25 (2): 143-150. 10.1016/j.peptides.2003.12.003.
Lee K, Shin SY, Kim K, Lim SS, Hahm KS, Kim Y: Antibiotic activity and structural analysis of the scorpion-derived antimicrobial peptide IsCT and its analogs. Biochemical and biophysical research communications. 2004, 323 (2): 712-719. 10.1016/j.bbrc.2004.08.144.
Zeng XC, Wang SX, Li WX: Identification of BmKAPi, a novel type of scorpion venom peptide with peculiar disulfide bridge pattern from Buthus martensii Karsch. Toxicon. 2002, 40 (12): 1719-1722. 10.1016/S0041-0101(02)00134-4.
Rawlings ND, Tolle DP, Barrett AJ: Evolutionary families of peptidase inhibitors. The Biochemical journal. 2004, 378 (Pt 3): 705-716. 10.1042/BJ20031825.
Chhatwal GS, Habermann E: Neurotoxins, protease inhibitors and histamine releasers in the venom of the Indian red scorpion (Buthus tamulus): isolation and partial characterization. Toxicon. 1981, 19 (6): 807-823. 10.1016/0041-0101(81)90077-5.
Ribeiro JM, Arca B, Lombardo F, Calvo E, Phan VM, Chandra PK, Wikel SK: An annotated catalogue of salivary gland transcripts in the adult female mosquito, Aedes aegypti. BMC genomics. 2007, 8: 6-10.1186/1471-2164-8-6.
Nene V, Wortman JR, Lawson D, Haas B, Kodira C, Tu ZJ, Loftus B, Xi Z, Megy K, Grabherr M, et al: Genome sequence of Aedes aegypti, a major arbovirus vector. Science (New York, NY). 2007, 316 (5832): 1718-1723.
Parkinson NM, Conyers C, Keen J, MacNicoll A, Smith I, Audsley N, Weaver R: Towards a comprehensive view of the primary structure of venom proteins from the parasitoid wasp Pimpla hypochondriaca. Insect biochemistry and molecular biology. 2004, 34 (6): 565-571. 10.1016/j.ibmb.2004.03.003.
Valenzuela JG, Francischetti IM, Pham VM, Garfield MK, Ribeiro JM: Exploring the salivary gland transcriptome and proteome ofthe Anopheles stephensi mosquito. Insect biochemistry and molecular biology. 2003, 33 (7): 717-732. 10.1016/S0965-1748(03)00067-5.
Fogaca AC, Almeida IC, Eberlin MN, Tanaka AS, Bulet P, Daffre S: Ixodidin, a novel antimicrobial peptide from the hemocytes of the cattle tick Boophilus microplus with inhibitory activity against serine proteinases. Peptides. 2006, 27 (4): 667-674. 10.1016/j.peptides.2005.07.013.
Whittington CM, Papenfuss AT, Bansal P, Torres AM, Wong ES, Deakin JE, Graves T, Alsop A, Schatzkamer K, Kremitzki C, et al: Defensins and the convergent evolution of platypus and reptile venom genes. Genome research. 2008, 18 (6): 986-994. 10.1101/gr.7149808.
Bachali S, Jager M, Hassanin A, Schoentgen F, Jolles P, Fiala-Medioni A, Deutsch JS: Phylogenetic analysis of invertebrate lysozymes and the evolution of lysozyme function. Journal of molecular evolution. 2002, 54 (5): 652-664. 10.1007/s00239-001-0061-6.
Matsuura K, Tamura T, Kobayashi N, Yashiro T, Tatsumi S: The antibacterial protein lysozyme identified as the termite egg recognition pheromone. PLoS ONE. 2007, 2 (8): e813-10.1371/journal.pone.0000813.
Ribeiro JM, Alarcon-Chaidez F, Francischetti IM, Mans BJ, Mather TN, Valenzuela JG, Wikel SK: An annotated catalog of salivary gland transcripts from Ixodes scapularis ticks. Insect biochemistry and molecular biology. 2006, 36 (2): 111-129. 10.1016/j.ibmb.2005.11.005.
Zhijian C, Yingliang W, Jiqun S, Wanhong L, Fan X, Xin M, Hui L, Dahe J, Wenxin L: Evidence for the existence of a common ancestor of scorpion toxins affecting ion channels. Journal of biochemical and molecular toxicology. 2003, 17 (4): 235-238. 10.1002/jbt.10083.
Corzo G, Escoubas P, Villegas E, Barnham KJ, He W, Norton RS, Nakajima T: Characterization of unique amphipathic antimicrobial peptides from venom of the scorpion Pandinus imperator. The Biochemical journal. 2001, 359 (Pt 1): 35-45. 10.1042/0264-6021:3590035.
Moerman L, Bosteels S, Noppe W, Willems J, Clynen E, Schoofs L, Thevissen K, Tytgat J, Van Eldere J, Walt Van Der J, et al: Antibacterial and antifungal properties of alpha-helical, cationic peptides in the venom of scorpions from southern Africa. European journal of biochemistry/FEBS. 2002, 269 (19): 4799-4810. 10.1046/j.1432-1033.2002.03177.x.
Zeng XC, Li WX, Peng F, Zhu ZH: Cloning and characterization of a novel cDNA sequence encoding the precursor of a novel venom peptide (BmKbpp) related to a bradykinin-potentiating peptide from Chinese scorpion Buthus martensii Karsch. IUBMB life. 2000, 49 (3): 207-210.
Diego-Garcia E, Batista CV, Garcia-Gomez BI, Lucas S, Candido DM, Gomez-Lagunas F, Possani LD: The Brazilian scorpion Tityus costatus Karsch: genes, peptides and function. Toxicon. 2005, 45 (3): 273-283. 10.1016/j.toxicon.2004.10.014.
Tan NH, Ponnudurai G: Comparative study of the enzymatic, hemorrhagic, procoagulant and anticoagulant activities of some animal venoms. Comparative biochemistry and physiology. 1992, 103 (2): 299-302.
Almeida FM, Pimenta AM, De Figueiredo SG, Santoro MM, Martin-Eauclaire MF, Diniz CR, De Lima ME: Enzymes with gelatinolytic activity can be found in Tityus bahiensis and Tityus serrulatus venoms. Toxicon. 2002, 40 (7): 1041-1045. 10.1016/S0041-0101(02)00084-3.
Gao R, Zhang Y, Gopalakrishnakone P: Purification and N-terminal sequence of a serine proteinase-like protein (BMK-CBP) from the venom of the Chinese scorpion (Buthus martensii Karsch). Toxicon. 2008, 52 (2): 348-353. 10.1016/j.toxicon.2008.06.003.
Gao B, Sherman P, Luo L, Bowie J, Zhu S: Structural and functional characterization of two genetically related meucin peptides highlights evolutionary divergence and convergence in antimicrobial peptides. Faseb J. 2009, 23 (4): 1230-1245. 10.1096/fj.08-122317.
Calvete JJ, Marcinkiewicz C, Sanz L: Snake venomics of Bitis gabonica gabonica. Protein family composition, subunit organization of venom toxins, and characterization of dimeric disintegrins bitisgabonin-1 and bitisgabonin-2. Journal of proteome research. 2007, 6 (1): 326-336. 10.1021/pr060494k.
Sanz L, Escolano J, Ferretti M, Biscoglio MJ, Rivera E, Crescenti EJ, Angulo Y, Lomonte B, Gutierrez JM, Calvete JJ: Snake venomics of the South and Central American Bushmasters. Comparison of the toxin composition of Lachesis muta gathered from proteomic versus transcriptomic analysis. Journal of proteomics. 2008, 71 (1): 46-60. 10.1016/j.jprot.2007.10.004.
Junqueira-de-Azevedo IL, Ching AT, Carvalho E, Faria F, Nishiyama MY, Ho PL, Diniz MR: Lachesis muta (Viperidae) cDNAs reveal diverging pit viper molecules and scaffolds typical of cobra (Elapidae) venoms: implications for snake toxin repertoire evolution. Genetics. 2006, 173 (2): 877-889. 10.1534/genetics.106.056515.
Ma Y, Zhao R, Li S, Fan S, Wu Y, Liu H, Cao Z, Li W: Characterization of LmTxLP11 and LmVP1.1 transcripts and genomic organizations: Alternative splicing contributing to the diversity of scorpion venom peptides. Toxicon. 2009, 53 (1): 129-134. 10.1016/j.toxicon.2008.10.025.
Froy O, Gurevitz M: Arthropod defensins illuminate the divergence of scorpion neurotoxins. J Pept Sci. 2004, 10 (12): 714-718. 10.1002/psc.578.
Zhu S, Gao B, Tytgat J: Phylogenetic distribution, functional epitopes and evolution of the CSalphabeta superfamily. Cell Mol Life Sci. 2005, 62 (19–20): 19-20.
Gopalakrishnakone P, Cheah J, Gwee MC: Black scorpion (Heterometrus longimanus) as a laboratory animal: maintenance of a colony of scorpion for milking of venom for research, using a restraining device. Laboratory animals. 1995, 29 (4): 456-458. 10.1258/002367795780740050.
Ewing B, Hillier L, Wendl MC, Green P: Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome research. 1998, 8 (3): 175-185.
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic acids research. 1997, 25 (24): 4876-4882. 10.1093/nar/25.24.4876.
Waterhouse AM, Procter JB, Martin DM, Clamp M, Barton GJ: Jalview Version 2 – a multiple sequence alignment editor and analysis workbench. Bioinformatics (Oxford, England). 2009, 25 (9): 1189-1191. 10.1093/bioinformatics/btp033.
Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Molecular biology and evolution. 2007, 24 (8): 1596-1599. 10.1093/molbev/msm092.
This work was supported by grants from the National Natural Sciences Foundation of China to Li WX, Cao ZJ and Wu YL (Nos. 30530140, 30570045 and 30770519), the Basic Project of Ministry of Science and Technology of China to Li WX (No. 2007FY210800) and the Youth Chenguang Project of Science and Technology of Wuhan City to Cao ZJ (No. 20065004116-06).
YM carried out cDNA library construction, paticipated in the bioinformatics and phylogeny analysis, and drafted the manuscript. RZ participated in the alignment and phylogeny analysis, and drafted the manuscript. YH participated in the sequencing. SL participated in the sequencing. JL participated in the alignment and phylogeny analysis. YW participated in the design and coordination of the study. ZC participated in the design and coordination of the study, and drafted the manuscript. WL conceived of the study, and participated in its coordination. All authors have read and approved the final manuscript.
Yibao Ma, Ruiming Zhao contributed equally to this work.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.