ArachnoServer: a database of protein toxins from spiders

Wood, David LA; Miljenović, Tomas; Cai, Shuzhi; Raven, Robert J; Kaas, Quentin; Escoubas, Pierre; Herzig, Volker; Wilson, David; King, Glenn F

doi:10.1186/1471-2164-10-375

Database
Open access
Published: 13 August 2009

ArachnoServer: a database of protein toxins from spiders

David LA Wood^1,2,
Tomas Miljenović³,
Shuzhi Cai^1,2,
Robert J Raven⁴,
Quentin Kaas³,
Pierre Escoubas⁵,
Volker Herzig³,
David Wilson³ &
…
Glenn F King³

BMC Genomics volume 10, Article number: 375 (2009) Cite this article

8122 Accesses
55 Citations
Metrics details

Abstract

Background

Venomous animals incapacitate their prey using complex venoms that can contain hundreds of unique protein toxins. The realisation that many of these toxins may have pharmaceutical and insecticidal potential due to their remarkable potency and selectivity against target receptors has led to an explosion in the number of new toxins being discovered and characterised. From an evolutionary perspective, spiders are the most successful venomous animals and they maintain by far the largest pool of toxic peptides. However, at present, there are no databases dedicated to spider toxins and hence it is difficult to realise their full potential as drugs, insecticides, and pharmacological probes.

Description

We have developed ArachnoServer, a manually curated database that provides detailed information about proteinaceous toxins from spiders. Key features of ArachnoServer include a new molecular target ontology designed especially for venom toxins, the most up-to-date taxonomic information available, and a powerful advanced search interface. Toxin information can be browsed through dynamic trees, and each toxin has a dedicated page summarising all available information about its sequence, structure, and biological activity. ArachnoServer currently manages 567 protein sequences, 334 nucleic acid sequences, and 51 protein structures.

Conclusion

ArachnoServer provides a single source of high-quality information about proteinaceous spider toxins that will be an invaluable resource for pharmacologists, neuroscientists, toxinologists, medicinal chemists, ion channel scientists, clinicians, and structural biologists. ArachnoServer is available online at http://www.arachnoserver.org.

Background

The realisation that many venomous creatures possess a complex repertoire of protein toxins with pharmacological, pharmaceutical, and agrochemical applications has led to an exponential increase in the number of new toxins being discovered [1]. Several electronic databases have been established to capture information about subsets of these toxins, such as ConoServer [2] and SCORPION2 [3], which provide information about toxins from marine cone snails and scorpions, respectively. Other databases provide information about all groups of animal toxins, such as Tox-Prot [4] and the Animal Toxin Database (ATDB) [5]. These databases allow cross-comparison of toxins from different animal groups but the individual toxin records typically lack the rich information content of manually curated databases.

Spiders are by far the largest group of venomous animals, with more than 40,000 extant species [6], and they maintain the largest repertoire of pharmacologically active peptides [7]. Some spider toxins have been used for almost two decades as pharmacological tools to probe the structure and function of ion channels and cell-surface receptors [8, 9]. Other spider toxins are being developed as bioinsecticides [10–12] or providing leads for the development of drugs to treat a diverse range of pathophysiological states such as inflammatory and neuropathic pain [13], cardiac arrhythmia [14], and erectile dysfunction [15]. Remarkably, despite the incredible chemical diversity of spider venoms, there are no curated databases that deal specifically with proteinaceous toxins from these animals. Here we introduce ArachnoServer, a database designed to provide detailed and easily accessible information on the sequence, three-dimensional structure, and biological activity of spider toxins.

Construction and content

Data sources and integration

A non-redundant set of publicly available spider toxin sequences was sourced from Swiss-Prot [16] and GenBank [17], while experimentally determined protein structures were obtained from the Protein Data Bank [18]. Taxonomic information on spiders (Araneae) was retrieved from the World Spider Catalog [6], which has the advantage of providing more up-to-date taxonomy than NCBI, as well as historic taxonomic information that can be used in database searches. As the World Spider Catalog is under constant revision, care was taken to ensure future compatibility of the ArachnoServer data model with this catalog. Each species in ArachnoServer is assigned a unique life science identifier (LSID) which is linked to the equivalent LSID in an electronic version of the World Spider Catalog. In this way, the ArachnoServer database is automatically linked to taxonomic updates within the World Spider Catalog. High-resolution images of spiders, which are provided without restriction for academic use, were obtained from private collectors and have been used only in cases where both the species and sex could be verified.

A key feature of ArachnoServer is the molecular target ontology. The majority of spider toxins act on ion channels and cell-surface receptors [19], and consequently we developed a new molecular target ontology based on the channel and receptor subtype definitions and nomenclature recommended by the International Union of Basic & Clinical Pharmacology [20]. Additional descriptors were added for toxins that target invertebrate ion channels and receptors, cytolytic toxins, lectins, toxins that target enzymes, and toxins with enzymatic activity. This new ontology allows for the first time the retrieval of spider toxin information based on a precise definition of toxin specificity. For example, this ontology makes it facile to search the database for toxins that target a specific subtype of vertebrate ion channel or receptor. Finally, spider toxins often present diverse posttranslational modifications, and these are described in ArachnoServer using an ontology derived from ConoServer [2].

Data curation

Where appropriate, extra information from the literature was added to each imported toxin record and additional citations were incorporated using the PubMed EFetch service. Year of discovery was carefully curated according to the date on which a manuscript describing the sequence was first submitted for publication, or the date when the toxin sequence was first submitted to GenBank or SWISS-PROT or appeared in the patent literature; this information is often incorrect in the source databases. Toxins were assigned recommended names according to the recently described rational nomenclature for spider toxins [1] and all available literature synonyms were also attached to each toxin record.

ArachnoServer provides the ability to curate information specific to a particular toxin as well as data that are generically applicable. For example, curators can create, edit, or delete symbols and names for posttranslational modifications. Data sets available for curation include lists of posttranslational modifications, biological activities, sequence features (e.g., signal and propeptide sequences), the molecular target ontology, common names and images of spiders, and even the units used to describe LD₅₀, PD₅₀, and IC₅₀ values. The types of experimental evidence for disulfide bonds and toxin pharmacophores (e.g., experimentally determined, by homology, or predicted) can also be managed by the curation team. This inherent flexibility should allow the database to grow in both size and functionality as the knowledge base about spider toxins continues to expand.

Application architecture

ArachnoServer is a Java Spring Model View Controller (MVC) application that uses a Hibernate Object Relational Mapping (ORM) layer to a MySQL database. All database tables are mapped to Plain Old Java Object (POJO) serializable beans. Hibernate manages the writing, updating, and deleting of data from the database tables according to changes in the POJO data, instigated by events triggered from the view and propagated through to the Spring service layer. Using the Spring architecture with ORM mapping, the application and data model can be easily extended or modified, as changes to the data model do not require SQL changes (Hibernate transparently creates all SQL statements using mapped POJOs).

Utility and discussion

Interface and visualisation

ArachnoServer is presented using a graphically designed CSS skin that can be easily swapped for other skins in future releases if required. The front page provides statistics about the total number of toxins available, their taxonomic distribution, and year of discovery (Fig. 1). Individual toxin records can be accessed through basic or advanced searches or via the browse function, all of which are described below.

The spider toxin record (STR) card summarises all information about a toxin in a clean and intuitive interface (Fig. 2). A description of the toxin's biological activity is provided at the top of the page followed by a dynamically generated image that displays the mature toxin sequence along with information about disulfide bond connectivities, pharmacophore residues, and post-translational modifications. Mousing over the image displays additional information, including the position in the sequence and full residue name. The expandable sections below the image were carefully designed to allow biologists working in different disciplines to find the desired information without being distracted by extraneous data. For example, pharmacologists interested in the molecular target of the toxin but not other details can click on the Biological Activity tab in order to find this specific information, whereas structural biologists and medicinal chemists may be more interested in accessing details of the structure via the Toxin Structure section, which allows the structure to be viewed in interactive 3D mode via a Jmol applet http://www.jmol.org. For all toxins, a colour-coded image is displayed in the Protein Information section that provides a quick overview of the signal sequence, propeptide, and mature toxin regions. Other tabs provide information on the spider taxonomy (Taxonomy), links to external databases (Accessions), links to relevant literature (Literature References), sequences in FASTA format (Sequences), and alternative toxin names (Toxin Synonyms).

Basic and advanced searches

ArachnoServer supports basic searches and advanced searches. Basic searches query toxin names (including synonyms), taxonomic information (including historic taxonomy), common names of spiders, and toxin families. Basic searches can be performed from every page using the search bar in the ArachnoServer banner.

A key feature that distinguishes ArachnoServer from other toxin databases is the advanced search feature (Fig. 3). Context-specific fields are dynamically arranged on the page, and search keys populated in drop-down lists of relevant data from the database. Search clauses can be grouped, then joined using boolean operators. Most database fields can be searched, including all basic search fields, as well as additional features such as biological activity, posttranslational modifications, literature references, and counts of various data fields (e.g., the number of disulfide bonds in a toxin). Results from both basic and advanced searches are displayed as paginated tables that can be exported in XML or PDF format. FASTA sequences for toxins from any set of search results can also be exported in plain text format.

The advanced search feature provides a convenient way for scientists to locate records of immediate interest. For example, a medicinal chemist or microbiologist focused on infectious diseases might be particularly interested in toxins that have antimicrobial activity. Using the advanced search feature to search for toxins with antimicrobial activity reveals that the database currently contains 39 such toxins (Fig. 3A). However, this search could be restricted according to many different criteria, such as the year of discovery, activity against specific bacteria or parasites, or the number of disulfide bonds and posttranslational modifications in the toxin. For example, adding a requirement for activity against Escherichia coli limits the search result to 11 toxins (Fig. 3B). The medicinal chemist might be particularly interested in the subset of these toxins for which a 3D structure has been determined. Adding this criterion limits the search result to two toxins (Fig. 3C), for which the 3D structure can then be viewed from the individual spider toxin records.

Browsing

Browsing of the toxin records is also possible through the browse page. Dynamic trees are populated with folders of toxins that match the grouping criteria. Users can browse the spider taxonomic tree and view all toxins for a given taxon. Similarly, the molecular target ontology and posttranslational modifications can be browsed. Fig. 4 shows an example where the molecular ontology browse function has been used to locate all toxins in the database that are known to target invertebrate voltage-gated calcium channels. Note that a complete list of FASTA sequences can be obtained from browse results.

Sequence similarity searches

Similarity searches using BLAST [21] are available through the BLAST page and also from each of the sequence records in the STR page. Both protein and nucleic databases can be searched using the programs blastn, blastp, blastx, tblastn, and tblastx. Hits from toxins in the database are linked directly from the BLAST result page to the STR cards in ArachnoServer, and, where available, back to the original sources in GenBank or Swiss-Prot.

Conclusion

ArachnoServer is a web-based resource that provides comprehensive information about protein toxins from spider venoms. The combination of specific classification schemes and a rich user interface allows researchers to easily locate and view information on the sequence, structure, and biological activity of these toxins. This manually curated database will be a valuable resource for both basic researchers as well as those interested in potential pharmaceutical and agricultural applications of spider toxins.

Availability and requirements

ArachnoServer is freely available at http://www.arachnoserver.org. ArachnoServer is platform independent and supports most browsers including Firefox (Version 3 and higher), Safari (Ver. 3.2 and higher), Google Chrome (Ver. 1 and higher), Internet Explorer (Ver. 7 and higher), and Opera (Ver. 9.6 and higher). A java plug-in is required in order to view 3D protein structures.

References

King GF, Gentz MC, Escoubas P, Nicholson GM: A rational nomenclature for naming peptide toxins from spiders and other venomous animals. Toxicon. 2008, 52: 264-276. 10.1016/j.toxicon.2008.05.020.
Article CAS PubMed Google Scholar
Kaas Q, Westermann JC, Halai R, Wang CK, Craik DJ: ConoServer, a database for conopeptide sequences and structures. Bioinformatics. 2008, 24: 445-446. 10.1093/bioinformatics/btm596.
Article CAS PubMed Google Scholar
Tan PT, Veeramani A, Srinivasan KN, Ranganathan S, Brusic V: SCORPION2: a database for structure-function analysis of scorpion toxins. Toxicon. 2006, 47: 356-363. 10.1016/j.toxicon.2005.12.001.
Article CAS PubMed Google Scholar
Jungo F, Bairoch A: Tox-Prot, the toxin protein annotation program of the Swiss-Prot protein knowledgebase. Toxicon. 2005, 45: 293-301. 10.1016/j.toxicon.2004.10.018.
Article CAS PubMed Google Scholar
He QY, He QZ, Deng XC, Yao L, Meng E, Liu ZH, Liang SP: ATDB: a uni-database platform for animal toxins. Nucleic Acids Res. 2008, 36: D293-D297. 10.1093/nar/gkm832.
Article PubMed Central CAS PubMed Google Scholar
Platnick NI: Advances in spider taxonomy, 1992–1995: with redescriptions 1940–1980. 1997, New York: New York Entomological Society & The American Museum of Natural History, [http://research.amnh.org/entomology/spiders/catalog/]
Google Scholar
Escoubas P, Sollod BL, King GF: Venom landscapes: mining the complexity of spider venoms via a combined cDNA and mass spectrometric approach. Toxicon. 2006, 47: 650-663. 10.1016/j.toxicon.2006.01.018.
Article CAS PubMed Google Scholar
Adams ME: Agatoxins: ion channel specific toxins from the American funnel web spider, Agelenopsis aperta. Toxicon. 2004, 43: 509-525. 10.1016/j.toxicon.2004.02.004.
Article CAS PubMed Google Scholar
Ushkaryov YA, Volynski KE, Ashton AC: The multiple actions of black widow spider toxins and their selective use in neurosecretion studies. Toxicon. 2004, 43: 527-542. 10.1016/j.toxicon.2004.02.008.
Article CAS PubMed Google Scholar
Tedford HW, Sollod BL, Maggio F, King GF: Australian funnel-web spiders: master insecticide chemists. Toxicon. 2004, 43: 601-618. 10.1016/j.toxicon.2004.02.010.
Article CAS PubMed Google Scholar
Fitches E, Edwards MG, Mee C, Grishin E, Gatehouse AM, Edwards JP, Gatehouse JA: Fusion proteins containing insect-specific toxins as pest control agents: snowdrop lectin delivers fused insecticidal spider venom toxin to insect haemolymph following oral ingestion. J Insect Physiol. 2004, 50: 61-71. 10.1016/j.jinsphys.2003.09.010.
Article CAS PubMed Google Scholar
Khan SA, Zafar Y, Briddon RW, Malik KA, Mukhtar Z: Spider venom toxin protects plants from insect attack. Transgenic Res. 2006, 15: 349-357. 10.1007/s11248-006-0007-2.
Article CAS PubMed Google Scholar
Mazzuca M, Heurteaux C, Alloui A, Diochot S, Baron A, Voilley N, Blondeau N, Escoubas P, Gelot A, Cupo A, et al: A tarantula peptide against pain via ASIC1a channels and opioid mechanisms. Nat Neurosci. 2007, 10: 943-945. 10.1038/nn1940.
Article CAS PubMed Google Scholar
Bode F, Sachs F, Franz MR: Tarantula peptide inhibits atrial fibrillation. Nature. 2001, 409: 35-36. 10.1038/35051165.
Article CAS PubMed Google Scholar
Nunes KP, Costa-Gonçalves A, Lanza LF, Cortes SF, Cordeiro MN, Richardson M, Pimenta AM, Webb RC, Leite R, De Lima ME: Tx2-6 toxin of the Phoneutria nigriventer spider potentiates rat erectile function. Toxicon. 2008, 51: 1197-1206. 10.1016/j.toxicon.2008.02.010.
Article PubMed Central CAS PubMed Google Scholar
Boeckmann B, Bairoch A, Apweiler R, Blatter MC, Estreicher A, Gasteiger E, Martin MJ, Michoud K, O'Donovan C, Phan I, et al: The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 2003, 31: 365-370. 10.1093/nar/gkg095.
Article PubMed Central CAS PubMed Google Scholar
Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL: GenBank. Nucleic Acids Res. 2008, 36: D25-D30. 10.1093/nar/gkm929.
Article PubMed Central CAS PubMed Google Scholar
Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE: The Protein Data Bank. Nucleic Acids Res. 2000, 28: 235-242. 10.1093/nar/28.1.235.
Article PubMed Central CAS PubMed Google Scholar
Sollod BL, Wilson D, Zhaxybayeva O, Gogarten JP, Drinkwater R, King GF: Were arachnids the first to use combinatorial peptide libraries?. Peptides. 2005, 26: 131-139. 10.1016/j.peptides.2004.07.016.
Article CAS PubMed Google Scholar
Alexander SPH, Mathie A, Peters JA: Guide to receptors and channels (GRAC), (2008). Br J Pharmacol. 2008, 153 (Suppl 2): S1-S209. 10.1038/sj.bjp.0707746. 3
Article PubMed Central PubMed Google Scholar
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.
Article PubMed Central CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by Discovery Grant DP0774245 from the Australian Research Council to G.F.K. and a Queensland State Government National and International Alliances Program Grant to QFAB. We thank Nick Peall for design of the ArachnoServer skin, Dominique Gorse for project assistance, and Bastian Rast for high-resolution spider images.

Author information

Authors and Affiliations

Queensland Facility for Advanced Bioinformatics, The University of Queensland, St Lucia, QLD, 4072, Australia
David LA Wood & Shuzhi Cai
ARC Centre of Excellence in Bioinformatics, The University of Queensland, St Lucia, QLD, 4072, Australia
David LA Wood & Shuzhi Cai
Institute for Molecular Bioscience, The University of Queensland, St Lucia, QLD, 4072, Australia
Tomas Miljenović, Quentin Kaas, Volker Herzig, David Wilson & Glenn F King
Queensland Museum, Brisbane, QLD, 4101, Australia
Robert J Raven
Institut de Pharmacologie Moléculaire et Cellulaire, CNRS, 06560, Valbonne, France
Pierre Escoubas

Authors

David LA Wood
View author publications
You can also search for this author in PubMed Google Scholar
Tomas Miljenović
View author publications
You can also search for this author in PubMed Google Scholar
Shuzhi Cai
View author publications
You can also search for this author in PubMed Google Scholar
Robert J Raven
View author publications
You can also search for this author in PubMed Google Scholar
Quentin Kaas
View author publications
You can also search for this author in PubMed Google Scholar
Pierre Escoubas
View author publications
You can also search for this author in PubMed Google Scholar
Volker Herzig
View author publications
You can also search for this author in PubMed Google Scholar
David Wilson
View author publications
You can also search for this author in PubMed Google Scholar
Glenn F King
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to David LA Wood or Glenn F King.

Additional information

Authors' contributions

GFK and DLAW devised the ArachnoServer concept and wrote the manuscript. All authors were involved in formulating specifications, application design, and testing. DLAW was responsible for data integration, database design, and application implementation with assistance from TM and SC. SC also worked on application architecture. GFK and QK developed the molecular target and posttranslational modification ontologies, respectively. RR was responsible for integration of taxonomy from the World Spider Catalog. DW, VH, and GFK were responsible for database curation. PE was primary beta tester. All authors read the final manuscript

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Wood, D.L., Miljenović, T., Cai, S. et al. ArachnoServer: a database of protein toxins from spiders. BMC Genomics 10, 375 (2009). https://doi.org/10.1186/1471-2164-10-375

Download citation

Received: 13 March 2009
Accepted: 13 August 2009
Published: 13 August 2009
DOI: https://doi.org/10.1186/1471-2164-10-375

ArachnoServer: a database of protein toxins from spiders