NemaPath: online exploration of KEGG-based metabolic pathways for nematodes
© Wylie et al; licensee BioMed Central Ltd. 2008
Received: 30 June 2008
Accepted: 04 November 2008
Published: 04 November 2008
Nematode.net http://www.nematode.net is a web-accessible resource for investigating gene sequences from parasitic and free-living nematode genomes. Beyond the well-characterized model nematode C. elegans, over 500,000 expressed sequence tags (ESTs) and nearly 600,000 genome survey sequences (GSSs) have been generated from 36 nematode species as part of the Parasitic Nematode Genomics Program undertaken by the Genome Center at Washington University School of Medicine. However, these sequencing data are not present in most publicly available protein databases, which only include sequences in Swiss-Prot. Swiss-Prot, in turn, relies on GenBank/Embl/DDJP for predicted proteins from complete genomes or full-length proteins.
Here we present the NemaPath pathway server, a web-based pathway-level visualization tool for navigating putative metabolic pathways for over 30 nematode species, including 27 parasites. The NemaPath approach consists of two parts: 1) a backend tool to align and evaluate nematode genomic sequences (curated EST contigs) against the annotated Kyoto Encyclopedia of Genes and Genomes (KEGG) protein database; 2) a web viewing application that displays annotated KEGG pathway maps based on desired confidence levels of primary sequence similarity as defined by a user. NemaPath also provides cross-referenced access to nematode genome information provided by other tools available on Nematode.net, including: detailed NemaGene EST cluster information; putative translations; GBrowse EST cluster views; links from nematode data to external databases for corresponding synonymous C. elegans counterparts, subject matches in KEGG's gene database, and also KEGG Ontology (KO) identification.
The NemaPath server hosts metabolic pathway mappings for 30 nematode species and is available on the World Wide Web at http://nematode.net/cgi-bin/keggview.cgi. The nematode source sequences used for the metabolic pathway mappings are available via FTP http://www.nematode.net/FTP/index.php, as provided by the Genome Center at Washington University School of Medicine.
Phylum Nematoda and Nematode Genomics
Nematodes or roundworms are one of the most common phyla of animals, with over 20,000 different described species , ubiquitous in freshwater, marine, and terrestrial environments. They have remarkable life-styles both in free-living and parasitic variants, having the ability to adapt to challenging environments or to invade multiple hosts, respectively. Parasitic nematodes of humans cause sub-clinical and clinical diseases of major socio-economic importance globally as ~3 billion people are infected . Financial losses caused by parasites to agriculture (domesticated animals and crops) have a major impact on farm profitability, exacerbating global food shortage situations (e.g., plant parasitic nematodes are responsible for $80 billion in annual crop damage ). Nematodes have been studied extensively due to their agricultural and medical importance; nematode sequencing data have increased at a rapid rate over the past decade. As of the beginning of 2008, there are over 500,000 Expressed Sequence Tags (ESTs) in the dbEST division of GenBank originating from over 40 non-Caenorhabditis nematode species , and 32 genomic projects are completed or underway .
We expect that complete annotated genomes are approximately 3–4 years away. Therefore, to empower the broader scientific community the available EST and GSS sequences from parasitic nematodes require organization in a functional context, a need underscored by their absent from the majority of publicly available protein databases such as Pfam  and Kyoto Encyclopedia of Genes and Genomes (KEGG) . These protein databases incorporate only sequences found in Swiss-Prot , and Swiss-Prot in turn relies on GenBank/EMBL/DDJP for predicted proteins from full genomes or full-length proteins. Hence, nematode-originated EST and GSS sequence data can only be informative when organized and presented in an easily accessible, systematic way that can be explored by the scientific community.
Currently, there are four comprehensive web-based databases available providing tools for exploring nematode sequences: two model-species-specific databases and two others encompassing parasitic and free-living nematode sequences. The model-specific databases include WormBase , a model organism database for Caenorhabditis elegans and other related Caenorhabditis species, and Pristionchus.org , a resource dedicated to the major satellite organism Pristionchus pacificus used in studying evolutionary developmental biology. NEMBASE3  and Nematode.net  are databases concerned with the aggregation and navigation of sequencing data derived from multiple free-living and parasitic nematode species.
Nematode Pathway Visualization and Comparison
Here we present NemaPath (made available as a component of Nematode.net) that allows for systematic study of nematode transcriptomes via enzyme pathway associations. NemaPath compares relatively refined nucleotide sequences (e.g., full-length cDNAs, clustered ESTs, RefSeq) to the KEGG genes database of curated protein sequences. Sequences in the KEGG database have known, annotated Enzyme Commission (EC)  system number associations. By aligning query sequence against annotated sequence we may assign putative function by EC number association. The software discards KEGG database entries that do not have associated EC numbers, and therefore only displays metabolic pathways. As our software makes no assumptions based on prior biological or biochemical knowledge, the end-user is able to navigate the full set of returned alignment summaries (strength of hit as E-value and bit score, subject accession numbers, etc.) from initial exploratory query sequences.
To eliminate data redundancy, ESTs have been assembled by identity into NemaGene contigs and further organized into clusters. ESTs within a contig derive from nearly identical transcripts, whereas contigs within a cluster may represent splice isoforms of a gene or transcripts from multi-gene families with extremely high sequence identity . NemaPath associations are done on the contig level using NemaGene contig builds as querying sequences or full-length genes resulting from genome projects (e.g., C. elegans, Brugia malayi, Ancylostoma caninum and Ascaris suum). Cluster sequence reports may be viewed online by reverse lookup using constituent contig IDs from NemaPath's pathway map hit table or by viewing the cluster in Nematode.net's implementation of the GBrowse  genome viewer.
Finally, summaries of mappings are provided as extendable tree-views, which organize mappings by identified EC numbers. The summary of pathway mappings when coupled with statistical tools  can provide the scientific community with a solid platform for comparative metabolomics in the Phylum Nematoda.
Although other excellent KEGG pathway mapping software platforms exist for EST-related data – such as ESTExplorer  and PathwayExplorer  – the main aim of the NemaPath database is to provide pre-compiled pan-phylum comparative metabolomics for an important and oft-studied group of parasites, as well as cross-integration with other nematode information (e.g., WormBase). As such, new sequence data are added as they become publicly available without effort at the end-user level. Many of the nematode-centric KEGG views we provide are unique unto NemaPath.
Construction and Content
Alignment Association to KEGG
The initial step in the NemaPath pipeline involves analyzing clustered ESTs in context of the latest version of KEGG's high-quality genomes including manual assignment of orthologies. NemaGene EST contigs [, Materials and Methods] are a value-added effort, refinements include: 1) grouping of the ESTs into contigs based on sequence similarity; 2) elimination of chimeric ESTs; 3) accommodation of multiple splice-isoforms; 4) persistence of cluster names when new ESTs are added. For each contig with multiple EST members, the consensus sequence is longer and of higher quality than each stand-alone EST read. The KEGG genes file (used as a database against which we BLAST our queries) is comprised of concatenated FASTA protein sequences from all of the species in the KEGG genes database. As of this writing, the KEGG genes file can be retrieved for academic use by download via FTP from KEGG ftp://ftp.genome.jp/pub/kegg/genes/fasta. Each FASTA entry contains metadata in the header line: species gene name, protein matched, EC id, and KEGG Orthology (KO) information. As the metadata represents information arrived at by KEGG's annotation process, we can make inferred relationships by identity using our nematode sequences.
WU-BLAST  alignments are performed in an automated fashion by a perl application called KEGGscan. KEGGscan reports compile BLAST results in a tab-delimited format, including E-value and bit score for the alignment, as well as the metadata information pulled from the KEGG subject's header line. KEGGscan reports are loaded into the NemaPath database during every NemaPath build; plaintext report file for each species/build can be accessed through our web site http://www.nematode.net/FTP/nemapath_ftp/index.php.
KEGGscan report information is cross-referenced with corresponding KEGG pathway image map information in the NemaPath database. EC ids have representative nodes within KEGG-supplied pathway bitmap images. Associations between NemaGene contig sequence and KEGG genes are pre-compiled on a per build basis for each NemaPath release. However, a user has the ability to narrow or broaden the pool of viewable matches by supplying a threshold for E-value, which is a statistical value of the quality of the BLAST alignment. To aid the user in this endeavor, NemaPath supplies an E-value hit distribution graph for each nematode species.
Other multi-species views include clade-specific and host-specific aggregations that incorporate only the best matches (by lowest E-value better than 1e-05) to each gene belonging to a clade or host category. Sequence data are currently available for four (I, III, IV and V) of the five nematode phylogenetic clades. All five clades include parasitic species and parasitism is hypothesized to have arisen independently multiple times . Comparative analyses between these clades may provide valuable information; hence, we provide an interface for viewing clade-specific NemaPath annotations. In addition to clade-based comparisons, we provide a host-specific comparative view that differentiates animal parasites, plant parasites and free-living nematodes.
Utility and Discussion
NemaPath Research Application for Nematology
With algorithm improvements and ever-faster expansion of biological sequence databases, sequence comparison has become a basic but critical tool in the post-genomic era. Putative functions of newly obtained sequencing data can be easily inferred by similarity search to the currently characterized proteins when those sequences contain similarities significant enough to be detected on the primary sequence level. However, while individual mappings are frequently used for detailed studies, organized hierarchical annotations of genes provide an understanding of biological systems as a whole – namely how individual genes interact as parts of complexes, pathways, and networks. As a result, nematologists have used the KEGG interaction and reaction network associations for intra- and inter-specific comparative studies and pan-phylum comparative studies. Such in silico comparative metabolomics have identified metabolic features that are taxonomically restricted and/or enriched in specific stages or tissues. For example, a pan-phylum analysis based on 93,000 genes of partial genomes in 32 nematode species when compared to the KEGG database identified taxonomically restricted biochemical pathways that may serve to direct drug target definition . Comparative metabolic pathway analysis in the human intestinal nematode parasite Strongyloides stercoralis when compared to the metabolic pathways of the free-living nematode C. elegans, revealed down regulation of nucleotide sugar metabolism in infective L3 and dauer stage which is consistent with the lack of new cell division and DNA replication in these developmentally arrested stages . Dissimilar expression profiles of genes for metabolic enzymes of Heterodera glycines infective J2 and C. elegans have been reported  based on in-depth analysis of ESTs and microarray expression data. Finally, comparison of relative coverage of metabolic enzymes of the adult heartworm Dirofilaria immitis compared to C. elegans, have supported a hypothesis that the adult heartworm D. immitis takes advantages of a anaerobic electron transfer-based energy generation system distinct from the aerobic pathway utilized by its mammalian hosts , an observation leading to a promising candidate pathway for development of new macrofilaricides. A hypergeometric statistic test on the extent of KEGG Orthology (KO) groups in the first tissue-level comparative study of nematode intestines has revealed that the major pathways of carbohydrate metabolism and energy metabolism are two commonly over-represented metabolic features in intestines of gastrointestinal parasitic nematodes Ascaris suum, Haemonchus contortus, and the free-living C. elegans .
As a step towards a full-scale genome project of Ancylostoma caninum – a hookworm of canids used as a model for studying human infections – 104,000 GSSs were generated and subsequently assembled into 57.6 Mb of unique sequence, resulting in gene identification of 9113 non-redundant genes (5538 based on GSSs and 3575 based on ESTs); functional classifications of many of the 70% of genes with homology to genes in other species were possible based on gene ontology and KEGG placement .
Validations and Limitations
Every effort has been made to improve the quality and fidelity of EST sequences by NemaGene clustering. However, final cluster products are putative, partial gene representations built on the most current information available. Enzyme Commission numbers are assigned to NemaGene contigs in the NemaPath database by primary sequence similarity. As such, associations include the caveats pertaining to any automated, high throughput alignment analysis.
To assess the accuracy of our methods, the NemaPath associations of the complete full-length gene set for C. elegans was compared to the metabolic pathways in C. elegans in the KEGG database. This validation screen was performed using two cutoffs (1e-30 and 1e-100) and identified a number of unique EC ids by the original KEGG associations and also our NemaPath associations. Using 1e-30 and 1e-100 as a cutoff, we identified that 1e-100 gave a number of unique EC mappings closer to the original KEGG associations. Only in 4 out of 94 pathways were the number of EC ids identified by NemaPath were lower than in KEGG's C. elegans reference pathway. The revealed differences, consistently higher EC ids identified by NemaPath, are mainly based on the manual curation of the KEGG mappings compared to our automated associations guided by cutoff value; our associations do not account for the ancillary information included in KEGG Orthology numbers (KO) generation.
KO numbers incorporate ancillary information (i.e., manual curation) that is not represented in the EC number annotation. KO is a further extension of this scheme (similarity-based automatic EC id association) based on computational analysis – as well as manual curation – of SSDB ortholog clusters in order to classify all gene functions and explore unknown pathways . NemaPath does provide direct representation of KO assignment, but does not exclude EC associations in specific pathways based on a priori KO knowledge. By design, an association of a user's query sequence to a particular EC identifier will be highlighted in every pathway where the EC id exists, rather than exclude information from the end-user or enforce presumptive omissions. Partial EC numbers (e.g., 1.1.-.-) are not excluded; care must be taken in their interpretation. The ambiguous nature of the partial EC number allows different enzymes that catalyze different reactions to share the same identifier within the same class, even though this does not necessarily mean they have the same activities .
Because we do not provide exhaustive curation, proper interpretation requires the user's cognizance of the metabolic pathways, as well as an understanding of KEGG annotation and vocabulary.
All species-to-KEGG associations in NemaPath are re-compiled each time KEGG releases a new version of their genes database (as of this writing the associations are made using KEGG release 46). New nematode species are added post NemaGene clustering. Genome sequencing projects of several free-living and parasitic nematode species are underway or planned , and the associations to KEGG metabolic pathways will be included as the data become available. Furthermore, future builds of NemaPath will not be limited to metabolic pathways, nematode sequence associations will include other manually drawn pathway maps representing molecular interactions and reaction networks. Also, further development will allow multi-species comparison beyond the current two-species similarity view supported by NemaPath.
NemaPath, the database described herein, provides the research community a unique resource for pathway visualization of multi-species nematode genomic data in terms of KEGG database vocabulary. NemaPath is part of the larger Nematode.net web resource and integrates well with the site's previous functionality, streamlining access to significant internal nematode sequence data, as well as information provided by off-site resources at NEMBASE3  and WormBase  repositories.
Availability and Requirements
For accessing the Nematode.net web site, direct your web browser to http://www.nematode.net on the World Wide Web. Direct access to the NemaPath pathway browser – as well as species-specific tree-views – can be found at http://nematode.net/cgi-bin/keggview.cgi URL. Plaintext file versions of alignment information (as compiled by KEGGscan) are also available by species http://www.nematode.net/FTP/nemapath_ftp/index.php. The most current NemaGene EST cluster builds are available via FTP at http://nematode.net/FTP/cluster_ftp/index.php after completing a short access form. Please contact the authors concerning access to ancillary information associated with this resource.
We would like to thank all of the individuals at Kanehisa Laboratory of Kyoto University Bioinformatics Center for their work on the KEGG project, as well as Pathway Solutions Inc. for granting academic access to the software and data associated with the KEGG database. Jarret Glasscock, Mike Dante, Billy Li, and Rick Meyer of the Genome Center at Washington University School of Medicine provided additional project feedback. The NemaPath project is supported by US National Institute for Allergy and Infectious Disease grant AI46593.
- Lorenzen S: The Phylogenetic Systematics of Free-Living Nematodes. The Ray Society, London. 1994Google Scholar
- Bethony J, Brooker S, Albonico M, Geiger SM, Loukas A, Diemert D, Hotez PJ: Soil-transmitted helminth infections: ascariasis, trichuriasis, and hookworm. The Lancet. 2006, 367: 1521-1532. 10.1016/S0140-6736(06)68653-4.View ArticleGoogle Scholar
- Barker KR, Hussey RS, Krusberg LR, Bird GW, Dunn RA, Ferris VR, Freckman DW, Gabriel CJ, Grewal PS, Macguidwin AE, Riddle DL, Roberts PA, Schmitt DP: Plant and soil nematodes-societal impact and focus for the future. Journal of Nematology. 1994, 26: 127-137.PubMedGoogle Scholar
- NCBI: Complete Eukaryotic Taxonomy. [http://www.ncbi.nlm.nih.gov/genomes/genlist.cgi?taxid=2759&type=0&name=Complete%20Eukaryota]
- Mitreva , Jasmer : WormBook. 2006, 23: 1-21.Google Scholar
- Bateman Alex, Coin Lachlan, Durbin Richard, Finn Robert, Hollich Volker, Griffiths-Jones Sam, Khanna Ajay, Marshall Mhairi, Moxon Simon, Sonnhammer Erik, Studholme David, Yeats Corin, Eddy Sean: The Pfam protein families database. Nucleic Acids Res. 2004, D138-41. 10.1093/nar/gkh121. 32 DatabaseGoogle Scholar
- Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita KF, Itoh M, Kawashima S, Katayama T, Araki M, Hirakawa M: From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. 2006, 34: D354-357. 10.1093/nar/gkj102.PubMedView ArticleGoogle Scholar
- ExPASy – UniProt Knowledgebase: Swiss-Prot and TrEMBL. [http://www.expasy.org/sprot/]
- Harris TW, Lee R, Schwarz E, Bradnam K, Lawson D, Chen W, Blasier D, Kenny E, Cunningham F, Kishore R, Chan J, Muller HM, Petcherski A, Thorisson G, Day A, Bieri T, Rogers A, Chen CK, Spieth J, Sternberg P, Durbin R, Stein LD: WormBase: a cross-species database for comparative genomics. Nucleic Acids Res. 2003, 31 (1): 133-7. 10.1093/nar/gkg053.PubMedView ArticleGoogle Scholar
- Dieterich C, Roeseler W, Sobetzko P, Sommer RJ: Pristionchus.org: a genome-centric database of the nematode satellite species Pristionchus pacificus. Nucleic Acids Res. 2007, D498-502. 10.1093/nar/gkl804. Epub 2006 Oct 24, 35 DatabaseGoogle Scholar
- Parkinson J, Whitton C, Schmid R, Thomson M, Blaxter M: NEMBASE: a resource for parasitic nematode ESTs. Nucleic Acids Res. 2004, D427-30. 10.1093/nar/gkh018. 32 DatabaseGoogle Scholar
- Wylie T, Martin JC, Dante M, Mitreva MD, Clifton SW, Chinwalla A, Waterston RH, Wilson RK, McCarter JP: Nematode.net: a tool for navigating sequences from parasitic and free-living nematodes. Nucleic Acids Res. 2004, D423-6. 10.1093/nar/gkh010. 32 DatabaseGoogle Scholar
- Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB). [http://www.chem.qmul.ac.uk/iubmb/enzyme/]
- Mitreva M, McCarter JP, Martin J, Dante M, Wylie T, Chiapelli B, Pape D, Clifton SW, Nutman TB, Waterston RH: Comparative genomics of gene expression in the parasitic and free-living nematodes Strongyloides stercoralis and Caenorhabditis elegans. Genome Res. 2004, 14 (2): 209-20. 10.1101/gr.1524804.PubMedView ArticleGoogle Scholar
- Stein LD, Mungall C, Shu S, Caudy M, Mangone M, Day A, Nickerson E, Stajich JE, Harris TW, Arva A, Lewis S: The generic genome browser: a building block for a model organism system database. Genome Res. 2002, 12 (10): 1599-610. 10.1101/gr.403602.PubMedView ArticleGoogle Scholar
- Wu J, Mao X, Cai T, Luo J, Wei L: KOBAS server: a web-based platform for automated annotation and pathway identification. Nucleic Acids Res. 2006, W720-4. 10.1093/nar/gkl167. 34 Web ServerGoogle Scholar
- Nagaraj SH, Deshpande N, Gasser RB, Ranganathan S: ESTExplorer: an expressed sequence tag (EST) assembly and annotation platform. Nucleic Acids Res. 2007, W143-7. 10.1093/nar/gkm378. Epub 2007 Jun 1, 35 Web ServerGoogle Scholar
- Mlecnik B, Scheideler M, Hackl H, Hartler J, Sanchez-Cabo F, Trajanoski Z: PathwayExplorer: web service for visualizing high-throughput expression data on biological pathways. Nucleic Acids Res. W633-7. 2005 Jul 1, 33 Web ServerGoogle Scholar
- Perl Programming Language. [http://www.perl.org]
- MySQL Database Official Site. [http://www.mysql.com]
- McCarter JP, Mitreva MD, Martin J, Dante M, Wylie T, Rao U, Pape D, Bowers Y, Theising B, Murphy CV, Kloek AP, Chiapelli BJ, Clifton SW, Bird DM, Waterston RH: Analysis and functional classification of transcripts from the nematode Meloidogyne incognita. Genome Biol. 2003, 4 (4): R26-10.1186/gb-2003-4-4-r26. Epub 2003 Mar 31PubMedView ArticleGoogle Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-10.PubMedView ArticleGoogle Scholar
- Hannon GJ, Rossi JJ: Unlocking the potential of the human genome with RNA interference. Nature. 2004, 431: 371-378. 10.1038/nature02870.PubMedView ArticleGoogle Scholar
- Fire A, Xu S, Montgomery MK, Kostas SA, Driver SE, Mello CC: Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans. Nature. 1998, 391 (6669): 806-811. 10.1038/35888.PubMedView ArticleGoogle Scholar
- Kamath RS, Fraser AG, Dong Y, Poulin G, Durbin R, Gotta M, Kanapin A, Le Bot N, Moreno S, Sohrmann M: Systematic functional analysis of the Caenorhabditis elegans genome using RNAi. Nature. 2003, 421 (6920): 231-237. 10.1038/nature01278.PubMedView ArticleGoogle Scholar
- Rual JF, Ceron J, Koreth J, Hao T, Nicot AS, Hirozane-Kishikawa T, Vandenhaute J, Orkin SH, Hill DE, Heuvel van den S: Toward improving Caenorhabditis elegans phenome mapping with an ORFeome-based RNAi library. Genome Res. 2004, 14 (10B): 2162-2168. 10.1101/gr.2505604.PubMedView ArticleGoogle Scholar
- Sonnichsen B, Koski LB, Walsh A, Marschall P, Neumann B, Brehm M, Alleaume AM, Artelt J, Bettencourt P, Cassin E: Full-genome RNAi profiling of early embryogenesis in Caenorhabditis elegans. Nature. 2005, 434 (7032): 462-469. 10.1038/nature03353.PubMedView ArticleGoogle Scholar
- Mitreva M, Blaxter ML, Bird DM, McCarter JP: Comparative genomics of nematodes. Trends Genet. 2005, 21 (10): 573-581. 10.1016/j.tig.2005.08.003.PubMedView ArticleGoogle Scholar
- Fatumo S, Plaimas K, Mallm JP, Schramm G, Adebiyi E, Oswald M, Eils R, Konig R: Estimating novel potential drug targets of Plasmodium falciparum by analysing the metabolic network of knock-out strains in silico. Infect Genet Evol. 2008Google Scholar
- Yeh I, Hanekamp T, Tsoka S, Karp PD, Altman RB: Computational analysis of Plasmodium falciparum metabolism: organizing genomic information to facilitate drug discovery. Genome Res. 2004, 14 (5): 917-924. 10.1101/gr.2050304.PubMedView ArticleGoogle Scholar
- Blaxter ML, De Ley P, Garey JR, Liu LX, Scheldeman P, Vierstraete A, Vanfleteren JR, Mackey LY, Dorris M, Frisse LM, Vida JT, Thomas WK: A molecular evolutionary framework for the phylum Nematoda. Nature. 1998, 71-75. 10.1038/32160.Google Scholar
- Parkinson J, Mitreva M, Whitton C, Thomson M, Daub J, Martin J, Schmid R, Hall N, Barrell B, Waterston RH, McCarter JP, Blaxter ML: A transcriptomic analysis of the phylum Nematoda. Nat Genet. 2004, 36 (12): 1259-67. 10.1038/ng1472. Epub 2004 Nov 14PubMedView ArticleGoogle Scholar
- Elling A, Mitreva M, Recknor J, Gai X, Martin J, Maier T, McDermott J, Hewezi T, McK Bird D, Davis E, Hussey R, Nettleton D, McCarter J, Baum T: Divergent evolution of arrested development in the dauer stage of Caenorhabditis elegans and the infective stage of Heterodera glycines. Genome Biology. 8: R211-10.1186/gb-2007-8-10-r211.Google Scholar
- Yin Y, Martin J, McCarter JP, Clifton SW, Wilson RK, Mitreva M: Identification and analysis of genes expressed in the adult filarial parasitic nematode Dirofilaria immitis. Int J Parasitol. 2006, 36 (7): 829-39. 10.1016/j.ijpara.2006.03.002. Epub 2006 Mar 31PubMedView ArticleGoogle Scholar
- Yin Y, Martin J, Abubucker S, Scott AL, McCarter JP, Wilson RK, Jasmer DP, Mitreva M: Intestinal Transcriptomes of Nematodes: Comparison of the Parasites Ascaris suum and Haemonchus contortus with the Free-living Caenorhabditis elegans. PLoS Negl Trop Dis. 2008,Google Scholar
- Sahar Abubucker, John Martin, Yong Yin, Lucinda Fulton, Shiaw-Pyng Yang, Kym Hallsworth-Pepin, Spencer Johnston J, John Hawdon, McCarter James, Wilson Richard, Makedonka Mitreva: The canine hookworm genome: Analysis and classification of Ancylostoma caninum survey sequences. 2007, doi:10.1016/j.molbiopara.2007.11.001Google Scholar
- Kanehisa M, Goto S, Kawashima S, Okuno Y, Hattori M: The KEGG resource for deciphering the genome. Nucleic Acids Res. 2004, D277-80. 10.1093/nar/gkh063. 32 DatabaseGoogle Scholar
- Green ML, Karp PD: Genome annotation errors in pathway databases due to semantic ambiguity in partial EC numbers. Nucleic Acids Res. 2005, 33 (13): 4035-9. 10.1093/nar/gki711.PubMedView ArticleGoogle Scholar
- Nematode taxa for which a genome sequencing project is underway. [http://www.ncbi.nlm.nih.gov/books/bv.fcgi?rid=wormbook.table.13479]
- NEMBASE3. [http://www.nematodes.org/nembase3/]
- WormBase. [http://www.wormbase.org]
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.