DroID: the Drosophila Interactions Database, a comprehensive resource for annotated gene and protein interactions
© Yu et al; licensee BioMed Central Ltd. 2008
Received: 13 May 2008
Accepted: 07 October 2008
Published: 07 October 2008
Charting the interactions among genes and among their protein products is essential for understanding biological systems. A flood of interaction data is emerging from high throughput technologies, computational approaches, and literature mining methods. Quick and efficient access to this data has become a critical issue for biologists. Several excellent multi-organism databases for gene and protein interactions are available, yet most of these have understandable difficulty maintaining comprehensive information for any one organism. No single database, for example, includes all available interactions, integrated gene expression data, and comprehensive and searchable gene information for the important model organism, Drosophila melanogaster.
DroID, the Drosophila Interactions Database, is a comprehensive interactions database designed specifically for Drosophila. DroID houses published physical protein interactions, genetic interactions, and computationally predicted interactions, including interologs based on data for other model organisms and humans. All interactions are annotated with original experimental data and source information. DroID can be searched and filtered based on interaction information or a comprehensive set of gene attributes from Flybase. DroID also contains gene expression and expression correlation data that can be searched and used to filter datasets, for example, to focus a study on sub-networks of co-expressed genes. To address the inherent noise in interaction data, DroID employs an updatable confidence scoring system that assigns a score to each physical interaction based on the likelihood that it represents a biologically significant link.
DroID is the most comprehensive interactions database available for Drosophila. To facilitate downstream analyses, interactions are annotated with original experimental information, gene expression data, and confidence scores. All data in DroID are freely available and can be searched, explored, and downloaded through three different interfaces, including a text based web site, a Java applet with dynamic graphing capabilities (IM Browser), and a Cytoscape plug-in. DroID is available at http://www.droidb.org.
Many of the important properties of biological systems emerge as a result of the interactions among genes and among their protein products. Genes and the proteins they encode participate in gene-gene, gene-protein, and protein-protein interactions to mediate a wide variety of biological processes. An increasing appreciation for the importance of charting these interactions has lead to many large-scale efforts to identify gene and protein interactions for a number of systems . As this data continues to accumulate from a variety of sources there is an increasing need for comprehensive databases and analysis tools that allow biologists to make use of it. Genes and proteins that function in the same pathway, for example, interact directly or indirectly, and their functions can only be fully understood in the context of the interaction networks to which they belong.
Gene and protein interaction data have come from a variety of sources. To detect protein-protein interactions, for example, high throughput yeast two-hybrid [2–5] and co-affinity purification [6, 7] screens have been developed and applied to proteins from humans and several model organisms. To generate large networks of gene-protein interactions, high throughput techniques are being developed for detecting transcription factors and other proteins bound to DNA [8–11]. Finally, gene-gene interactions that suggest functional relationships between pairs of genes are being revealed by large-scale assays for genetic interactions [12, 13]. While each type of interaction data has proven useful for understanding how genes and their products work together in biological systems, the large amount of disparate data can be difficult to access and interpret. Combining data from different sources has become important because no single screen or technique is free from false positives and false negatives. Many studies have shown, for example, that interactions detected in multiple screens or by multiple techniques are less likely to be false positives (e.g., ), so that combining datasets can provide a simple way to gain confidence in any particular set of interactions. Likewise, the inability of any one technique or particular screen to detect all biologically relevant interactions suggests that combining datasets increases coverage.
A number of centralized databases have been implemented to store gene and protein interaction data and to make it publicly available [15–21]. While most of the data are from large-scale screens, several of these databases have also begun to include data from small-scale 'low throughput' experiments collected by manual curation of the literature. Despite the ideal of central databases to be comprehensive, a surprising number of interactions can be found in one database but not another [22, 23]. Thus, biologists have been well advised to consult multiple databases to get a complete picture of the available data. Most of the large interaction databases include data for many different species. Such multi-species databases, however, are rarely fully comprehensive for any one organism; for example, organism-specific gene information, such as gene expression and phenotype data is not available for searching and filtering the interaction data. Multi-organism databases also have difficulty representing potentially conserved interactions for any given species. Finding conserved interactions requires looking up the orthologous proteins and conducting searches for interaction data in each of several different organisms. Recently, a few public databases have addressed these issues in efforts to generate comprehensive resources for a particular species; e.g., HomoMINT  and UniHI for humans . DroID is designed to be a comprehensive interactions database dedicated to the important model organism, Drosophila melanogaster.
Construction and content
DroID is an extensive update of an earlier database . New features that are described in more detail below include a web interface, gene expression data, calculated gene correlation values, confidence scores, and substantially more interaction data. In addition, DroID is updated quarterly and each version is available for download. The current version of DroID (v4.0) is described here.
Interaction data and generation of interologs
DroID interaction datasets
Number of Interactions
Number of Genes
Curagen yeast two-hybrid
Finley yeast two-hybrid
Hybrigenics yeast two-hybrid
Other physical interactions*
DroID contains the yeast two-hybrid interactions published in three major studies [29–31] in addition to unpublished interactions from an ongoing large-scale two-hybrid screening project [29, 32]. Full experimental details as reported by the original publications are included. For Drosophila physical interactions not covered by the three large-scale yeast two-hybrid screens and for interactions of human, worm (C. elegans), and yeast proteins, raw data are downloaded from respective online databases. These databases include BioGRID , IntAct , and MINT , in addition to MIPS  for yeast and HPRD , PDZbase , and Reactome  for human. To enable periodic updates we established a pipeline for entering data into DroID as follows. First, raw interaction data is parsed to ensure that it includes only physical protein-protein interactions. DroID obtains interactions annotated with at least one detection method that detects physical interactions (e.g., yeast two-hybrid, mass spectrometry, pull down, etc.). Second, we map genes to uniform identifiers for the four organisms utilized by DroID; that is, Flybase gene number (FBgn) for fly, Ensembl gene identifier (ENSG) for human, Wormbase gene identifier (WBGene) for worm, and ORF identifiers for yeast. For each interaction, DroID stores the original PubMed identifier (PMID), methods used in detecting it, and the databases reporting it. Finally, we map interactions collected from human, worm, and yeast to Drosophila interologs by orthology mapping using Inparanoid (currently at version 6) . DroID also stores genetic interactions obtained from Flybase, each annotated by reference numbers that trace to original data sources. Aside from interologs, DroID currently does not include interactions based solely on computational predictions, which may be found in other databases [36, 37]. For example, the Fly-DPI database has Drosophila protein interactions predicted on the basis of domain pairs found in experimental PPI .
Gene attributes and gene expression data
DroID includes a searchable gene attributes table populated from periodically updated gene annotations available in Flybase . Users can search for interactions involving specific genes by searching for gene names, symbols, synonyms, or gene identifiers. The gene attributes table also allows searches based on gene class, gene function annotations based on gene ontology (GO) , and protein domains. The IM Browser interface  further extends this search ability by enabling a live search of Flybase for genes based on additional attributes, including reference and phenotype.
DroID also stores searchable gene expression data, which allows interaction data to be viewed and filtered in the context of gene expression patterns. DroID currently has two microarray-based gene expression datasets that can be used to constrain a search for interactions. One dataset includes genome-wide expression profiles over the course of embryogenesis in half-hour increments , and the other includes expression profiles for a developmental time course from early embryos through adults . DroID can accommodate additional gene and protein expression data as they become available.
Gene expression correlation
Genes that are frequently co-expressed often function together in common processes (e.g., [41, 42]). Thus, there is substantial value in knowing the level of co-expression for pairs of genes that interact. To facilitate co-expression analyses for Drosophila, we computed correlation values between pair-wise expression profiles derived from the Gene Expression Omnibus (GEO) database . We downloaded all D. melanogaster gene expression datasets from GEO and computed linear Pearson correlations between pair-wise expression profiles within each dataset. We first removed datasets that have less than 5 samples (e.g., tissues, conditions, or time points) to avoid possible spurious strong correlations. This resulted in 49 genome-wide expression datasets with 844 combined samples. Multiple correlations for a pair of genes from different datasets were then combined to produce a final correlation value for a specific gene pair. The combination is done based on how many samples each dataset has. Intuitively, a correlation based on a dataset having many samples may be more significant than the same value derived from another dataset with only a few samples. If there are n datasets, and each reports a correlation of xi for a gene pair, the final correlation value is computed by
corr = Σi(xi * si)/Σi(si)
where i ∈ [1, 2, ..., n], and si represents the number of samples in data set i. Every interaction in DroID is annotated with the current gene expression correlation value for that pair. Correlation values are updateable as new gene expression datasets are added to GEO.
Cross data set confidence scores
Protein-protein interaction data tend to be noisy, with variable rates of false positives from one dataset to another. A novel feature of DroID is the annotation of each physical protein-protein interaction with an updateable confidence score that reflects the probability that it is a biologically relevant true positive. Most methods for generating confidence scores work within a single type of data, such as yeast two-hybrid or protein complex data, by searching for features of the data that correlate with biological significance [30, 44–47]. As a consequence, the scores derived for one data set bear little relation to those for another data set. In contrast, DroID assigns confidence scores to all physical interactions, including data from different techniques and interologs derived from worm, yeast, and human. The method used to assign confidence scores is based on the logistic regression approach described by Giot et al. [30, 44]. In this approach we first identify training data, including a set of interactions that are likely to be true positives and another set that are likely to be false positives. We then search for specific attributes of the interactions that correlate with the two training sets. The attributes include gene expression correlation, number of associated literature citations (PubMed identifiers or PMIDs), local and global network topology, and domain-domain interactions. For example, the number of PMIDs for an interaction correlates with its likelihood of being in the true positive training set. Conversely, the number of interactions for a protein is inversely correlated with presence in the true positive training set. The correlations are then used to train a logistic regression model that can assign scores to all interactions based on their attributes. For the interactions in DroID, we used a variation of this scoring system in which we combine multiple training datasets to reduce the potential bias of any single training set (Yu, submitted).
Every physical interaction in DroID has a confidence score between 0 and 1 to represent the probability that it is a biological true positive. Validation of the scoring system shows that interactions with higher scores are more likely to be biologically relevant than interactions with lower scores (Yu et al., submitted; ). The set of interactions with scores greater than 0.5, for example, have significantly more pairs of genes that share GO biological process or cellular component annotations compared to interactions scoring < 0.5, or to random pairs. Interactions scoring less than 0.5 also share significantly more GO annotations than random pairs of genes, which indicates that overall the interactions collected in DroID are enriched for biologically relevant true positives. As additional interaction data and other new information become available, the scoring models can be periodically retrained to improve the overall accuracy. Thus, the confidence scores are updateable and receive a version number at each revision.
Utility and discussion
Summary of the database
Overlap between interaction datasets
Gene expression correlation
Viewing interaction data in the context of gene expression data
Gene expression data can also be used to view interaction data in a dynamic context. Most gene and protein interaction data that are currently available come from studies that are independent of gene expression. Examples include yeast two-hybrid data in which pairs of proteins are expressed together in yeast, whether or not they are co-expressed in vivo, and co-AP experiments in which often at least one of the proteins is artificially expressed with an affinity tag in tissue culture cells. Thus, the protein interactions in DroID and most other databases represent pairs of proteins that may interact in vivo, but only if they are expressed together. A powerful way to view this interaction data, therefore, is in the context of gene expression patterns for a particular tissue or developmental time point. DroID includes gene expression data from genome-wide developmental studies. This data can be used to constrain a set of interactions to include only genes expressed at a user-defined level and time point or developmental stage.
DroID access interfaces
All data in DroID can be accessed and downloaded in part or whole via three different interfaces (Figure 1). A user-friendly web interface is provided for simple searching, browsing, and downloading of DroID data. Going to the DroID web page opens a search box, which asks users for a term describing a gene or protein. The term can be a gene symbol, name, synonym, or a term describing a gene or protein (Figure 1A). Clicking 'Search Genes' produces a page listing genes that fit the search criteria. On this page, users select one or any number of the genes, and then have the option to select specific interaction datasets or to search all of them simultaneously. The search produces a results page listing the found interactions and their current confidence scores (Figure 1B). Each interaction is represented by the symbols of the two genes and a list of the datasets in which they were found. Additional information about each gene, including GO annotations and links to Flybase can be obtained by clicking on the gene symbol. Similarly, clicking on the dataset name for each interaction reveals its details, including original experimental data when available, references, and relevant links. The results page also includes several additional options for further analysis. These include an option to show the gene expression correlation values for each interaction and an option to filter the results by gene expression patterns or confidence scores. Utilizing these filters helps researchers to focus on interactions that are more likely to be true positives or that involve co-expressed genes. The results page also includes a link for downloading the interactions in formats that can then be uploaded into network analysis programs. Finally, a link is included that will generate a summary table showing the number of interactions for the selected genes in each of the interaction datasets, including those not originally searched. The summary table also includes a button that automatically opens the IM Browser applet to generate a graphical map of the interactions (see below).
DroID can be accessed via two different dynamic interfaces that allow an interaction network to be explored as a graph where nodes represent genes or proteins and edges connecting the nodes represent interactions. Viewing an interaction map in this way places each gene and interaction into the context of other interactions and facilitates biological insights that are not possible from simple lists of interactions. The first interface is a plug-in (Figure 1C) that allows DroID to be accessed through the powerful network visualization and analysis program, Cytoscape . The second interface is IM Browser (Figure 1D), a program originally designed to access an earlier version of DroID and other interaction databases . IM Browser runs as a java applet and allows advanced queries and dynamic graphing of search results. While a complete description of IM Browser capabilities is beyond the scope of this paper, a few features are worth noting here. First, the program easily accommodates new types of interaction data and dynamically enables all node and edge information to be used in searches and filtering. This feature is important as new techniques for detecting interactions are needed and continue to emerge, and each new technique has its own type of data. Second, interaction maps can be edited and saved to the user's local computer, and local datasets can be loaded into the program to allow the user to view and analyze their own interactions in the context of DroID data. Finally, a new feature of IM Browser allows maps to be filtered based on gene expression data or confidence scores. The constraint is implemented as a dynamic filter that can be applied to an existing interaction map. As new gene expression data becomes available, and eventually protein expression data is collected from proteomics studies, an increasingly fruitful way to view interaction maps will be in the context of specific temporal and spatial expression patterns.
DroID is a comprehensive interactions database designed specifically for Drosophila melanogaster. The database currently covers more Drosophila genes and interactions than any other single database and is periodically updated. Because it is an organism-specific database, it readily includes potentially conserved interactions found in other organisms by mapping them to Drosophila genes. The database also includes comprehensive gene information, including Drosophila-specific information, which can be used to search for and filter interactions and to analyze gene networks. DroID includes gene expression data, both as expression profiles and as correlation values, to help researchers link together genes that may function together in specific biological processes. Finally, DroID assigns updateable confidence scores to every physical interaction to help focus studies on biologically relevant links. Combined with three user interfaces, DroID should provide a valuable resource for studying Drosophila systems.
Availability and requirements
DroID is freely available for non-commercial use. Any modern web browser can access the DroID home page at http://www.droidb.org/. A web browser with an installed Java Virtual Machine can access IM Browser from the DroID home page or from a list of found interactions. Cytoscape  enables installation and usage of the DroID plugin for Cytoscape.
We thank Jodi R. Parrish, George G. Roberts III, and Stephen Guest for helpful comments on the manuscript. This work was supported in part through grants from the National Institutes of Health (HG001536) and the National Center for Research Resources (RR18327).
- Shoemaker BA, Panchenko AR: Deciphering protein-protein interactions. Part I. Experimental techniques and databases. PLoS Comput Biol. 2007, 3 (3): e42-10.1371/journal.pcbi.0030042.PubMedView ArticleGoogle Scholar
- Fields S, Song O: A Novel Genetic System to Detect Protein-Protein Interactions. Nature. 1989, 340 (6230): 245-246. 10.1038/340245a0.PubMedView ArticleGoogle Scholar
- Parrish JR, Gulyas KD, Finley RL: Yeast two-hybrid contributions to interactome mapping. Curr Opin Biotechnol. 2006, 17 (4): 387-393. 10.1016/j.copbio.2006.06.006.PubMedView ArticleGoogle Scholar
- Fields S: High-throughput two-hybrid analysis. The promise and the peril. Febs J. 2005, 272 (21): 5391-5399. 10.1111/j.1742-4658.2005.04973.x.PubMedView ArticleGoogle Scholar
- Vidal M: Interactome modeling. FEBS Lett. 2005, 579 (8): 1834-1838. 10.1016/j.febslet.2005.02.030.PubMedView ArticleGoogle Scholar
- Gavin AC, Superti-Furga G: Protein complexes and proteome organization from yeast to man. Curr Opin Chem Biol. 2003, 7 (1): 21-27. 10.1016/S1367-5931(02)00007-8.PubMedView ArticleGoogle Scholar
- Gingras AC, Aebersold R, Raught B: Advances in protein complex analysis using mass spectrometry. J Physiol. 2005, 563 (Pt 1): 11-21.PubMedView ArticleGoogle Scholar
- Deplancke B, Mukhopadhyay A, Ao W, Elewa AM, Grove CA, Martinez NJ, Sequerra R, Doucette-Stamm L, Reece-Hoyes JS, Hope IA: A gene-centered C. elegans protein-DNA interaction network. Cell. 2006, 125 (6): 1193-1205. 10.1016/j.cell.2006.04.038.PubMedView ArticleGoogle Scholar
- Lee TI, Rinaldi NJ, Robert F, Odom DT, Bar-Joseph Z, Gerber GK, Hannett NM, Harbison CT, Thompson CM, Simon I: Transcriptional regulatory networks in Saccharomyces cerevisiae. Science. 2002, 298 (5594): 799-804. 10.1126/science.1075090.PubMedView ArticleGoogle Scholar
- Mukherjee S, Berger MF, Jona G, Wang XS, Muzzey D, Snyder M, Young RA, Bulyk ML: Rapid analysis of the DNA-binding specificities of transcription factors with DNA microarrays. Nat Genet. 2004, 36 (12): 1331-1339. 10.1038/ng1473.PubMedView ArticleGoogle Scholar
- Sandmann T, Jakobsen JS, Furlong EE: ChIP-on-chip protocol for genome-wide analysis of transcription factor binding in Drosophila melanogaster embryos. Nat Protoc. 2006, 1 (6): 2839-2855. 10.1038/nprot.2006.383.PubMedView ArticleGoogle Scholar
- Boone C, Bussey H, Andrews BJ: Exploring genetic interactions and networks with yeast. Nat Rev Genet. 2007, 8 (6): 437-449. 10.1038/nrg2085.PubMedView ArticleGoogle Scholar
- Lehner B, Crombie C, Tischler J, Fortunato A, Fraser AG: Systematic mapping of genetic interactions in Caenorhabditis elegans identifies common modifiers of diverse signaling pathways. Nat Genet. 2006, 38 (8): 896-903. 10.1038/ng1844.PubMedView ArticleGoogle Scholar
- von Mering C, Krause R, Snel B, Cornell M, Oliver SG, Fields S, Bork P: Comparative Assessment of Large-Scale Data Sets of Protein-Protein Interactions. Nature. 2002, 417 (6887): 399-403. 10.1038/nature750.PubMedView ArticleGoogle Scholar
- Salwinski L, Miller CS, Smith AJ, Pettit FK, Bowie JU, Eisenberg D: The Database of Interacting Proteins: 2004 update. Nucleic Acids Res. 2004, D449-451. 10.1093/nar/gkh086. 32 DatabaseGoogle Scholar
- Kerrien S, Alam-Faruque Y, Aranda B, Bancarz I, Bridge A, Derow C, Dimmer E, Feuermann M, Friedrichsen A, Huntley R: IntAct – open source resource for molecular interaction data. Nucleic Acids Res. 2007, D561-565. 10.1093/nar/gkl958. 35 DatabaseGoogle Scholar
- Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M: BioGRID: a general repository for interaction datasets. Nucleic Acids Res. 2006, D535-539. 10.1093/nar/gkj109. 34 DatabaseGoogle Scholar
- Chatr-aryamontri A, Ceol A, Palazzi LM, Nardelli G, Schneider MV, Castagnoli L, Cesareni G: MINT: the Molecular INTeraction database. Nucleic Acids Res. 2007, D572-574. 10.1093/nar/gkl950. 35 DatabaseGoogle Scholar
- Guldener U, Munsterkotter M, Oesterheld M, Pagel P, Ruepp A, Mewes HW, Stumpflen V: MPact: the MIPS protein interaction resource on yeast. Nucleic Acids Res. 2006, D436-441. 10.1093/nar/gkj003. 34 DatabaseGoogle Scholar
- Mishra GR, Suresh M, Kumaran K, Kannabiran N, Suresh S, Bala P, Shivakumar K, Anuradha N, Reddy R, Raghavan TM: Human protein reference database – 2006 update. Nucleic Acids Res. 2006, D411-414. 10.1093/nar/gkj141. 34 DatabaseGoogle Scholar
- Vastrik I, D'Eustachio P, Schmidt E, Joshi-Tope G, Gopinath G, Croft D, de Bono B, Gillespie M, Jassal B, Lewis S: Reactome: a knowledge base of biologic pathways and processes. Genome Biol. 2007, 8 (3): R39-10.1186/gb-2007-8-3-r39.PubMedView ArticleGoogle Scholar
- Reguly T, Breitkreutz A, Boucher L, Breitkreutz BJ, Hon GC, Myers CL, Parsons A, Friesen H, Oughtred R, Tong A: Comprehensive curation and analysis of global interaction networks in Saccharomyces cerevisiae. J Biol. 2006, 5 (4): 11-10.1186/jbiol36.PubMedView ArticleGoogle Scholar
- Mathivanan S, Periaswamy B, Gandhi TK, Kandasamy K, Suresh S, Mohmood R, Ramachandra YL, Pandey A: An evaluation of human protein-protein interaction data in the public domain. BMC Bioinformatics. 2006, 7 (Suppl 5): S19-10.1186/1471-2105-7-S5-S19.PubMedView ArticleGoogle Scholar
- Persico M, Ceol A, Gavrila C, Hoffmann R, Florio A, Cesareni G: HomoMINT: an inferred human network based on orthology mapping of protein interactions discovered in model organisms. BMC Bioinformatics. 2005, 6 (Suppl 4): S21-10.1186/1471-2105-6-S4-S21.PubMedView ArticleGoogle Scholar
- Chaurasia G, Iqbal Y, Hanig C, Herzel H, Wanker EE, Futschik ME: UniHI: an entry gate to the human protein interactome. Nucleic Acids Res. 2007, D590-594. 10.1093/nar/gkl817. 35 DatabaseGoogle Scholar
- Crosby MA, Goodman JL, Strelets VB, Zhang P, Gelbart WM: FlyBase: genomes by the dozen. Nucleic Acids Res. 2007, D486-491. 10.1093/nar/gkl827. 35 DatabaseGoogle Scholar
- Yu H, Luscombe NM, Lu HX, Zhu X, Xia Y, Han JD, Bertin N, Chung S, Vidal M, Gerstein M: Annotation transfer between genomes: protein-protein interologs and protein-DNA regulogs. Genome Res. 2004, 14 (6): 1107-1118. 10.1101/gr.1774904.PubMedView ArticleGoogle Scholar
- Pacifico S, Liu G, Guest S, Parrish JR, Fotouhi F, Finley RL: A database and tool, IM Browser, for exploring and integrating emerging gene and protein interaction data for Drosophila. BMC Bioinformatics. 2006, 7: 195-10.1186/1471-2105-7-195.PubMedView ArticleGoogle Scholar
- Stanyon CA, Liu G, Mangiola BA, Patel N, Giot L, Kuang B, Zhang H, Zhong J, Finley RL: A Drosophila Protein-Interaction Map Centered on Cell-Cycle Regulators. Genome Biology. 2004, 5 (12): R96-10.1186/gb-2004-5-12-r96.PubMedView ArticleGoogle Scholar
- Giot L, Bader JS, Brouwer C, Chaudhuri A, Kuang B, Li Y, Hao YL, Ooi CE, Godwin B, Vitols E: A protein interaction map of Drosophila melanogaster. Science. 2003, 302 (5651): 1727-1736. 10.1126/science.1090289.PubMedView ArticleGoogle Scholar
- Formstecher E, Aresta S, Collura V, Hamburger A, Meil A, Trehin A, Reverdy C, Betin V, Maire S, Brun C: Protein interaction mapping: a Drosophila case study. Genome Res. 2005, 15 (3): 376-384. 10.1101/gr.2659105.PubMedView ArticleGoogle Scholar
- Stanyon CA, Finley RL: Progress and potential of Drosophila protein interaction maps. Pharmacogenomics. 2000, 1 (4): 417-431. 10.1517/146224188.8.131.527.PubMedView ArticleGoogle Scholar
- Mewes HW, Amid C, Arnold R, Frishman D, Guldener U, Mannhaupt G, Munsterkotter M, Pagel P, Strack N, Stumpflen V: MIPS: analysis and annotation of proteins from whole genomes. Nucleic Acids Res. 2004, D41-44. 10.1093/nar/gkh092. 32 DatabaseGoogle Scholar
- Beuming T, Skrabanek L, Niv MY, Mukherjee P, Weinstein H: PDZBase: a protein-protein interaction database for PDZ-domains. Bioinformatics. 2005, 21 (6): 827-828. 10.1093/bioinformatics/bti098.PubMedView ArticleGoogle Scholar
- O'Brien KP, Remm M, Sonnhammer EL: Inparanoid: a comprehensive database of eukaryotic orthologs. Nucleic Acids Res. 2005, D476-480. 33 DatabaseGoogle Scholar
- von Mering C, Jensen LJ, Snel B, Hooper SD, Krupp M, Foglierini M, Jouffre N, Huynen MA, Bork P: STRING: known and predicted protein-protein associations, integrated and transferred across organisms. Nucleic acids research. 2005, D433-437. 33 DatabaseGoogle Scholar
- Lin CY, Chen SH, Cho CS, Chen CL, Lin FK, Lin CH, Chen PY, Lo CZ, Hsiung CA: Fly-DPI: database of protein interactomes for D. melanogaster in the approach of systems biology. BMC bioinformatics. 2006, 7 (Suppl 5): S18-10.1186/1471-2105-7-S5-S18.PubMedView ArticleGoogle Scholar
- The Gene Ontology Consortium: Gene Ontology: Tool for the Unification of Biology. Nature Genetics. 2000, 25 (25–29):Google Scholar
- Tomancak P, Beaton A, Weiszmann R, Kwan E, Shu S, Lewis SE, Richards S, Ashburner M, Hartenstein V, Celniker SE: Systematic determination of patterns of gene expression during Drosophila embryogenesis. Genome Biol. 2002, 3 (12): RESEARCH0088-10.1186/gb-2002-3-12-research0088.PubMedView ArticleGoogle Scholar
- Arbeitman MN, Furlong EEM, Imam F, Johnson E, Null BH, Baker BS, Krasnow MA, Scott MP, Davis RW, White KP: Gene Expression During the Life Cycle of Drosophila melanogaster. Science. 2002, 297: 2270-2275. 10.1126/science.1072152.PubMedView ArticleGoogle Scholar
- Hooper SD, Boue S, Krause R, Jensen LJ, Mason CE, Ghanim M, White KP, Furlong EE, Bork P: Identification of tightly regulated groups of genes during Drosophila melanogaster embryogenesis. Mol Syst Biol. 2007, 3: 72-10.1038/msb4100112.PubMedView ArticleGoogle Scholar
- Lee I, Date SV, Adai AT, Marcotte EM: A probabilistic functional network of yeast genes. Science. 2004, 306 (5701): 1555-1558. 10.1126/science.1099511.PubMedView ArticleGoogle Scholar
- Barrett T, Troup DB, Wilhite SE, Ledoux P, Rudnev D, Evangelista C, Kim IF, Soboleva A, Tomashevsky M, Edgar R: NCBI GEO: mining tens of millions of expression profiles – database and tools update. Nucleic Acids Res. 2007, D760-765. 10.1093/nar/gkl887. 35 DatabaseGoogle Scholar
- Parrish JR, Yu J, Liu G, Hines JA, Chan JE, Mangiola BA, Zhang H, Pacifico S, Fotouhi F, Dirita VJ: A proteome-wide protein interaction map for Campylobacter jejuni. Genome Biol. 2007, 8 (7): R130-10.1186/gb-2007-8-7-r130.PubMedView ArticleGoogle Scholar
- Deng M, Sun F, Chen T: Assessment of the Reliability of Protein-Protein Interactions and Protein Function Prediction. Pacific Symposium on Biocomputing. 2003, 8: 140-151.Google Scholar
- Bader JS, Chaudhuri A, Rothberg JM, Chant J: Gaining Confidence in High-Throughput Protein Interaction Networks. Nature Biotechnology. 2004, 22 (1): 78-85. 10.1038/nbt924.PubMedView ArticleGoogle Scholar
- Suthram S, Shlomi T, Ruppin E, Sharan R, Ideker T: A direct comparison of protein interaction confidence assignment schemes. BMC Bioinformatics. 2006, 7: 360-10.1186/1471-2105-7-360.PubMedView ArticleGoogle Scholar
- Ge H, Liu Z, Church GM, Vidal M: Correlation between Transcriptome and Interactome Mapping Data from Saccharomyces cerevisiae. Nature Genetics. 2001, 29 (482–486):Google Scholar
- Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13 (11): 2498-2504. 10.1101/gr.1239303.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.