Stratification of candidate genes for Parkinson’s disease using weighted protein-protein interaction network analysis
© The Author(s). 2018
Received: 15 January 2018
Accepted: 18 May 2018
Published: 13 June 2018
Genome wide association studies (GWAS) have helped identify large numbers of genetic loci that significantly associate with increased risk of developing diseases. However, translating genetic knowledge into understanding of the molecular mechanisms underpinning disease (i.e. disease-specific impacted biological processes) has to date proved to be a major challenge. This is primarily due to difficulties in confidently defining candidate genes at GWAS-risk loci. The goal of this study was to better characterize candidate genes within GWAS loci using a protein interactome based approach and with Parkinson’s disease (PD) data as a test case.
We applied a recently developed Weighted Protein-Protein Interaction Network Analysis (WPPINA) pipeline as a means to define impacted biological processes, risk pathways and therein key functional players. We used previously established Mendelian forms of PD to identify seed proteins, and to construct a protein network for genetic Parkinson’s and carried out functional enrichment analyses. We isolated PD-specific processes indicating ‘mitochondria stressors mediated cell death’, ‘immune response and signaling’, and ‘waste disposal’ mediated through ‘autophagy’. Merging the resulting protein network with data from Parkinson’s GWAS we confirmed 10 candidate genes previously selected by pure proximity and were able to nominate 17 novel candidate genes for sporadic PD.
With this study, we were able to better characterize the underlying genetic and functional architecture of idiopathic PD, thus validating WPPINA as a robust pipeline for the in silico genetic and functional dissection of complex disorders.
In the past decade, high throughput technologies, such as DNA microarray, exome and genome sequencing, have provided investigators with an extensive catalogue of genes and genetic variations associated with complex disorders .
Interpreting GWAS results is a challenging task [6, 7] as not only is it difficult to confidently define the authentic associated marker (the top identified SNP might be in linkage disequilibrium [LD] with the real associated SNP), but also it is now recognized that the practice of choosing candidate genes by proximity to a strongly associated SNP is unlikely accurate [8–10]. Furthermore, there are significant challenges in then characterising functional/biological effects to the top SNPs. This clearly indicates that making the step from genomic insight(s) to functional translation is currently arduous due to the limited success in progressing from genomic loci to candidate genes, coupled with the prevalent functional practice of studying one gene at a time.
In this study, we applied a systematic computational approach to investigate the interactome of the PD-Mendelian genes through weighted protein-protein interaction network analysis (WPPINA)  as a means to define PD-specific impacted pathways and stratify candidate genes within PD-GWAS loci. Specifically, we selected proteins encoded by PD-Mendelian genes as seeds, and harvested data of wet-lab proven protein-protein interactors (PPI) to build the PD-interactome for which we then performed functional annotation analysis to highlight impacted biological processes and to list all the proteins contributing to these systems. Finally, we mapped the genes encoding all such proteins to the loci highlighted in the PD meta-analysis  to assist identification of candidate genes driving risk in sporadic PD on the basis of available functional knowledge.
Using PD as a case study, we demonstrate that WPPINA, in combination with genomic approaches, is a powerful and adaptable tool to assess and prioritize candidate genes within risk-loci of complex disorders.
We sought to consider the interactomes associated with the parkinsonian syndromes, which manifest in classical PD - the focus of the current study – and a subset of cases diagnosed with frontotemporal dementia (FTD) with parkinsonism and in a group of conditions classified under the umbrella term ‘parkinsonian spectrum’ (PS, see also Additional files 1 and 2: Table S1). We used all known Mendelian genes associated with these conditions as seeds (Table 1) to build syndrome-specific networks to: i) highlight syndrome-specific impacted biological processes and relevant nodes therein, and; ii) aid prioritization of candidate genes within the PD-GWAS loci. In this fashion, we were able to test for specificity of our method prior to carrying forward the PD-GWAS candidate genes prioritization on the basis of the PD-specific impacted biological processes and the key proteins therein.
Network construction and topological analysis
For each Mendelian gene used as a seed the WPPINA builds a protein network composed of the seeds, their direct interactors (first layer nodes) and the direct interactors of each first layer node (second layer nodes).
The construction of the second layer resulted in a comprehensive network made of 5414 nodes and 18,492 undirected edges (Fig. 1e); the seed-centrality issue was diluted as confirmed by an increase in average number of neighbors and connection density, and by a decrease of the average shortest path (Additional file 3). The analysis of the first + second layer networks led to the identification of inter-interactome hubs (IIHs) that are nodes with a degree of interactome connection (IDC) ≥ 0.6 (Additional file 4). These nodes represent the shortest path to connect more than 60% of the seed genes, therefore (based on the network parsimony hypothesis ) it is reasonable to expect these to reside in disease relevant pathways, congruent with the majority of the seeds, and therefore at the core of the syndrome-specific network.
Network functional analysis
We first analyzed the entire network (input of 5414 nodes) for enrichment of Gene Ontology (GO) biological processes (BPs) terms (Additional file 5); we then applied the same procedure to the IIHs for PD, FTD and PS separately (Additional files 6, 7 and 8)  highlighting a subset of semantic classes indicative of syndrome-specific biological processes (Additional file 9).
PD-GWAS candidate genes prioritization
Ten/27 candidate genes (5 of which are also Mendelian) – GBA, SYT11, SNCA, RAB7L1(RAB29), INPP5F, DLG2, LRRK2, GCH1, MAPT, and RAI1 – previously selected by proximity were confirmed by our functional approach. The remaining 17/27 candidate genes – ZBTB7B, MUC1, CLK2, DAP3, SCAMP3, ADAM15, RUSC1, SHC1, ARHGEF2, KAT8, STX4, IGF2BP3, PPM1L, DCUN1D1, NSF, BAG3 and TOM1L2 – are to be regarded as novel PD candidate genes. Cell type specific expression for each prioritized candidate gene was calculated from the dataset generated by Zhang et al.  showing that neurons and mature astrocytes almost equally share expression of the prioritized candidate genes (Additional files 15 and 16).
To validate this analysis, we ran 100,000 random simulations. The p-values associated with the experimental analysis in comparison with the random distribution showed strong statistical significance (Additional file 17, p = 0.004 for LD r2 ≥ 0.5 and p = 0.005 for LD r2 ≥ 0.8). Analytic p-values values generated using the hypergeometric distribution also led to statistically significant results (p = 0.009 for LD r2 ≥ 0.5 and p = 0.012 for LD r2 ≥ 0.8). We undertook an additional validation step by assessing the total number of annotations reported in GO for each ORF within the analyzed haplotypes to verify potential annotation bias possibly impacting the specificity of the GWAS loci prioritization. As shown in Additional file 18, the number of annotations per genes in GO does not affect prioritization specificity.
Familial and sporadic PD cases associate with highly/moderately penetrant variants in a number of Mendelian genes and multiple risk loci with small to moderate effect size, respectively. Despite an abundance of genetic data, translating PD genetics into understanding of the functional and molecular mechanisms underlying PD pathogenesis is a challenging task. For Mendelian genes, this is partially due to differences in the pace of genetic data generation (fast) versus functional and molecular assessments (slow) that still evaluate one gene at a time. For sporadic cases, their association with multiple risk markers (that indicate loci rather than genes) results in additional theoretical and practical issues affecting the design of functional experiments.
A number of methods to prioritize genes within GWAS loci have recently emerged: burden scoring at gene or pathway level [15, 16], GWAS data integration with cis-eQTL signals or epigenetic markers [17–20], and PPIs-based methods [9, 21]. However, while these hold much promise, none has yet solved the issue of confidently prioritizing genes within GWAS loci and it is still the case that in published GWAS nominated genes are assigned to the top SNPs predominantly by proximity, an approach that not only is arbitrary but also inaccurate and ineffective to gather insight on molecular mechanisms impacted in disease pathogenesis . Therefore, there is an urgent requirement for additional pipelines to further and better characterize GWAS loci as well as the impacted biological processes, risk pathways and therein key functional players for potential future targeting is real [1, 22].
In this view, our recently developed pipeline, WPPINA , is complementary to existing models given that it builds protein networks as a direct function of the disease under investigation (i.e. on the basis of their Mendelian genetics). By applying this pipeline to PD, FTD and PS we were able to highlight syndrome-specific PPI networks, impacted biological processes and associated functional players at the core of each disease (Fig. 2). We found that specific processes were differentially relevant across the 3 syndromes: ‘immune response’, particularly ‘signaling in immune response’, was relevant for PD and PS, ‘DNA damage response’ was relevant for FTD only, and ‘waste disposal’ was differentially relevant for PD (i.e. ‘autophagy’) and FTD (i.e. ‘unfolded protein response’ and ‘ubiquitin proteasome system’). Particularly, we isolated ‘mitochondria stressors’ and ‘mitochondria mediated cell death’, ‘immune response and signaling’ (i.e. ‘immune system receptors’ and ‘cytokine signaling’), and ‘waste disposal’ mediated through ‘autophagy’ as PD-specific processes.
After verifying the suitability of our approach in highlighting syndrome-specific impacted biological processes, we then focused on the PD-interactome to prioritize candidate genes within PD-GWAS loci (Fig. 3)  on the basis of functional relevance rather than proximity. We used the totality of the proteins indicated by the PD-specific processes and mapped them onto the PD-GWAS loci, confirming 10 candidate genes (previously nominated by proximity) and identifying 17 novel candidate genes (Fig. 3 and Additional file 15) for sporadic PD. Of note, for some of the loci we identified more than one candidate gene. This could be due to a number of reasons: i) the top SNP at the risk locus may mask a lower (but still biologically significant) signal of a second, independent SNP , and/or; ii) the same SNP(s) might influence more than one ORF  as, for example, in the case of rs34311866 for which classical wet-lab approaches prioritized different genes (i.e. GAK and TMEM175) [25, 26]. It is also noteworthy that we were not able to assign a gene to every locus. This might reflect that: i) the PPI network is incomplete (i.e. partly because not all experimental data has been captured by manual curation, and partly because the network can only be as complete as the experiments that have been performed and published), and/or; ii) we may have lowered the number of potentially positive nodes by using a stringent filtering to only retain robustly validated interactors in the network. Finally, the risk loci may target functional elements that reside in -trans rather than in -cis (e.g. distal enhancers or silencers) or non-coding RNAs, rather than protein coding genes .
Through this in silico study of a multifactorial, complex neurodegenerative disorder, we show that our pipeline can be applied to: i) define disease-specific biological processes on the basis of known Mendelian genes, and; ii) provide a list of proteins involved in disease-specific processes that can be used to prioritize candidate genes in GWAS loci.
This pipeline aids in expanding on the genetic and functional architecture underlying idiopathic forms of disease. In doing so, it provides the basis for further genetic (e.g. such list of proteins/genes might be screened for rare variants within whole exome sequencing data sets) and hypothesis-driven functional studies to validate-risk pathways as well as identify targets for the development of therapies in the future.
Methods are described in detail in the Additional file 1.
Construction of the PPI network
PPIs of Mendelian-PD gene products were downloaded for each seed protein as MITAB 2.5 files (January-2016) from the IntAct , BioGRID , InnateDB, InnateDB-all, InnateDB-IMEx  and MINT  databases by means of the PSICQUIC platform (http://www.ebi.ac.uk/Tools/webservices/psicquic/view/main.xhtml) developed by the IMEx consortium . All PPIs underwent quality control (QC) and filtering. Only human interaction and experimentally proven physical interaction were retained (predicted interactions and expanded complexes were removed). The interactions were then scored and thresholded taking into consideration the number of different publications and methods reporting the interaction (see  and Additional file 1 for details). Polyubiquitin-C, B and D (UBC, UBB and UBD) were excluded from the network as they may indicate unspecific binding of ubiquitin to proteins tagged for degradation.
The inter-interactome degree (IID) for each single node was calculated based on the number of different interactomes that node belonged to. The interactome connection degree (ICD) of a node equates to the IID divided by the total number of seeds in the network and ranges between 1 (nodes able to bridge all the interactomes in the network) and 1/number of seeds (nodes unable to bridge any interactomes in the network). Nodes with IDC ≥ 0.6 are inter-interactomes hubs (IIHs).
Functional enrichment analysis and replication
We performed Gene Ontology (GO)  terms enrichment analyses in g:Profiler (g:GOSt, http://biit.cs.ut.ee/gprofiler/)  (October–November 2016) and replicated in PANTHER (June 2017). Enriched GO-BP terms were grouped into custom-made ‘semantic classes’; similar semantic classes were grouped into hierarchical groups called ‘functional blocks’ (Additional file 1).
Gene prioritization – GWAS
We used thirty-two relevant SNPs as per the PD meta-analysis (form discovery phase and/or joint analysis) . The genes mapping to the GWAS-loci (SNPSNAP, reference EU 1000G; locus definition by linkage disequilibrium (LD) r2 > 0.5) were matched with those encoding proteins contributing to the PD-specific risk-processes highlighted by WPPINA to aid prioritization of genes within the PD-GWAS loci. Results were statistically validated by generating 100,000 random gene-sets of sizes similar to the lists of open reading frames (ORFs) in LD with the PD-GWAS SNPs. P-values were calculated considering the total number of random matches falling above the actual number of experimental matches divided by the total number of trials. Analytic p-values were also calculated by using the hypergeometric distribution. An additional assessment was performed by evaluating the number of annotations for each gene reported in GO (GO annotation service http://amigo.geneontology.org/grebe) to verify whether differences in the annotation sizes might have influenced and driven gene prioritization.
Cell type expression
We evaluated cell specific expression profile for genes prioritized in each PD locus. The individual expression FPKM data were downloaded for human temporal lobe cortex mature astrocytes, neurons, microglia, oligodendrocytes and endothelial cells from Additional file 1 of Zhang et al. .
Data was handled, filtered and scored through in-house R scripts (https://www.r-project.org/) as described before . The final network was visualized through the freely available Cytoscape 2.8.2 software and analyzed through the network analysis plug-in .
The authors would like to acknowledge generous research support from the Michael J. Fox Foundation, Parkinson’s UK, Alzheimer’s Society and the Rosetrees Trust.
This work was supported in part by the Wellcome Trust/MRC Joint Call in Neurodegeneration award (WT089698) to the UK Parkinson’s Disease Consortium (UKPDC) whose members are from the UCL Institute of Neurology, the University of Sheffield and the MRC Protein Phosphorylation Unit at the University of Dundee, MRC Programme grant (to JH and PAL) MR/N026004/1 and by a MRC New Investigator Research Grant (MR/L010933/1) to PAL. RCL is funded by Parkinson’s UK, grant G-1307 and supported by the National Institute for Health Research University College London Hospitals Biomedical Research. RF is supported by funding from Alzheimer’s Society (grant number 284). DAK is supported by an MB PhD Award from the International Journal of Experimental Pathology. JET is supported by a BBSRC CASE studentship with BC Platforms, BB/M017222/1. University College London Hospitals and University College London receive support from the Department of Health's National Institute for Health Research (NIHR) Biomedical Research Centres (BRC). This work was partially funded by the JPND project RiMod-FTD.The funding agencies had no role in study design and data analysis and interpretation.
Availability of data and materials
The datasets generated during and/or analysed during the current study are available in the Additional files.
CM and RF designed, performed the analysis and wrote the paper; PAL designed the analysis and wrote the paper; DAK and JET helped with some of the analysis; JH, NW and RCL supported the design of the work. All authors read and approved the final manuscript.
Ethics approval and consent to participate
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Manzoni C, Kia DA, Vandrovcova J, Hardy J, Wood NW, Lewis PA, Ferrari R. Genome, transcriptome and proteome: the rise of omics data and their integration in biomedical sciences. Brief Bioinform. 2018;19(2):286-302. https://doi.org/10.1093/bib/bbw114.
- Hernandez DG, Reed X, Singleton AB. Genetics in Parkinson disease: Mendelian versus non-Mendelian inheritance. J Neurochem. 2016;139 Suppl 1:59–74.View ArticlePubMedGoogle Scholar
- Bean LJH Stephens K, Amemiya Anne. Adam MP, Molecular Genetics. Editor-in-Chief; Senior Editors: Ardinger HH, Pagon RA, Wallace SE. Genetic Counseling. Seattle (WA): University of Washington, Seattle; 1993-2018. ISSN: 2372-0697.Google Scholar
- Nalls MA, Pankratz N, Lill CM, Do CB, Hernandez DG, Saad M, DeStefano AL, Kara E, Bras J, Sharma M, et al. Large-scale meta-analysis of genome-wide association data identifies six new risk loci for Parkinson's disease. Nat Genet. 2014;46(9):989–93.View ArticlePubMedPubMed CentralGoogle Scholar
- van der Brug MP, Singleton A, Gasser T, Lewis PA. Parkinson’s disease: from human genetics to clinical trials. Sci Transl Med. 2015;7(305):205ps220.View ArticleGoogle Scholar
- Ward LD, Kellis M. Interpreting noncoding genetic variation in complex traits and human disease. Nat Biotechnol. 2012;30(11):1095–106.View ArticlePubMedPubMed CentralGoogle Scholar
- Furlong LI. Human diseases through the lens of network biology. Trends Genet. 2013;29(3):150–9.View ArticlePubMedGoogle Scholar
- Edwards SL, Beesley J, French JD, Dunning AM. Beyond GWASs: illuminating the dark road from association to function. Am J Hum Genet. 2013;93(5):779–97.View ArticlePubMedPubMed CentralGoogle Scholar
- Tasan M, Musso G, Hao T, Vidal M, MacRae CA, Roth FP. Selecting causal genes from genome-wide association studies via functionally coherent subnetworks. Nat Methods. 2015;12(2):154–9.View ArticlePubMedGoogle Scholar
- Visscher PM, Wray NR, Zhang Q, Sklar P, McCarthy MI, Brown MA, Yang J. 10 years of GWAS discovery: biology, function and Translation. Am J Hum Genet. 2017;101(1):5–22.View ArticlePubMedPubMed CentralGoogle Scholar
- Ferrari R, Lovering RC, Hardy J, Lewis PA, Manzoni C. Weighted protein interaction network analysis of Frontotemporal dementia. J Proteome Res. 2017;16(2):999–1013.View ArticlePubMedGoogle Scholar
- Simoes SN, Martins DC Jr, Pereira CA, Hashimoto RF, Brentani H. NERI: network-medicine based integrative approach for disease gene prioritization by relative importance. BMC Bioinformatics. 2015;16(Suppl 19):S9.View ArticlePubMedPubMed CentralGoogle Scholar
- Barabasi AL, Gulbahce N, Loscalzo J. Network medicine: a network-based approach to human disease. Nat Rev Genet. 2011;12(1):56–68.View ArticlePubMedPubMed CentralGoogle Scholar
- Zhang Y, Sloan SA, Clarke LE, Caneda C, Plaza CA, Blumenthal PD, Vogel H, Steinberg GK, Edwards MS, Li G, et al. Purification and characterization of progenitor and mature human astrocytes reveals transcriptional and functional differences with mouse. Neuron. 2016;89(1):37–53.View ArticlePubMedGoogle Scholar
- de Leeuw CA, Mooij JM, Heskes T, Posthuma D. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput Biol. 2015;11(4):e1004219.View ArticlePubMedPubMed CentralGoogle Scholar
- Lamparter D, Marbach D, Rueedi R, Kutalik Z, Bergmann S. Fast and rigorous computation of gene and Pathway scores from SNP-based summary statistics. PLoS Comput Biol. 2016;12(1):e1004714.View ArticlePubMedPubMed CentralGoogle Scholar
- Gamazon ER, Wheeler HE, Shah KP, Mozaffari SV, Aquino-Michaels K, Carroll RJ, Eyler AE, Denny JC, Consortium GT, Nicolae DL, et al. A gene-based association method for mapping traits using reference transcriptome data. Nat Genet. 2015;47(9):1091–8.View ArticlePubMedPubMed CentralGoogle Scholar
- Gusev A, Ko A, Shi H, Bhatia G, Chung W, Penninx BW, Jansen R, de Geus EJ, Boomsma DI, Wright FA, et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat Genet. 2016;48(3):245–52.View ArticlePubMedPubMed CentralGoogle Scholar
- Marigorta UM, Denson LA, Hyams JS, Mondal K, Prince J, Walters TD, Griffiths A, Noe JD, Crandall WV, Rosh JR, et al. Transcriptional risk scores link GWAS to eQTLs and predict complications in Crohn's disease. Nat Genet. 2017;49(10):1517–21.View ArticlePubMedPubMed CentralGoogle Scholar
- Zhu Z, Zhang F, Hu H, Bakshi A, Robinson MR, Powell JE, Montgomery GW, Goddard ME, Wray NR, Visscher PM, et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat Genet. 2016;48(5):481–7.View ArticlePubMedGoogle Scholar
- Wang L, Matsushita T, Madireddy L, Mousavi P, Baranzini SE. PINBPA: cytoscape app for network analysis of GWAS data. Bioinformatics. 2015;31(2):262–4.View ArticlePubMedGoogle Scholar
- Hasin Y, Seldin M, Lusis A. Multi-omics approaches to disease. Genome Biol. 2017;18(1):83.View ArticlePubMedPubMed CentralGoogle Scholar
- Hormozdiari F, Kostem E, Kang EY, Pasaniuc B, Eskin E. Identifying causal variants at loci with multiple signals of association. Genetics. 2014;198(2):497–508.View ArticlePubMedPubMed CentralGoogle Scholar
- Hrdlickova B, de Almeida RC, Borek Z, Withoff S. Genetic variation in the non-coding genome: involvement of micro-RNAs and long non-coding RNAs in disease. Biochim Biophys Acta. 2014;1842(10):1910–22.View ArticlePubMedGoogle Scholar
- Beilina A, Rudenko IN, Kaganovich A, Civiero L, Chau H, Kalia SK, Kalia LV, Lobbestael E, Chia R, Ndukwe K, et al. Unbiased screen for interactors of leucine-rich repeat kinase 2 supports a common pathway for sporadic and familial Parkinson disease. Proc Natl Acad Sci U S A. 2014;111(7):2626–31.View ArticlePubMedPubMed CentralGoogle Scholar
- Jinn S, Drolet RE, Cramer PE, Wong AH, Toolan DM, Gretzula CA, Voleti B, Vassileva G, Disa J, Tadin-Strapps M, et al. TMEM175 deficiency impairs lysosomal and mitochondrial function and increases alpha-synuclein aggregation. Proc Natl Acad Sci U S A. 2017;114(9):2389–94.View ArticlePubMedPubMed CentralGoogle Scholar
- Orchard S, Ammari M, Aranda B, Breuza L, Briganti L, Broackes-Carter F, Campbell NH, Chavali G, Chen C, del-Toro N, et al. The MIntAct project--IntAct as a common curation platform for 11 molecular interaction databases. Nucleic Acids Res. 2014;42(Database issue):D358–63.View ArticlePubMedGoogle Scholar
- Chatr-Aryamontri A, Oughtred R, Boucher L, Rust J, Chang C, Kolas NK, O'Donnell L, Oster S, Theesfeld C, Sellam A, et al. The BioGRID interaction database: 2017 update. Nucleic Acids Res. 2017;45(D1):D369–79.View ArticlePubMedGoogle Scholar
- Breuer K, Foroushani AK, Laird MR, Chen C, Sribnaia A, Lo R, Winsor GL, Hancock RE, Brinkman FS, Lynn DJ. InnateDB: systems biology of innate immunity and beyond--recent updates and continuing curation. Nucleic Acids Res. 2013;41(Database issue):D1228–33.View ArticlePubMedGoogle Scholar
- Chatr-aryamontri A, Ceol A, Palazzi LM, Nardelli G, Schneider MV, Castagnoli L, Cesareni G. MINT: the molecular INTeraction database. Nucleic Acids Res. 2007;35(Database):D572–4.View ArticlePubMedGoogle Scholar
- Orchard S, Kerrien S, Abbani S, Aranda B, Bhate J, Bidwell S, Bridge A, Briganti L, Brinkman FS, Cesareni G, et al. Protein interaction data curation: the international molecular exchange (IMEx) consortium. Nat Methods. 2012;9(4):345–50.View ArticlePubMedPubMed CentralGoogle Scholar
- Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000;25(1):25–9.View ArticlePubMedPubMed CentralGoogle Scholar
- Reimand J, Arak T, Adler P, Kolberg L, Reisberg S, Peterson H, Vilo J. g:Profiler-a web server for functional interpretation of gene lists (2016 update). Nucleic Acids Res. 2016;44(W1):W83–9.View ArticlePubMedPubMed CentralGoogle Scholar
- Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504.View ArticlePubMedPubMed CentralGoogle Scholar