Volume 14 Supplement 3
Multiscale modeling of the causal functional roles of nsSNPs in a genome-wide association study: application to hypoxia
© Xie et al.; licensee BioMed Central Ltd. 2013
Published: 28 May 2013
It is a great challenge of modern biology to determine the functional roles of non-synonymous Single Nucleotide Polymorphisms (nsSNPs) on complex phenotypes. Statistical and machine learning techniques establish correlations between genotype and phenotype, but may fail to infer the biologically relevant mechanisms. The emerging paradigm of Network-based Association Studies aims to address this problem of statistical analysis. However, a mechanistic understanding of how individual molecular components work together in a system requires knowledge of molecular structures, and their interactions.
To address the challenge of understanding the genetic, molecular, and cellular basis of complex phenotypes, we have, for the first time, developed a structural systems biology approach for genome-wide multiscale modeling of nsSNPs - from the atomic details of molecular interactions to the emergent properties of biological networks. We apply our approach to determine the functional roles of nsSNPs associated with hypoxia tolerance in Drosophila melanogaster. The integrated view of the functional roles of nsSNP at both molecular and network levels allows us to identify driver mutations and their interactions (epistasis) in H, Rad51D, Ulp1, Wnt5, HDAC4, Sol, Dys, GalNAc-T2, and CG33714 genes, all of which are involved in the up-regulation of Notch and Gurken/EGFR signaling pathways. Moreover, we find that a large fraction of the driver mutations are neither located in conserved functional sites, nor responsible for structural stability, but rather regulate protein activity through allosteric transitions, protein-protein interactions, or protein-nucleic acid interactions. This finding should impact future Genome-Wide Association Studies.
Our studies demonstrate that the consolidation of statistical, structural, and network views of biomolecules and their interactions can provide new insight into the functional role of nsSNPs in Genome-Wide Association Studies, in a way that neither the knowledge of molecular structures nor biological networks alone could achieve. Thus, multiscale modeling of nsSNPs may prove to be a powerful tool for establishing the functional roles of sequence variants in a wide array of applications.
Recent advances in next generation sequencing have generated abundant genetic variants and "omics" data. Together, these extremely large, multidimensional datasets present an exciting opportunity to identify genes, and to predict pathways likely to be involved in diseases and traits. However, these complex data sources plus the broad spectrum of phenotypes, challenge the quest to uncover the genetic, molecular, and cellular mechanisms that underlie phenotypes [1–3]. A major challenge in deciphering the genetic basis of multigenic diseases or traits is to distinguish driver mutations that impact the survival or reproduction of a particular phenotype (e.g., cancer) from passengers that do not confer a selective advantage. Standard genome sequence analysis cannot detect all driver mutations due to difficulties in the estimation of the background mutation rate and underlying genetic heterogeneity of adaptive phenotypes [4, 5]. Statistical machine learning techniques (e.g., SNAP ) provide an alternate approach by learning from the annotated mutation data. However, the "black-box" nature of machine learning makes it difficult to interpret the novel functional roles of mutations. Parallel to the development of new genotyping and phenotyping techniques, a number of novel computational tools have been developed to integrate and analyze genetic and omics data with the aim of establishing statistical causal relationships between genetic markers, genome-wide molecular signatures, and organismal phenotypes [7–13]. For example, co-expression and Bayesian network models derived from DNA variances and genome-wide transcriptional profiles have been applied to identify causal disease genes , cancer drivers [10, 15], and master regulators of cancer [16–18]. Although great efforts have been made to address n<<p problem, where the number of observations n (e.g., gene expressions in different conditions) is much smaller than the number of variables or parameters p (e.g., all measured genes), the power of these statistics-based techniques is still limited if sample sizes are small. Moreover, the complex phenotype is often associated with interactions among multiple causal genes (epistasis), any of which alone is not sufficient to drive phenotypic change. It is challenging for statistical methods to identify epistasis given the large number of possible interactions. Fundamentally, the "causal" relationships inferred from these methods are mathematical correlations. They may not provide biological insight into the underlying molecular and cellular mechanisms that associate genotypes with phenotypes.
To demonstrate the feasibility of our approach, we apply multiscale modeling to reveal the genetic, molecular, and cellular basis of hypoxia, a physiological condition in which the cell is deprived of an adequate oxygen supply. The hypoxia-induced phenotype has been related to multiple pathological conditions including cancer . Cells, tissues, and organisms have developed different strategies to survive low oxygen levels; however, the underlying molecular mechanisms contributing to hypoxia tolerance remain unclear. To render mammalian cells and tissues resistant to a low O2 environment, Drosophila melanogaster (D. melanogaster) has been used as a model system to investigate the mechanisms underlying hypoxia tolerance. Through long-term laboratory selection, Zhou et al. have generated D. melanogaster populations that tolerate severe, normally lethal, levels of hypoxia . Microarray analysis identified several adaptive changes in the hypoxia-selected flies . Comparison between the genome sequences of hypoxia-selected flies and those of controls identified 107 amino acid mutations in 52 genes . These data provide us with an unparalleled opportunity to understand the genetic, molecular, and cellular basis of the hypoxia tolerance phenotype and to develop new computational tools to establish causal genotype-phenotype associations, which can be validated through controlled experiments. It is noted that the gene expression profiles are only measured for one condition in the hypoxia tolerance phenotype, hence conventional co-expression approaches are not applicable to this study. Although the hypotheses generated from this study have been experimentally validated by us and are consistent with experimental results from others, the sensitivity and specificity of the method has not been fully evaluated. In the future we will extensively test our method using large case-control datasets from public databases such as the NCBI database of genotypes and phenotypes (dbGap)  and the Welcome Trust Case Control Consortium (WTCCC) .
Knowledge-driven network inference of driver mutations responsible for hypoxia tolerance
Predicted driver mutations and core pathways for hypoxia tolerance in Drosophila melanogaster from multiple evidences.
Mutated Gene (Annotation Symbol)
FDR Corrected p-value for the overrepresentation of signaling pathways
Shortest-path Distance (z-score) up/down
Functional role of nsSNP inferred from structural modeling
Expected accuracy (%) of non-neutral mutation from SNAP
Human ortholog and hypoxia association
Possible DNA binding
histone deacetylase 4
AR of catalytic activity
calcium-dependent cysteine-type endopeptidase
AR of substrate binding
AR of substrate binding
Structural analysis of functional roles of nsSNPs
Structural modeling of nsSNPs
Structural roles of putative driver mutations
Machine learning based prediction of non-neutral nsSNPs
The functional importance of nsSNP is further supported by SNAP , software used to predict a given nsSNP as neutral or non-neutral with an expected accuracy. In a benchmark study, SNAP outperformed most similar methods . 23 out of the 107 nsSNPs, located on 18 genes, are predicted as non-neutral with an accuracy of higher than 58% (SNAP reliability index 0), (Additional File 1 Table S4). Five predicted non-neutral mutations are hypothesized as putative drivers. Two of them (H and CG33714) have an accuracy of over 80%. The remaining predictions have lower expected accuracies. This could imply that while the functional impact of each individual mutation is limited, collectively they may mediate the signaling pathway activity through epistasis.
Several mutations in CG31220 (Additional File 1 Table S4), a serine-type peptidase, are predicted as non-neutral by SNAP. These mutations are mapped to the substrate binding sites or other functional important regions in the structure (Additional File 1 Figure S1). However, enriched biological pathways associated with this gene were not detected. More studies are required to understand how these non-neutral mutations impact the biological network.
Experimental and literature supports
As discussed above, a complex phenotype rises from re-regulated biological pathways that themselves result from the collective effects of multiple genetic mutations (epistasis). Since the down- or up-regulation of core pathways directly impacts the organismal phenotype, the experimental validation of the core pathway would provide strong evidence to support the predicted driver mutations that are responsible for the re-regulation of the core pathway. Indeed, we have experimentally validated that Notch signaling is the core pathway of hypoxia tolerance in D. melanogaster. The reduced activation of Notch signaling by a specific γ-secretase inhibitor significantly reduces the survival and life-span of hypoxia tolerant D. melanogaster strains . The critical role of Notch signaling in hypoxia tolerance is further supported by UAS-Gal4 over-expression and RNAi knockdown of genes involved in Notch signaling . Other experimental evidence from the literatures, as detailed below, also support our predictions. The top ranked H gene (also called hairless) is a well-known regulator of Notch signaling in D. melanogaster . Dys encodes the protein dystrophin. Genetic interaction screens in D. melanogaster have shown that Dys is involved in interactions with components of the Notch signaling pathway . Furthermore, the mutation of the Dys homolog in the mouse model is related to the up-regulation of the Notch-beta pathway . For other genes, although little direct experimental evidence supports an association with hypoxia in D. melanogasta their functional roles in hypoxia has been demonstrated in cancer and other human diseases. HDAC4 regulates hypoxia-inducible factor 1 α (HIF1 α) and cancer cell response to hypoxia . GalNAc-T2 is an N-acetyl-galactoseaminyl transferase that catalyzes the synthesis of glycosphingolipid (GSL). A recent study has shown that GSL may directly regulate the activity of Notch signaling . Wnt5 is a ligand to a family of frizzled receptors, acting as a regulator of Wnt signaling. An increasing body of evidences suggests that Wnt and Notch signaling cooperatively determine the fate of cell development in humans [36–42]. The association between Rad51D and hypoxia has been demonstrated in cancer . Ulp1 is a SUMO-specific protease that is essential for the stabilization of HIF1α during hypoxia by removing SUMO and participates in the regulation of hypoxia-responsive genes .
The important functional role of allosteric regulation, protein-protein interactions, and protein-nucleic acid interactions in sequence variants
In this study, none of the driver mutations associated with hypoxia are conserved functional site residues, nor are they responsible for structural stability. The driver mutations are hypothesized to be involved in either protein-protein interactions (in the case of Rad51D), protein-nucleic acid interaction (e.g., in CG33714), or allosteric regulation (e.g., in HDAC4). A recent survey of the structural basis of in-frame mutations in protein-protein interactions has suggested that changes in specific interactions play a critical role in pathogenesis . From a network point of view, the modification of protein-protein interactions, rather than the proteins themselves, may have significant impact on network properties . Recent progress in the ENCODE and modENCODE projects highlights the critical functional roles of non-coding DNAs in the regulation of biological processes [46, 47]. As a large number of non-coding DNAs perform their functions through specific protein-nucleic acid interactions, the mutations that impact protein-nucleic acid binding could be directly associated with phenotype changes. The dysregulation of allosteric interactions is considered to be another major determinant of disease . During evolution, organisms need to survive and reproduce in a changed environment. As such, certain genes need to gain functions and activate critical pathways. Allosteric regulation is an efficient way for driver mutations to act since the change of activity is not constrained to a single molecule, but can be propagated to a whole network . New computational methods that are able to identify "hot spots" in protein-protein interactions, protein-nucleic acid recognition, and allosteric regulations, in which the mutation may cause the dysregulation of biological pathways, may have significant impact on the interpretation of Genome-Wide Association Studies.
The relevance of D. melanogasta driver mutations to human hypoxia adaption
Recently several studies in hypoxia adaptation in humans have been performed on Tibetans [49, 50], Andeans , and Ethiopians . However, all human studies to date have adopted limited, sampling-based approaches, such as genotyping or exome sequencing. The relatively sparse sampling of the genome makes it harder to identify large-scale shifts in the allele frequency spectrum associated with natural selection. Consequently, these studies restricted subsequent analysis to variants in candidate genes that are mainly involved in the canonical hypoxia response (HIF pathway) and related pathways. The identification of the functional roles of sequence variances in human orthologs of Drosophila genes may provide critical insight in the prioritization of candidate genes in human, which may fail using conventional statistical techniques. Indeed, the majority of driver mutations identified in this study are human orthologs and associated with the hypoxia cellular phenotype, as shown in Table 1.
Based upon multiscale modeling, we propose that the up-regulation of Notch and Gurken/EGFR and the down-regulation of Toll and Torso/RTK pathways are responsible for hypoxia tolerance. Using integrated structural and network analysis, we hypothesize that nsSNPs in H, Rad51D, Ulp1, Sol, Wnt5, CG33714, GalNAc-T2, Dys, and HDAC4, may all lead to the functional modification of these genes via allosteric regulation and protein-protein/DNA/RNA interactions and hence are driver mutations defining the hypoxia tolerance phenotype. Our predictions are supported by experimental evidence [23, 26]. Moreover, multiscale modeling may identify potential epistasis using a very small sample size. This reduces the burden imposed during statistical multiple testing of large epistasis models. It is anticipated that the further extension of this multiscale modeling approach to genome-wide protein-protein interactions, protein-nucleic acid interactions, and microRNA data will provide a powerful tool for uncovering the functional roles of both coding and non-coding sequence variations in GWAS; a role which neither the knowledge of molecular structures nor of biological networks alone can achieve. However, challenges remain in extending multiscale modeling approaches. New algorithms are required to predict emergent properties, at both molecular and network levels, as well as to seamlessly model information flow across scales.
Prediction of non-neutral mutations on nsSNPs from sequence
A sequence information based method, SNAP  is used to predict the non-neutral (functional effect) and neutral (no functional effect) nsSNPs.
Knowledge-driven network inference of core pathways and driver mutations
The network-based analysis of driver mutation is shown in Figure 2. The mutated genes and differentially regulated genes are mapped to a protein-protein interaction (PPI) network extracted from the STRING Database  for D. melanogaster. A subnetwork that connects a mutated gene and up-, and down-regulated genes is identified using a shortest path search of the PPI network. The genes identified in each subnetwork are subject to Gene Set Enrichment Analysis (GSEA). If the genes in the subnetwork are enriched by the essential biological processes/pathways, the mutated gene is a potential driver.
Analysis of differential expressed genes
A cDNA microarray analysis of 13,061 known or predicted genes from the D. melanogaster genome is performed using the R package . K-nearest neighbors  in the space of genes is used to impute missing expression values. The LOWESS normalization method  is used to normalize the raw density data. P-value and fold change are calculated using the two-sided, two-class t-test . A Bonferroni-Holm  false discovery rate (FDR) controlling procedure [58, 59] is used to adjust the P-values. The genes are considered to be differentially expressed between the two samples when the FDR is smaller than 0.05. If the fold change is larger than 1.5-fold for up-regulated genes and is smaller than 0.67-fold for down-regulated genes, these genes are considered significantly differentially expressed.
Subnetwork construction by shortest path search
Where s2 is the unbiased estimator of the variance of the sample and n is the number of participants.
Here the t-value is used to measure the difference between the identified subnetwork (x2) and a background random network (x1). Background random networks are built by randomly selecting one gene as a source node and a set of other genes as destination nodes. A positive t-value means a shorter than average path. The mutations on the genes with statistically significant high t-values are prioritized as driver mutations.
Gene set overrepresentation analysis to identify driver biological pathways and mutations
The Biological Networks Gene Ontology Tool (BiNGO)  is applied in Cytoscape's versatile visualization environment  to determine which biological processes and molecular functions are significantly overrepresented in the set of genes involved in each subnetwork. Gene ontology  terms are ranked according to the False Discovery Rate (FDR) corrected p-values for each subnetwork. The statistically significant enriched biological pathways (p-value < 0.05) are considered as potential core pathways that contribute to the survival or reproduction of a phenotype. This pathway is subject to further validations by experiments and literature searches. If a subnetwork contains the validated core pathway, the mutated gene in this subnetwork is hypothesized to be a causal gene. Correspondingly, the mutations on this gene are candidate driver mutations.
Structure-based analysis of driver mutations
Homology modeling and nsSNP mapping
Homology models of proteins are built using Modeller . Sequence alignments between these proteins and templates of known structures are obtained from a PSI-BLAST sequence search . The functional sites are predicted using SMAP [67–69]. Mutated residues are mapped onto the model structures and the functional roles of these residues are predicted according to their locations on the model structures.
Covariance analysis based on multiple sequence alignments of proteins in the same Pfam family  as the mutated protein can help identify remote relationships between mutated residues and other residues within the protein sequence. The Pfam family is identified by a whole sequence search. Redundancy of sequences in the Pfam family is removed using CD-hit  with a sequence identity threshold of 90% . Multiple sequence alignments among these sequences are built using the MUSCLE software  with default parameters. Covariance of mutations with other residues is calculated using five different methods: Statistical Coupling Analysis (SCA) ; Explicit Likelihood of Subset Co-variation (ELSC) ; Observed Minus Expected Squared covariance algorithm (OMES) ; Mutual Information Covariance Algorithm (MI) ; and Conservation Algorithm (ConservationSum) . The residues that are predicted to be coupling with mutations by at least two methods are considered as co-evolved residues with the mutated residues.
Availability of supporting data
The data sets supporting the results of this article are included within the article.
List of abbreviations
Epidermal Growth Factor Receptor
Receptor Tyrosine Kinase
Mutation Seeded Subnetwork
This work was supported by National Institutes of Health Grants GM63208, CUNY High Performance Computing Center, CUNY Research Foundation, and Hunter President Fund. We thank reviewers for their constructive comments.
The publication costs for this article were funded by CUNY Research foundation.
This article has been published as part of BMC Genomics Volume 14 Supplement 3, 2013: SNP-SIG 2012: Identification and annotation of SNPs in the context of structure, function, and disease. The full contents of the supplement are available online at http://www.biomedcentral.com/bmcgenomics/supplements/14/S3
- Kraft P, Hunter DJ: Genetic risk prediction--are we there yet?. N Engl J Med. 2009, 360: 1701-1703. 10.1056/NEJMp0810107.View ArticlePubMedGoogle Scholar
- An G: Closing the scientific loop: bridging correlation and causality in the petaflop age. Sci Transl Med. 2010, 2: 41ps34-10.1126/scitranslmed.3000390.View ArticlePubMedGoogle Scholar
- Cherkassky V, Ma Y: Another look at statistical learning theory and regularization. Neural Netw. 2009, 22: 958-969. 10.1016/j.neunet.2009.04.005.View ArticlePubMedGoogle Scholar
- Eifert C, Powers RS: From cancer genomes to oncogenic drivers, tumour dependencies and therapeutic targets. Nat Rev Cancer. 2012, 12: 572-578. 10.1038/nrc3299.View ArticlePubMedGoogle Scholar
- Vandin F, Upfal E, Raphael BJ: Finding Driver Pathways in Cancer: Models and Algorithms. Algorithms Mol Biol. 2012, 7: 23-10.1186/1748-7188-7-23.PubMed CentralView ArticlePubMedGoogle Scholar
- Bromberg Y, Rost B: SNAP: predict effect of non-synonymous polymorphisms on function. Nucleic Acids Res. 2007, 35: 3823-3835. 10.1093/nar/gkm238.PubMed CentralView ArticlePubMedGoogle Scholar
- Califano A, Butte AJ, Friend S, Ideker T, Schadt E: Leveraging models of cell regulation and GWAS data in integrative network-based association studies. Nat Genet. 2012, 44: 841-847. 10.1038/ng.2355.PubMed CentralView ArticlePubMedGoogle Scholar
- Li Y, Tesson BM, Churchill GA, Jansen RC: Critical reasoning on causal inference in genome-wide linkage and association studies. Trends Genet. 2010, 26: 493-498. 10.1016/j.tig.2010.09.002.PubMed CentralView ArticlePubMedGoogle Scholar
- Chen BJ, Causton HC, Mancenido D, Goddard NL, Perlstein EO, Pe'er D: Harnessing gene expression to identify the genetic basis of drug resistance. Mol Syst Biol. 2009, 5: 310-PubMed CentralPubMedGoogle Scholar
- Akavia UD, Litvin O, Kim J, Sanchez-Garcia F, Kotliar D, Causton HC, Pochanard P, Mozes E, Garraway LA, Pe'er D: An integrated approach to uncover drivers of cancer. Cell. 2010, 143: 1005-1017. 10.1016/j.cell.2010.11.013.PubMed CentralView ArticlePubMedGoogle Scholar
- Chuang HY, Lee E, Liu YT, Lee D, Ideker T: Network-based classification of breast cancer metastasis. Mol Syst Biol. 2007, 3: 140-PubMed CentralView ArticlePubMedGoogle Scholar
- Burkard TR, Rix U, Breitwieser FP, Superti-Furga G, Colinge J: A computational approach to analyze the mechanism of action of the kinase inhibitor bafetinib. PLoS Comput Biol. 2010, 6: e1001001-10.1371/journal.pcbi.1001001.PubMed CentralView ArticlePubMedGoogle Scholar
- Fliri AF, Loging WT, Volkmann RA: Drug effects viewed from a signal transduction network perspective. J Med Chem. 2009, 52: 8038-8046. 10.1021/jm901001p.View ArticlePubMedGoogle Scholar
- Schadt EE, Lamb J, Yang X, Zhu J, Edwards S, Guhathakurta D, Sieberts SK, Monks S, Reitman M, Zhang C et al: An integrative genomics approach to infer causal associations between gene expression and disease. Nat Genet. 2005, 37: 710-717. 10.1038/ng1589.PubMed CentralView ArticlePubMedGoogle Scholar
- Torkamani A, Schork NJ: Identification of rare cancer driver mutations by network reconstruction. Genome Res. 2009, 19: 1570-1578. 10.1101/gr.092833.109.PubMed CentralView ArticlePubMedGoogle Scholar
- Bansal M, Califano A: Genome-wide dissection of posttranscriptional and posttranslational interactions. Methods Mol Biol. 2012, 786: 131-149. 10.1007/978-1-61779-292-2_8.View ArticlePubMedGoogle Scholar
- Sumazin P, Yang X, Chiu HS, Chung WJ, Iyer A, Llobet-Navas D, Rajbhandari P, Bansal M, Guarnieri P, Silva J, Califano A: An extensive microRNA-mediated network of RNA-RNA interactions regulates established oncogenic pathways in glioblastoma. Cell. 2011, 147: 370-381. 10.1016/j.cell.2011.09.041.PubMed CentralView ArticlePubMedGoogle Scholar
- Carro MS, Lim WK, Alvarez MJ, Bollo RJ, Zhao X, Snyder EY, Sulman EP, Anne SL, Doetsch F, Colman H et al: The transcriptional network for mesenchymal transformation of brain tumours. Nature. 2010, 463: 318-325. 10.1038/nature08712.PubMed CentralView ArticlePubMedGoogle Scholar
- Nussinov R, Tsai CJ, Csermely P: Allo-network drugs: harnessing allostery in cellular networks. Trends Pharmacol Sci. 2011, 32: 686-693. 10.1016/j.tips.2011.08.004.View ArticlePubMedGoogle Scholar
- Blois MS: Information and Medicine: The Nature of Medical Descriptions. 1984, University of California PressGoogle Scholar
- Hsu PP, Sabatini DM: Cancer cell metabolism: Warburg and beyond. Cell. 2008, 134: 703-707. 10.1016/j.cell.2008.08.021.View ArticlePubMedGoogle Scholar
- Zhou D, Xue J, Lai JC, Schork NJ, White KP, Haddad GG: Mechanisms underlying hypoxia tolerance in Drosophila melanogaster: hairy as a metabolic switch. PLoS Genet. 2008, 4: e1000221-10.1371/journal.pgen.1000221.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhou D, Udpa N, Gersten M, Visk DW, Bashir A, Xue J, Frazer KA, Posakony JW, Subramaniam S, Bafna V, Haddad GG: Experimental selection of hypoxia-tolerant Drosophila melanogaster. Proc Natl Acad Sci USA. 2011, 108: 2349-2354. 10.1073/pnas.1010643108.PubMed CentralView ArticlePubMedGoogle Scholar
- Mailman MD, Feolo M, Jin Y, Kimura M, Tryka K, Bagoutdinov R, Hao L, Kiang A, Paschall J, Phan L et al: The NCBI dbGaP database of genotypes and phenotypes. Nat Genet. 2007, 39: 1181-1186.PubMed CentralView ArticlePubMedGoogle Scholar
- Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature. 2007, 447: 661-678. 10.1038/nature05911.
- Azad P, Zhou D, Zarndt R, Haddad GG: Identification of genes underlying hypoxia tolerance in Drosophila by a P-element screen. G3 (Bethesda). 2012, 2: 1169-1178. 2012.View ArticleGoogle Scholar
- Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE: The Protein Data Bank. Nucleic Acids Res. 2000, 28: 235-242. 10.1093/nar/28.1.235.PubMed CentralView ArticlePubMedGoogle Scholar
- Bottomley MJ, Lo Surdo P, Di Giovine P, Cirillo A, Scarpelli R, Ferrigno F, Jones P, Neddermann P, De Francesco R, Steinkuhler C et al: Structural and functional analysis of the human HDAC4 catalytic domain reveals a regulatory structural zinc-binding domain. J Biol Chem. 2008, 283: 26694-26704. 10.1074/jbc.M803514200.PubMed CentralView ArticlePubMedGoogle Scholar
- Fischle W, Dequiedt F, Fillion M, Hendzel MJ, Voelter W, Verdin E: Human HDAC7 histone deacetylase activity is associated with HDAC3 in vivo. J Biol Chem. 2001, 276: 35826-35835. 10.1074/jbc.M104935200.View ArticlePubMedGoogle Scholar
- Bristow RG, Hill RP: Hypoxia and metabolism. Hypoxia, DNA repair and genetic instability. Nat Rev Cancer. 2008, 8: 180-192. 10.1038/nrc2344.View ArticlePubMedGoogle Scholar
- Maier D: Hairless: the ignored antagonist of the Notch signalling pathway. Hereditas. 2006, 143: 212-221. 10.1111/j.2007.0018-0661.01971.x.View ArticlePubMedGoogle Scholar
- Kucherenko MM, Pantoja M, Yatsenko AS, Shcherbata HR, Fischer KA, Maksymiv DV, Chernyk YI, Ruohola-Baker H: Genetic modifier screens reveal new components that interact with the Drosophila dystroglycan-dystrophin complex. PLoS One. 2008, 3: e2418-10.1371/journal.pone.0002418.PubMed CentralView ArticlePubMedGoogle Scholar
- Turk R, Sterrenburg E, de Meijer EJ, van Ommen GJ, den Dunnen JT, t Hoen PA: Muscle regeneration in dystrophin-deficient mdx mice studied by gene expression profiling. BMC Genomics. 2005, 6: 98-10.1186/1471-2164-6-98.PubMed CentralView ArticlePubMedGoogle Scholar
- Geng H, Harvey CT, Pittsenbarger J, Liu Q, Beer TM, Xue C, Qian DZ: HDAC4 protein regulates HIF1alpha protein lysine acetylation and cancer cell response to hypoxia. J Biol Chem. 2011, 286: 38095-38102. 10.1074/jbc.M111.257055.PubMed CentralView ArticlePubMedGoogle Scholar
- Hamel S, Fantini J, Schweisguth F: Notch ligand activity is modulated by glycosphingolipid membrane composition in Drosophila melanogaster. J Cell Biol. 2010, 188: 581-594. 10.1083/jcb.200907116.PubMed CentralView ArticlePubMedGoogle Scholar
- Hayward P, Kalmar T, Arias AM: Wnt/Notch signalling and information processing during development. Development. 2008, 135: 411-424. 10.1242/dev.000505.View ArticlePubMedGoogle Scholar
- Fre S, Pallavi SK, Huyghe M, Lae M, Janssen KP, Robine S, Artavanis-Tsakonas S, Louvard D: Notch and Wnt signals cooperatively control cell proliferation and tumorigenesis in the intestine. Proc Natl Acad Sci USA. 2009, 106: 6309-6314. 10.1073/pnas.0900427106.PubMed CentralView ArticlePubMedGoogle Scholar
- Boulter L, Govaere O, Bird TG, Radulescu S, Ramachandran P, Pellicoro A, Ridgway RA, Seo SS, Spee B, Van Rooijen N et al: Macrophage-derived Wnt opposes Notch signaling to specify hepatic progenitor cell fate in chronic liver disease. Nat Med. 2012, 18: 572-579. 10.1038/nm.2667.PubMed CentralView ArticlePubMedGoogle Scholar
- Duncan AW, Rattis FM, DiMascio LN, Congdon KL, Pazianos G, Zhao C, Yoon K, Cook JM, Willert K, Gaiano N, Reya T: Integration of Notch and Wnt signaling in hematopoietic stem cell maintenance. Nat Immunol. 2005, 6: 314-322. 10.1038/ni1164.View ArticlePubMedGoogle Scholar
- Pannequin J, Bonnans C, Delaunay N, Ryan J, Bourgaux JF, Joubert D, Hollande F: The wnt target jagged-1 mediates the activation of notch signaling by progastrin in human colorectal cancer cells. Cancer Res. 2009, 69: 6065-6073.View ArticlePubMedGoogle Scholar
- Ungerback J, Elander N, Grunberg J, Sigvardsson M, Soderkvist P: The Notch-2 gene is regulated by Wnt signaling in cultured colorectal cancer cells. PLoS One. 2011, 6: e17957-10.1371/journal.pone.0017957.PubMed CentralView ArticlePubMedGoogle Scholar
- Roma J, Almazan-Moga A, Sanchez de Toledo J, Gallego S: Notch, wnt, and hedgehog pathways in rhabdomyosarcoma: from single pathways to an integrated network. Sarcoma. 2012, 2012: 695603-PubMed CentralView ArticlePubMedGoogle Scholar
- Cheng J, Kang X, Zhang S, Yeh ET: SUMO-specific protease 1 is essential for stabilization of HIF1alpha during hypoxia. Cell. 2007, 131: 584-595. 10.1016/j.cell.2007.08.045.PubMed CentralView ArticlePubMedGoogle Scholar
- Wang X, Wei X, Thijssen B, Das J, Lipkin SM, Yu H: Three-dimensional reconstruction of protein networks provides insight into human genetic disease. Nat Biotechnol. 2012, 30: 159-164. 10.1038/nbt.2106.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhong Q, Simonis N, Li QR, Charloteaux B, Heuze F, Klitgord N, Tam S, Yu H, Venkatesan K, Mou D et al: Edgetic perturbation models of human inherited disorders. Mol Syst Biol. 2009, 5: 321-PubMed CentralView ArticlePubMedGoogle Scholar
- Rosenbloom KR, Sloan CA, Malladi VS, Dreszer TR, Learned K, Kirkup VM, Wong MC, Maddren M, Fang R, Heitner SG et al: ENCODE Data in the UCSC Genome Browser: year 5 update. Nucleic Acids Res. 2013, 41: D56-63. 10.1093/nar/gks1172.PubMed CentralView ArticlePubMedGoogle Scholar
- Roy S, Ernst J, Kharchenko PV, Kheradpour P, Negre N, Eaton ML, Landolin JM, Bristow CA, Ma L, Lin MF et al: Identification of functional elements and regulatory circuits by Drosophila modENCODE. Science. 2010, 330: 1787-1797.PubMed CentralView ArticlePubMedGoogle Scholar
- Kowarsch A, Fuchs A, Frishman D, Pagel P: Correlated mutations: a hallmark of phenotypic amino acid substitutions. PLoS Comput Biol. 2010, 6:Google Scholar
- Simonson TS, Yang Y, Huff CD, Yun H, Qin G, Witherspoon DJ, Bai Z, Lorenzo FR, Xing J, Jorde LB et al: Genetic evidence for high-altitude adaptation in Tibet. Science. 2010, 329: 72-75. 10.1126/science.1189406.View ArticlePubMedGoogle Scholar
- Bigham A, Bauchet M, Pinto D, Mao X, Akey JM, Mei R, Scherer SW, Julian CG, Wilson MJ, Lopez Herraez D et al: Identifying signatures of natural selection in Tibetan and Andean populations using dense genome scan data. PLoS Genet. 2010, 6:Google Scholar
- Scheinfeldt LB, Soi S, Thompson S, Ranciaro A, Woldemeskel D, Beggs W, Lambert C, Jarvis JP, Abate D, Belay G, Tishkoff SA: Genetic adaptation to high altitude in the Ethiopian highlands. Genome Biol. 2012, 13: R1-10.1186/gb-2012-13-1-r1.PubMed CentralView ArticlePubMedGoogle Scholar
- Jensen LJ, Kuhn M, Stark M, Chaffron S, Creevey C, Muller J, Doerks T, Julien P, Roth A, Simonovic M et al: STRING 8--a global view on proteins and their functional interactions in 630 organisms. Nucleic Acids Res. 2009, 37: D412-416. 10.1093/nar/gkn760.PubMed CentralView ArticlePubMedGoogle Scholar
- Team RDC: R: A language and enviroment for statistical computing. Book R: A language and enviroment for statistical computing. Edited by: ed.^eds. 2010, City: R Foundation for Statistical ComputingGoogle Scholar
- Li B, Chen YW, Chen YQ: The nearest neighbor algorithm of local probability centers. IEEE Trans Syst Man Cybern B Cybern. 2008, 38: 141-154.View ArticlePubMedGoogle Scholar
- Berger JA, Hautaniemi S, Jarvinen AK, Edgren H, Mitra SK, Astola J: Optimized LOWESS normalization parameter selection for DNA microarray data. BMC Bioinformatics. 2004, 5: 194-10.1186/1471-2105-5-194.PubMed CentralView ArticlePubMedGoogle Scholar
- Rice JA: Mathematical Statistics and Data Analysis. 2006, Belmont, CA: Duxbury PressGoogle Scholar
- Sonnenberg A: Bonferroni-Holm sequential test procedure. Z Gastroenterol. 1985, 23: 703-704.PubMedGoogle Scholar
- Lin WY, Lee WC: Improving power of genome-wide association studies with weighted false discovery rate control and prioritized subset analysis. PLoS One. 7: e33716-
- Hu JX, Zhao H, Zhou HH: False Discovery Rate Control With Groups. J Am Stat Assoc. 105: 1215-1227.
- Schulz F, Wagner D, Weihe K: Dijkstra's algorithm on-line: An empirical case study from public railroad transport. Algorithm Engineering. 1999, 1668: 110-123. 10.1007/3-540-48318-7_11.View ArticleGoogle Scholar
- Welch BL: The Generalization of Students Problem When Several Different Population Variances Are Involved. Biometrika. 1947, 34: 28-35.PubMedGoogle Scholar
- Maere S, Heymans K, Kuiper M: BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics. 2005, 21: 3448-3449. 10.1093/bioinformatics/bti551.View ArticlePubMedGoogle Scholar
- Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13: 2498-2504. 10.1101/gr.1239303.PubMed CentralView ArticlePubMedGoogle Scholar
- Consortium TGO: Gene Ontology: tool for the unification of biology. Nature Genet. 2000, 25: 25-29. 10.1038/75556.View ArticleGoogle Scholar
- Sali A, Blundell TL: Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol. 1993, 234: 779-815. 10.1006/jmbi.1993.1626.View ArticlePubMedGoogle Scholar
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.PubMed CentralView ArticlePubMedGoogle Scholar
- Xie L, Bourne PE: A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery. Bioinformatics. 2009, 25: i305-312. 10.1093/bioinformatics/btp220.PubMed CentralView ArticlePubMedGoogle Scholar
- Xie L, Bourne PE: Detecting evolutionary relationships across existing fold space, using sequence order-independent profile-profile alignments. Proc Natl Acad Sci USA. 2008, 105: 5441-5446. 10.1073/pnas.0704422105.PubMed CentralView ArticlePubMedGoogle Scholar
- Xie L, Bourne PE: A robust and efficient algorithm for the shape description of protein structures and its application in predicting ligand binding sites. BMC Bioinformatics. 2007, 8 (Suppl 4): S9-10.1186/1471-2105-8-S4-S9.PubMed CentralView ArticlePubMedGoogle Scholar
- Bateman A, Coin L, Durbin R, Finn RD, Hollich V, Griffiths-Jones S, Khanna A, Marshall M, Moxon S, Sonnhammer ELL et al: The Pfam Protein Families Database. Nucleic Acids Res. 2004, 32: D138-D141. 10.1093/nar/gkh121.PubMed CentralView ArticlePubMedGoogle Scholar
- Huang Y, Niu B, Gao Y, Fu L, Li W: CD-HIT Suite: a web server for clustering and comparing biological sequences. Bioinformatics. 2010, 26: 680-682. 10.1093/bioinformatics/btq003.PubMed CentralView ArticlePubMedGoogle Scholar
- Fodor AA, Aldrich RW: Influence of conservation on calculations of amino acid covariance in multiple sequence alignments. Proteins. 2004, 56: 211-221. 10.1002/prot.20098.View ArticlePubMedGoogle Scholar
- Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32: 1792-1797. 10.1093/nar/gkh340.PubMed CentralView ArticlePubMedGoogle Scholar
- Lockless SW, Ranganathan R: Evolutionarily conserved pathways of energetic connectivity in protein families. Science. 1999, 286: 295-299. 10.1126/science.286.5438.295.View ArticlePubMedGoogle Scholar
- Dekker JP, Fodor A, Aldrich RW, Yellen G: A perturbation-based method for calculating explicit likelihood of evolutionary co-variance in multiple sequence alignments. Bioinformatics. 2004, 20: 1565-1572. 10.1093/bioinformatics/bth128.View ArticlePubMedGoogle Scholar
- Kass I, Horovitz A: Mapping pathways of allosteric communication in GroEL by analysis of correlated mutations. Proteins. 2002, 48: 611-617. 10.1002/prot.10180.View ArticlePubMedGoogle Scholar
- Atchley WR, Wollenberg KR, Fitch WM, Terhalle W, Dress AW: Correlations among amino acid sites in bHLH protein domains: an information theoretic analysis. Mol Biol Evol. 2000, 17: 164-178. 10.1093/oxfordjournals.molbev.a026229.View ArticlePubMedGoogle Scholar
- Shenkin PS, Erman B, Mastrandrea LD: Information-Theoretical Entropy as a Measure of Sequence Variability. Proteins-Structure Function and Genetics. 1991, 11: 297-313. 10.1002/prot.340110408.View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.