- Research article
- Open Access
Large-scale integrative network-based analysis identifies common pathways disrupted by copy number alterations across cancers
© Hwang et al.; licensee BioMed Central Ltd. 2013
- Received: 4 July 2012
- Accepted: 26 June 2013
- Published: 3 July 2013
Many large-scale studies analyzed high-throughput genomic data to identify altered pathways essential to the development and progression of specific types of cancer. However, no previous study has been extended to provide a comprehensive analysis of pathways disrupted by copy number alterations across different human cancers. Towards this goal, we propose a network-based method to integrate copy number alteration data with human protein-protein interaction networks and pathway databases to identify pathways that are commonly disrupted in many different types of cancer.
We applied our approach to a data set of 2,172 cancer patients across 16 different types of cancers, and discovered a set of commonly disrupted pathways, which are likely essential for tumor formation in majority of the cancers. We also identified pathways that are only disrupted in specific cancer types, providing molecular markers for different human cancers. Analysis with independent microarray gene expression datasets confirms that the commonly disrupted pathways can be used to identify patient subgroups with significantly different survival outcomes. We also provide a network view of disrupted pathways to explain how copy number alterations affect pathways that regulate cell growth, cycle, and differentiation for tumorigenesis.
In this work, we demonstrated that the network-based integrative analysis can help to identify pathways disrupted by copy number alterations across 16 types of human cancers, which are not readily identifiable by conventional overrepresentation-based and other pathway-based methods. All the results and source code are available at http://compbio.cs.umn.edu/NetPathID/.
- Copy Number Change
- Copy Number Alteration
- Copy Number Data
- Network View
- Label Propagation Algorithm
Recent high-throughput technologies have enabled researchers to identify genomic alterations that could result in activation of oncogenes or inactivation of tumor suppressor genes, and thus disrupt pathways and biological processes known to contribute to tumor formation [1, 2]. Many anticancer drugs have been developed to target proteins that act in these cancer-related pathways. Therefore, the precise identification and systemic characterization of altered activities in cancer-related pathways could accelerate the development of more effective targeted therapies, and aid in tailoring treatment to the genetic causes of an individual patient’s cancer .
Many large-scale genomic studies have been performed to define the cancer genome [3–11]. This effort is epitomized by The Cancer Genome Atlas [12–14] and its umbrella group, the International Cancer Genome Consortium . Typically, in these studies, enrichment analysis was performed to identify statistically significant overlap between the list of altered genes and pathways or predefined gene sets [16–19]. For example, publications based on TCGA data have identified disrupted pathways in many cancer types, and these studies attempt to integrate sequence data, expression data, epigenetic data and copy-number data to get a wholistic view of the cancer genome [12–14].
In more advanced network analysis, altered genes (e.g. differentially expressed genes or mutated genes) are first projected onto an interaction network, and then clusters are found in this network. Ideker and colleagues pioneered this approach  and later extended the approach to identify network signatures (e.g. pathways, subnetworks, or functional modules) [20–27]. Similarly, pathway-based methods have been developed to incorporate interactions of member genes in known biological pathways to measure activities of pathways. These pathway-based methods were shown to be more accurate at identifying cancer-related pathways compared to overrepresentation-based enrichment analysis [22, 28].
In the experiments, we first show the limitation of the enrichment analysis and current network-based analysis on DNA copy number, and then demonstrate that, although there are distinct patterns of copy number alteration in specific types of cancer, our method can identify common pathways disrupted in more than 10 different types of cancers. Our analysis of common and cancer-type specific disrupted pathways will lead to a better understanding of cancer network modules, and suggest potential therapeutic targets for cancer treatment. We also provide a network view of disrupted pathways to show how copy number alteration can disrupt core pathways that are essential for cancer development and progression.
Limitation of enrichment analysis and current network-based analysis on DNA copy number alterations
Although many efforts have been made to build gene set databases (e.g. KEGG, Biocarta, Reactome, or the Gene Ontology database), and significant work has been done to expand the current knowledge of gene functions and roles in cellular systems, many human genes are not yet annotated in existing gene set databases. Most notably, we observed that more than 70% of the genes that are identified as significantly altered based on copy number alterations (see Methods) are not annotated in current pathway databases (Additional file 1: Figure S1). Due to this low coverage of gene annotation in significantly altered copy number regions, overrepresentation-based enrichment analyses, and standard pathway-based analyses are omitting some of the most significantly altered genes in their analyses. Hence, they provide a limited analysis of pathway activity that is based on the small non-representative fraction of altered genes that are currently annotated in pathway databases.
Previous network-based methods suffer less from incomplete gene annotations for inferring pathway activity; they also have difficulty in analyzing pathway activity across cancers. This is primarily due to the diverse copy number alteration patterns that exist in different human cancers. By performing copy number alteration analysis using Genomic Identification of Significant Targets in Cancer (GISTIC)  in 16 types of cancers, we have found a diverse spectra of copy number alteration patterns, and show that only a few significantly altered genomic regions are present across multiple cancer types (Additional file 1: Figure S2). This lack of coherence in copy number alterations across cancers could lead to the failure of some network-based methods to identify common biological pathways affected by copy number alterations. Our method introduced below overcomes this limitation by using a label propagation technique along with a protein-protein interaction network that includes these unannotated gene products.
A network-based approach for discovering disrupted pathways based on copy number alterations across multiple cancer types
We performed extensive evaluation of NetPathID by comparing it with overrepresentation-based gene set enrichment analysis using hypergeometric testing, the method of Lee et al.[27, 28] (Additional file 2). We found that NetPathID can accurately identify cancer-related pathways from negative controls (i.e., randomly generated decoy pathways). We also confirmed that commonly disrupted pathways identified by NetPathID are related to cancer biology (see Additional file 2). Finally, we found that NetPathID is robust with regards to the bias in the protein-protein interaction networks (see Additional file 2).
Patterns of disrupted pathways based on copy number alterations across cancers
We applied our approach to copy number data from 16 types of human cancers, using information from protein-protein interaction network and predefined annotated pathway databases (see Methods). Statistical significance of the pathway score was assessed through permutation (Additional file 1: Figure S7). Pathways with significant Benjamini- Hochberg adjusted p-values were selected by using a false discovery rate cutoff of 0.005 for Biocarta, and 0.10 for both the KEGG and Reactome databases. In addition, to determine if a pathway was significantly disrupted in the given cancer type, we further filtered the selected pathways that were ranked in the top 20% based on pathway activity scores in each cancer type. Finally, 488, 456, and 855 (14%, 15%, and 12% of the total) pathways from Biocarta, KEGG, and Reactome pathway databases, respectively, were found to be significant across 16 types of cancers, and used for further analysis.
Co-disruption of pathways by copy number alterations in human cancers
Biological pathways often function cooperatively to contribute to phenotypes such as cancer. Thus, advances in understanding these pathways and their interconnectivity will accelerate the development of molecular targeted therapies that promise to change the practice of oncology . We first explored patterns of disrupted pathways by using two-way hierarchical clustering to identify clusters of pathways that are statistically co-disrupted (Figure 3A). Our analyses identified clear relationships among disrupted pathways such as telomerase (TEL), TGF-beta, RB, and P53 pathways. For example, the telomerase pathway is co-disrupted with TGF-beta, ATM signaling, and CTCF: First Multivalent Nuclear Factor (CTCF) pathways (Figure 3B (1)). A recent study experimentally validated that the TGF-beta signaling pathway negatively regulates the telomerase pathway, and other studies also reported that MDM2, which is a gene in the ATM pathway, and CTCF both inhibit the expression of telomerase [38–40]. Another interesting example of co-disrupted pathways includes ARF, p53, and RB pathways (Figure 3B (2)). Disruption of the RB pathways could activate the ARF pathway, and the activation of ARF could trigger the p53 pathway, which induces growth arrest and/or apoptosis . These consistent observations with previous studies demonstrate that NetPathID is capable of identifying valid patterns of co-disrupted pathways. Thus, further analysis of co-disrupted pathways may help to provide novel insights into the nature of pathway associations.
Common and cancer-type specific disrupted pathways based on copy number changes across cancers
We further attempted to identify common pathways disrupted by copy number alterations across cancers. We used the following strict criteria to define commonly disrupted pathways: 1) Pathways must have significant BH-adjusted p-values; 2) Pathways must be ranked within the top 10% compared to other pathways in each cancer study based on their activity scores, and; 3) Top ranked pathways must be present in at least 10 different types of cancer. Based on these criteria we found an average of 14 commonly disrupted pathways from KEGG, Biocarta, and Reactome (Figure 3, and Additional file 3: Tables S1 and S2). Examples of commonly disrupted pathways from the Biocarta pathway database include telomerase, transforming growth factor beta (TGF-Beta) signaling pathway, NTRK1 (TrkA) signaling and Cell Cycle pathways. Some of these pathways were already known to be altered broadly across many cancer types, such as TGF-Beta , Cell Cycle  and telomerase . Other pathways, though, have only been implicated in a few cancers. For example, TrkA signaling is known to play a role in neuronal cancers as well as a few non-neuronal cancers such as medullary thyroid carcinoma, lung, pancreatic, ovarian and breast cancers . However, TrkA signaling has not been functionally associated with cancers such as ALL, GIST, mesothelioma and renal cancer, which were identified in our analysis. This could have important implications because therapeutics have been developed that target TrkA signaling .
As we expected, NetPathID identified cancer related pathways such as the KEGG annotated Pancreatic, Colorectal, Glioma, Lung, Prostate and Bladder cancer pathways. Likewise, using the Reactome annotated pathways our method identified well known cancer-related pathways that regulate cell growth, death and proliferation including the EGFR signaling pathway, NF-kb activation, and Ras signaling pathway as commonly disrupted across many types of cancers (Additional file 3: Table S1). Importantly, our method also identified pathways that were not previously considered to be universally disrupted in cancers, such as the Adherens Junction and PECAM1 pathways . Our analysis found these pathways to be significant in all 16 cancer types in the KEGG annotated pathways and Reactome annotated pathways (Additional file 3: Table S1). These results support the idea of using novel targeted therapies, such as the monoclonal antibody targeting PECAM1 , in a wider array of cancer types.
In addition to pathways disrupted in the majority of cancers, we also found that there are sets of pathways that are only disrupted in specific types of human cancers (Figure 3C). Some examples include Cytokine Network (cytokines), Cytokines and Inflammatory Response (INFLAM), Dendritic cells in regulating TH1 and TH2 Development (DC) pathways, which are all only disrupted in acute lymphoblastic leukemia and myelodysplasia. These disrupted pathways are widely involved in T cell and B cell activities that are associated with immune responses, and activation and proliferation of specific differentiated immune cells. Other cancer-type specific disrupted pathways include the Sonic Hedgehog/Patched1 (SHH) Receptor Ptc1 Regulates cell cycle (PTC) pathway disrupted in renal and glioma cancers, and Role of Ran in mitotic spindle regulation (RANMS) pathway disrupted in colorectal cancer (Additional file 3: Table S13). These findings are supported by functional studies which have demonstrated that inhibition of the Sonic Hedgehog signaling pathway is known to induce renal cancer, and enhances the efficacy of targeted therapy in glioma [49, 50]. Ran, which controls the cell cycle through the regulation of mitotic spindle organization, was shown to be highly expressed in many cancer types including gastric and colorectal, and is known for its involvement in malignant transformation and/or the enhanced proliferation of cancer cells .
These observations suggest that commonly disrupted pathways across many types of cancers could play a major role in the development of cancers, while the set of disrupted pathways that are specific to certain types of cancers could help to characterize these types of cancers and provide options for different targeted therapies.
Current pathway databases cover only a small fraction of human genes. Therefore, although the use of these pathway databases as prior knowledge helps to define and identify disrupted pathways, it is possible that there are many more gene modules as yet undescribed that contribute to cancer. To tackle this challenge, we obtained 4,620 protein-protein interaction subnetwork modules that are conserved across different species, and use them as additional pathway data . The conserved subnetworks were identified by PathBLAST  among two (or more) species, and cover more than 8,558 proteins (genes). We used NetPathID with these 4,620 subnetwork modules and identified 41 commonly disrupted subnetworks that are present in at least 10 types of cancer and are ranked within top 5% in each cancer study (see Additional file 3: Table S1). This nicely illustrates how NetPathID is not simply biased to genes in existing databases, and is able to highlight modules of uncharacterized genes that are worthy of further study.
Commonly disrupted pathways across cancers correlate with clinical outcomes
We investigated whether we could use disrupted pathways discovered by NetPathID to identify subgroups of patients that correlate with different clinical outcomes, such as survival. Specifically, we hypothesized that commonly disrupted pathways reflect molecular mechanisms contributing to the biological/clinical behavior of cancers. Thus, we could use member genes in disrupted pathways as gene signatures to identify patient subgroups having different clinical outcomes.
Cancer-related genes are enriched in commonly disrupted pathways
We also investigated the commonly disrupted pathways in terms of their enrichment for known cancer-related genes. We hypothesized that commonly disrupted pathways across many types of cancers have many cancer “driver” genes as key components. More specifically, the presence of known cancer-causing genes with well-defined biological properties in a functional module can be used to make reasonable guesses about other candidate “driver” genes in the same functional module. To validate our hypothesis, we defined a functional module as the union of our commonly disrupted pathways (i.e., ranked within top k% in each cancer study, and present in more than 10 cancer types), and computed the fraction of known cancer-causing genes based on the Cancer Gene Census database from the Sanger Institute in the functional module. A higher cancer gene fraction for the functional module indicates more enrichment for cancer-causing genes. For comparison, we use the Lee et al. method [22, 28] that overlays gene scores obtained from aggregated and pooled analyses to pathways, and then ranked them to identify the top-ranked disrupted pathways.
The higher fraction of known cancer-causing genes in functional modules from our commonly disrupted pathways is consistent with our hypothesis that commonly disrupted pathways across many types of cancers would have many cancer “driver” genes. Thus, we predict that the other genes in these functional modules could also be cancer “drivers” and warrant further study.
An example of a disrupted signaling pathway: TGF-beta signaling pathway
It is evident from the network visualization that many of the annotated genes in TGF-beta signaling pathway have a relatively low recurrent frequency, which would imply that the pathway is only disrupted in a small cohort of cancer patients. However, by including interacting genes it becomes apparent that many genes that directly interact with members of the TGF-beta signaling pathway are altered in a significant percentage of cancer types. We also noted that the interacting partners are different for different cancer types. For example, in colorectal cancer, MAPRE1 binds to the tumor suppressor protein APC which is often mutated in familial and sporadic forms of colorectal cancer. MAPRE1 is also involved in processes including cell migration and adhesion, transcriptional activation, and apoptosis. In our data, the copy number of APC is not significantly altered by copy number, but our network view indicates that amplification of MAPRE1 could affect pathways and processes involved in APC regulation. In a similar manner, other highly amplified genes that are not annotated with the TGF-beta signaling pathway are interacting with genes in the TGF-beta signaling pathway, and thus, those highly amplified genes could affect activity of the TGF-beta signaling pathway. Another example is the relationship between HNF4a and colorectal cancer (Additional file 1: Figure S13a). HNF4A, a transcription factor regulated by TGF-beta signaling , is associated with diabetes and HCC, but has only recently been linked to intestinal tract pathology including ulcerative colitis  and Crohn’s disease . Our results indicate it may also be playing a role in colorectal cancer,
In ovarian cancer, the annotated genes in the TGF-beta pathway also have a relatively low recurrent frequency, while genes that directly interact with these annotated genes are altered in a high percentage of cancers. Interestingly, the interacting genes are not the same interacting genes found in the colorectal cancer network. In ovarian cancer, a few of the interesting interacting genes include MYC, a well-known oncogene and TRIB1, a novel regulator of the MAP kinase pathway recently linked to leukemogenesis .
These network views provide biologically meaningful insights into how copy number alterations in different genes, among different types of cancers could affect common pathways. The illustrations also demonstrate the usefulness of an integrative analysis to discover disrupted pathways, which contain many member genes each having low significance with respect to copy number changes.
Despite the success of our approach, there are limitations to the method. First, we use both amplified and deleted genes without distinguishing the two types. Thus, it would not be straightforward to interpret the effect on the activity of the pathways across cancers, because the disruption may be caused by either amplified or deleted genes. It would be possible to extend the method by separating amplification and deletion, but this could limit the ability to identify pathways with both amplified and deleted genes. In fact, we found many pathways disrupted by both amplified and deleted genes. Thus, one promising direction for further improvement of the method would be to incorporate other complementary genomic datasets to determine the role of disrupted pathways. For example, we could include datasets measuring expression of genes downstream of the pathways to determine the effect of copy number alterations on the pathways.
Another limitation in our analysis is the relatively sparse coverage of current protein-protein interaction databases. Instead of using the protein-protein interaction databases we could, instead, use functional linkage networks, which have more comprehensive coverage of a broad variety of gene relationships, and could allow for more sensitive discovery of network signatures under various conditions of interest . While existing network-based methods cannot handle the high density of these large-scale functional networks, the label propagation-based methods were successfully applied to functional linkage networks in a recent study . Thus, it would be interesting to use functional linkage networks to discover robust and reliable disrupted pathways across cancers.
As with all studies that use annotated pathways, there is the problem of overlap between pathways and the decision to include/exclude proteins when the pathway is annotated. For example, ERK1/2 and AKT are members of many of the annotated pathways such as TGF-beta, TRKA, and Telomerase that were identified in our study, yet the pathways specifically named “MAPK” or “AKT” by the annotating organization did not register as significantly altered. Our findings suggest that these pathways as a whole are not commonly dysregulated, but only specific aspects of these pathways are co-opted by cancer.
In this work, we have presented analysis using input from GISTIC with default settings. To test the robustness of NetPathID we tested different cutoff levels using GISTIC to see if the list of disrupted pathways would change. We found that NetPathID was remarkably robust even when the cutoff values were raised from the default setting of 0.1 to 0.3 or 0.5. Most of the rankings of commonly disrupted pathways and the rankings of pathways that were specific to one type of cancer remained constant (see Additional file 2 and Additional file 3: Tables S15–S21).
Another possible bias could arise from the inclusion of whole chromosome arm deletions or amplifications (e.g., 11q is clearly lost for neuroblastoma in Additional file 1: Figure S3), because it is likely that the majority of the genes on the chromosome arm are not driving tumor growth. We adapted NetPathID so that we could run the analysis with or without including whole chromosome arm gains and losses. Again, surprisingly, most of the rankings did not change significantly (see Additional file 2 and Additional file 3: Tables S15–S21).
One possible explanation for the robustness of NetPathID is that NetPathID is not limited to the set of genes in the genomic regions detected by GISTIC. The gene set is expanded using our label propagation method which results in pathway activity scores based on a larger gene set. In addition, NetPathID is initiated using the average log2 ratio of the amplified or deleted gene detected by GISTIC. Therefore, genes with low log2 ratios will have less of an effect than genes with high log2 ratios.
Finally, although our study focused on the discovery of disrupted pathways from datasets of copy number alterations, the algorithm is general and can be readily applied to other types of genomic data, including, gene expression, mutation, and methylation.
We have described a network-based integrative method for discovering disrupted pathways based on copy number alterations in human cancers. NetPathID integrates copy number data and the protein-protein interaction networks to quantify activity scores of pathways. Specifically, NetPathID effectively utilizes information in the protein-protein interaction network and copy number changes with label propagation to quantify altered activities of pathways. This approach has the potential to uncover disrupted pathways that cannot be discovered by using overrepresentation and pathways-based methods, which rely on a limited number of annotated genes. Thus, NetPathID is uniquely suitable for providing a global analysis of disrupted pathways across cancers.
We applied our approach to copy number data from 16 types of cancers, and discovered commonly disrupted pathways and pathways that are only disrupted in specific types of cancer. Functional enrichment analysis of commonly disrupted pathways demonstrated that many cancer types share common biological processes that define the malignant state, including self-sufficiency in growth signaling, insensitivity to antigrowth signals, inactivation of apoptosis, and genomic instability. Of particular significance, we identified a patient subpopulation with poor survival using member genes in disrupted pathways, implying the potential of these disrupted pathways to serve as a guide to therapy in a subgroup of patients.
Copy number data preparation
The copy number data from 16 human cancer types were collected from a recent study after removing cell lines and datasets with fewer than 15 patients samples (Sept. 2010)  (http://www.broadinstitute.org/tumorscape/pages/portalHome.jsf). Copy-number measurements were obtained using the Affymetrix 250 K SNP arrays. For the details of preprocessing and segmentation of copy number dataset, please refer to . To detect significantly altered copy number regions, we use GISTIC with default settings, with exceptions indicated in the text .
Human protein-protein interaction and pathway data
We obtained the protein-protein interaction network from the Human Protein Reference Database (May 2010) . This network contained 9,667 proteins and 76,132 binary edges. We obtained KEGG, Biocarta, and Reactome gene sets from MsigDB (Sept. 2010)  and 4,620 conserved subnetworks in the human protein-protein interaction network from . To reduce bias to disease proteins in the protein-protein interaction network, we use the extended protein-protein interaction network suggested by . The extended protein-protein interaction network is generated by combining the HPRD, OPHID, BIND, and MINT database, and has a similar degree of interactions for both disease and non-disease proteins. For details of network preparation and statistics, please refer to .
Collecting altered genes based on copy number alterations
Computing gene score
Computing the pathway activity score
where M jk is activity score of the jth pathway in kth type of cancers, and N Mj is the member genes in pathway M j .
Pooled and aggregated analysis
To perform the pooled analysis, we first incorporate all of the copy number data from 2,172 patients into one dataset. Then, we run GISTIC to identify the set of genes with significant copy number alterations. To discover disrupted pathways enriched with this set of genes, we rank genes based on –log10(qval) from GISTIC, and then select the top k% genes to perform overrepresentation-based analysis using hypergeometric testing. To perform the aggregated analysis, we run GISTIC to calculate the significance of the altered genes in each cancer type. After running GISTIC on all datasets, we summarize –log10(qval) for each gene across all datasets. Then we rank genes based on aggregated –log10(qval), and select the top k% genes to perform overrepresentation-based analysis using hypergeometric testing. Note that if one gene has a qval from GISTIC for both amplification and deletion, we select the more significant qval for the gene.
Significance of pathway scores
To assess significance of the pathway score, we performed the analysis on random datasets. To construct these control datasets we randomly shuffled initial gene activity scores and pathway member assignments 10,000 times to generate a background distribution of pathway scores. From this control dataset we were able to derive the empirical p-value of the actual scores.
Analysis of patterns of disrupted pathways
In each cancer type, pathway activity scores were Z-score transformed. Then, we perform two-way hierarchical clustering using Cluster 3 with complete linkage to analyze patterns of pathway co-disruption based on the inferred pathway activity for each cancer type in the dataset. To validate the correlation between clustered pathways, we use the Matlab corrcoeff function. Then, two-way hierarchical clustering was performed to plot the heat map describing the correlation coefficient of pathway co-disruption.
Network view of disrupted pathways
To generate the network view of the pathway, we collected annotated genes in the pathway, and genes that directly interact with one of the annotated genes based on the protein-protein interaction database. Interacting genes were only included in the network if their GISTIC-q-value was among the top 500 values out of 18,932 ranked genes.
Microarray gene expression data preparation, and clustering
Four microarray gene expression datasets were used for the identification of patient subgroups. The lung cancer dataset was downloaded from . We downloaded two ovarian cancer datasets (GSE9891, GSE3149) and one breast cancer dataset (GSE2034) from Gene Expression Omnibus (GEO). All datasets were RMA normalized, log transformed, and expression values were median centered. To perform unsupervised hierarchical clustering, we use Matlab clustergram function with average linkage.
We would like to thank to Minnesota Supercomputing Institute for allowing us to use their computational resources TKS is supported by grants from the Randy Shaver Community Cancer Research and Community Fund, the Masonic Cancer Center, and the NIH-NCI Grant R00CA151672. GW, and VK are supported by a grant from the National Science Foundation (NSF IIS-0916439). RK is supported by grants from the National Science Foundation (NSF III-1117153, IIS 1149697).
- Albertson DG, Collins C, et al: Chromosome aberrations in solid tumors. Nat Genet. 2003, 34 (4): 369-376. 10.1038/ng1215.View ArticlePubMedGoogle Scholar
- Shlien A, Malkin D: Copy number variations and cancer. Genome. 2009, 1 (6): 62-10.1186/gm62.View ArticleGoogle Scholar
- Wood LD, Parsons DW, et al: The genomic landscapes of human breast and colorectal cancers. Science. 2007, 318 (5853): 1108-10.1126/science.1145720.View ArticlePubMedGoogle Scholar
- Jones S, Zhang X, et al: Core signaling pathways in human pancreatic cancers revealed by global genomic analyses. Science. 2008, 321 (5897): 1801-10.1126/science.1164368.PubMed CentralView ArticlePubMedGoogle Scholar
- McLendon R, Friedman A, et al: Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature. 2008, 455 (7216): 1061-1068. 10.1038/nature07385.View ArticleGoogle Scholar
- Parsons DW, Jones S, et al: An integrated genomic analysis of human glioblastoma multiforme. Science. 2008, 321 (5897): 1807-10.1126/science.1164382.PubMed CentralView ArticlePubMedGoogle Scholar
- van’t Veer LJ, Bernards R: Enabling personalized cancer medicine through analysis of gene-expression patterns. Nature. 2008, 452 (7187): 564-10.1038/nature06915.View ArticlePubMedGoogle Scholar
- Lee W, Jiang Z, et al: The mutation spectrum revealed by paired genome sequences from a lung cancer patient. Nature. 2010, 465 (7297): 473-477. 10.1038/nature09004.View ArticlePubMedGoogle Scholar
- Network, T. C. G. A. R: Integrated genomic analyses of ovarian carcinoma. Nature. 2011, 474: 609-615. 10.1038/nature10166.View ArticleGoogle Scholar
- Parsons DW, Li M, et al: The genetic landscape of the childhood cancer medulloblastoma. Science. 2011, 331 (6016): 435-10.1126/science.1198056.PubMed CentralView ArticlePubMedGoogle Scholar
- Stransky N, Egloff AM, et al: The mutational landscape of head and neck squamous cell carcinoma. Science. 2011, 333 (6046): 1157-1160. 10.1126/science.1208130.PubMed CentralView ArticlePubMedGoogle Scholar
- Hammerman PS, Lawrence MS, et al: Comprehensive genomic characterization of squamous cell lung cancers. Nature. 2012, 489: 519-525. 10.1038/nature11404.View ArticleGoogle Scholar
- Koboldt DC, Fulton RS, et al: “Comprehensive molecular portraits of human breast tumours.”. Nature. 2012Google Scholar
- Kandoth C, Schultz N, et al: Integrated genomic characterization of endometrial carcinoma. Nature. 2013, 497 (7447): 67-73. 10.1038/nature12113.View ArticlePubMedGoogle Scholar
- Hudson TJ, Anderson W, et al: International network of cancer genome projects. Nature. 2010, 464 (7291): 993-998. 10.1038/nature08987.View ArticlePubMedGoogle Scholar
- Ideker T, Ozier O, et al: Discovering regulatory and signalling circuits in molecular interaction networks. Bioinformatics. 2002, 18 (suppl 1): S233-10.1093/bioinformatics/18.suppl_1.S233.View ArticlePubMedGoogle Scholar
- Martin D, Brun C, et al: GOToolBox: functional analysis of gene datasets based on Gene Ontology. Genome Biol. 2004, 5 (12): R101-10.1186/gb-2004-5-12-r101.PubMed CentralView ArticlePubMedGoogle Scholar
- Subramanian A, Tamayo P, et al: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005, 102 (43): 15545-10.1073/pnas.0506580102.PubMed CentralView ArticlePubMedGoogle Scholar
- Da Wei Huang BTS, Lempicki RA: Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2008, 4 (1): 44-57. 10.1038/nprot.2008.211.View ArticleGoogle Scholar
- Cabusora L, Sutton E, et al: Differential network expression during drug and stress response. Bioinformatics. 2005, 21 (12): 2898-10.1093/bioinformatics/bti440.View ArticlePubMedGoogle Scholar
- Rajagopalan D, Agarwal P: Inferring pathways from gene lists using a literature-derived network of biological relationships. Bioinformatics. 2005, 21 (6): 788-10.1093/bioinformatics/bti069.View ArticlePubMedGoogle Scholar
- Chuang HY, Lee E, et al: Network-based classification of breast cancer metastasis. Mol Syst Biol. 2007, 3 (1): 140-PubMed CentralPubMedGoogle Scholar
- Guo Z, Li Y, et al: Edge-based scoring and searching method for identifying condition-responsive protein–protein interaction sub-network. Bioinformatics. 2007, 23 (16): 2121-10.1093/bioinformatics/btm294.View ArticlePubMedGoogle Scholar
- Dittrich MT, Klau GW, et al: Identifying functional modules in protein–protein interaction networks: an integrated exact approach. Bioinformatics. 2008, 24 (13): i223-10.1093/bioinformatics/btn161.PubMed CentralView ArticlePubMedGoogle Scholar
- Ulitsky I, Shamir R: Identifying functional modules using expression profiles and confidence-scored protein interactions. Bioinformatics. 2009, 25 (9): 1158-10.1093/bioinformatics/btp118.View ArticlePubMedGoogle Scholar
- Cerami E, Demir E, et al: Automated network analysis identifies core pathways in glioblastoma. PLoS One. 2010, 5 (2): e8918-10.1371/journal.pone.0008918.PubMed CentralView ArticlePubMedGoogle Scholar
- Vandin F, Upfal E, et al: Algorithms for detecting significantly mutated pathways in cancer, Springer. Journal of Computational Biology. 2010, 18 (3): 507-522.View ArticleGoogle Scholar
- Lee E, Chuang HY, et al: Inferring pathway activity toward precise disease classification. PLoS Comput Biol. 2008, 4 (11): e1000217-10.1371/journal.pcbi.1000217.PubMed CentralView ArticlePubMedGoogle Scholar
- Vaske CJ, Benz SC, et al: Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM. Bioinformatics. 2010, 26 (12): i237-10.1093/bioinformatics/btq182.PubMed CentralView ArticlePubMedGoogle Scholar
- Beroukhim R, Getz G, et al: Assessing the significance of chromosomal aberrations in cancer: methodology and application to glioma. Proc Natl Acad Sci. 2007, 104 (50): 20007-10.1073/pnas.0710052104.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhou D, Bousquet O, et al: Learning with local and global consistency. Advances in neural information processing systems. 2004, 16.16: 321-328.Google Scholar
- Hwang TH, Sicotte H, et al: Robust and efficient identification of biomarkers by classifying features on graphs. Bioinformatics. 2008, 24 (18): 2023-10.1093/bioinformatics/btn383.View ArticlePubMedGoogle Scholar
- Mostafavi S, Ray D, et al: GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function. Genome Biol. 2008, 9 (Suppl 1): S4-10.1186/gb-2008-9-s1-s4.PubMed CentralView ArticlePubMedGoogle Scholar
- Tian Z, Hwang TH, et al: A hypergraph-based learning algorithm for classifying gene expression and arrayCGH data with prior knowledge. Bioinformatics. 2009, 25 (21): 2831-10.1093/bioinformatics/btp467.View ArticlePubMedGoogle Scholar
- Hwang T, Kuang R: A heterogeneous label propagation algorithm for disease gene discovery. Proceedings of SIAM international conference on data mining. 2010, 583-594.Google Scholar
- Vanunu O, Magger O, et al: Associating genes and protein complexes with disease via network propagation. PLoS Comput Biol. 2010, 6 (1): e1000641-10.1371/journal.pcbi.1000641.PubMed CentralView ArticlePubMedGoogle Scholar
- Hanahan D: The hallmarks of cancer. Cell. 2000, 100 (1): 57-70. 10.1016/S0092-8674(00)81683-9.View ArticlePubMedGoogle Scholar
- Lin SY, Elledge SJ: Multiple tumor suppressor pathways negatively regulate telomerase. Cell. 2003, 113 (7): 881-889. 10.1016/S0092-8674(03)00430-6.View ArticlePubMedGoogle Scholar
- Renaud S, Loukinov D, et al: CTCF binds the proximal exonic region of hTERT and inhibits its transcription. Nucleic Acids Res. 2005, 33 (21): 6850-10.1093/nar/gki989.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhao J, Bilsland A, et al: MDM2 negatively regulates the human telomerase RNA gene promoter. BMC cancer. 2005, 5 (1): 6-10.1186/1471-2407-5-6.PubMed CentralView ArticlePubMedGoogle Scholar
- Sherr CJ: Tumor surveillance via the ARF–p53 pathway. Genes Dev. 1998, 12 (19): 2984-10.1101/gad.12.19.2984.View ArticlePubMedGoogle Scholar
- Massagué J: TGFβ in cancer. Cell. 2008, 134 (2): 215-230. 10.1016/j.cell.2008.07.001.PubMed CentralView ArticlePubMedGoogle Scholar
- Williams GH, Stoeber K: "The cell cycle and cancer." The Journal of pathology. Proceedings of the National Academy of Sciences. 2012, 94.7 (1997): 2776-2778.Google Scholar
- Shay JW, Wright WE: Role of telomeres and telomerase in cancer. Seminars in cancer biology, Elsevier. In Seminars in cancer biology. 2011, 21 (No. 6): 349-353. 10.1016/j.semcancer.2011.10.001. Academic PressView ArticleGoogle Scholar
- Lagadec C, Meignan S, et al: TrkA overexpression enhances growth and metastasis of breast cancer cells. Oncogene. 2009, 28 (18): 1960-1970. 10.1038/onc.2009.61.View ArticlePubMedGoogle Scholar
- Huang H, Bhat A, et al: Targeting the ANGPT–TIE2 pathway in malignancy. Nat Rev Cancer. 2010, 10 (8): 575-585. 10.1038/nrc2894.View ArticlePubMedGoogle Scholar
- Bergom C, Gao C, et al: Mechanisms of PECAM-1-mediated cytoprotection and implications for cancer cell survival. Leuk Lymphoma. 2005, 46 (10): 1409-1421. 10.1080/10428190500126091.View ArticlePubMedGoogle Scholar
- DeLisser H, Liu Y, et al: Vascular endothelial platelet endothelial cell adhesion molecule 1 (PECAM-1) regulates advanced metastatic progression. Proc Natl Acad Sci. 2010, 107 (43): 18616-18621. 10.1073/pnas.1004654107.PubMed CentralView ArticlePubMedGoogle Scholar
- Dormoy V, Danilin S, et al: The sonic hedgehog signaling pathway is reactivated in human renal cell carcinoma and plays orchestral role in tumor growth. Mol Cancer. 2009, 8 (1): 123-10.1186/1476-4598-8-123.PubMed CentralView ArticlePubMedGoogle Scholar
- Ulasov IV, Nandi S, et al: Inhibition of Sonic Hedgehog and Notch Pathways Enhances Sensitivity of CD133+ Glioma Stem Cells to Temozolomide Therapy. Mol Med. 2011, 17 (1–2): 103-PubMed CentralPubMedGoogle Scholar
- Azuma K, Sasada T, et al: Ran, a small GTPase gene, encodes cytotoxic T lymphocyte (CTL) epitopes capable of inducing HLA-A33–restricted and tumor-reactive CTLs in cancer patients. Clin Cancer Res. 2004, 10 (19): 6695-10.1158/1078-0432.CCR-04-0818.View ArticlePubMedGoogle Scholar
- Suthram S, Dudley JT, et al: Network-based elucidation of human disease similarities reveals common functional modules enriched for pluripotent drug targets. PLoS Comput Biol. 2010, 6 (2): e1000662-10.1371/journal.pcbi.1000662.PubMed CentralView ArticlePubMedGoogle Scholar
- Kelley BP, Yuan B, et al: PathBLAST: a tool for alignment of protein interaction networks. Nucleic Acids Res. 2004, 32 (suppl 2): W83-PubMed CentralView ArticlePubMedGoogle Scholar
- Bild AH, Yao G, et al: Oncogenic pathway signatures in human cancers as a guide to targeted therapies. Nature. 2005, 439 (7074): 353-357.View ArticlePubMedGoogle Scholar
- Wang Y, Klijn JGM, et al: Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer. Lancet. 2005, 365 (9460): 671-679.View ArticlePubMedGoogle Scholar
- Shedden K, Taylor JMG, et al: Gene expression–based survival prediction in lung adenocarcinoma: a multi-site, blinded validation study. Nat Med. 2008, 14 (8): 822-827. 10.1038/nm.1790.PubMed CentralView ArticlePubMedGoogle Scholar
- Tothill RW, Tinker AV, et al: Novel molecular subtypes of serous and endometrioid ovarian cancer linked to clinical outcome. Clin Cancer Res. 2008, 14 (16): 5198-10.1158/1078-0432.CCR-08-0196.View ArticlePubMedGoogle Scholar
- Gu Z, Reynolds EM, et al: The type I serine/threonine kinase receptor ActRIA (ALK2) is required for gastrulation of the mouse embryo. Development. 1999, 126 (11): 2551-2561.PubMedGoogle Scholar
- Barrett JC, Lee JC, et al: Genome-wide association study of ulcerative colitis identifies three new susceptibility loci, including the HNF4A region. Nat Genet. 2009, 41 (12): 1330-1334. 10.1038/ng.483.View ArticlePubMedGoogle Scholar
- Marcil V, Sinnett D, et al: Association between genetic variants in the HNF4A gene and childhood-onset Crohn’s disease. Genes Immun. 2012, 13.7: 565-556.Google Scholar
- Yokoyama T, Kanno Y, et al: Trib1 links the MEK1/ERK pathway in myeloid leukemogenesis. Blood. 2010, 116 (15): 2768-2775. 10.1182/blood-2009-10-246264.View ArticlePubMedGoogle Scholar
- Hwang T, Zhang W, et al: “Inferring disease and gene Set associations with rank coherence in networks.”. Bioinformatics. 2011, 27.19: 2692-2699.View ArticleGoogle Scholar
- Beroukhim R, Mermel CH, et al: The landscape of somatic copy-number alteration across human cancers. Nature. 2010, 463 (7283): 899-10.1038/nature08822.PubMed CentralView ArticlePubMedGoogle Scholar
- Keshava Prasad T, Goel R, et al: Human protein reference database—2009 update. Nucleic Acids Res. 2009, 37 (suppl 1): D767-D772.PubMed CentralView ArticlePubMedGoogle Scholar
- Wu X, Jiang R, et al: Network-based global inference of human disease genes. Mol Syst Biol. 2008, 4 (1): 189-PubMed CentralPubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.