Integrated genomics of susceptibility to alkylator-induced leukemia in mice
© Cahan and Graubert; licensee BioMed Central Ltd. 2010
Received: 29 April 2010
Accepted: 17 November 2010
Published: 17 November 2010
Therapy-related acute myeloid leukemia (t-AML) is a secondary, generally incurable, malignancy attributable to chemotherapy exposure. Although there is a genetic component to t-AML susceptibility in mice, the relevant loci and the mechanism(s) by which they contribute to t-AML are largely unknown. An improved understanding of susceptibility factors and the biological processes in which they act may lead to the development of t-AML prevention strategies.
In this work we applied an integrated genomics strategy in inbred strains of mice to find novel factors that might contribute to susceptibility. We found that the pre-exposure transcriptional state of hematopoietic stem/progenitor cells predicts susceptibility status. More than 900 genes were differentially expressed between susceptible and resistant strains and were highly enriched in the apoptotic program, but it remained unclear which genes, if any, contribute directly to t-AML susceptibility. To address this issue, we integrated gene expression data with genetic information, including single nucleotide polymorphisms (SNPs) and DNA copy number variants (CNVs), to identify genetic networks underlying t-AML susceptibility. The 30 t-AML susceptibility networks we found are robust: they were validated in independent, previously published expression data, and different analytical methods converge on them. Further, the networks are enriched in genes involved in cell cycle and DNA repair (pathways not discovered in traditional differential expression analysis), suggesting that these processes contribute to t-AML susceptibility. Within these networks, the putative regulators (e.g., Parp2, Casp9, Polr1b) are the most likely to have a non-redundant role in the pathogenesis of t-AML. While identifying these networks, we found that current CNVR and SNP-based haplotype maps in mice represented distinct sources of genetic variation contributing to expression variation, implying that mapping studies utilizing either source alone will have reduced sensitivity.
The identification and prioritization of genes and networks not previously implicated in t-AML generates novel hypotheses on the biology and treatment of this disease that will be the focus of future research.
Therapy-related acute myeloid leukemia (t-AML) is a secondary malignancy attributable to chemotherapy and/or radiation exposure. t-AML comprises 5-20% of adult AML cases and its prevalence is increasing along with the size of the population undergoing chemotherapy [1, 2]. While chemotherapy regimen  and genetic background  contribute to t-AML, the risk factors are not well understood. Strong evidence for genetic predisposition to t-AML is provided by inherited cancer syndromes such as neurofibromatosis, where germline mutations of NF1 are associated with increased risk of t-AML in humans and mice [5, 6]. Gaining a better understanding of t-AML susceptibility factors is a pressing concern as it may lead to prevention strategies and provide insight into the genesis of de novo AML.
One class of chemotherapeutics associated with t-AML is the alkylators (i.e. melphalan, busulfan, thiotepa). The therapeutic effect of alkylator agents is believed to result from the formation of DNA adducts and single and double-strand breaks, which trigger apoptosis or growth arrest . Based on this presumed mechanism of alkylator action, genes involved in DNA repair , response to oxidative stress , and drug metabolism  have been investigated as mediators of t-AML susceptibility in candidate gene studies, with largely inconclusive results. A recent study in our lab investigated the genetic basis of t-AML susceptibility using inbred mice . In this study, eight to twelve individual mice from each of 20 inbred strains were treated with the alkylating agent N-nitroso-N-ethylurea (ENU), a potent mutagen with a propensity to cause AT:TA transversions and AT:GC transitions . Mice were monitored for the development of AML for up to 16 months post ENU exposure. The incidence of AML varied by strain from 0 to 80% (H2 = 0.10, P-value < 0.001), supporting the hypothesis that there is a strong genetic component in t-AML susceptibility.
We hypothesized that the pre-exposure transcriptional state of hematopoietic stem and progenitor cells, the putative target of leukemogenesis , underlies variation in susceptibility to t-AML. A pre-exposure transcriptional basis of susceptibility would be expected if a rapid response is critical in determining a cell's ultimate fate upon mutagen exposure. This hypothesis is consistent with the observation that expression of genes critical to surviving genotoxic stress in yeast does not change after exposure to DNA-damaging agents , implying that the necessary factors are already expressed at baseline. A similar situation has been reported in human lymphoblastoid cell lines, in which the pre-exposure transcriptional state of the cell more accurately predicts survival after alkylator treatment than the post-exposure state .
An integrated genetic map of inbred mouse strains
Global pre-exposure transcriptional state of hematopoietic stem and progenitor cells is associated with t-AML susceptibility
Functional enrichment of differentially expressed genes
protein modification process
hexose metabolic process
monosaccharide metabolic process
negative regulation of cellular metabolic process
nucleotide metabolic process
nucleoside triphosphate biosynthetic process
negative regulation of nucleobase, nucleoside, nucleotide and nucleic acid metabolic process
apoptotic mitochondrial changes
glutamine family amino acid metabolic process
pyridine nucleotide metabolic process
negative regulation of transcription
Integrated cis-eQTL mapping identifies candidate drivers of t-AML susceptibility
Our previous eQTL analysis identified 408 expression traits (391 genes) in KL cells that were associated with 214 CNVRs . We repeated this analysis using the 48-strain haplotype resource to map KL expression traits to SNP-based haplotypes. We considered only cis-eQTL-associated genes, as it has been shown that trans-eQTLs contain a large proportion of false positives . We found 127 associations between expression traits and haplotypes, after selecting the most significant association per trait. In the current study, we used the combined set of SNP- and CNVR-based eQTLs to discover and explore genetically driven modules of co-expressed genes associated with t-AML susceptibility.
There are 45 genes (45 probes) that are both differentially expressed and linked to at least one eQTL. We refer to these genes as anchors throughout the text. 37 are linked to CNVR-eQTLs; the remaining 8 are linked to haplotype-eQTLs. To validate the cis-eQTL associations, we mined publicly available expression data representing hematopoietic stem, progenitor, erythroid and myeloid populations from the BXD recombinant inbred panel . Because this data was generated using the same GEP platform that we used, we were able to ask how our KL population is related to these more purified populations (Additional File 5). As expected, our KL expression profiles cluster most closely with stem and progenitor profiles and are distinct from both erythroid and myeloid lineages. For each anchor gene, we tested the association between BXD genotypes of SNPs within 2 Mb and anchor expression and corrected for multiple testing. We found that 30 of the 45 anchors were significantly associated with at least one SNP within 2 Mb in at least one of the hematopoietic compartments (26 in either Stem or Progenitor), supporting the hypothesis that expression differences of the anchor are caused by locally encoded genetic variation. Of 480 testable eQTLs-transcript associations, 300 (62.5%) were replicated in at least one of the hematopoietic data sets. KL eQTLs may have failed to validate in the other tissues because they are false positives, because the causative genetic variant does not exist in the BXD strains, or due to tissue-specific expression regulation.
Anchored network analysis identifies t-AML susceptibility expression modules
Anchored susceptibility modules
There is accumulating evidence that many genetic contributors to complex traits are not protein-coding changes [32, 33]. If true, then other classes of genetic events that can affect phenotype must, at some level, impact gene expression (i.e., eQTLs). Hypothesizing that such events contribute to t-AML susceptibility, we took an integrated genomics approach to identify and prioritize candidate transcriptional networks. The first step in this approach was to identify eQTLs in hematopoietic stem and progenitor cells, the likely target of leukemic transformation. Previously, we described a CNVR eQTL map in classical inbred mice . In the current work, we expanded this map to include SNP-based haplotype eQTLs. In deriving the mouse haplotype map, we found surprisingly little correlation between haplotypes and neighboring CNVRs. This is in contrast to human studies, where nearly 75% of common CNVRs are estimated to be in linkage disequilibrium with neighboring SNPs . This suggests that at the currently available resolution and coverage (and genotyping accuracy), mouse haplotypes and CNVRs represent distinct sources of genetic information. We found two-fold more CNVR eQTLs than haplotype-based eQTLs (401 vs. 167). It is tempting to speculate that this difference in eQTL types is because CNVRs have a stronger impact on expression in cis and therefore are more likely to be detected as eQTLs. However, the difference could largely be due to the reduced power to detect haplotype eQTLs because of the exacerbated multiple testing problem that comes with performing approximately 20 times more statistical tests. In total, greater than 60% of the eQTLs were reproducible in an independent dataset.
The second step in the integrated approach was to find genes differentially expressed between t-AML susceptible and resistant strains. Because unsupervised clustering using all expressed transcripts grouped strains by susceptibility status, we expected to find a large number of genes associated with susceptibility. Greater than 7% of the expressed transcripts are differentially expressed (976/13,496). These genes are enriched in several, independent biological processes, most notably apoptosis. Among the differentially expressed intrinsic apoptosis genes are Caspase 9 (Casp9), B-cell leukemia/lymphoma 2 (Bcl-2), BCL2-associated agonist of cell death (Bad), BCL2-associated X protein (Bax), and mutS homolog 6 (Msh6). Msh6 is a member of the mutSα DNA mismatch recognition complex that has been shown to mediate apoptosis in certain contexts [35, 36]. Notably, the absence of mutSα activity in myeloid progenitors results in the complete loss of O 6-methylguanine (O 6MeG)-mediated cytotoxicity . That resistant strains have higher expression of Msh6 suggests that upon alkylator exposure, resistant strains may recognize DNA damage and respond appropriately (i.e. undergo apoptosis) whereas the KL cells of susceptible strains may tend to survive, accumulate mutations, and become transformed. In KL cells, almost all susceptible strains have no detectable expression of Casp9, an initiator of programmed cell death, suggesting that these cells (low-to-no Casp9 expression) are less primed for Casp9-dependent apoptosis. Although knockout of Casp9 in mice is embryonic lethal, presumably due to severe brain development defects [38, 39], Casp9 is not required for apoptosis during normal hematopoiesis . Therefore, inbred strains may exhibit genetically-driven yet tissue-specific differences in apoptosis such that hematopoietic cells of susceptible strains are relatively protected from cell death after exposure to genotoxic stress.
Differential expression and gene enrichment analysis highlighted several biologically plausible pathways that may underlie t-AML susceptibility. However, it remained unclear which pathway members, if any, are causal contributors to the phenotype, as illustrated by the complex expression patterns of the intrinsic apoptosis genes. More broadly, the role and relative importance of each of the 917 differentially expressed genes in susceptibility remained undetermined. We hypothesized that among the 917 differentially expressed genes would be a subset in which expression variation is caused by cis-encoded genetic variation. Further, we posited that these 'anchor' genes cause expression variation of multiple downstream genes, which collectively are associated with t-AML susceptibility (or resistance). The mechanisms by which anchors might act in trans are varied. They include altered transcription factor abundance and homeostatic or compensatory forces within and between biological pathways. Regardless of the mechanisms of action, the identification of putative events that influence susceptibility and their linkage to gene networks forms a powerful and practical strategy to both find biological pathways underlying cancer susceptibility and to prioritize candidate mediators. Therefore, as the third step in the integrated genomics approach, we identified networks of genes that are significantly correlated with candidate susceptibility anchors. To validate the networks, we used independent gene expression data from multiple hematopoietic populations, trimming the networks of response genes whose expression was not reproduced.
One of the benefits of the integrated genomics approach is that it can implicate biological processes that would not have been detected using differential expression alone. The susceptibility networks that we identified are enriched in genes involved in DNA repair, base excision repair, apoptosis, and cell cycle, among other annotations. A second potential benefit of the integrated approach is that it differentiates between upstream (anchors) and response genes, an advantage over existing approaches that derive gene regulatory networks from expression data alone. While this is a hypothesis that remains to be tested, the identification of candidate upstream factors will be useful in prioritizing among apoptosis-related genes for experimental validation. For example, although Casp9 and Bcl2 are differentially expressed, Casp9 is also the candidate anchor of module A_33, the module most strongly associated with susceptibility status. We speculate that perturbation of candidate anchors, such as Casp9, are more likely to be informative in elucidating susceptibility than response genes (i.e. Bcl2).
Network analysis allowed us to predict the function of uncharacterized genes. For example, A630001G21Rik is expressed primarily in primitive hematopoietic and B-cells , yet its function is undetermined. Our analysis places it as the anchor of module A_12, which is enriched in apoptosis-related genes, including Bcl2. Therefore, A630001G21Rik may play previously unknown role in regulation of Bcl2 expression and apoptosis activity. Similarly, Cytoskeleton-associated protein-like 2 (Ckap2l) is the anchor of the largest module, A_16, enriched in both cell cycle and DNA repair genes. Although Ckap2l is highly expressed in hematopoietic progenitors , its functions are unknown. Its closest ortholog, Ckap2, is highly expressed in mouse stem cell lines and has detectable expression in hematopoietic progenitors, bone marrow, osteoclasts, osteoblasts, and macrophages . There is a growing body of literature suggesting that Ckap2 (also known as Tumor-associated microtubule-associated protein) is involved in cell cycle progression [42–44]. It is possible that Ckap2l contributes to cell cycle regulation in HSCs and progenitors, and that genetic disturbances of its expression impact t-AML susceptibility. Experiments that perturb expression of anchor genes such as Casp9 and Ckap2l to assess their impact on module expression and activity are the next logical steps in determining the role of candidate networks in susceptibility. If such experiments demonstrate a causal link between anchor genes and module expression, then moving forward to formally define their role in leukemia will be warranted.
A drawback to the anchored network approach, as currently implemented, is that it assumes there is only a single anchor per module. In cases where CNVRs disrupt regulatory elements, it is possible that a single genetic event impacts the expression of multiple neighboring genes. For example, in module A_37 (Figure 4C) we found 10 response genes within 7 Mb of a CNVR. This module warrants special attention because it includes poly (ADP-ribose) polymerase family member 2 (Parp2, the anchor) and apurinic/apyrimidinic endonuclease 1 (Apex1), both members of the base excision repair pathway [45, 46]. Both genes have lower expression in susceptible strains, again suggesting that lowered overall DNA damage response promotes susceptibility.
A caveat to the current work is that maps of genetic variation in the mouse genome are incomplete, a knowledge gap that promises to be filled by more informative SNP arrays  and next-generation sequencing [48, 49]. It is possible that un-captured genetic variants may be the ultimate cause of the observed co-expression networks. These variants may mediate their impact through mechanisms other than altering the expression of anchors. In the extreme case, all modules may not be controlled by anchor expression, but by undetected causes. Nevertheless, the modules themselves are still informative in that they describe sets of coordinately regulated genes that, collectively, are associated with both t-AML susceptibility and biologically plausible processes and pathways.
To our knowledge, this is the first report of an integrated genomics approach to dissect the role of the pre-exposure transcriptional state in t-AML susceptibility. From a clinical perspective, t-AML is important because the response to treatment is poor and survival is short . But because t-AML is a clinically-induced malignancy, it is by definition preventable. Therefore, a long-term goal in this field is to gain sufficient understanding of susceptibility factors in order to make worthwhile the personalization of chemotherapeutic regimens based on t-AML risk. The transcriptional networks and their candidate anchors described here are an important early step towards gaining such an understanding.
Construction of SNP-based haplotype map
Begin with the first informative SNP on a chromosome.
If the number of SNPs in the current block is 1 then go to (3). Otherwise, go to (4).
Group strains by genotype and add the next consecutive SNP to the current block.
Cluster strains by SNP-based distance using Partitioning Around Medioids (number of clusters = 2 to 6).
Assign haplotype labels to strains based on the clustering with the maximum average silhouette .
Derive consensus haplotypes. For each haplotype cluster, a consensus haplotype is defined as the string comprised of the most frequent genotype at each SNP position.
Compare the consensus haplotypes to the actual SNP genotypes.
If the number of errors is greater than 1 then go to (9), otherwise go to (10).
Remove the most recently added SNP from the current block. Store the haplotyping results from the previous iteration. Start a new block with the current SNP. Go to (3).
Add the next consecutive SNP to the current block. Go to (4). If there are no more SNPs on the current chromosome, select a new chromosome and go to (2). The computation is complete when all chromosomes have been analyzed.
SNP-based distances between strains are computed as the sum of SNP differences between strains. The range of number of allowable haplotypes per block was selected based on the estimated number of ancestral haplotypes . Pooled multi-allelic R2 was computed based on haplotype frequencies .
Integrated expression QTL mapping
GEP expression profiling was previously described  and is available at GEO under accession GSE10656. This data is referred to as kit+/lineage- (KL) throughout the text. Hypothalamus and adipose tissue expression data were obtained from GEO (accessions GSE5961 and GSE8028, respectively). For clustering and network analysis, probes were first filtered based on detection. In the KL data, a probe was considered detected in a sample if its signal was greater than a set of negative controls on the Illumina array. 13,496 probes were detected in all biological replicates of at least three strains (excluding C3H, for which only one array was analyzed). Only the 14,871 and 10,040 probes that were detected as present in at least 25% of the strains in the hypothalamus and adipose data sets, respectively, were kept for clustering analysis.
Unsupervised hierarchical clustering was performed with R's hclust function, using 1-Pearson correlation as the distance metric and the complete linkage method for node merging. To assess the non-randomness of the strains clustering according to susceptibility status, we computed the ratio of the mean of the distances among susceptible strains to the mean of the distances between all susceptible and resistant strains. Then, we permuted the strain labels 10,000 times, and recomputed the ratio of distances. The P-value of the observed clustering is the number of random permutations in which the distance test statistic > = observed distance test statistic divided by 10,000. This analysis was performed on the median expression profiles of strain replicates, only in those strains in which the susceptibility status is known. SNP clustering was based on strain-strain pair-wise distances computed by counting the number of SNPs that differ between each of the strains divided by the total number of SNPs that are typed in both strains.
Differential expression analysis
Strains with unknown susceptibility status were not included in the differential expression analysis. We used the limma package in R to model the expression of each gene with coefficients representing strain replicates and susceptibility status [53, 54] and the false discovery rate (FDR) was estimated using q-value . All of the 976 significant probes were detected as present in at least 50% of either the susceptible or resistant strains. When a gene is targeted by more than one probe, only the most significant differentially expressed probe was used for visualization. Association of module eigengenes with susceptibility was tested in the same way as differential expression. Enrichment analysis was performed using DAVID . Only the GO annotations Biological Process 5 and KEGG pathways were assessed. We only report annotations that pass an FDR threshold < 25%. Expression data from all 20 strains previously profiled were used in expression network analysis. Anchored expression networks were identified by searching for probes that exhibited expression profiles that were significantly correlated with anchor gene expression at an FDR threshold < 1%.
Expression quantitative trait locus mapping
CNVR eQTLs previously identified were used in this analysis  and eQTLs based on SNP haplotypes were identified using the haplotype association method with weighted strain permutation to account for strain relatedness [57–59]
Analysis of coexpression networks
Normalized gene expression data used for validation of eQTLs and anchored modules was downloaded from GEO (GSE18067). This data set includes profiling on sorted (purified) hematopoietic stem, progenitor, myeloid and erythroid populations from female BXD recombinant inbred mice . Only detection calls, coded as 0 for absent or 1 for present, were used to globally compare our KL data to the BXD data. Clustering was performed using the same parameters as described above for the KL data. KL eQTLs were validated by testing the association between the genotypes of SNPs within 2 Mb of anchor genes and driver gene expression in each compartment separately. Genotypes were treated as factors in a linear model of driver gene expression. P-values of the resulting F-statistics were adjusted for multiple testing using Holm's method . Drivers that had corrected P-values < 0.05 in at least one compartment were considered validated. Assessing the reproducibility of the association between driver and response gene expression was performed in a similar manner. A linear model of response gene expression was fit with driver gene expression as the dependent variable (one model per driver-response gene pair per compartment). In this case, Benjamini and Hochberg's method to control the false discovery rate was applied to the resulting p-values . WGCNA analysis was performed as previously described using the R package WGCNA . Briefly, β values for calculating the weighted network adjacency were selected based on the power at which the scale law R2 exceeded 0.9. Weighted adjacency matrices were computed, modules were defined using the cut Tree Dynamic function (which selects good dendrogram cutoffs) and similar modules were merged using merge Close Modules (which compensates for the high sensitivity of WGCNA). Eigengenes were computed as the first principal component of a module's expression matrix. Eigengenes were tested for differential expression between susceptible and resistant strains, as described above for individual genes.
List of Abbreviations
therapy-related acute myeloid leukemia
DNA copy number variation
expression quantitative trait loci
single nucleotide polymorphisms
copy number variant regions
gene expression profiling
false discovery rate
Weighted Gene Co-expression Network Analysis
B-cell leukemia/lymphoma 2
BCL2-associated agonist of cell death
BCL2-associated X protein
mutS homolog 6
- (O 6MeG) O:
Cytoskeleton-associated protein-like 2
poly (ADP-ribose) polymerase family member 2
apurinic/apyrimidinic endonuclease 1.
P.C. was supported in part by the National Human Genome Research Institute (T32 HG000045) and a Kauffman Fellowship. This work was supported by the NIH (P01CA101937). We thank Nancy L. Saccone for helpful discussions.
- Leone G, Voso MT, Sica S, Morosetti R, Pagano L: Therapy related leukemias: susceptibility, prevention and treatment. Leuk Lymphoma. 2001, 41 (3-4): 255-276. 10.3109/10428190109057981.PubMedView ArticleGoogle Scholar
- Leone G, Pagano L, Ben-Yehuda D, Voso MT: Therapy-related leukemia and myelodysplasia: susceptibility and incidence. Haematologica. 2007, 92 (10): 1389-1398. 10.3324/haematol.11034.PubMedView ArticleGoogle Scholar
- Larson RA, Le Beau MM: Therapy-related myeloid leukaemia: a model for leukemogenesis in humans. Chem Biol Interact. 2005, 153-154: 187-195. 10.1016/j.cbi.2005.03.023.PubMedView ArticleGoogle Scholar
- Knoche E, Mc Leod HL, Graubert TA: Pharmacogenetics of alkylator-associated acute myeloid leukemia. Pharmacogenomics. 2006, 7 (5): 719-729. 10.2217/146224188.8.131.529.PubMedView ArticleGoogle Scholar
- Side L, Taylor B, Cayouette M, Conner E, Thompson P, Luce M, Shannon K: Homozygous inactivation of the NF1 gene in bone marrow cells from children with neurofibromatosis type 1 and malignant myeloid disorders. N Engl J Med. 1997, 336 (24): 1713-1720. 10.1056/NEJM199706123362404.PubMedView ArticleGoogle Scholar
- Jacks T, Shih TS, Schmitt EM, Bronson RT, Bernards A, Weinberg RA: Tumour predisposition in mice heterozygous for a targeted mutation in Nf1. Nat Genet. 1994, 7 (3): 353-361. 10.1038/ng0794-353.PubMedView ArticleGoogle Scholar
- Meikrantz W, Bergom MA, Memisoglu A, Samson L: O6-alkylguanine DNA lesions trigger apoptosis. Carcinogenesis. 1998, 19 (2): 369-372. 10.1093/carcin/19.2.369.PubMedView ArticleGoogle Scholar
- Seedhouse C, Bainton R, Lewis M, Harding A, Russell N, Das-Gupta E: The genotype distribution of the XRCC1 gene indicates a role for base excision repair in the development of therapy-related acute myeloblastic leukemia. Blood. 2002, 100 (10): 3761-3766. 10.1182/blood-2002-04-1152.PubMedView ArticleGoogle Scholar
- Allan JM, Wild CP, Rollinson S, Willett EV, Moorman AV, Dovey GJ, Roddam PL, Roman E, Cartwright RA, Morgan GJ: Polymorphism in glutathione S-transferase P1 is associated with susceptibility to chemotherapy-induced leukemia. Proc Natl Acad Sci USA. 2001, 98 (20): 11592-11597. 10.1073/pnas.191211198.PubMed CentralPubMedView ArticleGoogle Scholar
- Larson RA, Wang Y, Banerjee M, Wiemels J, Hartford C, Le Beau MM, Smith MT: Prevalence of the inactivating 609C-->T polymorphism in the NAD(P)H:quinone oxidoreductase (NQO1) gene in patients with primary and therapy-related myeloid leukemia. Blood. 1999, 94 (2): 803-807.PubMedGoogle Scholar
- Fenske TS, Mc Mahon C, Edwin D, Jarvis JC, Cheverud JM, Minn M, Mathews V, Bogue MA, Province MA, McLeod HL, et al: Identification of candidate alkylator-induced cancer susceptibility genes by whole genome scanning in mice. Cancer Res. 2006, 66 (10): 5029-5038. 10.1158/0008-5472.CAN-05-3404.PubMedView ArticleGoogle Scholar
- Noveroske JK, Weber JS, Justice MJ: The mutagenic action of N-ethyl-N-nitrosourea in the mouse. Mamm Genome. 2000, 11 (7): 478-483. 10.1007/s003350010093.PubMedView ArticleGoogle Scholar
- Thirman MJ, Larson RA: Therapy-related myeloid leukemia. Hematol Oncol Clin North Am. 1996, 10 (2): 293-320. 10.1016/S0889-8588(05)70340-3.PubMedView ArticleGoogle Scholar
- Birrell GW, Brown JA, Wu HI, Giaever G, Chu AM, Davis RW, Brown JM: Transcriptional response of Saccharomyces cerevisiae to DNA-damaging agents does not identify the genes that protect against these agents. Proc Natl Acad Sci USA. 2002, 99 (13): 8778-8783. 10.1073/pnas.132275199.PubMed CentralPubMedView ArticleGoogle Scholar
- Fry RC, Svensson JP, Valiathan C, Wang E, Hogan BJ, Bhattacharya S, Bugni JM, Whittaker CA, Samson LD: Genomic predictors of interindividual differences in response to DNA damaging agents. Genes Dev. 2008, 22 (19): 2621-2626. 10.1101/gad.1688508.PubMed CentralPubMedView ArticleGoogle Scholar
- Schadt EE: Molecular networks as sensors and drivers of common human diseases. Nature. 2009, 461 (7261): 218-223. 10.1038/nature08454.PubMedView ArticleGoogle Scholar
- Meng H, Vera I, Che N, Wang X, Wang SS, Ingram-Drake L, Schadt EE, Drake TA, Lusis AJ: Identification of Abcc6 as the major causal gene for dystrophic cardiac calcification in mice through integrative genomics. Proc Natl Acad Sci USA. 2007, 104 (11): 4530-4535. 10.1073/pnas.0607620104.PubMed CentralPubMedView ArticleGoogle Scholar
- Wang SS, Shi W, Wang X, Velky L, Greenlee S, Wang MT, Drake TA, Lusis AJ: Mapping, genetic isolation, and characterization of genetic loci that determine resistance to atherosclerosis in C3H mice. Arterioscler Thromb Vasc Biol. 2007, 27 (12): 2671-2676. 10.1161/ATVBAHA.107.148106.PubMedView ArticleGoogle Scholar
- Schadt EE, Monks SA, Drake TA, Lusis AJ, Che N, Colinayo V, Ruff TG, Milligan SB, Lamb JR, Cavet G, et al: Genetics of gene expression surveyed in maize, mouse and man. Nature. 2003, 422 (6929): 297-302. 10.1038/nature01434.PubMedView ArticleGoogle Scholar
- Schadt EE, Lamb J, Yang X, Zhu J, Edwards S, Guhathakurta D, Sieberts SK, Monks S, Reitman M, Zhang C, et al: An integrative genomics approach to infer causal associations between gene expression and disease. Nat Genet. 2005, 37 (7): 710-717. 10.1038/ng1589.PubMed CentralPubMedView ArticleGoogle Scholar
- Yang X, Deignan JL, Qi H, Zhu J, Qian S, Zhong J, Torosyan G, Majid S, Falkard B, Kleinhanz RR, et al: Validation of candidate causal genes for obesity that affect shared metabolic pathways and networks. Nat Genet. 2009, 41 (4): 415-423. 10.1038/ng.325.PubMed CentralPubMedView ArticleGoogle Scholar
- Ghazalpour A, Doss S, Zhang B, Wang S, Plaisier C, Castellanos R, Brozell A, Schadt EE, Drake TA, Lusis AJ: Integrating genetic and network analysis to characterize genes related to mouse weight. PLoS Genet. 2006, 2 (8): e130. 10.1371/journal.pgen.0020130.
- Plaisier CL, Horvath S, Huertas-Vazquez A, Cruz-Bautista I, Herrera MF, Tusie-Luna T, Aguilar-Salinas C, Pajukanta P: A systems genetics approach implicates USF1, FADS3, and other causal candidate genes for familial combined hyperlipidemia. PLoS Genet. 2009, 5 (9): e1000642. 10.1371/journal.pgen.1000642.
- Cahan P, Li Y, Izumi M, Graubert TA: The impact of copy number variation on local gene expression in mouse hematopoietic stem and progenitor cells. Nat Genet. 2009, 41 (4): 430-437. 10.1038/ng.350.PubMed CentralPubMedView ArticleGoogle Scholar
- Bogue MA, Grubb SC, Maddatu TP, Bult CJ: Mouse Phenome Database (MPD). Nucleic Acids Res. 2007, D643-649. 10.1093/nar/gkl1049. 35 Database
- Cahan P, Godfrey LE, Eis PS, Richmond TA, Selzer RR, Brent M, Mc Leod HL, Ley TJ, Graubert TA: wuHMM: a robust algorithm to detect DNA copy number variation using long oligonucleotide microarray data. Nucleic Acids Res. 2008, 36 (7): e41. 10.1093/nar/gkn110.
- Nadler JJ, Zou F, Huang H, Moy SS, Lauder J, Crawley JN, Threadgill DW, Wright FA, Magnuson TR: Large-scale gene expression differences across brain regions and inbred strains correlate with a behavioral phenotype. Genetics. 2006, 174 (3): 1229-1236. 10.1534/genetics.106.061481.PubMed CentralPubMedView ArticleGoogle Scholar
- Breitling R, Li Y, Tesson BM, Fu J, Wu C, Wiltshire T, Gerrits A, Bystrykh LV, de Haan G, Su AI: Genetical genomics: spotlight on QTL hotspots. PLoS Genet. 2008, 4 (10): e1000232. 10.1371/journal.pgen.1000232.
- Gerrits A, Li Y, Tesson BM, Bystrykh LV, Weersing E, Ausema A, Dontje B, Wang X, Breitling R, Jansen RC: Expression quantitative trait loci are highly sensitive to cellular differentiation state. PLoS Genet. 2009, 5 (10): e1000692. 10.1371/journal.pgen.1000692.
- Zhang B, Horvath S: A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol. 2005, 4: Article17Google Scholar
- Langfelder P, Horvath S: WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008, 9: 559. 10.1186/1471-2105-9-559.Google Scholar
- Stranger BE, Nica AC, Forrest MS, Dimas A, Bird CP, Beazley C, Ingle CE, Dunning M, Flicek P, Koller D, et al: Population genomics of human gene expression. Nat Genet. 2007, 39 (10): 1217-1224. 10.1038/ng2142.PubMed CentralPubMedView ArticleGoogle Scholar
- Visel A, Zhu Y, May D, Afzal V, Gong E, Attanasio C, Blow MJ, Cohen JC, Rubin EM, Pennacchio LA: Targeted deletion of the 9p21 non-coding coronary artery disease risk interval in mice. Nature. 464 (7287): 409-412. 10.1038/nature08801.
- Mc Carroll SA, Kuruvilla FG, Korn JM, Cawley S, Nemesh J, Wysoker A, Shapero MH, de Bakker PI, Maller JB, Kirby A, et al: Integrated detection and population-genetic analysis of SNPs and copy number variation. Nat Genet. 2008, 40 (10): 1166-1174. 10.1038/ng.238.View ArticleGoogle Scholar
- Young LC, Peters AC, Maeda T, Edelmann W, Kucherlapati R, Andrew SE, Tron VA: DNA mismatch repair protein Msh6 is required for optimal levels of ultraviolet-B-induced apoptosis in primary mouse fibroblasts. J Invest Dermatol. 2003, 121 (4): 876-880. 10.1046/j.1523-1747.2003.12486.x.PubMedView ArticleGoogle Scholar
- Roos WP, Christmann M, Fraser ST, Kaina B: Mouse embryonic stem cells are hypersensitive to apoptosis triggered by the DNA damage O(6)-methylguanine due to high E2F1 regulated mismatch repair. Cell Death Differ. 2007, 14 (8): 1422-1432. 10.1038/sj.cdd.4402136.PubMedView ArticleGoogle Scholar
- Klapacz J, Meira LB, Luchetti DG, Calvo JA, Bronson RT, Edelmann W, Samson LD: O6-methylguanine-induced cell death involves exonuclease 1 as well as DNA mismatch recognition in vivo. Proc Natl Acad Sci USA. 2009, 106 (2): 576-581. 10.1073/pnas.0811991106.PubMed CentralPubMedView ArticleGoogle Scholar
- Hakem R, Hakem A, Duncan GS, Henderson JT, Woo M, Soengas MS, Elia A, de la Pompa JL, Kagi D, Khoo W, et al: Differential requirement for caspase 9 in apoptotic pathways in vivo. Cell. 1998, 94 (3): 339-352. 10.1016/S0092-8674(00)81477-4.PubMedView ArticleGoogle Scholar
- Kuida K, Haydar TF, Kuan CY, Gu Y, Taya C, Karasuyama H, Su MS, Rakic P, Flavell RA: Reduced apoptosis and cytochrome c-mediated caspase activation in mice lacking caspase 9. Cell. 1998, 94 (3): 325-337. 10.1016/S0092-8674(00)81476-2.PubMedView ArticleGoogle Scholar
- Marsden VS, O'Connor L, O'Reilly LA, Silke J, Metcalf D, Ekert PG, Huang DC, Cecconi F, Kuida K, Tomaselli KJ, et al: Apoptosis initiated by Bcl-2-regulated caspase activation independently of the cytochrome c/Apaf-1/caspase-9 apoptosome. Nature. 2002, 419 (6907): 634-637. 10.1038/nature01101.PubMedView ArticleGoogle Scholar
- Wu C, Orozco C, Boyer J, Leglise M, Goodale J, Batalov S, Hodge CL, Haase J, Janes J, Huss JW, et al: BioGPS: an extensible and customizable portal for querying and organizing gene annotation resources. Genome Biol. 2009, 10 (11): R130-10.1186/gb-2009-10-11-r130.PubMed CentralPubMedView ArticleGoogle Scholar
- Seki A, Fang G: CKAP2 is a spindle-associated protein degraded by APC/C-Cdh1 during mitotic exit. J Biol Chem. 2007, 282 (20): 15103-15113. 10.1074/jbc.M701688200.PubMedView ArticleGoogle Scholar
- Hong KU, Kim HJ, Kim HS, Seong YS, Hong KM, Bae CD, Park J: Cdk1-cyclin B1-mediated phosphorylation of tumor-associated microtubule-associated protein/cytoskeleton-associated protein 2 in mitosis. J Biol Chem. 2009, 284 (24): 16501-16512. 10.1074/jbc.M900257200.PubMed CentralPubMedView ArticleGoogle Scholar
- Jeon SM, Choi B, Hong KU, Kim E, Seong YS, Bae CD, Park J: A cytoskeleton-associated protein, TMAP/CKAP2, is involved in the proliferation of human foreskin fibroblasts. Biochem Biophys Res Commun. 2006, 348 (1): 222-228. 10.1016/j.bbrc.2006.07.046.PubMedView ArticleGoogle Scholar
- Schreiber V, Ame JC, Dolle P, Schultz I, Rinaldi B, Fraulob V, Menissier-de Murcia J, de Murcia G: Poly(ADP-ribose) polymerase-2 (PARP-2) is required for efficient base excision DNA repair in association with PARP-1 and XRCC1. J Biol Chem. 2002, 277 (25): 23028-23036. 10.1074/jbc.M202390200.PubMedView ArticleGoogle Scholar
- Raffoul JJ, Cabelof DC, Nakamura J, Meira LB, Friedberg EC, Heydari AR: Apurinic/apyrimidinic endonuclease (APE/REF-1) haploinsufficient mice display tissue-specific differences in DNA polymerase beta-dependent base excision repair. J Biol Chem. 2004, 279 (18): 18425-18433. 10.1074/jbc.M313983200.PubMedView ArticleGoogle Scholar
- Yang H, Ding Y, Hutchins LN, Szatkiewicz J, Bell TA, Paigen BJ, Graber JH, de Villena FP, Churchill GA: A customized and versatile high-density genotyping array for the mouse. Nat Methods. 2009, 6 (9): 663-666. 10.1038/nmeth.1359.PubMed CentralPubMedView ArticleGoogle Scholar
- Sudbery I, Stalker J, Simpson JT, Keane T, Rust AG, Hurles ME, Walter K, Lynch D, Teboul L, Brown SD, et al: Deep short-read sequencing of chromosome 17 from the mouse strains A/J and CAST/Ei identifies significant germline variation and candidate genes that regulate liver triglyceride levels. Genome Biol. 2009, 10 (10): R112-10.1186/gb-2009-10-10-r112.PubMed CentralPubMedView ArticleGoogle Scholar
- Quinlan AR, Clark RA, Sokolova S, Leibowitz ML, Zhang Y, Hurles ME, Mell JC, Hall IM: Genome-wide mapping and assembly of structural variant breakpoints in the mouse genome. Genome Res.
- Rousseeuw PJ, Kaufman L: Finding Groups in Data: An Introduction to Cluster Analysis. 1990, New York: WileyGoogle Scholar
- Szatkiewicz JP, Beane GL, Ding Y, Hutchins L, Pardo-Manuel de Villena F, Churchill GA: An imputed genotype resource for the laboratory mouse. Mamm Genome. 2008, 19 (3): 199-208. 10.1007/s00335-008-9098-9.PubMed CentralPubMedView ArticleGoogle Scholar
- Zhao H, Nettleton D, Dekkers JC: Evaluation of linkage disequilibrium measures between multi-allelic markers as predictors of linkage disequilibrium between single nucleotide polymorphisms. Genet Res. 2007, 89 (1): 1-6. 10.1017/S0016672307008634.PubMedView ArticleGoogle Scholar
- Smyth GK: Limma: linear models for microarray data. Bioinformatics and Computational Biology Solutions using R and Bioconductor. Edited by: R Gentleman VC, Dudoit S, Irizarry R, Huber W. 2005, New York: Springer, 397-420. full_text.View ArticleGoogle Scholar
- Smyth GK: Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2004, 3: Article3Google Scholar
- Storey JD, Tibshirani R: Statistical significance for genomewide studies. Proc Natl Acad Sci USA. 2003, 100 (16): 9440-9445. 10.1073/pnas.1530509100.PubMed CentralPubMedView ArticleGoogle Scholar
- Huang da W, Sherman BT, Lempicki RA: Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009, 4 (1): 44-57. 10.1038/nprot.2008.211.PubMedView ArticleGoogle Scholar
- Mc Clurg P, Janes J, Wu C, Delano DL, Walker JR, Batalov S, Takahashi JS, Shimomura K, Kohsaka A, Bass J, et al: Genomewide association analysis in diverse inbred mice: power and population structure. Genetics. 2007, 176 (1): 675-683. 10.1534/genetics.106.066241.View ArticleGoogle Scholar
- Pletcher MT, Mc Clurg P, Batalov S, Su AI, Barnes SW, Lagler E, Korstanje R, Wang X, Nusskern D, Bogue MA, et al: Use of a dense single nucleotide polymorphism map for in silico mapping in the mouse. PLoS Biol. 2004, 2 (12): e393-10.1371/journal.pbio.0020393.PubMed CentralPubMedView ArticleGoogle Scholar
- Mc Clurg P, Pletcher MT, Wiltshire T, Su AI: Comparative analysis of haplotype association mapping algorithms. BMC Bioinformatics. 2006, 7: 61-10.1186/1471-2105-7-61.View ArticleGoogle Scholar
- Holm S: A simple sequentially rejective multiple test procedure. Scandinavian Journal of Statistics. 1979, 6: 65-70.Google Scholar
- Benjamini Y, Hochberg Y: Controlling the false discovery rate: A practical and powerful approach to multiple testing. J R Stat Soc B. 1995, 57: 289-300.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.