Integrative analysis of RUNX1 downstream pathways and target genes
- Joëlle Michaud1, 2, 9,
- Ken M Simpson3,
- Robert Escher1, 10,
- Karine Buchet-Poyau4, 11, 12, 16,
- Tim Beissbarth3, 13,
- Catherine Carmichael1, 2,
- Matthew E Ritchie3,
- Frédéric Schütz2, 3,
- Ping Cannon1,
- Marjorie Liu5,
- Xiaofeng Shen6,
- Yoshiaki Ito7,
- Wendy H Raskind8,
- Marshall S Horwitz8,
- Motomi Osato7,
- David R Turner6,
- Terence P Speed3,
- Maria Kavallaris5,
- Gordon K Smyth3 and
- Hamish S Scott1, 14, 15Email author
© Michaud et al; licensee BioMed Central Ltd. 2008
Received: 24 September 2007
Accepted: 31 July 2008
Published: 31 July 2008
The RUNX1 transcription factor gene is frequently mutated in sporadic myeloid and lymphoid leukemia through translocation, point mutation or amplification. It is also responsible for a familial platelet disorder with predisposition to acute myeloid leukemia (FPD-AML). The disruption of the largely unknown biological pathways controlled by RUNX1 is likely to be responsible for the development of leukemia. We have used multiple microarray platforms and bioinformatic techniques to help identify these biological pathways to aid in the understanding of why RUNX1 mutations lead to leukemia.
Here we report genes regulated either directly or indirectly by RUNX1 based on the study of gene expression profiles generated from 3 different human and mouse platforms. The platforms used were global gene expression profiling of: 1) cell lines with RUNX1 mutations from FPD-AML patients, 2) over-expression of RUNX1 and CBFβ, and 3) Runx1 knockout mouse embryos using either cDNA or Affymetrix microarrays. We observe that our datasets (lists of differentially expressed genes) significantly correlate with published microarray data from sporadic AML patients with mutations in either RUNX1 or its cofactor, CBFβ. A number of biological processes were identified among the differentially expressed genes and functional assays suggest that heterozygous RUNX1 point mutations in patients with FPD-AML impair cell proliferation, microtubule dynamics and possibly genetic stability. In addition, analysis of the regulatory regions of the differentially expressed genes has for the first time systematically identified numerous potential novel RUNX1 target genes.
This work is the first large-scale study attempting to identify the genetic networks regulated by RUNX1, a master regulator in the development of the hematopoietic system and leukemia. The biological pathways and target genes controlled by RUNX1 will have considerable importance in disease progression in both familial and sporadic leukemia as well as therapeutic implications.
The Core Binding Factor (CBF) is a transcriptional regulator complex, which is composed of two sub-units . Mammals have three genes coding for the α-subunits, RUNX1, RUNX2 and RUNX3 , and one coding for the β-subunit, CBFβ . The α-subunits recognize a specific sequence (TGT/cGGT) in the regulatory regions of their target genes in order to bind DNA directly, while the β-subunit heterodimerizes with the α-subunits but does not interact directly with the DNA. The interaction with CBFβ stabilizes the RUNX-DNA complex [3, 4] and protects the RUNX proteins from degradation .
In humans, the CBF complex containing RUNX1 as the α-subunit is one of the most frequent targets of chromosomal and genetic alterations in leukemia. Chromosomal rearrangements involving RUNX1 or CBFβ , somatic point mutations in RUNX1  and amplification of RUNX1  have all been described in acute leukemia. In addition to somatic alterations, germ-line point mutations in RUNX1 are responsible for an autosomal dominant platelet disorder with a propensity to develop leukemia (FPD-AML, OMIM 601399) [9, 10]. Interestingly, the dosage of RUNX1 protein seems to play a role in the determination of the leukemic phenotype. Indeed, low dosage of RUNX1, resulting from haploinsufficient or dominant negative mutations, lead to the development of myeloid leukemia [9–11], whereas amplification of RUNX1 gene is more often observed in lymphoid leukemia, particularly pediatric ALL . A number of observations also suggest that although RUNX1 is involved in the first steps of leukemia development, additional somatic mutations are necessary and probably determinant for the leukemic phenotype: 1) The predisposition to develop leukemia in FPD-AML patients shows that germline RUNX1 mutations are not sufficient for the development of the disease . 2) Somatic translocations are not able to induce leukemia in mouse cells on their own . 3) The translocation t(12;21), which fuses ETV6 (TEL) to RUNX1, can arise in utero but does not trigger leukemia until later in childhood, with as much as nine years latency . These additional mutations are likely to occur in molecules involved in the same biological pathways as RUNX1, as hemizygous loss of several molecules in the same biological pathway (e.g. RUNX1 and SPI1) is thought to be almost as tumorigenic as homozygous loss of one molecule (e.g. homozygous RUNX1 mutation in AML-M0) . Therefore the identification of downstream targets of RUNX1, with care to the model systems including species and cell type of origin, is of great interest in order to identify novel candidate molecules involved in leukemogenesis.
The identification of the biological pathways regulated by RUNX1 is also of importance to shed light on its in vivo function and role in leukemia development. The observation that Runx1 knockout mice show a lack of definitive hematopoietic maturation and die at embryonic stage 12 from hemorrhages in the central nervous system demonstrates that RUNX1 plays a critical role during development of the hematopoietic system [16, 17]. In addition, RUNX1 might also play a role in other systems as it is expressed in many other embryonic tissues [18–20] and in epithelial cells [19, 20]. It is furthermore overexpressed in endometrioid carcinoma  and down-regulated in gastric cancer . The in vivo function of RUNX1 is therefore yet to be fully understood.
Here we describe the combination of a number of genomic and bioinformatic approaches to identify biological pathways downstream of RUNX1, and report on a number of processes in which RUNX1 is likely to be involved. We also took advantage of the integration of these approaches in order to identify novel RUNX1 target genes.
Gene expression profiling of cells harboring different levels of RUNX1
Lymphoblastic cells derived from FPD patients heterozygous for a RUNX1 frameshift mutation (R135fs) were first analyzed. This mutation results in haploinsufficiency of RUNX1, as the mutant protein has lost its capacity to bind DNA and to transactivate the expression of the target genes . Quantitative RT-PCR on these non-leukemic lymphoblastic cells showed that affected individuals express approximately 55% of the transcript level observed in unaffected individuals (see Additional File 1 :Figure S1). The genes differentially expressed between two affected and two non-affected cell lines are therefore largely the result of a low dosage of RUNX1 protein. Using human cDNA microarrays with the Hs8k cDNA clone library from Research Genetics and a selection of control spots, 366 genes were identified as differentially expressed, of which 52% (192/366) were down-regulated in affected individuals (Figure 1 and see Additional File 2).
For overexpression studies, HeLa epithelial cells were transduced using adenoviral vectors. FACS analysis showed that over 90% of HeLa cells were transduced by a EGFP-expressing adenovirus (data not shown). This system results in a highly homogenous cell population in which small changes of expression can be identified. The wild type CBF complex α-subunit, RUNX1, was overexpressed together with the β-subunit, CBFβ (see Additional File 1: Figure S2) and seven hybridizations were performed. Following overexpression of the CBF complex, 721 genes were differentially expressed including the up-regulation of 42% of the genes (300/721; Figure 1 and see Additional File 2).
Finally, we compared the expression profiles of two wild type and two Runx1 knockout mouse embryo propers at each embryonic stages E8.5 and E12 using Affymetrix chips. Despite the heterogeneity of the samples, 931 and 297 genes were differentially expressed at embryonic stages E8.5 and E12, respectively. Of these genes, 57% (533/931) and 72% (214/297) were down-regulated in the knockout embryos (Figure 1 and see Additional File 3). These differences in expression are likely to reflect the lack of hematopoiesis and the premature death, respectively, observed in the Runx1 embryos.
We then compared the different datasets using a mean-rank gene set enrichment test (MR-GSE) in order to determine the level of connection between the 3 approaches (FPD cell lines, CBF overexpression and Runx1 knockout mouse embryos), disregarding the cell type and the organism. High correspondence was observed between the two human datasets. The correspondence between the human and the mouse datasets was not as good, although still significant. This might partially be explained by the difficulties of matching human and mouse platforms (see Additional File 1: Figure S3).
Correlation with clinical AML samples
MR-GSE tests also showed that genes differentially expressed in the B cell lines derived from FPD-AML patients tended to be differentially expressed in the blasts and mononuclear cells of 22 clinical patients with a t(8;21) translocation (p = 10-10) and of 18 patients with the inv(16) abnormality (p = 3.5 × 10-9). For example, the top 14 differentially expressed genes in the FPD-AML dataset that are also differentially expressed in the clinical samples are shown in Additional File 1 (Table S3). As a whole, these results demonstrate that the genes identified in our study are likely to play an important role in the development of the disease.
Biological processes regulated by RUNX1: bioinformatic approaches
Gene ontology enrichment
GO: Biological processes
Immune response p = 6.5 × 10 -5 36 genes
Macromolecular complex assembly p = 0.02 47 genes
Blood vessel development p = 0.06 15 genes
Response to external stimulus* p = 0.0003 18 genes
Negative regulation of apoptosis p = 0.002 16 genes
Cell growth p = 0.02 21 genes
Behavior p = 0.0003 14 genes
Response to biotic stimulus p = 0.002 19 genes
Immune system process p = 0.0006 18 genes
Cell proliferation p = 0.01 36 genes
GO: Molecular functions
Cadmium ion binding p = 0.002 4 genes
RNA binding p = 0.03 50 genes
IgG binding p = 0.006 3 genes
Cadmium ion binding p = 0.03 4 genes
Ferric-chelate reductase activity p = 0.03 2 genes
Polysaccharide binding p = 0.03 6 genes
GO: cellular component
Spindle p = 0.06 11 genes
Cell junction p = 0.06 14 genes
Cell surface p = 0.05 9 genes
Extracellular space p = 0.06 37 genes
InterPro motifs (FatiGo)
Vertebrate metallothionein p = 0.0001
Vertebrate metallothionein p = 0.02
Tubulin p = 0.04
Biological processes regulated by RUNX1: in vivo confirmations
We designed a series of assays that were performed on either cell lines, or directly on samples from FPD-AML patients with RUNX1 mutations, to confirm the disturbance of several interesting biological processes identified by the above approaches.
Heterozygous RUNX1 point mutations affect proliferation
RUNX1 modulates microtubule stability
A significant enrichment of molecules containing a common tubulin motif was observed following overexpression of the CBF complex (Table 1). Five tubulin isoforms were down-regulated following overexpression of the CBF complex. These data led to the observation that CBF overexpression affected the expression of 57 genes associated with cytoskeletal structures according to GO annotation (see Additional File 1: Table S6). This class of genes was not significantly represented in the dataset from the FPD-AML cell lines, however this may be the result of the not complete knock-down of RUNX1 in the affected individuals leading to small changes that are not detected by microarray analysis. Therefore we also tested whether microtubule stability was affected in these cell lines. Significantly higher microtubule polymer levels were observed in the affected patients compared to the unaffected individuals (Figure 5B and 5C; p < 0.002). Furthermore, the microtubules in affected cells could not be stabilized using the drug Taxol to the same extent as the unaffected cells (Figure 5D; p < 0.0003). This might result from the inability of the drug to bind to the microtubule molecule because of the unusual presence of other microtubule stabilizing proteins or from a lack of soluble tubulin molecules in the cellular environment. In any case, these results suggest that RUNX1 is involved in microtubule dynamics.
Neither the proliferation nor the tubulin defects are due to the EBV transformation of the cell lines as many independent proliferation and tubulin polymerization assays performed on lymphoblastic cell lines derived from families with predispositions to various haematological malignancies do not show similar familial clustering (data not shown).
Highly significant correspondence was observed between the FPD, CBF and mouse datasets and the genes switched on after irradiation of lymphoblasts (Figure 4). We used a glycophorin A assay to test whether the FPD-AML patients are more prone to somatic genetic mutations than unaffected individuals. This test assesses the frequency of mutation events occurring at the glycophorin A locus in erythroid progenitors in blood of heterozygous individuals (MN phenotype) . Although more samples would be necessary for corroboration, a significant trend was present between the blood of two affected patients and five unaffected individuals, suggesting that a subtle increase of mutation rate may occur when RUNX1 activity is impaired (Figure 5E; p < 0.01). This increased mutation rate appears to be higher in the assay that would detect deletions (NO), that are the predominate mutations arising due to ionizing irradiation .
Identification of potential novel RUNX1 target genes – co-expression in human tissues and hematopoietic cell lines
We reasoned that direct RUNX1 target genes must be expressed in the same tissues or cells as RUNX1. Thus, the expression patterns of a number of differentially expressed genes, chosen due to potential functions in leukemia development, were compared to that of RUNX1 (see Additional File 5). The expression of 22 genes in 20 human tissues, 19 hematopoietic cell lines and normal human bone cells was assessed using cDNA panels . 9 of these genes show a high expression in a number of hematopoietic cell lines and all the others show common expression with RUNX1 in various tissues such as liver and peripheral blood leukocytes (PBLs).
Identification of potential novel RUNX1 target genes – data overlaps
Genes differentially expressed in FPD, CBF and in E8.5/E12
Identification of potential novel RUNX1 target genes – regulatory region analysis
In order to accumulate evidence that some of the genes present in these overlaps are direct target genes, we searched for human RUNX1 binding sites, which were conserved in mouse using the oPOSSUM software (http://www.cisreg.ca/cgi-bin/oPOSSUM/opossum see Additional File 1) . Many differentially expressed genes contained at least one conserved RUNX1 binding site in their regulatory regions and the overlaps between the datasets show a higher enrichment for such genes as hypothesized above (Figure 6A).
The regions flanking five putative conserved binding sites identified in three differentially expressed genes, and one negative control region, were cloned upstream of a luciferase reporter gene and co-transfected together with plasmids expressing RUNX1 and CBFβ. These genes were selected because of their presence in the overlap between the human datasets and/or their interesting functions; ANXA1 (Annexin 1) is involved in cell proliferation and cytoskeleton regulation; ARMET (Arginine-rich, mutated in early stage tumors) is mutated in cancer; CYR61 (Cysteine-rich, angiogenic inducer, 61) promotes proliferation and angiogenesis. An increase in luciferase activity was observed for ANXA1 binding sites and for one of the ARMET binding sites and a diminution of the luciferase activity was observed for the CYR61 binding site (Figure 6B). No modification of the luciferase activity was observed for a sequence derived from the negative control CASP3 regulatory region (where no conserved binding site was identified by the oPOSSUM program). It is likely that a combination of a number of binding sites and the presence of additional co-factors are necessary for a correct and synergistic in vivo regulation of these genes and it might explain the small activity observed for the ARMET binding sites. It might also explain the activation of the ANXA1 site while this gene was repressed by the overexpression of the CBF complex.
The highly significant correlation observed between the genes identified in the FPD-AML cells and the overexpression system and clinical data on AML samples supports the hypothesis that large number of genes would be broadly regulated by RUNX1 in our various approaches disregarding of the cell type. Genes identified as differentially expressed following disregulation of RUNX1 expression level and/or in these AML samples are good candidates for targets of secondary hits during leukemogenesis downstream of RUNX1 mutation. The various approaches described in this study, including conserved binding sites and co-expression studies, will also help to further prioritize genes that might sustain secondary hits. For example, the gene encoding the Cyclin D3 (CCND3) was differentially expressed following overexpression of the CBF complex and mutations in this gene have been described in acute myeloid leukemia patients .
In order to generate insights into the in vivo role of RUNX1, we employed bioinformatics tools to identify processes that were changed following alteration of RUNX1 expression level. We have shown that genes involved in megakaryopoiesis tend to be differentially expressed in the FPD and CBF datasets, demonstrating that a large number of the differentially expressed genes may play a role in platelet formation. Enrichment for genes involved in cell proliferation was also observed in both the FPD and CBF datasets, and functional assays on the FPD-AML cell lines showed that heterozygous mutation of RUNX1 reduced proliferation of lymphoblasts. These data validate our integrative approach as they confirm studies in transgenic mice expressing the fusion proteins CBFβ-MYH11  and RUNX1-ETO , which both act in a dominant negative fashion over the wild-type protein. These mice show a decrease in both lymphoid and myeloid cell proliferation. This observation also correlates with mouse data showing that Runx1 promotes cell cycle progression from G1 to S phase . An anti-proliferative effect of a RUNX1 mutant protein may have an oncogenic effect due to an improper balance between proliferation and differentiation. For example, overexpression of RUNX1 usually results in ALL while complete or partial loss of RUNX1 results in AML development.
Our integrative approach unraveled a novel process that may play an important role in RUNX1 function, involving the cytoskeletal dynamics. Indeed following the finding that an enrichment of microtubule and cytoskeleton related molecules was observed when the CBF complex was overexpressed, functional assay using the FPD-AML cells demonstrated an increase of polymerized microtubules in FPD-AML affected cells compared to cells from unaffected individuals. Microtubules are important in many processes such as cell migration, cell division, cellular transport and signal transduction  and microtubule remodeling is essential during the cell cycle, especially during mitosis when a correct microtubule network is essential for proper chromosomal segregation . Interestingly, the fusion protein, CBFβ-MYH11 that results from inv(16), co-localizes with the actin cytoskeleton and disorganizes stress fibers and F-actin structures . A mild microtubule defect might partially explain the platelet defect observed in FPD-AML patients, as microtubules are necessary at several different stages of megakaryopoiesis including endomitosis, production of platelets from mature polyploid megakaryocytes, and release of the content of platelet granules . Moreover, mutations in the actin-binding protein WASP and the myosin heavy chain MYH9 cause the Wiskott-Aldrich  and May-Hegglin  syndromes of thrombocytopenia, respectively. However, RUNX1 is likely to regulate only specific tubulin isoforms or tissue-specific cytoskeleton-associated proteins as a strong cytoskeleton defect would be more detrimental to the whole organism. In addition, the dosage of normal RUNX1 activity necessary for normal function might differ according to cell type, and some cell types may be more susceptible than others to perturbation in RUNX1 levels. Interestingly, Taxol resistant leukemic cells have been shown to have a reduced total level of tubulin and an increased level of polymerized tubulin , similar to the results seen in the FPD-AML cells. Furthermore, a high level of survivin (BIRC5), which was down-regulated following overexpression of the CBF complex, is associated with resistance to Taxol . This is the first evidence demonstrating a relationship between RUNX1 and microtubule dynamics.
Finally, we showed that the predisposition of FPD-AML to develop leukemia may be due to an increased rate of mutation in RUNX1 heterozygous cells. Every dataset showed significant correspondence with genes involved in DNA damage response. Although not conclusive, the glycophorin A assay, which measures the frequency of the progeny of mutated erythrocyte precursors in blood, showed a mild increase in mutation frequency in FPD-AML patients compared to unaffected individuals. Recently, it was shown that the RUNX1-ETO fusion protein induces mutations in transfected U937 myeloid cells . This study demonstrated that the fusion protein regulates many genes involved in the base excision repair pathway, which mainly corrects for point mutations. Furthermore, a higher incidence of leukemia in CBFβ-MYH11 chimeras compared to normal chimeras when exposed to ENU mutagenesis has also been observed [41, 42]. This demonstrates that alteration of RUNX1 function may increase the rate of mutation and lead to an accumulation of mutated cells.
The three processes described here (proliferation, cytoskeleton stability and genomic instability) are tightly interconnected and may explain the phenotype observed in FDP-AML patients. Indeed, a proliferation defect would have an impact on megakaryopoiesis and cytoskeleton remodeling. In turn, a cytoskeleton defect could also affect proliferation and trigger chromosomal aberrations. The necessary threshold level of RUNX1 expression is likely to be cell-specific, explaining why RUNX1 heterozygous mutation affects only hematopoietic cells; nevertheless, our observations could conceivably suggest possible involvement of RUNX1 in solid-tissue tumor.
We also identified new potential RUNX1 target genes by analyzing the regulatory regions and the expression pattern of the differentially expressed genes present in the overlaps between the different platforms. Many RUNX1 target genes have already been described in the literature, mainly from in vitro studies and in mouse cells [43, 44]. Four of the published target genes, CSF1R, MYB, MPO and TIMP1, were differentially expressed in the Runx1 knockout embryos. In addition, target genes that were described more recently, including CCND3  and IGFBP3 , were identified following overexpression of the CBF complex. That there was not more correlation may be due to incomplete microarray platforms, but more importantly is likely to reflect the bias present in the published RUNX1 target genes that were identified because of their primary role in hematopoiesis and these may not represent the most common RUNX1 target genes. Interesing candidates were among the 16 genes differentially expressed in every dataset, such as Annexin I (ANXA1), which was shown to reduce inflammation, by inhibiting neutrophil recruitment  and has an anti-proliferative effect by inducing aberrant cytoskeleton formation . This gene is likely to play an important role downstream of RUNX1.
In summary, this combination of gene expression profiling platforms allowed prioritization of novel candidate genes for leukemogenesis according to distinct parameters and has shed light on RUNX1 functions by identifying biological pathways downstream of RUNX1 such as microtubule stability and genomic instability and identified a large number of potential novel RUNX1 target genes. Whether or not these are direct RUNX1 targets remains to be demonstrated by further research.
Recombinant adenoviruses expressing RUNX1 p49 isoform  or CBFβ were generated as described , except that VmRL-CMV1 and pSCOT were used as the adenovirus backbone and transfer vector respectively. For details, see Additional File 1.
Cell lines and RNA extraction
EBV-transformed lymphoblasts generating B cell lines from FPD-AML patients (Pedigree 2, individuals V:1 and V:2;)  and related unaffected individuals (Pedigree 2, individuals IV:1 and V:3) were used for the FPD microarray dataset. HeLa cells (4 × 107) were infected with a multiplicity of infection (MOI) of 100 for each adenovirus and incubated for 48 hours. The Qiagen RNeasy maxikit was used for the extraction of total RNA in each case. Runx1 knockout and wild-type embryo propers at embryonic stages E8.5 and E12 were homogenized in Trizol (Invitrogen) and total RNA extracted following the manufacturer's protocol.
Runx1 knockout mice have been previously described . They are maintain on a BalbC genetic background at the Biological Resource Center, (Biopolis, Singapore) and all animal experiments followed the guidelines set by the National Advisory Committee for Laboratory Animal Research. Wild-type and Runx1 knockout mouse embryo propers were harvested at embryonic stages E8.5 and E12.
cDNA Microarray hybridization
cDNA microarrays were printed by the Australian Genome Research Facility (AGRF) with the Hs8k cDNA clone library from Research Genetics and a selection of control spots. In total there were 8132 EST probes printed in duplicate. The array also contained 12 copies of the Lucidea Universal ScoreCard controls (Amersham). Labeling, hybridization, and washing were performed as described . In the case of the FPD dataset, four hybridizations were performed comparing two affected individuals against two unaffected individuals of pedigree 2. For the overexpression system, 2 different RNA samples from HeLa cells overexpressing EGFP were used as reference and 2 different RNA samples from HeLa cells overexpressing RUNX1 and CBFβ were used as experimental RNAs. Seven hybridizations (including 3 dyeswaps) were performed. The data were filtered for genes whose difference in expression was due to EGFP, using four hybridizations between EGFP expressing cells and normal HeLa cells.
Affymetrix genechip hybridization
Labelling, hybridization and washing were performed by the AGRF following the Affymetrix protocol (701725 rev5). Briefly, total RNA (100 ng) was amplified using T7-oligo dT and the Megascript T7 kit (Ambion). A second round of cDNA synthesis was performed using the total amount of the amplified RNA. Biotin-labeled RNA was subsequently synthesized using the GeneChip IVT Labeling Kit. Labelled RNA (15 μg) was fragmented and the mouse genome 430 2.0 arrays were hybridized overnight and washed as described before being scanned using a GeneChip scanner 3000 (Affymetrix). Two biological replicates were used for each condition.
The cDNA microarray images were analyzed using SPOT software . Spots were assigned quality weights based on their segmented pixel areas and the log-ratios were print-tip loess normalized . Duplicate printings of each probe on each array were combined using the common correlation method of . For the mouse Affymetrix GeneChips, the intensities for each probe set were normalized and summarized using the Robust Multi-array Analysis algorithm . Differential expression was assessed using empirical Bayes moderated t- and F-statistics from the LIMMA package . Recognizing that p-value calculations make normality and other distributional assumptions, which are hard to verify for microarray data, we decided to use control probes and appropriate plots to guide our criteria for differential expression as far as possible. For the cDNA data, conservative threshold values for differential expression were chosen to minimize the false-positive and false-negative rates estimated from Scorecard control probes printed on the arrays. This resulted in a threshold value of |t|>4 for the FPD data. Of 204 calibration control probes printed on the arrays, none reached this cutoff for statistical significance, suggesting a false discovery rate less than 1/204, without relying on any distributional assumptions. For the mouse Affymetrix data, a threshold of |t|>3 was chosen from a q-q plot of the moderated t-statistics.
For the overexpression system arrays, a combination of criteria was used to assess differential expression. These arrays were analyzed as part of a larger microarray study using the same overexpression system to study a range of AML related genes. Genes with |t|>4 were initially assigned as differentially expression, with only one calibration control probe reaching this threshold. A series of nested F-tests (with p-value cutoff 1e-5) was also performed using the larger dataset in order to get an improved estimate of the number of genes significantly differentially expressed in more than one condition simultaneously. This increased the number of differentially expressed genes by a third. Finally, genes were removed from the differentially expressed list if their response to RUNX1/CBFβ transduction was not significantly greater than their response to the adenovirus alone.
All the analyzed datasets have been deposited at the NCBI Gene Expression Omnibus http://www.ncbi.nlm.nih.gov/geo/ under accession numbers GSE2592 (mouse Affymetrix data), GSE2593 (overexpression experiment) and GSE2594 (FPD-AML arrays).
Mean-rank gene set enrichment tests (MR-GSE)
A version of statistical gene set testing was used to investigate associations between the expression profiles obtained from different experiments. Each test uses a set of genes selected as differentially expressed in one data set (the reference dataset) and determines whether the gene set tends to be highly ranked in another dataset (the test dataset). The test statistic is the mean rank of the gene set in the test dataset. This approach, which we call mean-rank gene set enrichment (MR-GSE), is very similar to Tian et al's Tk test  and Kim and Volsky's PAGE test . The main difference is that MR-GSE averages the ranks of t-statistics instead of t-statistics themselves, which makes it less influenced by individual genes in the gene set. This has the advantage of giving more weight to gene sets with a larger number of active genes, and it also allows us to use the same testing procedure with a range of ranking procedures other than t-statistics. Where possible, MR-GSE is used with moderated t-statistics rather than ordinary t-statistics, as these are preferable for microarray analysis including gene set testing [56, 58]. Unlike earlier Gene Set Enrichment Analysis methods , MR-GSE can be used to test individual gene sets in isolation and has good power even for microarray experiments with small to moderate sample sizes.
The null hypothesis tested by MR-GSE is that the gene set is randomly chosen. When the reference and test datasets share the same microarray platform, p-values can be computed using Wilcoxon two-sample rank tests . When the reference and test datasets are based on different microarray platforms (cDNA vs Affymetrix), the p-values were instead computed using random permutations of probes on the reference arrays. This was done to avoid any bias arising from probe selection on the cDNA platform or from multiple probe-sets for individual genes on the Affymetrix platform.
For the integration of gene expression profiling data and biological processes regulated by RUNX1, genes were ranked in the test datasets by absolute moderated t-statistic. For the correlation with clinical AML samples, the test dataset was the previously published expression profiling data on 285 AML patients and 8 healthy individuals . In this case, the Affymetrix probe-sets were ranked according to their correlation with the 11 RUNX1 probe-sets across the 293 RNA samples. Correlations were computed using Gene Recommender , which provides a very robust correlation measure suitable for this purpose. Probe-sets were also ranked by moderated t-statistic on their ability to distinguish the healthy patients from the 22 patients with t(8;21) or from the 18 patients with inv(16).
The MR-GSE p-values are computed by permuting genes rather than permuting arrays. This is necessary because the tests are designed for use with small numbers of arrays. The computation necessarily assumes that different genes have statistically independent expression values within experimental groups. When the gene set contains genes which are highly interdependent, and which vary substantially between biological replicates, the test may be anti-conservative. We checked the independence assumption for our data by computing average inter-gene correlations using REML. The inter-gene correlations were found to be generally very small at the expression level (data not shown), suggesting that the MS-GSE results are meaningful on our data.
Bioinformatic identification of biological processes and cross-platform comparison
Enrichment of a gene ontology annotation in a dataset of differentially expressed genes compared to the genes present on the array was determined using the GOStat program http://gostat.wehi.edu.au/. For the MR-GSE test, relevant gene sets were taken from published reviews or independent microarray data (see Additional File 1: Table S4)
BrdU proliferation assay
The Cell Proliferation ELISA, BrdU kit (Roche) was used to measure proliferation of cell lines derived from two independent families, including the family used for the microarray experiment (Pedigree 2)  and an additional family harboring a nonsense mutation Y260X present outside of the Runt domain (Pedigree 3, affected individuals III:7 and IV:4 and one unaffected individual III:8 ). Briefly, the cells were split into 96-well plates at an equal density. BrdU was added to the cells for 4 hours and the cells were then treated according to the manufacturer's protocol. The optical density (OD450) was measured on an ELISA plate reader. Technical triplicates and two independent experiments were performed. A two-way ANOVA (analysis of variance) test was performed.
Tubulin polymerization assay
Soluble (cytosolic) and polymerized (cytoskeletal) fractions of tubulin were separated from the cell lines treated with or without 4 μg/ml of Taxol as described . The same cell lines used for the proliferation assay were assessed. Results were expressed as a percentage of polymerized tubulin by dividing the densitometric value of polymerized tubulin (insoluble) by the total tubulin content (sum of densitometric value of soluble and polymerized tubulin). Three independent experiments were performed and a two-way ANOVA was done.
Glycophorin A assay
Blood samples were collected in EDTA-tubes, with informed consent, from seven individuals heterozygous (MN phenotype) at the glycophorin A locus. These include: a FPD-AML patient harboring a frameshift mutation (N69fsX94) and her unaffected sister, a second FPD-AML patient harboring a nonsense mutation (Pedigree 3 (Y260X), individual IV:4)  and 4 independent unaffected individuals. The assay is described in detail in Additional File 1. A two-way ANOVA test was performed to compare the 5 controls to the 2 affected individuals.
Luciferase reporter assay
Genomic regions overlapping the conserved binding sites (300–400 bps) were amplified from BACs and cloned into pGL3-Basic vector (Promega #E1751). Each construct was co-transfected into HeLa cells using lipofectamine 2000 (Invitrogen) along with pSCOT plasmids expressing RUNX1 and CBFβ or empty vector to keep the amount of plasmid constant. For normalization, 20 ng of pRL-TK vector (Renilla luciferase Promega #E2241) was also co-transfected. The luciferase activities were measured using the Dual-Luciferase Reporter Assay System (Promega #E1910). The increase or decrease in luciferase activity was determined as a function of the endogenous activity of each construct.
cDNA panel production
We thank Dr. S. Brenz Verca and Prof S. Rusconi for providing the adenoviral backbone, transfer vector (pScot) and the HER911 packaging cell line. We thank Prof. S.E. Antonarakis for his support during the adenovirus production. This project was supported by grants from the Ligue Genevoise Contre le Cancer, the Fondation Pour la Lutte Contre le Cancer, the Fondation Dr Henri Dubois-Ferrière Dinu Lipatti, the Nossal Leadership Fellowship from the Walter and Eliza Hall Institute of Medical Research, NHMRC Grants (257501, 257529) and NHMRC fellowship 171601 to HSS; International Postgraduate Research (Australian government) and Melbourne International Research scholarships to JAM and FS, an Australian postgraduate award to MER, an NHMRC Dora Lush Postgraduate Award (305552) to CC, a Swiss National Science Foundation and Bernische Krebsliga fellowships to RE, NIH (DK58161 and HL079507), to MH and NHMRC Career Development Award 300580 to MK.
- Kamachi Y, Ogawa E, Asano M, Ishida S, Murakami Y, Satake M, Ito Y, Shigesada K: Purification of a mouse nuclear factor that binds to both the A and B cores of the polyomavirus enhancer. J Virol. 1990, 64 (10): 4808-4819.PubMedPubMed Central
- Levanon D, Groner Y: Structure and regulated expression of mammalian RUNX genes. Oncogene. 2004, 23 (24): 4211-4219. 10.1038/sj.onc.1207670.PubMedView Article
- Ogawa E, Maruyama M, Kagoshima H, Inuzuka M, Lu J, Satake M, Shigesada K, Ito Y: PEBP2/PEA2 represents a family of transcription factors homologous to the products of the Drosophila runt gene and the human AML1 gene. . 1993, 90 (14): 6859-6863.
- Wang S, Wang Q, Crute BE, Melnikova IN, Keller SR, Speck NA: Cloning and characterization of subunits of the T-cell receptor and murine leukemia virus enhancer core-binding factor. Mol Cell Biol. 1993, 13 (6): 3324-3339.PubMedPubMed CentralView Article
- Huang G, Shigesada K, Ito K, Wee HJ, Yokomizo T, Ito Y: Dimerization with PEBP2beta protects RUNX1/AML1 from ubiquitin-proteasome-mediated degradation. Embo J. 2001, 20 (4): 723-733. 10.1093/emboj/20.4.723.PubMedPubMed CentralView Article
- Lutterbach B, Hiebert SW: Role of the transcription factor AML-1 in acute leukemia and hematopoietic differentiation. Gene. 2000, 245 (2): 223-235. 10.1016/S0378-1119(00)00014-7.PubMedView Article
- Osato M: Point mutations in the RUNX1/AML1 gene: another actor in RUNX leukemia. Oncogene. 2004, 23 (24): 4284-4296. 10.1038/sj.onc.1207779.PubMedView Article
- Roumier C, Fenaux P, Lafage M, Imbert M, Eclache V, Preudhomme C: New mechanisms of AML1 gene alteration in hematological malignancies. Leukemia. 2003, 17 (1): 9-16. 10.1038/sj.leu.2402766.PubMedView Article
- Michaud J, Wu F, Osato M, Cottles GM, Yanagida M, Asou N, Shigesada K, Ito Y, Benson KF, Raskind WH, Rossier C, Antonarakis SE, Israels S, McNicol A, Weiss H, Horwitz M, Scott HS: In vitro analyses of known and novel RUNX1/AML1 mutations in dominant familial platelet disorder with predisposition to acute myelogenous leukemia: implications for mechanisms of pathogenesis. Blood. 2002, 99 (4): 1364-1372. 10.1182/blood.V99.4.1364.PubMedView Article
- Song WJ, Sullivan MG, Legare RD, Hutchings S, Tan X, Kufrin D, Ratajczak J, Resende IC, Haworth C, Hock R, Loh M, Felix C, Roy DC, Busque L, Kurnit D, Willman C, Gewirtz AM, Speck NA, Bushweller JH, Li FP, Gardiner K, Poncz M, Maris JM, Gilliland DG: Haploinsufficiency of CBFA2 causes familial thrombocytopenia with propensity to develop acute myelogenous leukaemia. Nat Genet. 1999, 23 (2): 166-175. 10.1038/13793.PubMedView Article
- Osato M, Asou N, Abdalla E, Hoshino K, Yamasaki H, Okubo T, Suzushima H, Takatsuki K, Kanno T, Shigesada K, Ito Y: Biallelic and heterozygous point mutations in the runt domain of the AML1/PEBP2alphaB gene associated with myeloblastic leukemias. Blood. 1999, 93 (6): 1817-1824.PubMed
- Mikhail FM, Serry KA, Hatem N, Mourad ZI, Farawela HM, El Kaffash DM, Coignet L, Nucifora G: AML1 gene over-expression in childhood acute lymphoblastic leukemia. Leukemia. 2002, 16 (4): 658-668. 10.1038/sj.leu.2402399.PubMedView Article
- Rhoades KL, Hetherington CJ, Harakawa N, Yergeau DA, Zhou L, Liu LQ, Little MT, Tenen DG, Zhang DE: Analysis of the role of AML1-ETO in leukemogenesis, using an inducible transgenic mouse model. Blood. 2000, 96 (6): 2108-2115.PubMed
- Ford AM, Bennett CA, Price CM, Bruin MC, Van Wering ER, Greaves M: Fetal origins of the TEL-AML1 fusion gene in identical twins with leukemia. Proc Natl Acad Sci U S A. 1998, 95 (8): 4584-4588. 10.1073/pnas.95.8.4584.PubMedPubMed CentralView Article
- Cook WD, McCaw BJ: Accommodating haploinsufficient tumor suppressor genes in Knudson's model. Oncogene. 2000, 19 (30): 3434-3438. 10.1038/sj.onc.1203653.PubMedView Article
- Okuda T, van Deursen J, Hiebert SW, Grosveld G, Downing JR: AML1, the target of multiple chromosomal translocations in human leukemia, is essential for normal fetal liver hematopoiesis. Cell. 1996, 84 (2): 321-330. 10.1016/S0092-8674(00)80986-1.PubMedView Article
- Wang Q, Stacy T, Binder M, Marin-Padilla M, Sharpe AH, Speck NA: Disruption of the Cbfa2 gene causes necrosis and hemorrhaging in the central nervous system and blocks definitive hematopoiesis. Proc Natl Acad Sci U S A. 1996, 93 (8): 3444-3449. 10.1073/pnas.93.8.3444.PubMedPubMed CentralView Article
- North T, Gu TL, Stacy T, Wang Q, Howard L, Binder M, Marin-Padilla M, Speck NA: Cbfa2 is required for the formation of intra-aortic hematopoietic clusters. Development. 1999, 126 (11): 2563-2575.PubMed
- Levanon D, Brenner O, Negreanu V, Bettoun D, Woolf E, Eilam R, Lotem J, Gat U, Otto F, Speck N, Groner Y: Spatial and temporal expression pattern of Runx3 (Aml2) and Runx1 (Aml1) indicates non-redundant functions during mouse embryogenesis. Mech Dev. 2001, 109 (2): 413-417. 10.1016/S0925-4773(01)00537-8.PubMedView Article
- Lian JB, Balint E, Javed A, Drissi H, Vitti R, Quinlan EJ, Zhang L, Van Wijnen AJ, Stein JL, Speck N, Stein GS: Runx1/AML1 hematopoietic transcription factor contributes to skeletal development in vivo. J Cell Physiol. 2003, 196 (2): 301-311. 10.1002/jcp.10316.PubMedView Article
- Planaguma J, Diaz-Fuertes M, Gil-Moreno A, Abal M, Monge M, Garcia A, Baro T, Thomson TM, Xercavins J, Alameda F, Reventos J: A differential gene expression profile reveals overexpression of RUNX1/AML1 in invasive endometrioid carcinoma. Cancer Res. 2004, 64 (24): 8846-8853. 10.1158/0008-5472.CAN-04-2066.PubMedView Article
- Sakakura C, Hagiwara A, Miyagawa K, Nakashima S, Yoshikawa T, Kin S, Nakase Y, Ito K, Yamagishi H, Yazumi S, Chiba T, Ito Y: Frequent downregulation of the runt domain transcription factors RUNX1, RUNX3 and their cofactor CBFB in gastric cancer. Int J Cancer. 2005, 113 (2): 221-228. 10.1002/ijc.20551.PubMedView Article
- Valk PJ, Verhaak RG, Beijen MA, Erpelinck CA, Barjesteh van Waalwijk van Doorn-Khosrovani S, Boer JM, Beverloo HB, Moorhouse MJ, van der Spek PJ, Lowenberg B, Delwel R: Prognostically useful gene-expression profiles in acute myeloid leukemia. N Engl J Med. 2004, 350 (16): 1617-1628. 10.1056/NEJMoa040465.PubMedView Article
- Albertini RJ, Hayes RB: Somatic cell mutations in cancer epidemiology. IARC Sci Publ. 1997, 159-184.
- Sankaranarayanan K, Wassom JS: Ionizing radiation and genetic risks XIV. Potential research directions in the post-genome era based on knowledge of repair of radiation-induced DNA double-strand breaks in mammalian somatic cells and the origin of deletions associated with human genomic disorders. Mutat Res. 2005, 578 (1-2): 333-370.PubMedView Article
- Michaud J, Kudoh J, Berry A, Bonne-Tamir B, Lalioti MD, Rossier C, Shibuya K, Kawasaki K, Asakawa S, Minoshima S, Shimizu N, Antonarakis SE, Scott HS: Isolation and characterization of a human chromosome 21q22.3 gene (WDR4) and its mouse homologue that code for a WD-repeat protein. Genomics. 2000, 68 (1): 71-79. 10.1006/geno.2000.6258.PubMedView Article
- Ho Sui SJ, Mortimer JR, Arenillas DJ, Brumm J, Walsh CJ, Kennedy BP, Wasserman WW: oPOSSUM: identification of over-represented transcription factor binding sites in co-expressed genes. Nucleic Acids Res. 2005, 33 (10): 3154-3164. 10.1093/nar/gki624.PubMedPubMed CentralView Article
- Scherr M, Eder M: Modulation of gene expression by siRNA in hematopoietic cells. Curr Opin Drug Discov Devel. 2005, 8 (2): 262-269.PubMed
- Smith ML, Arch R, Smith LL, Bainton N, Neat M, Taylor C, Bonnet D, Cavenagh JD, Andrew Lister T, Fitzgibbon J: Development of a human acute myeloid leukaemia screening panel and consequent identification of novel gene mutation in FLT3 and CCND3. Br J Haematol. 2005, 128 (3): 318-323. 10.1111/j.1365-2141.2004.05324.x.PubMedView Article
- Cao W, Britos-Bray M, Claxton DF, Kelley CA, Speck NA, Liu PP, Friedman AD: CBF beta-SMMHC, expressed in M4Eo AML, reduced CBF DNA-binding and inhibited the G1 to S cell cycle transition at the restriction point in myeloid and lymphoid cells. Oncogene. 1997, 15 (11): 1315-1327. 10.1038/sj.onc.1201305.PubMedView Article
- Strom DK, Nip J, Westendorf JJ, Linggi B, Lutterbach B, Downing JR, Lenny N, Hiebert SW: Expression of the AML-1 oncogene shortens the G(1) phase of the cell cycle. J Biol Chem. 2000, 275 (5): 3438-3445. 10.1074/jbc.275.5.3438.PubMedView Article
- Jordan MA, Wilson L: Microtubules as a target for anticancer drugs. Nat Rev Cancer. 2004, 4 (4): 253-265. 10.1038/nrc1317.PubMedView Article
- Pearson CG, Bloom K: Dynamic microtubules lead the way for spindle positioning. Nat Rev Mol Cell Biol. 2004, 5 (6): 481-492. 10.1038/nrm1402.PubMedView Article
- Tanaka Y, Watanabe T, Chiba N, Niki M, Kuroiwa Y, Nishihira T, Satomi S, Ito Y, Satake M: The protooncogene product, PEBP2beta/CBFbeta, is mainly located in the cytoplasm and has an affinity with cytoskeletal structures. Oncogene. 1997, 15 (6): 677-683. 10.1038/sj.onc.1201235.PubMedView Article
- Lecine P, Italiano JE, Kim SW, Villeval JL, Shivdasani RA: Hematopoietic-specific beta 1 tubulin participates in a pathway of platelet biogenesis dependent on the transcription factor NF-E2. Blood. 2000, 96 (4): 1366-1373.PubMed
- Derry JM, Ochs HD, Francke U: Isolation of a novel gene mutated in Wiskott-Aldrich syndrome. Cell. 1994, 78 (4): 635-644. 10.1016/0092-8674(94)90528-2.PubMedView Article
- Seri M, Cusano R, Gangarossa S, Caridi G, Bordo D, Lo Nigro C, Ghiggeri GM, Ravazzolo R, Savino M, Del Vecchio M, d'Apolito M, Iolascon A, Zelante LL, Savoia A, Balduini CL, Noris P, Magrini U, Belletti S, Heath KE, Babcock M, Glucksman MJ, Aliprandis E, Bizzaro N, Desnick RJ, Martignetti JA: Mutations in MYH9 result in the May-Hegglin anomaly, and Fechtner and Sebastian syndromes. The May-Heggllin/Fechtner Syndrome Consortium. Nat Genet. 2000, 26 (1): 103-105. 10.1038/79063.PubMedView Article
- Dumontet C, Jaffrezou JP, Tsuchiya E, Duran GE, Chen G, Derry WB, Wilson L, Jordan MA, Sikic BI: Resistance to microtubule-targeted cytotoxins in a K562 leukemia cell variant associated with altered tubulin expression and polymerization. Bull Cancer. 2004, 91 (5): E81-112.PubMed
- Zaffaroni N, Pennati M, Colella G, Perego P, Supino R, Gatti L, Pilotti S, Zunino F, Daidone MG: Expression of the anti-apoptotic gene survivin correlates with taxol resistance in human ovarian cancer. Cell Mol Life Sci. 2002, 59 (8): 1406-1412. 10.1007/s00018-002-8518-3.PubMedView Article
- Alcalay M, Meani N, Gelmetti V, Fantozzi A, Fagioli M, Orleth A, Riganelli D, Sebastiani C, Cappelli E, Casciari C, Sciurpi MT, Mariano AR, Minardi SP, Luzi L, Muller H, Di Fiore PP, Frosina G, Pelicci PG: Acute myeloid leukemia fusion proteins deregulate genes involved in stem cell maintenance and DNA repair. J Clin Invest. 2003, 112 (11): 1751-1761.PubMedPubMed CentralView Article
- Castilla LH, Garrett L, Adya N, Orlic D, Dutra A, Anderson S, Owens J, Eckhaus M, Bodine D, Liu PP: The fusion gene Cbfb-MYH11 blocks myeloid differentiation and predisposes mice to acute myelomonocytic leukaemia. Nat Genet. 1999, 23 (2): 144-146. 10.1038/13776.PubMedView Article
- Yuan Y, Zhou L, Miyamoto T, Iwasaki H, Harakawa N, Hetherington CJ, Burel SA, Lagasse E, Weissman IL, Akashi K, Zhang DE: AML1-ETO expression is directly involved in the development of acute myeloid leukemia in the presence of additional mutations. Proc Natl Acad Sci U S A. 2001, 98 (18): 10398-10403. 10.1073/pnas.171321298.PubMedPubMed CentralView Article
- Michaud J, Scott HS, Escher R: AML1 interconnected pathways of leukemogenesis. Cancer Invest. 2003, 21 (1): 105-136. 10.1081/CNV-120018821.PubMedView Article
- Okada H, Watanabe T, Niki M, Takano H, Chiba N, Yanai N, Tani K, Hibino H, Asano S, Mucenski ML, Ito Y, Noda T, Satake M: AML1(-/-) embryos do not express certain hematopoiesis-related gene transcripts including those of the PU.1 gene. Oncogene. 1998, 17 (18): 2287-2293. 10.1038/sj.onc.1202151.PubMedView Article
- Bernardin-Fried F, Kummalue T, Leijen S, Collector MI, Ravid K, Friedman AD: AML1/RUNX1 increases during G1 to S cell cycle progression independent of cytokine-dependent phosphorylation and induces cyclin D3 gene expression. J Biol Chem. 2004, 279 (15): 15678-15687. 10.1074/jbc.M310023200.PubMedView Article
- Iwatsuki K, Tanaka K, Kaneko T, Kazama R, Okamoto S, Nakayama Y, Ito Y, Satake M, Takahashi S, Miyajima A, Watanabe T, Hara T: Runx1 promotes angiogenesis by downregulation of insulin-like growth factor-binding protein-3. Oncogene. 2005, 24 (7): 1129-1137. 10.1038/sj.onc.1208287.PubMedView Article
- Perretti M, Ahluwalia A, Harris JG, Goulding NJ, Flower RJ: Lipocortin-1 fragments inhibit neutrophil accumulation and neutrophil-dependent edema in the mouse. A qualitative comparison with an anti-CD11b monoclonal antibody. J Immunol. 1993, 151 (8): 4306-4314.PubMed
- Alldridge LC, Bryant CE: Annexin 1 regulates cell proliferation by disruption of cell morphology and inhibition of cyclin D1 expression through sustained activation of the ERK1/2 MAPK signal. Exp Cell Res. 2003, 290 (1): 93-107. 10.1016/S0014-4827(03)00310-0.PubMedView Article
- Levanon D, Glusman G, Bangsow T, Ben-Asher E, Male DA, Avidan N, Bangsow C, Hattori M, Taylor TD, Taudien S, Blechschmidt K, Shimizu N, Rosenthal A, Sakaki Y, Lancet D, Groner Y: Architecture and anatomy of the genomic locus encoding the human leukemia-associated transcription factor RUNX1/AML1. Gene. 2001, 262 (1-2): 23-33. 10.1016/S0378-1119(00)00532-1.PubMedView Article
- Chaussade C, Pirola L, Bonnafous S, Blondeau F, Brenz-Verca S, Tronchere H, Portis F, Rusconi S, Payrastre B, Laporte J, Van Obberghen E: Expression of myotubularin by an adenoviral vector demonstrates its function as a phosphatidylinositol 3-phosphate [PtdIns(3)P] phosphatase in muscle cell lines: involvement of PtdIns(3)P in insulin-stimulated glucose transport. Mol Endocrinol. 2003, 17 (12): 2448-2460. 10.1210/me.2003-0261.PubMedView Article
- Smyth GK, Michaud J, Scott HS: Use of within-array replicate spots for assessing differential expression in microarray experiments. Bioinformatics. 2005, 21 (9): 2067-2075. 10.1093/bioinformatics/bti270.PubMedView Article
- Buckley MJ: Spot User's Guide. CSIRO Mathematical and Information Sciences, Sydney, Australia. 2000
- Smyth GK, Speed T: Normalization of cDNA microarray data. Methods. 2003, 31 (4): 265-273. 10.1016/S1046-2023(03)00155-5.PubMedView Article
- Irizarry RA, Bolstad BM, Collin F, Cope LM, Hobbs B, Speed TP: Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res. 2003, 31 (4): e15-10.1093/nar/gng015.PubMedPubMed CentralView Article
- Smyth GK: Linear models and empirical Bayes methods for assessing differential expression in microarray experiments. Statistical Applications in Genetics and Molecular Biology. 2004, 3 (1): article 3-10.2202/1544-6115.1027.View Article
- Tian L, Greenberg SA, Kong SW, Altschuler J, Kohane IS, Park PJ: Discovering statistically significant pathways in expression profiling studies. Proc Natl Acad Sci U S A. 2005, 102 (38): 13544-13549. 10.1073/pnas.0506577102.PubMedPubMed CentralView Article
- Kim SY, Volsky DJ: PAGE: parametric analysis of gene set enrichment. BMC Bioinformatics. 2005, 6: 144-10.1186/1471-2105-6-144.PubMedPubMed CentralView Article
- Dinu I, Potter JD, Mueller T, Liu Q, Adewale AJ, Jhangri GS, Einecke G, Famulski KS, Halloran P, Yasui Y: Improving gene set analysis of microarray data by SAM-GS. BMC Bioinformatics. 2007, 8: 242-10.1186/1471-2105-8-242.PubMedPubMed CentralView Article
- Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005, 102 (43): 15545-15550. 10.1073/pnas.0506580102.PubMedPubMed CentralView Article
- Hollander M, Wolfe DA: Nonparametric stitistical inference. 1973, New York , John Wiley & Sons, 27-33.
- Owen AB, Stuart J, Mach K, Villeneuve AM, Kim S: A gene recommender algorithm to identify coexpressed genes in C. elegans. Genome Res. 2003, 13 (8): 1828-1837.PubMedPubMed Central
- Beissbarth T, Speed TP: GOstat: find statistically overrepresented Gene Ontologies within a group of genes. Bioinformatics. 2004, 20 (9): 1464-1465. 10.1093/bioinformatics/bth088.PubMedView Article
- Verrills NM, Flemming CL, Liu M, Ivery MT, Cobon GS, Norris MD, Haber M, Kavallaris M: Microtubule alterations and mutations induced by desoxyepothilone B: implications for drug-target interactions. Chem Biol. 2003, 10 (7): 597-607. 10.1016/S1074-5521(03)00141-8.PubMedView Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.