Long-range transcriptional regulation by the p110 CUX1 homeodomain protein on the ENCODE array

  • Charles Vadnais1, 2,

    Affiliated with

    • Arif A Awan1,

      Affiliated with

      • Ryoko Harada1,

        Affiliated with

        • Pier-Luc Clermont2,

          Affiliated with

          • Lam Leduy1,

            Affiliated with

            • Ginette Bérubé1 and

              Affiliated with

              • Alain Nepveu1, 2, 3, 4Email author

                Affiliated with

                BMC Genomics201314:258

                DOI: 10.1186/1471-2164-14-258

                Received: 15 October 2012

                Accepted: 26 March 2013

                Published: 16 April 2013

                Abstract

                Background

                Overexpression of the Cut homeobox 1 gene, CUX1, inversely correlates with patient survival in breast cancers. Cell-based assays and molecular studies have revealed that transcriptional regulation by CUX1 involves mostly the proteolytically processed p110 isoform. As there is no antibody specific to p110 CUX1 only, an alternate strategy must be employed to identify its targets.

                Results

                We expressed physiological levels of a tagged-p110 CUX1 protein and performed chromatin affinity purification followed by hybridization on ENCODE and promoter arrays. Targets were validated by chromatin immunoprecipitation and transcriptional regulation by CUX1 was analyzed in expression profiling and RT-qPCR assays following CUX1 knockdown or p110 CUX1 overexpression. Approximately 47% and 14% of CUX1 binding sites were respectively mapped less than 4 Kbp, or more than 40 Kbp, away from a transcription start site. More genes exhibited changes in expression following CUX1 knockdown than p110 CUX1 overexpression. CUX1 directly activated or repressed 7.4% and 8.4% of putative targets identified on the ENCODE and promoter arrays respectively. This proportion increased to 11.2% for targets with 2 binding sites or more. Transcriptional repression was observed in a slightly higher proportion of target genes. The CUX1 consensus binding motif, ATCRAT, was found at 47.2% of the CUX1 binding sites, yet only 8.3% of the CUX1 consensus motifs present on the array were bound in vivo. The presence of a consensus binding motif did not have an impact on whether a target gene was repressed or activated. Interestingly, the distance between a binding site and a transcription start site did not significantly reduced the ability of CUX1 to regulate a target gene. Moreover, CUX1 not only was able to regulate the next adjacent gene, but also regulated the gene located beyond this one as well as the gene located further away in the opposite direction.

                Conclusion

                Our results demonstrate that p110 CUX1 can activate or repress transcription when bound at a distance and can regulate more than one gene on certain genomic loci.

                Keywords

                ChAP-chip Chromatin immunoprecipitation ENCODE and promoter microarrays Expression profiling shRNA Lentivirus overexpression Transcriptional activation and repression Regulation at a distance Cut homeobox 1 (CUX1)

                Background

                The last decade has seen significant advances in the field of transcription. The discovery of nuclear histone acetyltransferases (HATs) in the mid nineties has literally opened a new field of investigation into post-translational modifications that target histones and modulate the chromatin state either locally or over large genomic loci. The multiple types of modifications that take place on specific histone residues and the regulatory cascades that can be triggered in this manner led investigators to propose that a "histone code" regulates gene expression in a manner reminiscent of the genetic code translating nucleic acid coding sequences into protein sequences [1, 2]. In parallel, a number of novel experimental approaches have contributed to move the transcription field from a gene-by-gene approach focused on core promoters to a genome-wide non-biased approach that enables us to study large numbers of transcriptional targets as well as the mechanisms by which these targets are regulated [3]. Recent tools in our arsenal include the increasing availability of genomic microarrays [4], siRNA-mediated gene knockdown [5], more efficient virus-based gene delivery systems [6, 7], and high-throughput sequencing [8]. Importantly, the "rediscovery" of chromatin immunoprecipitation combined with the development of microchip arrays containing large numbers of genomic sequences has opened new horizons. Indeed, chromatin immunoprecipitation was first described in the mid eighties by the group of John T. Lis who used this assay to show that RNA polymerase II molecules were already present at the 5' end of the hsp70 gene in uninduced cells and that heat shock somehow enabled transcription elongation to take place [9]. Curiously, the method was not applied to specific transcription factors before another decade [10]. Interestingly, the genomic microarray that was designed as part of the ENCODE project provided a sampling of the human genome that can be interrogated to define the distribution types of transcriptional regulation of specific transcription factors [11]. The information thus gathered has forced us to reconsider our original understanding of basic mechanisms of transcriptional regulation [12]. For example, a common belief was that a specific transcription factor could bind to a few dozen genes whose core promoters contain its consensus binding site as defined in vitro, and once recruited to a promoter could almost single-handedly regulate transcription [13]. We now know that c-MYC binds to approximately 20% of gene promoters and is also capable of regulating genes at a distance [1416]. Another major conceptual advance concerns the criteria to define a transcriptional target. Experimental evidence typically included the presence of a consensus binding motif within a core promoter, in vitro binding assays and luciferase reporter assays. While these assays are still employed, it is clear that they cannot provide definitive evidence that a transcription factor regulates a specific gene. Additional evidence must also include chromatin immunoprecipitation assays to demonstrate "in vivo" DNA binding, and change in expression of the endogenous gene target in response to the knockdown and/or overexpression of the transcription factor.

                Cut homeobox 1 (CUX1) has previously been called CCAAT-displacement protein (CDP), CDP/Cut and Cut-like 1 (CUTL1). CUX1 encodes two main isoforms that exhibit different DNA binding and transcriptional properties (reviewed in [17]). The full-length protein, p200 CUX1, is a very abundant protein that binds DNA with extremely fast kinetics [18]. In mid-G1 phase, 1% to 10% of p200 CUX1 is proteolytically processed by a nuclear isoform of cathepsin L to produce the p110 CUX1 isoform [19, 20]. This shorter isoform can stably interact with DNA and, depending on promoter-context, can function as transcriptional repressor or activator [21, 22]. The expression and activity of p110 CUX1 are tightly regulated in a cell cycle-dependent manner, mostly through phosphorylation-dephosphorylation by cyclin A/Cdk2, cyclin A/Cdk1 cyclin B/Cdk1, and Cdc25A, as well as proteolytic processing by nuclear cathepsin L and a caspase-like protease [19, 20, 2327]. These post-translational modifications circumscribe the transcriptional activity of p110 CUX1 to the period between mid-G1 to sometimes in G2. In contrast to p110 CUX1, the DNA binding activity of p200 CUX1 is constant throughout the cell cycle [19]. Its transcriptional activity, if any, would be limited to the "CAATT-displacement activity", a mechanism of passive repression involving competition for binding site occupancy [18].

                Homozygous inactivation of Cux1 in mice causes perinatal lethality in a large proportion of animals due to delayed lung development and associated respiratory failure [28]. Surviving mice are usually male and exhibit growth retardation, disrupted hair follicle morphogenesis, purulent rhinitis, infertility, cachexia, and reduction of B and T cell content in bone marrow and thymus, respectively [2830]. In transgenic mouse models, overexpression of CUX1 generated various cancer-associated disorders depending on the specific isoform and tissue type expression. These include multi-organ organomegaly, glomerulosclerosis and polycystic kidneys, pre-cancerous lesions in the liver, myeloproliferative-disease-like myeloid leukemias and mammary tumors sometimes associated with lung metastasis [3136]. Cell-based assays demonstrated a role for CUX1 in cell cycle progression and cell proliferation [27, 37], strengthening of the spindle assembly checkpoint [38], cell migration and invasion [22, 3941], resistance to apoptotic signals [42], and dendrite branching and spine development in cortical neurons [43]. Which CUX1 isoform(s) is active in these processes cannot be determined from siRNA or shRNA-mediated knockdown approaches, however, in overexpression studies the p110 CUX1 isoform was shown to regulate transcription of genes involved in cell cycle progression, DNA damage response, spindle assembly checkpoint and cell motility.

                Many specific transcription factors are able bind to genomic sites that are far away from TSS. These studies also revealed that only about up to 10% of putative transcriptional targets showed evidence of regulation in response to changes in transcription factor concentrations [4446]. Whether CUX1 binds preferentially to core promoter sequences, like E2F1, or whether it can also bind at a distance from TSS, like c-Myc, has not been determined [14, 15]. Also, what proportion of all CUX1 targets is regulated in response to overexpression or silencing of CUX1 is not known. To begin to address these questions, we have performed ChAP-chip using ENCODE and promoter microarrays. Putative targets were validated in independent ChIP followed by q-PCR, while regulatory effects were measured in expression profiling experiments and confirmed by RT-qPCR. The results show that CUX1 binds to a large number of genomic sites that are located far away from a TSS and can regulate genes at a distance even when another gene is located in the intervening region.

                Results

                Strategy to identify p110 CUX1 binding sites

                The overall goal of the present study was to define the modes of transcriptional regulation by CUX1 and, in particular, determine whether CUX1 can regulate genes at a distance. As detailed in the introduction, previous transcriptional studies and cell-based assays have implicated the p110 CUX1 isoform in transcriptional activation and repression of target genes. Since p110 CUX1 is generated by proteolytic processing, its primary sequence is included in the full-length CUX1 protein sequence. Consequently, all available antibodies that bind to p110 CUX1 also recognize p200 CUX1. Our strategy to identify in vivo binding sites for p110 CUX1 was to isolate chromatin by two different methods. First, we purified chromatin by tandem affinity purification (TAP) using a population of Hs578t cells stably expressing moderate levels of a p110 CUX1 protein with two epitope tags at its C-terminus, p110-Tag2 (Figure 1A and B). Chromatin isolated in this manner as well as total chromatin (input) were used in hybridizations on the NimbleGen HG17 ENCODE high density oligonucleotide tiling array. Secondly, binding sites identified in the microarray were then validated by performing independent ChIP in the parental Hs578t cells using CUX1 antibodies, 861 and 1300 (Figure 1A). Importantly, these cells express endogenous CUX1 proteins only. The strategy of chromatin affinity purification (ChAP) followed by microarray analysis (ChAP-chip) has previously been validated [47], and described in detail [48].
                http://static-content.springer.com/image/art%3A10.1186%2F1471-2164-14-258/MediaObjects/12864_2012_4981_Fig1_HTML.jpg
                Figure 1

                Expression of CUX1 Recombinant Proteins. (A) Schematic representation of CUX1 proteins with some of the functional domains: ID, inhibitory domain; CC, coiled-coil; CR1, CR2 and CR3, Cut repeat 1, 2 and 3; HD homeodomain; CBD, calmodulin binding domain; Prot A, protein A. The regions recognized by the 861 and 1300 antibodies are shown. (B) Hs578t cells were infected with a retroviral vector to establish a population of cells stably expressing a recombinant p110 CUX1 protein with two tags at its C-terminus, p110 CUX1-Tag2. A population stably carrying the empty vector was used as a control. Nuclear extracts were prepared from each population of cells and analyzed by Western blot using the 861 and 1300 CUX1 antibodies.

                Distribution of CUX1 binding sites on the ENCODE array

                Using a stringent false discovery rate (FDR = 0.05), 513 CUX1 binding sites were identified on the ENCODE array (Table 1). The recruitment of CUX1 to 23 out of 25 genomic sites (92%) was validated in quantitative-PCR assays using chromatin that was independently obtained from Hs578t cells by immunoprecipitation with CUX1 antibodies (Table 1). 79.6% of probes on the ENCODE array derive from transcribed genomic regions. 70.9% of CUX1 binding sites were located within transcribed regions, indicating a 1.6-fold enrichment in non-transcribed regions. In comparison, data obtained from ChIP on the ENCODE platform [14] for c-MYC reveals a 1.56 fold enrichment in non-transcribed regions while E2F1 showed a strong enrichment for transcribed regions (Table 2).
                Table 1

                CUX1 binding sites on the ENCODE array

                # of binding sites

                513

                Average site width (bp)

                503

                Sites tested in qPCR

                25

                Validation rate

                92%

                Validation rate (with consensus)

                100%

                Validation rate (no consensus)

                90%

                Number and average width of CUX1 binding sites identified on the NimbleGen HG17 ENCODE array using chromatin purified from Hs578t cells. Also shown are the number and percentage of binding sites that were validated in an independent ChIP experiment. The validation rate is also shown independently for sites that contained the ATCRAT consensus sequence as well as for sites that did not.

                Table 2

                Distribution of CUX1, Myc and E2F1 binding sites in transcribed and non-transcribed regions

                 

                Encode platform

                CUX1

                c-MYC

                E2F1

                Number of binding sites

                 

                513

                172

                204

                Non-transcribed regions

                20.4%

                28.1%

                28.5%

                5.9%

                Transcribed regions

                79.6%

                70.9%

                71.5%

                94.1%

                Enrichment in un-transcribed regions

                 

                1.61

                1.56

                0.24

                P Value

                 

                0.0018

                0.1333

                <0.0001

                Number of CUX1, C-Myc and E2F1binding sites in transcribed and un-transcribed regions. Also indicated are the fold enrichment in transcribed regions. P Values are calculated using a Fisher's exact test.

                Mapping of CUX1 binding sites relative to transcription start sites (TSS) generated a bell-shaped curve of low height around TSS (Figure 2A). 14.2% of all binding sites overlapped a TSS, and an additional 17% and 16% of binding sites were respectively located in the 4 Kbp region upstream and downstream of a TSS. The number of binding sites gradually declined with increasing distance. Yet, over 6% and 8% of binding sites were situated at more than 40,000 bp upstream or downstream, respectively, from the closest TSS. 53% of CUX1 binding sites are located more than 4,000 bp away from a TSS and approximately 14% of all CUX1 binding sites are situated at more than 40,000 bp from a TSS.
                http://static-content.springer.com/image/art%3A10.1186%2F1471-2164-14-258/MediaObjects/12864_2012_4981_Fig2_HTML.jpg
                Figure 2

                Distribution of CUX1, C-Myc and E2F1 Binding Sites Relative to Transcription Start Sites. (A) Percentage of CUX1 binding sites located at various distances from the closest transcription start site. The "0" column indicates genes where the CUX1 binding site overlaps the start site. (B) Location of C-Myc binding sites as per A. (C) Location of E2F1 binding sites as per A.

                We compared the distribution of CUX1 binding sites with those of 3 randomly generated sets of binding sites, as well as those of c-Myc and E2F1 using the data of Bieda et al., 2006 [14] (Figure 2B and C). We note that the distributions of randomly generated sets of binding sites exhibited flatter bell-shaped curves around TSS (Additional file 1: Figure S1). We conclude that the higher frequency of CUX1 binding sites close to TSS reflects the preferential recruitment of CUX1 to promoter regions. The same cannot be said regarding the binding sites that are located at more than 40 Kbp from TSS, since the same proportions of randomly generated binding sites were located in these regions.

                In contrast to CUX1 and c-Myc, the E2F1 transcription factor was found to bind almost exclusively to the region immediately adjacent to TSS. The preference of E2F1 to core promoter regions led the authors to posit that E2F1 is recruited via protein interactions with components of the general transcription machinery [14]. The wider distribution of binding sites observed for CUX1 and c-Myc is also observed for other transcription factors [15, 49, 50] (Additional file 2: Figure S2A-C), while other factors show a preference for TSS similarly to E2F1 (Additional file 2: Figure S2D-E). Yet other factors show different patterns of binding, such as Pax8, which exhibits preference for non-promoter CpG islands and a tendency to bind in the 10–100 Kbp range rather than close to the TSS of genes [51].

                Binding of CUX1 to distant regulatory elements

                We compared the location of CUX1 binding sites that are more than 4 Kb from the nearest TSS to DNAse hypersensitivity mappings and ChromHMM data in human mammary epithelial cells from published datasets. DNAse hypersensitivity sites have been used as markers of regulatory DNA elements such as enhancers, silencers, insulators and locus control regions [5255]. ChromHMM is a computational method that compiles data from histone modification mappings and integrates them to predict genomic elements such as enhancers [56]. This analysis revealed that respectively 19.2% and 22.1% of distantly located CUX1 binding sites are present within 1 kb of a DNAse hypersensitivity site and of an enhancer predicted (Table 3). Both of these proportions are greater than what is seen for randomly distributed binding sites. However, there was no enrichment of CUX1 binding sites in proximity of insulator elements (Table 3). These results are in agreement with the notion that CUX1 can perform some regulatory functions when binding at a distance from transcription start sites.
                Table 3

                A fraction of CUX1 binding sites locate close to enhancer elements and DHS sites

                Type

                CUX1

                Random

                Fold difference

                P Value

                DHS

                19.2%

                12.9%

                1.49

                0.0109

                Enhancers

                22.1%

                15.2%

                1.45

                0.0100

                Insulators

                4.43%

                4.40%

                1.01

                1.0000

                Percentages of CUX1 binding sites located more than 4 Kbps away from a TSS but within 1 Kbp of the indicated type of genomic element. Percentages are shown for a set of randomly generated binding sites of the same size distribution as CUX1. P Value is calculated using a Fisher's exact test. See Methods for information on the datasets used.

                Detection of CUX1 binding sites and consensus binding motif on promoter arrays

                Promoter microarrays are useful because they enable one to interrogate easily over 30,000 gene promoters. A limitation is that only a limited amount of promoter sequences can be included for each gene, precluding the detection of far away binding sites that could play a role in transcriptional regulation. Based on the localization of CUX1 binding sites on the ENCODE array, we calculated that between 17.2% to 26.6% of CUX1 binding sites would be identified on commercially available promoter arrays (Table 4). However, since for many distant CUX1 binding sites another binding site is also present close to the transcription start site, we estimated that between 44.6% to 58.5% of gene targets would be identified on distinct promoter arrays (Table 5). In contrast, as E2F1 is targeted to transcription start sites, between 80.4% to 85.8% of E2F1 binding sites would be expected to be identified on a promoter array.
                Table 4

                Binding sites and target genes predicted to be identified in promoter arrays

                Platform

                Promoter array boundaries

                % of Binding sites predicted in promoter array

                CUX1

                C-Myc

                E2F1

                Nimblegen

                −3.5 kb to + 0.75 kb

                17.2%

                26.8%

                80.4%

                Agilent

                −5.5 kb to + 2.5 kb

                23.4%

                34.3%

                84.3%

                Affymetrix

                −7.5 kb to + 2.45 kb

                26.6%

                34.9%

                85.8%

                Percentages of CUX1, C-Myc and E2F1 binding sites that were identified on the ENCODE array and that are located within the boundaries of promoters on various promoter array platforms.

                Table 5

                Binding sites and target genes predicted to be identified in promoter arrays

                Platform

                Promoter array boundaries

                % of Target genes predicted in promoter array

                CUX1

                C-Myc

                E2F1

                Nimblegen

                −3.5 kb to + 0.75 kb

                44.6%

                36.0%

                90.2%

                Agilent

                −5.5 kb to + 2.5 kb

                57.1%

                45.9%

                92.2%

                Affymetrix

                −7.5 kb to + 2.45 kb

                58.5%

                48.3%

                92.2%

                Percentages of CUX1, C-Myc and E2F1 target genes that are identified on the ENCODE array and whose binding site are located within the boundaries of promoters on various promoter array platforms.

                We verified these predictions by performing a ChAP-chip experiment using the Nimblegen promoter microarray. Total chromatin (input) as well as purified chromatin from Hs578t cells expressing p110 CUX1-Tag2 were used in hybridization on the promoter array of NimbleGen. Using a stringent false discovery rate (FDR = 0.05), 5828 CUX1 binding sites were identified on 4706 gene promoters (Table 6). The recruitment of CUX1 to 25 out of 25 genomic sites (100%) was validated in quantitative-PCR assays using chromatin that was independently obtained from Hs578t cells by immunoprecipitation with CUX1 antibodies (Table 6). The vast majority of target genes (83.7%) contained only one CUX1 binding site, yet a sizable fraction contained 2 or more binding sites (Table 6).
                Table 6

                CUX1 binding sites on the promoter array

                Genes on array

                20593

                Number of sites/gene

                Number of genes

                CUX1 Binding sites

                5828

                1

                3942

                Genes bound by CUX1

                4706

                2

                643

                Average Site Width (bp)

                503

                3

                90

                Sites tested in qPCR

                25

                4

                23

                Validation rate

                100%

                5+

                8

                Columns 1 and 2: Number and average width of CUX1 binding sites identified on the NimbleGen HG18 Human Promoter Array using chromatin purified from Hs578t cells. Also shown are the number and percentage of binding sites that were validated in an independent ChIP experiment.

                Columns 3 and 4: Number of genes with the indicated number of CUX1 binding site. Note that validation was performed on 14 and 11 sites that contained or not an ATCRAT motif. All of them were validated.

                According to the predictions shown in Table 5, 44.6% of CUX1 target genes should be identified on the promoter array from Nimblegen. We calculated the proportion of ENCODE genes with a CUX1 binding site that were also identified as putative targets of CUX1 in the promoter array. When we considered all 513 CUX1 binding sites and 445 adjacent ENCODE genes, we found that 92 genes (21%) were identified in the promoter array (Table 7, third column). When we considered only the 85 ENCODE genes that were regulated in response to changes in CUX1 levels (see below), we found that 27 genes (32%) were identified as putative target of CUX1 in the promoter array (Table 7, third column).
                Table 7

                Binding sites and target genes predicted to be identified in promoter arrays

                 

                Total

                Identified on Nimblegen promoter array

                All Genes on the ENCODE Array

                445

                92 (21%)

                Regulated Genes on the ENCODE Array (1.25)

                85

                27 (32%)

                Regulated Genes on the ENCODE Array (1.5)

                26

                8 (31%)

                The second column shows the number of genes on the ENCODE. The third column shown the number and percentage of these genes that were also identified on the Nimblegen promoter array.

                The CUX1 consensus binding site, ATCRAT (where R = C or A), was found to be present at 47.2% of the 5828 bound genomic sites (Table 8). This frequency was judged to be significant as the CUX1 consensus binding site was found to be present in only 17.5% of 5828 randomly chosen regions of equal size. Notably, the GC content between bound and unbound regions is practically identical, and thus cannot account for the difference in binding site occurrence (Table 8). Yet, only 8.3% (3633/43778) of the CUX1 consensus sites present on the array were bound in vivo. We conclude that the CUX1 consensus binding site plays a role in the recruitment of CUX1 at specific genomic locations, but the presence of a consensus site is not sufficient.
                Table 8

                CUX1 consensus binding sites and bound genomic regions

                 

                Regions

                Regions with consensus

                % with consensus

                GC Content

                Bound Regions

                5828

                2749

                47.2%***

                47.3%

                Unbound Regions

                5828

                1020

                17.5%

                47.0%

                Columns 2–4, occurrence of the CUX1 consensus binding site, ATCRAT (where R = C or A), within the 5828 genomic regions bound by CUX1 on the promoter array. To calculate the p value, an equal number of randomly chosen regions of equal width was searched for the presence of the CUX1 consensus binding site: ***: p < 0.001. Column 5 shows the GC content of bound and unbound regions.

                Identification of binding motifs in genomic regions bound by CUX1

                We envisioned that interactions with other transcription factors play an important role in recruiting CUX1 to specific locations. In agreement with this notion, functional analysis revealed distinct sets of cellular functions among gene targets that contain an ATCRAT consensus and those that do not (Tables 9 and 10). To further test the possibility that CUX1 may interact with other factors, we investigated the presence of binding motifs other than that of CUX1 using the MEME suite of analysis tools (meme.nbcr.net/). We first tested the reliability of the tool by using it to find motifs in the sequences of CUX1 BS in which we had independently determined that they contained the established ATCRAT consensus. As expected, it identified the ATCRAT consensus as the most enriched motif in the set of sequences, by a vast margin (Table 11, entry 1). We then analyzed binding motifs in the two sets of CUX1 binding sites: those that contained the ATCRAT motif and those that did not. While the size of bound regions varied from 149 to 1107 bp, the average size was 532 and 477 bp, respectively. Interestingly, only one common binding motif was found in the two sets, while the rest of the binding motifs were unique to each set (Tables 11 and 12). Extending the search to the 500 bp regions on either side of bound regions did not highlight other differences between the two sets or reveal additional contributing factors (Tables 13 and 14). These findings support the notion that targeting of CUX1 to specific genomic sites is influenced by protein-protein interactions with other DNA binding proteins.
                Table 9

                Functions of CUX1 target genes that contain a consensus CUX1 binding site

                Functional term

                Fold enrichment

                P Value

                Macromolecular complex assembly

                1.39

                3.1E-04

                Microtubule cytoskeleton organization

                1.88

                6.1E-04

                Cytoskeleton organization

                1.47

                6.6E-04

                Response to DNA damage stimulus

                1.51

                7.2E-04

                Negative regulation of programmed cell death

                1.49

                1.2E-03

                Anti-apoptosis

                1.68

                1.2E-03

                Cellular response to stress

                1.38

                1.4E-03

                Cellular macromolecule catabolic process

                1.32

                1.5E-03

                Protein localization

                1.28

                1.7E-03

                Translational elongation

                1.97

                2.3E-03

                Ten most over-represented biological functions of CUX1 targets gene from the promoter array which contain a consensus CUX1 binding sequence (ATCRAT). Overrepresentation is determined using the online DAVID tool.

                Table 10

                Functions of CUX1 target genes that do not contain a consensus CUX1 Binding site

                Functional term

                Fold enrichment

                P Value

                Ribonucleoprotein complex biogenesis

                2.11

                3.3E-06

                Translation

                1.73

                1.1E-05

                RNA processing

                1.55

                1.1E-05

                Cell cycle

                1.41

                5.4E-05

                Mitotic cell cycle

                1.62

                6.1E-05

                Ribosome biogenesis

                2.17

                7.4E-05

                Nuclear mRNA splicing, via spliceosome

                2.02

                8.0E-05

                Cell cycle process

                1.45

                1.9E-04

                Establishment of protein localization

                1.38

                1.9E-04

                Translational elongation

                2.19

                3.0E-04

                Ten most over-represented biological functions of CUX1 target genes from the promoter array which do not contain a consensus CUX1 binding sequence (ATCRAT). Overrepresentation is determined using the online DAVID tool.

                Table 11

                Identification of DNA motifs in CUX1 binding sites with the ATCRAT consensus

                Motif

                Reverse complement

                E-Value

                Transcription factors

                ATCRAT

                ATYGAT

                3.5E-735

                Cux1, Pbx1

                GGGYGGGR

                YCCCRCCC

                4.8E-35

                Klf4, Klf7, Sp1, Sp4, Zfp281, Zfp740, Egr1

                AAATAHW

                WDTATTT

                1.9E-27

                -

                CTBCCTS

                SAGGVAG

                6.30E-26

                Spi1, Stat3, Fev, Sfpi1

                CWCCDCC

                GGHGGWG

                6.60E-23

                -

                DRGGAAA

                TTTCCYH

                6.20E-21

                -

                BSTGTGTG

                CACACASV

                1.20E-20

                -

                RGAGAAR

                YTTCTCY

                2.60E-14

                -

                ACRCWG

                CWGYGT

                3.70E-14

                -

                RAAACAAA

                TTTGTTTY

                1.90E-11

                Sox11, Sox4, Foxd3, Foxi1

                10 Most enriched DNA motifs found in CUX1 binding sites that contain the ATCRAT CUX1 consensus. The DNA sequences considered in this analysis correspond to the entire regions bound by CUX1 as defined in the microarray. The size of bound regions varies from 149 to 1107 bp (95th percentile) and is 532 bp on average. Proteins with DNA binding motifs highly similar to the motifs are listed in the rightmost column. K = G/T, M = A/C, R = A/G, Y = C/T, S = C/G, W = A/T, B = C/G/T, V = A/C/G, H = A/C/T, D = A/G/T.

                Table 12

                Identification of DNA motifs in CUX1 binding sites without the ATCRAT consensus

                Motif

                Reverse complement

                E-Value

                Transcription factors

                DTATTTW

                WAAATAH

                3.80E-35

                -

                CYCCRCCC

                GGGYGGRG

                4.60E-34

                Klf4, Klf7, Sp1, Sp4, Zfp281, Zfp740, Egr1

                CAYTTCY

                RGAARTG

                1.50E-26

                Gabpa, Stat1

                CACASAS

                STSTGTG

                3.20E-23

                Runx1

                DGGAAA

                TTTCCH

                5.00E-22

                Stat1, Nfatc2, Rela, Rel, Fev

                CCRCCDCC

                GGHGGYGG

                6.40E-19

                -

                GSAGAGR

                YCTCTSC

                3.90E-17

                -

                CHGCAGC

                GCTGCDG

                1.30E-16

                Myf, Ascl2

                CATTTWM

                KWAAATG

                2.90E-26

                -

                DTTTCTS

                SAGAAAH

                1.70E-13

                -

                10 Most enriched DNA motifs found in CUX1 binding sites that do not contain the ATCRAT CUX1 consensus. The DNA sequences considered in this analysis correspond to the entire regions bound by CUX1 as defined in the microarray. The size of bound regions varies from 149 to 949 bp (95th percentile) and is 477 bp on average. Proteins with DNA binding motifs highly similar to the motifs are listed in the rightmost column.

                Table 13

                Identification of DNA motifs close to CUX1 binding sites with the ATCRAT consensus

                Entry

                Motif

                Reverse complement

                E-value

                Match in Table14

                Factors

                1

                CNGCCTCC

                GGAGGCNG

                2.9E-168

                Entry 3

                -

                2

                CTGTARTC

                GAYTACAG

                2.5E-161

                Entry 1

                -

                3

                CAGGCTGG

                CCAGCCTG

                3.7E-145

                -

                -

                4

                AAAWAMAA

                TTKTWTTT

                6.1E-135

                Entry 2

                Srf, Elf3, Tcfap2e

                5

                TGCAGTGR

                YCACTGCA

                4.6E-115

                Entry 6

                Zbtb3

                6

                CCAGCTAC

                GTAGCTGG

                8.9E-109

                Entry 4

                -

                7

                GAGACRGR

                YCYGTCTC

                1.6E-108

                -

                -

                8

                BGYGGTGG

                CCACCRCV

                2.6E-92

                -

                -

                9

                CTCCYGMC

                GKCRGGAG

                1.7E-86

                Entry 5

                -

                10

                CAAAGTGC

                GCACTTTG

                2.1E-71

                Entry 10

                -

                10 Most enriched DNA motifs found within the 500 bp regions on either side of CUX1 binding sites that contain the ATCRAT CUX1 consensus. Column 5 lists Table 14 entries whose motifs are very similar (max. 1 mismatch). Proteins with DNA binding motifs highly similar to the motifs are listed in the rightmost column. K = G/T, M = A/C, R = A/G, Y = C/T, S = C/G, W = A/T, B = C/G/T, V = A/C/G, H = A/C/T, D = A/G/T.

                Table 14

                Identification of DNA motifs close to CUX1 binding sites without the ATCRAT consensus

                Entry

                Motif

                Reverse complement

                E-Value

                Match in Table13

                Factors

                1

                CTGTARTC

                GAYTACAG

                3.9E-170

                Entry 2

                -

                2

                AAAAWAMA

                TKTWTTTT

                6.6E-170

                Entry 4

                Srf, Elf3, Tcfap2e

                3

                CMGCCTCC

                GGAGGCKG

                1.7E-160

                Entry 1

                -

                4

                AGTAGCTG

                CAGCTACT

                6.2E-120

                Entry 6

                -

                5

                CTCCWSCC

                GGSWGGAG

                2.3E-116

                Entry 9

                -

                6

                GCRGTGR

                YCACYGC

                2.6E-109

                Entry 5

                Zbtb3

                7

                CCMCRCCC

                GGGYGKGG

                1.4E-104

                -

                Klf4, Klf7, Sp1, Sp4

                8

                AAATTAGC

                GCTAATTT

                2.4E-93

                -

                Pdx1

                9

                GTAGAGAY

                RTCTCTAC

                2.0E-90

                -

                -

                10

                AAAGTGCT

                AGCACTTT

                2.0E-75

                Entry 10

                -

                10 Most enriched DNA motifs found within the 500 bp regions on either side of CUX1 binding sites that do not contain the ATCRAT CUX1 Consensus. Column 5 lists Table 13 entries whose motifs are very similar (max. 1 mismatch). Proteins with DNA binding motifs highly similar to the motifs are listed in the rightmost column.

                Regulatory effects of CUX1 on putative targets

                To verify the effect of CUX1 on putative targets, we performed expression profiling on three Hs578t cell populations: cells that had been infected with a retrovirus expressing an shRNA against CUX1, cells infected with a retrovirus expressing p110 CUX1, or cells infected with an empty retrovirus. In each case, replicate microarray hybridizations were carried out such that a p value could be calculated for each difference in gene expression. Results from expression profiling were validated by repeating the infections and performing RT-qPCR analysis on 20 genes whose expression went up or down in response to one treatment or the other (Figure 3). All genes tested in this manner displayed changes in gene expression in the same direction as that observed in the microarray hybridization: genes that were repressed in expression profiling were also repressed when mRNA levels were measured by RT-qPCR. Similar observations were made for genes that were activated. We note, however, that the fold activation or repression calculated by RT-qPCR were not necessarily proportional to the changes observed in microarray hybridization. For example, EEF1A1 and C20ORF44 mRNA were increased respectively 7.3 and 1.8 fold when measured by RT-qPCR, but were increased 1.7 and 1.4 fold in microarray analyses. Some of these differences could be due to the fact that measurements by the two methods were made with RNA prepared from independent experiments. Notwithstanding the differences in magnitude, the effects of CUX1 on gene expression was confirmed for all tested genes.
                http://static-content.springer.com/image/art%3A10.1186%2F1471-2164-14-258/MediaObjects/12864_2012_4981_Fig3_HTML.jpg
                Figure 3

                Overexpression and Knockdown of CUX1 and Expression Profiling Validation. (A) Hs578t cells were infected with a lentiviral vector expressing p110 CUX1 or nothing (vector). RNA and proteins were purified 48 hours post-infection. CUX1 expression was analyzed by RT-qPCR and immunoblotting. (B) Hs578t cells were infected with a lentiviral vector expressing CUX1 shRNA or a scrambled RNA. RNA and proteins were purified 5 days after infection. CUX1 expression was analyzed by and RT-qPCR and immunoblotting. (C) RNA levels of the indicated genes were measured by RT-qPCR in cells treated as in A. Expected up or down indicates regulation that was observed by expression profiling. (D) RNA levels of the indicated genes were measured by RT-qPCR in cells treated as in B. Expected up or down indicates regulation that was observed by expression profiling. * p<0.05, *** p<0.001 on a Student's T test.

                A total of 445 genes are present on the ENCODE array, and all have a CUX1 binding site located within 213 Kbp of their TSS. Expression profiling results could be matched for 327 of these genes. Using a cut-off of 50% either up or down-regulated and a p value below 0.05, we observed differences in the expression of 26 target genes (7.4%), following changes in CUX1 levels (Table 15). 20 genes responded to CUX1 knockdown, and 6 genes, to p110 CUX1 overexpression (Table 15). Among the 26 regulated target genes, 10 genes (38%) were activated and 16 genes (62%) were repressed by CUX1 (Table 15). Similar proportions of activated and repressed genes were found when a cut-off of 25% change in gene expression was employed (Table 16). These findings confirm that p110 CUX1 can participate in transcriptional activation or repression depending on promoter context.
                Table 15

                Genes on the ENCODE array regulated in response to CUX1 overexpression or CUX1 knockdown (1.5 fold)

                Effect of CUX1 on 327 putative target genes

                CUX1 Overexpression or Knockdown

                CUX1 Overexpression

                CUX1 Knockdown

                Up- or Downregulated

                26 (7.4%)

                6 (1.7%)

                20 (5.7%)

                Upregulated

                10 (2.9%)

                3 (0.9%)

                7 (2.0%)

                Downregulated

                16 (4.6%)

                3 (0.9%)

                13 (3.7%)

                Number and percentage of genes on the ENCODE platform that exhibit a 1.5 fold change in expression following p110 CUX1 overexpression or CUX1 knockdown. "Upregulated by CUX1" are genes whose expression is increased following p110 CUX1 and/or decreased following CUX1 knockdown. Conversely, "Downregulated by CUX1" are genes whose expression is decreased following p110 CUX1 and/or increased following CUX1 knockdown.

                Table 16

                Genes on the ENCODE array regulated in response to CUX1 overexpression or CUX1 knockdown (1.25 fold)

                Effect of CUX1 on 327 putative target genes

                CUX1 Overexpression or Knockdown

                CUX1 Overexpression

                CUX1 Knockdown

                Up- or Downregulated

                85 (24.4%)

                36 (10.3%)

                62 (17.8%)

                Upregulated

                35 (10.0%)

                18 (5.2%)

                24 (10.9%)

                Downregulated

                50 (14.3%)

                18 (5.2%)

                38 (6.9%)

                Number and percentage of target genes on the ENCODE platform that exhibit a 1.25 fold change in expression following p110 CUX1 overexpression or CUX1 knockdown. Genes were analyzed as in Table 15.

                Similar results were obtained when we analyzed the expression of putative targets identified on the promoter array. A total of 347 genes, 8.4% of all putative targets for which expression profiling results could be matched, were regulated by CUX1. 287 and 85 genes exhibited regulation in response to CUX1 knockdown or p110 CUX1 overexpression, respectively. 181 (52%) were up-regulated by CUX1 while 167 (48%) were down-regulated by CUX1.

                Effect of distance on transcriptional regulation by CUX1

                We noted that CUX1 regulated 7.4% and 8.4% of putative targets from the ENCODE and the promoter arrays, respectively. We next investigated the relationship between the position of a CUX1 binding site relative to a transcription start site and the probability of a gene to be regulated in response to changes in CUX1 levels. When genes were classified according to the distance between the CUX1 binding site and the transcription start site, we did not observe significant difference in the fraction of targets that were regulated by CUX1 (Figure 4A and B). However, we observed much variability in the fraction of regulated genes because the number of genes within some distance intervals were very small. Therefore, to increase the sample size, we repeated the analysis this time using a cut-off of 25% either up or down and a p value below 0.05 (Figure 4C and D). We observed differences in the expression of 62 and 36 genes in response to CUX1 shRNA and CUX1 overexpression, respectively (Table 16). Again, more genes were found to be regulated by CUX1 using the shRNA approach. Among genes that exhibited regulation by CUX1, 35 genes (41%) were activated by CUX1, and 50 genes (59%) were repressed by CUX1 (Table 16). The histogram presenting the percentage of regulated genes versus the distance of CUX1 binding sites to TSS shows that essentially the same proportion of genes are regulated whether CUX1 binds close or far away from the TSS (Figure 4C). Indeed, no statistical difference was observed between genes bound at the TSS and those bound more than 40 Kbp away. We conclude that CUX1 can activate or repress transcription when bound at a distance from a transcription start site.
                http://static-content.springer.com/image/art%3A10.1186%2F1471-2164-14-258/MediaObjects/12864_2012_4981_Fig4_HTML.jpg
                Figure 4

                Effect of Distance on Regulation by CUX1. (A) Genes from the ENCODE array have been organized according to the distance between their transcription start site (TSS) and the closest CUX1 binding site. The "0" column indicates genes where the CUX1 binding site overlaps the start site. The histogram shows, for each interval of distance, the percentage of genes that exhibit a 1.5 fold change in expression following p110 CUX1 overexpression or CUX1 knockdown. The total number of genes within each interval is indicated within each column. (B) As in A, except that the regulation by CUX1 is expressed as either activation by CUX1 or repression by CUX1. (C) As in A, but with a threshold of 1.25 fold change in expression. (D) As in B, but with a threshold of 1.25 fold change in expression.

                Effect of multiple CUX1 sites

                The presence of multiple CUX1 binding sites has a modest, yet significant, impact on the probability that a gene is regulated by CUX1. CUX1 regulated 7.9%, 11.2% of genes that contain respectively one or two CUX1 binding sites, respectively (Table 17).
                Table 17

                Number of genes on the promoter array that are regulated in response to CUX1 overexpression or CUX1 knockdown

                Number of sites/target

                Number of targets

                Targets with profiling data

                1.5 fold change

                1.25 fold change

                Any #

                4706

                4140

                347

                8.4%

                1437

                34.7%

                1

                3942

                3527

                278

                7.9%

                1182

                33.5%

                2+

                643

                613

                **69

                11.2%

                ***255

                41.6%

                Effect of the number of CUX1 binding sites on the probability that target genes exhibit a change in expression following p110 CUX1 overexpression or CUX1 knockdown, depending on the number of CUX1 binding site present in their promoter region. **: P < 0.01, ***:P < 0.001 on a Fisher's exact test vs. genes whose promoter have only 1 CUX1 binding site.

                Effect of gene position on transcriptional regulation by CUX1

                Intuitively, one would assume that a transcription factor is more likely to regulate the closest promoter. Yet, some enhancers will exhibit an effect on a promoter situated on one side, but no effect on the promoter that is on the other side on the map. This sort of selectivity between an enhancer and a promoter has been explained by the presence of boundary or insulator elements or by specific interactions between proteins bound at the enhancer and the regulated promoter. Previous studies on CUX1 have all focused on genes that contain a CUX1 binding site within the immediate promoter. To begin to investigate the rules that govern the action of CUX1, we calculated the fraction of different types of CUX1 targets that were regulated in response to changes in CUX1 levels. Three types of genes were analyzed: 1, genes that are the closest to the CUX1 binding site; 2, genes that are further away and in the other direction from the CUX1 binding site; 3, genes that are located further away and are separated by another gene from the CUX1 binding site. For each category, we calculated the percentage of genes that exhibit a 1.25 or 1.5-fold change in expression following p110 CUX1 overexpression or CUX1 knockdown. Strikingly, essentially similar fractions of genes were regulated whether they were closest to the CUX1 binding site or were located further away in the other direction (Figure 5, compare 1 and 2). Moreover, the proportion of regulated genes was not significantly lower among genes that belong to the third category (Figure 5, type 3 genes). We conclude that CUX1 is capable of regulating genes at a distance. Moreover, CUX1 can regulate more than one gene on certain genomic loci.
                http://static-content.springer.com/image/art%3A10.1186%2F1471-2164-14-258/MediaObjects/12864_2012_4981_Fig5_HTML.jpg
                Figure 5

                Relationship Between Gene Position and Regulation by CUX1. Three types of situations are depicted in the diagram. 1, genes that are the closest to the CUX1 binding site; 2, genes that are further away and in the other direction from the CUX1 binding site; 3, genes that are located further away and are separated by another gene from the CUX1 binding site. For each category, the table shows the percentage of genes that exhibit a 1.25 or 1.5 change in expression following p110 CUX1 overexpression or CUX1 knockdown.

                Discussion

                Genome-wide location analysis on the ENCODE array revealed that ~47% of CUX1 binding sites are located in the 4-Kbp region upstream and downstream of a TSS, while more than 14% of CUX1 binding sites are situated at more than 40 Kbp from a TSS (Figure 2). Overall, 7.4% and 8.4% of putative targets on the ENCODE and promoter arrays respectively, exhibited a 1.5-fold change in expression following CUX1 knockdown or p110 CUX1 overexpression (Tables 15, 16, 17, 18 and 19). This proportion is within the 1-10% range of potential targets that have been reported to be regulated by other transcription factors [4446].
                Table 18

                Number of genes on the promoter array that are regulated in response to CUX1 overexpression or CUX1 knockdown (1.25 fold)

                Effect of CUX1 on all genes and 4140 putative targets

                CUX1 Overexpression or Knockdown

                CUX1 Overexpression

                CUX1 Knockdown

                Gene list

                All genes

                Target genes

                Target genes

                Target genes

                Up- or Downregulated

                4880

                27.7%

                1437

                34.7%

                568

                13.7%

                1083

                26.1%

                Upregulated

                2290

                13.0%

                696

                16.8%

                261

                6.3%

                546

                13.2%

                Downregulated

                2590

                14.7%

                744

                17.9%

                307

                7.4%

                537

                13.0%

                Number and percentage of all genes and CUX1 target genes on the promoter array that exhibit a 1.25 fold change in expression following p110 CUX1 overexpression or CUX1 knockdown. Genes were analyzed as in Table 15.

                Table 19

                Number of genes on the promoter array that are regulated in response to CUX1 overexpression or CUX1 knockdown (1.5 fold)

                Effect of CUX1 on all genes and 4140 putative targets

                CUX1 Overexpression or Knockdown

                CUX1 Overexpression

                CUX1 Knockdown

                Gene list

                All genes

                Target genes

                Target genes

                Target genes

                Up- or Downregulated

                1231

                7.0%

                347

                8.4%

                85

                2.1%

                287

                6.9%

                Upregulated

                591

                3.4%

                181

                4.4%

                28

                0.7%

                169

                4.1%

                Downregulated

                640

                3.6%

                167

                4.0%

                57

                1.4%

                118

                2.8%

                Number and percentage of all genes and CUX1 target genes on the promoter array that exhibit a 1.5 fold change in expression following p110 CUX1 overexpression or CUX1 knockdown. "Upregulated by CUX1" are genes whose expression is increased following p110 CUX1 and/or decreased following CUX1 knockdown. Conversely, "Downregulated by CUX1" are genes whose expression is decreased following p110 CUX1 and/or increased following CUX1 knockdown. The total number of genes and target genes were 17586 and 4140, respectively.

                Importantly, analysis of the percentage of regulated genes versus the distance of CUX1 binding sites to TSS showed that essentially the same proportion of genes are regulated whether CUX1 binds close or far away from the TSS (Figure 4A and B). In other words, the probability that a gene is regulated by CUX1 is not affected by the distance between the CUX1 binding site and the TSS. In addition, our results indicate that the position of genes relative to a CUX1 binding site do not determine whether these genes are regulated by CUX1. CUX1 regulated similar percentages of genes whether they were closest to the CUX1 binding site or were located further away in the other direction (Figure 5, compare 1 and 2). Moreover, CUX1 regulated a surprisingly high proportion (5.4%) of genes that were separated from their binding site by another gene (Figure 5). Altogether these results demonstrate that CUX1 can regulate genes at a distance and can regulate more than one gene on certain genomic loci.

                The proportion of target genes that were found to be activated or repressed by CUX1, respectively 52% and 48% (Tables 18), is significantly different from what we reported in previous studies on target genes involved in cell cycle progression, cell motility, or the DNA damage response [21, 22, 57]. In each case, a vast majority of genes were found to be activated by p110 CUX1, whether we performed siRNA-mediated knockdown or overexpression of p110 CUX1. One factor that may explain this could be the functional classes of genes that were studied previously. The functional class of “cell cycle” genes includes mostly genes that stimulate cell cycle progression. Out of 25 cell cycle gene targets identified by ChIP-chip, 22 were activated and 2 were repressed by CUX1 (while only one was not affected) [21]. One of the two repressed genes, p21WAF1/CKI1, code for a CDK-inhibitor that blocks cell cycle progression, while the other, CCNH, is involved in transcription and DNA repair. All target genes that were activated play a positive role in cell cycle progression. Similarly, among 19 targets that play a role in DNA damage response, 18 were activated and one was repressed [57]. The repressed gene again was p21WAF1/CKI1. Overall, these results are consistent with the notion that CUX1 establishes a transcriptional program that promotes cell cycle progression and at the same time ensures the maintenance of genetic integrity.

                We employed two experimental approaches to examine the transcriptional regulation of genes by CUX1. Expression profiling was performed following shRNA-mediated knockdown of CUX1 or p110 CUX1 overexpression. Among targets identified on the promoter array, 287 genes exhibited a 1.5-fold change in expression following CUX1 knockdown, while 85 genes were regulated in response to p110 CUX1 overexpression. Therefore, more genes were found to be regulated by CUX1 using the shRNA approach. This result can be interpreted to mean that CUX1 is required for optimal expression of many target genes, however, increasing CUX1 expression is not sufficient to modulate the expression of some target genes.

                The CUX1 consensus binding site, ATCRAT (where R = C or A), was found to be present at 47.2% of the 5828 bound genomic sites (Table 8). We conclude that the presence of a CUX1 consensus binding site contributes to, but is not sufficient for, the recruitment of CUX1 to specific genomic locations. We envision that interactions with other transcription factors play an important role in recruiting CUX1 to specific locations. In agreement with this notion, functional analysis revealed distinct sets of cellular functions among gene targets that contain an ATCRAT consensus and those that do not (Tables 9 and 10). We note that functional classes involved in cell cycle were over-represented among target genes that do not contain a consensus CUX1 binding site (Table 10). In previous studies, CUX1 was shown to interact with E2F factors and cooperate with these factors in the regulation of several cell cycle genes [58, 59]. It is likely that protein-protein interaction with E2F factors reduces the requirement for the presence of a high-affinity binding site for the recruitment of CUX1 on this class of genes.

                CUX1 can be purified efficiently by immunoprecipitation or affinity chromatography. Following cross-linking, however, the yield of purification is drastically reduced such that we need 500 million cells to perform chromatin immunoprecipitation or affinity purification (ChIP or ChAP) for CUX1. This caveat has limited our ability to perform ChIP-sequencing and therefore our study relied on microarray hybridizations. While sequence coverage is admittedly smaller on microarrays, data collected from both ENCODE and promoter arrays have enabled us to define the importance of the CUX1 consensus binding site in the recruitment of CUX1 to genomic locations and determine whether CUX1 can regulate genes at a distance.

                Conclusions

                Our results demonstrate that p110 CUX1 can mediate transcriptional repression or activation of specific genes when bound at variable distances from the transcription start site. Although the CUX1 consensus binding motif, ATCRAT, plays a role in the recruitment of CUX1 to specific genomic sites, protein-protein interactions must contribute to its transcriptional activity.

                Methods

                Cell culture

                Hs578T is a human mammary carcinoma cell line [60]. Previous studies have documented changes in gene expression in response both to CUX1 knockdown and overexpression [21, 33]. Hs578T cells were maintained in Dulbecco’s modified minimum essential medium (DMEM)(Wisent) supplemented with penicillin-streptomycin, and 5% fetal bovine serum (FBS) (Gibco).

                Retroviral infection and stable cell lines

                Retroviruses were produced by transfecting 293VSV cells with the pREV/TRE vector either empty or encoding p110 CUX1-Tag2 (CUX1 a.a. 612–1336 with protein A and CBP tags inserted at the C-terminus) (Clontech). Preparation of the retroviruses and stable cell lines was done as previously described [37].

                Chromatin Affinity Purification (ChAP)

                The method of chromatin affinity purification (ChAP) has previously been validated [47], and described in detail [48]. To ensure that the recombinant p110-Tag2 protein would be expressed at moderate level, we employed the pRevTRE retroviral vector (Clontech), which contains the minimal CMV promoter with a tetracycline responsive element. Importantly, no tetracycline was added to the medium. Moreover, the Hs578T breast tumor cells do not express a tetracycline-responsive transactivator. Basal expression from the pRevTRE vector was previously shown to be very low [6163]. ChAP was performed on 5 x108 Hs578T. The cell nuclei were purified as described in [64], then lysed in RIPA-M buffer (10 mM Tris–HCl pH8, 1 mM EDTA, 0.5 mM EGTA, 150 mM NaCl, 1% Triton X-100, 0.5% DOC, 0.1% SDS, 1 mM PMSF, protease inhibitors) and sonicated on ice to obtain 250- to 800-bp-long DNA fragments. Stably expressed recombinant p110-Tag2 protein was purified by the Taptag purification method with some modifications [65]. The IgG matrix bound p110-Tag2/DNA were washed in wash buffer I (20 mM Tris–HCl pH8, 2 mM EDTA, 2 mM EGTA, 150 mM NaCl, 1% NP-40, 0.5% DOC, 0.2% SDS), wash buffer II (20 mM Tris–HCl pH9, 2 mM EDTA, 2 mM EGTA, 500 mM NaCl, 1% NP-40, 0.5% DOC, 0.1% SDS), wash buffer III (50 mM Tris–HCl pH7.5, 2 mM EDTA, 1 mM EGTA, 0.5 M LiCl, 1% NP-40, 0.7% DOC,) and then TEV buffer (10 mM Tris–HCl (pH8.0), 100mMNaCl, 0.1% TX-100, 0.5 mM EDTA, 10% glycerol, 1 mM DTT). After TEV protease digestion, the released protein/DNA complexes were purified by affinity chromatography on calmodulin beads in the presence of calcium and then eluted with EGTA. After de-crosslinking, samples were treated with RNase A and Proteinase K. Un-enriched input chromatin was put aside as a control.

                Preparation of ChAP purified DNA for hybridization

                ChAP purified chromatin was amplified by the method of Ligation-Mediated PCR as detailed previously [66]. Briefly, ChAPed DNAs and input DNA were blunted, ligated to a unidirectional linker and amplified by PCR for 24 cycles to generate a sufficient amount of DNA. Amplified DNA samples were Cy5 labeled and amplified input controls were Cy3 labeled using Nimblegen's Dual-Color DNA Labeling Kit according to the manufacturer's instructions.

                DNA microarray hybridization

                Labeled samples were hybridized to either NimbleGen's HG17 ENCODE or their HG18 Human Promoter Array Set high density oligonucleotide tiling array (385 k probe format) and then washed according to the manufacturer's instructions. Arrays were scanned on an Agilent 5 μm scanner model G2505B using customized scan area settings (X: 28, Y:6, Width: 20, Height: 14, values in mm).

                ChAP-microarray result analysis

                For both array platforms (Encode and promoter array), grid alignment, raw signal extraction, peak identification and peak mapping were carried out using the Nimblescan v8.0 software according to the company's instructions. Identified peaks were considered significant with a false discovery rate (FDR) below 0.05, which is considered highly confident. Further analysis of identified binding sites was carried our using either the R platform for statistical computing (http://​www.​R-project.​org) or scripts written in PERL (Practical Extraction and Report Language, http://​www.​perl.​org). All peaks identified in ChAP-Chip experiments on the Encode Array and the Human Promoter Array Set are provided (Additional files 3 and 4, respectively).

                ChAP-microarray result validation

                Independent ChIP experiments using antibodies specific for endogenous CUX1 were carried out in Hs578t, as previously described [21]. Real-time PCR was used to measure the level of enrichment of genomic target regions in ChIP DNA vs. the un-enriched input DNA. We selected 25 genes from both the targets identified on the ENCODE array and on the Promoter array set and designed primers specific for the corresponding regions where CUX1 was putatively identified as binding.

                ENCODE binding sites for c-MYC and E2F1

                We used ChIP-chip binding sites for E2F1 and c-Myc downloaded from the website of Dr. Peggy Farnham laboratory at <http://​genomics.​ucdavis.​edu/​farnham/​suppdata.​html>. This dataset contains the binding sites predicted for E2F1, c-MYC and POLR2A (RNA polymerase II) in the ENCODE regions classified by 4 criteria: L1 (P < 0.0001 and 98th percentile), L2 (P < 0.0001 and 95th percentile), L3 (P < 0.05 and 98th percentile) and L4 (P < 0.05 and 95th percentile) [14]. Based on the validation of 29 binding sites, Bieda et al. conclude that L1 binding sites are highly reliable, L2 and L3 binding sites are also reliable however based on sparser testing and L4 binding sites are usually artifacts. Binding sites identified with the L1 criteria were used for our analyses. The chromosomal intervals for binding sites predicted for E2F1 and POLR2A belonged to genomic coordinates using hg16, whereas c-MYC binding sites were in hg17. Therefore, the lift-over program found on the online GALAXY platform [6769] was used and random results were verified using UCSC genome browser to convert hg16 coordinates to those of hg17. There were 1 and 2 binding sites for E2F1 at L1 and L3 respectively (hg16), which could not be mapped to hg17.

                DHS and ChromHMM data analysis

                Data tracks were downloaded from the UCSC's Encode data portal (http://​genome.​ucsc.​edu/​ENCODE/​). Genomic locations were compared to those of the CUX1 binding sites using scripts written in R. UCSC Accession numbers of the tracks used are: wgEncodeEH000503 (GEO accessions GSM736552 and GSM736634) for the DHS data and wgEncodeEH000786 for the ChromHMM data.

                Consensus sequence analysis

                Genomic sequences corresponding to regions of interest (binding sites or other) were obtained using the online GALAXY platform. Scripts written in R were used to identify the ATCRAT consensus motif within regions of interest.

                De novo binding motif identification

                De Novo motif discovery was performed using the DREME (Discriminative DNA Motif Discovery) motif discovery tools form the MEME suite of tools. Comparison with known DNA binding motifs was performed using the TOMTOM algorithm using the JASPAR CORE database as a reference for comparison. (meme.nbcr.net/) [7072].

                Functional overrepresentation analysis

                Identification of overrepresented gene functions was carried out using the online annotation tool DAVID. Genes that were bound by CUX1 (Targets) were compared with all genes present on the microarray (Background). Overrepresentation of a function depends on the increase in the proportion of genes involved in a given function between CUX1 targets and the background. The P-value is determined using an improved Fisher’s exact test from the DAVID software [73, 74].

                p110 CUX1 overexpression and CUX1 shRNA

                For overexpression, Hs578t cells were infected with a lentiviral vector expressing p110 CUX1. Duplicate infections were carried out in parallel and cells were harvested after 24 hours. For CUX1 knockdown, a stable Hs578t cell line containing a doxycycline inducible shRNA was established by retroviral infection. Doxycycline was applied to the cells for 6 days before harvest with control cells left untreated. Knockdown experiments were carried out in biological duplicates in parallel.

                Expression profiling sample preparation and hybridization

                Total RNA was isolated from cells using the Arcturus Picopure RNA isolation kit. 2 μg of RNA was then amplified using the Arcturus RiboAmp PLUS RNA amplification kit according to the manufacturer's instructions for a single round of amplification. Amplified mRNA (aRNA) was labeled using the Arcturus Turbo Labelling Cy5 and Cy3 kits using the manufacturer's instructions with a modification: The labeling reaction was carried out using 5 μg of aRNA in a 20 μl volume instead of 50 μl to increase the dye incorporation rate. Labelled aRNA was hybridized to Agilent's Whole Human Genome Microarry (G4112F) according to the manufacturer's instructions, washed and scanned on a 5 μm Agilent scanner. Hybridizations of the biological duplicates of each experiment were carried out in technical duplicates using dye swaps (Cy3 and Cy5), for a total of 4 replicates for each of the overexpression and downregulation experiment.

                Expression profiling data analysis

                Raw signal and background intensities were extracted from the scanned images of expression arrays using the Feature Extraction software from Agilent. Raw data was processed and normalized using the R platform and the LIMMA package [75]. Processed expression profiling results are provided (Additional file 5).

                Expression profiling result validation

                Independent p110 CUX1 overexpression and CUX1 shRNA knockdown experiments were carried out in Hs578t cells using retroviral vectors. 10 genes were selected from each experiment and real-time PCR was used to confirm the changes in expression seen in expression profiling.

                Abbreviations

                ChAP: 

                Chromatin Affinity Purification

                ChIP: 

                Chromatin Immuno Precipitation

                DHS: 

                DNAse hypersensitivity site

                ENCODE: 

                ENCyclopedia of DNA Elements

                FDR: 

                False discovery rate

                TAP: 

                Tandem affinity purification

                TSS: 

                Transcription start site.

                Declarations

                Acknowledgements

                C.V. is the recipient of a scholarship from the Fonds de la Recherche en Santé du Québec. This research was supported by grant # MOP-98010 from the Canadian Institute of Health Research of Canada and grant #019389 from the Canadian Cancer Society to AN. We acknowledge infrastructure support and technical assistance from the Breast Cancer Functional Genomics Group, which is supported by funds from the Terry Fox Foundation and CIHR.

                Authors’ Affiliations

                (1)
                Goodman Cancer Centre, McGill University
                (2)
                Department of Biochemistry, McGill University
                (3)
                Department of Medicine, McGill University
                (4)
                Department of Oncology, McGill University

                References

                1. Jenuwein T, Allis CD: Translating the histone code. Science 2001,293(5532):1074–1080.PubMedView Article
                2. Turner BM: Cellular memory and the histone code. Cell 2002,111(3):285–291.PubMedView Article
                3. MacQuarrie KL, Fong AP, Morse RH, Tapscott SJ: Genome-wide transcription factor binding: beyond direct target regulation. Trends Genet 2011,27(4):141–148.PubMedView Article
                4. Mantripragada KK, Buckley PG, Diaz de Stahl T, Dumanski JP: Genomic microarrays in the spotlight. Trends Genet 2004,20(2):87–94.PubMedView Article
                5. Bosher JM, Labouesse M: RNA interference: genetic wand and genetic watchdog. Nat Cell Biol 2000,2(2):E31-E36.PubMedView Article
                6. Hofmann A, Nolan GP, Blau HM: Rapid retroviral delivery of tetracycline-inducible genes in a single autoregulatory cassette [see comments]. Proc Natl Acad Sci U S A 1996,93(11):5185–5190.PubMedView Article
                7. Paulus W, Baur I, Boyce FM, Breakefield XO, Reeves SA: Self-contained, tetracycline-regulated retroviral vector system for gene delivery to mammalian cells. J Virol 1996,70(1):62–67.PubMed
                8. Meyerson M, Gabriel S, Getz G: Advances in understanding cancer genomes through second-generation sequencing. Nat Rev Genet 2010,11(10):685–696.PubMedView Article
                9. Gilmour DS, Lis JT: In vivo interactions of RNA polymerase II with genes of Drosophila melanogaster. Mol Cell Biol 1985,5(8):2009–2018.PubMed
                10. Marzio G, Tyagi M, Gutierrez MI, Giacca M: HIV-1 tat transactivator recruits p300 and CREB-binding protein histone acetyltransferases to the viral promoter. Proc Natl Acad Sci U S A 1998,95(23):13519–13524.PubMedView Article
                11. Consortium EP: The ENCODE (ENCyclopedia Of DNA Elements) Project. Science 2004,306(5696):636–640.View Article
                12. Farnham PJ: Insights from genomic profiling of transcription factors. Nat Rev Genet 2009,10(9):605–616.PubMedView Article
                13. Amati B, Land H: Myc-Max-Mad: a transcription factor network controlling cell cycle progression, differentiation and death. Curr Op Genet Dev 1994,4(1):102–108.PubMedView Article
                14. Bieda M, Xu X, Singer MA, Green R, Farnham PJ: Unbiased location analysis of E2F1-binding sites suggests a widespread role for E2F1 in the human genome. Genome Res 2006,16(5):595–605.PubMedView Article
                15. Cawley S, Bekiranov S, Ng HH, Kapranov P, Sekinger EA, Kampa D, Piccolboni A, Sementchenko V, Cheng J, Williams AJ: Unbiased mapping of transcription factor binding sites along human chromosomes 21 and 22 points to widespread regulation of noncoding RNAs. Cell 2004,116(4):499–509.PubMedView Article
                16. Lin CY, Loven J, Rahl PB, Paranal RM, Burge CB, Bradner JE, Lee TI, Young RA: Transcriptional Amplification in Tumor Cells with Elevated c-Myc. Cell 2012,151(1):56–67.PubMedView Article
                17. Sansregret L, Nepveu A: The multiple roles of CUX1: Insights from mouse models and cell-based assays. Gene 2008,412(1–2):84–94.PubMedView Article
                18. Moon NS, Berube G, Nepveu A: CCAAT displacement activity involves Cut repeats 1 and 2, not the Cut homeodomain. J Biol Chem 2000,275(40):31325–31334.PubMedView Article
                19. Moon NS, Premdas P, Truscott M, Leduy L, Berube G, Nepveu A: S Phase-Specific Proteolytic Cleavage Is Required to Activate Stable DNA Binding by the CDP/Cut Homeodomain Protein. Mol Cell Biol 2001, 21:6332–6345.PubMedView Article
                20. Goulet B, Baruch A, Moon NS, Poirier M, Sansregret LL, Erickson A, Bogyo M, Nepveu A: A Cathepsin L Isoform that Is Devoid of a Signal Peptide Localizes to the Nucleus in S Phase and Processes the CDP/Cux Transcription Factor. Mol Cell 2004,14(2):207–219.PubMedView Article
                21. Harada R, Vadnais C, Sansregret L, Leduy L, Berube G, Robert F, Nepveu A: Genome-wide location analysis and expression studies reveal a role for p110 CUX1 in the activation of DNA replication genes. Nucleic Acids Res 2008,36(1):189–202.PubMedView Article
                22. Kedinger V, Sansregret L, Harada R, Vadnais C, Cadieux C, Fathers K, Park M, Nepveu A: p110 CUX1 homeodomain protein stimulates cell migration and invasion in part through a regulatory cascade culminating in the repression of E-cadherin and occludin. J Biol Chem 2009,284(40):27701–27711.PubMedView Article
                23. Coqueret O, Berube G, Nepveu A: The mammalian Cut homeodomain protein functions as a cell-cycle-dependent transcriptional repressor which downmodulates p21WAF1/CIP1/SDI1 in S phase. EMBO J 1998,17(16):4680–4694.PubMedView Article
                24. Santaguida M, Ding Q, Berube G, Truscott M, Whyte P, Nepveu A: Phosphorylation of the CCAAT displacement protein (CDP)/Cux transcription factor by cyclin A-Cdk1 modulates its DNA binding activity in G(2). J Biol Chem 2001,276(49):45780–45790.PubMedView Article
                25. Santaguida M, Nepveu A: Differential regulation of CDP/Cux p110 by cyclin A/Cdk2 and cyclin A/Cdk1. J Biol Chem 2005,280(38):32712–32721.PubMedView Article
                26. Sansregret L, Gallo D, Santaguida M, Leduy L, Harada R, Nepveu A: Hyperphosphorylation by cyclin B/CDK1 in mitosis resets CUX1 DNA binding clock at each cell cycle. J Biol Chem 2010,285(43):32834–32843.PubMedView Article
                27. Truscott M, Denault JB, Goulet B, Leduy L, Salvesen GS, Nepveu A: Carboxyl-terminal proteolytic processing of CUX1 by a caspase enables transcriptional activation in proliferating cells. J Biol Chem 2007,282(41):30216–30226.PubMedView Article
                28. Ellis T, Gambardella L, Horcher M, Tschanz S, Capol J, Bertram P, Jochum W, Barrandon Y, Busslinger M: The transcriptional repressor CDP (Cutl1) is essential for epithelial cell differentiation of the lung and the hair follicle. Genes Dev 2001,15(17):2307–2319.PubMedView Article
                29. Sinclair AM, Lee JA, Goldstein A, Xing D, Liu S, Ju R, Tucker PW, Neufeld EJ, Scheuermann RH: Lymphoid apoptosis and myeloid hyperplasia in CCAAT displacement protein mutant mice. Blood 2001,98(13):3658–3667.PubMedView Article
                30. Luong MX, van der Meijden CM, Xing D, Hesselton R, Monuki ES, Jones SN, Lian JB, Stein JL, Stein GS, Neufeld EJ: Genetic ablation of the CDP/Cux protein C terminus results in hair cycle defects and reduced male fertility. Mol Cell Biol 2002,22(5):1424–1437.PubMedView Article
                31. Ledford AW, Brantley JG, Kemeny G, Foreman TL, Quaggin SE, Igarashi P, Oberhaus SM, Rodova M, Calvet JP, Vanden Heuvel GB: Deregulated Expression of the Homeobox Gene Cux-1 in Transgenic Mice Results in Downregulation of p27(kip1) Expression during Nephrogenesis, Glomerular Abnormalities, and Multiorgan Hyperplasia. Dev Biol 2002,245(1):157–171.PubMedView Article
                32. Brantley JG, Sharma M, Alcalay NI, Heuvel GBV: Cux-1 transgenic mice develop glomerulosclerosis and interstitial fibrosis. Kidney Int 2003,63(4):1240–1248.PubMedView Article
                33. Cadieux C, Kedinger V, Yao L, Vadnais C, Drossos M, Paquet M, Nepveu A: Mouse mammary tumor virus p75 and p110 CUX1 transgenic mice develop mammary tumors of various histologic types. Cancer Res 2009,69(18):7188–7197.PubMedView Article
                34. Cadieux C, Harada R, Paquet M, Cote O, Trudel M, Nepveu A, Bouchard M: Polycystic kidneys caused by sustained expression of Cux1 isoform p75. J Biol Chem 2008,283(20):13817–13824.PubMedView Article
                35. Cadieux C, Fournier S, Peterson AC, Bedard C, Bedell BJ, Nepveu A: Transgenic mice expressing the p75 CCAAT-displacement protein/Cut homeobox isoform develop a myeloproliferative disease-like myeloid leukemia. Cancer Res 2006,66(19):9492–9501.PubMedView Article
                36. Heuvel GBV, Bodmer R, McConnell KR, Nagami GT, Igarashi P: Expression Of a Cut-Related Homeobox Gene In Developing and Polycystic Mouse Kidney. Kidney Int 1996,50(2):453–461.View Article
                37. Sansregret L, Goulet B, Harada R, Wilson B, Leduy L, Bertoglio J, Nepveu A: The p110 isoform of the CDP/Cux transcription factor accelerates entry into S phase. Mol Cell Biol 2006,26(6):2441–2455.PubMedView Article
                38. Sansregret L, Vadnais C, Livingstone J, Kwiatkowski N, Awan A, Cadieux C, Leduy L, Hallett MT, Nepveu A: Cut homeobox 1 causes chromosomal instability by promoting bipolar division after cytokinesis failure. Proc Natl Acad Sci U S A 2011,108(5):1949–1954.PubMedView Article
                39. Michl P, Ramjaun AR, Pardo OE, Warne PH, Wagner M, Poulsom R, D'Arrigo C, Ryder K, Menke A, Gress T: CUTL1 is a target of TGF(beta) signaling that enhances cancer cell motility and invasiveness. Cancer Cell 2005,7(6):521–532.PubMedView Article
                40. Michl P, Downward J: CUTL1: a key mediator of TGFbeta-induced tumor invasion. Cell Cycle 2006,5(2):132–134.PubMedView Article
                41. Aleksic T, Bechtel M, Krndija D, von Wichert G, Knobel B, Giehl K, Gress TM, Michl P: CUTL1 promotes tumor cell migration by decreasing proteasome-mediated Src degradation. Oncogene 2007,26(40):5939–5949.PubMedView Article
                42. Ripka S, Neesse A, Riedel J, Bug E, Aigner A, Poulsom R, Fulda S, Neoptolemos J, Greenhalf W, Barth P: CUX1: target of Akt signalling and mediator of resistance to apoptosis in pancreatic cancer. Gut 2010,59(8):1101–1110.PubMedView Article
                43. Cubelos B, Sebastian-Serrano A, Beccari L, Calcagnotto ME, Cisneros E, Kim S, Dopazo A, Alvarez-Dolado M, Redondo JM, Bovolenta P: Cux1 and Cux2 regulate dendritic branching, spine morphology, and synapses of the upper layer neurons of the cortex. Neuron 2010,66(4):523–535.PubMedView Article
                44. Scacheri PC, Davis S, Odom DT, Crawford GE, Perkins S, Halawi MJ, Agarwal SK, Marx SJ, Spiegel AM, Meltzer PS: Genome-wide analysis of menin binding provides insights into MEN1 tumorigenesis. PLoS Genet 2006,2(4):e51.PubMedView Article
                45. Yang A, Zhu Z, Kapranov P, McKeon F, Church GM, Gingeras TR, Struhl K: Relationships between p63 binding, DNA sequence, transcription activity, and biological function in human cells. Mol Cell 2006,24(4):593–602.PubMedView Article
                46. Krig SR, Jin VX, Bieda MC, O'Geen H, Yaswen P, Green R, Farnham PJ: Identification of genes directly regulated by the oncogene ZNF217 using chromatin immunoprecipitation (ChIP)-chip assays. J Biol Chem 2007,282(13):9703–9712.PubMedView Article
                47. Harada R, Berube G, Tamplin OJ, Denis-Larose C, Nepveu A: DNA-binding specificity of the cut repeats from the human cut-like protein. Mol Cell Biol 1995,15(1):129–140.PubMed
                48. Harada R, Nepveu A: Chromatin affinity purification. Meth Mol Biol (Clifton, NJ) 2012,809(22113280):237–253.View Article
                49. Kim TH, Barrera LO, Zheng M, Qu C, Singer MA, Richmond TA, Wu Y, Green RD, Ren B: A high-resolution map of active promoters in the human genome. Nature 2005,436(7052):876–880.PubMedView Article
                50. Carroll JS, Liu XS, Brodsky AS, Li W, Meyer CA, Szary AJ, Eeckhoute J, Shao W, Hestermann EV, Geistlinger TR: Chromosome-wide mapping of estrogen receptor binding reveals long-range regulation requiring the forkhead protein FoxA1. Cell 2005,122(1):33–43.PubMedView Article
                51. Ruiz-Llorente S, de Pau EC, Sastre-Perona A, Montero-Conde C, Gomez-Lopez G, Fagin JA, Valencia A, Pisano DG, Santisteban P: Genome-wide analysis of Pax8 binding provides new insights into thyroid functions. BMC Genomics 2012, 13:147.PubMedView Article
                52. Li Q, Harju S, Peterson KR: Locus control regions: coming of age at a decade plus. Trends Genet 1999,15(10):403–408.PubMedView Article
                53. Gaszner M, Felsenfeld G: Insulators: exploiting transcriptional and epigenetic mechanisms. Nat Rev Genet 2006,7(9):703–713.PubMedView Article
                54. Gross DS, Garrard WT: Nuclease hypersensitive sites in chromatin. Annu Rev Biochem 1988, 57:159–197.PubMedView Article
                55. Thurman RE, Rynes E, Humbert R, Vierstra J, Maurano MT, Haugen E, Sheffield NC, Stergachis AB, Wang H, Vernot B: The accessible chromatin landscape of the human genome. Nature 2012,489(7414):75–82.PubMedView Article
                56. Ernst J, Kellis M: ChromHMM: automating chromatin-state discovery and characterization. Nat Methods 2012,9(3):215–216.PubMedView Article
                57. Vadnais C, Davoudi S, Afshin M, Harada R, Dudley R, Clermont PL, Drobetsky E, Nepveu A: CUX1 transcription factor is required for optimal ATM/ATR-mediated responses to DNA damage. Nucleic Acids Res 2012,40(10):4483–4495.PubMedView Article
                58. Seguin L, Liot C, Mzali R, Harada R, Siret A, Nepveu A, Bertoglio J: CUX1 and E2F1 regulate coordinated expression of the mitotic complex genes Ect2, MgcRacGAP, and MKLP1 in S phase. Mol Cell Biol 2009,29(2):570–581.PubMedView Article
                59. Truscott M, Harada R, Vadnais C, Robert F, Nepveu A: p110 CUX1 cooperates with E2F transcription factors in the transcriptional activation of cell cycle-regulated genes. Mol Cell Biol 2008,28(10):3127–3138.PubMedView Article
                60. Kraus MH, Yuasa Y, Aaronson SA: A position 12-activated H-ras oncogene in all HS578T mammary carcinosarcoma cells but not normal mammary cells of the same patient. Proc Natl Acad Sci U S A 1984,81(17):5384–5388.PubMedView Article
                61. Geck P, Maffini MV, Szelei J, Sonnenschein C, Soto AM: Androgen-induced proliferative quiescence in prostate cancer cells: the role of AS3 as its mediator. Proc Natl Acad Sci U S A 2000,97(18):10185–10190.PubMedView Article
                62. Saqr HE, Omran O, Dasgupta S, Yu RK, Oblinger JL, Yates AJ: Endogenous GD3 ganglioside induces apoptosis in U-1242 MG glioma cells. J Neurochem 2006,96(5):1301–1314.PubMedView Article
                63. Sipione S, Rigamonti D, Valenza M, Zuccato C, Conti L, Pritchard J, Kooperberg C, Olson JM, Cattaneo E: Early transcriptional profiles in huntingtin-inducible striatal cells by microarray analyses. Hum Mol Genet 2002,11(17):1953–1965.PubMedView Article
                64. Weinmann AS, Farnham PJ: Identification of unknown target genes of human transcription factors using chromatin immunoprecipitation. Methods 2002,26(1):37–47.PubMedView Article
                65. Puig O, Caspary F, Rigaut G, Rutz B, Bouveret E, Bragado-Nilsson E, Wilm M, Seraphin B: The tandem affinity purification (TAP) method: a general procedure of protein complex purification. Methods 2001,24(3):218–229.PubMedView Article
                66. Ren B, Cam H, Takahashi Y, Volkert T, Terragni J, Young RA, Dynlacht BD: E2F integrates cell cycle progression with DNA repair, replication, and G(2)/M checkpoints. Genes Dev 2002,16(2):245–256.PubMedView Article
                67. Blankenberg D, Von Kuster G, Coraor N, Ananda G, Lazarus R, Mangan M, Nekrutenko A, Taylor J: Galaxy: a web-based genome analysis tool for experimentalists. Curr Protoc Mol Biol 2010, Chapter 19:Unit 19 10 11–21.
                68. Giardine B, Riemer C, Hardison RC, Burhans R, Elnitski L, Shah P, Zhang Y, Blankenberg D, Albert I, Taylor J: Galaxy: a platform for interactive large-scale genome analysis. Genome Res 2005,15(10):1451–1455.PubMedView Article
                69. Goecks J, Nekrutenko A, Taylor J: Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol 2010,11(8):R86.PubMedView Article
                70. Bailey TL: DREME: motif discovery in transcription factor ChIP-seq data. Bioinformatics 2011,27(12):1653–1659.PubMedView Article
                71. Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS: MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res 2009,37(Web Server issue):W202-W208.PubMedView Article
                72. Gupta S, Stamatoyannopoulos JA, Bailey TL, Noble WS: Quantifying similarity between motifs. Genome Biol 2007,8(2):R24.PubMedView Article
                73. da Huang W, Sherman BT, Lempicki RA: Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 2009,4(1):44–57.PubMedView Article
                74. da Huang W, Sherman BT, Lempicki RA: Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res 2009,37(1):1–13.PubMedView Article
                75. Smyth GK: Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol 2004, 3:Article3.PubMed

                Copyright

                © Vadnais et al.; licensee BioMed Central Ltd. 2013

                This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://​creativecommons.​org/​licenses/​by/​2.​0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                Advertisement