Nuclear Receptor HNF4α Binding Sequences are Widespread in Alu Repeats
© Bolotin et al; licensee BioMed Central Ltd. 2011
Received: 18 March 2011
Accepted: 15 November 2011
Published: 15 November 2011
Alu repeats, which account for ~10% of the human genome, were originally considered to be junk DNA. Recent studies, however, suggest that they may contain transcription factor binding sites and hence possibly play a role in regulating gene expression.
Here, we show that binding sites for a highly conserved member of the nuclear receptor superfamily of ligand-dependent transcription factors, hepatocyte nuclear factor 4alpha (HNF4α, NR2A1), are highly prevalent in Alu repeats. We employ high throughput protein binding microarrays (PBMs) to show that HNF4α binds > 66 unique sequences in Alu repeats that are present in ~1.2 million locations in the human genome. We use chromatin immunoprecipitation (ChIP) to demonstrate that HNF4α binds Alu elements in the promoters of target genes (ABCC3, APOA4, APOM, ATPIF1, CANX, FEMT1A, GSTM4, IL32, IP6K2, PRLR, PRODH2, SOCS2, TTR) and luciferase assays to show that at least some of those Alu elements can modulate HNF4α-mediated transactivation in vivo (APOM, PRODH2, TTR, APOA4). HNF4α-Alu elements are enriched in promoters of genes involved in RNA processing and a sizeable fraction are in regions of accessible chromatin. Comparative genomics analysis suggests that there may have been a gain in HNF4α binding sites in Alu elements during evolution and that non Alu repeats, such as Tiggers, also contain HNF4α sites.
Our findings suggest that HNF4α, in addition to regulating gene expression via high affinity binding sites, may also modulate transcription via low affinity sites in Alu repeats.
As much as 50% of the ~3 billion base pairs in the human genome may be derived from repetitive DNA sequence . While repetitive DNA is often referred to as "junk" DNA, even when that term was originally coined it was hypothesized that junk DNA may play an active role in genome function . The notion that repetitive DNA may play a regulatory role and be involved in the evolution of gene regulation was also postulated early on, although it was not until recently that there was evidence to support those ideas [3–5].
A major category of repetitive DNA is short interspersed nuclear elements (SINEs), which are believed to have originated from the 7SL RNA gene that is part of the ribosome complex . In the human genome, the largest class of SINEs are Alu repeats, which at ~1.2 million copies account for ~10% of the human genome . Alu elements were first characterized as ~300 nucleotide repetitive sequences that contain an AluI restriction site (5'-AGCT-3') from the bacterium Arthrobacter luteus[7, 8]. Alu elements, which are still mobile in the human genome by virtue of the action of a LINE-1 reverse transcriptase , are a relatively recent occurrence evolutionarily. They are found exclusively in primates, including humans, and hence are postulated to have entered the mammalian genome ~60-65 million years ago .
Alu elements have been implicated in several human diseases including leukemia, hemophilia and breast cancer, suggesting that their impact on human health may be significant . There are several well characterized examples of Alu insertions affecting splicing patterns and hence protein function . A variety of transcription factor (TF) binding sites (TFBSs) have also been characterized in Alu elements, including sites for YY1 , Sp1 , tumor suppressor p53 , homeodomain and TATA binding proteins . Nuclear receptors (NR), which belong to a superfamily of ligand-dependent TFs, have also been found to have binding sites in Alu elements: retinoid acid receptor (RAR, NR1B) , estrogen receptor (ER, NR3A) [18, 19], progesterone receptor (PR, NR3C3)  and vitamin D receptor (VDR, NR1I1) . Alu insertions have also been shown to alter the expression of at least six human genes: CD8a (CD8A), keratin 18 (KRT18), parathyroid hormone (PTH), Wilm's tumor 1 (WT1), receptor for Fc fragment of IgE, high affinity I, gamma polypeptide (FCER1G) and breast cancer 1, early onset (BRCA1). Therefore, Alu sequences may regulate the level of transcripts and hence proteins in the cell, as well as the function of those proteins.
Hepatocyte nuclear factor 4 alpha, (HNF4α, NR2A1) is a member of the NR superfamily that is highly expressed in the liver, as well as the kidney, intestine (large and small), pancreas and stomach . HNF4α is best known for its role in the adult liver and pancreas, as well as in early development [24, 25]; it also has an emerging role in the gut [26–28]. The HNF4Α gene is mutated in an inherited form of type 2 diabetes, maturity onset diabetes of the young 1 (MODY1) , and was recently identified as a susceptibility locus in inflammatory bowel disease (IBD) . Mutations in HNF4α binding sites have also been directly linked to human diseases, including hemophilia and MODY3 [31, 32]. Many NRs are common drug targets ; the recent identification of the endogenous ligand of HNF4α that binds in a reversible fashion also makes HNF4α a potential drug target [34, 35].
In addition to its medical relevance, HNF4α also appears to play a unique role in the evolution of NRs. It is highly conserved across species, with 100% amino acid conservation in the DNA binding domain of all mammalian HNF4α. While HNF4α is most similar to the retinoid × receptor alpha (RXRα, NR2B1), unlike many other NRs, it does not heterodimerize with RXR. Rather, it binds DNA in the form of direct repeats separated by one nucleotide (DR1, AGGTCAxAGGTCA) exclusively as a homodimer . HNF4α has been found in every animal organism examined thus far, including sponge and coral , and has been postulated to be the ancestor of the entire NR family .
Many hundreds of HNF4α target genes have been identified by both classical promoter analysis as well as more modern genome-wide studies [32, 39–41]. During one such genomic study, we observed a very uneven frequency profile of individual HNF4α binding sequences . Specifically, we noted that a certain DNA sequence designated H4.141 (5'-AGGCTGaAGTGCA-3') was > 100-fold overrepresented compared to other HNF4α binding sites in the human, but not the mouse, genome (see additional file 1: Figure S1). In the current study, we investigate the notion that these and other HNF4α binding sequences are in Alu repeats. We use the powerful high throughput technology of protein binding micorarrays (PBMs) to show that HNF4α does indeed bind numerous sequences in Alu repeats in vitro. We perform ChIP and luciferase assays to show that HNF4α binds at least some Alu sequences in vivo and that those binding events are associated with transcriptional activation. Finally, we investigate accessibility of these sites by correlation with DNase hypsersensitivity data and evolutionary conservation by comparative genomic analysis.
HNF4α binds Alu repeats in vitro
Frequency of Alu-derived sequences bound by HNF4α in Alu repeats and the human genome.
# in Alus
# in hg18
HNF4α binds Alu repeats in the promoter region of target genes in vivo
HNF4α activates transcription from Alu elements
Frequency of HNF4α sites in Alu and non Alu repeats
Number of Alu repeats with HNF4α binding sites in the human genome (hg 18).
# of H4 sites in Alu
# of Alus
Total # of H4 sites
Alu families in human genome (hg18) with HNF4α binding sites.
# with H4
# in hg18
Alu subfamilies in human genome (hg18) with HNF4α binding sites.
# with H4
# in hg18
Non Alu repeat families in human genome (hg18) with HNF4α binding sites.
Sorted by number of HNF4α sites
Sorted by prevalence of HNF4α sites
# with H4
# in hg18
# with H4
# in hg18
Frequency of HNF4α-Alu elements in promoters and DNase hypersensitive sites
Others have shown that the region 5000 bp upstream from the TSS (+1) contains on average 3.63 Alu elements . We analyzed the same promoter region and found that every human gene has on average 2.91 HNF4α-Alu elements, consistent with the overall high proportion of Alu elements with an HNF4α site (Tables 3 and 4). To determine which Alu elements may be accessible, and hence potentially play a role in transcription regulation, we determined the number of HNF4α-Alu elements that reside within DNase hypersensitive regions using datasets from the ENCODE project [46, 47]. Genome-wide 46,129 HNF4α-Alu elements (~6.2% of all HNF4α-Alu's) are within DNase hypersensitive regions across mutliple cell lines, with 5458 genes containing one or more HNF4α-Alu/DNase sites in their 5 kb promoter region. ~7000 HNF4α-Alu elements are in DNase hypersensitive regions in HepG2 cells alone (6212 from Rep Track 1 and 8127 from Rep Track 2). While these findings may be an underestimate due to the difficulty of sequencing through repetitive elements, they nonetheless indicate that while the majority of the ~750,000 HNF4α-Alu elements may not be accessible in most cell types, a sizeable portion of HNF4α-Alu elements are in regions of open chromatin and hence may be transcriptionally active.
Age of Alu repeats in HNF4α target genes
The functional relevance of repetitive DNA such as Alu repeats in the human genome has been debated ever since they were first discovered several decades ago. In this study, we show that the nuclear receptor HNF4α binds Alu-derived 13-mers in vitro as well as Alu elements in the promoters of HNF4α target genes in vivo. We show that HNF4α sites in Alu elements can drive gene expression in luciferase assays and that HNF4α binding sites are found in ~64% of all known Alu repeats in the genome (~1.2 million HNF4α sites in ~750,000 Alu elements). Additionally, we found that while HNF4α sites are predominantly found in Alu repeats, they are also found in other repeats such as SVA elements, which contain a portion of Alu repeat , and L2, MIR and Tigger families of retrotransposons.
Functionality of HNF4α-Alu elements
Perhaps the most important question is how many of the HNF4α-Alu elements are functional. Several recent studies suggest that Alu elements may indeed play a role in regulating gene expression: Alu elements are enriched in regions with genes , particularly in housekeeping and metabolism genes. However, they are underrepresented in developmental genes , suggesting that their presence in those genes may be detrimental. Binding sites for other NRs have also been found in Alu repeats and several of those sites were found to affect transcription [17, 19–21]. To determine what types of genes contain HNF4α-Alu elements, we performed a Gene Ontology (GO) analysis of genes enriched with HNF4α-Alu elements (> 8 per 5 kb promoter region) and found RNA processing and transcription regulation genes, as well as macromolecular catabolic processes and complex assembly genes (see additional file 2: Table S6 for a full list of significant GO categories and relevant genes). RNA processing is not a category previously associated with classical HNF4α binding sites, but Alu elements have been found to play a direct role in alternative splicing .
In a detailed, genome-wide analysis of functional targets of HNF4α and binding sites, we recently found that only 30% of genes down regulated in an HNF4α RNAi experiment contained a potential classical HNF4α binding site . While the other 70% could be indirect targets, it is also possible that some of those genes are regulated by HNF4α-containing Alu elements, consistent with our finding here that on average every gene in the human genome contains ~2.91 HNF4α-Alu elements within 5000 bp upstream of the TSS. On an individual gene basis, we found that even though the HNF4α binding sites in Alu repeats are not high affinity sites compared to the majority of classical HNF4α sites, they are nonetheless capable of driving the expression of a heterologous gene on their own. In the context of the genome, however, the HNF4α-Alu elements are typically present in conjunction with other TFBS in the promoter, including other HNF4α binding sites, suggesting that they may act in more of a modulatory capacity than as the sole drivers of transcription, as we observed on the APOA4 promoter. These results are similar to those found for other NRs albeit on different binding sites within the Alu elements [19–21].
The functionality of HNF4α-Alu elements, as with any potential TFBS, will also depend on the state of the local chromatin and the accessibility of the site to HNF4α. While it has been reported that most Alu repeats in the human genome contain CpG dinucleotides that are methylated , potentially rendering them nonfunctional, the Alu elements that are hypomethylated tend to be in promoter regions, suggesting that they are accessible [52, 53]. Indeed, our analysis showed that there may be as many as ~46,000 HNF4α-Alu elements in DNase hypersensitive regions genome-wide, suggesting that they may be accessible for binding and therefore may affect transcription.
Alu repeats as a sink for HNF4α protein?
In addition to affecting transcription directly, it is tempting to speculate that the relatively large number of HNF4α-Alu elements, especially in regions of open chromatin, could act as a sink or reservoir for HNF4α protein. We have estimated by semi-quantitative immunoblotting that there may be as many as 450,000 molecules of HNF4α in the nucleus of an adult mouse hepatocyte (unpublished observation); this estimate is consistent with the fact that we originally had to purify HNF4α only ~5,000 to 10,000-fold from adult rat liver nuclei . Assuming that human hepatocytes have similar levels of HNF4α protein and keeping in mind that HNF4α binds DNA only as a dimer , this suggests that the presence of ~7000 to 46,000 HNF4α-Alu elements in accessible regions of the genome would not have a significant impact on the availability of ~225,000 HNF4α protein dimers in a normal adult hepatocyte nucleus. However, conditions that significantly alter the accessibility of the ~750,000 HNF4α-Alu elements genome-wide, or the amount of HNF4α protein, could in theory result in a situation in which the stoichiometry of HNF4α-Alu sites to HNF4α protein is indeed relevant. For example, global loss of DNA methylation has been associated with cancer progression and there is at least one report in which certain Alu elements lose methylation during tumor progression . Likewise, a decrease in the amount of functional HNF4α protein, such as that found in heterozygous MODY1 patients , activation of signaling pathways [56–61], DNA damage via p53 [62, 63], microRNAs , diet [35, 65, 66] and diseases such as colitis and cancer [67, 68] could tip the balance between HNF4α protein and potential binding sites, rendering the notion of Alu elements as a sink of HNF4α potentially relevant. The stoichiometry of HNF4α protein to total HNF4α binding sites may also differ in other tissues and developmental time points , which could alter the relevance of HNF4α-Alu elements.
The ~1.2 million HNF4α binding sites in ~750,000 Alu elements in the human genome has the potential to affect the expression of HNF4α target genes. Therefore, it will be important to keep the HNF4α-Alu elements in mind when investigating HNF4α function, especially when using non primates as models for humans and when investigating conditions, such as cancer, where there may be genome-scale alterations in chromatin accessibility. These results join the increasing number of reports of NR and other TF binding sites in Alu or other repeat elements  and support the notion that repetitive DNA may be more than just "junk" DNA.
PBM design and analysis
A custom-designed 8x15k Alu PBM (PBM3) containing 8 grids, each of which consisted of ~15,000 spots of DNA, was ordered from Agilent (Figure 1). An in silico Alu library of ~200 DNA sequences was made by extracting every unique 13-mer from every Alu element consensus from the RepBase database (http://www.girinst.org/repbase/). The human genome (hg18) was searched with the Alu library and the 100 most frequent sites were included on PBM3. The 13-mer Alu library was further searched with the support vector machine (SVM) model described in Bolotin et al . (The SVM is an algorithm trained on sequences bound by HNF4α in the PBM; it predicts the binding HNF4α binding with correlation R2 = 0.76.) The top 100 scoring potential HNF4α binding sites from the SVM search were included on PBM3 for a total of 200-derived Alu sequences. Another 704 sequences were included from permutations of three adjacent positions in every combination of the DR1 consensus (5'-AGGTCAaAGGTCA-3') and 768 sequences from similar permutations of a DR2 consensus (5'-AGGTCAaaAGGTCA-3'). Additionally, 100 randomly generated 13-mers and 50 randomly generated 14-mers were included as negative controls for the DR1s and DR2s, respectively. Finally, an additional 2,061 unique sequences were generated from an SVM search of all human genes for a total of 3802 unique DNA sequences, each of which was replicated 4 times on the PBM for a total of 15,208 DNA spots. The linker and cap sequences were the same as those described in Bolotin et al. . (See additional file 2: Table S5 for a list of all DNA sequences on PBM3 and the corresponding HNF4α binding score.)
Crude nuclear extracts of COS-7 cells transfected with human HNF4α2 or HNF4α8 expression vectors was applied to PBM3 (~400 ng HNF4α protein per grid) and visualized and analyzed as described in Bolotin et al. . The primary antibody was a mouse monoclonal that recognizes the C-terminal region of HNF4α (H1415 from R&D Systems); the secondary was NL-637 anti-Mouse IgG (NL008 from R&D Systems). PBMs were scanned using a GenePix Axon 4000B scanner (Molecular Devices, Sunnyvale, CA) at 543 nm (Cy3) dUTP and 633 nm (Cy5-conjugated secondary antibody). Since there was no significant difference between the HNF4α2 and HNF4α8 isoforms, which differ by ~30 amino acids in the N-terminal region but have identical DNA binding and dimerization/ligand binding domains, the average of the four grids (two with HNF4α2 and two with HNF4α8) were used for the final PBM3 score. The sequences with a score > 0.612 (i.e., 2 SD above the mean of the random controls, p-value < .045) were considered to be HNF4α binders.
ChIP and RNAi Expression Profiling
HNF4α ChIP from HepG2 cells was performed as described in . Quantitative-PCR (qPCR) following the ChIP was performed using BioRad IQ SYBR Green Supermix. Each 23.5-ul reaction included 12.5 ul of Supermix, 0.25 ul of 100 nmol of each primer, 0.5 ul of template and 10 ul of ddH2O. The qPCR was performed as follows: 95°C for 5 min (hot start), followed by 40 cycles 95°C for 30 sec (melt), 30 sec at the melting temperature (Tm) for annealing and extension, followed by a melt curve. The Tm was determined experimentally for each pair of primers by using a temperature gradient qPCR that was visualized on an ethidium bromide-stained agarose gel to control for product size. All qPCR was performed using BioRad iQ5 and myQ5 thermocyclers. (See additional file 2:Table S2 for a complete list of PCR primers giving a positive ChIP signal.) Affymetrix expression profiling data for the HNF4α RNAi knockdown in HepG2 cells were obtained from Bolotin et al. .
Human embryonic kidney (HEK 293T) cells were plated (0.25 × 106 cells) in 12-well plates. After 24 hr the cells were transfected using Lipofectamine 2000 according to the manufacturer's protocol (Invitrogen), with different amounts of empty vector (pcDNA3) or wild type human HNF4α2 in pcDNA3, 1 μg of the luciferase reporter and 200 ng of a CMV.βgal control. Cells were harvested after 24 hr using Triton lysis buffer (1% Triton X-100, 25 mM Gly-Gly pH 7.8, 15 mM MgSO4, 4 mM EGTA, 1 mM DTT). Luciferase and β-gal activity were measured as described earlier . Significant differences in luciferase activity between cells transfected with empty vector or human HNF4α2 were determined by the Student's t-test. APOM, PRODH2 and TTR luciferase constructs were created by cloning PCR products of the Alu elements in the respective promoters into pGL4.23 (Promega): the APOM construct used SfiI restriction sites and the PRODH2 and TTR constructs used NheI and KpnI sites. The APOA4.Luc construct was made by cloning a PCR product from the human APOA4 promoter (-1343 to +247) into the pGL4.10 vector (Promega) at HindIII and NheI sites. Site-directed mutations were introduced into the HNF4α binding sites in the Alu and PBM elements using the QuikChange kit (Stratagene). Luciferase reporter constructs with classical HNF4α response elements (RE-1 and RE-2) were made by inserting the appropriate synthetic oligonucleotides into pGL4.23. All constructs were sequence verified. (See additional file 2: Table S2 for the sequence of the PCR primers and oligonucleotides used in the constructions.)
Searches of human genome hg18 downloaded from UCSC Genome Browser (http://genome.ucsc.edu) were conducted using all of the sequences that HNF4α bound in PBM3 using Seqmap . Alu and non Alu repeats with HNF4α sites were identified by comparing the HNF4α genome-wide search results to the repeat coordinates obtained from Repeat Masker Track version 3.2.7 in UCSC Genome Browser. The results were processed using custom Perl scripts and an SQL database. To determine accessibility of HNF4α-Alu sites, we used BEDtools software package  to cross reference our list of ~750,000 HNF4α-Alu elements (Table 2) with DNase hypersensitivity tracks in the ENCODE Project in UCSC Genome Bioinformatics, allowing for one nucleotide or more of overlap. We used both the clustered track that contains data from multiple human cell lines (http://genome.ucsc.edu/cgi-bin/hgTrackUi?hgsid=211217271&g=wgEncodeRegDnaseClustered) as well as tracks for two different repetitions of HepG2 cells (http://genome.ucsc.edu/cgi-bin/hgTrackUi?db=hg18&g=wgEncodeUwDnaseSeq). Gene Ontology analysis of genes containing HNF4α-Alu elements was done using DAVID . We used as a cut off eight HNF4α-Alu elements within 5 kb upstream of +1, two SD above the average number of sites (2.91+4.22).
Acknowledgements and funding
We thank D. Mane-Padros and L. Vuong for the luciferase constructs with classical HNF4α response elements and B. Fang for predicting mutations in HNF4α binding sites. This work was funded by a PhRMA Foundation fellowship to EB, and grants to FMS from the UCR Institute for Integrative Genome Biology and the NIH (R21 MH087397, R01 DK053892). KC, W H-V, CY and JMS were supported by NIH R01 DK053892. EB was supported by NIH R21 MH087397. The funding bodies did not have any role in the study design, data collection, manuscript preparation or submission.
- Lander ES, Linton LM, Birren B, CN, et al: Initial sequencing and analysis of the human genome. Nature. 2001, 409 (6822): 860-921. 10.1038/35057062.PubMedView ArticleGoogle Scholar
- Ohno S: So much "junk" DNA in our genome. Brookhaven Symp Biol. 1972, 23: 366-370.PubMedGoogle Scholar
- van de Lagemaat LN, Landry JR, Mager DL, Medstrand P: Transposable elements in mammals promote regulatory variation and diversification of genes with specialized functions. Trends Genet. 2003, 19 (10): 530-536. 10.1016/j.tig.2003.08.004.PubMedView ArticleGoogle Scholar
- Davidson EH, Britten RJ: Organization, transcription, and regulation in the animal genome. Q Rev Biol. 1973, 48 (4): 565-613. 10.1086/407817.PubMedView ArticleGoogle Scholar
- Orgel LE, Crick FH: Selfish DNA: the ultimate parasite. Nature. 1980, 284 (5757): 604-607. 10.1038/284604a0.PubMedView ArticleGoogle Scholar
- Ullu E, Tschudi C: Alu sequences are processed 7SL RNA genes. Nature. 1984, 312 (5990): 171-172. 10.1038/312171a0.PubMedView ArticleGoogle Scholar
- Rubin CM, Houck CM, Deininger PL, Friedmann T, Schmid CW: Partial nucleotide sequence of the 300-nucleotide interspersed repeated human DNA sequences. Nature. 1980, 284 (5754): 372-374. 10.1038/284372a0.PubMedView ArticleGoogle Scholar
- Houck CM, Rinehart FP, Schmid CW: A ubiquitous family of repeated DNA sequences in the human genome. J Mol Biol. 1979, 132 (3): 289-306. 10.1016/0022-2836(79)90261-4.PubMedView ArticleGoogle Scholar
- Batzer MA, Deininger PL: Alu repeats and human genomic diversity. Nat Rev Genet. 2002, 3 (5): 370-379. 10.1038/nrg798.PubMedView ArticleGoogle Scholar
- Liu GE, Alkan C, Jiang L, Zhao S, Eichler EE: Comparative analysis of Alu repeats in primate genomes. Genome Res. 2009, 19 (5): 876-885. 10.1101/gr.083972.108.PubMedPubMed CentralView ArticleGoogle Scholar
- Deininger PL, Batzer MA: Alu repeats and human disease. Mol Genet Metab. 1999, 67 (3): 183-193. 10.1006/mgme.1999.2864.PubMedView ArticleGoogle Scholar
- Kreahling J, Graveley BR: The origins and implications of Aluternative splicing. Trends Genet. 2004, 20 (1): 1-4. 10.1016/j.tig.2003.11.001.PubMedView ArticleGoogle Scholar
- Humphrey GW, Englander EW, Howard BH: Specific binding sites for a pol III transcriptional repressor and pol II transcription factor YY1 within the internucleosomal spacer region in primate Alu repetitive elements. Gene Expr. 1996, 6 (3): 151-168.PubMedGoogle Scholar
- Oei S-L, Babich VS, Kazakov VI, Usmanova NM, Kropotov AV, Tomilin NV: Clusters of regulatory signals for RNA polymerase II transcription associated with Alu family repeats and CpG islands in human promoters. Genomics. 2004, 83 (5): 873-882. 10.1016/j.ygeno.2003.11.001.PubMedView ArticleGoogle Scholar
- Cui F, Sirotin MV, Zhurkin VB: Impact of Alu repeats on the evolution of human p53 binding sites. Biol Direct. 2011, 6 (1): 2-10.1186/1745-6150-6-2.PubMedPubMed CentralView ArticleGoogle Scholar
- Thornburg BG, Gotea V, Makaowski W: Transposable elements as a significant source of transcription regulating signals. Gene. 2006, 365: 104-110.PubMedView ArticleGoogle Scholar
- Laperriere D, Wang T-T, White JH, Mader S: Widespread Alu repeat-driven expansion of consensus DR2 retinoic acid response elements during primate evolution. BMC Genomics. 2007, 8: 23-10.1186/1471-2164-8-23.PubMedPubMed CentralView ArticleGoogle Scholar
- Norris J, Fan D, Aleman C, Marks JR, Futreal PA, Wiseman RW, Iglehart JD, Deininger PL, McDonnell DP: Identification of a new subclass of Alu DNA repeats which can function as estrogen receptor-dependent transcriptional enhancers. J Biol Chem. 1995, 270 (39): 22777-22782. 10.1074/jbc.270.39.22777.PubMedView ArticleGoogle Scholar
- Mason CE, Shu FJ, Wang C, Session RM, Kallen RG, Sidell N, Yu T, Liu MH, Cheung E, Kallen CB: Location analysis for the estrogen receptor-alpha reveals binding to diverse ERE sequences and widespread binding within repetitive DNA elements. Nucleic Acids Res. 2010, 38 (7): 2355-2368. 10.1093/nar/gkp1188.PubMedPubMed CentralView ArticleGoogle Scholar
- Jacobsen BM, Jambal P, Schittone SA, Horwitz KB: ALU repeats in promoters are position-dependent co-response elements (coRE) that enhance or repress transcription by dimeric and monomeric progesterone receptors. Mol Endocrinol. 2009, 23 (7): 989-1000. 10.1210/me.2009-0048.PubMedPubMed CentralView ArticleGoogle Scholar
- Gombart AF, Saito T, Koeffler HP: Exaptation of an ancient Alu short interspersed element provides a highly conserved vitamin D-mediated innate immune response in humans and primates. BMC Genomics. 2009, 10: 321-10.1186/1471-2164-10-321.PubMedPubMed CentralView ArticleGoogle Scholar
- Britten RJ: DNA sequence insertion and evolutionary variation in gene regulation. Proceedings of the National Academy of Sciences of the United States of America. 1996, 93 (18): 9374-9377. 10.1073/pnas.93.18.9374.PubMedPubMed CentralView ArticleGoogle Scholar
- Bolotin E, Schnabl J, Sladek F: HNF4A (Homo sapiens). Transcription Factor Encyclopedia. 2009, [http://burgundy.cmmt.ubc.ca/cgi-bin/tfe/home.pl]Google Scholar
- Hayhurst GP, Lee YH, Lambert G, Ward JM, Gonzalez FJ: Hepatocyte nuclear factor 4alpha (nuclear receptor 2A1) is essential for maintenance of hepatic gene expression and lipid homeostasis. Mol Cell Biol. 2001, 21: 1393-1403. 10.1128/MCB.21.4.1393-1403.2001.PubMedPubMed CentralView ArticleGoogle Scholar
- Watt AJ, Garrison WD, Duncan SA: HNF4: a central regulator of hepatocyte differentiation and function. Hepatology. 2003, 37: 1249-1253. 10.1053/jhep.2003.50273.PubMedView ArticleGoogle Scholar
- Babeu JP, Darsigny M, Lussier CR, Boudreau F: Hepatocyte nuclear factor 4alpha contributes to an intestinal epithelial phenotype in vitro and plays a partial role in mouse intestinal epithelium differentiation. Am J Physiol Gastrointest Liver Physiol. 2009, 297 (1): G124-134. 10.1152/ajpgi.90690.2008.PubMedView ArticleGoogle Scholar
- Cattin AL, Le Beyec J, Barreau F, Saint-Just S, Houllier A, Gonzalez FJ, Robine S, Pincon-Raymond M, Cardot P, Lacasa M, et al: Hepatocyte nuclear factor 4alpha, a key factor for homeostasis, cell architecture, and barrier function of the adult intestinal epithelium. Mol Cell Biol. 2009, 29 (23): 6294-6308. 10.1128/MCB.00939-09.PubMedPubMed CentralView ArticleGoogle Scholar
- Darsigny M, Babeu JP, Dupuis AA, Furth EE, Seidman EG, Levy E, Verdu EF, Gendron FP, Boudreau F: Loss of hepatocyte-nuclear-factor-4alpha affects colonic ion transport and causes chronic inflammation resembling inflammatory bowel disease in mice. PLoS One. 2009, 4 (10): e7609-10.1371/journal.pone.0007609.PubMedPubMed CentralView ArticleGoogle Scholar
- Yamagata K, Furuta H, Oda N, Kaisaki PJ, Menzel S, Cox NJ, Fajans SS, Signorini S, Stoffel M, Bell GI: Mutations in the hepatocyte nuclear factor-4alpha gene in maturity-onset diabetes of the young (MODY1). Nature. 1996, 384 (6608): 458-460. 10.1038/384458a0.PubMedView ArticleGoogle Scholar
- Barrett JC, Lee JC, Lees CW, Prescott NJ, Anderson CA, Phillips A, Wesley E, Parnell K, Zhang H, Drummond H, et al: Genome-wide association study of ulcerative colitis identifies three new susceptibility loci, including the HNF4A region. Nat Genet. 2009, 41 (12): 1330-1334. 10.1038/ng.483.PubMedView ArticleGoogle Scholar
- Ryffel GU: Mutations in the human genes encoding the transcription factors of the hepatocyte nuclear factor (HNF)1 and HNF4 families: functional and pathological consequences. J Mol Endocrinol. 2001, 27 (1): 11-29. 10.1677/jme.0.0270011.PubMedView ArticleGoogle Scholar
- Sladek F, Seidel S: Hepatocyte nuclear factor 4 alpha. Nuclear Receptors and Genetic Diseases. 2001, London: Academic Press, 309-361.Google Scholar
- Overington JP, Al-Lazikani B, Hopkins AL: How many drug targets are there?. Nat Rev Drug Discov. 2006, 5 (12): 993-996. 10.1038/nrd2199.PubMedView ArticleGoogle Scholar
- Hwang-Verslues WW, Sladek FM: HNF4alpha--role in drug metabolism and potential drug target?. Curr Opin Pharmacol. 2010, 10 (6): 698-705. 10.1016/j.coph.2010.08.010.PubMedPubMed CentralView ArticleGoogle Scholar
- Yuan X, Ta TC, Lin M, Evans JR, Dong Y, Bolotin E, Sherman MA, Forman BM, Sladek FM: Identification of an endogenous ligand bound to a native orphan nuclear receptor. PLoS ONE. 2009, 4: e5609-10.1371/journal.pone.0005609.PubMedPubMed CentralView ArticleGoogle Scholar
- Jiang G, Nepomuceno L, Hopkins K, Sladek FM: Exclusive homodimerization of the orphan receptor hepatocyte nuclear factor 4 defines a new subclass of nuclear receptors. Mol Cell Biol. 1995, 15: 5131-5143.PubMedPubMed CentralView ArticleGoogle Scholar
- Sladek FM: What are nuclear receptor ligands?. Mol Cell Endocrinol. 2011, 334 (1-2): 3-13. 10.1016/j.mce.2010.06.018.PubMedView ArticleGoogle Scholar
- Bridgham JT, Eick GN, Larroux C, Deshpande K, Harms MJ, Gauthier ME, Ortlund EA, Degnan BM, Thornton JW: Protein evolution by molecular tinkering: diversification of the nuclear receptor superfamily from a ligand-dependent ancestor. PLoS Biol. 2010, 8 (10):Google Scholar
- Ellrott K, Yang C, Sladek FM, Jiang T: Identifying transcription factor binding sites through Markov chain optimization. Bioinformatics (Oxford, England). 2002, 18 (Suppl 2): S100-109. 10.1093/bioinformatics/18.suppl_2.S100.View ArticleGoogle Scholar
- Odom DT, Zizlsperger N, Gordon DB, Bell GW, Rinaldi NJ, Murray HL, Volkert TL, Schreiber J, Rolfe PA, Gifford DK, et al: Control of pancreas and liver gene expression by HNF transcription factors. Science. 2004, 303: 1378-1381. 10.1126/science.1089769.PubMedPubMed CentralView ArticleGoogle Scholar
- Odom DT, Dowell RD, Jacobsen ES, Nekludova L, Rolfe PA, Danford TW, Gifford DK, Fraenkel E, Bell GI, Young RA: Core transcriptional regulatory circuitry in human hepatocytes. Mol Syst Biol. 2006, 2: 2006 0017-PubMedPubMed CentralView ArticleGoogle Scholar
- Bolotin E, Liao H, Ta TC, Yang C, Hwang-Verslues W, Evans JR, Jiang T, Sladek FM: Integrated approach for the identification of human hepatocyte nuclear factor 4alpha target genes using protein binding microarrays. Hepatology. 2010, 51 (2): 642-653. 10.1002/hep.23357.PubMedPubMed CentralView ArticleGoogle Scholar
- Jiang G, Sladek FM: The DNA binding domain of hepatocyte nuclear factor 4 mediates cooperative, specific binding to DNA and heterodimerization with the retinoid × receptor alpha. J Biol Chem. 1997, 272: 1218-1225. 10.1074/jbc.272.2.1218.PubMedView ArticleGoogle Scholar
- Quentin Y: Origin of the Alu family: a family of Alu-like monomers gave birth to the left and the right arms of the Alu elements. Nucleic Acids Res. 1992, 20 (13): 3397-3401. 10.1093/nar/20.13.3397.PubMedPubMed CentralView ArticleGoogle Scholar
- Polak P, Domany E: Alu elements contain many binding sites for transcription factors and may play a role in regulation of developmental processes. BMC Genomics. 2006, 7: 133-10.1186/1471-2164-7-133.PubMedPubMed CentralView ArticleGoogle Scholar
- Sabo PJ, Kuehn MS, Thurman R, Johnson BE, Johnson EM, Cao H, Yu M, Rosenzweig E, Goldy J, Haydock A, et al: Genome-scale mapping of DNase I sensitivity in vivo using tiling DNA microarrays. Nat Methods. 2006, 3 (7): 511-518. 10.1038/nmeth890.PubMedView ArticleGoogle Scholar
- Sabo PJ, Hawrylycz M, Wallace JC, Humbert R, Yu M, Shafer A, Kawamoto J, Hall R, Mack J, Dorschner MO, et al: Discovery of functional noncoding elements by digital analysis of chromatin structure. Proceedings of the National Academy of Sciences of the United States of America. 2004, 101 (48): 16837-16842. 10.1073/pnas.0407387101.PubMedPubMed CentralView ArticleGoogle Scholar
- Goodman M: The genomic record of Humankind's evolutionary roots. Am J Hum Genet. 1999, 64 (1): 31-39. 10.1086/302218.PubMedPubMed CentralView ArticleGoogle Scholar
- Ostertag EM, Goodier JL, Zhang Y, Kazazian HH: SVA elements are nonautonomous retrotransposons that cause disease in humans. Am J Hum Genet. 2003, 73 (6): 1444-1451. 10.1086/380207.PubMedPubMed CentralView ArticleGoogle Scholar
- Grover D, Mukerji M, Bhatnagar P, Kannan K, Brahmachari SK: Alu repeat analysis in the complete human genome: trends and variations with respect to genomic composition. Bioinformatics (Oxford, England). 2004, 20 (6): 813-817. 10.1093/bioinformatics/bth005.View ArticleGoogle Scholar
- Kreahling J, Graveley BR: The origins and implications of Aluternative splicing. Trends Genet. 2004, 20 (1): 1-4. 10.1016/j.tig.2003.11.001.PubMedView ArticleGoogle Scholar
- Xie H, Wang M, Bonaldo Mde F, Smith C, Rajaram V, Goldman S, Tomita T, Soares MB: High-throughput sequence-based epigenomic analysis of Alu repeats in human cerebellum. Nucleic Acids Res. 2009, 37 (13): 4331-4340. 10.1093/nar/gkp393.PubMedPubMed CentralView ArticleGoogle Scholar
- Xie H, Wang M, de Andrade A, Bonaldo Mde F, Galat V, Arndt K, Rajaram V, Goldman S, Tomita T, Soares MB: Genome-wide quantitative assessment of variation in DNA methylation patterns. Nucleic Acids Res. 2011, 39 (10): 4099-4108. 10.1093/nar/gkr017.PubMedPubMed CentralView ArticleGoogle Scholar
- Sladek FM, Zhong WM, Lai E, Darnell JE: Liver-enriched transcription factor HNF-4 is a novel member of the steroid hormone receptor superfamily. Genes Dev. 1990, 4 (12B): 2353-2365. 10.1101/gad.4.12b.2353.PubMedView ArticleGoogle Scholar
- Xie H, Wang M, Bonaldo Mde F, Rajaram V, Stellpflug W, Smith C, Arndt K, Goldman S, Tomita T, Soares MB: Epigenomic analysis of Alu repeats in human ependymomas. Proceedings of the National Academy of Sciences of the United States of America. 2010, 107 (15): 6952-6957. 10.1073/pnas.0913836107.PubMedPubMed CentralView ArticleGoogle Scholar
- Sun K, Montana V, Chellappa K, Brelivet Y, Moras D, Maeda Y, Parpura V, Paschal BM, Sladek FM: Phosphorylation of a conserved serine in the deoxyribonucleic acid binding domain of nuclear receptors alters intracellular localization. Mol Endocrinol. 2007, 21 (6): 1297-1311. 10.1210/me.2006-0300.PubMedView ArticleGoogle Scholar
- Xie X, Liao H, Dang H, Pang W, Guan Y, Wang X, Shyy JY, Zhu Y, Sladek FM: Down-regulation of hepatic HNF4alpha gene expression during hyperinsulinemia via SREBPs. Mol Endocrinol. 2009, 23 (4): 434-443. 10.1210/me.2007-0531.PubMedPubMed CentralView ArticleGoogle Scholar
- Hong YH, Varanasi US, Yang W, Leff T: AMP-activated protein kinase regulates HNF4alpha transcriptional activity by inhibiting dimer formation and decreasing protein stability. J Biol Chem. 2003, 278 (30): 27495-27501. 10.1074/jbc.M304112200.PubMedView ArticleGoogle Scholar
- Leclerc I, Lenzner C, Gourdon L, Vaulont S, Kahn A, Viollet B: Hepatocyte nuclear factor-4alpha involved in type 1 maturity-onset diabetes of the young is a novel target of AMP-activated protein kinase. Diabetes. 2001, 50 (7): 1515-1521. 10.2337/diabetes.50.7.1515.PubMedView ArticleGoogle Scholar
- Viollet B, Kahn A, Raymondjean M: Protein kinase A-dependent phosphorylation modulates DNA-binding activity of hepatocyte nuclear factor 4. Mol Cell Biol. 1997, 17 (8): 4208-4219.PubMedPubMed CentralView ArticleGoogle Scholar
- Hatzis P, Kyrmizi I, Talianidis I: Mitogen-activated protein kinase-mediated disruption of enhancer-promoter communication inhibits hepatocyte nuclear factor 4alpha expression. Mol Cell Biol. 2006, 26 (19): 7017-7029. 10.1128/MCB.00297-06.PubMedPubMed CentralView ArticleGoogle Scholar
- Maeda Y, Seidel SD, Wei G, Liu X, Sladek FM: Repression of hepatocyte nuclear factor 4alpha tumor suppressor p53: involvement of the ligand-binding domain and histone deacetylase activity. Mol Endocrinol. 2002, 16 (2): 402-410. 10.1210/me.16.2.402.PubMedGoogle Scholar
- Maeda Y, Hwang-Verslues WW, Wei G, Fukazawa T, Durbin ML, Owen LB, Liu X, Sladek FM: Tumour suppressor p53 down-regulates the expression of the human hepatocyte nuclear factor 4alpha (HNF4alpha) gene. Biochem J. 2006, 400 (2): 303-313. 10.1042/BJ20060614.PubMedPubMed CentralView ArticleGoogle Scholar
- Takagi S, Nakajima M, Kida K, Yamaura Y, Fukami T, Yokoi T: MicroRNAs regulate human hepatocyte nuclear factor 4alpha, modulating the expression of metabolic enzymes and cell cycle. J Biol Chem. 2010, 285 (7): 4415-4422. 10.1074/jbc.M109.085431.PubMedView ArticleGoogle Scholar
- Selva DM, Hogeveen KN, Innis SM, Hammond GL: Monosaccharide-induced lipogenesis regulates the human hepatic sex hormone-binding globulin gene. J Clin Invest. 2007, 117 (12): 3979-3987.PubMedPubMed CentralGoogle Scholar
- Chiang JY: Regulation of bile acid synthesis: pathways, nuclear receptors, and mechanisms. J Hepatol. 2004, 40 (3): 539-551. 10.1016/j.jhep.2003.11.006.PubMedView ArticleGoogle Scholar
- Tanaka T, Jiang S, Hotta H, Takano K, Iwanari H, Sumi K, Daigo K, Ohashi R, Sugai M, Ikegame C, et al: Dysregulated expression of P1 and P2 promoter-driven hepatocyte nuclear factor-4alpha in the pathogenesis of human cancer. J Pathol. 2006, 208: 662-672. 10.1002/path.1928.PubMedView ArticleGoogle Scholar
- Ahn SH, Shah YM, Inoue J, Morimura K, Kim I, Yim S, Lambert G, Kurotani R, Nagashima K, Gonzalez FJ, et al: Hepatocyte nuclear factor 4alpha in the intestinal epithelial cells protects against inflammatory bowel disease. Inflamm Bowel Dis. 2008, 14 (7): 908-920. 10.1002/ibd.20413.PubMedPubMed CentralView ArticleGoogle Scholar
- Briancon N, Weiss MC: In vivo role of the HNF4alpha AF-1 activation domain revealed by exon swapping. Embo J. 2006, 25 (6): 1253-1262. 10.1038/sj.emboj.7601021.PubMedPubMed CentralView ArticleGoogle Scholar
- Bourque G, Leong B, Vega VB, Chen X, Lee YL, Srinivasan KG, Chew JL, Ruan Y, Wei CL, Ng HH, et al: Evolution of the mammalian transcription factor binding repertoire via transposable elements. Genome Res. 2008, 18 (11): 1752-1762. 10.1101/gr.080663.108.PubMedPubMed CentralView ArticleGoogle Scholar
- Hwang-Verslues WW, Sladek FM: Nuclear receptor hepatocyte nuclear factor 4alpha1 competes with oncoprotein c-Myc for control of the p21/WAF1 promoter. Mol Endocrinol. 2008, 22: 78-90.PubMedView ArticleGoogle Scholar
- Jiang H, Wong WH: SeqMap: mapping massive amount of oligonucleotides to the genome. Bioinformatics (Oxford, England). 2008, 24 (20): 2395-2396. 10.1093/bioinformatics/btn429.View ArticleGoogle Scholar
- Quinlan AR, Hall IM: BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics (Oxford, England). 2010, 26 (6): 841-842. 10.1093/bioinformatics/btq033.View ArticleGoogle Scholar
- Huang da W, Sherman BT, Lempicki RA: Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009, 4 (1): 44-57.PubMedView ArticleGoogle Scholar
- Jurka J: Evolutionary impact of human Alu repetitive elements. Curr Opin Genet Dev. 2004, 14 (6): 603-608. 10.1016/j.gde.2004.08.008.PubMedView ArticleGoogle Scholar
- Crooks GE, Hon G, Chandonia JM, Brenner SE: WebLogo: a sequence logo generator. Genome Res. 2004, 14: 1188-1190. 10.1101/gr.849004.PubMedPubMed CentralView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.