Using BAC transgenesis in zebrafish to identify regulatory sequences of the amyloid precursor protein gene in humans
BMC Genomics volume 13, Article number: 451 (2012)
Non-coding DNA in and around the human Amyloid Precursor Protein (APP) gene that is central to Alzheimer’s disease (AD) shares little sequence similarity with that of appb in zebrafish. Identifying DNA domains regulating expression of the gene in such situations becomes a challenge. Taking advantage of the zebrafish system that allows rapid functional analyses of gene regulatory sequences, we previously showed that two discontinuous DNA domains in zebrafish appb are important for expression of the gene in neurons: an enhancer in intron 1 and sequences 28–31 kb upstream of the gene. Here we identify the putative transcription factor binding sites responsible for this distal cis-acting regulation, and use that information to identify a regulatory region of the human APP gene.
Functional analyses of intron 1 enhancer mutations in enhancer-trap BACs expressed as transgenes in zebrafish identified putative binding sites of two known transcription factor proteins, E4BP4/ NFIL3 and Forkhead, to be required for expression of appb. A cluster of three E4BP4 sites at −31 kb is also shown to be essential for neuron-specific expression, suggesting that the dependence of expression on upstream sequences is mediated by these E4BP4 sites. E4BP4/ NFIL3 and XFD1 sites in the intron enhancer and E4BP4/ NFIL3 sites at −31 kb specifically and efficiently bind the corresponding zebrafish proteins in vitro. These sites are statistically over-represented in both the zebrafish appb and the human APP genes, although their locations are different. Remarkably, a cluster of four E4BP4 sites in intron 4 of human APP exists in actively transcribing chromatin in a human neuroblastoma cell-line, SHSY5Y, expressing APP as shown using chromatin immunoprecipitation (ChIP) experiments. Thus although the two genes share little sequence conservation, they appear to share the same regulatory logic and are regulated by a similar set of transcription factors.
The results suggest that the clock-regulated and immune system modulator transcription factor E4BP4/ NFIL3 likely regulates the expression of both appb in zebrafish and APP in humans. It suggests potential human APP gene regulatory pathways, not on the basis of comparing DNA primary sequences with zebrafish appb but on the model of conservation of transcription factors.
It is important to understand the regulation of the Amyloid Precursor Protein (APP) gene expression because epidemiologic studies show that Alzheimer Disease (AD) is exquisitely sensitive to gene dosage , and levels of APP expression including β-peptide levels correlate with the severity and age-of-onset of AD . The severity and onset of AD is thus closely linked to expression of the APP gene. These observations suggest that controlling APP gene expression is a possible route to reducing the severity of AD. A pre-requisite for therapeutic manipulation of APP gene expression is a more complete understanding of the mechanisms that regulate APP expression in neurons. The APP gene promoter does not contain a functional TATA box but instead has long CpG islands and a strong initiator element (INR) surrounding the major transcription start site . While transcriptional regulation of APP gene has been studied extensively, most of that work has focused on the proximal ~ 1500 bp sequences of the promoter [3–13], and it is unclear to what extent APP gene is regulated by promoter sequences alone. Like most other genes it is likely that the APP promoter is modulated by distal regulatory sequences. The non-coding DNA within and surrounding the APP gene is not conserved in vertebrates, and although ~700 bp of DNA immediately upstream of the start site is conserved in mammals, this conservation does not extend to other vertebrates such as Fugu or zebrafish [3, 14]. Thus regulation of the gene by cis-acting distal sequences remains poorly understood. Although regulatory function can be conserved across species without sequence similarity [15–19], identifying such sequences that control gene expression under those circumstances is much more difficult.
We have previously shown that two discontinuous DNA regions regulate neuron-specific appb gene expression in zebrafish. One of these is an enhancer located within intron 1; in the absence of this enhancer there is no expression of a BAC transgene that contained approximately 100 kb of 5’ sequences . The second regulatory sequence mapped to a region located between approximately 28–31 kb 5’ of the transcription start site of the zebrafish appb gene. Deletion of this element shifted the expression pattern from being neuron-specific to notochord-specific, which is the default pattern observed with the basal promoter plus intron-enhancer combination. Based on these observations, we proposed that the upstream element suppressed aberrant expression (in the notochord) and activated appropriate expression in neurons. Requirement of the upstream-enhancer for expression further suggested that zebrafish appb is regulated by interaction between these distal regulatory sequences.
Here we identify the putative transcription factor binding sites that mediate activity of these regulatory regions and use the information to study the regulation of the human APP locus. Analysis of the expression of enhancer-trap BACs containing mutated intron 1 enhancers in zebrafish indicates that binding sites of at least two known transcription factors are important for function. They are the clock-regulated and immune system modulator transcription factor E4BP4/ NFIL3 and members of the Forkhead gene family (XFD1). A search of non-coding DNA in introns and the 50 kb sequence surrounding the appb gene for additional binding sites revealed a ~ 8-fold and ~ 11-fold greater than statistical frequency of E4BP4 and XFD1 sites, respectively. Amongst these is a cluster of three E4BP4 sites at −31 kb. These sites bound the E4BP4 DNA binding domains, expressed in E. coli, efficiently and selectively in vitro. Though comparison of zebrafish and human APP did not reveal substantially conserved non-coding sequences that could represent regulatory elements, we hypothesized that gene expression may be conserved via the use of the same transcription factors. Therefore, we searched for E4BP4 binding sites in the human APP locus. Remarkably, we found that putative E4BP4 sites were also over-represented in the human APP locus, though their locations differed from that seen in the zebrafish appb. One such cluster of four E4BP4 binding sites in the fourth intron of the human APP gene was marked by a peak of acetylated histones in a human neuroblastoma cell line that expresses APP. We propose that E4BP4/ NFIL3 may regulate human APP expression via binding to distal regulatory sequences.
BACs CH211-192O20 and CH211-43O16 from a zebrafish library, designated here as BACs C and D respectively, have been described . The two BACs overlap one another and contain different lengths of sequences upstream of appb gene. Both were used in order to have maximum upstream DNA, both closest to and farthest from the appb transcription start site, in the enhancer-trap BACs. This was necessitated by the ~110 kb packaging capacity of the phage P1 head used in the generation of enhancer-trap BACs (see Figure 7 of reference ).
Generating enhancer-trap BACs
Progressive truncations from either end of BAC DNA, purification and analyses of clone DNA from deletion libraries using Field Inversion Gel Electrophoresis (FIGE) was performed using procedures described before [20–22]. Sequence of the newly created end was determined in each case using primers from the Tn10 transposon end remaining after the truncation.
BAC DNA injections into zebrafish eggs
Injections of Qiagen tip-purified enhancer-trap BAC DNA into zebrafish eggs and subsequent analyses of EGFP expression in developing embryos using fluorescence microscopy were performed as reported earlier . To generate transgenic lines, enhancer-trap BAC DNA with iTol2-end insertions were co-injected with Tol2 transposase mRNA as described previously .
Mutagenesis of intron 1 enhancer in Enhancer-trap transposon plasmid
Suitable PCR primers were used to amplify segments of the 1 kb intron enhancer, and the amplified products incorporated into the enhancer-trap Tn10 transposon plasmid. Point mutations were engineered into PCR primers so that the amplified product contained mutations in putative transcription factor binding sites that overlapped such as SOX5 and E4BP4. Thus to get mutations in only SOX5 and not E4BP4, point mutations were introduced. Changes to the putative binding sites were first incorporated into the small plasmid containing the enhancer-trap Tn10 which was then inserted into appb BACs C and D, exactly as described previously to make enhancer-trap BACs with the wild type sequence of the intron enhancer .
Preparing zebrafish Forkhead and E4BP4/ NFIL3 proteins
Conserved DNA binding domains (DBD) of the zebrafish Forkhead and E4BP4 genes were amplified from zebrafish genomic DNA using primers shown in Additional file 1: Figure S1. Restriction sites were introduced in-frame to the Forkhead and E4BP4 open reading frames (ORFs) to facilitate cloning into the pET-30a(+) expression vector. A six-histidine residue tag fused to the N-terminal end of these proteins was used for purification purposes as previously reported . DBD of Forkhead and E4BP4 were purified from bacteria as previously described . Electrophoretic Mobility Shift Assays (EMSA) were performed exactly as described earlier .
Chromatin Immunoprecipitation with H3K9Ac antibody
ChIPs, real-time PCR, and data analysis were performed as described . The anti-H3K9Ac antibody was purchased from Abcam, Cambridge, MA. The control antibody anti-IgG was obtained from Millipore, Billerica, MA. Human neuroblastoma SHSY5Y cells were propagated in an undifferentiated state, cultured in DMEM medium and 10% heat inactivated FBS. H3K9Ac ChIP was performed on undifferentiated SHSY5Y cells. ChIP primers were designed to span potential E4BP4 binding sites, and are displayed in Additional file 2: Figure S2. Primers used for detecting mRNA levels of E4BP4 in the undifferentiated cell line SHSY5Y are displayed in Additional file 3: Figure S3.
Identification of putative transcription factor (TF) binding sites
The sequence in the 1 kb intron 1 enhancer of appb was analyzed using the “MotifScanner” program, and the results are shown in Additional file 2: Figure S2 of reference . The putative TF binding sites with the highest probability scores from that analysis are highlighted in the intron enhancer sequence shown here in Additional file 4: Figure S4. Mutational analyses of putative TF binding sites within the intron 1 enhancer revealed that E4BP4/ NFIL3 and XFD1 sites were required for function. Next, the genomic DNA sequence containing either the zebrafish appb gene or the human APP gene, displayed in a Microsoft Word file, were scanned by the “Find” function for the sequence representing the binding sites of E4BP4/ NFIL3 or XFD1. The reverse strand binding sites for the two transcription factors were similarly identified using the “Find” function on the sequences complementary to the sites. The total of these constituted the putative binding sites of each transcription factor.
Sequence motif frequencies
Frequencies for random occurrence of putative binding sites were calculated by raising ¼ (which represents the probability of finding a specific nucleotide at any location) to the total number of nucleotides in the consensus binding site for the specific transcription factor. The fold over-represented was deduced from the ratio of actual occurrence at the genetic locus to what would be expected if they occurred randomly.
New technology for scanning the zebrafish appb gene locus with enhancer traps using BACs was reported in an earlier study , and is illustrated in Figure 1. The enhancer-trap comprised of a basal promoter EGFP gene flanked by 0.35 kb of sequence immediately upstream of the appb transcription initiation site and ~ 1 kb of DNA containing the intron 1 enhancer of appb (shown schematically in top panel of Figure 1). The enhancer-trap is located in front of the loxP arrowhead in the Tn10 transposon that is inserted randomly into BACs C and D overlapping the appb locus. Cre-mediated recombination between this inserted enhancer-trap-loxP and the loxP endogenous to the BAC generates libraries of appb- BAC deletions in which the loxP end is brought progressively closer to the lox511 end; while simultaneously placing the enhancer-trap at the new loxP end. The set of enhancer-trap appb- BACs is first characterized by sequencing the new ends and then expressed in zebrafish embryos. Both BACs were needed for maximal representation of upstream sequences both proximal and distal to appb transcription start site in enhancer-trap BACs (see Methods, and reference ).
Results from that previous study indicated that the enhancer in intron 1 can function specifically in non-neural tissue such as the notochord, where endogenous appb is not expressed, when used with promoter proximal elements within the +0.147 to −0.35 kb of sequence surrounding the transcription start site of appb. Thus expression in notochord was observed using either the small enhancer-trap transposon plasmids containing only the proximal promoter elements, or enhancer-trap appb- BACs in which sequences −0.35 to −31 kb had been deleted. However when additional 5’ sequences extending till approximately −31 kb in the enhancer-trap appb- BACs were present, gene expression became exquisitely specific to neurons (the vertical dashed blue lines in Figure 1 demarcate the end-points of deletions in the enhancer-trap BACs that produce the two distinct expression patterns). These results suggest that the intron enhancer works with sequences in the −31 kb region to confer neuron-specific gene expression. The results also appeared to suggest that some type of transcription repression occurs by factors binding to sequences between −28 and −31 kb to suppress expression in the notochord, as deduced from the different expression patterns of BACs Δ75C and Δ72C . Simultaneous expression in the notochord and neural cells was not observed in the same fish using BAC deletions Δ74D through Δ75C (Summarized here in Figure 1 from the data shown in Figure 7, Panels A-C of reference ). We concluded that tissue-specific expression of the appb gene resembling its endogenous pattern required two separate, somewhat distant regulatory domains to cooperate in cis to confer tissue-specificity. The two domains of regulation are the ~ 1 kb of DNA within intron 1 (ZFISH7:9:29144733–29145740), and the region +0.147 to −31 kb. As indicated in the earlier study, exclusion of the ~1 kb intron element produced no expression of GFP: appb BACs with ~100 kb of 5’ sequences without the intron 1 enhancer, i.e. BACs deleted from the wild type loxP end of insert DNA with Tn-US that lacked the intron enhancer (Figures 1B, 1C, & 2 in reference ) failed to express GFP in any tissue. Absence of the region −0.35 to −31 kb on the other hand led to expression in the inappropriate tissue, the notochord, as seen with BACs Δ72C, Δ52C, Δ38C, and Δ29C.
Expression analyses of appb-BACs with mutated intron 1 enhancer: intact putative E4BP4 and XFD1 sites essential for appb expression in zebrafish
Our previous study also reported a bio-informatic analysis of putative transcription factor binding sites within the intron 1 enhancer sequence (Additional file 2: Figure S2 in reference ). Sites with the highest probability scores in that list, such as SOX5, E4BP4, XFD1, OCT1 and GATA3, are shown schematically here in the intron enhancer (IE) of the enhancer-trap transposon in Figure 2A. Deletions from either end of the intron enhancer sequence and point mutations, to distinguish between overlapping sites, were made in these putative transcription factor binding sites in the 1 kb intron enhancer of appb. Changes to the binding sites were first incorporated into the small plasmid containing the enhancer-trap Tn10 transposon (shown as the inverted triangle in Figure 2A) which was then introduced into appb BACs C and D, exactly as described previously to make enhancer-trap BACs with the wild type sequence of the intron enhancer. A schematic representation of all enhancer-trap BACs containing mutated intron enhancers that were used in this study is shown in Figure 2B. Color-coded letters indicate the locations of the wild type binding sites of transcription factors in intron enhancer (IE), while the dashed colored lines represent deletions of that particular site. The identities of mutated enhancer-trap BACs from each mutant intron enhancer, used for expression in zebrafish in this study, are indicated adjacent to the changes made. Mutant enhancer-trap BACs chosen for expression had deletion-ends within −31 kb of the appb transcription start site. Most deletion-ends were within 10 kb, and some such as Δ28D and Δ1D had deletion-ends <2 kb of the transcription start site. The deletion-ends indicate locations of the enhancer-trap in BACs, and are represented schematically with the bent lines in Figures 1 and 2. The lowest panel in Figure 2B shows a mutated enhancer-trap BAC that was again deleted from the opposite end by a lox511-iTol2kan transposon. Enhancer-trap BAC DNAs were injected into zebrafish embryos for expression as transgenes. Locations of intron enhancer sequence-changes, and the PCR primers used to construct them, are depicted schematically in Additional file 4 Figure S4. A total of 18 mutated enhancer-trap transposon-inserted BAC libraries were made, some with a mixture of enhancer-traps containing the modified intron enhancer in both orientations. Experiments injecting DNA into zebrafish eggs were repeated at least four times with BACs that expressed EGFP fluorescence in neurons, and six times with BACs that did not express EGFP. The results obtained with transient expressions are summarized in Table 1 and indicate that the putative binding sites for OCT1, GATA3, SOX5 and the CT-repeat element are dispensable for appb expression in zebrafish neurons. Clones such as Δ94C or Δ74D that contain wild type intron enhancer (Figure 2B) generate identical expression patterns as Δ5C with CT-repeat and SOX5 deleted, or Δ1D with GATA3 and OCT1 deleted (Figure 2B, Table 1). In contrast, DNA sequences that contain E4BP4 or XFD1 sites are critical for expression of appb in neurons of zebrafish. When mutated or deleted, the resulting enhancer-trap BACs do not express either in neurons or the notochord. A FIGE of representative larger than 75 kb appb BACs with mutated intron 1 enhancers are shown in panel A of Figure 3. The enhancer-traps in these BACs are located well within 31 kb of the start site of transcription of appb.
A few of the BACs that expressed in neurons were further retrofitted with iTol2-ends at the opposite lox511 end of BAC DNA for germline propagation, using the transposon pTnlox511-iTol2kan as described recently . For example the BAC clone in lane 12 of Panel A, Figure 3, (marked by yellow arrowhead), was truncated from the lox511 end of BAC DNA using lox511-iTol2kan transposon. Clone DNAs from the deletion/retrofitting library is shown in Panel B. Clone DNA from lane 11 in Panel B (indicated by blue arrowhead) was introduced into zebrafish eggs for germline propagation, and a F2 transgenic fish representative of this line is shown in Panel E. It has an iTol2kan-EGFP-appb- BAC transgene with intron 1 enhancer deleted for GATA3, OCT1 and the CT-repeat sequences (DNA of this BAC clone is shown schematically in bottom panel of Figure 2B).
Distribution of E4BP4 and XFD1 sites in zebrafish appb gene
A search of the genome database for additional E4BP4 and XFD1 sites within and 50 kb sequence surrounding the zebrafish appb gene revealed that both these sites are highly over-represented in the gene region, approximately 8-fold and 11-fold over statistical frequency, respectively. These are shown schematically in Figure 4. Amongst these is a cluster of three E4BP4 sites at −31 kb. Locations of these sites are also tabulated in the top panel of Additional file 5: Figure S5.
Sequence analysis of the new ends created by the lox511-iTol2kan transposon in BAC DNAs in lanes 20 and 21, marked by the red arrowheads in Panel B, Figure 3, indicated that the lox511-transposon had deleted the cluster of three E4BP4 sites at −31 kb (Figure 4). When injected into zebrafish embryos neither of these DNAs expressed EGFP in neurons. Instead, expression patterns were always in the notochord (Figure 3D), demonstrating that the cluster of three E4BP4 sites at ~31 kb upstream of appb, shown schematically in Figure 4, is necessary for neuron-specific expression.
Conserved DNA binding protein domains of zebrafish Forkhead and E4BP4 specifically recognize the XFD1 and E4BP4 sites in intron enhancer and −31 kb of appb gene in vitro
We next determined whether the proposed E4BP4 and XFD1 sites in these regulatory regions bound the corresponding zebrafish proteins. For this we expressed the conserved DNA binding domains (DBD) of zebrafish E4BP4 (147 aa) and Forkhead (154 aa) in E. coli. (shown schematically in Additional file 1: Figure S1). The DNA binding domains were partially purified under non-denaturing conditions using the histidine tag, and used in Electrophoretic Mobility Shift Assays (EMSA). As EMSA probes we used probes e and f from intron 1 of appb (Figure 5). Probe e, spanning site E11 in Figure 4, formed a discrete nucleoprotein complex only with recombinant E4BP4-DBD (Figure 5A lanes 5–8) indicated by the arrow, whereas probe f, spanning site X3 in Figure 4, bound only to recombinant Forkhead-DBD, (Figure 5B lanes 1–4). Thus, the bio-informatically identified sequences were indeed true E4BP4 and XFD1 binding sites. Having determined the specificity of the probes we further tested additional putative E4BP4 binding sites (probes a-d, spanning sites E4 and E5, E6, E7, respectively in Figure 4) that lie within the cluster of sites located at −31 kb. All four new probes bound recombinant E4BP4-DBD, though with different affinities (Figure 5C). We conclude that both E4BP4 and XFD1 sites in intron 1, and the cluster of three sites at −31 kb of appb bind zebrafish E4BP4 and Forkhead DBDs efficiently and specifically in vitro.
E4BP4 binding sites in the human APP gene
Sequence comparisons of non-coding DNA within and flanking the APP gene in humans and zebrafish did not reveal appreciable conservation at the nucleotide level. However, it was possible that the regulatory logic was evolutionarily conserved between these distantly related species. To determine if the regulatory information obtained in zebrafish had implications for human APP gene regulation, we first scored for E4BP4 binding sites in and around the human APP gene. We noted a cluster of four putative E4BP4 binding sites in intron 4 of human APP (Figure 6A). Locations of these sites are also indicated in the bottom panel of Additional file 5: Figure S5. As a first step towards determining whether these sites were functionally important, we determined the histone modification state around this cluster in a human neuroblastoma cell line, SHSY5Y, which expresses human APP mRNA. Expression of E4BP4 mRNA was also confirmed in the cell line using RT-PCR. We used antibodies directed against histone H3 acetylated at lysine 9 (H3K9Ac) to carry out chromatin immunoprecipitation (ChIP) assays; the co-precipitated genomic DNA was queried by quantitative PCR using primers that identified different parts of the human APP gene (shown in Additional file 2: Figure S2). We detected a modest peak of H3K9Ac activity centered over the region with clustered E4BP4 sites (Figure 6B). As a positive control we used ChIP primers located within the promoter of the ubiquitously expressed IkappaBalpha (IkBα) gene. We also assayed H3K9Ac at a region in intron 2 of human APP that has been previously shown to be enriched for H3K27Ac (http://genome.ucsc.edu/cgi-bin/hgTracks?position=chr21:27252862-27543138&hgsid=203637875&wgEncodeHaibTfbs.Peaks.vis=full), which is another activation-associated histone modification. This region was highly modified with H3K9Ac. The fold enrichment of H3K9Ac at this site and the intron 4 sites was comparable (180-fold versus 120-fold, respectively, (Figure 6B). We conclude that the E4BP4 site-containing intron 4 region is epigenetically modified in these cells.
Several previous reports have described elegant vector systems and procedures to trap enhancers in the zebrafish [27–31]. The methods usually either insert the trap directly into the chromosomes of the organism or test sequences pre-selected based on cross-species conservation, for tissue-specific enhancer activity in Tol2 vectors. Our approach using enhancer-trap BACs is not affected by genome accessibility issues of the enhancer-trap because insertion of the trap occurs in isolated pieces of chromosomal DNA in BACs in the bacterial host (advantages discussed in ). Our approach is also likely to be free of biases, as there is no prior selection of sequences to test for enhancer activity. The ability to analyze multiple discontinuous DNA domains that act in concert to regulate expression of a gene such as appb appears a likely advantage of the enhancer-trap BAC approach.
A large number of enhancer-trap BACs with deletions/ mutations in the intron 1 enhancer and the upstream −31 kb region were analyzed in zebrafish, and putative binding sites for E4BP4 and XFD1 were identified as being critical for appb expression in neurons (Figures 2 and 3, Table 1). The two zebrafish transcription factor proteins, expressed in E. coli, bound efficiently and selectively in vitro to their respective sites identified as being essential for appb expression through mutational analysis (Figure 5). These results were then used to explore whether a similar set of transcription factors could also regulate expression of the human APP gene. We noted that binding sites of E4BP4 and XFD1 were also statistically over-represented at the human APP gene locus (Figure 6A). Chromatin immunoprecipitation (ChIP) analyses with H3K9Ac antibody of a cluster of four E4BP4 sites in intron 4 of human APP indicated that they were epigenetically modified (Figure 6B). It suggests the sites are functionally important in actively transcribing chromatin and are highly likely to serve a regulatory role. SiRNA knock-down experiments to further demonstrate that E4BP4 protein is actually involved in this regulation will need to wait till protocols for efficient transfection of the SHSY5Y neuroblastoma cells are devised.
Cluster of three E4BP4 sites at −31 kb required for expression of zebrafish appb
Expression of the BAC DNAs shown in lanes 20 and 21 of Figure 3B is not in neurons, quite unlike the other clones from the same deletion series. Sequence analyses of the new BAC end created by the lox511-iTol2kan transposon insertion indicated that the cluster of three E4BP4 sites at −31 kb was deleted in both these clones. The dependence of appb expression in zebrafish neurons on upstream sequences had also been mapped to that same region in our previous study with deletions made by the enhancer-trap transposon from the opposite loxP end of BAC . The conclusion that ~28 kb of upstream sequence is required for neuronal expression was derived from the strikingly different patterns of expression of the two enhancer-trap appb BACs Δ75C and Δ72C (see Figure 7 in reference ), which had enhancer-trap locations at ~28 and >31 kb upstream, respectively, of the appb transcription start site. That the three E4BP4 sites at −31 kb are important for neuron-specific expression of appb in zebrafish is thus re-confirmed here more directly. The three E4BP4 sites at −31 kb also bind E4BP4-DBD efficiently and specifically in vitro (Figure 5).
Some members of Forkhead gene family expressed exclusively in zebrafish notochord
An earlier study  indicates the Forkhead family of transcription factors fkd1, fkd2 and fkd4 are expressed during gastrulation in the zebrafish, with high levels of fkd1 and fkd4 mRNA accumulating exclusively in the notochord during somitogenesis. It suggests that fkd1, fkd2 and fkd4 proteins are available only in the notochord and thus could explain the expression of EGFP exclusively in the notochord when the cluster of three upstream E4BP4 sites at −31 kb are deleted in the enhancer-trap appb-BACs (Figures 3, 4).
E4BP4 has repressor activity in addition to activation properties, is intricately involved with the immune system and its expression is clock-regulated
The transcription factor E4BP4, also known as NFIL3, has been known to have both transcription activation and repression activities [34, 35]. It serves in the Central Nervous System (CNS) as an anti-apoptotic factor to promote survival and growth of motor neurons . As NFIL3, it is also intricately linked with the immune system, where it is required for protecting natural killer (NK) T cells  and regulates IL-12 p40 in macrophages . Strikingly, E4BP4/ NFIL3 has recently been found to regulate the IL12b gene by acting as a repressor from a distal enhancer 10 kb upstream of the gene using the STAT3 pathway . We believe these characteristics of E4BP4/ NFIL3 are very relevant to our findings because the importance of immunological and inflammatory processes in the pathogenesis and therapy of Alzheimer's disease is well documented [40, 41].
Results presented here indicate that the cluster of three E4BP4 sites at −31 kb in the zebrafish gene (Figure 4) is critical for neuron-specific expression of EGFP from promoter elements of appb in conjunction with the intron enhancer. Earlier reports of E4BP4 having transcription repression activity [34–38], thus facilitates formulating the following working hypothesis for appb gene regulation: binding of both E4BP4 and Forkhead proteins to appb intron 1 DNA is required for appb gene expression. The dependence of neuronal appb expression on the three E4BP4 sites at −31 kb could then be explained as follows: with excess E4BP4, possibly from those bound to sequences at −31 kb, Forkhead activity is suppressed, and expression is specific to neurons. In the absence of these upstream sequences, sequestered levels of E4BP4 are low, and expression is in notochord.
Possible model for appb gene regulation
We propose a novel interplay between Fkd and E4BP4/ NFIL3, either directly or indirectly through other proteins, for restricting expression of appb to neurons. Although required, the E4BP4 bound to the lone site in the minimal intron 1 enhancer appears not to have enough repressive function to prevent expression in the notochord. Thus in the absence of the cluster of three E4BP4 sites at −31 kb, expression is exclusive to the notochord because that is where fkd1, fkd2 and fkd4 proteins are localized . When the cluster of three E4BP4 sites is present, Forkhead bound to XFD1 is suppressed by E4BP4 proteins and expression is exclusive to neurons. The DNA region 28–31 kb upstream of appb is likely to have additional activator sites that enhance neuron-specific expression.
For E4BP4/ NFIL3 and Forkhead to regulate appb gene expression in the CNS, the proteins need to be available in the zebrafish brain. Although evidence for availability of these proteins in brain is lacking for zebrafish, expression has been reported for the Forkhead protein in neurons of the mouse spinal cord , while the E4BP4/ NFIL3 protein has been shown to be expressed in the embryonic motor neurons of both rat and chicken .
Testing hypothesis for regulation of APP in humans
It is likely that APP gene regulation shares common features in zebrafish and humans. However, the lack of conservation in sequence of non-coding DNA around the APP gene in these model vertebrates has been a dilemma . We explored the hypothesis that conservation may be at the level of the transcription factors involved. A search for E4BP4 and XFD1 sites in and around the APP gene reveals a much greater than statistical frequency of both these sites, just as in the case of the zebrafish appb gene. There are 22 putative binding sites for the human E4BP4/ NFIL3, about 6-fold above statistical frequency, as shown in Figure 6A, with an additional one in exon 23 (not shown). Although there are only two sites with the XFD1 consensus sequence, sites with 8 of 9 bases identical (consecutively) to the consensus site exist far more abundantly in human APP (13 additional such sites were identified, but not shown in Figure 6A), leading one to speculate that protein complexes capable of binding to these sites might have evolved to accommodate the single end-nucleotide change. The ChIP experiments using H3K9Ac antibodies to immunoprecipitate actively transcribing chromosomal regions in the undifferentiated SHSY5Y human cell line identifies the cluster of four E4BP4 binding sites in intron 4 as active compared to three other regions in the same gene used as negative controls (shown in Figure 6 panels B and C). These negative controls are introns 15.1, 15.2 and 18 within the same APP gene. It appears likely therefore that E4BP4/ NFIL3 also regulates human APP.
Variation in levels of β-amyloid in mice brains follows a circadian pattern
The 42-amino acid β-amyloid peptide levels in brain interstitial fluid of mice have been reported to correlate directly with wakefulness . Although the study did not find a similar correlation of full length APP in total tissue homogenates, it is intriguing that expression of the transcription factor E4BP4 shown here to regulate appb/ APP expression follows circadian rhythm controls .
Extrapolating results from zebrafish to the human to formulate hypothesis
Identifying gene regulatory DNA domains with conserved function but without conserved sequence across species is a daunting task, especially when they are located differently in the gene region as noted here between appb and APP. We propose that regulation of the APP gene in humans occurs by a mechanism similar to that of the appb gene in zebrafish, using a similar set of transcription factors that bind to sites distributed differently across the gene in the two species. The zebrafish system allowed rapid identification of important gene-regulatory sequences through mutational analyses. The system also helped delineate between several candidate transcription factor proteins that could potentially bind to the same DNA sequence. A database search for proteins that contain the conserved DNA-binding domain of the Forkhead gene family of transcription factors identified several other DNA-binding proteins. Our ability to focus on the Forkhead family of proteins arose from the previous finding in zebrafish that members of the family fkd1, fkd2 and fkd4 are expressed during gastrulation, with high levels of fkd1 and fkd4 mRNA accumulating exclusively in the notochord during somitogenesis .
Here we have functionally analyzed mutations in the two discontinuous DNA domains in zebrafish appb that were shown earlier to be important for expression of the gene in neurons. Previously known transcription factor E4BP4/ NFIL3 and Forkhead binding sites are shown to be required for intron 1 enhancer function. Dependence of neuron specific expression on sequences 31 kb upstream of appb is shown to reside in a cluster of three E4BP4 sites. Both E4BP4/ NFIL3 and Forkhead sites in these regulatory domains bind the corresponding zebrafish proteins efficiently and selectively in vitro. These sites exist in non-coding DNA of both the zebrafish and human APP genes at levels much above statistical frequency. Furthermore, a cluster of four E4BP4 sites in intron 4 of human APP is shown to be epigenetically marked with H3K9Ac in a human neuroblastoma cell-line that expresses APP. Taken together these findings suggest that appb in zebrafish and APP in humans may follow the same regulatory logic using the same set of transcription factors despite a lack of sequence similarity in their regulatory DNA. It suggests potential human APP gene regulatory pathways, not on the basis of comparing DNA primary sequences with zebrafish appb but on the model of conservation of transcription factors.
Amyloid Precursor Protein (human)/
- appb :
Amyloid Precursor Protein gene (zebrafish)/
Transcription Start Site/
Field Inversion Gel Electrophoresis/
Electrophoretic Mobility Shift Assay/
DNA Binding Domain/
Central Nervous System
Fetal Bovine serum.
Dillen K, Annaert W: A two decade contribution of molecular cell biology to the centennial of Alzheimer’s disease: are we progressing toward therapy?. Int Rev Cytol. 2006, 254: 215-300.
Perneczky R, Tsolakidou A, Arnold A, Diehl-Schmid J, Grimmer T, Förstl H, Kurz A, Alexopoulos P: CSF soluble amyloid precursor proteins in the diagnosis of incipient Alzheimer disease. Neurology. 2011, 77: 35-38. 10.1212/WNL.0b013e318221ad47.
Theuns J, Brouwers N, Engelborghs S, Sleegers K, Bogaerts V, Corsmit E, De Pooter T, van Duijn CM, De Deyn PP, Van Broeckhoven C: Promoter mutations that increase amyloid precursor-protein expression are associated with Alzheimer disease. Am J Hum Genet. 2006, 78: 936-946. 10.1086/504044.
Salbaum JM, Weidemann A, Lemaire HG, Masters CL, Beyreuther K: The promoter of Alzheimer’s disease amyloid A4 precursor gene. EMBO J. 1988, 7: 2807-2813.
Yoshikai SI, Sasaki H, Dohura K, Fururya H, Sakaki Y: Genomic organization of the human amyloid beta-protein precursor gene. Gene. 1990, 87: 257-263. 10.1016/0378-1119(90)90310-N.
Lahiri DK, Robakis NK: The promoter activity of the gene encoding Alzheimer beta-amyloid precursor protein (APP) is regulated by two blocks of upstream sequences. Brain Res Mol Brain Res. 1991, 9: 253-257.
Song W, Lahiri DK: Functional identification of the promoter of the gene encoding the Rhesus monkey beta-amyloid precursor protein. Gene. 1998, 217: 165-176. 10.1016/S0378-1119(98)00340-0.
Lahiri DK, Song W, Ge YW: Analysis of the 5′-flanking region of the beta-amyloid precursor protein gene that contributes to increased promoter activity in differentiated neuronal cells. Brain Res Mol Brain Res. 2000, 77: 185-198.
Richardson JC, Kendal CE, Anderson R, Priest F, Gower E, Soden P, Gray R, Topps S, Howlett DR, Lavender D, Clarke NJ, Barnes JC, Haworth R, Stewart MG, Rupniak HTR: Ultrastructural and behavioural changes precede amyloid deposition in a transgenic model of Alzheimer’s disease. Neuroscience. 2003, 122: 213-228. 10.1016/S0306-4522(03)00389-0.
Lahiri DK, Ge YW, Maloney B: Characterization of the APP proximal promoter and 5’-untranslated regions: Identification of cell-type specific domains and implications in APP gene expression and Alzheimer’s disease. FASEB J. 2005, 19: 653-665.
Rogers JT, Randall JD, Cahill CM, Eder PS, Huang X, Gunshin H, Leiter L, McPhee J, Sarang SS, Utsuki T, et al: An iron-responsive element type II in the 5’ untranslated region of the Alzheimer’s amyloid precursor protein transcript. J. Biol Chem. 2002, 277: 518-528.
Shaw KT, Utsuki T, Rogers J, Yu QS, Sambamurti K, Brossi A, Ge YW, Lahiri DK, Greig NH: Phenserine regulates translation of beta-amyloid precursor protein mRNA by a putative interleukin-1 responsive element, a target for drug development. Proc Natl Acad Sci USA. 2001, 98: 7605-7610. 10.1073/pnas.131152998.
Maloney B, Ge Y-W, Greig N, Lahiri DK: Presence of a “CAGA box” in the APP gene unique to amyloid plaque-forming species and absent in all APP-1/2 genes: implications in Alzheimer’s disease. FASEB J. 2004, 18: 1288-1290.
Shakes LA, Malcolm TL, Allen KL, De S, Harewood KR, Chatterjee PK: Context dependent function of appb enhancer identified using enhancer trap-containing BACs as transgenes in Zebrafish. Nucleic Acids Research. 2008, 36: 6237-6248. 10.1093/nar/gkn628.
Fisher S, Grice EA, Vinton RM, Bessling SL, McCallion AS: Conservation of RET regulatory function from human to Zebrafish without sequence similarity. Science. 2006, 14: 276-279.
Chan ET, Quon GT, Chua G, Babak T, Trochesset M, Zirngibl RA, Aubin J, Ratcliffe MJ, Wilde A, Brudno M, Morris QD, Hughes TR: Conservation of core gene expression in vertebrate tissues. J Biol. 2009, 8: e33-10.1186/jbiol130.
Blow MJ, McCulley DJ, Li Z, Zhang T, Akiyama JA, Holt A, Plajzer-Frick I, Shoukry M, Wright C, Chen F, Afzal V, Bristow J, Ren B, Black BL, Rubin EM, Visel A, Pennacchio LA: ChIP-Seq identification of weakly conserved heart enhancers. Nat Genet. 2010, 42: 806-810. 10.1038/ng.650.
Kague E, Bessling SL, Lee J, Hu G, Passos-Bueno MR, Fisher S: Functionally conserved cis-regulatory elements of COL18A1 identified through zebrafish transgenesis. Dev Biol. 2010, 337: 496-505. 10.1016/j.ydbio.2009.10.028.
Taher L, McGaughey DM, Maragh S, Aneas I, Bessling SL, Miller W, Nobrega MA, McCallion AS, Ovcharenko I: Genome-wide identification of conserved regulatory function in diverged sequences. Genome Res. 2011, 7: 1139-1149.
Chatterjee PK, Yarnall DP, Haneline SA, Godlevski MM, Thornber SJ, Robinson PS, Davies HE, White NJ, Riley JH, Shepherd NS: Direct sequencing of bacterial and P1 artificial chromosome nested-deletions for identifying position-specific single nucleotide polymorphisms. Proc. Natl. Acad. Sci. (USA). 1999, 96: 13276-13281. 10.1073/pnas.96.23.13276.
Chatterjee PK: Retrofitting BACs and PACs with LoxP Transposons to Generate Nested Deletions. “Bacterial Artificial Chromosomes” vol 1, pp 231–241. Methods in Mol. Biology. Edited by: Shaying Z, Marvin S. 2004, The Humana Press Inc
Chatterjee PK, Baker JC: Preparing Nested Deletions Template DNA for Field Inversion Gel Electrophoresis Analyses and Position-Specific End Sequencing With Transposon Primers. “Bacterial Artificial Chromosomes” vol 1, pp 243–254. Methods in Mol. Biology. Edited by: Shaying Z, Marvin S. 2004, The Humana Press Inc
Shakes LA, Abe G, Eltayeb MA, Wolf HM, Kawakami K, Chatterjee PK: Generating libraries of iTol2-end insertions at BAC ends using loxP and lox511 Tn10 transposons. BMC Genomics. 2011, 12: 351-10.1186/1471-2164-12-351.
Chatterjee PK, Pruzan R, Flint SJ: Purification of an active TATA-binding protein-containing factor using a monoclonal antibody that recognizes the human TATA-binding protein. Protein Expr Purif. 1993, 5: 445-455.
Erman B, Cortes M, Nikolajczyk BS, Speck NA, Sen R: ETS-core binding factor: a common composite motif in antigen receptor gene enhancers. Mol. Cell. Biol. 1998, 18: 1322-1330.
Chakraborty T, Perlot T, Subrahmanyam R, Jani A, Goff PH, Zhang Y, Ivanova I, Alt FW, Sen R: A 220-nucleotide deletion of the intronic enhancer reveals an epigenetic hierarchy in immunoglobulin heavy chain locus activation. J. Exp. Med. 2009, 206: 1019-1027. 10.1084/jem.20081621.
Ellingsen S, Laplante MA, Konig M, Kikuta H, Furmanek T, Hoivik EA, Becker TS: Large-scale enhancer detection in the zebrafish genome. Development. 2005, 132: 3799-3811. 10.1242/dev.01951.
Kawakami K: Transposon tools and methods in zebrafish. Developmental Dynamics. 2005, 234: 244-254. 10.1002/dvdy.20516.
Nagayoshi S, Hayashi E, Abe G, Osato N, Asakawa K, Urasaki A, Horikawa K, Ikeo K, Takeda H, Kawakami K: Insertional mutagenesis by the Tol2 transposon-mediated enhancer trap approach generated mutations in two developmental genes: tcf7 and synembryn-like. Development. 2008, 135 (1): 159-169.
Bessa J, Tena JJ, de la Calle-Mustienes E, Fernández-Miñán A, Naranjo S, Fernández A, Montoliu L, Akalin A, Lenhard B, Casares F, Gómez-Skarmeta JL: Zebrafish enhancer detection (ZED) vector: a new tool to facilitate transgenesis and the functional analysis of cis-regulatory regions in zebrafish. Dev Dyn. 2009, 238: 2409-2417. 10.1002/dvdy.22051.
Royo JL, Hidalgo C, Roncero Y, Seda MA, Akalin A, Lenhard B, Casares F, Gómez-Skarmeta JL: Dissecting the transcriptional regulatory properties of human chromosome 16 highly conserved non-coding regions. PLoS One. 2011, 6: e24824-10.1371/journal.pone.0024824.
Wolf HM, Iranloye O, Norford DC, Chatterjee PK, Functionalizing Bacterial Artificial Chromosomes with Transposons to explore Gene Regulation: Bacterial Artificial Chromosomes. 2011, 45-62. InTech, (http://www.intechopen.com), Rijeka, Croatia (2011) ISBN 978-953-307-725-3
Odenthal J, Nüsslein-Volhard C: Fork head domain genes in zebrafish. Dev Genes Evol. 1998, 208: 245-258. 10.1007/s004270050179.
Cowell IG: E4BP4/NFIL3, a PAR-related bZIP factor with many roles. BioEssays. 2002, 24: 1023-1029. 10.1002/bies.10176.
Bozek K, Relogio A, Kielbasa SM, Heine M, Dame C, Kramer A, Herzel H: Regulation of clock-controlled genes in mammals. PLOS One. 2009, 4: e4882-10.1371/journal.pone.0004882.
Junghans D, Chauvet S, Buhler E, Dudley K, Sykes T, Henderson CE: The CES-2-related transcription factor E4BP4 is an intrinsic regulator of motoneuron growth and survival. Development. 2004, 131: 4425-4434. 10.1242/dev.01313.
Kamizono S, Duncan GS, Seidel MG, Morimoto A, Hamada K, Grosveld G, Akashi K, Lind EF, Haight JP, Ohashi PS, Look AT, Mak TW: Nfil3/E4bp4 is required for the development and maturation of NK cells in vivo. J Exp Med. 2009, 206: 2977-2986. 10.1084/jem.20092176.
Kobayashi T, Matsuoka K, Sheikh SZ, Elloumi HZ, Kamada N, Hisamatsu T, Hansen JJ, Doty KR, Pope SD, Smale ST, Hibi T, Rothman PB, Kashiwada M, Plevy SE: NFIL3 is a regulator of IL-12 p40 in macrophages and mucosal immunity. J. Immunol. 2011, 186: 4649-4655. 10.4049/jimmunol.1003888.
Smith AM, Qualls JE, O'Brien K, Balouzian L, Johnson PF, Schultz-Cherry S, Smale ST, Murray PJ: A distal enhancer in Il12b is the target of transcriptional repression by the STAT3 pathway and requires the basic leucine zipper (B-ZIP) protein NFIL3. J Biol Chem. 2011, 2011286: 23582-23590.
Popović M, Caballero-Bleda M, Puelles L, Popović N: Importance of immunological and inflammatory processes in the pathogenesis and therapy of Alzheimer's disease. Int J Neurosci. 1998, 95: 203-236. 10.3109/00207459809003341.
Cojocaru IM, Cojocaru M, Miu G, Sapira V: Study of interleukin-6 production in Alzheimer's disease. Rom J Intern Med. 2011, 49: 55-58.
Morikawa Y, Komori T, Hisaoka T, Senba E: Detailed expression pattern of Foxp1 and its possible roles in neurons of the spinal cord during embryogenesis. Dev Neurosc. 2009, 31: 511-522. 10.1159/000243715.
Kang J-E, Lim MM, Bateman RJ, Lee JJ, Smyth LP, Cirrito JR, Fujiki N, Nishino S, Holtzman DM: Amyloid-β dynamics are regulated by orexin and the sleep-wake cycle. Science. 2009, 326: 1005-1007. 10.1126/science.1180962.
The project described was supported by Award Number P20MD000175 from the National Center on Minority Health and Health Disparities. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Center on Minority Health and Health Disparities or the National Institutes of Health, and funds from the North Carolina Biotechnology Center. We thank Ms. Cicely Williams, Rosalind Grays, Connie Keys, Crystal McMichael, Camilla Felton and Darlene Laws for support and encouragement. PKC would like to thank Drs. Ken Harewood and Sean Kimbro for encouragement and funding support.
The authors declare that they have no competing interests.
LAS screened enhancer-trap BAC libraries by FIGE, sequenced BAC ends, injected BAC DNA into zebrafish embryos, and documented positive embryos using photo-microscopy. HD helped with construction of the Fkd expression plasmid, purified the proteins, performed EMSA assays with E4BP4 and Fkd proteins and analyzed data. HMW discovered the unique distribution of E4BP4 and XFD1 binding sites in non-coding DNA in and around the zebrafish appb and human APP genes, designed and constructed the expression plasmids for E4BP4 and Fkd proteins, and helped write the manuscript. CH screened BAC libraries using FIGE, supplied zebrafish eggs and provided animal care for the zebrafish embryos. DCN supervised zebrafish animal care, analyzed data and helped write the manuscript. PKC designed the study along with RS, helped with design and construction of the enhancer-trap transposon plasmids, generated the enhancer-trap libraries of BAC deletions, purified BAC DNA for injections using Qiagen columns, and was responsible for writing the article with RS. Both RS and PKC critically evaluated the study. PP conducted the ChIP experiments and analyzed data. All authors read and approved the final manuscript.
Leighcraft A Shakes, Hansen Du contributed equally to this work.
Electronic supplementary material
Additional file 1: Figure S1. The E. coli expressed DNA-binding protein domain common to members within gene family, shown in green (A) E4BP4/ NFIL3, or red (B) Fkd. The corresponding DNA sequence that was amplified using the PCR primers is underlined. (PDF 14 KB)
Additional file 2: Figure S2. Sequences of primers, analyzed using pDRWN32 software, used to probe chromatin immune-precipitates with H3K9Ac or control IgG antibody. (PDF 51 KB)
Additional file 3: Figure S3. Sequences of primers used to detect E4BP4/ NFIL3 mRNA levels in the human cell line SHSY5Y using RT-PCR. (PDF 75 KB)
Additional file 4: Figure S4. Sequence of intron 1 enhancer is indicated in black letters, and the sequences of predicted transcription factor (TF) binding sites indicated in colored letters. The long arrows indicate the location and directionality of PCR primers used to delete specific transcription factor binding sites. The thick short underlines within the SOX5 site indicate point mutations introduced that leave the overlapping E4BP4 site intact. (PDF 14 KB)
Authors’ original submitted files for images
About this article
Cite this article
Shakes, L.A., Du, H., Wolf, H.M. et al. Using BAC transgenesis in zebrafish to identify regulatory sequences of the amyloid precursor protein gene in humans. BMC Genomics 13, 451 (2012). https://doi.org/10.1186/1471-2164-13-451