- Research article
- Open Access
Differences in enhancer activity in mouse and zebrafish reporter assays are often associated with changes in gene expression
BMC Genomicsvolume 13, Article number: 713 (2012)
Phenotypic evolution in animals is thought to be driven in large part by differences in gene expression patterns, which can result from sequence changes in cis- regulatory elements (cis- changes) or from changes in the expression pattern or function of transcription factors (trans- changes). While isolated examples of trans- changes have been identified, the scale of their overall contribution to regulatory and phenotypic evolution remains unclear.
Here, we attempt to examine the prevalence of trans- effects and their potential impact on gene expression patterns in vertebrate evolution by comparing the function of identical human tissue-specific enhancer sequences in two highly divergent vertebrate model systems, mouse and zebrafish. Among 47 human conserved non-coding elements (CNEs) tested in transgenic mouse embryos and in stable zebrafish lines, at least one species-specific expression domain was observed in the majority (83%) of cases, and 36% presented dramatically different expression patterns between the two species. Although some of these discrepancies may be due to the use of different transgenesis systems in mouse and zebrafish, in some instances we found an association between differences in enhancer activity and changes in the endogenous gene expression patterns between mouse and zebrafish, suggesting a potential role for trans- changes in the evolution of gene expression.
In total, our results: (i) serve as a cautionary tale for studies investigating the role of human enhancers in different model organisms, and (ii) suggest that changes in the trans environment may play a significant role in the evolution of gene expression in vertebrates.
The idea that phenotypic evolution occurs mostly by changes in gene expression rather than in protein coding sequences has been gaining increasing acceptance [1–3]. However, how the expression of a particular gene evolves between lineages is less clear. Gene expression is controlled by sets of regulatory sequences (cis- regulatory elements) that are recognized by protein transcription factors (trans- factors) differentially expressed among cell types and developmental stages. The interplay between cis- elements and expressed trans- factors determines gene expression; thus, the evolution of gene expression can occur due to changes in either the cis- regulatory elements (hereafter referred to as “cis- changes”) or in the associated trans environment (“trans- changes”, which can be due to changes in the expression of the trans- factors themselves, or in their activity, specificity, etc.).
The relative contribution of cis and trans- changes to the evolution of transcriptional regulation is not well understood, in particular in the vertebrate lineage. Several studies have addressed this question comparing closely related lineages in other systems (e.g. yeasts [4–9] or Drosophilas [10, 11]). In general, these studies show that, at short evolutionary distances, cis-changes are the dominant force shaping gene transcriptional regulation. Although less information is available on vertebrate models, a similar conclusion was reached by Wilson and co-workers , demonstrating that for genes of the human chromosome 21, most of expression divergence between mice and human was associated to cis-changes. Nevertheless, the difficulty of experimental study in vertebrate models and the presence of much more complex cis-regulatory landscapes in these organisms make it currently impossible to confidently identify and compare the full cis-regulatory complement of most vertebrate genes in different species. A way to circumvent this problem has been to focus on the evolution of highly conserved non-coding elements (CNEs, [13–16]) associated to certain genes, for which orthology relationships between species can be confidently established. Functional assays of these CNEs showed that they often act as transcriptional enhancers, harboring a substantial part of the information on when, where and how much a gene must be transcriptionally active [14, 15, 17–19]. Thus, CNE comparisons allow a partial evaluation of the contribution of cis vs. trans changes between species. Along these lines, different groups have compared the behavior of a handful of orthologous CNEs near key developmental genes in their respective native hosts, often mouse and zebrafish, finding that only around a third of the orthologous enhancers may drive different developmental expression [20–24]. In these studies, however, it is not possible to tease apart the contribution of cis vs trans changes in these differences. In a recent study, Ritter and co-workers compared the expression driven by 13 human CNEs in mouse and zebrafish transgenic embryos, finding that 5/13 (39%) CNEs showed different expression in the two hosts , suggesting differences in the trans environment, consistent with previous results . However, the low number of reported cases makes it difficult to infer the real extent of trans effects in the evolution of gene expression.
Here, we have aimed to directly test the potential impact of trans- changes in the evolution of enhancer activity and of endogenous gene expression. As a proxy for potential differences in trans environments, we have compared the enhancer activity of 47 human enhancers in mouse and stable zebrafish transgenic embryos. We have found that the majority of the sequences tested (83%) show differences in reporter expression. Despite possible confounding experimental factors, many of these differences are likely to be caused by changes in the trans environment between the two distantly related vertebrate species. The frequency of such differences was high for both ancestral CNEs that are conserved in zebrafish and those that diverged beyond recognition in the teleost, representing CNEs with mild and severe cis changes, respectively. In addition, four-way comparisons of CNE-driven and associated endogenous gene expressions in zebrafish and mouse showed that the majority of observed changes in gene expression (8 out of 11 cases) are in agreement with a trans change, suggesting that trans-changes may play a role in driving these differences.
Widespread differences in enhancer activity between mouse and zebrafish
In order to assess the potential extent of evolutionary trans- changes between teleosts and mammals we compared the enhancer activity of 47 sequences in mouse and zebrafish. We used a subset of randomly selected human CNEs that were previously shown to have enhancer activity in mouse 11.5dpc (days post coitum) embryos . These exact same sequences were used to generate stable zebrafish transgenic lines driving GFP reporter expression (Figure 1), which produce much more robust results and avoid phenotypic variation typically associated with transient transgenic assays in zebrafish . GFP expression was monitored in F1 transgenic embryos, from 24 to 48 hours post-fertilization (hpf), a developmental range that includes the stage homologous to the mouse 11.5dpc stage (the phylotypic stage ). Consistent GFP expression from multiple founders for each CNE was then decomposed and annotated into major anatomical domains homologous to those used for mouse embryos. Although many of the CNEs tested have shown consistent expression between mouse and zebrafish for at least one anatomical domain (39 out of 47), many discrepancies of expression were also detected, resulting in a surprisingly high number of sequences that presented at least one major species-specific anatomical domain (39 out of 47; 83%, containing on average 2.4 expression domains in zebrafish and 2.3 in mouse per CNE; Additional file 1). Since the sequences probed were identical in both species, these divergences could result from the specific settings used for the mouse and zebrafish reporter assays (see discussion) and/or to differences between the transcription factor environments of developing mouse and zebrafish embryos, which may be interpreting the same cis- regulatory information differently. In the most extreme cases, 17/47 (36%, containing on average 2.2 expression domains in zebrafish and 1.9 in mouse per CNE) of these sequences showed dramatically different patterns (divergence of expression for 75% or more anatomical domains). One example of such global differences is the enhancer Hs608, which shows activity in dorsal root ganglia and spinal cord in mouse, but only forebrain expression in zebrafish (Figure 2A and B). In other cases the divergence was less extensive, often with few extra expression domains in mouse and/or zebrafish. For example the Hs278 enhancer drives expression to the hindbrain and spinal cord in transgenic mice (Figure 2C), whereas transgenic zebrafish have only spinal cord expression (Figure 2D). It should be noted, however, that some of these differences may also be due to the use of a different transgenesis system for each species (see Discussion). Finally, only 8/47 (17%, containing on average 2.3 expression domains in zebrafish and 2.3 in mouse) sequences showed fully consistent reporter expression patterns (Additional file 1). This is the case of the Hs123 enhancer, which shows a shared expression in mice and zebrafish in the forebrain (Figure 2E and F).
Differences in enhancer activity are frequent in ancestral CNEs with either severe or mild cis- changes
One interesting hypothesis is whether trans- changes may correlate with changes in the sequences to which they bind; in other words, they may correlate with cis- changes. This may happen, for instance, if an enhancer loses the binding site for a specific trans factor, relieving the selective pressure to keep this trans factor expressed in the corresponding tissue. The opposite may also be true: loss of a trans factor’s expression in a domain may relieve the selective constraint to preserve the corresponding binding sites in the enhancer sequence. To assess this possibility, we selected a group of phylogenetically comparable CNEs present in the gnathostome ancestor (i.e. conserved to the shark Callorhinchus milli, 37 aCNEs, see Methods) and divided them into two major groups, whether they have been conserved (20 CNEs) or diverged beyond recognition/lost (17 CNEs) in the lineage of zebrafish. Ancestral sequences that cannot be detected in zebrafish are expected to have evolved at a much faster rate or been fully deleted (severe cis- changes); in contrast, those CNEs that can be detected in fish are more likely to have experienced less evolutionary sequence change (mild cis- changes). Consistent with this idea, phyloP scores show a nearly 50% higher average rate of evolution within placental mammals for those aCNEs not detectable in zebrafish (0.159 vs. 0.107, p = 0.042, ANOVA test). Similarly, the ratio between positions with low phyloP score (score < −1, accelerated evolution) and high score (score > +1, positions under strong purifying selection) is 2.5 times higher for the CNEs absent in zebrafish (p < 0.0001, Chi-square test). Conversely, the average sequence conservation among mammals, as measured by PhastCons scores, is higher for those CNEs present in the zebrafish genome (0.699 vs. 0.794, p = 0.047). Finally, the alignment of the orthologous CNEs from human, mouse, chicken and Xenopus show higher average conservation for CNEs conserved in zebrafish (63.9% vs. 70.0%, p = 0.043).
However, despite their different evolutionary sequence history, analyses of each group separately showed that species-specific domains of enhancer activity are very frequent in both types of CNEs, with either severe or mild changes (82% and 90% respectively, p = 0.863; Additional file 2). Similarly, the fraction of CNEs with global reporter expression changes is similar (8/17 vs. 5/20), and 3/5 of the CNEs with identical expression patterns correspond to aCNEs with severe cis- changes. Since each CNE may drive reporter expression to one or more anatomical annotations, we have also quantified the percentage of the affected expression domains per CNE. Here, we also found a high proportion of affected domains in both groups, with a slightly higher (though not significantly) representation of species-specific domains in the non-conserved/lost group (60% vs 47%, p = 0.09, Fisher 1-tail test; Additional file 2). These results, although not providing evidences against a major role of cis-changes in the control of gene expression in evolution, suggest that the effect of the trans environment (and/or transgenesis system employed) may be important for CNE function, regardless of the extent of their cis-regulatory sequence divergence.
Differences in enhancer activity are associated with changes in gene expression
We have shown that many CNEs have divergent behaviors in mouse and zebrafish embryos, which could be due to possible differences in the associated trans environments in the two species in at least some instances. In this case, however, this would not imply that these putative trans- changes have played a role in the evolution of the associated endogenous gene expression patterns in the two species. To assess the potential evolutionary importance of the observed differences in enhancer activity, we next compared the expression patterns of the genes associated with the studied CNEs. We combined synteny and comparisons of expression patterns between gene and enhancer to confidently associate genes to 26 CNEs (see Methods). For these genes, expression data were generated or surveyed in available databases for both zebrafish and mouse at comparable time points. Then, we annotated the same anatomical terms as per the enhancer assays, which allowed 4-way comparisons: zebrafish endogenous gene expression (Zg), mouse endogenous gene expression (Mg), zebrafish transgenic reporter expression (Zt) and mouse transgenic reporter expression (Mt) (Additional file 3). Despite the large fraction of differences in enhancer activity, only 11/26 genes showed changes in major expression domains between the two species at the studied developmental stages (Table 1). Intriguingly, however, the CNEs associated to these 11 genes show a higher fraction of divergent expression domains in the reporter assays (an average of 67% of expression domains diverge between mouse and zebrafish) than the remaining 15 genes (average of 38%; p = 0.012, T-test; Additional file 3). Moreover, the majority (8/11, 73%) of the CNEs associated with the divergent genes show consistent changes in enhancer activity for at least one of the divergent anatomical terms (Table 1). One example of this is the enhancer Hs382, which drive reporter expression in dorsal root ganglia in mouse consistent with the associated gene, znf536 (DRG; Figure 3A and B), whereas both the reporter and the target gene are absent from this structure in zebrafish (Figure 3C and D). Therefore, these results suggest that some differences in enhancer activity between mouse and zebrafish are indeed mirroring changes in the endogenous gene expression patterns.
We have compared enhancer activity of 47 human conserved non-coding sequences between mouse and zebrafish. Surprisingly, we found that the vast majority of these sequences (39/47, 83%) show discrepancies in at least one expression domain between mouse and zebrafish, indicating a remarkably different behaviour of the same enhancer sequences when tested in these two distantly related vertebrate lineages. Since the sequences tested are the same in the two transgenesis systems, it is possible that differences in reporter expression patterns may correspond to differences in the cellular TF environment of each species. A fraction of these differences may also result from the use of different transgenesis systems in each species; however, as discussed below, these are not likely to account for the majority of cases (see below).
Differences in enhancer activity: trans-changes or experimental differences between transgenesis systems?
A major question raised by our results is whether the vast differences are mostly due to differences in trans environments between both species or to experimental differences between the two transgenesis systems, or a similar combination of both. The experimental differences between mouse and zebrafish include the use of different minimal promoters (hsp68 vs gata2a), reporter genes (LacZ vs GFP), transgenesis techniques (pronuclear injection vs Tol2 transposon) and endogenous characteristics associated to each animal model (opaque vs transparent embryos). To ensure the reproducibility of results we have performed transgenesis in zebrafish using the ZED vector  and analysed reporter expression in stable transgenic embryos (F1 generation). This vector is able to minimize the position effect usually associated to transposon mediated zebrafish transgenesis. In addition, this vector contains the gata2a minimal promoter, which has been has been shown to be able to read a wide range of enhancers in enhancer assays [19, 27, 30–40]. A similar performance for this minimal promoter was observed in a large-scale enhancer trap screen, being able to recapitulate the expression pattern of genes nearby the enhancer trap insertions with minimal noise . Similarly, the mouse hsp68 minimal promoter has been extensively used by many different research groups, for a broad range of tissue-specific enhancers [26, 42–50]. Tol2 transposon mediated transgenesis  is broadly used in zebrafish because it is very efficient, usually resulting in the integration of single to few copies of the reporter construct. This contrasts with the integration of multiple concatenated copies generated by the pronuclear injection of linearised DNA, widely applied in mouse transgenesis. Multiple copies of the reporter construct might reduce the position effect, but the consequent over-sensibility of the system might be a concern. The ability to control the enzymatic dependent revelation of the reporter gene (LacZ) in mouse reporter assays, together with the endogenous opacity of mice embryos might compensate this over-sensibility, making it a system possible to compare with the zebrafish reporter assay. Therefore, although, we cannot rule out that a fraction of the differences observed in this analysis are due to the individual features of the vectors, these are unlikely to account for the majority of the large number of differences in enhancer activity observed in our study (this would imply that most published data for enhancer activity in mouse and zebrafish is also not reliable). Furthermore, several observations from our results suggest that experimental considerations may indeed be behind only in a minority of cases: (i) A similar number of anatomical domains are detected in both enhancer activity assays (106 in mouse and 113 in zebrafish), thus “sensibility” of both assays is similar. (ii) Differences in enhancer activity due to ectopic expression in one of the species represent a comparable percentage of the total and are similarly distributed across anatomical domains (45 (54%) versus 38 (46%) cases in zebrafish and mouse, respectively; domain distribution in Additional file 4). (iii) Several differences in enhancer activity are coincident with changes in putative target genes.
Finally, several considerations were taken in order to minimize differences in enhancer activity that could be due to differences in the development and/or body plan of the two species: (i) Only wide expression in major conserved anatomical terms was considered (e.g. “hindbrain”, “spinal cord”, etc.; see Methods) to avoid cell type- and subdomain-specific differences. (ii) The comparisons were performed in homologous developmental stages corresponding to the vertebrate phylotypic stage, which shows the highest similarity between species during embryonic development, both morphologically and transcriptomically . (iii) Taking advantage of the in vivo nature of the zebrafish transgenic assays, we monitored GFP expression between 24 and 48hpf – which encompasses the most likely equivalent of the mouse 11.5dpc embryo – to minimize differences due to highly dynamic expression patterns. (It should be noted, however, that differences in the time of onset of expression of a particular TF in a given developmental domain is in itself a true trans change, with potentially important implications for the expression of the endogenous gene).
In summary, these considerations indicate that at least a significant fraction of the observed differences in reporter expression between mouse and zebrafish may correspond to changes in the trans environments (in particular those cases associated to changes in the endogenous gene expression), either by differences in the location and/or time of expression of trans factors in both species.
Discrepancies with previous studies in vertebrate lineages
Ritter and co workers estimated that ~39% of enhancers (5/13) may show differences in expression between mouse and zebrafish , about half of the frequency we report here. However, the main focus of Ritter and colleges’ work was the study of cis evolution, and therefore they investigated only a limited number of cases of trans evolution; indeed, Ritter et al’s and our results are not statistically different (p = 0.247, Chi-squared test). In addition, to avoid the high degree of mosaicism and variability associated with unstable transgenesis in zebrafish , we have performed the present study generating several stable transgenic lines for each of the studied CNEs. This approach allows more confident identification of expression domains, in particular in the case of very restricted domains or if the enhancer activity is mild or not very robust in that particular region. A clear example of such effect is observed in the Hs200 enhancer. Transient transgenic embryos always present a consistent strong expression in the forebrain (Additional file 5, and ), also observed in our four independent stable transgenic lines for this CNE. However, the stable lines showed additional – weaker, yet consistent – expression in the midbrain, hindbrain and spinal cord (Additional file 5). Therefore, our results, obtained using stable transgenic lines, are likely to be more sensitive than previous studies using transient transgenesis. In Additional file 6, we summarize other previous studies comparing enhancer activity in mouse and zebrafish. From these, 13/20 (65%) compared cases show divergent expression patterns between mouse and zebrafish, further supporting the existence of a large number of trans changes between both species [19, 21, 26, 27, 31, 43, 52–55].
Finally, it should be noted that ours and other studies in vertebrates systems differ in a substantial point with those performed in other model organisms such as yeast and Drosophila [4, 6–11]. In the latter, differences in gene expression were usually estimated as variation in relative quantitative expression, whereas in vertebrate systems only differences in spatiotemporal expression (i.e. anatomic domains at certain developmental stages) are assessed. For this reason, these studies complement each other investigating the outcome of different gene expression traits during evolution.
Changes in trans environments and the evolution of gene expression
In addition to a survey for differences in enhancer activity between zebrafish and mouse, we have observed that the genes that present changes in expression patterns are associated to enhancers that drive more divergent reporter expression between zebrafish and mouse. Indeed, eight of such cases of differences in enhancer activity readily correspond with the changes in the expression of the endogenous genes. It is therefore tempting to speculate that these differences in enhancer activity (and thus potentially trans- changes) are, at least in part, responsible for the evolutionary differences in the expression of these endogenous genes between the two studied vertebrate lineages, with potential biological consequences. Remarkably, despite the low number of cases, we find that most genes (73%) that show differences in expression patterns between fish and mammals have an associated difference in enhancer activity, suggesting that this may be a more widespread phenomenon than commonly assumed, at least for CNE-associated genes.
Another intriguing finding was the large proportion of genes that did not show differences in expression patterns in zebrafish and mouse despite their putative enhancers behave differently. The simplest explanation for this observation may be that these discrepancies in enhancer activity are due to the experimental differences discussed above. However, other biological explanations are also plausible. For instance, it is conceivable that there is a different need in the two species for other partially redundant or additive regulatory elements, not present in the tested CNE, to fully respond to the specific trans environment. In support of this idea, most trans- changes that were not associated with changes in gene expression were due to the lack of expression domains in the transgenic embryos for any of the species (14/15 in mouse and 13/16 in zebrafish, as determined by the 4-way comparisons, Additional file 3). Another, non-mutually exclusive, explanation may be the co-evolution between the enhancers and the trans environments in the different lineages, or cis-trans compensation. In line with this hypothesis, recent reports in Drosophila and mouse provide evidence that this compensation may be larger than generally considered [56, 57]. Consistently, Ritter and co workers found a better correlation for enhancer activity when testing zebrafish and human ortholog sequences in zebrafish and mice, respectively, than both orthologs in zebrafish .
Given the large number of differences in enhancer activity detected in this study, our results suggest that changes in trans environments may be more common in vertebrate evolution than previously anticipated, and that some of these changes may be associated with evolution of endogenous gene expression. Future research should focus on identifying the TFs responsible for trans- changes, and which evolutionary differences exist in those TFs (changes in gene expression caused by cis and/or trans mutations, variations in activity, specificity, etc) that are ultimately responsible for the evolution of gene expression. Finally, our results serve as a strong cautionary tale for studies investigating the regulatory function of human sequences in classical vertebrate model organisms, whether the observed variation in enhancer activity is due to specifics of each transgenesis technique, or to true biological differences in trans environments across large evolutionary time-scales.
Functional reporter analyses in zebrafish and comparison with mice reporters
Human CNEs, previously shown to have enhancer activity in mouse , were randomly selected, isolated from human genomic DNA and subcloned in a standard Gateway entry vector (pENTR/D-TOPO vector, Invitrogen). Sequences were transferred by recombination to a Gateway compatible reporter vector (the ZED vector) and zebrafish transgenesis was performed as described . Stable transgenic lines were isolated and GFP expression was assessed at 24 and 48 hours post fertilization (hpf) from embryos reared at 28°C and staged according to standard protocols . GFP was observed and documented using a stereomicroscope fitted for epifluorescence (SMZ 1500, Nikon) with a digital camera attached (F View II, Olympus). At least 3 independent stable transgenic lines were analyzed per CNE (with the exception of Hs799 for which only 2 lines were analyzed), and expression patterns were defined by presence of the reporter gene (GFP) in at least half of these lines (Additional files 7 and 8). Mice reporter data was extracted from the “VISTA enhancer browser” database (http://enhancer.lbl.gov/). Extra annotations were added when a tissue was considered positive in zebrafish and expression was observed in mice in half or more cases of the available data (Additional file 7). Comparison was only performed for anatomical regions present in both species. Five enhancers previously tested in zebrafish by Lee and coworkers (; Hs702, Hs901, Hs1043, Hs1114 and Hs1358) were included in the data set for mice and zebrafish comparison. Expression domains were compared by analyzing mouse expression at a single time point (11.5 days) and two corresponding developmental time points (24 and 48hpf) in zebrafish. This comparison is summarized in Additional files 1 and 2. The transgenic line Tg(−3.1neurog1:GFP)sb2, that shows expression in the dorsal root ganglia, has been previously reported [60, 61].
Sequence conservation analyses
Conservation of CNEs in the elephant shark C. milli was obtained from Lee et al. . To complement and update this analysis, all studied CNEs were blasted against the most up-to-date C. milli trace databases (both Sanger and 454 reads, from NCBI). Blastn hits with p < 10-5 were considered positive and incorporated to the set of ancestral CNEs (aCNEs).
Sequence conservation data for each human aCNE was extracted from the 46-way-placental PhyloP and PhastCons conservation tracks from UCSC table browser (, http://genome.ucsc.edu/cgi-bin/hgTables?command=start). PhyloP score provides a measure of the conservation or divergence of a particular alignment position (negative values indicate purifying selection, whereas high positive scores suggest accelerated evolution). PhastCons scores indicate the degree of conservation of a nucleotide position in a multi-species alignment (from fully conserved (=1) to not conserved (=0)). For both PhyloP and PhastCons we calculated the average score for each CNE and performed ANOVA tests between CNEs conserved and lost in zebrafish. In addition, we compared the proportion of highly conserved or diverged positions within each of the two groups using Chi-squared tests.
Identification of enhancers target genes and comparison of gene expression patterns
To associate the tested CNEs to their respective target genes, we first searched for genomic regulatory blocks (GRB) based on conserved synteny across vertebrate species (Additional file 9 A). Some of these GRBs were previously defined  and for the others we have used Synorth software (; http://synorth.genereg.net/). Then, one or few target genes from these GRBs containing the respective CNE were selected based on consistency of CNE and gene expression patterns in mice [26, 65–68] (Additional file 9B and C, E and F). For these cases, we searched for available in situ hybridization (ISH) and/or RT-PCR gene expression data (EMAGE ; Eurexpress ; Genepaint ; GXD ). Once we have confidently established the association between the CNE and the endogenous gene in mammals, we investigated the gene expression in the orthologous gene of zebrafish in available ISH database  (Additional file 9D and G; Additional file 10). For some genes with no available expression data (gsx2 (ENSDARG00000043322) and znf423 (ENSDARG00000059707)), we performed whole mount ISH in zebrafish embryos (Additional file 11). In the case of CNEs that have diverged beyond recognition/lost in zebrafish and for which there are two paralogs in this species, we discarded the element if the candidate genes had discrepant expression. For the set of confident gene-CNE associations in both species, we then annotated the presence or absence of gene expression in the homologous mouse and zebrafish anatomical domains as per CNEs. This way we were able to build a 4-way comparison for 26 CNEs with presence/absence of signal in transgenic mice and zebrafish and the endogenous mouse and zebrafish target genes (Additional file 3).
In addition, we found that, for four of the studied human CNEs, the target genes have no corresponding ortholog in zebrafish (Hs426-Pou3f4, Hs240 and Hs752-TLE1 and Hs312-TLE4). Consistently, none of these four CNEs could be identified in the genome of zebrafish or of any other available teleost, despite their presence in shark and tetrapod genomes. However, the associated genes could be identified in the other four teleost genomes, suggesting that these are zebrafish-specific gene losses. The most plausible evolutionary scenario, then, is that first the CNEs were lost in the lineage of teleost fish and then the gene orthologs were lost in zebrafish. Interestingly, when these regulatory modules are tested for enhancer activity in zebrafish they are still able to reproduce part of the mice transgenic expression patterns (Additional file 12).
Zebrafish whole-mount in situ hybridization
Antisense RNA probes were prepared from cDNAs using digoxigenin (Roche) as label. cDNAs were amplified by RT-PCR from total RNA extracted from different developmental stages of zebrafish embryos, treated with DNAse, and processed as described [20, 70]. Coding regions of genes were amplified using the following primers:
gsx2RV: 5′-CTTTGCCTTCTTTCTTGTGCTTAACGC - 3′
znf423FW: 5′-TGTGTGATTACTGTGAGGAAACATTC -3′
The amplification products were subcloned in pGEMT-Easy (Promega) and sequenced. Zebrafish specimens were prepared, hybridized and stained as described .
King MC, Wilson AC: Evolution at two levels in humans and chimpanzees. Science. 1975, 188: 107-116. 10.1126/science.1090005.
Halder G, Callaerts P, Gehring WJ: Induction of ectopic eyes by targeted expression of the eyeless gene in Drosophila. Science. 1995, 267: 1788-1792. 10.1126/science.7892602.
Carroll SB: Evo-devo and an expanding evolutionary synthesis: a genetic theory of morphological evolution. Cell. 2008, 134: 25-36. 10.1016/j.cell.2008.06.030.
Ronald J, Akey JM: The evolution of gene expression QTL in Saccharomyces cerevisiae. PLoS One. 2007, 2: e678-10.1371/journal.pone.0000678.
Ronald J, Brem RB, Whittle J, Kruglyak L: Local regulatory variation in Saccharomyces cerevisiae. PLoS Genet. 2005, 1: e25-10.1371/journal.pgen.0010025.
Wang D, Sung H-M, Wang T-Y, Huang C-J, Yang P, Chang T, Wang Y-C, Tseng D-L, Wu J-P, Lee T-C, Shih M-C, Li W-H: Expression evolution in yeast genes of single-input modules is mainly due to changes in trans-acting factors. Genome Res. 2007, 17: 1161-1169. 10.1101/gr.6328907.
Sung H-M, Wang T-Y, Wang D, Huang Y-S, Wu J-P, Tsai H-K, Tzeng J, Huang C-J, Lee Y-C, Yang P, Hsu J, Chang T, Cho C-Y, Weng L-C, Lee T-C, Chang T-H, Li W-H, Shih M-C: Roles of trans and cis variation in yeast intraspecies evolution of gene expression. Mol Biol Evol. 2009, 26: 2533-2538. 10.1093/molbev/msp171.
Tirosh I, Reikhav S, Levy AA, Barkai N: A yeast hybrid provides insight into the evolution of gene expression regulation. Science. 2009, 324: 659-662. 10.1126/science.1169766.
Emerson JJ, Hsieh L-C, Sung H-M, Wang T-Y, Huang C-J, Lu HH-S, Lu M-YJ WS-H, Li W-H: Natural selection on cis and trans regulation in yeasts. Genome Res. 2010, 20: 826-836. 10.1101/gr.101576.109.
Wittkopp PJ, Haerum BK, Clark AG: Evolutionary changes in cis and trans gene regulation. Nature. 2004, 430: 85-88. 10.1038/nature02698.
Wittkopp PJ, Haerum BK, Clark AG: Regulatory changes underlying expression differences within and between Drosophila species. Nat Genet. 2008, 40: 346-350. 10.1038/ng.77.
Wilson MD, Barbosa-Morais NL, Schmidt D, Conboy CM, Vanes L, Tybulewicz VLJ, Fisher EMC, Tavaré S, Odom DT: Species-specific transcription in mice carrying human chromosome 21. Science. 2008, 322: 434-438. 10.1126/science.1160930.
Sandelin A, Bailey P, Bruce S, Engström PG, Klos JM, Wasserman WW, Ericson J, Lenhard B: Arrays of ultraconserved non-coding regions span the loci of key developmental genes in vertebrate genomes. BMC Genomics. 2004, 5: 99-10.1186/1471-2164-5-99.
Shin JT, Priest JR, Ovcharenko I, Ronco A, Moore RK, Burns CG, MacRae CA: Human-zebrafish non-coding conserved elements act in vivo to regulate transcription. Nucleic Acids Res. 2005, 33: 5437-5445. 10.1093/nar/gki853.
Woolfe A, Goodson M, Goode DK, Snell P, McEwen GK, Vavouri T, Smith SF, North P, Callaway H, Kelly K, Walter K, Abnizova I, Gilks W, Edwards YJK, Cooke JE, Elgar G: Highly conserved non-coding sequences are associated with vertebrate development. PLoS Biol. 2005, 3: e7-10.1371/journal.pbio.0030007.
Venkatesh B, Kirkness EF, Loh Y-H, Halpern AL, Lee AP, Johnson J, Dandona N, Viswanathan LD, Tay A, Venter JC, Strausberg RL, Brenner S: Ancient noncoding elements conserved in the human genome. Science. 1892, 2006: 314-
Allende ML, Manzanares M, Tena JJ, Feijóo CG, Gómez-Skarmeta JL: Cracking the genome’s second code: enhancer detection by combined phylogenetic footprinting and transgenic fish and frog embryos. Methods. 2006, 39: 212-219. 10.1016/j.ymeth.2005.12.005.
Pennacchio LA, Ahituv N, Moses AM, Prabhakar S, Nobrega MA, Shoukry M, Minovitsky S, Dubchak I, Holt A, Lewis KD, Plajzer-Frick I, Akiyama J, De Val S, Afzal V, Black BL, Couronne O, Eisen MB, Visel A, Rubin EM: In vivo enhancer analysis of human conserved non-coding sequences. Nature. 2006, 444: 499-502. 10.1038/nature05295.
Navratilova P, Fredman D, Hawkins TA, Turner K, Lenhard B, Becker TS: Systematic human/zebrafish comparative identification of cis-regulatory activity around vertebrate developmental transcription factor genes. Dev Biol. 2009, 327: 526-540. 10.1016/j.ydbio.2008.10.044.
De la Calle-Mustienes E, Feijóo CG, Manzanares M, Tena JJ, Rodríguez-Seguel E, Letizia A, Allende ML, Gómez-Skarmeta JL: A functional survey of the enhancer activity of conserved non-coding sequences from vertebrate Iroquois cluster gene deserts. Genome Res. 2005, 15: 1061-1072. 10.1101/gr.4004805.
Antonellis A, Huynh JL, Lee-Lin S-Q, Vinton RM, Renaud G, Loftus SK, Elliot G, Wolfsberg TG, Green ED, McCallion AS, Pavan WJ: Identification of neural crest and glial enhancers at the mouse Sox10 locus through transgenesis in zebrafish. PLoS Genet. 2008, 4: e1000174-10.1371/journal.pgen.1000174.
Jarinova O, Hatch G, Poitras L, Prudhomme C, Grzyb M, Aubin J, Bérubé-Simard F-A, Jeannotte L, Ekker M: Functional resolution of duplicated hoxb5 genes in teleosts. Development. 2008, 135: 3543-3553. 10.1242/dev.025817.
Kleinjan DA, Bancewicz RM, Gautier P, Dahm R, Schonthaler HB, Damante G, Seawright A, Hever AM, Yeyati PL, Van Heyningen V, Coutinho P: Subfunctionalization of duplicated zebrafish pax6 genes by cis-regulatory divergence. PLoS Genet. 2008, 4: e29-10.1371/journal.pgen.0040029.
Strähle U, Rastegar S: Conserved non-coding sequences and transcriptional regulation. Brain Res Bull. 2008, 75: 225-230. 10.1016/j.brainresbull.2007.11.010.
Ritter DI, Li Q, Kostka D, Pollard KS, Guo S, Chuang JH: The importance of being cis: evolution of orthologous fish and mammalian enhancer activity. Mol Biol Evol. 2010, 27: 2322-2332. 10.1093/molbev/msq128.
Visel A, Minovitsky S, Dubchak I, Pennacchio LA: VISTA enhancer browser–a database of tissue-specific human enhancers. Nucleic Acids Res. 2007, 35: D88-D92. 10.1093/nar/gkl822.
Royo JL, Hidalgo C, Roncero Y, Seda MA, Akalin A, Lenhard B, Casares F, Gómez-Skarmeta JL: Dissecting the transcriptional regulatory properties of human chromosome 16 highly conserved non-coding regions. PLoS One. 2011, 6: e24824-10.1371/journal.pone.0024824.
Irie N, Kuratani S: Comparative transcriptome analysis reveals vertebrate phylotypic period during organogenesis. Nat Commun. 2011, 2: 248-
Bessa J, Tena JJ, De la Calle-Mustienes E, Fernández-Miñán A, Naranjo S, Fernández A, Montoliu L, Akalin A, Lenhard B, Casares F, Gómez-Skarmeta JL: Zebrafish enhancer detection (ZED) vector: a new tool to facilitate transgenesis and the functional analysis of cis-regulatory regions in zebrafish. Dev Dyn. 2009, 238: 2409-2417. 10.1002/dvdy.22051.
Navratilova P, Fredman D, Lenhard B, Becker TS: Regulatory divergence of the duplicated chromosomal loci sox11a/b by subpartitioning and sequence evolution of enhancers in zebrafish. Mol Genet Genomics. 2010, 283: 171-184. 10.1007/s00438-009-0503-1.
Komisarczuk AZ, Kawakami K, Becker TS: Cis-regulation and chromosomal rearrangement of the fgf8 locus after the teleost/tetrapod split. Dev Biol. 2009, 336: 301-312. 10.1016/j.ydbio.2009.09.029.
Naranjo S, Voesenek K, De la Calle-Mustienes E, Robert-Moreno A, Kokotas H, Grigoriadou M, Economides J, Van Camp G, Hilgert N, Moreno F, Alsina B, Petersen MB, Kremer H, Gómez-Skarmeta JL: Multiple enhancers located in a 1-Mb region upstream of POU3F4 promote expression during inner ear development and may be required for hearing. Hum Genet. 2010, 128: 411-419. 10.1007/s00439-010-0864-x.
Punnamoottil B, Herrmann C, Pascual-Anaya J, D’Aniello S, Garcia-Fernàndez J, Akalin A, Becker TS, Rinkwitz S: Cis-regulatory characterization of sequence conservation surrounding the Hox4 genes. Dev Biol. 2010, 340: 269-282. 10.1016/j.ydbio.2010.01.035.
Ragvin A, Moro E, Fredman D, Navratilova P, Drivenes Ø, Engström PG, Alonso ME, de La Calle Mustienes E, Skarmeta JLG, Tavares MJ, Casares F, Manzanares M, Van Heyningen V, Molven A, Njølstad PR, Argenton F, Lenhard B, Becker TS: Long-range gene regulation links genomic type 2 diabetes and obesity risk regions to HHEX, SOX4, and IRX3. Proc Natl Acad Sci U S A. 2010, 107: 775-780. 10.1073/pnas.0911591107.
Royo JL, Maeso I, Irimia M, Gao F, Peter IS, Lopes CS, D’Aniello S, Casares F, Davidson EH, Garcia-Fernández J, Gómez-Skarmeta JL: Transphyletic conservation of developmental regulatory state in animal evolution. Proc Natl Acad Sci U S A. 2011, 108: 14186-14191. 10.1073/pnas.1109037108.
Tena JJ, Alonso ME, De la Calle-Mustienes E, Splinter E, De Laat W, Manzanares M, Gómez-Skarmeta JL: An evolutionarily conserved three-dimensional structure in the vertebrate Irx clusters facilitates enhancer sharing and coregulation. Nat Commun. 2011, 2: 310-
Robert-moreno À, Naranjo S, De la Calle-Mustienes E, Gómez-Skarmeta JL, Alsina B: Characterization of New otic enhancers of the Pou3f4 gene reveal distinct signaling pathway regulation and spatio-temporal patterns. PLoS One. 5: e15907-
Maeso I, Irimia M, Tena JJ, González-Pérez E, Tran D, Ravi V, Venkatesh B, Campuzano S, Gómez-Skarmeta JL, Garcia-Fernàndez J: An ancient genomic regulatory block conserved across bilaterians and its dismantling in tetrapods by retrogene replacement. Genome Res. 2012, 22: 642-655. 10.1101/gr.132233.111.
Irimia M, Royo JL, Burguera D, Maeso I, Gómez-Skarmeta JL, Garcia-Fernandez J: Comparative genomics of the Hedgehog loci in chordates and the origins of Shh regulatory novelties. Sci Rep. 2012, 2: 433-
Irimia M, Tena JJ, Alexis MS, Fernandez-Miñan A, Maeso I, Bogdanovic O, De la Calle-Mustienes E, Roy SW, Gómez-Skarmeta JL, Fraser HB: Extensive conservation of ancient microsynteny across metazoans due to cis-regulatory constraints. Genome Res. 2012, 22: 2356-2367. 10.1101/gr.139725.112.
Ellingsen S, Laplante MA, König M, Kikuta H, Furmanek T, Hoivik EA, Becker TS: Large-scale enhancer detection in the zebrafish genome. Development. 2005, 132: 3799-3811. 10.1242/dev.01951.
Nóbrega MA, Zhu Y, Plajzer-Frick I, Afzal V, Rubin EM: Megabase deletions of gene deserts result in viable mice. Nature. 2004, 431: 988-993. 10.1038/nature03022.
Verma-Kurvari S, Savage T, Smith D, Johnson JE: Multiple elements regulate Mash1 expression in the developing CNS. Dev Biol. 1998, 197: 106-116. 10.1006/dbio.1998.8873.
Lee AP, Brenner S, Venkatesh B: Mouse transgenesis identifies conserved functional enhancers and cis-regulatory motif in the vertebrate LIM homeobox gene Lhx2 locus. PLoS One. 2011, 6: e20088-10.1371/journal.pone.0020088.
Potvin E, Beuret L, Cadrin-Girard J-F, Carter M, Roy S, Tremblay M, Charron J: Cooperative action of multiple cis-acting elements is required for N-myc expression in branchial arches: specific contribution of GATA3. Mol Cell Biol. 2010, 30: 5348-5363. 10.1128/MCB.00353-09.
Wasserman NF, Aneas I, Nobrega MA: An 8q24 gene desert variant associated with prostate cancer risk confers differential in vivo activity to a MYC enhancer. Genome Res. 2010, 20: 1191-1197. 10.1101/gr.105361.110.
Werner T, Hammer A, Wahlbuhl M, Bösl MR, Wegner M: Multiple conserved regulatory elements with overlapping functions determine Sox10 expression in mouse embryogenesis. Nucleic Acids Res. 2007, 35: 6526-6538. 10.1093/nar/gkm727.
May D, Blow MJ, Kaplan T, McCulley DJ, Jensen BC, Akiyama JA, Holt A, Plajzer-Frick I, Shoukry M, Wright C, Afzal V, Simpson PC, Rubin EM, Black BL, Bristow J, Pennacchio LA, Visel A: Large-scale discovery of enhancers from human heart tissue. Nat Genet. 2012, 44: 89-93.
Visel A, Akiyama JA, Shoukry M, Afzal V, Rubin EM, Pennacchio LA: Functional autonomy of distant-acting human enhancers. Genomics. 2009, 93: 509-513. 10.1016/j.ygeno.2009.02.002.
Visel A, Blow MJ, Li Z, Zhang T, Akiyama JA, Holt A, Plajzer-Frick I, Shoukry M, Wright C, Chen F, Afzal V, Ren B, Rubin EM, Pennacchio LA: ChIP-seq accurately predicts tissue-specific activity of enhancers. Nature. 2009, 457: 854-858. 10.1038/nature07730.
Kawakami K, Shima A, Kawakami N: Identification of a functional transposase of the Tol2 element, an Ac-like element from the Japanese medaka fish, and its transposition in the zebrafish germ lineage. Proc Natl Acad Sci U S A. 2000, 97: 11403-11408. 10.1073/pnas.97.21.11403.
Royo JL, Bessa J, Hidalgo C, Fernández-Miñán A, Tena JJ, Roncero Y, Gómez-Skarmeta JL, Casares F: Identification and analysis of conserved cis-regulatory regions of the MEIS1 gene. PLoS One. 2012, 7: e33617-10.1371/journal.pone.0033617.
Narlikar L, Sakabe NJ, Blanski AA, Arimura FE, Westlund JM, Nobrega MA, Ovcharenko I: Genome-wide discovery of human heart enhancers. Genome Res. 2010, 20: 381-392. 10.1101/gr.098657.109.
McGaughey DM, McCallion AS: Efficient discovery of ASCL1 regulatory sequences through transgene pooling. Genomics. 2010, 95: 363-369. 10.1016/j.ygeno.2010.02.011.
Beermann F, Kaloulis K, Hofmann D, Murisier F, Bucher P, Trumpp A: Identification of evolutionarily conserved regulatory elements in the mouse Fgf8 locus. Genesis. 2006, 44: 1-6. 10.1002/gene.20177.
Takahasi KR, Matsuo T, Takano-Shimizu-Kouno T: Two types of cis-trans compensation in the evolution of transcriptional regulation. Proc Natl Acad Sci U S A. 2011, 108: 15276-15281. 10.1073/pnas.1105814108.
Goncalves A, Leigh-Brown S, Thybert D, Stefflova K, Turro E, Flicek P, Brazma A, Odom DT, Marioni JC: Extensive compensatory cis-trans regulation in the evolution of mouse gene expression. Genome Res. 2012, 22: 2376-2384. 10.1101/gr.142281.112.
Kimmel CB, Ballard WW, Kimmel SR, Ullmann B, Schilling TF: Stages of embryonic development of the zebrafish. Dev Dyn. 1995, 203: 253-310. 10.1002/aja.1002030302.
Lee AP, Kerk SY, Tan YY, Brenner S, Venkatesh B: Ancient vertebrate conserved noncoding elements have been evolving rapidly in teleost fishes. Mol Biol Evol. 2011, 28: 1205-1215. 10.1093/molbev/msq304.
Sprague J, Bayraktaroglu L, Clements D, Conlin T, Fashena D, Frazer K, Haendel M, Howe DG, Mani P, Ramachandran S, Schaper K, Segerdell E, Song P, Sprunger B, Taylor S, Van Slyke CE, Westerfield M: The zebrafish information network: the zebrafish model organism database. Nucleic Acids Res. 2006, 34: D581-D585. 10.1093/nar/gkj086.
Blader P, Plessy C, Strähle U: Multiple regulatory elements with spatially and temporally distinct activities control neurogenin1 expression in primary neurons of the zebrafish embryo. Mech Dev. 2003, 120: 211-218. 10.1016/S0925-4773(02)00413-6.
Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, Rosenbloom K, Clawson H, Spieth J, Hillier LW, Richards S, Weinstock GM, Wilson RK, Gibbs RA, Kent WJ, Miller W, Haussler D: Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res. 2005, 15: 1034-1050. 10.1101/gr.3715005.
Akalin A, Fredman D, Arner E, Dong X, Bryne JC, Suzuki H, Daub CO, Hayashizaki Y, Lenhard B: Transcriptional features of genomic regulatory blocks. Genome Biol. 2009, 10: R38-10.1186/gb-2009-10-4-r38.
Dong X, Fredman D, Lenhard B: Synorth: exploring the evolution of synteny and long-range regulatory interactions in vertebrate genomes. Genome Biol. 2009, 10: R86-10.1186/gb-2009-10-8-r86.
Finger JH, Smith CM, Hayamizu TF, McCright IJ, Eppig JT, Kadin JA, Richardson JE, Ringwald M: The mouse gene expression database (GXD): 2011 update. Nucleic Acids Res. 2011, 39: D835-D841. 10.1093/nar/gkq1132.
Richardson L, Venkataraman S, Stevenson P, Yang Y, Burton N, Rao J, Fisher M, Baldock RA, Davidson DR, Christiansen JH: EMAGE mouse embryo spatial gene expression database: 2010 update. Nucleic Acids Res. 2010, 38: D703-D709. 10.1093/nar/gkp763.
Diez-Roux G, Banfi S, Sultan M, Geffers L, Anand S, Rozado D, Magen A, Canidio E, Pagani M, Peluso I, Lin-Marq N, Koch M, Bilio M, Cantiello I, Verde R, De Masi C, Bianchi SA, Cicchini J, Perroud E, Mehmeti S, Dagand E, Schrinner S, Nürnberger A, Schmidt K, Metz K, Zwingmann C, Brieske N, Springer C, Hernandez AM, Herzog S, et al: A high-resolution anatomical atlas of the transcriptome in the mouse embryo. PLoS Biol. 2011, 9: e1000582-10.1371/journal.pbio.1000582.
Visel A, Thaller C, Eichele G: GenePaint.org: an atlas of gene expression patterns in the mouse embryo. Nucleic Acids Res. 2004, 32: D552-D556. 10.1093/nar/gkh029.
Bradford Y, Conlin T, Dunn N, Fashena D, Frazer K, Howe DG, Knight J, Mani P, Martin R, Moxon SAT, Paddock H, Pich C, Ramachandran S, Ruef BJ, Ruzicka L, Bauer Schaper H, Schaper K, Shao X, Singer A, Sprague J, Sprunger B, Van Slyke C, Westerfield M ZFIN: Enhancements and updates to the zebrafish model organism database. Nucleic Acids Res. 2011, 39: D822-D829. 10.1093/nar/gkq1077.
Gawantka V, Delius H, Hirschfeld K, Blumenstock C, Niehrs C: Antagonizing the Spemann organizer: role of the homeobox gene Xvent-1. EMBO J. 1995, 14: 6268-6279.
Tena JJ, Neto A, De la Calle-Mustienes E, Bras-Pereira C, Casares F, Gómez-Skarmeta JL: Odd-skipped genes encode repressors that control kidney development. Dev Biol. 2007, 301: 518-531. 10.1016/j.ydbio.2006.08.063.
We would like to thank F. Casares, for helpful discussions and comments on the manuscript. This study was supported by the Spanish and Andalusian Governments (JLGS grant numbers BFU2010-14839, CSD2007-00008 and Proyecto de Excelencia CVI-3488) and National Institute of Health (HBF grant number 1R21HG005240-01A1). AA is a FPI fellow and J.B. is a Juan de la Cierva postdoctoral fellow of the Consejo Superior de Investigaciones Cientificas.
The authors declare that they have no competing interests.
MI, JLGS and JB conceived and designed the experiments. AAC, MI and JB performed the experiments. AAC, MI, HBF and JB analysed the data. AV and LP contributed reagents and materials. HBF, AV and LP revised the paper. MI, JLGS and JB wrote the paper. All authors read and approved the final manuscript.