Skip to main content

Convergence and divergence in gene expression among natural populations exposed to pollution



Natural populations of the teleost fish Fundulus heteroclitus tolerate a broad range of environmental conditions including temperature, salinity, hypoxia and chemical pollutants. Strikingly, populations of Fundulus inhabit and have adapted to highly polluted Superfund sites that are contaminated with persistent toxic chemicals. These natural populations provide a foundation to discover critical gene pathways that have evolved in a complex natural environment in response to environmental stressors.


We used Fundulus cDNA arrays to compare metabolic gene expression patterns in the brains of individuals among nine populations: three independent, polluted Superfund populations and two genetically similar, reference populations for each Superfund population. We found that up to 17% of metabolic genes have evolved adaptive changes in gene expression in these Superfund populations. Among these genes, two (1.2%) show a conserved response among three polluted populations, suggesting common, independently evolved mechanisms for adaptation to environmental pollution in these natural populations.


Significant differences among individuals between polluted and reference populations, statistical analyses indicating shared adaptive changes among the Superfund populations, and lack of reduction in gene expression variation suggest that common mechanisms of adaptive resistance to anthropogenic pollutants have evolved independently in multiple Fundulus populations. Among three independent, Superfund populations, two genes have a common response indicating that high selective pressures may favor specific responses.


Many natural populations are continuously exposed to chemical stressors. One indication of this is that in 2004, over 4.24 billion pounds of industrial chemicals were disposed or released to the environment by facilities required to report to the EPA, including 576 million pounds of air emissions and 210 million pounds of surface water releases [1]. What are the biological consequences of chronic exposure to environmental pollution? We addressed this question using natural populations of the teleost Fundulus heteroclitus that inhabit and have adapted to highly polluted Superfund sites [25]. These populations are exposed to some of the highest concentrations of aromatic hydrocarbon pollutants of any vertebrate species [6]. Compelling evidence for adaptation in these populations is that embryos from the polluted sites are resistant to the toxic effects of the contaminated sediments while reference embryos are not [25]. Both first and second generation embryos from Superfund sites at New Bedford Harbor and Elizabeth River and first generation embryos from depurated Newark Bay fish are resistant, suggesting that differential survival is due to genetic adaptation rather than physiological induction. The central question we asked is, what differences in metabolic gene expression appear to be evolutionarily important within and among populations subject to anthropogenic stress?

To address this question we examined brain gene expression in individuals from three independent populations of Fundulus that inhabit and have adapted to highly polluted Superfund sites. The Superfund sites are highly contaminated with aromatic hydrocarbons, including polychlorinated biphenyls (PCBs), dibenzo-p-dioxins, and polycyclic aromatic hydrocarbons (PAHs) among other contaminants [79]. Gene expression was measured in field-caught fish raised for one year in common re-circulating tanks which removes physiological differences among populations induced by temperature, water quality, feeding, and other environmental factors [10]. Although many pollutants have long residual body-times, one year depuration should eliminate most pollutants (highly chlorinated PCB half-lives in trout are 70 days [11], dioxin half-life in Fundulus is <60 hours [12]). We assume that differences found after this common gardening are predominantly genetic rather than physiological.

We used a metabolic array since exposure to stress is likely to affect metabolic gene expression due to the additional energy needed to protect against harmful effects. Metabolic gene expression was examined in brains since brain tissue is metabolically active and may be highly susceptible to lipophilic pollutants: high energy demands, high lipid content, low antioxidant levels, reactive microglial cells, and inability to replicate neurons make it vulnerable to pollution and resulting oxidative stress [13, 14].

Multiple Fundulus populations that have independently evolved adaptive resistance to complex suites of pollutants provide independent contrasts for identifying genes involved in adaptation. We compared brain gene expression levels in common gardened Fundulus heteroclitus from three independent, resistant Superfund sites to individuals from six nearby reference populations. We report significant differences in gene expression that were unique to each polluted population or shared among two polluted populations as well as conserved responses in gene expression among all three Superfund populations.


Population genetics and experimental design

We found large amounts of significant variation in gene expression among individuals within each of nine populations (38–61% of genes differed among individuals within populations, p < 0.01). These results are similar to previous studies of Fundulus metabolic gene expression [1517]. Using a mixed model to test for differences between populations but without considering geographic distances between populations (relatedness) or whether a population is polluted, up to 25% of the genes have significant differences in expression among any two of the nine populations (p < 0.01). On average 12% of the genes differ between populations. Two reference populations (in CT and NJ) spanning a steep transition zone between northern and southern populations of Fundulus [18, 19] exhibit the most differences. Because most of the variation in gene expression among populations is a function of the variation within populations (r2: 0.63–0.83; see Additional file 1), these differences are consistent with a neutral or nearly-neutral model of evolution [20].

With the large number of differences in gene expression among individuals within a population, one should expect differences between genetically distinct populations. These differences are not necessarily due to evolution by natural selection since the variation in expression does not necessarily affect a biologically important phenotype. To distinguish biologically important differences, those evolving by natural selection, from among the many genes exhibiting significant differences within and among populations, we specifically tested whether gene expression in a Superfund population is statistically different from the two reference populations (Fig. 1, see Additional file 2). These three populations have similar genetic distances from each other such that the Superfund population and the two reference populations share similar numbers of neutral alleles and have similar differences in allele frequency [18]. Without the effect of natural selection, the combined variation in the two reference populations should be greater than the variation between the Superfund population and each reference population. In an analysis of variance (ANOVA), the variation between the polluted versus two reference populations will be significant if it is greater than the average variation within the polluted population and the combined reference populations. If the variation is significant, it exceeds the expectation of neutral evolutionary models [20] and thus is consistent with evolution by natural selection. Traits evolving by natural selection affect biologically important traits. Thus, this analysis of variance provides evidence for which gene expression changes are of functional significance.

Figure 1
figure 1

Sampling and Experimental Design. A) Map of sampling sites, USA, showing polluted Superfund Sites (diamonds, B, E and H) and reference sites (circles). Dashed triangles show the population comparisons illustrated in (B). Superfund sites have text labels. Site names, latitudes and longitudes are in Additional file 2. B) Loop design showing 5 individuals (1–5) from a polluted Superfund population (shaded diamond) hybridized with 5 individuals from each of two reference sites (circles). This experiment used 3 loops and 15 individuals per loop for a total of 45 individuals hybridized to 45 arrays. Each individual is labeled with both Cy3 and Cy5 dyes. Spots are replicated 4 times per array.

We also asked which gene expression levels were significantly different in all three polluted populations compared to all six reference populations. Since there is as much genetic distance among populations within these two groups (polluted or reference) as there is between groups (polluted versus reference), the variation in gene expression due to random drift is accounted for in the ANOVA. Thus, similar to the separate comparisons with the three Superfund populations, the genetic differences in gene expression most likely represent evolution by natural selection and in these comparisons, adaptation to chronic exposure to pollution.

Differences in brain gene expression between polluted and paired reference populations

A linear mixed model was used to test for differences between Superfund populations versus the two surrounding reference sites. We made three comparisons: each Superfund population versus two reference populations (Fig. 1). For the most northern Superfund population, New Bedford Harbor, MA, 37 genes are significantly differentially expressed compared to two reference populations (17%, p < 0.01). For the Superfund site in Newark Bay, NJ, 27 genes are differentially expressed (13%, p < 0.01). For the most southern Superfund site, Elizabeth River, VA, 13 genes are significantly different (5%, p < 0.01). These numbers are greater than the 2 genes expected by chance (i.e. 1% of 200 analyzed genes). After a Bonferroni correction for multiple comparisons, 9, 6 and 1 genes are significantly different in New Bedford Harbor, Newark Bay and Elizabeth River populations, respectively [see Additional file 3]. We did not find greater inter-individual variation in gene expression in the Elizabeth River and surrounding reference populations. Thus, the failure to detect as many differences in expression in the Elizabeth River population compared to the New Bedford Harbor and Newark Bay populations is not due to greater individual variation.

Common gene expression differences among all polluted populations

The Venn diagram summarizes significant genes (p = 0.01) for each comparison of a Superfund population to two reference populations (Fig. 2). The New Bedford Harbor population had 5 genes in common with the Newark Bay population and 6 genes in common with the Elizabeth River population. The Newark Bay population had 4 genes in common with the Elizabeth River population. Two genes are common to all 3 populations, a NADH-ubiquinone oxidoreductase AGGG subunit precursor (NDUB2 or NDUBF2), a subunit in oxidative phosphorylation complex I, and thioredoxin, a general protein disulfide oxidoreductase.

Figure 2
figure 2

Venn Diagram showing numbers of significant genes (p < 0.01) for each polluted versus reference sites comparison. Two genes are significantly different in all three polluted populations compared to their respective reference populations. NBH = New Bedford Harbor comparison, Newark = Newark Bay comparison, ER = Elizabeth River comparison.

We statistically compared brain gene expression patterns among individuals from all three polluted populations (New Bedford Harbor, Newark Bay, and Elizabeth River) to individuals from all six reference populations to determine if there were gene expression changes due to common pollution stress. Analyses indicate that 17% (27/170) of the genes were more similar among polluted populations and significantly different from reference populations at p = 0.01 (Fig. 3). Sixteen genes were more highly expressed and eleven genes were less expressed. After a Bonferroni correction, eleven genes remain significant.

Figure 3
figure 3

All polluted sites versus all reference sites. Heat map, p-values, and gene expression increased (red) and decreased (green) in a comparison of all polluted sites (P) versus all reference sites (R).

Table 1 shows the p-values associated with these 27 statistically significant genes between all polluted populations versus all reference populations contrasted with p-values from separate population comparisons of a polluted population versus two reference populations. NDUB2 is consistently differentially expressed at p < 0.01 in the four separate comparisons: the three separate comparisons of each Superfund site versus surrounding reference sites and the comparison of all polluted versus all reference populations. At p < 0.05, CYP1B1 also is consistently differentially expressed in all four comparisons.

Table 1 Genes and p-values for differentially regulated genes.


Metabolic gene expression differences in polluted populations

We used natural populations of Fundulus that inhabit and have adapted to highly polluted Superfund sites to identify gene expression changes involved in adaptation to pollution. Expression of 17% of the metabolic genes examined differed between the New Bedford Harbor, MA Superfund population compared to two reference populations, 12% differed in the Newark Bay, NJ comparison, and 5% differed in the Elizabeth River comparison. Genes contributing to phase I and phase II biotransformation of xenobiotics are likely candidates for adaptation at the physiological or genetic level in pollution-impacted populations. Transcripts for the phase I enzymes CYP2N2 and CYP1B1 were less expressed in the New Bedford Harbor population and CYP2N2 expression also was lower in the Newark Bay population. CYP1A induction is important in many toxic responses to dioxin-like compounds although it is refractory to induction in individuals from all of these polluted populations [2, 3, 21]. Not unexpectedly, its expression did not differ in these un-induced individuals.

Several gene expression changes are consistent with toxic effects from exposure to dioxin-like contaminants. For instance, expression of many genes involved in oxidative phosphorylation differed in the three polluted populations, as might be expected since dioxin affects the respiratory mitochondrial electron chain [22]. Additionally, in response to an increase in reactive oxygen species caused by leakage from the electron transport chain or directly via phase I and II metabolism, there is an increase in glutathione S-transferase A and glutathione peroxidase expression, as seen in other studies [23, 24]. The expression of superoxide dismutases (Cu-Zn and Mn), and glutathione S-transferases mu 3 and mu 5 did not differ. Interestingly, all three Superfund populations had differences in the expression of fatty acid binding protein genes which may be central regulators in inflammatory and metabolic signaling [25].

Common significant genes among polluted populations

An important question for adaptation in general is whether common solutions to environmental stress evolve. Recent transcriptional studies suggest multiple mechanisms to a biological end [26, 27]. In Drosophila, the number of evolved responses depends on selection strength: altered expression of different cytochrome P450s (CYP) develops in response to DDT in laboratory Drosophila, yet in resistant field isolates, over-expression of CYP6g1 predominates [28] (but see [29]). One approach to address this question is to discern significant changes in expression shared among Superfund populations (Fig. 2). Two genes are common in all 3 populations, the NADH-ubiquinone oxidoreductase AGGG subunit precursor (NDUB2), a subunit in oxidative phosphorylation complex I, and thioredoxin, a general protein disulfide oxidoreductase. However only NDUB2 has a consistent pattern with significantly more highly expressed in the Superfund populations. Thioredoxin expression was greater in one Superfund population but decreased in the other two, suggesting that the expression of this gene is evolutionarily labile.

A statistically more powerful test of a shared adaptive response in gene expression is to compare all three polluted populations to all six reference populations. In these analyses, expression levels of 17% of the genes examined were statistically significantly different in the polluted populations compared to reference populations. These differentially expressed genes suggest a shared response to pollution stress in the independent polluted populations. However, by comparing significant expression among polluted populations, many of these differences in gene expression appear to be driven by only one or two of the Superfund populations (Table 1) and might reflect the different pollutants at each site. Only NDUB2 is consistently differentially expressed at p < 0.01 in the three separate comparisons of each Superfund site versus surrounding reference sites and also in the comparison of all polluted versus all reference populations. Thus its altered expression is unlikely to be due to a type I error. If we are interested in defining the limited number of genes with a common response among all polluted populations, we are concerned with type II error (acceptance of a false null hypothesis) rather than type I error. To better control for type II error we use a less stringent significance level of p < 0.05 and stipulate that direction of expression is consistent (always up or down in polluted versus reference populations). At this significance level, CYP1B1 also is consistently differentially expressed in all four comparisons. Thus, in the comparisons of each superfund site versus its two reference sites and among the three polluted versus six reference populations comparison, two of the 170 genes (1.2%) appear to have shared derived changes in gene expression in response to anthropogenic pollution.

Evolutionarily important adaptive genes

We suggest that these two genes are of critical importance in stress response and are involved in resistance to pollution stress. For instance, NDUB2 is part of complex I in oxidative phosphorylation. Defects in this complex have been linked to Parkinson's disease and other genetic metabolic diseases, and modulation of complex 1 activity can generate oxidative stress [30, 31]. Although we analyzed 27/46 subunits of NADH-ubiquinone oxidoreductase, only NDUB2 was significantly more highly expressed in polluted populations, suggesting that its transcription may be limiting. This is surprising since transcriptional activators and coactivators such as nuclear respiratory factors NRF1 and 2 co-regulate many nuclear encoded, mitochondrial genes [32], and expression of NDUB2 was positively correlated with many, but not all, complex I subunits, suggesting a common transcriptional regulation. If there is a common transcriptional regulation for NDUB2, then one would expect many complex I genes to be significantly altered. Although we used 45 individuals, a greater sample size might have identified significant differences in expression for more complex I genes.

CYP1B1 has a physiological induced responsiveness to exposure to dioxin-like compounds [33] found in these Superfund sites. While CYP1B1 is usually induced [34], we found significantly lower expression levels. Since CYP1B1 activates chemically diverse procarcinogens [35], our analyses suggest that decreased expression may be a protective genetic response to chronic pollutant exposure. These two genes, NDUB2 and CYP1B1, are not significantly correlated (p > 0.2) suggesting that the variation in their expression is not due to a single trans-acting factor. The observation that consistent differences in these two genes have evolved independently in these three populations suggests that the changes in gene expression are evolutionarily important and thus functionally important.

If these Superfund populations were similar to breeding selection experiments, we would expect a decrease in variance compared to reference populations. Among individuals in Superfund populations, the average variation in gene expression is 2.3 ± 1.8 and in the six reference populations it is 2.2 ± 1.6, and the correlation of variances among genes is 97%. Statistically, there is a homogeneity of variance among these two groups. Why would the variances be similar? Pollution is a relatively recent event compared to the evolutionary history of this species. These polluted populations clearly have adaptive differences in survival and development that are maintained in the first and second generations [25]. Yet, we find no reduction in the variation in gene expression in polluted populations. Since most of the variation in gene expression is thought to be genetically based [15, 36, 37], maintenance of variation in gene expression most likely represents steady influx of alternative alleles by migration. Migration in this species, although empirically low, is enough to minimize genetic difference among populations in Chesapeake Bay [38], and among populations greater than 50 km apart in Georgia [18]. The maintenance of variation for gene expression in populations subjected to significant selection suggests that outbreeding and migration have mitigated the loss of genetic variation and allelic variation has been maintained. Consequently, the shared selective differences in gene expression are both biologically important and represent an advantageous and superior response among the many options.


Independent Fundulus populations provide the opportunity to discover adaptive responses to pollution evolved in a complex natural environment. In each of the Superfund populations 5 to 17% of genes are differentially expressed, yet only two genes (1.2%) have a consistent difference in expression among all polluted sites. Consistent significant differences in gene expression between polluted and reference populations, statistical analyses that indicate evolved adaptive changes, and lack of reduction in variation in gene expression suggest that multiple Fundulus populations have independently evolved common mechanisms of adaptive resistance to complex suites of pollutants. Integrating an evolutionary population genetics approach with an environmental toxicology study allowed us to differentiate genes that vary due to chronic pollutant exposure from those that vary due to random genetic drift. The shared response of two genes among these three independent populations supports the idea that high selective pressures may favor similar responses.


Field fish collection, care and sampling

Fundulus heteroclitus were collected by minnow trap from three Superfund Sites (New Bedford Harbor, MA; Newark Bay, NJ; Elizabeth River, VA), and six reference populations where pairs of these reference sites were located north and south of each polluted site (Sandwich, MA and Pt. Judith, RI for New Bedford Harbor, MA; Tuckerton NJ and Clinton, CT for Newark, NJ: and Magnatha, VA and Manteo, NC for Elizabeth River, VA; Fig. 1). Fish were kept in a common re-circulating aquarium system with controlled temperature and salinity of 20°C, 15 ppt salinity for one year before experiments. Effluent from tanks was passed through an activated charcoal filter and 20% water changes were performed weekly. Tanks were cleaned and fish checked for health status on a daily basis. Fish were fed once daily a 33% mixture of brine shrimp flake, blood meal flake and Spirulina flake (FOD, Aquatic Ecosystems). Prior to sampling, fish were subjected to pseudo-winter (6:18, light: dark cycle) for four to six weeks, then maintained for a minimum of 6 weeks with a light cycle of 16:8, light:dark. After the pseudo-winter, Fundulus came into reproductive condition and spawned, and reproductive tissues regressed. The reproductive tissues were in regression when fish were assayed.

Populations and individuals within a population were chosen randomly for sacrifice and sampled in the morning and early afternoon for two consecutive days in order to minimize physiological changes due to diurnal cycles. Five fish from each of 9 populations were sacrificed by cervical dislocation. Fish were mixed sex and ranged in weight, with an average weight of 7.28 ± 0.33 g. Neither sex nor weight were significant variables in the mixed models discussed below. Dissected organs, including brain, for subsequent RNA extraction were immediately frozen in a dry ice/ethanol bath and stored at -80°C. This experiment was performed according to an approved Institutional Animal Care and Use Committee at North Carolina State University.

RNA extraction, amplification, hybridization, scanning

Total RNA was isolated from each brain using a guanidinium thiocyanate buffer [39] followed by purification using the Qiagen RNeasy Mini kit (Qiagen Inc., Valencia, CA, USA) according to the manufacturer's protocols. Purified RNA was quantified spectrophotometrically, and RNA quality was assessed by gel electrophoresis. RNA for hybridization was prepared by one round of amplification (aRNA) using Ambion Amino Allyl MessageAmp aRNA Kit to form copy template RNA by T7 amplification. Amino-allyl UTP was incorporated into targets during T7 transcription, and resulting amino-allyl aRNA was coupled to Cy3 and Cy5 dyes (Amersham Biosciences, Piscataway, NJ, USA).

Labeled aRNA samples (5 pmol/ul) were hybridized to slides in 12 ul of hybridization buffer (50% formamide buffer, 5× SSPE, 1% sodium dodecyl sulfate, 0.2 mg/ml bovine serum albumin, 1 mg/ml denatured salmon sperm DNA (Sigma), and 1 mg/ml RNAse free poly(A) RNA (Sigma) for 42.5 hours at 42°C. Slides were prepared by blocking according to the manufacturer's recommendations with an additional treatment of 66 mM sodium borohydride to minimize background autofluorescence [40]. After hybridization, non-specifically bound probe was washed off with SSC and the slides were spun dry and scanned using a ScanArray Express 4000 (Perkin Elmer). Scanner settings used for analysis were 90% laser power for Cy3, 80% laser power for Cy5 and photomultiplier tube (PMT) setting at 70%. Resulting 16 bit Tiff Images were quantified using IconoClust® (CLONDIAG, Jena, Germany) spotfinding software.

Metabolic arrays

Amplified cDNA sequences for 384 metabolic genes from Fundulus heteroclitus heart and liver libraries were spotted onto CodeLink activated slides (GE Healthcare, Piscataway, NJ) at the University of Miami core microarray facility (slides were the kind gift of D. Crawford). Each slide contained 4 spatially separated arrays, and each array contained 4 replicates of 384 spots (genes) including controls. Printed cDNAs encode essential proteins for cellular metabolism based on KEGG (Kyoto Encyclopedia of Genes and Genomes). Sequence information, annotation and gene ontology are available for Fundulus on the FunnyBase website [41].

Of these 384 genes, not all genes were analyzed. Unanalyzed spots included negative controls (random genomic amplification or Ctenophore specific cDNAs) and genes that either saturated the photomultiplier tube or had signals less than the negative controls. The number of genes examined were 214, 216, and 248 for the New Bedford Harbor, Newark Bay, and Elizabeth River comparisons, respectively, and the analysis combining all locations (9 populations) used 170 common genes.

Experimental design

A loop design was used for the microarray hybridizations where each sample is hybridized to 2 arrays using both Cy3 and Cy5 labeled fluorophores [42]. In this experiment, each loop consisted of Cy3 and Cy5 labeled brain aRNA from 5 individuals from a polluted site (P) hybridized to Cy3 and Cy5 labeled brain aRNA from 5 individuals from each of 2 adjacent reference sites (R1 and R2), for a total among the 3 loops of 45 individuals hybridized to 45 microarrays. Each array had different combinations of individuals, and each loop formed was P1→ R1.1→ R2.1→ P2 → R1.2-→ R2.2→ P3→ R1.3→ R2.3→ P4→ R1.4→ R2.4→ P5 → R1.5→ R2.5→ P1 where each arrow represents a separate hybridization (array) with the individual at the base of the arrow labeled with Cy3 and the individual at the head of the arrow labeled with Cy5 (Fig. 1).

Statistical analysis

Log2 measures of gene expression were normalized using a linear mixed model in SAS (JMP v6.0.0 with a microarray platform beta-version in SAS v9.1.3) to remove the effects of dye (fixed effect) and array (random effect) following a joint regional and spatial Lowess transformation in MAANOVA Version 0.98.8 for R to account for both intensity and spatial bias [43]. The model was of the form yij = μ + Ai + Dj + (AxD)ij + εij, where, yij is the signal from the ith array with dye j, μ is the sample mean, Ai and Dj are the overall variation in arrays (1–15) and dyes (Cy3 and Cy5), (AxD)ij is the array × dye interaction and εij is the stochastic error [44, 45]. Residuals from this model were used for gene-by-gene analyses (below). For all loops, the coefficients of variation (CV ± StdErr) averaged across all genes were relatively low: 2.5% ± 0.28%, 4.75% ± 0.52%, 3.0 % ± 0.50% for New Bedford Harbor, Newark Bay, and Elizabeth River comparisons, respectively.

Normalized data were modeled by residual maximum likelihood on a gene-by-gene basis using a linear mixed model in SAS using PROC MIXED. To test for a treatment effect (effect of chronic exposure to pollution), normalized data (residuals from the mixed model normalization) were modeled using treatment, population nested in treatment and dye as fixed effects and array and spot nested in array as random effects according to the model rijkgm = μ + Ai + Dj + Tk + P(T)gk + S(A)mi + εijkgm, where Tk is the kth treatment (polluted or reference), P(T)gk is the gth population nested in treatment, and S(A)mi is the mth spot nested in array. To compare all nine sites (3 polluted and 6 reference populations hybridized across 3 separate loops), loop (Ll), dye by loop interactions, and array nested in loop terms were added in the mixed model normalization according to the model yijl = μ + Ai + Dj + (AxD)ij + Ll + (DxL)jl + A(L)il + εijl. To compare all nine populations, we used population as a fixed effect in the gene-by-gene model according to the model rijgm = μ + Ai + Dj + Pg+ S(A)mi + εijgm. To test for a treatment effect using all nine populations, loop, treatment × loop interactions, and population nested in loop × treatment interactions were incorporated as fixed effects into the gene-by-gene model according to the model rijklgm = μ + Ai + Dj + Tk + Ll + (LxT)lk+ P(TxL)gkl + S(A)mi + εijklgm. To examine individual variation among locations, the mixed model gene-by-gene analysis used individual nested in population, population and dye as fixed effects according to the model rijghm = μ + Ai + Dj + Pg + I(P)gh + S(A)mi + εijghm.

Mixed model analyses were performed for each loop or combined loop analysis, and we used a nominal p-value cut-off for significant genes of p < 0.01. Using this p-value reveals more genes that may differentially expressed but risks identifying genes that may be false positives. Genes identified as also being significant after false positives are reduced by a more stringent multiple comparison correction were analyzed using a Bonferroni correction (p = 0.05). Homogeneity of variance was assessed among all polluted versus reference populations using analysis of variance (ANOVA) and Brown-Forsythe's test. We also used Brown-Forsythe's test for homogeneity of variance among all populations. The heat map of differentially expressed genes using all nine populations was visualized using the Macintosh version [46] of TreeView [47]. The correlation matrix was performed using Pearson product-moment correlation in SAS.



polychlorinated biphenyls


polycyclic aromatic hydrocarbons


NADH-ubiquinone oxidoreductase AGGG subunit precursor


cytochrome P450 1B1 (CYP1B1), cytochrome P450 2N2


cytochrome P450 1A


  1. U.S. Environmental Protection Agency. []

  2. Elskus AA, Monosson E, McElroy AE, Stegeman JJ, Woltering DS: Altered CYP1A expression in Fundulus heteroclitus adults and larvae: a sign of pollutant resistance?. Aquatic Toxicology. 1999, 45 (2-3): 99-113. 10.1016/S0166-445X(98)00102-7.

    Article  CAS  Google Scholar 

  3. Meyer J DGR: Patterns of heritability of decreased EROD activity and resistance to PCB 126-induced teratogenesis in laboratory-reared offspring of killifish (Fundulus heteroclitus) from a creosote-contaminated site in the Elizabeth River, VA, USA. Mar Environ Res. 2002, 54 (3-5): 621-626. 10.1016/S0141-1136(02)00170-8.

    Article  PubMed  Google Scholar 

  4. Nacci D, Coiro L, Champlin D, Jayaraman S, McKinney R, Gleason T, Munns J, Specker JL, Cooper K: (Adaptation of wild fish populations to persistent environmental contaminants. Marine Biology. 1999, 134: 9-18. 10.1007/s002270050520.

    Article  Google Scholar 

  5. Ownby DR, Newman MC, Mulvey M, Vogelbein WK, Unger MA, Arzayus LF: Fish (Fndulus hereroclitus) populations with different exposure histories differ in tolerance of creosote-contaminated sediments. Environmental Toxicology and Chemistry. 2002, 21 (9): 1897-1902. 10.1897/1551-5028(2002)021<1897:FFHPWD>2.0.CO;2.

    Article  CAS  PubMed  Google Scholar 

  6. Wirgin I, Waldman JR: Resistance to contaminants in North American fish populations. Mutat Res. 2004, 552 (1-2): 73-100.

    Article  CAS  PubMed  Google Scholar 

  7. Crawford DW, Bonnevie NL, Wenning RJ: Sources of pollution and sediment contamination in Newark Bay, New Jersey. Ecotoxicol Environ Saf. 1995, 30 (1): 85-100. 10.1006/eesa.1995.1010.

    Article  CAS  PubMed  Google Scholar 

  8. Bergen BJ, Nelson WG, Mackay J, Dickerson D, Jayaraman S: ENVIRONMENTAL MONITORING OF REMEDIAL DREDGING AT THE NEW BEDFORD HARBOR, MA, SUPERFUND SITE. Environmental Monitoring and Assessment. 2005, 111: 257-275. 10.1007/s10661-005-8223-4.

    Article  CAS  PubMed  Google Scholar 

  9. Walker SE, Dickhut RM, Chisholm-Brause C: Polycyclic aromatic hydrocarbons in a highly industrialized urban estuary: inventories and trends. Environmental Toxicology and Chemistry. 2004, 23 (11): 2655-2664. 10.1897/03-628.

    Article  CAS  PubMed  Google Scholar 

  10. Pierce VA, Crawford DL: Phylogenetic analysis of thermal acclimation of the glycolytic enzymes in the genus Fundulus. Physiol Zool. 1997, 70 (6): 597-609.

    Article  CAS  PubMed  Google Scholar 

  11. Fisk TA, Norstrom RJ, Cymbalisty CD, Muir DCG: Dietary accumulation and depuration of hydrophobic organochlorines: bioaccumulation parameter and their relationship with the octanol/water partition coeffient. Env Tox and Chem. 1998, 17 (5): 951-961. 10.1897/1551-5028(1998)017<0951:DAADOH>2.3.CO;2.

    Article  CAS  Google Scholar 

  12. Prince R, Cooper KR: Comparisons of the the effects of 2,3,7,8-tetrachlorodibenzo-p-dioxin on chemically impacted and nonimpacted subpopulations of Fundulus heteroclitus: I. TCDD Toxicity. Environ Toxicol Chem. 1995, 14 (4): 579-587.

    Article  CAS  Google Scholar 

  13. Anthony DC, Montine TJ, Graham DG: Toxic responses of the nervous system. Casarett and Doull's toxicology the basic science of poisons. Edited by: Klaasen CD. 1996, McGraw-Hill, 463-486. 5

    Google Scholar 

  14. Calderon-Garciduenas L RW Maronpot RR, Henriquez-Roldan C, Delgado-Chavez R, Calderon-Garciduenas A, Dragustinovis I, Franco-Lira M, Aragon-Flores M, Solt AC, Altenburg M, Torres-Jardon R, Swenberg JA.: Brain inflammation and Alzheimer's-like pathology in individuals exposed to severe air pollution. Toxicol Pathol. 2004, 32 (6): 650-658. 10.1080/01926230490520232.

    Article  PubMed  Google Scholar 

  15. Whitehead A, Crawford DL: Neutral and adaptive variation in gene expression. Proc Natl Acad Sci U S A. 2006, 103 (14): 5425-5430. 10.1073/pnas.0507648103.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  16. Whitehead A, Crawford DL: Variation within and among species in gene expression: raw material for evolution. Mol Ecol. 2006, 15 (5): 1197-1211. 10.1111/j.1365-294X.2006.02868.x.

    Article  CAS  PubMed  Google Scholar 

  17. Whitehead A, Crawford DL: Variation in tissue-specific gene expression among natural populations. Genome Biol. 2005, 6 (2): R13-10.1186/gb-2005-6-2-r13.

    Article  PubMed Central  PubMed  Google Scholar 

  18. Adams SM, Lindmeier JB, Duvernell DD: Microsatellite analysis of the phylogeography, Pleistocene history and secondary contact hypotheses for the killifish, Fundulus heteroclitus. Mol Ecol. 2006, 15 (4): 1109-1123.

    Article  CAS  PubMed  Google Scholar 

  19. Powers DA, Lauerman T, Crawford D, DiMichele L: Genetic mechanisms for adapting to a changing environment. Annu Rev Genet. 1991, 25: 629-659. 10.1146/

    Article  CAS  PubMed  Google Scholar 

  20. Nei M: Molecular Evolutionary Genetics. 1987, New York, NY , Columbia University Press

    Google Scholar 

  21. Bello SM, Franks DG, Stegeman JJ, Hahn ME: Acquired Resistance to Ah Receptor Agonists in a Population of Atlantic Killifish (Fundulus heteroclitus) Inhabiting a Marine Superfund Site: In Vivo and in Vitro Studies on the Inducibility of Xenobiotic Metabolizing Enzymes. Toxicol Sci. 2001, 60 (1): 77-91. 10.1093/toxsci/60.1.77.

    Article  CAS  PubMed  Google Scholar 

  22. Senft AP, Dalton TP, Nebert DW, Genter MB, Hutchinson RJ, Shertzer HG: Dioxin increases reactive oxygen production in mouse liver mitochondria. Toxicol Appl Pharmacol. 2002, 178 (1): 15-21. 10.1006/taap.2001.9314.

    Article  CAS  PubMed  Google Scholar 

  23. Pham RT, Barber DS, Gallagher EP: GSTA is a major glutathione S-transferase gene responsible for 4-hydroxynonenal conjugation in largemouth bass liver. Mar Environ Res. 2004, 58 (2-5): 485-488. 10.1016/j.marenvres.2004.03.033.

    Article  CAS  PubMed  Google Scholar 

  24. Orbea A, Cajaraville MP: Peroxisome proliferation and antioxidant enzymes in transplanted mussels of four basque estuaries with different levels of polycyclic aromatic hydrocarbon and polychlorinated biphenyl pollution. Environmental Toxicology and Chemistry. 2006, 25 (6): 1616-1626. 10.1897/04-520R2.1.

    Article  CAS  PubMed  Google Scholar 

  25. Makowski L, Hotamisligil GS: Fatty acid binding proteins--the evolutionary crossroads of inflammatory and metabolic responses. J Nutr. 2004, 134 (9): 2464S-2468S.

    CAS  PubMed Central  PubMed  Google Scholar 

  26. Ghazalpour A, Doss S, Sheth SS, Ingram-Drake LA, Schadt EE, Lusis AJ, Drake TA: Genomic analysis of metabolic pathway gene expression in mice. Genome Biol. 2005, 6 (7): R59-10.1186/gb-2005-6-7-r59.

    Article  PubMed Central  PubMed  Google Scholar 

  27. Oleksiak MF, Roach JL, Crawford DL: Natural variation in cardiac metabolism and gene expression in Fundulus heteroclitus. Nature Genet. 2005, 37: 67-72.

    CAS  PubMed Central  PubMed  Google Scholar 

  28. Le Goff G, Boundy S, Daborn PJ, Yen JL, Sofer L, Lind R, Sabourault C, Madi-Ravazzi L, ffrench-Constant RH: Microarray analysis of cytochrome P450 mediated insecticide resistance in Drosophila. Insect Biochem Mol Biol. 2003, 33 (7): 701-708. 10.1016/S0965-1748(03)00064-X.

    Article  CAS  PubMed  Google Scholar 

  29. Pedra JH, McIntyre LM, Scharf ME, Pittendrigh BR: Genome-wide transcription profile of field- and laboratory-selected dichlorodiphenyltrichloroethane (DDT)-resistant Drosophila. Proc Natl Acad Sci U S A. 2004, 101 (18): 7034-7039. 10.1073/pnas.0400580101.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  30. Perier C, Tieu K, Guegan C, Caspersen C, Jackson-Lewis V, Carelli V, Martinuzzi A, Hirano M, Przedborski S, Vila M: Complex I deficiency primes Bax-dependent neuronal apoptosis through mitochondrial oxidative damage. Proc Natl Acad Sci U S A. 2005, 102 (52): 19126-19131. 10.1073/pnas.0508215102.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  31. Shoffner JM: Oxidative phosphorylation disease diagnosis. Ann N Y Acad Sci. 1999, 893: 42-60. 10.1111/j.1749-6632.1999.tb07817.x.

    Article  CAS  PubMed  Google Scholar 

  32. Scarpulla RC: Nuclear activators and coactivators in mammalian mitochondrial biogenesis. Biochim Biophys Acta. 2002, 1576 (1-2): 1-14.

    Article  CAS  PubMed  Google Scholar 

  33. Alexander DL, Eltom SE, Jefcoate CR: Ah receptor regulation of CYP1B1 expression in primary mouse embryo-derived cells. Cancer Res. 1997, 57 (20): 4498-4506.

    CAS  PubMed  Google Scholar 

  34. El-kady MA, Mitsuo R, Kaminishi Y, Itakura T: Isolation of cDNA of novel cytochrome P450 1B gene, CYP1B2, from Carp (Cyprinus carpio) and its induced expression in gills. Environ Sci. 2004, 11 (6): 345-354.

    CAS  PubMed  Google Scholar 

  35. Shimada T, Hayes CL, Yamazaki H, Amin S, Hecht SS, Guengerich FP, Sutter TR: Activation of chemically diverse procarcinogens by human cytochrome P-450 1B1. Cancer Res. 1996, 56 (13): 2979-2984.

    CAS  PubMed  Google Scholar 

  36. Gibson G, Weir B: The quantitative genetics of transcription. Trends Genet. 2005, 21 (11): 616-623. 10.1016/j.tig.2005.08.010.

    Article  CAS  PubMed  Google Scholar 

  37. Nuzhdin SV, Wayne ML, Harmon KL, McIntyre LM: Common pattern of evolution of gene expression level and protein sequence in Drosophila. Mol Biol Evol. 2004, 21 (7): 1308-1317. 10.1093/molbev/msh128.

    Article  CAS  PubMed  Google Scholar 

  38. Brown BL, W. CR: Gene Flow and Mitochondrial Dna Variation in the Killifish Fundulus-Heteroclitus. Evolution. 1991, 45 (5): 1147-1161. 10.2307/2409722.

    Article  Google Scholar 

  39. Chomczynski P, Sacchi N: Single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction. Anal Biochem. 1987, 162 (1): 155-159. 10.1016/0003-2697(87)90021-2.

    Article  Google Scholar 

  40. Raghavachari N, Bao YP, Li G, Xie X, Muller UR: Reduction of autofluorescence on DNA microarrays and slide surfaces by treatment with sodium borohydride. Anal Biochem. 2003, 312: 101 -1105. 10.1016/S0003-2697(02)00440-2.

    Article  CAS  PubMed  Google Scholar 

  41. FunnyBase Expressed Gene Database. []

  42. Kerr MK, Churchill GA: Experimental design for gene expression microarrays. Biostatistics. 2001, 2 (2): 183-201. 10.1093/biostatistics/2.2.183.

    Article  PubMed  Google Scholar 

  43. Wu H, Kerr K, Cui X, Churchill GA: MAANOVA: a software package for the analysis of spotted cDNA microarray experiments. The Analysis of Gene Expression Data: Methods and Software. 2003, New York: Springer

    Google Scholar 

  44. Jin W, Riley RM, Wolfinger RD, White KP, Passador-Gurgel G, Gibson G: The contributions of sex, genotype and age to transcriptional variance in Drosophila melanogaster. Nature Genetics. 2001, 29 (4): 355-366. 10.1038/ng766.

    Article  Google Scholar 

  45. Wolfinger RD, Gibson G, Wolfinger ED, Bennett L, Hamadeh H, Bushel P, Afshari C, Paules RS: Assessing gene significance from cDNA microarray expression data via mixed models. J Comput Biol. 2001, 8 (6): 625-637. 10.1089/106652701753307520.

    Article  CAS  PubMed  Google Scholar 

  46. de Hoon MJ, Imoto S, Nolan J, Miyano S: Open source clustering software. Bioinformatics. 2004, 20 (9): 1453-1454. 10.1093/bioinformatics/bth078.

    Article  CAS  PubMed  Google Scholar 

  47. Eisen MB, Spellman PT, Brown PO, Botstein D: Cluster analysis and display of genome-wide expression patterns. Proceedings of the National Academy of Sciences of the United States of America. 1998, 95 (25): 14863-14868. 10.1073/pnas.95.25.14863.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

Download references


Partial funding for this work was received from NIH 5 RO1 ES011588. The Authors thank Douglas L. Crawford for help with production of the metabolic arrays, Douglas L. Crawford and Charles F. Ide for review of the manuscript, the Genome Research Lab at North Carolina State University, Stan Martin for assistance with the Beta version of JMP 6.0.0, and Dr. Cavell Brownie for statistical support.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Marla A Fisher.

Additional information

Authors' contributions

MAF and MFO designed research; MAF performed research; MAF and MFO analyzed data; and MAF and MFO wrote the paper.

Electronic supplementary material


Additional File 1: Variation among and within populations. Additional file 1 shows the relationship between variation in gene expression among and within populations. (PPT 84 KB)


Additional File 2: Sites, latitude and longitude in this study. Additional file 2 provides full references for the sites depicted in Figure 1, including names, latitudes and longitudes. (DOC 32 KB)


Additional File 3: Significant Differences in Expression. Additional file 3 lists genes and p-values that are significantly different in each Superfund versu s respective reference populations. (DOC 99 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Fisher, M.A., Oleksiak, M.F. Convergence and divergence in gene expression among natural populations exposed to pollution. BMC Genomics 8, 108 (2007).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: