Genome-scale transcriptional analyses of first-generation interspecific sunflower hybrids reveals broad regulatory compatibility
© Rowe and Rieseberg; licensee BioMed Central Ltd. 2013
Received: 4 December 2012
Accepted: 15 April 2013
Published: 23 May 2013
Interspecific hybridization creates individuals harboring diverged genomes. The interaction of these genomes can generate successful evolutionary novelty or disadvantageous genomic conflict. Annual sunflowers Helianthus annuus and H. petiolaris have a rich history of hybridization in natural populations. Although first-generation hybrids generally have low fertility, hybrid swarms that include later generation and fully fertile backcross plants have been identified, as well as at least three independently-originated stable hybrid taxa. We examine patterns of transcript accumulation in the earliest stages of hybridization of these species via analyses of transcriptome sequences from laboratory-derived F1 offspring of an inbred H. annuus cultivar and a wild H. petiolaris accession.
While nearly 14% of the reference transcriptome showed significant accumulation differences between parental accessions, total F1 transcript levels showed little evidence of dominance, as midparent transcript levels were highly predictive of transcript accumulation in F1 plants. Allelic bias in F1 transcript accumulation was detected in 20% of transcripts containing sufficient polymorphism to distinguish parental alleles; however the magnitude of these biases were generally smaller than differences among parental accessions.
While analyses of allelic bias suggest that cis regulatory differences between H. annuus and H. petiolaris are common, their effect on transcript levels may be more subtle than trans-acting regulatory differences. Overall, these analyses found little evidence of regulatory incompatibility or dominance interactions between parental genomes within F1 hybrid individuals, although it is unclear whether this is a legacy or an enabler of introgression between species.
KeywordsHybridization Helianthus Introgression Gene flow Allelic bias Speciation
For organisms that reproduce sexually, biological fitness requires the successful interaction of maternal and paternal genomes within the new individual. While these interactions may take place at various points along the path from DNA to external phenotype, analyses of transcript accumulation currently provide the strongest technology to detect these interactions on a genome-wide scale. Changes in transcript levels are hypothesized to enable response to selective forces in novel environments [1–3]. Alteration of single components of regulatory machinery may have dramatic effects on transcript profiles . We therefore expect that bringing together two sets of regulatory machinery that have been separated for millions of years may lead to novel patterns of transcription that contribute to novel phenotypes in interspecific hybrids.
In plants, the effect of inter-species hybridization on transcript levels has been most extensively studied in allopolyploids, where hybridization occurs in conjunction with genome doubling. Comparison of allopolyploids with autopolyploids in several systems has provided evidence that hybridization has more dramatic effects on transcript phenotypes than increased ploidy [5–7]. In some cases, polyploidization following hybridization has been proposed as a mechanism of moderating novel transcript phenotypes generated by regulatory divergence between parental genomes . Extreme gene expression changes following hybridization, or “transcriptional shock”, have been described in early-generation allopolyploid hybrids of Arabidopsis, wheat, and cotton, as well as diploid Senecio spp. hybrids [9–14]. While in later-generation hybrids and back-crosses, changes in gene expression may be caused by genome rearrangement, segregation of parental alleles, or environmentally-mediated selection on accumulated mutation, transcription in first generation (F1) hybrids will be controlled by interaction between parental genomes mediated by transcriptional machinery. Non-additive F1 transcriptional phenotypes may be caused by differences between parental species at the transcribed locus (cis effects) or differences in trans-acting regulatory factors. In hybrids, parental genomes are exposed to a common pool of trans-acting factors, and analyses of allelic bias, or differential parental genome contributions to accumulated transcript, can provide insight into the relative contributions of cis and trans effects to inter-specific gene expression differences [15, 16].
The sunflower genus Helianthus is native to North America and contains 49 species of annual or perennial herbs. The annual sunflowers form a distinct and well-supported clade containing eleven species, including the widely-distributed species H. annuus and H. petiolaris. These species likely originated in allopatry, but their current ranges show considerable overlap. Cytological studies and genetic maps constructed from interspecific crosses suggest that chromosomal rearrangements have accumulated since the evolutionary separation of H. annuus and H. petiolaris. These species are also separated by differences in morphology, life history and habitat preference, and show poor pollen viability in hybrid offspring [17–19].
Although H. annuus and H. petiolaris are estimated to have diverged from each other nearly 2 million years ago (Sambatti et al. 2012), they have been observed to hybridize in natural settings [20, 21]. Average divergence between H. annuus and H. petiolaris is estimated to range from Fst = 0.19 (based on microsatellite variation) to Fst = 0.3 (based on sequence polymorphism in transcripts), similar to levels of intraspecific divergence among stickleback populations and between human populations from West Africa and East Asia [22–24]. This relatively low divergence is consistent with analyses of single-gene phylogenies that suggest substantial recent introgression between H. annuus and H. petiolaris. In at least three cases, hybridization between H. annuus and H. petiolaris has led to the formation of distinct hybrid species (H. anomalus, H. deserticola, and H. paradoxus), which occupy extreme habitats (active sand dunes, desert, and salt marshes respectively). It has been hypothesized, with experimental support, that hybrids bearing genotypes associated with phenotypic traits and environmental tolerances outside of the range exhibited by either parental species were able to colonize unusual ecological niches and form new species [25, 26].
Hybrids between H. annuus and H. petiolaris have also been created for research and agricultural purposes. Most prominently, H. petiolaris is the source of cytoplasmic male sterility PET1, widely used in commercial sunflower hybrid production . H. petiolaris is a potential source of useful germplasm for improvement of H. annuus cultivar resistance to stresses, particularly osmotic stresses such as drought and saline soils.
Here we investigate patterns of transcript accumulation in hybrid sunflowers generated from controlled crosses of Helianthus annuus (cmsHA89) with H. petiolaris (Pet2152). We find that the majority of transcripts accumulate to intermediate levels in the F1 hybrid, and moreover, that mean transcript levels across parental accessions are highly predictive of transcript levels observed in F1 hybrids. Few transcripts showed accumulation outside of the range observed in parental accessions. Within F1 individuals, bias in accumulation of parental alleles was detected in 20% of transcripts where parental alleles could be reliably distinguished, but the magnitude of differences in accumulation were generally lower than differences observed between parental accessions. These results suggest that both cis and trans regulatory divergence contribute to interspecific differences in transcription, yet H. annuus and H. petiolaris genomes show relatively few instances of “misregulation” or extreme phenotypes at the transcript level.
Plant growth and generation of H. annuus × H. petiolaris hybrids
We used a cultivated accession of H. annuus, rather than a wild accession that might more closely represent the parents of homoploid hybrid sunflowers, for several practical reasons, described below: (1) Male sterility and distinct morphology of the H. annuus cultivar used, cmsHA89, provided better recovery and identification of hybrids than would be expected from crosses involving wild H. annuus. (2) cmsHA89 is not self-incompatible, but requires a pollen donor to produce viable seed. This reduced the chances of self fertilization due to mentor effects in mixed pollen loads (Desrochers et al. 1998). cmsHA89 sterility is conferred by the PET1 cytoplasm; while the ultimate origins of this cytoplasm remain unclear, it was introduced into H. annuus cultivated lines via introgression from H. petiolaris. However, the cmsHA89 cytotype is extremely rare in natural populations of Helianthus (Rieseberg et al. 1994). (3) The large heads of H. annuus cultivars provide greater potential seed yield per single cross than wild accessions. (4) The relatively homozygous genome of cmsHA89 also provided greater power to identify variants between parental genomes and assign parentage to alleles within hybrid offspring. This last factor is particularly important as H. petiolaris is intolerant to inbreeding and inbred lines of H. petiolaris are not available.
Plants of H. annuus cultivar cmsHA89 (USDA PI 650572) and wild H. petiolaris accession Pet2152 (USDA PI 586920) were grown in one gallon pots under standard greenhouse conditions at the University of British Columbia Botanical Garden Nursery. Before they began to open, cmsHA89 flowers were covered with drawstring organza bags to deter unauthorized pollination. When the anther filaments of at least the outer three rings of florets were exposed, cmsHA89 flowers were pollinated with pollen from a single Pet2152 plant and re-covered. Reciprocal crosses were not performed because cmsHA89 does not produce pollen. At the same time, self-incompatible Pet2152 plants were intercrossed. Seed heads were allowed to mature and dry before removal from the plant.
While crosses between H. annuus and H. petiolaris are generally of poor fertility, one cmsHA89 × Pet2152 cross produced approximately 200 mature seeds. F1 seeds (n = 60) were scarified (removal of the top 1/3 of the seed) to improve synchrony of germination and placed on moist filter paper disks in plastic petri dishes in the dark at approximately 25°C. cmsHA89 and Pet2152 (half-siblings of the F1 seedlings) were treated similarly. After 3 days, seedlings were transferred to soil in 32-cell nursery flats in a controlled environment chamber (16:8 light:dark, 50% RH, 28°C). After 3 additional weeks, plants were transplanted into 1-gallon pots and moved to a greenhouse bench.
The hybrid identity of putative F1 plants was confirmed via examination of external phenotypes and molecular markers. Visible phenotypic markers included pigmentation at the base of the stem, leaf shape, plant branching, and production of foliar glandular trichomes. Molecular marker phenotypes were observed by extraction of genomic DNA and amplification of two loci previously determined to differ in size (distinguishable by agarose gel electrophoresis) between cmsHA89 and Pet2152. PCR primers, amplification conditions, and representative gel images are provided in Additional file 1: Supplemental Methods 1.
mRNA extraction and sequencing
At 45 days post-germination, leaf tissue was collected from 8 F1 plants and 2 plants from each parent accession. The youngest fully-expanded leaf was cut from each plant, placed into a 50 ml conical tube, and immediately frozen in liquid nitrogen. Total RNA was extracted from approximately 50 mg of ground tissue as described . Preparation of non-normalized cDNA libraries and whole transcriptome shotgun sequencing (RNA-Seq) via Illumina HiSeq 2000 were performed at the Michael Smith Genome Sciences Centre in Vancouver, British Columbia, Canada (http://www.bcgsc.ca/services). Samples were multiplexed with 3 samples per lane.
Sequence data processing and analysis
Read counts were analyzed in R using the DESeq package to compare counts of reads aligned to a given reference contig . The DESeq package uses a modified Fisher’s exact test with data fit to a negative binomial distribution to test for pair-wise differences in count data between sample classes, allowing within-transcript comparisons across a broad dynamic range. Three pairwise comparisons were performed to identify contigs that showed differences in accumulated mapped transcript reads between H. annuus cmsHA89 and H. petiolaris Pet2152, cmsHA89 and F1 samples, and Pet2152 and F1 samples.
Per-contig read counts were averaged across parental accession samples (cmsHA89, Pet2152) to generate mean parent values, and across samples from F1 hybrid plants. Linear modeling of mean transcript counts from F1 plants as a function of mean parent transcript levels was performed in R. Examination of residuals and leverage estimates for this model led us to remove 3 contigs with values of Cook’s D exceeding 1. Refitting the model without these points did not significantly alter the parameter estimates for the model. Predictions of hybrid transcript values, with 99% confidence intervals, were generated using this model. Reference contigs showing mean hybrid transcript accumulation outside the confines of the confidence interval for predictions were classed as “non-additive”.
Variance among hybrids
Variability in transcript levels among individual F1 hybrids was assessed by calculating the coefficient of variation (CV) for each reference contig. To reduce bias in the estimates of CV due to non-normal distribution of transcript level estimates, read counts were first subjected to a natural log transformation and CV was calculated using the formula CV = sqrt((eσ(ln))2 - 1), where σ(ln) is the sample standard deviation calculated from the log-transformed hybrid transcript values. Contigs with a CV greater than 2 were considered to have high variance among hybrid plants.
Allelic bias in transcript accumulation
Data from the four sequenced parental accession samples (HA89.5, HA89.9, PET.2, PET.3) were analyzed simultaneously using SAMtools ‘mpileup’ to identify single nucleotide polymorphisms (SNPs) with respect to the reference sequence. We used a custom perl script to extract loci meeting the following criteria: 1) the variant allele frequency ≠ 1 (this criterion excludes sites where samples differ from the reference, but not between accessions), 2) phred quality score ≥ 80, 3) a single allele is detected within each accession, and 4) the number of sequence reads covering the position is ≥ 5 for each sample (Additional file 3: Supplemental Methods 2). This final criterion eliminates potential false discovery of allelic bias due to failure of one parental allele to align to the reference transcript set, yet also eliminates sequences that are not transcribed (at a detectable level under our conditions of growth and sampling) in one parent genome that may show true allelic bias in the F1 offspring.
For each qualifying variant position within a contig, read depth per SNP was determined for both H. annuus and H. petiolaris-derived variants within individual F1 transcript sequence datasets. From SAMtools ‘mpileup’ output for individual hybrid plants, we extracted ‘dp4’ (read depth for: reference allele on the forward strand, reference allele on the reverse strand, alternate allele on the forward strand, alternate allele on the reverse strand) at each target site, and combined forward and reverse read counts to determine per-allele read depth. At this stage we also removed variants that were only detected on one direction of sequence read (either forward or reverse), as these are likely to represent sequencing artifacts. Read counts for each variant were compared across F1 samples using DESeq . For later gene-level analyses, significant SNP within the same contig were considered as a single significantly-differing transcript, with allelic bias estimated as an average of differences in read counts per SNP and positions showing inconsistent results (i.e. one position shows significant bias toward the H. annuus variant, while the other shows bias toward the H. petiolaris variant) flagged. To assess the general level of transcript level variation due to cis regulatory divergence between parental genomes versus trans-acting regulators, we examined the overlap between reference contigs with one or more fixed SNP showing significant differences in transcript accumulation between parental accessions and those showing significant allelic bias in F1 hybrids. We also fitted a linear model to predict the magnitude of allelic bias based on the observed difference in transcript level between parental accessions.
Classification and annotation of transcripts
Contigs with transcript accumulation patterns suggesting non-additive interactions between parental genomes within hybrid individuals, as revealed by the analyses described above, were labeled as ‘non-additive’, ‘transgressive’, ‘high variance’, or ‘allelic bias’. We identified non-additive transcripts as those showing significant deviation of mean transcript levels in hybrid plants from combined mean transcript accumulation of parental accessions (e.g. those hybrid transcript values falling outside the 99% confidence interval of the linear model associating hybrid transcript with mean parent transcript levels). Transcripts labeled ‘transgressive’ showed mean accumulation within the F1 hybrids that was significantly greater or less than the mean values observed for both H. annuus and H. petiolaris. While transgressive levels of transcript accumulation should also be described as non-additive, these two categories do not fully overlap due to the differences in analyses used to define them. Situations where the difference between H. annuus and H. petiolaris is large or there is variation in transcript abundance within parental accessions may broaden the confidence interval encompassing ‘additive’ values for F1s, despite F1 means significantly differing from both parents. ‘High variance’ contigs showed estimates of the coefficient of variation across F1s that were greater than 2. The set of reference contigs labeled ‘allelic bias’ contained at least one SNP that distinguished the two parental alleles (see criteria above) with variants represented in mapped cDNA sequence reads at a ratio significantly different from equality.
Potential functions of reference transcript contigs identified as non-additive in F1s according to any of the above criteria were explored via analysis of similarity to published protein (NCBI non-redundant protein RefSeq ) and nucleotide databases using blastx and blastn from NCBI-BLAST + (http://www.ncbi.nlm.nih.gov/books/NBK1763/), filtering results with e-values greater than 1e-10. Analyses of gene ontology (GO) for contigs of interest were performed using GOrilla, with refinement via ReviGO [38, 39]. For each gene list, contigs showing significant similarity to Arabidopsis thaliana TAIR10 sequences with GO annotations were compared to three separate similarly-sized lists of contigs randomly drawn from the H. annuus reference transcript set using a hypergeometric distribution (modified Fisher’s Exact Test). GO processes found to be significantly (FDR-adjusted p-value < 0.05) over-represented in all three analyses are discussed.
External phenotypes of H. annuus cmsHA89, H. petiolaris Pet2152, and F1 hybrid offspring
Days to floweringa
Number of flowersb
Flower diameter (mm)c
H. annuus cmsHA89
H. petiolaris Pet2152
RNA extraction and Illumina shotgun sequencing of cDNA were performed for eight F1 plants as well as two plants from each parental accession, generating an approximate average of 27 million 100 bp paired-end reads per sample. Linear modeling of sequence output showed no significant difference among accessions in the number of reads generated per sample (F = 0.0728, p = 0.93, R2 for the model = 0.0159). However, a significantly smaller percentage of Pet2152 reads were successfully mapped to the H. annuus-derived reference transcript dataset when compared to HA89: 51.94 (± 0.90) vs. 58.26 (± 0.37) percent mapped, F = 4.826, p(model) = 0.037, p(HA89-Pet2152 ≠ 0) = 0.013, R2 for the model = 0.5175. Sequence reads obtained from F1 hybrid plants mapped to the reference with intermediate success: 55.05 ± 2.27 percent mapped, p(HA89-F1 ≠ 0) = 0.077. Of 16312 contigs contained in the reference transcript set, approximately 2.5% had no reads mapped from the combined 12 samples and 7.5% (1220 contigs) had a per-sample average depth of less than ten sequence reads.
Examining the relationships among samples for transcript accumulation levels over the entire transcript reference via both Spearman correlation and principle components analyses showed that two samples, HA89.9 and F1.TA, grouped together rather than with other HA89 or F1-derived samples. The average coefficient of the pair-wise correlation between F1.TA and other F1 samples was 0.818, while the range of correlation coefficients (R2) for comparisons among F1 (excluding F1.TA) was 0.977-0.998. Similarly, while cmsHA89 samples were significantly and positively correlated (R2 = 0.755), transcript levels showed higher similarity between HA89.9 and F1.TA (R2 = 0.977). As patterns of sequence polymorphism, in addition to earlier genotyping, confirmed that these samples were identified correctly, we hypothesize that uncontrolled environmental factors influenced transcript accumulation patterns in these two plants, despite our attempts to maintain similar conditions. In particular, when compared to all other samples these plants show relatively reduced accumulation of transcripts involved in photosynthetic processes, and relative increases in accumulation of transcripts associated with defense and stress responses. To avoid excessive influence of these samples on interpretation of our data, we conducted analyses of both the complete data (n = 12) and a dataset from which HA89.9 and F1.TA had been removed (n = 10). Results presented below show the overlap between these two analyses; differences between analyses of complete and reduced datasets are shown in Additional file 4: Figure S1.
Interspecific differences in transcript accumulation
Non-additive transcript accumulation in F1 hybrids
Analysis of gene ontologies (GO) assigned to these transcripts indicated significant overrepresentation of 160 GO processes, reduced to 64 by collapsing highly redundant categories. Among these GO terms, two groups were prominent, involving photosynthesis and energy metabolism (including photosystem assembly, chlorophyll biosynthesis, plastid localization, pentose-phosphate shunt, electron transport) and defense response (including salicylic acid biosynthesis, regulation of hypersensitive response, MAPK cascades, jasmonate signaling).
Transgressive transcript accumulation in F1 hybrids
Contigs showing significantly transgressive transcript phenotypes across F1 H. annuus x H. petiolaris hybrids in both full and reduced analyses
Variance among F1 hybrids
Allelic bias in transcript accumulated in F1 hybrids
Of the 681 contigs containing at least one SNP showing significant bias in F1s, only 81 were also identified as showing significant differences in transcript accumulation between parental accessions cmsHA89 and PET2152 (Figure 5). A conservative estimate based on overlap of both full and reduced analyses indicates that 1456 (or 9% of) reference transcripts examined differ in transcript accumulation between parental accessions. Differences between parental accessions were consistent with differences between alleles in 65% of the contigs showing significant differences in both sets of analyses. All inconsistent contigs showed significantly higher levels of accumulation in H. petiolaris (compared to H. annuus) samples but significantly lower accumulation of transcripts bearing H. petiolaris alleles in samples from hybrids. Across all contigs showing significant evidence of allelic bias, almost 77% (523/681) showed higher transcript levels in H. petiolaris samples than H. annuus samples, suggesting that the larger number of trancripts observed showing bias toward the H. annuus allele in hybrid samples is not simply explained by preferential alignment of transcript sequence reads to the H. annuus-based reference transcriptome.
Gene expression changes associated with hybridization may be attributed to a variety of factors. Novel gene combinations, chromosomal rearrangements, increases in transposon activity, and changes in DNA methylation status occur in interspecific hybrids and are likely to affect gene expression [30, 31, 40–43]. The transcriptional phenotypes of first generation hybrids should predominantly reflect the basic interaction of parental genomes and their endogenous regulatory factors. Deviation of F1 transcript accumulation from midparent values (expected if parental genome contributions to hybrid transcript accumulation were purely additive) will reflect epistatic and dominance interactions between parental genomes.
The transcript patterns observed in annual sunflower hybrids in this study differ from other systems used to study homoploid hybridization in experimental settings, such as Drosophila interspecies hybrids and re-synthesized Senecio squalidus, where relatively high proportions of transcripts examined showed “misexpression” or allelic bias [14, 15, 44]. Various lines of evidence suggest that H. annuus and H. petiolaris have experienced substantial levels of recent genetic exchange, in several instances resulting in ecologically mediated formation of hybrid species [22, 25, 45–48]. While reduced divergence through introgression might be expected to increase genomic compatibility, selection for hybrid viability should also select against extreme levels of genomic misregulation. In this study, we have selected not merely for strict viability, but for growth beyond the seedling stage. It remains possible that regulatory incompatibilities have greater impact on early stages of growth and development, or specifically in reproductive tissues, and thus are not detected in this study, which, as is generally true for analyses of transcript accumulation, can only provide a snapshot of the continuous flow of transcript production and degradation. In this experiment, we also observed strong, uncontrolled environmental effects on transcript profiles that led to a loss of experimental power, most prominently affecting our ability to confidently identify transcriptional differences between H. annuus cmsHA89 and H. petiolaris PET2152 or F1 hybrids. Comparisons between H. petiolaris and F1, or within F1, are relatively unaffected. While this means that we may underestimate transcriptional divergence of F1 from the maternal parent, a broader implication is that uncontrolled environmental factors can have dramatic effects on transcription. The distribution of random effects within the generally resource-limited designs of many transcriptional profiling experiments may have profound effects on the conclusions drawn from these experiments, which would be exacerbated by genotype by environment interaction.
It is believed that formation of Helianthus hybrid species has been mediated by environmental selection on transgressive phenotypes generated through segregation of parental genomes [25, 26, 28]. At the same time, interacting parental genomes present in early generation hybrids must generate phenotypes with sufficient fitness to survive beyond the initial hybrid generation for novel segregants to appear. Naturally-occurring hybrid individuals, as well as laboratory-derived first-generation hybrids, appear to exhibit intermediate phenotypes for many morphological and phenological traits (Table 1) [19–21, 49]. This study suggests that H. annuus × H. petiolaris F1 hybrids also exhibit quantitatively intermediate phenotypes at the level of transcript accumulation, reflecting widespread compatibility between diverged parental transcript regulatory networks. The small sample sizes for parental accessions in this study may have hindered detection of transgressive transcription in F1 hybrids, through increased uncertainty regarding actual parental transcript levels. Our approach still provides an improvement in estimating parental transcription over strategies employing pooled samples, and focusing sampling effort on individual F1s has provided more reliable estimates of both the mean and variance of transcript levels in hybrids.
Although Senecio aethnensis and S. chrysanthemifolius (the parents of the homoploid hybrid S. squalidus) form a well-established hybrid zone with evidence of substantial gene flow between species, a much larger percentage of the analyzed transcript of first generation hybrids showed evidence of non-additive (4.9%) or transgressive (3.2%) accumulation [14, 50, 51]. The relative scarcity of non-additive (0.97–1.28%) or transgressive (0.06–0.7%) transcriptional phenotypes in this study might be attributed to differences in methodology. Quantification of transcript levels via sequencing techniques, rather than hybridization-based microarray platforms, allow both examination of a broader array of transcripts and greater sensitivity to detect low-level transcripts (without necessarily increasing statistical power to detect differences in these transcripts). Analysis of individual F1 plants allowed assessment of transcript variance among hybrids; distributions of hybrid transcript levels may suggest different relationships to parental transcript levels than values generated from pooled hybrid samples. Differences in the historic patterns of hybridization and selection, or in the phylogenetic distance between hybridizing species (much greater for H. annuus and H. petiolaris than between the two sister Senecio species in question), might also account for the different outcomes observed in these two hybrid systems. In particular, the female parental lineage of the hybrids examined in this study is a product of modern breeding, which has included hybridization with wild sunflowers .
Less than 1% of analyzed hybrid transcripts levels fell outside the predictive 99% confidence interval based on averaged transcript levels from parental accessions (Figure 3). Thus, non-additive transcript levels in these F1 plants are detected at a frequency indistinguishable from that expected by chance. These transcripts do, however, show significant over-representation of transcripts predicted to function in the broad categories photosynthesis/energy metabolism and response to biotic stimulus. In particular, transcripts participating in photosynthetic and energy processes are likely to be influenced by interaction with cytoplasmic components, even if the genes themselves are transmitted through nuclear inheritance. These transcripts also accumulate to high levels, potentially increasing the relative statistical power to identify variance from expected transcript values. The GO terms associated with the group of non-additively accumulated transcripts putatively involved in responses to biotic stimuli include defense response to bacterium, salicylic acid biosynthesis and metabolism, systemic acquired resistance, and MAPK cascade signaling. Misregulation or allelic incompatibility of genes involved in plant immune responses, particularly related to specific recognition of biotrophic pathogens, has been implicated in hybrid necroses (an extreme example of hybrid genome incompatibility) in Arabidopsis thaliana, lettuce, and wheat [53–55]. The hybrid plants in this study showed no obvious sign of hybrid necroses under relatively benign growth conditions, and rigorous examination of the phenotypic consequences of altered transcript levels for these immunity-associated genes will be necessary to determine whether immune incompatibilities are likely to have significant evolutionary consequences for Helianthus hybrids.
Interspecific hybridization presents the opportunity to distinguish the effects of nucleotide sequence variation associated with the transcript site (cis variation) and polymorphism in trans-acting regulatory factors. Variation in transcript accumulation between parental accessions that is caused by polymorphism in trans-acting factors should be diminished in hybrid individuals where transcription factors from both genomes are present. The allelic bias detected in F1 hybrids suggests that many differences observed between parental accessions are attributable to cis variation, although the magnitude of allelic bias is generally smaller than the difference in transcript levels observed between parental accessions. The observed expression patterns might therefore be a product of regulatory interaction within or between loci.
Analyses of gene ontology indicated that the group of transcripts showing significant allelic bias is enriched for processes including chloroplast organization, energy metabolism, translation, rRNA processing, and biosynthesis of isopentenyl diphosphate via the non-mevalonate (plastid-based) pathway. As these processes all involve cytoplasmically-inherited cellular components, it is plausible that nuclear-cytoplasmic interactions drive the allelic biases in transcript accumulation observed in hybrids. Despite H. annuus serving as the maternal parent of the hybrids, all over-represented gene groups examined contained a mixture of transcripts showing overrepresentation of H. annuus or H. petiolaris alleles.
The extent of cis regulatory differences between H. annuus and H. petiolaris transcripts is likely underestimated in the approach presented here. The criteria for selection of variants used to assign parentage to transcripts within F1 individuals excludes both loci lacking mapped transcript reads from either parental accession and loci that are polymorphic within either parental accession. While, on average, approximately 130,000 high-confidence heterozygous sites were identified per F1 individual, parentage could only be reliably assigned for a fraction of these. In addition, transcripts affected by polymorphism in cis regulatory sequences, but lacking consistent sequence polymorphism between parental accessions within the actual transcripts, will not be detected as showing allelic bias, although transcripts from such loci may be preferentially derived from one parental genome [16, 56, 57]. A relative lack of fixed polymorphism in transcripts showing expression differences between parental accessions contributes to the minimal overlap observed between transcripts showing allelic bias and those showing differences in accumulation between parental accessions. This suggests that cis and trans regulation are both influential in first generation hybrids, but confidently apportioning their relative effects will require additional data from non-coding regulatory regions.
Studies typically focus on the extreme consequences of hybridization, both the good (heterosis) and the bad (genomic incompatibilities and hybrid necroses) [53–55, 58–61]. This study, in contrast, detects few extreme transcript phenotypes in hybrid offspring of two annual sunflower species that show evidence of extensive gene flow since their divergence. Comparison of additional hybrid transcriptomes from crosses of wild sympatric and allopatric H. annuus and H. petiolaris, particularly incorporating a range of tissues and developmental stages, may clarify the role that introgression plays in transcriptional compatibility.
Availability of supporting data
RNAseq data used in this study are deposited in the NCBI Sequence Read Archive under accession numbers SRS1993196 through SRS1993207.
Coefficient of variation
Expressed sequence tag
False discovery rate
Mitogen-activated protein kinase
Polymerase chain reaction
Reads per kilobase of transcript sequence per million reads mapped from library
This work was supported by the U.S. National Institutes of Health [5F32GM089058 to H.C.R.] and Genome Canada and Genome BC to L.H.R. We thank Chris Grassa for assistance with bioinformatics, particularly in identifying fixed species variants to enable analyses of allelic bias. We additionally thank Greg Baute and Chris Grassa for providing genetic positions for reference transcripts, and Sebastien Renaut for helpful discussion and critical comments on this manuscript.
- Wray GA: The evolutionary significance of cis-regulatory mutations. Nat Rev Genet. 2007, 8: 206-216.View ArticlePubMedGoogle Scholar
- Stern DL, Orgogozo V: The loci of evolution: how predictable is genetic evolution?. Evolution. 2008, 62: 2155-2177. 10.1111/j.1558-5646.2008.00450.x.PubMed CentralView ArticlePubMedGoogle Scholar
- Carroll SB: Evo-devo and an expanding evolutionary synthesis: a genetic theory of morphological evolution. Cell. 2008, 134: 25-36. 10.1016/j.cell.2008.06.030.View ArticlePubMedGoogle Scholar
- Featherstone DE, Broadie K: Wrestling with pleiotropy: genomic and topological analysis of the yeast gene expression network. Bioessays. 2002, 24: 267-274. 10.1002/bies.10054.View ArticlePubMedGoogle Scholar
- Buggs RJA, Zhang L, Miles N, Tate JA, Gao L, Wei W, Schnable PS, Barbazuk WB, Soltis PS, Soltis DE: Transcriptomic Shock Generates Evolutionary Novelty in a Newly Formed, Natural Allopolyploid Plant. Curr Biol. 2011, 21: 551-556. 10.1016/j.cub.2011.02.016.View ArticlePubMedGoogle Scholar
- Chaudhary B, Flagel L, Stupar RM, Udall JA, Verma N, Springer NM, Wendel JF: Reciprocal silencing, transcriptional bias and functional divergence of homeologs in polyploid cotton (gossypium). Genetics. 2009, 182: 503-517. 10.1534/genetics.109.102608.PubMed CentralView ArticlePubMedGoogle Scholar
- Stupar RM, Bhaskar PB, Yandell BS, Rensink WA, Hart AL, Ouyang S, Veilleux RE, Busse JS, Erhardt RJ, Buell CR, Jiang J: Phenotypic and transcriptomic changes associated with potato autopolyploidization. Genetics. 2007, 176: 2055-2067. 10.1534/genetics.107.074286.PubMed CentralView ArticlePubMedGoogle Scholar
- Hegarty MJ, Barker GL, Wilson ID, Abbott RJ, Edwards KJ, Hiscock SJ: Transcriptome shock after interspecific hybridization in senecio is ameliorated by genome duplication. Curr Biol. 2006, 16: 1652-1659. 10.1016/j.cub.2006.06.071.View ArticlePubMedGoogle Scholar
- Comai L, Tyagi AP, Winter K, Holmes-Davis R, Reynolds SH, Stevens Y, Byers B: Phenotypic instability and rapid gene silencing in newly formed arabidopsis allotetraploids. Plant Cell. 2000, 12: 1551-1568.PubMed CentralView ArticlePubMedGoogle Scholar
- Wang J, Tian L, Madlung A, Lee H-S, Chen M, Lee JJ, Watson B, Kagochi T, Comai L, Chen ZJ: Stochastic and epigenetic changes of gene expression in Arabidopsis polyploids. Genetics. 2004, 167: 1961-1973. 10.1534/genetics.104.027896.PubMed CentralView ArticlePubMedGoogle Scholar
- He P, Friebe BR, Gill BS, Zhou J-M: Allopolyploidy alters gene expression in the highly stable hexaploid wheat. Plant Mol Biol. 2003, 52: 401-414. 10.1023/A:1023965400532.View ArticlePubMedGoogle Scholar
- Levy AA, Feldman M: Genetic and epigenetic reprogramming of the wheat genome upon allopolyploidization. Biol J Linn Soc. 2004, 82: 607-613. 10.1111/j.1095-8312.2004.00346.x.View ArticleGoogle Scholar
- Adams KL, Flagel L, Wendel JF: Responses of the Cotton Genome to Polyploidy. Genetics and Genomics of Cotton. Edited by: Paterson AH. 2009, New York, NY: Springer US, 419-429.View ArticleGoogle Scholar
- Hegarty MJ, Barker GL, Brennan AC, Edwards KJ, Abbott RJ, Hiscock SJ: Extreme changes to gene expression associated with homoploid hybrid speciation. Mol Ecol. 2009, 18: 877-889. 10.1111/j.1365-294X.2008.04054.x.View ArticlePubMedGoogle Scholar
- Wittkopp PJ, Haerum BK, Clark AG: Regulatory changes underlying expression differences within and between Drosophila species. Nat Genet. 2008, 40: 346-350. 10.1038/ng.77.View ArticlePubMedGoogle Scholar
- Fontanillas P, Landry CR, Wittkopp PJ, Russ C, Gruber JD, Nusbaum C, Hartl DL: Key considerations for measuring allelic expression on a genomic scale using high-throughput sequencing. Mol Ecol. 2010, 19 Suppl 1: 212-227.View ArticlePubMedGoogle Scholar
- Burke JM, Lai Z, Salmaso M, Nakazato T, Tang S, Heesacker A, Knapp SJ, Rieseberg LH: Comparative mapping and rapid karyotypic evolution in the genus helianthus. Genetics. 2004, 167: 449-457. 10.1534/genetics.167.1.449.PubMed CentralView ArticlePubMedGoogle Scholar
- Lai Z, Nakazato T, Salmaso M, Burke JM, Tang S, Knapp SJ, Rieseberg LH: Extensive chromosomal repatterning and the evolution of sterility barriers in hybrid sunflower species. Genetics. 2005, 171: 291-303. 10.1534/genetics.105.042242.PubMed CentralView ArticlePubMedGoogle Scholar
- Rosenthal DM, Schwarzbach AE, Donovan LA, Raymond O, Rieseberg LH: Phenotypic Differentiation between Three Ancient Hybrid Taxa and Their Parental Species. Int J Plant Sci. 2002, 163: 387-398. 10.1086/339237.View ArticleGoogle Scholar
- Heiser CB: Hybridization Between the Sunflower Species Helianthus annuus and H. petiolaris. Evolution. 1947, 1: 249-262. 10.2307/2405326.View ArticleGoogle Scholar
- Rieseberg LH, Whitton J, Gardner K: Hybrid zones and the genetic architecture of a barrier to gene flow between two sunflower species. Genetics. 1999, 152: 713-727.PubMed CentralPubMedGoogle Scholar
- Kane NC, King MG, Barker MS, Raduski A, Karrenberg S, Yatabe Y, Knapp SJ, Rieseberg LH: Comparative genomic and population genetic analyses indicate highly porous genomes and high levels of gene flow between divergent helianthus species. Evolution. 2009, 63: 2061-2075. 10.1111/j.1558-5646.2009.00703.x.PubMed CentralView ArticlePubMedGoogle Scholar
- Jones FC, Chan YF, Schmutz J, Grimwood J, Brady SD, Southwick AM, Absher DM, Myers RM, Reimchen TE, Deagle BE, Schluter D, Kingsley DM: A genome-wide SNP genotyping array reveals patterns of global and repeated species-pair divergence in sticklebacks. Curr Biol. 2012, 22: 83-90. 10.1016/j.cub.2011.11.045.PubMed CentralView ArticlePubMedGoogle Scholar
- Nelis M, Esko T, Mägi R, Zimprich F, Zimprich A, Toncheva D, Karachanak S, Piskáčková T, Balaščák I, Peltonen L, Jakkula E, Rehnström K, Lathrop M, Heath S, Galan P, Schreiber S, Meitinger T, Pfeufer A, Wichmann H-E, Melegh B, Polgár N, Toniolo D, Gasparini P, D’Adamo P, Klovins J, Nikitina-Zake L, Kučinskas V, Kasnauskienė J, Lubinski J, Debniak T: Genetic structure of Europeans: a view from the North–East. PLoS One. 2009, 4: e5472-10.1371/journal.pone.0005472.PubMed CentralView ArticlePubMedGoogle Scholar
- Gross BL, Kane NC, Lexer C, Ludwig F, Rosenthal DM, Donovan LA, Rieseberg LH: Reconstructing the origin of Helianthus deserticola: survival and selection on the desert floor. Am Nat. 2004, 164: 145-156. 10.1086/422223.PubMed CentralView ArticlePubMedGoogle Scholar
- Karrenberg S, Lexer C, Rieseberg LH: Reconstructing the History of selection during Homoploid Hybrid Speciation. Am Nat. 2007, 169: 725-737. 10.1086/516758.PubMed CentralView ArticlePubMedGoogle Scholar
- Laver HK, Reynolds SJ, Moneger F, Leaver CJ: Mitochondrial genome organization and expression associated with cytoplasmic male sterility in sunflower (Helianthus annuus). Plant J. 1991, 1: 185-193. 10.1111/j.1365-313X.1991.00185.x.View ArticlePubMedGoogle Scholar
- Lai Z, Gross BL, Zou Y, Andrews J, Rieseberg LH: Microarray analysis reveals differential gene expression in hybrid sunflower species. Mol Ecol. 2006, 15: 1213-1227. 10.1111/j.1365-294X.2006.02775.x.PubMed CentralView ArticlePubMedGoogle Scholar
- Renaut S, Grassa C, Moyers B, Kane N, Rieseberg L: The population genomics of sunflowers and genomic determinants of protein evolution revealed by RNAseq. Biology. 2012, 1: 575-596. 10.3390/biology1030575.PubMed CentralView ArticlePubMedGoogle Scholar
- Rieseberg LH, Sinervo B, Linder CR, Ungerer MC, Arias DM: Role of gene interactions in hybrid speciation: evidence from ancient and experimental hybrids. Science. 1996, 272: 741-745. 10.1126/science.272.5262.741.View ArticlePubMedGoogle Scholar
- Shaked H, Kashkush K, Ozkan H, Feldman M, Levy AA: Sequence elimination and cytosine methylation are rapid and reproducible responses of the genome to wide hybridization and allopolyploidy in wheat. Plant Cell. 2001, 13: 1749-1759.PubMed CentralView ArticlePubMedGoogle Scholar
- Bowers JE, Bachlava E, Brunick RL, Rieseberg LH, Knapp SJ, Burke JM: Development of a 10,000 Locus Genetic Map of the Sunflower Genome Based on Multiple Crosses. G3. 2012, 2: 721-729. 2012.PubMed CentralView ArticlePubMedGoogle Scholar
- Li H, Durbin R: Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009, 25: 1754-1760. 10.1093/bioinformatics/btp324.PubMed CentralView ArticlePubMedGoogle Scholar
- Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R: The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009, 25: 2078-2079. 10.1093/bioinformatics/btp352.PubMed CentralView ArticlePubMedGoogle Scholar
- Quinlan AR, Hall IM: BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010, 26: 841-842. 10.1093/bioinformatics/btq033.PubMed CentralView ArticlePubMedGoogle Scholar
- Anders S, Huber W: Differential expression analysis for sequence count data. Genome Biol. 2010, 11: R106-10.1186/gb-2010-11-10-r106.PubMed CentralView ArticlePubMedGoogle Scholar
- Pruitt KD, Tatusova T, Klimke W, Maglott DR: NCBI Reference sequences: current status, policy and new initiatives. Nucleic Acids Res. 2009, 37: D32-D36. 10.1093/nar/gkn721.PubMed CentralView ArticlePubMedGoogle Scholar
- Eden E, Navon R, Steinfeld I, Lipson D, Yakhini Z: GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists. BMC Bioinformatics. 2009, 10: 48-10.1186/1471-2105-10-48.PubMed CentralView ArticlePubMedGoogle Scholar
- Supek F, Bošnjak M, Škunca N, Šmuc T: REVIGO Summarizes and visualizes long lists of gene ontology terms. PLoS One. 2011, 6: e21800-10.1371/journal.pone.0021800.PubMed CentralView ArticlePubMedGoogle Scholar
- Kashkush K, Feldman M, Levy AA: Gene loss, silencing and activation in a newly synthesized wheat allotetraploid. Genetics. 2002, 160: 1651-1659.PubMed CentralPubMedGoogle Scholar
- Shan X, Liu Z, Dong Z, Wang Y, Chen Y, Lin X, Long L, Han F, Dong Y, Liu B: Mobilization of the active MITE transposons mPing and Pong in rice by introgression from wild rice (Zizania latifolia Griseb.). Mol Biol Evol. 2005, 22: 976-990. 10.1093/molbev/msi082.View ArticlePubMedGoogle Scholar
- Salmon A, Ainouche ML, Wendel JF: Genetic and epigenetic consequences of recent hybridization and polyploidy in Spartina (Poaceae). Mol Ecol. 2005, 14: 1163-1175. 10.1111/j.1365-294X.2005.02488.x.View ArticlePubMedGoogle Scholar
- Beaulieu J, Jean M, Belzile F: The allotetraploid Arabidopsis thaliana–Arabidopsis lyrata subsp. petraea as an alternative model system for the study of polyploidy in plants. Mol Genet Genomics. 2009, 281: 421-435. 10.1007/s00438-008-0421-7.View ArticlePubMedGoogle Scholar
- McManus CJ, Coolon JD, Duff MO, Eipper-Mains J, Graveley BR, Wittkopp PJ: Regulatory divergence in Drosophila revealed by mRNA-seq. Genome Res. 2010, 20: 816-825. 10.1101/gr.102491.109.PubMed CentralView ArticlePubMedGoogle Scholar
- Yatabe Y, Kane NC, Scotti-Saintagne C, Rieseberg LH: Rampant gene exchange across a strong reproductive barrier between the annual sunflowers, Helianthus annuus and H. petiolaris. Genetics. 2007, 175: 1883-1893. 10.1534/genetics.106.064469.PubMed CentralView ArticlePubMedGoogle Scholar
- Sambatti JBM, Strasburg JL, Ortiz-Barrientos D, Baack EJ, Rieseberg LH: Reconciling extremely strong barriers with high levels of gene exchange in annual sunflowers. Evolution. 2012, 66: 1459-1473. 10.1111/j.1558-5646.2011.01537.x.View ArticlePubMedGoogle Scholar
- Rieseberg LH: Homoploid reticulate evolution in helianthus (Asteraceae): evidence from ribosomal genes. Am J Bot. 1991, 78: 1218-1237. 10.2307/2444926.View ArticleGoogle Scholar
- Ludwig F, Rosenthal DM, Johnston JA, Kane N, Gross BL, Lexer C, Dudley SA, Rieseberg LH, Donovan LA: Selection on leaf ecophysiological traits in a desert hybrid helianthus species and early-generation hybrids. Evolution. 2004, 58: 2682-2692.PubMed CentralView ArticlePubMedGoogle Scholar
- Rieseberg LH: Crossing relationships among ancient and experimental sunflower hybrid lineages. Evolution. 2000, 54: 859-865.View ArticlePubMedGoogle Scholar
- Brennan AC, Bridle JR, Wang A-L, Hiscock SJ, Abbott RJ: Adaptation and selection in the Senecio (Asteraceae) hybrid zone on Mount Etna, Sicily. New Phytol. 2009, 183: 702-717. 10.1111/j.1469-8137.2009.02944.x.View ArticlePubMedGoogle Scholar
- Brennan AC, Barker D, Hiscock SJ, Abbott RJ: Molecular genetic and quantitative trait divergence associated with recent homoploid hybrid speciation: a study of Senecio squalidus (Asteraceae). Heredity (Edinb). 2012, 108: 87-95. 10.1038/hdy.2011.46.View ArticleGoogle Scholar
- Leclerq P: Une sterilite male cytoplasmique chez le tournesol. Ann Amelior Plant. 1969, 19: 99-106.Google Scholar
- Bomblies K, Lempe J, Epple P, Warthmann N, Lanz C, Dangl JL, Weigel D: Autoimmune response as a mechanism for a dobzhansky-muller-type incompatibility syndrome in plants. PLoS Biol. 2007, 5: e236-10.1371/journal.pbio.0050236.PubMed CentralView ArticlePubMedGoogle Scholar
- Jeuken MJW, Zhang NW, McHale LK, Pelgrom K, den Boer E, Lindhout P, Michelmore RW, Visser RGF, Niks RE: Rin4 Causes Hybrid Necrosis and Race-Specific Resistance in an Interspecific Lettuce Hybrid. Plant Cell. 2009, 21: 3368-3378. 10.1105/tpc.109.070334.PubMed CentralView ArticlePubMedGoogle Scholar
- Mizuno N, Hosogi N, Park P, Takumi S: Hypersensitive response-like reaction is associated with hybrid necrosis in interspecific crosses between tetraploid wheat and Aegilops tauschii Coss. PLoS One. 2010, 5: e11326-10.1371/journal.pone.0011326.PubMed CentralView ArticlePubMedGoogle Scholar
- Degner JF, Marioni JC, Pai AA, Pickrell JK, Nkadori E, Gilad Y, Pritchard JK: Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data. Bioinformatics. 2009, 25: 3207-3212. 10.1093/bioinformatics/btp579.PubMed CentralView ArticlePubMedGoogle Scholar
- Gregg C, Zhang J, Weissbourd B, Luo S, Schroth GP, Haig D, Dulac C: High-resolution analysis of parent-of-origin allelic expression in the mouse brain. Science. 2010, 329: 643-648. 10.1126/science.1190830.PubMed CentralView ArticlePubMedGoogle Scholar
- Ispolatov I, Doebeli M: Speciation due to hybrid necrosis in plant-pathogen models. Evolution. 2009, 63: 3076-3084. 10.1111/j.1558-5646.2009.00800.x.View ArticlePubMedGoogle Scholar
- Birchler JA, Yao H, Chudalayandi S, Vaiman D, Veitia RA: Heterosis. Plant Cell. 2010, 22: 2105-2112. 10.1105/tpc.110.076133.PubMed CentralView ArticlePubMedGoogle Scholar
- Lippman ZB, Zamir D: Heterosis: revisiting the magic. Trends Genet. 2007, 23: 60-66. 10.1016/j.tig.2006.12.006.View ArticlePubMedGoogle Scholar
- Veitia RA, Vaiman D: Exploring the mechanistic bases of heterosis from the perspective of macromolecular complexes. FASEB J. 2011, 25: 476-482. 10.1096/fj.10-170639.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.