Identification of a strawberry flavor gene candidate using an integrated genetic-genomic-analytical chemistry approach
© Chambers et al.; licensee BioMed Central Ltd. 2014
Received: 14 November 2013
Accepted: 17 March 2014
Published: 17 April 2014
There is interest in improving the flavor of commercial strawberry (Fragaria × ananassa) varieties. Fruit flavor is shaped by combinations of sugars, acids and volatile compounds. Many efforts seek to use genomics-based strategies to identify genes controlling flavor, and then designing durable molecular markers to follow these genes in breeding populations. In this report, fruit from two cultivars, varying for presence-absence of volatile compounds, along with segregating progeny, were analyzed using GC/MS and RNAseq. Expression data were bulked in silico according to presence/absence of a given volatile compound, in this case γ-decalactone, a compound conferring a peach flavor note to fruits.
Computationally sorting reads in segregating progeny based on γ-decalactone presence eliminated transcripts not directly relevant to the volatile, revealing transcripts possibly imparting quantitative contributions. One candidate encodes an omega-6 fatty acid desaturase, an enzyme known to participate in lactone production in fungi, noted here as FaFAD1. This candidate was induced by ripening, was detected in certain harvests, and correlated with γ-decalactone presence. The FaFAD1 gene is present in every genotype where γ-decalactone has been detected, and it was invariably missing in non-producers. A functional, PCR-based molecular marker was developed that cosegregates with the phenotype in F1 and BC1 populations, as well as in many other cultivars and wild Fragaria accessions.
Genetic, genomic and analytical chemistry techniques were combined to identify FaFAD1, a gene likely controlling a key flavor volatile in strawberry. The same data may now be re-sorted based on presence/absence of any other volatile to identify other flavor-affecting candidates, leading to rapid generation of gene-specific markers.
KeywordsRNA-seq γ-decalactone Fragaria Flavor Molecular marker Strawberry
The commercial strawberry (Fragaria x ananassa) (2n = 8x = 56) is a popular fresh and processed fruit with substantial value worldwide. It is recognized for its sweet flavors and appealing aromas. The volatile profiles of strawberry are relatively complicated among berries, with over 360 volatile compounds reported . A reduced set of approximately 20 volatiles are commonly reported to be important components of strawberry flavor [2–4]. The principle flavor compounds include esters , ketones, terpenes , furanones , aldehydes , alcohols, and sulfur-containing compounds. The concentrations of individual volatiles are highly dependent on species , environment and harvest date , [10–12], cultivar, postharvest treatment and fruit developmental stage .
One important volatile compound is γ-decalactone (γ-D; CAS 706-14-9). This volatile is described as “fruity”, “sweet”, or “peachy”  and contributes to fruit aroma [14, 15]. The volatile tends to be undetectable in some genotypes , while in others its accumulation varies greatly within and between harvest seasons . This pattern suggests that a critical biosynthetic step or substrate may be missing or limited, and under strong environmental influence. The high variability may be due to differences in expression of genes encoding enzymes linked to the process. The observation that some genotypes never produce the compound when others do presents an excellent basis to use global transcriptome profiling to identify candidate genes associated with is production or stability. Because the commercial strawberry is octoploid, F1 progeny from a volatile producer and a non-producer have led to predictions about inheritance of a given volatile [2, 16].
A number of researchers have used genomics approaches to identify marker-trait associations in polyploids using SNPs. Advances in marker discovery have been made in allohexaploid wheat (Triticum spp.) cultivars using the Illumina GoldenGate Assay , allohexaploid oat (Avena sativa L.) using Roche 454 sequencing , and allotetraploid oilseed rape (Brassica napus) using Illumina Solexa sequencing . Other approaches have focused on developmental changes in the transcriptome associated with ripening to identify gene candidates. This strategy was effective for grape (Vitis spp.) using Illumina sequencing , and recently in peach (Prunus persica L.) using microarrays .
The goal of this work is to use a transcriptome-based approach to identify the genes required for γ-D production. The approach leverages the presence/absence nature of γ-D from specific genotypes, its predictable inheritance, environmental lability, and variation during the growing season. The analysis identified one transcript from a narrow set of gene candidates that is functionally related to genes implicated in biosynthesis of this compound in certain fungi [22–24] and the related compound γ-dodecalactone . A PCR-based amplicon corresponding to the candidate sequence co-segregates with the volatile in a breeding population, corresponding backcrosses, and in select cultivars and wild accessions. We demonstrate that computational bulking of RNAseq data based on the presence or absence of a volatile can identify transcripts likely playing a direct role in volatile production.
The gene segregating with the presence of the γ-D volatile has been shown to segregate as a single dominant locus, making it a prime candidate for the approach outlined in Additional file 1: Figure S1. Briefly, a cross was constructed between Elyana, a γ-D producing cultivar, and Mara des Bois, a cultivar where γ-D has not been detected. Progeny were grown, and fruits from each individual plant were analyzed for volatiles and coincident gene expression. The fruits from each plant were analyzed and sequenced separately so that transcriptomes from producers and non-producers could be bulked computationally, with the hypothesis that candidate genes would be common to producers, while being expressed low levels or go undetected in non-producers. Results could be experimentally validated in the parental lines and in segregating progeny using gene expression analysis.
γ-D quantity is genetically and environmentally influenced
Fourteen progeny and both parents were individually analyzed by RNA-seq. The γ-D non-producers included in this analysis included ‘Mara des Bois’, 42, 89, 193, and 203. The producers were ‘Elyana’, 6, 24, 37, 51, 91, 93, 98, 103, 152, and 204. Many of the producers had higher γ-D levels than the ‘Elyana’ γ-D positive parent (Figure 2). The transcriptomes from individual lines were computationally pooled based on “producer” or “non-producer”. Over 106 million MID-tagged RNAseq reads were generated from each of the parents and progeny. The average number of filtered and mapped reads per genotype was 5.5 million per genotype and ranged from 3 million to 8.5 million. Both parents had more than 7 million filtered, mapped reads.
Approximately 17,000 out of ~31,000 annotated genes in the strawberry draft genome  were represented in the RNA-seq dataset. A cursory SNP search identified over 1.7 million SNPs total when compared to the F. vesca genome (SNP criteria: 95% minimum P not ref, 10 or greater read depth, and present in at least two of the sixteen genotypes’ datasets).
Gene expression trends in parental lines
Transcripts abundant in gamma-decalactone-producing genotypes
GO biological process
GO cellular component
GO molecular function
Omega-6 fatty acid endoplasmic reticulum isozyme 2-like
Unsaturated fatty acid biosynthesis process; oxidation-reduction process;
Endoplasmic reticulum membrane; integral to membrane;
Delta12-fatty acid dehydrogenase activity; oxidoreductase activity, acting on paired donors, with oxidation of a pair of donors resulting in the reduction of molecular oxygen to two molecules of water; omega-6 fatty acid desaturase activity;
Osmotin-like protein osm34
Defense response to bacterium, incompatible interaction; response to salt stress; defense response to fungus, incompatible interaction;
Serine threonine-protein kinase rio1-like
ATP binding; protein serine/threonine kinase activity;
Tricarboxylic acid cycle; cellular carbohydrate metabolic process; response to cadmium ion;
Cell wall; mitochondrial matrix; chloroplast;
Zinc ion binding; citrate (Si)-synthase activity; ATP binding;
PREDICTED: uncharacterized protein in LOC101300103
atp-dependent-nad h-hydrate dehydrate-like
f-box protein at5g07610-like
Diphthamide biosynthesis protein 2-like
Serine threonine protein phosphatase
Generation of precursor metabolites and energy;
Chloroplast; membrane; mitochondrion;
Computational bulking to limit candidate set
The large number of differentially-expressed transcripts could be further narrowed by analyzing transcript patterns for these genes in progeny segregating for γ-D. Pairwise comparisons were made between genotypes with high (genotypes ‘Elyana’, 91, 24, 37, 103, and 006) and non-detectable (genotypes ‘Mara des Bois’, 203, 89, and 42) γ-D levels. Gene candidates were filtered to have a modest >4-fold increase in transcript support of producers over non-producers. Using this approach, a single gene candidate was identified, gene24414 on linkage group 3 (LG3:31112418..31114643, scf0513029:129621..131846), the same abundant transcript shown in Table 1 as variable between the two parents. To illustrate how the integration of segregating progeny can separate out transcripts not common to γ-D volatile producers, all transcripts from the “lipids” category are shown in Figure 3C from the parental genotypes and several of their progeny. In general, lipid related transcripts show limited differential accumulation between any genotypes. The clear exception is the omega-6-fatty-acid desaturase (bottom row) which is not detected in ‘Mara des Bois’ (M) and in three of the progeny (green squares). The γ-D volatile was not detected in these same genotypes. Of all candidates from Table 1, the omega-6-fatty-acid desaturase was the only transcript that correlated 100% with the ability to produce γ-D. The gene was given the designation FaFAD1. A 1,128 bp open reading frame was cloned from ‘Elyana’ cDNA (Additional file 2).
Validation of key candidates
Candidate genes in fruit developmental series
Candidate genes and two environments
Molecular basis of γ-decalactone loss-of-function
γ-decalactone marker in three populations
The potential molecular marker was also tested against a set of cultivars with demonstrated present or undetectable γ-D. ‘Radiance’, ‘Albion’, ‘Winter Star’, and ‘Sweet Charlie’ were all γ-D positive and positive also for the PCR product (Figure 8C). ‘Deutsch Evern’, ‘Strawberry Festival’, ‘LF9’, and ‘Mieze Schindler’ were all negative for γ-D and also negative also for the marker. ‘Strawberry Festival’ and ‘LF9’ were additionally interesting because ‘LF9’ is a seedling from self-pollination of ‘Strawberry Festival’ . The prediction would therefore be that ‘LF9’ would be negative for the marker and γ-D phenotype. This was confirmed by volatile and marker analysis.
SSR marker development
Fruit flavor and aroma profiles are shaped by a mixture of sugars, acids and volatile organic compounds. Individual volatile components have been demonstrated to play key roles in consumer liking, as noted in tomato , and strawberry . Several reports have detailed the importance of key compounds to aroma production and the genetic loci or genes that control them [6, 16]. One component contributing to flavor in strawberry fruits is γ-decalactone (γ-D). This compound represents the recognized aromas in peaches and apricots, and is of interest to industry as a flavoring agent. The synthesis of γ-D is not well-understood in plants. However, several microorganisms can generate γ-D from fatty-acid substrates  and are used as bioreactors to produce this flavor compound.
To identify transcripts related to γ-D production, two parents varying for the volatile were crossed. ‘Elyana’ is a large, firm berry, and bred for production in Florida . ‘Mara des Bois’ is smaller and soft, and not used in wide commercial cultivation. Fruits (red, ripe fruits of similar size, age, and environment) were assayed for the volatile and also for transcripts associated with receptacle/achene ripening. The RNAseq data from producers and non-producers were computationally bulked to identify sequences common to each pool (Additional file 1: Figure S1). Strawberry fruit transcriptomes have been examined previously [32–34], and show changes in a substantial number of transcripts throughout the ripening process. The commercial strawberry is octoploid and highly heterozygous . Crossing two plants produces a myriad of progeny phenotypes due to contributions from homoeologous genes. The segregation within subgenomes likely provides additional “noise” that allows for transcripts relevant to the trait to separate clearly from others, focusing the candidate set. Transcriptomes for each plant were considered independently, so the transcriptomes of γ-D producers and non-producers could be compared in silico.
The test began with detection of γ-D. The γ-D producers show fluctuations in volatile quantity throughout the growing season (Figure 1). The genotypes with the highest concentrations were estimated to possess between 0.028 to 0.035 mM γ-D (data not shown). Gamma decalactone accumulation was not ever detected in the ‘Mara des Bois’ parent. The range of differences observed in parental fruits and progeny presented an ideal situation to assay global gene expression coinciding with volatile production.
When progeny were sorted by presence/absence of γ-D, a small subset of transcripts was significantly different between phenotypic groups. Pairwise comparisons of RNAseq data around a fruit phenotype resolved to an especially strong single candidate, a transcript encoding an omega-6-fatty acid desaturase (FaFAD1). This gene is located near a QTL previously reported to explain about 90% of the γ-D phenotype in strawberry . Another gene recently found in peach (PpFAD_1B-6, 71% identity and 82% similar to FaFAD1 at the protein level) was correlated with γ-D in ripening fruit, and demonstrated ω-6 oleate desaturase activity when overexpressed in yeast . Fatty acid desaturases catalyze the formation of double bonds into fatty acyl chains. The report from Sanchez , and the biochemistry inferred from protein sequence, each suggest a role for this gene in lactone production, but the precise biochemical steps have not been demonstrated. Future work will examine the role of this transcript in transgenic lines.
FaFAD1 transcript abundance correlated well with the presence of γ-D. The transcript accumulated with ripening, and only in certain environmental conditions (Figure 5). The ability to produce γ-D segregated as a single dominant locus, consistent with previous reports . A PCR-based survey of materials from the population indicated that the FaFAD1 genomic sequence could not be amplified from γ-D non-producers, suggesting gene deletion or radical alteration affecting PCR amplification. Eight total combinations using ten distinctly positioned primers were used to amplify regions upstream of and within the FaFAD1 gene (Shown in Figure 7). In each case, ‘Elyana’ genomic template amplified the expected fragments, but ‘Mara des Bois’ produced no amplicons. This is consistent with the idea that a deletion is responsible for the absence of FaFAD1-related sequence in the ‘Mara des Bois’ genome, and this would explain the dominant, single gene effect for the γ-D phenotype. This finding allowed for development of a PCR-based molecular marker to identify genotypes with the potential to produce γ-D. PCR primers corresponding to the 5′ sequence of FaFAD1 were used to amplify the region shown in Figure 7 (primers A and B). The presence of the PCR product co-segregated precisely with the ability to produce the γ-D, in the parental cross, in progeny, and in backcross populations (Figure 8A).
The nature of the deletion is further exemplified using simple sequence repeat (SSR) markers, tools frequently used to fingerprint strawberry germplasm [37–39]. The availability of the diploid strawberry, F. vesca, genome  makes it possible to position the microsatellite sequences within the structural context of the strawberry genome. One SSR sequence is located 11 kb adjacent to the FaFAD1 gene, and polymorphisms would be predicted to segregate with the candidate gene. The results in Figure 8 show that the presence of allele 205 is a reliable predictor of the ability to produce γ-D, at least in the subset of the population evaluated. These data are important because they provide two independent, PCR-based tests can differentiate γ-D producers and non-producers. While not tested outside of the parental genotypes and only in several progeny, this second primer set offers another potential molecular marker that may be used to verify results from the FaFAD1 sequence. The findings also indicate that the SSR is present in the ‘Elyana’ parent and not the other, suggesting that the missing FaFAD1 gene may be part of a larger deletion.
The potential utility of this molecular marker was demonstrated when it was applied to a set of unrelated germplasm that was also varying for presence/absence of γ-D. In this case, the PCR product could only be amplified in genotypes demonstrated to produce γ-D. The evaluation was extended to wild germplasm where fruit could be obtained. Even in these distant accessions, the PCR product was only amplified in lines where γ-D was detectable. The putative marker not only works within a population, but likely will work across commercial breeding populations and wild octoploid accessions. Future studies will track the origin of the gene in diploid germplasm in an attempt to reconstruct its origins, as was done with alleles to trace linalool-producing variants of FaNES1[6, 40].
There are notable limitations to the approach used in this study. The visual correlation between the γ-D phenotype and qRT-PCR results is shown in Figures 4 and 5. This quantitative trend, however, is not consistent in the RNA-seq RPKM data (data not shown). γ-D producers, irrespective of volatile amount produced, could not be distinguished quantitatively, though producers and non-producers were still easily discernible. This suggests that while the RNA-seq approach worked well for the qualitative γ-D phenotype, uncovering quantitative γ-D effects would be very challenging. This further underscores the need to incorporate independent methods (like qRT-PCR) for examining candidate genes in the early stages of discovery.
The development of two independent molecular markers that segregate with the phenotype in a wide range of germplam will permit improved parental selection and rapid screening of progeny possessing the ability to produce γ-D. A simple and robust assay can rapidly eliminate individual plants from breeding populations, saving time, fuel, space and other resources. Most importantly, the marker allows more rapid integration of a fruity volatile into advanced selections.
The results of this study demonstrate that gene candidates for strawberry fruit traits may be identified by integrating careful phenotyping and transcriptomic analysis with genetics. This approach rapidly reduced the complexity of the octoploid transcriptome down to a single candidate gene. The identified gene, FaFAD1, is functionally equivalent to genes involved in γ-D synthesis in fungi and potentially in peach . Gene identification led to the development of a gene-based marker, enabling selection for γ-D producers at the seedling stage. Moving rapidly from candidate to marker has the potential to increase breeding efficiency and reduce the downstream costs associated with maintaining plants lacking a favorable trait. Looking forward, γ-D is only one of many volatiles that can be analyzed with this approach. The same exact dataset may now be re-sorted to identify candidates for other volatiles. The bulk sorting of polyploid transcriptomes is a rapid and cost effective means to identify a testable suite of genes contributing to a given trait.
Parental lines were Fragaria x ananassa ‘Elyana’ female (γ-D producer) and ‘Mara des Bois’ male (γ-D non-producer). F1 progeny from the ‘Elyana’ x ‘Mara des Bois’ cross were clonally multiplied in a Colorado summer nursery in 2010. Two runner tips of each of approximately 130 seedlings were harvested from the nursery and grown at the Gulf Coast Research and Education Center (GCREC) in Wimauma, Florida during the 2010–11 winter season. These plants were evaluated in the field for horticultural qualities such as yield and fruit size and for volatile diversity using GC/MS. A subset of this population was selected based on flavor volatile diversity and superior plant performance. These selections were further clonally multiplied and grown again in the 2011/12 and 2012/13 seasons.
BC1 populations were made by crossing one γ-D positive progeny (progeny 98, male) to each parent (female) in 2011. Backcross progeny were increased in the summer nursery as described above. During the 2012/13 season, the progeny were transplanted to the same commercial strawberry growing system in Florida and analyzed for volatiles and other horticultural traits. Other cultivars for marker validation were maintained at GCREC. All other genotypes used in this study were obtained from the Germplasm Resource and Information Network (GRIN) repository in Corvallis, OR, and maintained in the field at GCREC.
Fruit volatile analysis
All fruiting progeny from the ‘Elyana’ x ‘Mara des Bois’ cross were analyzed for volatiles by GC/MS during the 2010/11 season. Harvest dates were January 20, February 11, February 25, and March 18, 2011. Backcross populations were harvested during the 2012/13 season and were harvested on January 13, January 31, and March 7, 2013. Data from the 2010/11 harvests were used to select genotypes segregating for volatiles of interest. Fruit processing for volatile analysis was conducted as follows. A representative ~25 g sample was collected from five to six fully ripe, clean, and normal-shaped berries from each genotype. The calyx from each berry was removed, and berries were blended with an equal weight of saturated NaCl solution (~35% NaCl in molecular biology grade water). The volatile 3-hexanone was added as an internal standard to a final concentration of 1 ppm prior to blending. Five ml aliquots were dispensed into 20 ml glass vials and sealed with magnetic crimp caps (Gerstel, Baltimore, MD, USA). Two technical replicates were processed for each genotype at each harvest. Samples were frozen at -20°C until analysis by Gas Chromatography/Mass Spectroscopy.
Gas chromatography/mass spectroscopy (GC/MS)
A 2 cm tri-phase SPME fiber (50/30 μm DVB/Carboxen/PDMS, Supelco, Bellefonte, PA, USA) was used to collect and concentrate volatiles prior to running on an Agilent 6890 GC coupled with a 5973 N MS detector (Agilent Technologies, Palo Alto, CA, USA). Before analysis, samples were held at 4°C in a Peltier cooling tray attached to a MPS2 autosampler (Gerstel). All other volatile sampling and analysis methods were as previously described . The volatile 3-hexanone was used as an internal control. An authentic γ-D standard (Sigma Aldrich, St. Louis, MO, USA) was run under the same chromatographic conditions as berry samples for verification of volatile identify. The area of each γ-D peak was normalized to the peak area of the internal standard, and normalized peak areas were compared between samples.
Estimation of γ-decalactone in strawberry fruit
A standard curve was made in half ripe fruit of F. x ananassa ‘Winterstar’ fruit puree to estimate the amount of γ-D. This approach mimics volatile detection in ripe fruit. Half red fruit were processed with saturated NaCl and 3-hexanone as described in Fruit Volatile Analysis above. Pure γ-D (Sigma; St. Louis, MO) was added to puree aliquots from 0.005 to 0.3 mM concentrations, vortexed thoroughly, and then analyzed by GCMS as with all other samples. Baseline γ-D was identified in puree sampled without pure volatile added.
Combined volatile and RNA-seq tissue
Fruit for volatile and RNA-seq analyses were harvested on December 15, 2011. Fourteen progeny and both parents (‘Elyana’ and ‘Mara des Bois’) were selected to maximize representation of segregating volatiles. Eight to ten fully-ripe fruit were collected from each genotype, cleaned, the calyx was removed, and then split longitudinally. Half of each progeny’s sample was processed for volatile analysis as described above, and the other half was flash frozen in liquid nitrogen and stored at -80°C until RNA extraction. A blank was processed in between each GC/MS sample to minimize cross-sample contamination.
Frozen berries were crushed and then ground to a fine powder in a liquid nitrogen-cooled coffee grinder (KitchenAid Blade Coffee Grinder, St. Joseph, MI, USA). RNA was extracted using a modified method . Two grams of fruit powder was used per 5 ml extraction buffer.
RNA was treated with DNase I, RNase-free (Fermentas, Waltham, MA, USA) according to the manufacturer instructions and then cleaned using the Qiagen RNeasy Mini Kit (Qiagen, Valencia, CA, USA). RNA quality was checked on a Bioanalyzer prior to RNA-seq library construction. Each sample was individually barcoded during library construction. Two lanes with eight libraries each were run on an Illumina Genome Analyzer IIx (Illumina, San Diego, CA, USA).
Reads that passed quality checks were aligned to the F. vesca genome using either a custom script or SeqMan NGen (DNASTAR version 2.3, Lasergene, Madison, WI, USA). F. vesca Genemark Hybrid version 1.1 was used for gene annotations. Gene candidates were identified in QSeq (DNASTAR) by making pairwise comparisons between high and low γ-D genotypes. Each pairwise comparison excluded genes less than 2 to 4-fold abundance when comparing high versus low γ-D genotypes. The genes remaining after these comparisons became gene candidates and were analyzed further by qRT-PCR.
A separate analysis was made using CLC Genomic Workbench software v6.5.0 (CLC Bio, Cambridge, MA, USA). The alignment parameters were adjusted to allow for expected variations in the octoploid genome relative to the diploid one (Minimum length fraction = 0.6; Minimum similarity fraction = 0.5; Maximum number of hits for a read = 5). RPKM value was used to normalize the expression data sets among individuals in the population. The data from Figure 3A overexpression data represent transcripts with a RPKM > 10 in at least one of the cultivars. A transcript was considered overexpressed when it was present at least 5 times more frequently in one genotype than the other. The functional classification of the differentially expressed genes was performed using MapMan ontology. The heatmap used for parents comparison was designed using Mapman software (Mapman version 3.6.0RC1,). A dot represents the log2 of the RPKM ratio of a transcript between the ‘Mara des Bois’ and ‘Elyana’ parents.
cDNA templates for qRT-PCR were synthesized using the Impromtu II Reverse Transcriptase kit (Promega, Madison, WI, USA) according to the manufacturer’s protocol. The cDNA was diluted 1:10 prior to qRT-PCR analysis. All qRT-PCR reactions were run in 20 ul reactions using EvaGreen qPCR Mastermix-ROX (Applied Biological Materials Inc., Richmond, BC, Canada). Each reaction contained 10 ul 2x EvaGreen mastermix, 2 μl primer mix (2 uM each), 1 μl 1:10 diluted cDNA, and 7 μl DNase/RNase free water. All qRT PCR primers were designed using qRT primer design tools available online (idtdna.com), and designed to amplify fragments between 95 and 110 base pairs. Each primer-template combination was run with three technical replicates and three biological replicates. A conserved hypothetical protein (FaCHP1 was used as a housekeeping control (5′ TGCATATATCAAGCAACTTTACACTGA 3′ forward and 5′ ATAGCTGAGATGGATCTTCCTGTGA 3′ reverse). The qRT PCR was run on an Applied Biosystems StepOnePlus Real-Time PCR System using StepOne Software (v2.0) (Applied Biosystems, Foster City, CA, USA). The qRT-PCR data was analyzed using the comparative CT method (ΔΔCT) following the manufacturer’s direction.
Candidate genes from the RNA-seq results were validated by qRT-PCR against templates from multiple genotypes, developmental stages, and/or environments. Candidates were initially validated using cDNA templates from all 16 genotypes included in the RNA-seq dataset. Further, candidates were tested for induction during ripening in ‘Elyana’ (only ripe fruits have detectable γ-D), and two environments with low or high γ-D production. The qRT-PCR sequences for FaFAD1 (gene24414) were forward 5′ GTGCCCTTACTGATAACAAACG 3′ and reverse 5′ TCGCAACCAATCCCACTC 3′, for CYTp450 (gene29958) forward 5′ ACCCAAAGGTCTATCACATGAC 3′ and reverse 5′ TGAGCTTCAGTTCCTAACCAC 3′, and for FaFAD2 (gene22642) forward 5′ AACTGGTGTCTGGGTCATTG 3′ and reverse 5′ GAAAGGAGTGAAGGATCAGGC 3′.
Cloning full length transcript for FaFAD1
The full length transcript for gene24414 was cloned using primer sequences guided by the transcriptome data. Primers were designed to include attb sites for Gateway (Invitrogen) cloning into pDONR222. Forward primer 5′ AAAAAGCAGGCTGCATGGGAGCCGATACCAAGTTCGAAGAG 3′ and a poly T reverse primer 5′ AGAAAGCTGGGTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT 3′ were initially used to obtain the full length transcript including 3′ UTR. A second reverse primer was designed from the cloned sequence up to and including the stop codon from the open reading frame (reverse 5′ AGAAAGCTGGGTGTTAGTTCCGGTACCAAAAAACACCTTTGGT 3′). A second round of PCR reconstructed full length attb sites using forward 5′ GGGGACAAGTTTGTACAAAAAAGCAGGCT 3′ and reverse 5′ GGGGACCACTTTGTACAAGAAAGCTGGGT 3′ primers. Standard PCR conditions were used with GoTaq polymerase (Promega) according to the manufacturer’s recommendations. The full length sequence, predicted protein translation, and physical map coordinates are provided in Additional file 2.
Designing a functional molecular marker
A molecular marker for γ-D production was designed using the F. vesca genome sequence . Primers were designed to amplify a 500 bp fragment from the 5′ end of gene24414 into the 5′ UTR: (forward) 5′ CGGGATTAATGGTTTTGTTGTTGACCGACC 3′ and (reverse) 5′ GTAGAAGAGAGAGACCAAGACGAG 3′. BFACT045 primers were previously shown to be linked to γ-D production and these primers were used as PCR controls forward 5′ CGACAAATGTAGTTGCTAGTCTTCTCA 3′ and reverse 5′ GAGGCAGAAGTGTTTTTCGTG 3′ . The BFACT045 primers did not produce alleles that segregated with the γ-D phenotype in our populations as tested by capillary electrophoresis (data not shown). All PCR reactions (12.5 μl total volume) were run using GoTaq Hot Start polymerase (Promega) according to the manufacturer’s instructions. All thermocycler conditions were as follows: 94°C 4 min, followed by 25 cycles of 94°C 30 s, 56°C 30 s, 72°C 30 s, and with a final extension for 10 min at 72°.
The putative marker for gene24414 was tested against a subset of 17 progeny from the ‘Elyana’ x ‘Mara des Bois’ cross including parents, and two backcross populations to test segregation with the γ-D phenotype. Progeny were included in this analysis if they had three or more volatile samplings during the season to ensure accurate phenotyping of γ-D non-producers. Backcross population ‘Elyana’ x progeny 98 produced 26 progeny suitable for analysis, and backcross population ‘Mara des Bois’ x progeny 98 had 22 progeny suitable for analysis. Other octoploids suitable for consideration as breeding parents and/or for this study were also tested for marker cosegregation with the γ-D phenotype. These genotypes included ‘Radiance’, ‘Deutsch Evern’, ‘Festival’, ‘LF9’ (a self-pollination of ‘Festival’), ‘Albion’, ‘Winterstar’™ (‘FL 05-107’), ‘Mieze Schindler’, ‘Sweet Charlie’, and ‘Winter Dawn’.
A selection of wild, octoploid genotypes were also tested for the presence of the gene24414 marker, but were only included in this study if a minimum of three volatile samplings were performed. These genotypes are listed here by their Germplasm Resources Information Network Plant Inventory (PI) number and include accessions 236579, 612323, 612495, 612498, and 612499.
Statistical marker analysis
Chi-square analysis was performed on marker and volatile data for all genotypes with at least three separate harvests during a season. Due to a strong environmental component, three harvests were necessary to provide the strongest evidence for accurately identifying a γ-D non-producer.
SSR marker design
A 50 kb region of the genome surrounding gene24414 was downloaded using the Strawberry Genome Browser at the Genome Database for Rosaceae (rosaceae.org). The web version of BatchPrimer3  was used to search for Simple Sequence Repeats (SSRs) near gene24414. Primers were designed to flank a dinucleotide repeat approximately 11 kb from gene24414. The primer pair (forward 5′ TGTAAAACGACGGCCAGTGAAGAAGATGACACTAGGGACGAGGAAG 3′ and reverse 5′ GTTCTATGTGAGAACATGGGAAGAAACATGAC 3′ with fluorescently labeled 5′ 6FAM-TGTAAAACGACGGCCAGT 3′) exhibited variation between ‘Elyana’ and ‘Mara des Bois’ and segregation in the fifteen progeny tested. These primers were used for allele detection during capillary electrophoresis as previously described .
Availability of supporting data
The RNAseq reads have been deposited into the NCBI Short Read Archive and are accessible under project SRP039356 (http://www.ncbi.nlm.nih.gov/sra/?term=SRP039356). Data represent all reads from parental lines (‘Mara des Bois’ and ‘Elyana’) as well as reads from individual F1 plants.
The authors thank Dr. Jeffrey Brecht (University of Florida) for the loan of a Chemstation software license, and Paul V. Stodghill (Robert W. Holley Center for Agriculture and Health, Agriculture Research Service, USDA, Ithaca, NY) for their assistance with the project. The authors also thank Shane Alan Evans and Christy Evans for their assistance in processing fruit samples. This work was performed with funding from the University of Florida Plant Molecular Breeding Initiative, with support from the Florida Strawberry Research and Education Foundation.
- Latrasse A: Fruits. Volatile compounds in foods and beverages. Edited by: Maarse H. 1991, New York, NY: Marcel-Decker, 329-387.Google Scholar
- Olbricht K, Grafe C, Weiss K, Ulrich D: Inheritance of aroma compounds in a model population of Fragaria × ananassa Duch. Plant Breeding. 2008, 127 (1): 87-93.Google Scholar
- Schieberle P, Hofmann T: Evaluation of the character impact odorants in fresh strawberry juice by quantitative measurements and sensory studies on model mixtures. J Agric Food Chem. 1997, 45 (1): 227-232. 10.1021/jf960366o.View ArticleGoogle Scholar
- Carrasco B, Hancock J, Beaudry R, Retamales J: Chemical composition and inheritance patterns of aroma in Fragaria x ananassa and Fragaria virginiana progenies. Hort Sci. 2005, 40 (6): 1649-1650.Google Scholar
- Bood KG, Zabetakis I: The biosynthesis of strawberry flavor (II): biosynthetic and molecular biology studies. J Food Sci. 2002, 67 (1): 2-8. 10.1111/j.1365-2621.2002.tb11349.x.View ArticleGoogle Scholar
- Aharoni A, Giri AP, Verstappen FW, Bertea CM, Sevenier R, Sun Z, Jongsma MA, Schwab W, Bouwmeester HJ: Gain and loss of fruit flavor compounds produced by wild and cultivated strawberry species. Plant Cell. 2004, 16 (11): 3110-3131. 10.1105/tpc.104.023895.PubMed CentralPubMedView ArticleGoogle Scholar
- Larsen M, Poll L: Odour thresholds of some important aroma compounds in strawberries. Z Lebensm Unters Forsch. 1992, 195 (2): 120-123. 10.1007/BF01201770.View ArticleGoogle Scholar
- Pérez AG, Sanz C, Olías R, Olías JM: Lipoxygenase and hydroperoxide lyase activities in ripening strawberry fruits. J Agric Food Chem. 1998, 47 (1): 249-253.View ArticleGoogle Scholar
- Ulrich D, Komes D, Olbricht K, Hoberg E: Diversity of aroma patterns in wild and cultivated Fragaria accessions. Genet Resour Crop Evol. 2007, 54 (6): 1185-1196. 10.1007/s10722-006-9009-4.View ArticleGoogle Scholar
- Watson R, Wright CJ, McBurney T, Taylor AJ, Linforth RST: Influence of harvest date and light integral on the development of strawberry flavour compounds. J Exp Bot. 2002, 53 (377): 2121-2129. 10.1093/jxb/erf088.PubMedView ArticleGoogle Scholar
- Pelayo-Zaldivar C, Ebeler SE, Kader AA: Cultivar and harvest date effects on flavor and other quality attributes of California strawberries. J Food Quality. 2005, 28 (2005): 78-97.View ArticleGoogle Scholar
- Olbricht K, Ulrich D, Weiss K, Grafe C: Variation in the amounts of selected volatiles in a model population of Fragaria x ananassa duch. As influenced by harvest year. J Agric Food Chem. 2011, 59 (3): 944-952. 10.1021/jf1034948.PubMedView ArticleGoogle Scholar
- Forney C, Kalt W, Jordan M: The composition of strawberry aroma is influenced by cultivar, maturity, and storage. Hort Sci. 2000, 35 (6): 1022-1026.Google Scholar
- Larsen M, Poll L, Olsen CE: Evaluation of the aroma composition of some strawberry (Fragaria × ananassa Duch) cultivars by use of odour threshold values. Z Lebensm Unters Forsch. 1992, 195 (6): 536-539. 10.1007/BF01204558.View ArticleGoogle Scholar
- Jouquand C, Chandler C, Plotto A, Goodner K: A sensory and chemical analysis of fresh strawberries over harvest dates and seasons reveals factors that affect eating quality. J Am Soc Hortic Sci. 2008, 133 (6): 859-867.Google Scholar
- Zorrilla-Fontanesi Y, Rambla JLL, Cabeza A, Medina JJ, Sánchez-Sevilla JF, Valpuesta V, Botella MA, Granell A, Amaya I: Genetic analysis of strawberry fruit aroma and identification of O-methyltransferase FaOMT as the locus controlling natural variation in mesifurane content. Plant Physiol. 2012, 159 (2): 851-870. 10.1104/pp.111.188318.PubMed CentralPubMedView ArticleGoogle Scholar
- Akhunov E, Nicolet C, Dvorak J: Single nucleotide polymorphism genotyping in polyploid wheat with the Illumina GoldenGate assay. Theor Appl Genet. 2009, 119 (3): 507-517. 10.1007/s00122-009-1059-5.PubMed CentralPubMedView ArticleGoogle Scholar
- Oliver RE, Lazo GR, Lutz JD, Rubenfield MJ, Tinker NA, Anderson JM, Morehead NHW, Adhikary D, Jellen EN, Maughan PJ: Model SNP development for complex genomes based on hexaploid oat using high-throughput 454 sequencing technology. BMC Genomics. 2011, 12 (1): 77-10.1186/1471-2164-12-77.PubMed CentralPubMedView ArticleGoogle Scholar
- Trick M, Long Y, Meng J, Bancroft I: Single nucleotide polymorphism (SNP) discovery in the polyploid Brassica napus using Solexa transcriptome sequencing. Plant Biotechnol J. 2009, 7 (4): 334-346. 10.1111/j.1467-7652.2008.00396.x.PubMedView ArticleGoogle Scholar
- Zenoni S, Ferrarini A, Giacomelli E, Xumerle L, Fasoli M, Malerba G, Bellin D, Pezzotti M, Delledonne M: Characterization of transcriptional complexity during berry development in Vitis vinifera using RNA-Seq. Plant Physiol. 2010, 152 (4): 1787-1795. 10.1104/pp.109.149716.PubMed CentralPubMedView ArticleGoogle Scholar
- Sánchez G, Venegas-Calerón M, Salas JJ, Monforte A, Badenes ML, Granell A: An integrative “omics” approach identifies new candidate genes to impact aroma volatiles in peach fruit. BMC Genomics. 2013, 14 (1): 343-10.1186/1471-2164-14-343.PubMed CentralPubMedView ArticleGoogle Scholar
- Schrader J, Etschmann M, Sell D, Hilmer J-M, Rabenhorst J: Applied biocatalysis for the synthesis of natural flavour compounds–current industrial processes and future prospects. Biotechnol Lett. 2004, 26 (6): 463-472.PubMedView ArticleGoogle Scholar
- Hagedorn S, Kaphammer B: Microbial biocatalysis in the generation of flavor and fragrance chemicals. Annu Rev Microbiol. 1994, 48 (1): 773-800. 10.1146/annurev.mi.48.100194.004013.PubMedView ArticleGoogle Scholar
- Romero-Guido C, Belo I, Ta TMN, Cao-Hoang L, Alchihab M, Gomes N, Thonart P, Teixeira JA, Destain J, Waché Y: Biochemistry of lactone formation in yeast and fungi and its utilisation for the production of flavour and fragrance compounds. Appl Microbiol Biotechnol. 2011, 89 (3): 535-547. 10.1007/s00253-010-2945-0.PubMedView ArticleGoogle Scholar
- Schöttler M, Boland W: Biosynthesis of dodecano-4-lactone in ripening fruits: Crucial role of an epoxide-hydrolase in enantioselective generation of aroma components of the nectarine (Prunus persica var. nucipersica) and the strawberry (Fragaria ananassa). Helv Chim Acta. 1996, 79 (5): 1488-1496. 10.1002/hlca.19960790521.View ArticleGoogle Scholar
- Shulaev V, Sargent DJ, Crowhurst RN, Mockler TC, Folkerts O, Delcher AL, Jaiswal P, Mockaitis K, Liston A, Mane SP, Burns P, Davis TM, Slovin JP, Bassil N, Hellens RP, Evans C, Harkins T, Kodira C, Desany B, Crasta OR, Jensen RV, Allan AC, Michael TP, Setubal JC, Celton JM, Rees DJ, Williams KP, Holt SH, Ruiz Rojas JJ, Chatterjee M, et al: The genome of woodland strawberry (Fragaria vesca). Nat Genet. 2011, 43 (2): 109-116. 10.1038/ng.740.PubMed CentralPubMedView ArticleGoogle Scholar
- Folta KM, Dhingra A, Howard L, Stewart PJ, Chandler CK: Characterization of LF9, an octoploid strawberry genotype selected for rapid regeneration and transformation. Planta. 2006, 224 (5): 1058-1067. 10.1007/s00425-006-0278-0.PubMedView ArticleGoogle Scholar
- Tieman D, Bliss P, McIntyre L, Blandon-Ubeda A, Bies D, Odabasi A, Rodriguez G, van der Knaap E, Taylor M, Goulet C, Mageroy MH, Snyder DJ, Colquhoun TA, Moskowitz H, Clark DG, Sims C, Bartoshuk L, Klee HJ: The chemical interactions underlying tomato flavor preferences. Curr Biol. 2012, 22 (11): 1035-1039. 10.1016/j.cub.2012.04.016.PubMedView ArticleGoogle Scholar
- Schwieterman ML, Colquhoun TA, Bartoshuk L, Jaworski EA, Gilbert JL, Tieman DM, Odabasi AZ, Moskowitz HR, Folta KM, Klee HJ, Sims CA, Whitaker VM, Clark DG: Strawberry flavor: diverse chemical recipes, a seasonal influence and their effect on sensory perception. PLoS One. 2014, 9 (2): e88446-10.1371/journal.pone.0088446.PubMed CentralPubMedView ArticleGoogle Scholar
- Blin-Perrin C, Molle D, Dufosse L, Le-Quere J, Viel C, Mauvais G, Feron G: Metabolism of ricinoleic acid into gamma-decalactone: beta-oxidation and long chain acyl intermediates of ricinoleic acid in the genus Sporidiobolus sp. FEMS Microbiol Lett. 2000, 188 (1): 69-74.PubMedGoogle Scholar
- Chandler CK, Santos BM, Peres NA, Jouquand C, Plotto A: Florida Elyana’ strawberry. Hort Sci. 2009, 44 (6): 1775-1776.Google Scholar
- Aharoni A, Keizer LC, Bouwmeester HJ, Sun Z, Alvarez-Huerta M, Verhoeven HA, Blaas J, van Houwelingen AM, de Vos RC, van der Voet H, Jansen RC, Guis M, Mol J, Davis RW, Schena M, van Tunen AJ, O’Connell AP: Identification of the SAAT gene involved in strawberry flavor biogenesis by use of DNA microarrays. Plant Cell. 2000, 12 (5): 647-662.PubMed CentralPubMedView ArticleGoogle Scholar
- Bombarely A, Merchante C, Csukasi F, Cruz-Rus E, Caballero JL, Medina-Escobar N, Blanco-Portales R, Botella MA, Muñoz-Blanco J, Sánchez-Sevilla JF, Valpuesta V: Generation and analysis of ESTs from strawberry (Fragaria x ananassa) fruits and evaluation of their utility in genetic and molecular studies. BMC Genomics. 2010, 11: 503-10.1186/1471-2164-11-503.PubMed CentralPubMedView ArticleGoogle Scholar
- Kang C, Darwish O, Geretz A, Shahan R, Alkharouf N, Liu Z: Genome-scale transcriptomic insights into early-stage fruit development in woodland strawberry Fragaria vesca. Plant Cell Online. 2013, 25 (6): 1960-1978. 10.1105/tpc.113.111732.View ArticleGoogle Scholar
- Ashley MV, Wilk JA, Styan SM, Craft KJ, Jones KL, Feldheim KA, Lewers KS, Ashman TL: High variability and disomic segregation of microsatellites in the octoploid Fragaria virginiana Mill. (Rosaceae). Theor Appl Genet. 2003, 107 (7): 1201-1207. 10.1007/s00122-003-1370-5.PubMedView ArticleGoogle Scholar
- Sánchez G, Besada C, Badenes ML, Monforte AJ, Granell A: A non-targeted approach unravels the volatile network in peach fruit. PLoS One. 2012, 7 (6): e38992-10.1371/journal.pone.0038992.PubMed CentralPubMedView ArticleGoogle Scholar
- Monfort A, Vilanova S, Davis T, Arús P: A new set of polymorphic simple sequence repeat (SSR) markers from a wild strawberry (Fragaria vesca) are transferable to other diploid Fragaria species and to Fragaria × ananassa. Mol Ecol Notes. 2006, 6 (1): 197-200. 10.1111/j.1471-8286.2005.01191.x.View ArticleGoogle Scholar
- Brunings A, Moyer C, Peres N, Folta KM: Implementation of simple sequence repeat markers to genotype Florida strawberry varieties. Euphytica. 2010, 173 (1): 63-75. 10.1007/s10681-009-0112-4.View ArticleGoogle Scholar
- Chambers A, Carle S, Njuguna W, Chamala S, Bassil N, Whitaker VM, Barbazuk WB, Folta KM: A genome-enabled, high-throughput, and multiplexed fingerprinting platform for strawberry (Fragaria L.). Mol Breed. 2013, 31 (3): 1-15.View ArticleGoogle Scholar
- Chambers A, Whitaker VM, Gibbs B, Plotto A, Folta KM: Detection of the linalool‒producing NES1 variant across diverse strawberry (Fragaria spp.) accessions. Plant Breed. 2012, 131 (3): 437-443. 10.1111/j.1439-0523.2012.01959.x.View ArticleGoogle Scholar
- Chang S, Puryear J, Cairney J: A simple and efficient method for isolating RNA from pine trees. Plant Mol Biol Report. 1993, 11 (2): 113-116. 10.1007/BF02670468.View ArticleGoogle Scholar
- Clancy MA, Rosli HG, Chamala S, Barbazuk WB, Civello PM, Folta KM: Validation of reference transcripts in strawberry (Fragaria spp.). Mol Genet Genomics. 2013, 288 (12): 671-681. 10.1007/s00438-013-0780-6.PubMedView ArticleGoogle Scholar
- Rousseau-Gueutin M, Lerceteau-Köhler E, Barrot L, Sargent DJ, Monfort A, Simpson D, Arus P, Guérin G, Denoyes-Rothan B: Comparative genetic mapping between octoploid and diploid Fragaria species reveals a high level of colinearity between their genomes and the essentially disomic behavior of the cultivated octoploid strawberry. Genetics. 2008, 179 (4): 2045-2060. 10.1534/genetics.107.083840.PubMed CentralPubMedView ArticleGoogle Scholar
- Rozen S, Skaletsky H: Primer3 on the WWW for general users and for biologist programmers. Methods Mol Biol. 2000, 132 (3): 365-386.PubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.