Deciphering gamma-decalactone biosynthesis in strawberry fruit using a combination of genetic mapping, RNA-Seq and eQTL analyses
© Sánchez-Sevilla et al.; licensee BioMed Central Ltd. 2014
Received: 14 November 2013
Accepted: 17 March 2014
Published: 17 April 2014
Understanding the basis for volatile organic compound (VOC) biosynthesis and regulation is of great importance for the genetic improvement of fruit flavor. Lactones constitute an essential group of fatty acid-derived VOCs conferring peach-like aroma to a number of fruits including peach, plum, pineapple and strawberry. Early studies on lactone biosynthesis suggest that several enzymatic pathways could be responsible for the diversity of lactones, but detailed information on them remained elusive. In this study, we have integrated genetic mapping and genome-wide transcriptome analysis to investigate the molecular basis of natural variation in γ-decalactone content in strawberry fruit.
As a result, the fatty acid desaturase FaFAD1 was identified as the gene underlying the locus at LGIII-2 that controls γ-decalactone production in ripening fruit. The FaFAD1 gene is specifically expressed in ripe fruits and its expression fully correlates with the presence of γ-decalactone in all 95 individuals of the mapping population. In addition, we show that the level of expression of FaFAH1, with similarity to cytochrome p450 hydroxylases, significantly correlates with the content of γ-decalactone in the mapping population. The analysis of expression quantitative trait loci (eQTL) suggests that the product of this gene also has a regulatory role in the biosynthetic pathway of lactones.
Altogether, this study provides mechanistic information of how the production of γ-decalactone is naturally controlled in strawberry, and proposes enzymatic activities necessary for the formation of this VOC in plants.
KeywordsAroma Crop improvement Desaturase Flavor Hydroxylase Lactone eQTL
The flavor and aroma of strawberries (Fragaria × ananassa) arise from a specific combination of sugars, acids and volatile organic compounds (VOCs) that varies widely among different cultivars and Fragaria species . More than 360 VOCs have been detected in strawberry, including esters, aldehydes, ketones, alcohols, terpenes, furanones, and sulfur compounds [2–6]. Lactones constitute a group of fatty acid-derived flavor molecules, which have γ-(4-) or δ-(5-)-lactone structures, and have been isolated from bacterial, plants and animal sources [7, 8]. Fruits are considered as a particularly rich source of lactones, conferring peach-like aroma and flavor in order to attract feeders for seed dispersal [9, 10]. During strawberry maturation, the levels of compounds defined as green-volatiles decrease whereas levels of flavor compounds characteristic of ripe fruits, including esters and lactones, increase in parallel to other ripening-regulated processes such as anthocyanin accumulation . Up to 10 different lactones have been identified in strawberry [1, 3, 11] and, among them, γ-decalactone is the most abundant, reaching maximum levels in fully red ripe fruits [4, 12].
Lactones containing 8–12 carbon atoms are very potent flavor constituents in a variety of fruits such as strawberry, pineapple and peach. Biosynthetic studies indicate that several pathways originating from β-oxidation of unsaturated fatty acids are responsible for the structural diversity of lactones [9, 10, 13]. All lactones originate from their corresponding 4- or 5-hydroxy carboxylic acids, although the precise mechanism by which these substrates are produced remains elusive . However, four different mechanisms have been proposed, suggesting that the oxygen atom could be introduced by either (1) reduction of oxo acids by NAD-linked reductases, (2) hydration of unsaturated fatty acids, (3) epoxidation and hydrolysis of unsaturated fatty acids, or (4) reduction of hydroperoxides [9, 14]. To the best our knowledge, the enzymes specifically involved in the formation of lactones have not yet been reported, however, candidate enzymatic activities, such as acyl-CoA dehydrogenase, which is the first enzyme in fatty acid β-oxidation, have been proposed to be important for lactone production . Epoxide hydrolases have been associated to γ-dodecalactone biosynthesis in peach, implying that the synthesis of this lactone could proceed through epoxidation of unsaturated fatty acids . Alternatively, the hydroxylation of unsaturated fatty acids could involve desaturases and cytochromes P450 (CYP) or other hydroxylases not related to CYPs .
The concentration of lactones in peach is controlled by multiple loci with quantitative effects, quantitative trait loci (QTL), hampering the identification of the genetic determinants controlling their biosynthesis or regulation . In contrast, the content of γ-decalactone has been shown to be controlled by one dominant locus in strawberry and, consequently, a number of strawberry cultivars lacking γ-decalactone have been reported [5, 6, 17]. We have previously reported that γ-decalactone is produced in the parental line ‘1392’ but not in ‘232’ of a strawberry mapping population and that the segregation of their F1 progeny matched a Mendelian 1:1 ratio with the locus controlling this trait mapped to the bottom arm of LG III-2 . In this context, this population represents a valuable tool to identify the gene(s) responsible for the natural variation of this VOC in strawberry, and to provide novel information about its biosynthesis in plants.
Although cultivated strawberry is an octoploid (2n = 8× = 56), and as such, four subgenomes are present in its complex genome, most loci show a disomic segregation. Furthermore its genome has a high level of conservation with the model species Fragaria vesca (2n = 2× = 14), including an almost complete synteny and high colinearity [18–22]. Thus, the available genome sequence of the diploid F. vesca can be used as a reference for genomic and genetic studies within the genus .
RNA-seq is replacing other methods of quantifying transcript expression, including microarray platforms , as it overcomes some of their limitations, such as detection of only those transcripts that are represented on microarrays, low dynamic range (limited upper and lower limits of detection), and thus provides more accurate quantification of differential transcript expression. A clear advantage of RNA-seq is the detection of novel non-annotated transcripts and, most relevant for highly heterozygous plants and polyploids such as F. × ananassa, the detection of the different alleles and homoeologous genes within their genomes [24, 25]. In this report, we have combined genome-wide RNA-seq analysis to a bulk segregant approach to identify a gene controlling γ-decalactone content in strawberry. Additional candidate genes of the biosynthetic pathway of lactones are also reported based on this genome-wide analysis. All together, this study provides information of how the content of γ-decalactone is naturally controlled in strawberry fruit and proposes enzymatic activities necessary for the formation of this VOC in plants.
The ‘232’ × ‘1392’ F1 mapping population, comprising 95 progeny lines, was used in this study. This population is derived from the cross between selection lines ‘232’ and ‘1392’ and is described in detail in . ‘232’ is a very productive strawberry (Fragaria × ananassa) line, whereas ‘1392’ has firmer and tastier fruits [6, 19]. The mapping population was grown in the strawberry-producing area of Huelva (Spain) under commercial conditions during the 2011/2012 season. Six plants of each line were vegetatively propagated and grown. Ripe fruits (10-15) were collected the same day from the six plants of each line, divided into three biological replicates and independently grinded in liquid nitrogen. Samples were stored at -80ºC until further analysis.
RNA isolation and RNA-seq from pooled samples
Relative concentration of γ-decalactone in fruits of selected progeny lines producing and not producing this compound
Fruit samples with γ-decalactone
Fruits w/o γ-decalactone
For each of the 6 (2 bulks with 3 biological replicates) samples, one paired-end library with approximately 300 bp insert size was prepared using an in-house optimized Illumina protocol at the Centro Nacional de Análisis Genómico (CNAG) facilities. Libraries were sequenced on Illumina HiSeq2000 lanes using 2 × 100 bp reads. More than 30 million reads were generated for each sample. Primary analysis of the data included base calling and quality control, with an assurance that >80% of all bases passing filter had a quality value of at least 30.
Mapping RNA-seq reads to the reference genome and generation of read counts
Raw RNA-seq reads were processed to remove low-quality nucleotides and aligned to the Fragaria vesca reference genome (v1.1) and CDS (v1.0)  using the program TopHat v2.0.6 . Default parameters of TopHat were used, allowing 40 multiple alignments per read and a maximum of 2 mismatches when mapping reads to the reference. The mapping results were then used to identify “islands” of expression, which can be interpreted as potential exons. TopHat builds a database of potential splice junctions and confirms these by comparing the previously unmapped reads against the database of putative junctions.
The aligned read files were processed by Cufflinks v2.0.2 . Reads were assembled into transcripts, their abundance estimated, and tests for differential expression and regulation between the samples were performed. Cufflinks does not make use of existing gene annotations during assembly of transcripts, but rather assembles a minimum set of transcripts that best describe the reads in the dataset. This approach allows Cufflinks to identify alternative transcription and splicing that are not described by pre-existing gene models . The normalized RNA-seq fragment counts were used to measure the relative abundances of transcripts expressed as Fragments Per Kilobase of exon per Million fragments mapped (FPKM). Confidence intervals for FPKM estimates were calculated using a Bayesian inference method .
Comparison to reference annotation and differential expression analysis
Once all short read sequences were assembled with Cufflinks, the output GTF files were sent to Cuffcompare along with a reference GTF annotation file, downloaded from Genome Database for Rosaceae (GDR) database (Fragaria vesca Whole Genome v1.1 Assembly & Annotation. http://www.rosaceae.org/). This classified each transcript as known or novel. Cuffcompare produced a combined. GTF file which was passed to Cuffdiff along with the original alignment (.SAM) files produced by TopHat to identify differentially expressed transcripts between the two pools. The Cuffdiff algorithm then re-estimated the abundance of transcripts listed in the GTF file using alignments from the SAM file, and concurrently tested for differential expression between the high γ-decalactone and the no γ-decalactone pools using a rigorous statistical analysis . The significance scores were corrected for multiple testing using the Benjamini-Hochberg correction. The expression testing is done at the level of transcripts, primary transcripts and genes. By tracking changes in the relative abundance of transcripts with a common transcription start site, Cuffdiff can also identify changes in splicing.
Visualization of mapped reads
Mapping results were visualized using a local copy of the Integrative Genomics Viewer software available at http://www.broadinstitute.org/igv/. Views of individual genes were generated by uploading TopHat-generated files containing the sequence alignment data (.bam files) to the genome browser.
Functional analysis of gene lists using BLAST2GO
The BLAST2GO v 2.4 suite was used for functional annotation of sequences, data mining and gene set enrichment analysis . The functional clustering tool was used to look for functional enrichment for genes over- and under-expressed more than two-fold between the pools. GO enrichment was derived with Fisher’s exact test and a cutoff of false discovery rate < 0.05 using the F. vesca genome annotation as reference background. A unique list of gene symbols was uploaded via the web interface. Gene Ontology Biological Process was selected as the functional annotation category for this analysis.
De novo assembly of Fragaria × ananassaRNA-seq reads
Since the current F. vesca genome sequence and the gene model is still a draft, some RNA-seq transcript sequences appeared truncated. Therefore, we proceeded to a de-novo assembly of the reads corresponding to the high- and no-γ-decalactone pools to obtain the full-length transcripts expressed in F. × ananassa using Trinity. The transcript contigs most similar to the F. vesca candidate genes were identified by mean of blast search.
Multiple sequence alignment was carried out with CLUSTALW at the default settings. Phylogenetic analyses were conducted using the neighbor-joining algorithm and Poisson model in MEGA version 5 . Protein targeting predictions were done using WoLF PSORT analysis (http://wolfpsort.org) and transmembrane domain search with the TMpred program (http://www.ch.embnet.org/software/TMPRED_form.html).
Real time qRT-PCR analysis
Total RNA was extracted from strawberry tissues as described previously for the RNA-seq experiment. First-strand cDNA was synthesized from 1 μg of total RNA using the iScript cDNA Synthesis kit (Bio-Rad) according to the manufacturer’s instructions. Gene expression was analyzed by quantitative real time polymerase chain reaction (qRT-PCR) using the fluorescent intercalating dye SYBRGreen I in an iQ5 real-time PCR detection system (Bio-Rad). Three biological replicates for each line and three independent synthesis of cDNA for each RNA sample were used for qRT-PCR. Relative quantification of the expression levels for the target genes was performed using the comparative Ct method . Glyceraldehyde-3-phosphate dehydrogenase gene (GAPDH) was used as normalizing gene . Primers are described in Additional file 1: Table S1.
QTL and expression quantitative trait loci (eQTL) analysis
QTL analyses were performed using MapQTL 5 as previously described . The raw relative data was analyzed first by the nonparametric Kruskal-Wallis rank-sum test. A stringent significance level of P = 0.005 was used as threshold. Second, the integrated genetic linkage map and transformed data sets for most traits were used to identify and locate QTLs using Interval Mapping. Significance LOD thresholds were estimated with a 1,000-permutation test for each trait and QTLs with LOD scores greater than the genome-wide threshold at 95% were declared significant.
In order to identify the determinants of the variation in γ-decalactone content in strawberry fruit, we aimed to identify differentially expressed genes between pools of fruits from lines contrasting in γ-decalactone content in the ‘232’ × ‘1392’ population using RNA-seq. Later, differentially expressed genes would be analyzed for their mapping position. Those genes convening the two conditions, i.e., highly expressed in fruits of high γ-decalactone lines and located within the QTL interval would be considered for further analysis. RNA was extracted from bulked pools of ripe fruits from 10 progeny lines with high γ-decalactone content and from 10 lines not producing the volatile (Table 1) and used in biological triplicate for Illumina RNA sequencing. An alignment of sequencing reads was performed using the reference Fragaria vesca Whole Genome (v1.1) and annotation (CDS v1.0) [; Genome Database for Rosaceae (GDR), http://www.rosaceae.org] using TopHat . Over 218 million reads 100 bp-long were generated and after removal of adaptor sequences and low-quality reads, 211.6 million clean reads remained (97% of the raw data). Between 68.7% and 70% of reads were paired for each of the 6 samples and an average of 68.2% of filtered paired reads were further mapped to the F. vesca genome. Some key metrics that allowed the assessment of the quality of mapping reads to the reference genome were extracted from the TopHat output and log files, and are shown in Additional file 1: Table S2.
After mapping the RNA-seq reads to the reference genome, transcripts were assembled and their relative abundances calculated using Cufflinks . Genes with normalized reads lower than 0.1 fragments per kilobase of exon per million fragments (FPKM) were considered as not expressed. A total of 33,458 gene/transcripts from the two F. x ananassa pools were predicted based in the reference model and 19,833 and 19,720 were expressed in the ripe fruits of the high γ-decalactone and the no γ-decalactone pools, respectively.
Differential gene expression (DGE) between the high γ-decalactone and the no γ-decalactone pools was calculated using the ratio of FPKM values of each gene in both pools. A total of 617 predicted genes were differentially expressed between the two pools and re-annotated using Blast2go. Of these, 403 were up-regulated and 214 were down-regulated in the high-γ-decalactone pool (Additional file 3). The observed ratios (log2 fold change) of differential expression ranged from -5.161 to 3.489, with negative and positive values indicating up- and down-regulation in the high-γ-decalactone pool, respectively. Only one gene (gene24970-v1.0-hybrid) encoding for a predicted protein with similarity to cinnamyl alcohol dehydrogenase, was not expressed at all in the no-γ-decalactone pool and expressed in the high-γ-decalactone pool, albeit with a relatively low value of expression (3.79 FPKM). Among the 617 differentially expressed transcripts, 577 corresponded to annotated genes in the F. vesca gene model v.1.0  while 40 matched with not annotated genome regions. Among these, 28 corresponded to predicted genes from F. vesca recently annotated in the NCBI, while the remaining 12 transcripts have not yet been annotated. Some gene families appeared over-represented in the fruits with high γ-decalactone content such as cinnamyl alcohol dehydrogenases, with 6 differentially expressed genes, glutathione s-transferases, with 7 up-regulated genes, and cytochrome p450, with 5 up-regulated members (Additional file 3).
Functional annotation and enrichment analysis
In order to describe gene functions in a standard and controlled vocabulary, we used the Blast2GO suite. A total of 3,757 gene ontology (GO) terms were assigned to a total of 559 differentially expressed genes, while 58 did not match any terms. Sequence distribution (at level 2, filtered by a cut-off of 60 sequences) for biological processes, molecular functions and cellular component are summarized in Additional file 1: Figure S1. Within biological processes, the most abundant categories were metabolic process (380 sequences or 27%), cellular process (350 sequences; 25%) and response to stimulus (210 sequences; 15%). The most represented molecular function was catalytic activity (299 sequences; 49%) and a large proportion of sequences (177; 22%) were associated with membrane as cellular compartment.
To investigate the biological processes associated with differences in γ-decalactone content, a GO enrichment analysis was performed using Fisher’s exact test using the sets of up-regulated and down-regulated transcripts separately in comparison to those in the reference F. vesca gene model. A total of 51 biological processes were significantly enriched for the genes up-regulated in fruits with high γ-decalactone content (Additional file 1: Table S3). Most of these 51 common ontologies are ‘descendants’ of 5 higher hierarchical nodes in the GO tree: response to stimulus (GO:0050896), cellular aromatic compound metabolic process (GO:0006725), lipid homeostasis (GO:0055088), nitrogen compound metabolic process (GO:0006807) and organic substance metabolic process (GO:0071704). Within these biological processes, the most significantly over represented term was oxidation-reduction process (GO:0055114, 92 genes within this term). One possible interpretation is that enzymes catalyzing the addition or removal of electrons are needed in the biosynthesis of γ-decalactone in strawberry.
The number of biological processes significantly enriched within the down-regulated genes (up-regulated in the no-γ-decalactone pool) was higher, 101, and more diverse (Additional file 1: Table S4). The three most significantly over represented terms were regulation of biological quality (GO:0065008), response to biotic stimulus (GOO:0009607) and ion transport (GO:0006811). Globally, these data suggest that lacking γ-decalactone in strawberry fruits is associated to a wide range of different biological processes. Particularly interesting is the number of genes up-regulated in the categories of response to stimulus and biotic stress in the absence of γ-decalactone.
Identification of FaFAD1 as the gene underlying the locuscontrolling γ-decalactone content in LG III-2
List of the top 25 significantly up-regulated genes in the high γ-decalactone pool compared to the No γ-decalactone pool
FPKM H γ-DEC
FPKM N γ-DEC
log2 fold change
Cinnamyl alcohol dehydrogenase-like
-1.8E + 308
-1.8E + 308
Aldo keto reductase
Microsomal delta-12 oleate desaturase
Isoflavone 2 -hydroxylase-like
Salicylic acid-binding protein 2-like
Auxin-binding protein abp19a-like
Probable glutathione s-transferase-like
Probable glutathione s-transferase-like
Pathogenesis-related protein 4
Inhibitor of trypsin and hageman factor
Sieve element-occluding protein
Psbp domain-containing protein chlorop-like
All of the reads from the high γ-decalactone pool mapping to the gene24414-v1.0-hybrid (hereafter named FaFAD1 and FvFAD1 for the cultivated and wild strawberry, respectively) corresponded to one unique allele, indicating that only one allele is expressed in fruits of the 10 selected siblings. Similarly, all the reads from the no-γ-decalactone pool corresponded to the same allele to that in the fruits with high content.
The deduced FaFAD1 protein sequence contains the Delta12 Fatty Acid Desaturase (Delta12-FADS)-like conserved domain (E-value: 1.74e-56). Membrane FADs are non-heme, iron-containing, oxygen-dependent enzymes involved in regioselective introduction of double bonds in fatty acyl aliphatic chains. These enzymes are responsible for the synthesis of 18:2 fatty acids in the endoplasmic reticulum. Six putative transmembrane domains are predicted within FaFAD1 using the TMpred program as expected for an integral membrane protein (Figure 2). Alignment with other characterized FAD2 proteins indicated that the characteristic His-rich motifs, which contribute to the interaction with the electron donor cytochrome b5, were conserved in the deduced FaFAD1 protein. The most similar protein to FaFAD1 in Arabidopsis was the endoplasmid reticulum localized oleate desaturase FAD2 catalyzing the conversion of oleic acid (18:1) to linoleic acid (18:2) .
To further investigate whether the down-regulation of FaFAD1 is the cause for the extremely low γ-decalactone content in strawberry fruits, we first validated the differential expression observed in the pools by qRT-PCR. As shown in Additional file 1: Figure S2A, the expression of FaFAD1 was ~30-fold higher in the high-γ-decalactone pool, the same differential expression obtained using RNA-seq (Table 2). Quantitative RT-PCR also validated the RNA-seq data for other three up-regulated genes (see below; Additional file 1: Figure S2).
Collocation of QTL for γ-decalactone content and eQTL for candidate genes
Based in their predicted function, we selected two additional candidate genes for further analyses within the top 25 highly up-regulated genes (Table 2). The fourth transcript in the list, gene17831-v1.0-hybrid, showed 17-fold (4.1 log2_fold_change) higher expression in the high-γ-decalactone pool (Table 2; Additional file 3). The predicted protein sequence has high similarity to the cytochrome P450, family 81, and contains the p450 superfamily conserved domain (E-value 8.37e-96) and the PLN02183 5-hydroxylase multidomain (E-value 1.22e-70). The gene02395-v1.0-hybrid, at the 23rd position in Table 2, showed a 5-fold up-regulation and encodes for a predicted protein with high similarity to the cytochrome P450, family 79, subfamily A and thus, also contains the p450 superfamily conserved domain (E-value 3.60e-40) and several hydroxylase domains such as PLN03018 (E-value 6.14e-125). The differential gene expression observed for these two transcripts was validated by qRT-PCR, obtaining a 14.8 and 6.5 fold-change in their expression between the pools for gene17831 and gene02395, respectively (Additional file 1: Figure S2). Since these two genes encode for protein sequences with high homology to CYP hydroxylases, we further investigated their possible association to γ-decalactone biosynthesis by analyzing their expression in parental and all progeny lines of the ‘232’ × ‘1392’ mapping population (Additional file 1: Figure S4). A significant level of correlation between the transcript level of the F. × ananassa gene corresponding to gene17831-v1.0-hybrid (hereafter referred to as FaFAH1) and γ-decalactone content was observed (Pearson correlation = 0.45). On the contrary, no association between the transcript level of gene02395-v1.0-hybrid (hereafter referred to as FaCYP1) and γ-decalactone content was observed. Furthermore, expression profiling in different tissues by qRT-PCR showed that FaFAH1 is expressed in leaf and ripe fruits while FaCYP1 was highly expressed in leaf and to a much lesser extend in green fruit (Figure 5B, C). These results are consistent with FaFAH1 but not FaCYP1 having a possible role in γ-decalactone accumulation in ripening strawberries.
Two bulked pools of segregants representing the phenotypic extremes within a relatively large population displaying wide variation for a given trait would only differ at the locus controlling the trait. Although bulk segregant analysis (BSA) has generally being used to tag genes controlling Mendelian traits, the method can also be used to identify major QTL . The applicability of BSA to RNA-seq was recently demonstrated by mapping the maize mutant gene gl3. Here we report the combination of BSA and RNA-seq as a powerful and valid approach for quantifying differential transcript expression and for cost-efficient identification of genes underlying γ-decalactone variation in cultivated strawberry. Once we fine mapped the locus to the bottom of chromosome 3, the assumptions made for candidate genes were that (1) the genes must show low or no expression in fruits without γ-decalactone while in fruits producing this VOC had to be high and (2) the gene must encode for an enzyme involved in the biosynthesis of this volatile, based on the proposed pathways, or should encode for a regulatory protein. Out of the 33,458 analyzed transcripts, only gene24414 fulfilled both requirements. This gene encodes for a protein, FaFAD1, with extensive similarity to delta-twelve fatty acid desaturases, enzymes that catalyze the regioselective introduction of a double bond at the Δ12 position during lipid biosynthesis . Therefore, the activity of this protein could supply fatty acid precursors for lactone biosynthesis. The sequence alignment of FaFAD1 with other desaturases revealed the presence of three conserved histidine boxes reported to be essential for the catalysis, and proposed to be the ligands for the iron atoms involved in the formation of the di-iron-oxygen complex. Interestingly, the deduced FaFAD1 protein is shorter than the rest of FAD proteins and neither the dilysine nor the aromatic amino acid-enriched retrieval signal (-YKNKF) are present at the C-terminus of FaFAD1 (Figure 2). One of these motifs is necessary for maintaining localization of the enzymes in the endoplasmic reticulum (ER) . However, a PSORT algorithm (http://wolfpsort.org) predicts that FaFAD1 is targeted to the ER with a certainty of 8.0, consistent with the six transmembrane domains predicted for FaFAD1.
In addition to a desaturase activity, a number of FAD2 variants are known to possess diversified functionalities, catalyzing hydroxylations, epoxidations, or the formation of acetylenic and conjugated double bonds [35, 41, 42]. Some other FAD2 enzymes have bifunctional hydroxylase/desaturase or even tri-functional activities [43, 44]. A close homologue to FaFAD1 in peach, PpFAD1B-6, has been proposed to be involved in lactone production in fruits . This enzyme inserts a double bond between carbon 12 and 13 of monounsaturated oleic acid to generate polyunsaturated linoleic acid, but do not have any detectable hydroxylase activity. However, FaFAD1 is phylogenetically located in a different clade and more closely related to the castor bean hydroxylase RcFAH12 . Seven amino acid residues that differ between oleate desaturases and hydroxylases have been identified and the substitutions of alanine 148 and methionine 324 of the Arabidopsis AtFAD2 by isoleucines, as found in RcFAH12 or Lesquerella fendleri hydroxylase/desaturase (LfFAH12), caused a substantial shift in catalytic activity [45, 46]. Interestingly, these two isoleucines are conserved between FaFAD1 and hydroxylases, suggesting that the strawberry gene could encode for a bifunctional enzyme (Figure 2).
The expression profiling of FaFAD1 in different tissues showed that the gene is highly expressed and specific of red fruit of lines with high γ-decalactone content. Therefore, the expression is highly correlated with γ-decalactone biosynthesis, which occurs at the late stages of fruit ripening . In addition, the correlation of FaFAD1 expression with γ-decalactone content in the mapping population, the coincident map position between γ-decalactone and FaFAD1 and the predicted enzymatic activity of FaFAD1 protein indicate that this gene is responsible for the natural variation of this VOC in strawberry. Furthermore, it can be stated that the absence or extremely low levels of γ-decalactone in fruits of half of the population lines is a consequence of the absence or extremely low levels of FaFAD1 expression in these lines. The same FaFAD1 allele was detected in both bulked pools either using the reference genome to map the reads or after de novo assembly. The differential expression of FaFAD1 observed between both pools was alike using both methods (Additional file 1: Table S6) and was also validated by qRT-PCR (Additional file 1: Figure S2; Table S6). However, when the progeny lines were analyzed independently, FaFAD1 expression was not detected by qRT-PCR in fruits without γ-decalactone. Furthermore, different FaFAD1 primer pairs failed to amplify in genomic DNA of these lines (Additional file 1: Figure S3; see also companion manuscript), suggesting that the FaFAD1 gene may not be present in their genome. Taking these results together, the most plausible explanation is that the no γ-decalactone pools had some contamination during processing with some fruits containing the volatile.
Two other candidate genes were studied on their potential contribution to γ-decalactone production based on their increased expression in the high γ-decalactone pool and the annotated enzymatic activity. While FaCYP1 was not associated to γ-decalactone content, the gene FaFAH1 was up-regulated during fruit ripening. Our eQTL analysis of FaFAD1 and FaFAH1 indicate that both are associated with γ-decalactone. While the association of FaFAD1 expression with γ-decalactone is complete, FaFAH1 only shows a high association with γ-decalactone. When one eQTL maps in the same genetic location as the gene whose transcript is being measured, as it is the case for FaFAD1, is generally caused by cis-acting regulatory polymorphisms in the gene (cis-eQTL). Most probably through a polymorphism in the promoter region, which in turn gives rise to differential expression. In contrast, eQTL that do not map to the location of the gene being assayed, such as for FaFAH1, most likely represent trans-acting regulators (trans-eQTL) that may control the expression of a number of genes elsewhere in the genome . Based in the predicted function of FaFAD1 and FaFAH1, we propose that the pathway for γ-decalactone biosynthesis in fruits proceeds through hydration of unsaturated fatty acids. In this proposed model, the enzyme FaFAD1 would catalyze the conversion of oleic acid (18:1) to linoleic acid (18:2) by the introduction of a double bond at the Δ12 position, as performed by other FAD2 enzymes. Additionally, FaFAD1 may possess hydroxylase activity, catalyzing the hydroxylation of oleic acid to ricinoleic acid. The fact that an eQTL for FaFAH1 expression was detected at the position where FaFAD1 maps suggests that FaFAD1, or most likely the product of FaFAD1 activity (i.e. linoleic acid), up-regulates the expression of FaFAH1, which may encode for the enzyme catalyzing the next reaction in the biosynthetic pathway. This reaction most probably is a hydroxylation although some CYP related enzymes have been shown to have epoxidase activity . Ricinoleic acid derivative is then shortened by four β-oxidation cycles to form the corresponding 4-hydroxy acid. The last step in γ-decalactone biosynthesis involves the cyclation of the molecule either by an enzyme with alcohol acyl-transferase activity or by spontaneous lactonisation under acid conditions .
Understanding the basis of volatile organic compound (VOC) biosynthesis and regulation is of utmost importance for the genetic improvement of fruit flavor. This study provides genetic and molecular data on how the content of γ-decalactone is naturally controlled in strawberry and highlights enzymatic activities necessary for the formation of this VOC in fruits. γ-decalactone has been shown to be a sensory important VOC for strawberry flavor [17, 49]. However, other important functions of volatiles are to defend plants against pathogens, to attract pollinators, seed dispersers, and other beneficial animals and microorganisms, and to serve as signals in plant–plant interaction . GO enrichment analysis for the genes up-regulated in fruits without γ-decalactone detected a significant enrichment in GO categories related to response to pathogens. One plausible explanation is that this lactone could have anti-pathogen activity and, in its absence, up-regulation of other mechanisms of biotic stress responses would compensate the lack of this VOC. In this context, γ-decalactone has been shown to be toxic to yeast and bacteria through its capacity for permeabilizing membranes . These data suggest that this VOC might have a function in this process, a possibility that deserves further investigation.
Availability of supporting data
The data sets supporting the results of this article are included within the article (and its additional files) and raw RNA-seq reads available in the European Nucleotide Archive (ENA) repository under accession PRJEB5430 (http://www.ebi.ac.uk/ena/data/view/PRJEB5430).
We are grateful to Aureliano Bombarely for advice on bioinformatics analysis of RNA-seq data and to the Centro Nacional de Análisis Genómico (CNAG) for Illumina sequencing. This work was supported by the Spanish Ministry of Economy and Competitivity and FEDER (grant numbers AGL2012-40066, BIO2010-15630), the EUBerry Project (EU FP7 KBBE–2010-4 Grant Agreement number 265942) and by a Marie Curie International Outgoing Fellowship within the 7th European Community Framework Programme to I.A (IOF Flavor 328052).
- Pérez A, Sanz A: Strawberry flavor. Handbook of Fruit and Vegetable Flavors. Edited by: Hui HY. 2010, Hoboken, New Jersey: John Wiley & Sons, Inc, 437-455.
- Latrasse A: Fruits III. Volatile Compounds in Fruits and Beverages. Edited by: Maarse H. 1991, New York: Marcek Dekker, Inc, 329-387.
- Zabetakis I, Holden MA: Strawberry flavour: analysis and biosynthesis. J Sci Food Agric. 1997, 74: 421-434. 10.1002/(SICI)1097-0010(199708)74:4<421::AID-JSFA817>3.0.CO;2-6.View Article
- Ménager I, Jost M, Aubert C: Changes in physicochemical characteristics and volatile constituents of strawberry (Cv. Cigaline) during maturation. J Agric Food Chem. 2004, 52: 1248-1254. 10.1021/jf0350919.PubMedView Article
- Jetti RR, Yang E, Kurnianta A, Finn C, Qian MC: Quantification of selected aroma-active compounds in strawberries by headspace solid-phase microextraction gas chromatography and correlation with sensory descriptive analysis. J Food Sci. 2007, 72: S487-S496. 10.1111/j.1750-3841.2007.00445.x.PubMedView Article
- Zorrilla-Fontanesi Y, Rambla J-L, Cabeza A, Medina JJ, Sánchez-Sevilla JF, Valpuesta V, Botella MA, Granell A, Amaya I: Genetic analysis of strawberry fruit aroma and identification of O-methyltransferase FaOMT as the locus controlling natural variation in mesifurane content. Plant Physiol. 2012, 159: 851-870. 10.1104/pp.111.188318.PubMed CentralPubMedView Article
- Osorio S, Muñoz C, Valpuesta V: Physiology and biochemistry of fruit flavors. Handbook of Fruit and Vegetable Flavors. Edited by: Hui HY. 2010, Hoboken, New Jersey: John Wiley & Sons, Inc, 25-43.View Article
- Aragüez I, Valpuesta Fernández V: Metabolic engineering of aroma components in fruits. Biotechnol J. 2013, 8: 1144-1158.PubMed
- Schwab W, Davidovich-Rikanati R, Lewinsohn E: Biosynthesis of plant-derived flavor compounds. Plant J. 2008, 54: 712-732. 10.1111/j.1365-313X.2008.03446.x.PubMedView Article
- Schöttler M, Boland W: Biosynthesis of dodecano-4-lactone in ripening fruits: crucial role of an epoxide-hydrolase in enantioselective generation of aroma components of the nectarine (Prunus persica var. nucipersica) and the strawberry (Fragaria ananassa). Helv Chim Acta. 1996, 79: 1488-1496. 10.1002/hlca.19960790521.View Article
- Olbricht K, Grafe C, Weiss K, Ulrich D: Inheritance of aroma compounds in a model population of Fragaria × ananassa Duch. Plant Breed. 2008, 127: 87-93.
- Douillard C, Guichard E: Comparison by multidimensional analysis of concentrations of volatile compounds in fourteen frozen strawberry varieties [aroma, furaneol, mesifurane]. Sci Aliment. 1989, 9: 53-76.
- Husain Q: Chemistry and biochemistry of some vegetable flavors. Handbook of Fruit and Vegetable Flavors. Edited by: Hui HY. 2010, Hoboken, New Jersey: John Wiley & Sons, Inc, 575-625.
- Sánchez G, Venegas-Calerón M, Salas JJ, Monforte A, Badenes ML, Granell A: An integrative “omics” approach identifies new candidate genes to impact aroma volatiles in peach fruit. BMC Genomics. 2013, 14: 343-10.1186/1471-2164-14-343.PubMed CentralPubMedView Article
- Xi W-P, Zhang B, Liang L, Shen J-Y, Wei W-W, Xu C-J, Allan AC, Ferguson IB, Chen K-S: Postharvest temperature influences volatile lactone production via regulation of acyl-CoA oxidases in peach fruit. Plant Cell Environ. 2012, 35: 534-545. 10.1111/j.1365-3040.2011.02433.x.PubMedView Article
- Eduardo I, Chietera G, Pirona R, Pacheco I, Troggio M, Banchi E, Bassi D, Rossini L, Vecchietti A, Pozzi C: Genetic dissection of aroma volatile compounds from the essential oil of peach fruit: QTL analysis and identification of candidate genes using dense SNP maps. Tree Genetics & Genomes. 2013, 9: 189-204. 10.1007/s11295-012-0546-z.View Article
- Larsen M, Poll L, Olsen C: Evaluation of the aroma composition of some strawberry (Fragaria ananassa Duch) cultivars by use of odour threshold values. Zeitschrift für Lebensmittel untersuchung und Forschung. 1992, 195: 536-539. 10.1007/BF01204558.View Article
- Rousseau-Gueutin M, Lerceteau-Kohler E, Barrot L, Sargent DJ, Monfort A, Simpson D, Arus P, Guerin G, Denoyes-Rothan B: Comparative genetic mapping between octoploid and diploid fragaria species reveals a high level of colinearity between their genomes and the essentially disomic behavior of the cultivated octoploid strawberry. Genetics. 2008, 179: 2045-2060. 10.1534/genetics.107.083840.PubMed CentralPubMedView Article
- Zorrilla-Fontanesi Y, Cabeza A, Domínguez P, Medina JJ, Valpuesta V, Denoyes-Rothan B, Sánchez-Sevilla JF, Amaya I: Quantitative trait loci and underlying candidate genes controlling agronomical and fruit quality traits in octoploid strawberry (Fragaria × ananassa). Theor Appl Gen. 2011, 123: 755-778. 10.1007/s00122-011-1624-6.View Article
- Isobe SN, Hirakawa H, Sato S, Maeda F, Ishikawa M, Mori T, Yamamoto Y, Shirasawa K, Kimura M, Fukami M, Hashizume F, Tsuji T, Sasamoto S, Kato M, Nanri K, Tsuruoka H, Minami C, Takahashi C, Wada T, Ono A, Kawashima K, Nakazaki N, Kishida Y, Kohara M, Nakayama S, Yamada M, Fujishiro T, Watanabe A, Tabata S: Construction of an integrated high density simple sequence repeat linkage map in cultivated strawberry (fragaria x ananassa) and its applicability. DNA Res. 2013, 20: 79-92. 10.1093/dnares/dss035.PubMed CentralPubMedView Article
- Sargent DJ, Passey T, Šurbanovski N, Lopez Girona E, Kuchta P, Davik J, Harrison R, Passey A, Whitehouse AB, Simpson DW: A microsatellite linkage map for the cultivated strawberry (Fragaria × ananassa) suggests extensive regions of homozygosity in the genome that may have resulted from breeding and selection. Theor Appl Gen. 2012, 124: 1229-1240. 10.1007/s00122-011-1782-6.View Article
- Bombarely A, Merchante C, Csukasi F, Cruz-Rus E, Caballero JL, Medina-Escobar N, Blanco-Portales R, Botella MA, Muñoz-Blanco J, Sánchez-Sevilla JF, Valpuesta V: Generation and analysis of ESTs from strawberry (Fragaria xananassa) fruits and evaluation of their utility in genetic and molecular studies. BMC Genomics. 2010, 11: 503-10.1186/1471-2164-11-503.PubMed CentralPubMedView Article
- Shulaev V, Sargent DJ, Crowhurst RN, Mockler TC, Folkerts O, Delcher AL, Jaiswal P, Mockaitis K, Liston A, Mane SP, Burns P, Davis TM, Slovin JP, Bassil N, Hellens RP, Evans C, Harkins T, Kodira C, Desany B, Crasta OR, Jensen RV, Allan AC, Michael TP, Setubal JC, Celton J-M, Rees DJG, Williams KP, Holt SH, Rojas JJR, Chatterjee M, et al: The genome of woodland strawberry (Fragaria vesca). Nature Gen. 2011, 43: 109-116. 10.1038/ng.740.View Article
- Martin LBB, Fei Z, Giovannoni JJ, Rose JKC: Catalyzing plant science research with RNA-seq. Front Plant Sci. 2013, 4: 66-PubMed CentralPubMedView Article
- Higgins J, Magusin A, Trick M, Fraser F, Bancroft I: Use of mRNA-seq to discriminate contributions to the transcriptome from the constituent genomes of the polyploid crop species Brassica napus. BMC Genomics. 2012, 13: 1-1. 10.1186/1471-2164-13-1.View Article
- Manning K: Isolation of nucleic acids from plants by differential solvent precipitation. Anal Biochem. 1991, 195: 45-50. 10.1016/0003-2697(91)90292-2.PubMedView Article
- Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL: TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013, 14: R36-10.1186/gb-2013-14-4-r36.PubMed CentralPubMedView Article
- Trapnell C, Hendrickson DG, Sauvageau M, Goff L, Rinn JL, Pachter L: Differential analysis of gene regulation at transcript resolution with rNA-seq. Nat Biotechnol. 2012, 31: 46-53. 10.1038/nbt.2450.PubMedView Article
- Conesa A, Götz S: Blast2GO: A comprehensive suite for functional analysis in plant genomics. Int J Plant Genom. 2008, 2008: 619832-
- Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q, Chen Z, Mauceli E, Hacohen N, Gnirke A, Rhind N, di Palma F, Birren BW, Nusbaum C, Lindblad-Toh K, Friedman N, Regev A: Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011, 29: 644-652. 10.1038/nbt.1883.PubMed CentralPubMedView Article
- Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.PubMed CentralPubMedView Article
- Pfaffl MW: A new mathematical model for relative quantification in real-time RT-PCR. Nucleic Acids Res. 2001, 29: 2002-2007.View Article
- Salvatierra A, Pimentel P, Moya-León MA, Caligari PDS, Herrera R: Comparison of transcriptional profiles of flavonoid genes and anthocyanin contents during fruit development of two botanical forms of Fragaria chiloensis ssp. chiloensis. Phytochemistry. 2010, 71: 1839-1847. 10.1016/j.phytochem.2010.08.005.PubMedView Article
- Okuley J, Lightner J, Feldmann K, Yadav N, Lark E, Browse J: Arabidopsis FAD2 gene encodes the enzyme that is essential for polyunsaturated lipid synthesis. Plant Cell. 1994, 6: 147-158.PubMed CentralPubMedView Article
- van de Loo FJ, Broun P, Turner S, Somerville C: An oleate 12-hydroxylase from Ricinus communis L. is a fatty acyl desaturase homolog. Proc Natl Acad Sci U S A. 1995, 92: 6743-6747. 10.1073/pnas.92.15.6743.PubMed CentralPubMedView Article
- Hernández ML, Mancha M, Martínez-Rivas JM: Molecular cloning and characterization of genes encoding two microsomal oleate desaturases (FAD2) from olive. Phytochemistry. 2005, 66: 1417-1426. 10.1016/j.phytochem.2005.04.004.PubMedView Article
- Kliebenstein D: Quantitative genomics: analyzing intraspecific variation using global gene expression polymorphisms or eQTLs. Annu Rev Plant Biol. 2009, 60: 93-114. 10.1146/annurev.arplant.043008.092114.PubMedView Article
- Collard BCY, Jahufer MZZ, Brouwer JB, Pang ECK: An introduction to markers, quantitative trait loci (QTL) mapping and marker-assisted selection for crop improvement: the basic concepts. Euphytica. 2005, 142: 169-196. 10.1007/s10681-005-1681-5.View Article
- Liu S, Yeh C-T, Tang HM, Nettleton D, Schnable PS: Gene mapping via bulked segregant RNA-Seq (BSR-Seq). PLoS ONE. 2012, 7: e36406-10.1371/journal.pone.0036406.PubMed CentralPubMedView Article
- McCartney AW, Dyer JM, Dhanoa PK, Kim PK, Andrews DW, McNew JA, Mullen RT: Membrane-bound fatty acid desaturases are inserted co-translationally into the ER and contain different ER retrieval motifs at their carboxy termini. Plant J. 2004, 37: 156-173. 10.1111/j.1365-313X.2004.01949.x.PubMedView Article
- Shanklin J, Cahoon EB: Desaturation and related modifications of fatty acids 1. Annu Rev Plant Physiol Plant Mol Biol. 1998, 49: 611-641. 10.1146/annurev.arplant.49.1.611.PubMedView Article
- Shanklin J, Guy JE, Mishra G, Lindqvist Y: Desaturases: emerging models for understanding functional diversification of diiron-containing enzymes. J Biol Chem. 2009, 284: 18559-18563. 10.1074/jbc.R900009200.PubMed CentralPubMedView Article
- Broun P, Boddupalli S, Somerville C: A bifunctional oleate 12-hydroxylase: desaturase from Lesquerella fendleri. Plant J. 1998, 13: 201-210. 10.1046/j.1365-313X.1998.00023.x.PubMedView Article
- Cao S, Zhou X-R, Wood CC, Green AG, Singh SP, Liu L, Liu Q: A large and functionally diverse family of Fad2 genes in safflower (Carthamus tinctorius L.). BMC Plant Biol. 2013, 13: 5-10.1186/1471-2229-13-5.PubMed CentralPubMedView Article
- Broadwater JA, Whittle E, Shanklin J: Desaturation and hydroxylation. Residues 148 and 324 of Arabidopsis FAD2, in addition to substrate chain length, exert a major influence in partitioning of catalytic specificity. J Biol Chem. 2002, 277: 15613-15620. 10.1074/jbc.M200231200.PubMedView Article
- Broun P, Shanklin J, Whittle E, Somerville C: Catalytic plasticity of fatty acid modification enzymes underlying chemical diversity of plant lipids. Science. 1998, 282: 1315-1317.PubMedView Article
- Pinot F, Beisson F: Cytochrome P450 metabolizing fatty acids in plants: characterization and physiological roles. FEBS J. 2010, 278: 195-205.PubMedView Article
- Waché Y, Aguedo M, Nicaud JM, Belin JM: Catabolism of hydroxyacids and biotechnological production of lactones by Yarrowia lipolytica. Appl Microbiol Biotechnol. 2003, 61: 393-404. 10.1007/s00253-002-1207-1.PubMedView Article
- Ubeda C, San-Juan F, Concejero B, Callejón RM, Troncoso AM, Morales ML, Ferreira V, Hernández-Orte P: Glycosidically bound aroma compounds and impact odorants of four strawberry varieties. J Agric Food Chem. 2012, 60: 6095-6102. 10.1021/jf301141f.PubMedView Article
- Dudareva N, Pichersky E: Metabolic engineering of plant volatiles. Curr Opin Biotechnol. 2008, 19: 181-189. 10.1016/j.copbio.2008.02.011.PubMedView Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.