Deciphering gamma-decalactone biosynthesis in strawberry fruit using a combination of genetic mapping, RNA-Seq and eQTL analyses
BMC Genomics volume 15, Article number: 218 (2014)
Understanding the basis for volatile organic compound (VOC) biosynthesis and regulation is of great importance for the genetic improvement of fruit flavor. Lactones constitute an essential group of fatty acid-derived VOCs conferring peach-like aroma to a number of fruits including peach, plum, pineapple and strawberry. Early studies on lactone biosynthesis suggest that several enzymatic pathways could be responsible for the diversity of lactones, but detailed information on them remained elusive. In this study, we have integrated genetic mapping and genome-wide transcriptome analysis to investigate the molecular basis of natural variation in γ-decalactone content in strawberry fruit.
As a result, the fatty acid desaturase FaFAD1 was identified as the gene underlying the locus at LGIII-2 that controls γ-decalactone production in ripening fruit. The FaFAD1 gene is specifically expressed in ripe fruits and its expression fully correlates with the presence of γ-decalactone in all 95 individuals of the mapping population. In addition, we show that the level of expression of FaFAH1, with similarity to cytochrome p450 hydroxylases, significantly correlates with the content of γ-decalactone in the mapping population. The analysis of expression quantitative trait loci (eQTL) suggests that the product of this gene also has a regulatory role in the biosynthetic pathway of lactones.
Altogether, this study provides mechanistic information of how the production of γ-decalactone is naturally controlled in strawberry, and proposes enzymatic activities necessary for the formation of this VOC in plants.
The flavor and aroma of strawberries (Fragaria × ananassa) arise from a specific combination of sugars, acids and volatile organic compounds (VOCs) that varies widely among different cultivars and Fragaria species . More than 360 VOCs have been detected in strawberry, including esters, aldehydes, ketones, alcohols, terpenes, furanones, and sulfur compounds [2–6]. Lactones constitute a group of fatty acid-derived flavor molecules, which have γ-(4-) or δ-(5-)-lactone structures, and have been isolated from bacterial, plants and animal sources [7, 8]. Fruits are considered as a particularly rich source of lactones, conferring peach-like aroma and flavor in order to attract feeders for seed dispersal [9, 10]. During strawberry maturation, the levels of compounds defined as green-volatiles decrease whereas levels of flavor compounds characteristic of ripe fruits, including esters and lactones, increase in parallel to other ripening-regulated processes such as anthocyanin accumulation . Up to 10 different lactones have been identified in strawberry [1, 3, 11] and, among them, γ-decalactone is the most abundant, reaching maximum levels in fully red ripe fruits [4, 12].
Lactones containing 8–12 carbon atoms are very potent flavor constituents in a variety of fruits such as strawberry, pineapple and peach. Biosynthetic studies indicate that several pathways originating from β-oxidation of unsaturated fatty acids are responsible for the structural diversity of lactones [9, 10, 13]. All lactones originate from their corresponding 4- or 5-hydroxy carboxylic acids, although the precise mechanism by which these substrates are produced remains elusive . However, four different mechanisms have been proposed, suggesting that the oxygen atom could be introduced by either (1) reduction of oxo acids by NAD-linked reductases, (2) hydration of unsaturated fatty acids, (3) epoxidation and hydrolysis of unsaturated fatty acids, or (4) reduction of hydroperoxides [9, 14]. To the best our knowledge, the enzymes specifically involved in the formation of lactones have not yet been reported, however, candidate enzymatic activities, such as acyl-CoA dehydrogenase, which is the first enzyme in fatty acid β-oxidation, have been proposed to be important for lactone production . Epoxide hydrolases have been associated to γ-dodecalactone biosynthesis in peach, implying that the synthesis of this lactone could proceed through epoxidation of unsaturated fatty acids . Alternatively, the hydroxylation of unsaturated fatty acids could involve desaturases and cytochromes P450 (CYP) or other hydroxylases not related to CYPs .
The concentration of lactones in peach is controlled by multiple loci with quantitative effects, quantitative trait loci (QTL), hampering the identification of the genetic determinants controlling their biosynthesis or regulation . In contrast, the content of γ-decalactone has been shown to be controlled by one dominant locus in strawberry and, consequently, a number of strawberry cultivars lacking γ-decalactone have been reported [5, 6, 17]. We have previously reported that γ-decalactone is produced in the parental line ‘1392’ but not in ‘232’ of a strawberry mapping population and that the segregation of their F1 progeny matched a Mendelian 1:1 ratio with the locus controlling this trait mapped to the bottom arm of LG III-2 . In this context, this population represents a valuable tool to identify the gene(s) responsible for the natural variation of this VOC in strawberry, and to provide novel information about its biosynthesis in plants.
Although cultivated strawberry is an octoploid (2n = 8× = 56), and as such, four subgenomes are present in its complex genome, most loci show a disomic segregation. Furthermore its genome has a high level of conservation with the model species Fragaria vesca (2n = 2× = 14), including an almost complete synteny and high colinearity [18–22]. Thus, the available genome sequence of the diploid F. vesca can be used as a reference for genomic and genetic studies within the genus .
RNA-seq is replacing other methods of quantifying transcript expression, including microarray platforms , as it overcomes some of their limitations, such as detection of only those transcripts that are represented on microarrays, low dynamic range (limited upper and lower limits of detection), and thus provides more accurate quantification of differential transcript expression. A clear advantage of RNA-seq is the detection of novel non-annotated transcripts and, most relevant for highly heterozygous plants and polyploids such as F. × ananassa, the detection of the different alleles and homoeologous genes within their genomes [24, 25]. In this report, we have combined genome-wide RNA-seq analysis to a bulk segregant approach to identify a gene controlling γ-decalactone content in strawberry. Additional candidate genes of the biosynthetic pathway of lactones are also reported based on this genome-wide analysis. All together, this study provides information of how the content of γ-decalactone is naturally controlled in strawberry fruit and proposes enzymatic activities necessary for the formation of this VOC in plants.
The ‘232’ × ‘1392’ F1 mapping population, comprising 95 progeny lines, was used in this study. This population is derived from the cross between selection lines ‘232’ and ‘1392’ and is described in detail in . ‘232’ is a very productive strawberry (Fragaria × ananassa) line, whereas ‘1392’ has firmer and tastier fruits [6, 19]. The mapping population was grown in the strawberry-producing area of Huelva (Spain) under commercial conditions during the 2011/2012 season. Six plants of each line were vegetatively propagated and grown. Ripe fruits (10-15) were collected the same day from the six plants of each line, divided into three biological replicates and independently grinded in liquid nitrogen. Samples were stored at -80ºC until further analysis.
RNA isolation and RNA-seq from pooled samples
Equivalent amounts of ripe fruit tissue from 10 γ-decalactone producing and non-producing progeny lines (Table 1) were collected in triplicate and separately pooled for RNA extraction. The three biological replicates of each bulked pool were named H γ-DEC1-3 and N γ-DEC1-3 (for High and No γ-decalactone pool, respectively) and used in the analysis. Total RNA was extracted from pooled strawberry fruits using a differential 2-butoxyethanol precipitation-based method . Prior to reverse transcription, RNA was treated with DNase I (Fermentas) to remove contaminating genomic DNA. RNA quantity and quality were determined based on absorbance ratios at 260 nm/280 nm and 260 nm/230 nm using a Nanodrop. RNA integrity was confirmed by the appearance of ribosomal RNA bands and lack of degradation products after separation in agarose gel electrophoresis and ethidium bromide staining. The integrity of the RNA samples was further verified using the 2100 Bioanalyzer (Agilent, Folsom, CA) and RIN values ranged between 7.2 and 7.4 for the six samples.
For each of the 6 (2 bulks with 3 biological replicates) samples, one paired-end library with approximately 300 bp insert size was prepared using an in-house optimized Illumina protocol at the Centro Nacional de Análisis Genómico (CNAG) facilities. Libraries were sequenced on Illumina HiSeq2000 lanes using 2 × 100 bp reads. More than 30 million reads were generated for each sample. Primary analysis of the data included base calling and quality control, with an assurance that >80% of all bases passing filter had a quality value of at least 30.
Mapping RNA-seq reads to the reference genome and generation of read counts
Raw RNA-seq reads were processed to remove low-quality nucleotides and aligned to the Fragaria vesca reference genome (v1.1) and CDS (v1.0)  using the program TopHat v2.0.6 . Default parameters of TopHat were used, allowing 40 multiple alignments per read and a maximum of 2 mismatches when mapping reads to the reference. The mapping results were then used to identify “islands” of expression, which can be interpreted as potential exons. TopHat builds a database of potential splice junctions and confirms these by comparing the previously unmapped reads against the database of putative junctions.
The aligned read files were processed by Cufflinks v2.0.2 . Reads were assembled into transcripts, their abundance estimated, and tests for differential expression and regulation between the samples were performed. Cufflinks does not make use of existing gene annotations during assembly of transcripts, but rather assembles a minimum set of transcripts that best describe the reads in the dataset. This approach allows Cufflinks to identify alternative transcription and splicing that are not described by pre-existing gene models . The normalized RNA-seq fragment counts were used to measure the relative abundances of transcripts expressed as Fragments Per Kilobase of exon per Million fragments mapped (FPKM). Confidence intervals for FPKM estimates were calculated using a Bayesian inference method .
Comparison to reference annotation and differential expression analysis
Once all short read sequences were assembled with Cufflinks, the output GTF files were sent to Cuffcompare along with a reference GTF annotation file, downloaded from Genome Database for Rosaceae (GDR) database (Fragaria vesca Whole Genome v1.1 Assembly & Annotation. http://www.rosaceae.org/). This classified each transcript as known or novel. Cuffcompare produced a combined. GTF file which was passed to Cuffdiff along with the original alignment (.SAM) files produced by TopHat to identify differentially expressed transcripts between the two pools. The Cuffdiff algorithm then re-estimated the abundance of transcripts listed in the GTF file using alignments from the SAM file, and concurrently tested for differential expression between the high γ-decalactone and the no γ-decalactone pools using a rigorous statistical analysis . The significance scores were corrected for multiple testing using the Benjamini-Hochberg correction. The expression testing is done at the level of transcripts, primary transcripts and genes. By tracking changes in the relative abundance of transcripts with a common transcription start site, Cuffdiff can also identify changes in splicing.
Visualization of mapped reads
Mapping results were visualized using a local copy of the Integrative Genomics Viewer software available at http://www.broadinstitute.org/igv/. Views of individual genes were generated by uploading TopHat-generated files containing the sequence alignment data (.bam files) to the genome browser.
Functional analysis of gene lists using BLAST2GO
The BLAST2GO v 2.4 suite was used for functional annotation of sequences, data mining and gene set enrichment analysis . The functional clustering tool was used to look for functional enrichment for genes over- and under-expressed more than two-fold between the pools. GO enrichment was derived with Fisher’s exact test and a cutoff of false discovery rate < 0.05 using the F. vesca genome annotation as reference background. A unique list of gene symbols was uploaded via the web interface. Gene Ontology Biological Process was selected as the functional annotation category for this analysis.
De novo assembly of Fragaria × ananassaRNA-seq reads
Since the current F. vesca genome sequence and the gene model is still a draft, some RNA-seq transcript sequences appeared truncated. Therefore, we proceeded to a de-novo assembly of the reads corresponding to the high- and no-γ-decalactone pools to obtain the full-length transcripts expressed in F. × ananassa using Trinity. The transcript contigs most similar to the F. vesca candidate genes were identified by mean of blast search.
Multiple sequence alignment was carried out with CLUSTALW at the default settings. Phylogenetic analyses were conducted using the neighbor-joining algorithm and Poisson model in MEGA version 5 . Protein targeting predictions were done using WoLF PSORT analysis (http://wolfpsort.org) and transmembrane domain search with the TMpred program (http://www.ch.embnet.org/software/TMPRED_form.html).
Real time qRT-PCR analysis
Total RNA was extracted from strawberry tissues as described previously for the RNA-seq experiment. First-strand cDNA was synthesized from 1 μg of total RNA using the iScript cDNA Synthesis kit (Bio-Rad) according to the manufacturer’s instructions. Gene expression was analyzed by quantitative real time polymerase chain reaction (qRT-PCR) using the fluorescent intercalating dye SYBRGreen I in an iQ5 real-time PCR detection system (Bio-Rad). Three biological replicates for each line and three independent synthesis of cDNA for each RNA sample were used for qRT-PCR. Relative quantification of the expression levels for the target genes was performed using the comparative Ct method . Glyceraldehyde-3-phosphate dehydrogenase gene (GAPDH) was used as normalizing gene . Primers are described in Additional file 1: Table S1.
QTL and expression quantitative trait loci (eQTL) analysis
QTL analyses were performed using MapQTL 5 as previously described . The raw relative data was analyzed first by the nonparametric Kruskal-Wallis rank-sum test. A stringent significance level of P = 0.005 was used as threshold. Second, the integrated genetic linkage map and transformed data sets for most traits were used to identify and locate QTLs using Interval Mapping. Significance LOD thresholds were estimated with a 1,000-permutation test for each trait and QTLs with LOD scores greater than the genome-wide threshold at 95% were declared significant.
In order to increase the precision of the ‘232’ × ‘1392’ map and identify new markers closely linked to the locus controlling γ-decalactone, we first saturated the previous map with DArT-derived SNP markers and developed a saturated map. The octoploid strawberry homoeology group (HG) III, where the locus was previously mapped , is presented in Figure 1 in comparison to the F. vesca pseudochromosome 3 (fvesca_v1.1_pseudo.fna). The 4 homoelogous linkage groups (LGs), with lengths of 79.4, 106.2, 102 and 88.8 cM, could now be identified instead of 7 shorter LGs in the previous integrated map . The average marker density in HG III was increased to 0.87 cM/marker and the largest gap ranged from 5.3 cM for LG III-1 to 7.7 cM for LG III-3 (Figure 1; Additional file 2). The locus controlling γ-decalactone was fine-mapped to the bottom of Fragaria × ananassa LG III-2, closely linked to markers BFACT-45 and ChFvM140, consistent with our previous data . In addition, six new SNP markers were mapped in the ± 3 cM interval to the γ-decalactone locus (Figure 1; Additional file 2).
In order to identify the determinants of the variation in γ-decalactone content in strawberry fruit, we aimed to identify differentially expressed genes between pools of fruits from lines contrasting in γ-decalactone content in the ‘232’ × ‘1392’ population using RNA-seq. Later, differentially expressed genes would be analyzed for their mapping position. Those genes convening the two conditions, i.e., highly expressed in fruits of high γ-decalactone lines and located within the QTL interval would be considered for further analysis. RNA was extracted from bulked pools of ripe fruits from 10 progeny lines with high γ-decalactone content and from 10 lines not producing the volatile (Table 1) and used in biological triplicate for Illumina RNA sequencing. An alignment of sequencing reads was performed using the reference Fragaria vesca Whole Genome (v1.1) and annotation (CDS v1.0) [; Genome Database for Rosaceae (GDR), http://www.rosaceae.org] using TopHat . Over 218 million reads 100 bp-long were generated and after removal of adaptor sequences and low-quality reads, 211.6 million clean reads remained (97% of the raw data). Between 68.7% and 70% of reads were paired for each of the 6 samples and an average of 68.2% of filtered paired reads were further mapped to the F. vesca genome. Some key metrics that allowed the assessment of the quality of mapping reads to the reference genome were extracted from the TopHat output and log files, and are shown in Additional file 1: Table S2.
After mapping the RNA-seq reads to the reference genome, transcripts were assembled and their relative abundances calculated using Cufflinks . Genes with normalized reads lower than 0.1 fragments per kilobase of exon per million fragments (FPKM) were considered as not expressed. A total of 33,458 gene/transcripts from the two F. x ananassa pools were predicted based in the reference model and 19,833 and 19,720 were expressed in the ripe fruits of the high γ-decalactone and the no γ-decalactone pools, respectively.
Differential gene expression (DGE) between the high γ-decalactone and the no γ-decalactone pools was calculated using the ratio of FPKM values of each gene in both pools. A total of 617 predicted genes were differentially expressed between the two pools and re-annotated using Blast2go. Of these, 403 were up-regulated and 214 were down-regulated in the high-γ-decalactone pool (Additional file 3). The observed ratios (log2 fold change) of differential expression ranged from -5.161 to 3.489, with negative and positive values indicating up- and down-regulation in the high-γ-decalactone pool, respectively. Only one gene (gene24970-v1.0-hybrid) encoding for a predicted protein with similarity to cinnamyl alcohol dehydrogenase, was not expressed at all in the no-γ-decalactone pool and expressed in the high-γ-decalactone pool, albeit with a relatively low value of expression (3.79 FPKM). Among the 617 differentially expressed transcripts, 577 corresponded to annotated genes in the F. vesca gene model v.1.0  while 40 matched with not annotated genome regions. Among these, 28 corresponded to predicted genes from F. vesca recently annotated in the NCBI, while the remaining 12 transcripts have not yet been annotated. Some gene families appeared over-represented in the fruits with high γ-decalactone content such as cinnamyl alcohol dehydrogenases, with 6 differentially expressed genes, glutathione s-transferases, with 7 up-regulated genes, and cytochrome p450, with 5 up-regulated members (Additional file 3).
Functional annotation and enrichment analysis
In order to describe gene functions in a standard and controlled vocabulary, we used the Blast2GO suite. A total of 3,757 gene ontology (GO) terms were assigned to a total of 559 differentially expressed genes, while 58 did not match any terms. Sequence distribution (at level 2, filtered by a cut-off of 60 sequences) for biological processes, molecular functions and cellular component are summarized in Additional file 1: Figure S1. Within biological processes, the most abundant categories were metabolic process (380 sequences or 27%), cellular process (350 sequences; 25%) and response to stimulus (210 sequences; 15%). The most represented molecular function was catalytic activity (299 sequences; 49%) and a large proportion of sequences (177; 22%) were associated with membrane as cellular compartment.
To investigate the biological processes associated with differences in γ-decalactone content, a GO enrichment analysis was performed using Fisher’s exact test using the sets of up-regulated and down-regulated transcripts separately in comparison to those in the reference F. vesca gene model. A total of 51 biological processes were significantly enriched for the genes up-regulated in fruits with high γ-decalactone content (Additional file 1: Table S3). Most of these 51 common ontologies are ‘descendants’ of 5 higher hierarchical nodes in the GO tree: response to stimulus (GO:0050896), cellular aromatic compound metabolic process (GO:0006725), lipid homeostasis (GO:0055088), nitrogen compound metabolic process (GO:0006807) and organic substance metabolic process (GO:0071704). Within these biological processes, the most significantly over represented term was oxidation-reduction process (GO:0055114, 92 genes within this term). One possible interpretation is that enzymes catalyzing the addition or removal of electrons are needed in the biosynthesis of γ-decalactone in strawberry.
The number of biological processes significantly enriched within the down-regulated genes (up-regulated in the no-γ-decalactone pool) was higher, 101, and more diverse (Additional file 1: Table S4). The three most significantly over represented terms were regulation of biological quality (GO:0065008), response to biotic stimulus (GOO:0009607) and ion transport (GO:0006811). Globally, these data suggest that lacking γ-decalactone in strawberry fruits is associated to a wide range of different biological processes. Particularly interesting is the number of genes up-regulated in the categories of response to stimulus and biotic stress in the absence of γ-decalactone.
Identification of FaFAD1 as the gene underlying the locuscontrolling γ-decalactone content in LG III-2
The top 25 up-regulated genes in the pool with high γ-decalactone content are listed in Table 2. The third gene with the highest up-regulation between the pools corresponds to the Fragaria vesca gene ID gene24414-v1.0-hybrid (gene24414), with homology to fatty acid desaturases (FAD) and in particular to the microsomal Δ-12 oleate desaturase (FAD2). This gene shows a high expression in the pool of fruits with γ-decalactone (325.92 FPKM) and its expression is ~30-fold higher (4.8 log2-fold) than in the pool of fruits without γ-decalactone (11.41 FPKM; Table 2; Additional file 3). Most interestingly, gene24414 maps to the end of the pseudo-molecule 3 of the Fragaria vesca reference genome, at the exact position where the gene controlling γ-decalactone has been mapped in F. x ananassa (Figure 1). In addition to gene24414, only 3 other genes among the 617 differentially expressed genes mapped to the γ-decalactone content locus. Two of them, gene24411 and gene24415, at about 34 and 9 Kb from gene24414, respectively, and a third, gene14386, separated by 808 Kb. However, the fold-change between the bulked pools for these 3 genes was much smaller than for gene24414, ranging from 0.5 to 0.9 log2-fold. Two of these genes, 24411 and 14386, with similarity to callose synthase and glycerophosphoryl diester phosphodiesterase, respectively, were down-regulated in the high γ-decalactone pool. The third gene, 24415 with highest similarity to the nucleolar complex protein 2, showed higher expression in the high γ-decalactone pool (Additional file 3).
All of the reads from the high γ-decalactone pool mapping to the gene24414-v1.0-hybrid (hereafter named FaFAD1 and FvFAD1 for the cultivated and wild strawberry, respectively) corresponded to one unique allele, indicating that only one allele is expressed in fruits of the 10 selected siblings. Similarly, all the reads from the no-γ-decalactone pool corresponded to the same allele to that in the fruits with high content.
The predicted gene24414 in the strawberry draft genome sequence  contains 5 exons. However, when visualized using the integrative genomics viewer (IGV), the reads of the RNA-seq experiments only spanned the first two exons indicating that FvFAD1 was not correctly annotated in the strawberry genome sequence. A different model of the same gene that only spans the two first exons is available in the NCBI under accession number LOC101309231, and is predicted to encode a protein of 393 amino acids. However, the last 16 amino acids of the carboxylic end of the predicted protein do not have similarity to reported FAD2 proteins (data not shown). In order to unequivocally determine the Fragaria x ananassa FAD1 transcript, we performed a de novo assembly of the RNA-seq reads from the high γ-decalactone pool replicates. Only one contig of 1847 bp with high similarity (excluding 25 bp of the 3′ end of the ORF and the 3′UTR) to the F. vesca FvFAD1 gene was obtained, and contained an ORF of 1125 bp encoding a predicted protein of 375 amino acid residues. In this prediction, the C-terminus shares high similarity to FAD2 proteins except that FaFAD1 lacks the two last amino acids (Figure 2). However, the lack of these two amino acids is also found in the predicted amino acid sequence encoded by a F. × ananassa EST (accession number CO381851) available in the NCBI, indicating that this might be the correct sequence. The full length ORF of FaFAD1, including a 3′ un-translated region, was cloned from parental line ‘1392’ using primers deduced from the transcriptome assembly and confirmed the predicted sequence. In agreement to previous results using the reference F. vesca genome, the same FaFAD1 allele was obtained after de novo assembly of the RNA-seq reads from the no-γ-decalactone pool.
The deduced FaFAD1 protein sequence contains the Delta12 Fatty Acid Desaturase (Delta12-FADS)-like conserved domain (E-value: 1.74e-56). Membrane FADs are non-heme, iron-containing, oxygen-dependent enzymes involved in regioselective introduction of double bonds in fatty acyl aliphatic chains. These enzymes are responsible for the synthesis of 18:2 fatty acids in the endoplasmic reticulum. Six putative transmembrane domains are predicted within FaFAD1 using the TMpred program as expected for an integral membrane protein (Figure 2). Alignment with other characterized FAD2 proteins indicated that the characteristic His-rich motifs, which contribute to the interaction with the electron donor cytochrome b5, were conserved in the deduced FaFAD1 protein. The most similar protein to FaFAD1 in Arabidopsis was the endoplasmid reticulum localized oleate desaturase FAD2 catalyzing the conversion of oleic acid (18:1) to linoleic acid (18:2) .
Highly similar proteins to FaFAD1 were identified after a blastp search in the NCBI and the phytozome database (http://www.phytozome.net). As shown in Figure 3, the phylogenetic analysis indicates the presence of two major clusters. As expected, the FaFAD1 protein was most similar to the F. vesca predicted protein encoded by the gene24414, located at the bottom of chromosome 3, and also to a predicted FAD protein from the closely related Malus genus. Interestingly, this group of protein sequences grouped with the Ricinus communis fatty acid hydroxylase RcFAH12. This protein, which shares high sequence similarity to desaturases, has been shown to catalyze the hydroxylation of oleate to produce the hydroxy fatty acid ricinoleate . A recently identified FAD from Prunus persica, PpFAD1B-6, has been associated with lactone content in peach fruits using an integrative ‘omics’ approach . This protein grouped in a second cluster with the Ricinus communis desaturase RcFAD2 and other reported desaturases, such as the Arabidopsis AtFAD2.
To further investigate whether the down-regulation of FaFAD1 is the cause for the extremely low γ-decalactone content in strawberry fruits, we first validated the differential expression observed in the pools by qRT-PCR. As shown in Additional file 1: Figure S2A, the expression of FaFAD1 was ~30-fold higher in the high-γ-decalactone pool, the same differential expression obtained using RNA-seq (Table 2). Quantitative RT-PCR also validated the RNA-seq data for other three up-regulated genes (see below; Additional file 1: Figure S2).
We next examined the expression level of the gene by qRT-PCR in the complete ‘232’ × ‘1392’ population composed of 93 siblings and the two parental lines. 51 lines (~54%) showed no detectable expression of FaFAD1, with threshold cycles (Ct) similar (± 2) to the no-template control and relative expression levels ranging from 0.01 to 0.58 (Figure 4). The rest of the lines showed different levels of expression that ranged from 7 to 247 times the average expression in the population (Figure 4). Furthermore, co-segregation between high/no FaFAD1 transcription and γ-decalactone content was observed (Figure 4; Additional file 1: Figure S4A). Strikingly, primers used for qRT-PCR of FaFAD1 failed to amplify in genomic DNA from each line producing fruits without γ-decalactone (Additional file 1: Figure S3A).
Several FAD2 genes show a seed specific expression, having a role in seed storage fatty acids, while others have an ubiquitous expression, being then involved in the general biosynthesis of membrane fatty acids . This correlation between expression and function of other desaturases prompted us to analyze the expression pattern of FaFAD1 in order to determine whether it is correlated with the induction of γ-decalactone production during the last stages of fruit ripening. The expression in different tissues and during fruit ripening was analyzed by qRT-PCR in the commonly cultivated Chandler cultivar. In this cultivar, the expression level of FaFAD1 in red fruits was similar to that of line ‘1392’ (Additional file 1: Figure S3B, C), consistent with both genotypes producing γ-decalactone in ripe fruits. In contrast, expression of FaFAD1 was not detected in ‘232’ and ‘Camarosa’ by qRT-PCR, indicative of both cultivars not producing γ-decalactone. As shown in Figure 5A, FaFAD1 increased its expression ~150-fold between white and red fruit, consistent with the biosynthesis of γ-decalactone during the late stages of fruit ripening. Supporting a specific role of FaFAD1 in ripe fruits, no expression was detected in leaves, and very low expression of FaFAD1 was detected in roots, green and white fruits.
Collocation of QTL for γ-decalactone content and eQTL for candidate genes
Based in their predicted function, we selected two additional candidate genes for further analyses within the top 25 highly up-regulated genes (Table 2). The fourth transcript in the list, gene17831-v1.0-hybrid, showed 17-fold (4.1 log2_fold_change) higher expression in the high-γ-decalactone pool (Table 2; Additional file 3). The predicted protein sequence has high similarity to the cytochrome P450, family 81, and contains the p450 superfamily conserved domain (E-value 8.37e-96) and the PLN02183 5-hydroxylase multidomain (E-value 1.22e-70). The gene02395-v1.0-hybrid, at the 23rd position in Table 2, showed a 5-fold up-regulation and encodes for a predicted protein with high similarity to the cytochrome P450, family 79, subfamily A and thus, also contains the p450 superfamily conserved domain (E-value 3.60e-40) and several hydroxylase domains such as PLN03018 (E-value 6.14e-125). The differential gene expression observed for these two transcripts was validated by qRT-PCR, obtaining a 14.8 and 6.5 fold-change in their expression between the pools for gene17831 and gene02395, respectively (Additional file 1: Figure S2). Since these two genes encode for protein sequences with high homology to CYP hydroxylases, we further investigated their possible association to γ-decalactone biosynthesis by analyzing their expression in parental and all progeny lines of the ‘232’ × ‘1392’ mapping population (Additional file 1: Figure S4). A significant level of correlation between the transcript level of the F. × ananassa gene corresponding to gene17831-v1.0-hybrid (hereafter referred to as FaFAH1) and γ-decalactone content was observed (Pearson correlation = 0.45). On the contrary, no association between the transcript level of gene02395-v1.0-hybrid (hereafter referred to as FaCYP1) and γ-decalactone content was observed. Furthermore, expression profiling in different tissues by qRT-PCR showed that FaFAH1 is expressed in leaf and ripe fruits while FaCYP1 was highly expressed in leaf and to a much lesser extend in green fruit (Figure 5B, C). These results are consistent with FaFAH1 but not FaCYP1 having a possible role in γ-decalactone accumulation in ripening strawberries.
Transcript expression levels measured for genes in a mapping population allow them to be treated as traits for gene expression QTL (eQTL) analysis. The locations of eQTL that regulate gene expression can be correlated with those of QTL for traditional phenotypic traits and so provide additional clues as to the genetic basis of quantitative genetic variation . The complete correlation observed between high or no FaFAD1 expression and γ-decalactone content (Figure 4; Additional file 1: Figure S4A) indicates that an eQTL controlling the expression of FaFAD1 collocates with both the position of the gene and the phenotypic trait. In order to evaluate this prediction and to test whether the differential expression of FaFAH1 or FaCYP1 could be correlated with the position of the locus controlling γ-decalactone content, we next used the qRT-PCR quantitative data for eQTL analysis for the three genes. As a positive control we re-analyzed the QTL for γ-decalactone content . Interestingly, QTL analysis performed with both the non-parametric Kruskal-Wallis test and interval mapping using the integrated map of  resulted in the identification of the same eQTL for both FaFAD1 and FaFAH1 expression (Figure 6; Additional file 1: Table S5). The eQTL for FaFAD1 expression, as expected, collocated with both the position of the QTL controlling γ-decalactone content and the position of the gene FaFAD1 at the bottom of LG III-2 and accounted for 90% of the variation (Additional file 1: Table S5). Interestingly, an eQTL controlling 55% of the variation in the expression of FaFAH1 was detected also in LG III-2, at the exact same position as where FaFAD1 (and the locus controlling γ-decalactone content) is mapped. The position of FaFAH1 is predicted to be in one LG of HG I based in the F. vesca genome sequence (Table 2), implying that the eQTL at LG III, which most likely is the gene FaFAD1, is regulating the expression of FaFAH1. When we analyzed the expression of FaCYP1, an eQTL was detected at the top of a different LG belonging to the same homoeology group III (Figure 6; Additional file 1: Table S5). The position of the eQTL for FaCYP1 matches the position of the gene based in the F. vesca genome sequence (Table 2). This data is consistent with the lack of association previously found between the transcript level of FaCYP1 and γ-decalactone content.
Two bulked pools of segregants representing the phenotypic extremes within a relatively large population displaying wide variation for a given trait would only differ at the locus controlling the trait. Although bulk segregant analysis (BSA) has generally being used to tag genes controlling Mendelian traits, the method can also be used to identify major QTL . The applicability of BSA to RNA-seq was recently demonstrated by mapping the maize mutant gene gl3. Here we report the combination of BSA and RNA-seq as a powerful and valid approach for quantifying differential transcript expression and for cost-efficient identification of genes underlying γ-decalactone variation in cultivated strawberry. Once we fine mapped the locus to the bottom of chromosome 3, the assumptions made for candidate genes were that (1) the genes must show low or no expression in fruits without γ-decalactone while in fruits producing this VOC had to be high and (2) the gene must encode for an enzyme involved in the biosynthesis of this volatile, based on the proposed pathways, or should encode for a regulatory protein. Out of the 33,458 analyzed transcripts, only gene24414 fulfilled both requirements. This gene encodes for a protein, FaFAD1, with extensive similarity to delta-twelve fatty acid desaturases, enzymes that catalyze the regioselective introduction of a double bond at the Δ12 position during lipid biosynthesis . Therefore, the activity of this protein could supply fatty acid precursors for lactone biosynthesis. The sequence alignment of FaFAD1 with other desaturases revealed the presence of three conserved histidine boxes reported to be essential for the catalysis, and proposed to be the ligands for the iron atoms involved in the formation of the di-iron-oxygen complex. Interestingly, the deduced FaFAD1 protein is shorter than the rest of FAD proteins and neither the dilysine nor the aromatic amino acid-enriched retrieval signal (-YKNKF) are present at the C-terminus of FaFAD1 (Figure 2). One of these motifs is necessary for maintaining localization of the enzymes in the endoplasmic reticulum (ER) . However, a PSORT algorithm (http://wolfpsort.org) predicts that FaFAD1 is targeted to the ER with a certainty of 8.0, consistent with the six transmembrane domains predicted for FaFAD1.
In addition to a desaturase activity, a number of FAD2 variants are known to possess diversified functionalities, catalyzing hydroxylations, epoxidations, or the formation of acetylenic and conjugated double bonds [35, 41, 42]. Some other FAD2 enzymes have bifunctional hydroxylase/desaturase or even tri-functional activities [43, 44]. A close homologue to FaFAD1 in peach, PpFAD1B-6, has been proposed to be involved in lactone production in fruits . This enzyme inserts a double bond between carbon 12 and 13 of monounsaturated oleic acid to generate polyunsaturated linoleic acid, but do not have any detectable hydroxylase activity. However, FaFAD1 is phylogenetically located in a different clade and more closely related to the castor bean hydroxylase RcFAH12 . Seven amino acid residues that differ between oleate desaturases and hydroxylases have been identified and the substitutions of alanine 148 and methionine 324 of the Arabidopsis AtFAD2 by isoleucines, as found in RcFAH12 or Lesquerella fendleri hydroxylase/desaturase (LfFAH12), caused a substantial shift in catalytic activity [45, 46]. Interestingly, these two isoleucines are conserved between FaFAD1 and hydroxylases, suggesting that the strawberry gene could encode for a bifunctional enzyme (Figure 2).
The expression profiling of FaFAD1 in different tissues showed that the gene is highly expressed and specific of red fruit of lines with high γ-decalactone content. Therefore, the expression is highly correlated with γ-decalactone biosynthesis, which occurs at the late stages of fruit ripening . In addition, the correlation of FaFAD1 expression with γ-decalactone content in the mapping population, the coincident map position between γ-decalactone and FaFAD1 and the predicted enzymatic activity of FaFAD1 protein indicate that this gene is responsible for the natural variation of this VOC in strawberry. Furthermore, it can be stated that the absence or extremely low levels of γ-decalactone in fruits of half of the population lines is a consequence of the absence or extremely low levels of FaFAD1 expression in these lines. The same FaFAD1 allele was detected in both bulked pools either using the reference genome to map the reads or after de novo assembly. The differential expression of FaFAD1 observed between both pools was alike using both methods (Additional file 1: Table S6) and was also validated by qRT-PCR (Additional file 1: Figure S2; Table S6). However, when the progeny lines were analyzed independently, FaFAD1 expression was not detected by qRT-PCR in fruits without γ-decalactone. Furthermore, different FaFAD1 primer pairs failed to amplify in genomic DNA of these lines (Additional file 1: Figure S3; see also companion manuscript), suggesting that the FaFAD1 gene may not be present in their genome. Taking these results together, the most plausible explanation is that the no γ-decalactone pools had some contamination during processing with some fruits containing the volatile.
Two other candidate genes were studied on their potential contribution to γ-decalactone production based on their increased expression in the high γ-decalactone pool and the annotated enzymatic activity. While FaCYP1 was not associated to γ-decalactone content, the gene FaFAH1 was up-regulated during fruit ripening. Our eQTL analysis of FaFAD1 and FaFAH1 indicate that both are associated with γ-decalactone. While the association of FaFAD1 expression with γ-decalactone is complete, FaFAH1 only shows a high association with γ-decalactone. When one eQTL maps in the same genetic location as the gene whose transcript is being measured, as it is the case for FaFAD1, is generally caused by cis-acting regulatory polymorphisms in the gene (cis-eQTL). Most probably through a polymorphism in the promoter region, which in turn gives rise to differential expression. In contrast, eQTL that do not map to the location of the gene being assayed, such as for FaFAH1, most likely represent trans-acting regulators (trans-eQTL) that may control the expression of a number of genes elsewhere in the genome . Based in the predicted function of FaFAD1 and FaFAH1, we propose that the pathway for γ-decalactone biosynthesis in fruits proceeds through hydration of unsaturated fatty acids. In this proposed model, the enzyme FaFAD1 would catalyze the conversion of oleic acid (18:1) to linoleic acid (18:2) by the introduction of a double bond at the Δ12 position, as performed by other FAD2 enzymes. Additionally, FaFAD1 may possess hydroxylase activity, catalyzing the hydroxylation of oleic acid to ricinoleic acid. The fact that an eQTL for FaFAH1 expression was detected at the position where FaFAD1 maps suggests that FaFAD1, or most likely the product of FaFAD1 activity (i.e. linoleic acid), up-regulates the expression of FaFAH1, which may encode for the enzyme catalyzing the next reaction in the biosynthetic pathway. This reaction most probably is a hydroxylation although some CYP related enzymes have been shown to have epoxidase activity . Ricinoleic acid derivative is then shortened by four β-oxidation cycles to form the corresponding 4-hydroxy acid. The last step in γ-decalactone biosynthesis involves the cyclation of the molecule either by an enzyme with alcohol acyl-transferase activity or by spontaneous lactonisation under acid conditions .
Understanding the basis of volatile organic compound (VOC) biosynthesis and regulation is of utmost importance for the genetic improvement of fruit flavor. This study provides genetic and molecular data on how the content of γ-decalactone is naturally controlled in strawberry and highlights enzymatic activities necessary for the formation of this VOC in fruits. γ-decalactone has been shown to be a sensory important VOC for strawberry flavor [17, 49]. However, other important functions of volatiles are to defend plants against pathogens, to attract pollinators, seed dispersers, and other beneficial animals and microorganisms, and to serve as signals in plant–plant interaction . GO enrichment analysis for the genes up-regulated in fruits without γ-decalactone detected a significant enrichment in GO categories related to response to pathogens. One plausible explanation is that this lactone could have anti-pathogen activity and, in its absence, up-regulation of other mechanisms of biotic stress responses would compensate the lack of this VOC. In this context, γ-decalactone has been shown to be toxic to yeast and bacteria through its capacity for permeabilizing membranes . These data suggest that this VOC might have a function in this process, a possibility that deserves further investigation.
Availability of supporting data
The data sets supporting the results of this article are included within the article (and its additional files) and raw RNA-seq reads available in the European Nucleotide Archive (ENA) repository under accession PRJEB5430 (http://www.ebi.ac.uk/ena/data/view/PRJEB5430).
Pérez A, Sanz A: Strawberry flavor. Handbook of Fruit and Vegetable Flavors. Edited by: Hui HY. 2010, Hoboken, New Jersey: John Wiley & Sons, Inc, 437-455.
Latrasse A: Fruits III. Volatile Compounds in Fruits and Beverages. Edited by: Maarse H. 1991, New York: Marcek Dekker, Inc, 329-387.
Zabetakis I, Holden MA: Strawberry flavour: analysis and biosynthesis. J Sci Food Agric. 1997, 74: 421-434. 10.1002/(SICI)1097-0010(199708)74:4<421::AID-JSFA817>3.0.CO;2-6.
Ménager I, Jost M, Aubert C: Changes in physicochemical characteristics and volatile constituents of strawberry (Cv. Cigaline) during maturation. J Agric Food Chem. 2004, 52: 1248-1254. 10.1021/jf0350919.
Jetti RR, Yang E, Kurnianta A, Finn C, Qian MC: Quantification of selected aroma-active compounds in strawberries by headspace solid-phase microextraction gas chromatography and correlation with sensory descriptive analysis. J Food Sci. 2007, 72: S487-S496. 10.1111/j.1750-3841.2007.00445.x.
Zorrilla-Fontanesi Y, Rambla J-L, Cabeza A, Medina JJ, Sánchez-Sevilla JF, Valpuesta V, Botella MA, Granell A, Amaya I: Genetic analysis of strawberry fruit aroma and identification of O-methyltransferase FaOMT as the locus controlling natural variation in mesifurane content. Plant Physiol. 2012, 159: 851-870. 10.1104/pp.111.188318.
Osorio S, Muñoz C, Valpuesta V: Physiology and biochemistry of fruit flavors. Handbook of Fruit and Vegetable Flavors. Edited by: Hui HY. 2010, Hoboken, New Jersey: John Wiley & Sons, Inc, 25-43.
Aragüez I, Valpuesta Fernández V: Metabolic engineering of aroma components in fruits. Biotechnol J. 2013, 8: 1144-1158.
Schwab W, Davidovich-Rikanati R, Lewinsohn E: Biosynthesis of plant-derived flavor compounds. Plant J. 2008, 54: 712-732. 10.1111/j.1365-313X.2008.03446.x.
Schöttler M, Boland W: Biosynthesis of dodecano-4-lactone in ripening fruits: crucial role of an epoxide-hydrolase in enantioselective generation of aroma components of the nectarine (Prunus persica var. nucipersica) and the strawberry (Fragaria ananassa). Helv Chim Acta. 1996, 79: 1488-1496. 10.1002/hlca.19960790521.
Olbricht K, Grafe C, Weiss K, Ulrich D: Inheritance of aroma compounds in a model population of Fragaria × ananassa Duch. Plant Breed. 2008, 127: 87-93.
Douillard C, Guichard E: Comparison by multidimensional analysis of concentrations of volatile compounds in fourteen frozen strawberry varieties [aroma, furaneol, mesifurane]. Sci Aliment. 1989, 9: 53-76.
Husain Q: Chemistry and biochemistry of some vegetable flavors. Handbook of Fruit and Vegetable Flavors. Edited by: Hui HY. 2010, Hoboken, New Jersey: John Wiley & Sons, Inc, 575-625.
Sánchez G, Venegas-Calerón M, Salas JJ, Monforte A, Badenes ML, Granell A: An integrative “omics” approach identifies new candidate genes to impact aroma volatiles in peach fruit. BMC Genomics. 2013, 14: 343-10.1186/1471-2164-14-343.
Xi W-P, Zhang B, Liang L, Shen J-Y, Wei W-W, Xu C-J, Allan AC, Ferguson IB, Chen K-S: Postharvest temperature influences volatile lactone production via regulation of acyl-CoA oxidases in peach fruit. Plant Cell Environ. 2012, 35: 534-545. 10.1111/j.1365-3040.2011.02433.x.
Eduardo I, Chietera G, Pirona R, Pacheco I, Troggio M, Banchi E, Bassi D, Rossini L, Vecchietti A, Pozzi C: Genetic dissection of aroma volatile compounds from the essential oil of peach fruit: QTL analysis and identification of candidate genes using dense SNP maps. Tree Genetics & Genomes. 2013, 9: 189-204. 10.1007/s11295-012-0546-z.
Larsen M, Poll L, Olsen C: Evaluation of the aroma composition of some strawberry (Fragaria ananassa Duch) cultivars by use of odour threshold values. Zeitschrift für Lebensmittel untersuchung und Forschung. 1992, 195: 536-539. 10.1007/BF01204558.
Rousseau-Gueutin M, Lerceteau-Kohler E, Barrot L, Sargent DJ, Monfort A, Simpson D, Arus P, Guerin G, Denoyes-Rothan B: Comparative genetic mapping between octoploid and diploid fragaria species reveals a high level of colinearity between their genomes and the essentially disomic behavior of the cultivated octoploid strawberry. Genetics. 2008, 179: 2045-2060. 10.1534/genetics.107.083840.
Zorrilla-Fontanesi Y, Cabeza A, Domínguez P, Medina JJ, Valpuesta V, Denoyes-Rothan B, Sánchez-Sevilla JF, Amaya I: Quantitative trait loci and underlying candidate genes controlling agronomical and fruit quality traits in octoploid strawberry (Fragaria × ananassa). Theor Appl Gen. 2011, 123: 755-778. 10.1007/s00122-011-1624-6.
Isobe SN, Hirakawa H, Sato S, Maeda F, Ishikawa M, Mori T, Yamamoto Y, Shirasawa K, Kimura M, Fukami M, Hashizume F, Tsuji T, Sasamoto S, Kato M, Nanri K, Tsuruoka H, Minami C, Takahashi C, Wada T, Ono A, Kawashima K, Nakazaki N, Kishida Y, Kohara M, Nakayama S, Yamada M, Fujishiro T, Watanabe A, Tabata S: Construction of an integrated high density simple sequence repeat linkage map in cultivated strawberry (fragaria x ananassa) and its applicability. DNA Res. 2013, 20: 79-92. 10.1093/dnares/dss035.
Sargent DJ, Passey T, Šurbanovski N, Lopez Girona E, Kuchta P, Davik J, Harrison R, Passey A, Whitehouse AB, Simpson DW: A microsatellite linkage map for the cultivated strawberry (Fragaria × ananassa) suggests extensive regions of homozygosity in the genome that may have resulted from breeding and selection. Theor Appl Gen. 2012, 124: 1229-1240. 10.1007/s00122-011-1782-6.
Bombarely A, Merchante C, Csukasi F, Cruz-Rus E, Caballero JL, Medina-Escobar N, Blanco-Portales R, Botella MA, Muñoz-Blanco J, Sánchez-Sevilla JF, Valpuesta V: Generation and analysis of ESTs from strawberry (Fragaria xananassa) fruits and evaluation of their utility in genetic and molecular studies. BMC Genomics. 2010, 11: 503-10.1186/1471-2164-11-503.
Shulaev V, Sargent DJ, Crowhurst RN, Mockler TC, Folkerts O, Delcher AL, Jaiswal P, Mockaitis K, Liston A, Mane SP, Burns P, Davis TM, Slovin JP, Bassil N, Hellens RP, Evans C, Harkins T, Kodira C, Desany B, Crasta OR, Jensen RV, Allan AC, Michael TP, Setubal JC, Celton J-M, Rees DJG, Williams KP, Holt SH, Rojas JJR, Chatterjee M, et al: The genome of woodland strawberry (Fragaria vesca). Nature Gen. 2011, 43: 109-116. 10.1038/ng.740.
Martin LBB, Fei Z, Giovannoni JJ, Rose JKC: Catalyzing plant science research with RNA-seq. Front Plant Sci. 2013, 4: 66-
Higgins J, Magusin A, Trick M, Fraser F, Bancroft I: Use of mRNA-seq to discriminate contributions to the transcriptome from the constituent genomes of the polyploid crop species Brassica napus. BMC Genomics. 2012, 13: 1-1. 10.1186/1471-2164-13-1.
Manning K: Isolation of nucleic acids from plants by differential solvent precipitation. Anal Biochem. 1991, 195: 45-50. 10.1016/0003-2697(91)90292-2.
Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL: TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013, 14: R36-10.1186/gb-2013-14-4-r36.
Trapnell C, Hendrickson DG, Sauvageau M, Goff L, Rinn JL, Pachter L: Differential analysis of gene regulation at transcript resolution with rNA-seq. Nat Biotechnol. 2012, 31: 46-53. 10.1038/nbt.2450.
Conesa A, Götz S: Blast2GO: A comprehensive suite for functional analysis in plant genomics. Int J Plant Genom. 2008, 2008: 619832-
Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q, Chen Z, Mauceli E, Hacohen N, Gnirke A, Rhind N, di Palma F, Birren BW, Nusbaum C, Lindblad-Toh K, Friedman N, Regev A: Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011, 29: 644-652. 10.1038/nbt.1883.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.
Pfaffl MW: A new mathematical model for relative quantification in real-time RT-PCR. Nucleic Acids Res. 2001, 29: 2002-2007.
Salvatierra A, Pimentel P, Moya-León MA, Caligari PDS, Herrera R: Comparison of transcriptional profiles of flavonoid genes and anthocyanin contents during fruit development of two botanical forms of Fragaria chiloensis ssp. chiloensis. Phytochemistry. 2010, 71: 1839-1847. 10.1016/j.phytochem.2010.08.005.
Okuley J, Lightner J, Feldmann K, Yadav N, Lark E, Browse J: Arabidopsis FAD2 gene encodes the enzyme that is essential for polyunsaturated lipid synthesis. Plant Cell. 1994, 6: 147-158.
van de Loo FJ, Broun P, Turner S, Somerville C: An oleate 12-hydroxylase from Ricinus communis L. is a fatty acyl desaturase homolog. Proc Natl Acad Sci U S A. 1995, 92: 6743-6747. 10.1073/pnas.92.15.6743.
Hernández ML, Mancha M, Martínez-Rivas JM: Molecular cloning and characterization of genes encoding two microsomal oleate desaturases (FAD2) from olive. Phytochemistry. 2005, 66: 1417-1426. 10.1016/j.phytochem.2005.04.004.
Kliebenstein D: Quantitative genomics: analyzing intraspecific variation using global gene expression polymorphisms or eQTLs. Annu Rev Plant Biol. 2009, 60: 93-114. 10.1146/annurev.arplant.043008.092114.
Collard BCY, Jahufer MZZ, Brouwer JB, Pang ECK: An introduction to markers, quantitative trait loci (QTL) mapping and marker-assisted selection for crop improvement: the basic concepts. Euphytica. 2005, 142: 169-196. 10.1007/s10681-005-1681-5.
Liu S, Yeh C-T, Tang HM, Nettleton D, Schnable PS: Gene mapping via bulked segregant RNA-Seq (BSR-Seq). PLoS ONE. 2012, 7: e36406-10.1371/journal.pone.0036406.
McCartney AW, Dyer JM, Dhanoa PK, Kim PK, Andrews DW, McNew JA, Mullen RT: Membrane-bound fatty acid desaturases are inserted co-translationally into the ER and contain different ER retrieval motifs at their carboxy termini. Plant J. 2004, 37: 156-173. 10.1111/j.1365-313X.2004.01949.x.
Shanklin J, Cahoon EB: Desaturation and related modifications of fatty acids 1. Annu Rev Plant Physiol Plant Mol Biol. 1998, 49: 611-641. 10.1146/annurev.arplant.49.1.611.
Shanklin J, Guy JE, Mishra G, Lindqvist Y: Desaturases: emerging models for understanding functional diversification of diiron-containing enzymes. J Biol Chem. 2009, 284: 18559-18563. 10.1074/jbc.R900009200.
Broun P, Boddupalli S, Somerville C: A bifunctional oleate 12-hydroxylase: desaturase from Lesquerella fendleri. Plant J. 1998, 13: 201-210. 10.1046/j.1365-313X.1998.00023.x.
Cao S, Zhou X-R, Wood CC, Green AG, Singh SP, Liu L, Liu Q: A large and functionally diverse family of Fad2 genes in safflower (Carthamus tinctorius L.). BMC Plant Biol. 2013, 13: 5-10.1186/1471-2229-13-5.
Broadwater JA, Whittle E, Shanklin J: Desaturation and hydroxylation. Residues 148 and 324 of Arabidopsis FAD2, in addition to substrate chain length, exert a major influence in partitioning of catalytic specificity. J Biol Chem. 2002, 277: 15613-15620. 10.1074/jbc.M200231200.
Broun P, Shanklin J, Whittle E, Somerville C: Catalytic plasticity of fatty acid modification enzymes underlying chemical diversity of plant lipids. Science. 1998, 282: 1315-1317.
Pinot F, Beisson F: Cytochrome P450 metabolizing fatty acids in plants: characterization and physiological roles. FEBS J. 2010, 278: 195-205.
Waché Y, Aguedo M, Nicaud JM, Belin JM: Catabolism of hydroxyacids and biotechnological production of lactones by Yarrowia lipolytica. Appl Microbiol Biotechnol. 2003, 61: 393-404. 10.1007/s00253-002-1207-1.
Ubeda C, San-Juan F, Concejero B, Callejón RM, Troncoso AM, Morales ML, Ferreira V, Hernández-Orte P: Glycosidically bound aroma compounds and impact odorants of four strawberry varieties. J Agric Food Chem. 2012, 60: 6095-6102. 10.1021/jf301141f.
Dudareva N, Pichersky E: Metabolic engineering of plant volatiles. Curr Opin Biotechnol. 2008, 19: 181-189. 10.1016/j.copbio.2008.02.011.
We are grateful to Aureliano Bombarely for advice on bioinformatics analysis of RNA-seq data and to the Centro Nacional de Análisis Genómico (CNAG) for Illumina sequencing. This work was supported by the Spanish Ministry of Economy and Competitivity and FEDER (grant numbers AGL2012-40066, BIO2010-15630), the EUBerry Project (EU FP7 KBBE–2010-4 Grant Agreement number 265942) and by a Marie Curie International Outgoing Fellowship within the 7th European Community Framework Programme to I.A (IOF Flavor 328052).
The authors would like to declare that they have no financial or non-financial competing interests in the publication of this manuscript.
JS-S carried out the RNA-seq analysis and bioinformatics studies and participated to draft the manuscript. EC-R participated in qRT-PCR analysis and gene cloning. VV participated in the sequence alignment and helped to draft the manuscript. MB participated in the design of the study and helped to draft the manuscript. IA conceived of the study, carried out the genetic analyses and RNA isolation, participated in its design and coordination and drafted the manuscript. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Figure S1: Distribution of differentially expressed genes in major functional terms (GO terms) for categories Biological Process, Molecular Function and Cellular Component. Figure S2. Validation of RNA-seq data by qRT-PCR. Figure S3. Polymorphism of FaFAD1 and expression analysis in ripe fruits of different accessions of Fragaria × ananassa. Figure S4. Expression profiles of candidate genes in the ‘232’ x ‘1392’ mapping population by real time qRT-PCR in comparison to γ-decalactone content in fruits. Table S1. Primers used in qRT-PCR. Table S2. Summary of read alignments for the three high-γ-decalactone (H γ-DEC) and three not producing γ-decalactone (No γ-DEC) biological replicates. Table S3. GO enrichment analysis by mean of Fisher’s exact test with the sets of up-regulated genes/locus (highly expressed in high-γ-decalactone pool) in comparison to general model of Fragaria vesca. Table S4. GO enrichment analysis by mean of Fisher’s exact test with the sets of down-regulated genes/locus (higher expression in the No-γ-decalactone pool) in comparison to general model of Fragaria vesca. Table S5. QTL detected in the ‘232’ × ‘1392’ strawberry population controlling the content of γ-decalactone and eQTL controlling the expression of FaFAD1, FaFAH1 and FaCYP1 based on Kruskal-Wallis (K-W) and interval mapping (IM). Table S6. Expression of gene24414-v1.0-hybrid in each of the biological replicates by three different approaches. (PDF 1 MB)
Additional file 2: List of the SSR, SNP, AFLP and gene markers mapped in the HG III along with DArT sequence, SNP position and their genotypes in the 232 × 1392 population.(XLSX 251 KB)
Authors’ original submitted files for images
About this article
Cite this article
Sánchez-Sevilla, J.F., Cruz-Rus, E., Valpuesta, V. et al. Deciphering gamma-decalactone biosynthesis in strawberry fruit using a combination of genetic mapping, RNA-Seq and eQTL analyses. BMC Genomics 15, 218 (2014). https://doi.org/10.1186/1471-2164-15-218