Integrating transcriptomics and metabolomics for the analysis of the aroma profiles of Saccharomyces cerevisiae strains from diverse origins
BMC Genomics volume 18, Article number: 455 (2017)
During must fermentation thousands of volatile aroma compounds are formed, with higher alcohols, acetate esters and ethyl esters being the main aromatic compounds contributing to floral and fruity aromas. The action of yeast, in particular Saccharomyces cerevisiae, on the must components will build the architecture of the wine flavour and its fermentation bouquet. The objective of the present work was to better understand the molecular and metabolic bases of aroma production during a fermentation process. For such, comparative transcriptomic and metabolic analysis was performed at two time points (5 and 50 g/L of CO2 released) in fermentations conducted by four yeast strains from different origins and/or technological applications (cachaça, sake, wine, and laboratory), and multivariate factorial analyses were used to rationally identify new targets for improving aroma production.
Results showed that strains from cachaça, sake and wine produced higher amounts of acetate esters, ethyl esters, acids and higher alcohols, in comparison with the laboratory strain. At fermentation time T1 (5 g/L CO2 released), comparative transcriptomics of the three S. cerevisiae strains from different fermentative environments in comparison with the laboratory yeast S288c, showed an increased expression of genes related with tetracyclic and pentacyclic triterpenes metabolism, involved in sterol synthesis. Sake strain also showed upregulation of genes ADH7 and AAD6, involved in the formation of higher alcohols in the Ehrlich pathway. For fermentation time point T2 (50 g/L CO2 released), again sake strain, but also VL1 strain, showed an increased expression of genes involved in formation of higher alcohols in the Ehrlich pathway, namely ADH7, ADH6 and AAD6, which is in accordance with the higher levels of methionol, isobutanol, isoamyl alcohol and phenylethanol observed.
Our approach revealed successful to integrate data from several technologies (HPLC, GC-MS, microarrays) and using different data analysis methods (PCA, MFA). The results obtained increased our knowledge on the production of wine aroma and flavour, identifying new gene in association to the formation of flavour active compounds, mainly in the production of fatty acids, and ethyl and acetate esters.
Wine flavour is the result of the interactions between grape must components and compounds originated from microbial metabolism. Grape must is constituted by three functional groups of compounds: nutrients, flavour precursors and flavour-active non-precursors. The action of yeasts on some of these compounds, will build the architecture of the wine flavour and their fermentation bouquet. Over the past 30 years, the huge increase in the understanding of Saccharomyces cerevisiae metabolism, namely of industrial yeast strains  has revealed its crucial role in the development of the wine secondary aroma, with higher alcohols, acetate esters and ethyl esters being the main aromatic compounds contributing to a floral and fruity aroma . Generally, wine yeast strains can be responsible for “fruity”, “floral”, “neutral”, or “cheesy”–“rancid” wine aromas, depending on their capacity to produce esters, higher alcohols, and volatile fatty acids . The selection of the best wine yeast depends essentially on its oenological/phenotypic characteristics, such as fermentative rate, tolerance to ethanol and to SO2, response to temperature, flocculent characteristics, the presence of killer factor, malic acid metabolism and the production of several fermentation by-products, such as acetic acid, H2S, higher alcohols, glycerol and acetaldehyde [4,5,6,7,8]. A large variety of mechanisms, including heterozygosity, nucleotide and structural variations, introgressions, horizontal gene transfer and hybridization, contribute to the genetic and phenotypic diversity of S. cerevisiae wine yeasts [9,10,11,12], and several domestication fingerprints have been identified in their genomes . Many researchers have studied the influence in the fermentation process of manipulating single genes through their deletion or over-expression, in order to clarify or to improve pathways involved in winemaking [14,15,16,17]. Some studies showed that wine strains adapt to specific oenological environments during their selection for biotechnological purposes, which is reflected in their transcriptome, proteome and metabolome [18,19,20]. On the other hand, transcriptome studies have been implemented using industrial yeast strains under winemaking conditions. These studies include gene expression analyses during alcoholic fermentation [20,21,22,23] and during exposure to a diversity of stresses such as high ethanol concentrations , low temperature , and high-sugar concentrations . Gene expression is variable among wild-type yeast strains and it was shown that differences in gene expression during fermentation affected co-regulated genes and distinguished yeast strains . Besides, winemaking strains deal better with stress-imposing environmental conditions and are able to manage nutrient deficiencies, such as nitrogen, in a more efficient and resourceful way suggesting a better adaptation to the specific stresses imposed. In order to understand the wine yeast aromatic profile, metabolomic tools are available and are commonly used. The study of metabolome includes the analysis of a wide variety of chemical compounds, usually present at very low concentrations, which is a major barrier for appropriate bioanalytical approaches. The analysis of the metabolic profile has been performed using several analytical platforms, such as gas-chromatography (GC) or liquid-chromatography (LC) coupled to mass-spectroscopy (MS) [28,29,30], capillary electrophoresis (CE) coupled to MS [31,32,33,34], infrared and Raman spectroscopy , nuclear magnetic resonance (NMR) spectroscopy [36,37,38] and direct injection MS (DIMS) [39, 40]. GC-MS analysis has been one of the best accepted approaches to study wine metabolome, with several advantages: sensitivity, robustness, easiness of use, low cost and ample linear range [41,42,43,44]. GC-MS combines advantages of both technologies: while MS provides individual mass spectra that can differentiate between chemically diverse metabolites, GC has high separation efficiency. The integration of the several “omic” approaches could be used to understand the variability existing within S. cerevisiae strains and to explore the molecular mechanisms underlying that variability.
In the present work we performed a comparative transcriptomic analysis of four S. cerevisiae strains from different origins and/or technological applications (wine, sake, cachaça and laboratory) at two time points during a must fermentation process and analysed the aroma profile of the fermented musts at each time point, in order to establish a correlation between gene expression and metabolite production. These strains were chosen from a larger collection as being from heterogeneous origins and displaying the biggest phenotypic differences , aiming to get a clearer association between flavour compounds production and gene expression.
Yeast strains and culture media
Four Saccharomyces cerevisiae strains were used in this study, in particular the commercial strain Zymaflore® VL1 (Laffort oenologie®), the cachaça strain Z63 (kindly provided by Rogélio Brandão), the sake strain Z23 (kindly provided by Gianni Liti)  and the laboratory strain S288c. Strains were grown at 28 °C, and routinely maintained at 4 °C on YPD plates containing 2% glucose (w/v), 2% peptone (w/v), 1% yeast extract (w/v) and 2% agar (w/v), and in glycerol (30%, v/v) stocks at −80 °C.
In this study, we used a natural must and a synthetic culture medium. The natural must was harvested in 2012 in the south of France (Maccabeu), flash-pasteurized and stored under sterile conditions. It contained 211 g/L of sugar and 213 mg/L of assimilable nitrogen. As a synthetic must, the MS300 (MS) medium  was used due to the fact that it mimics the grape musts to prepare the cells for fermentation. We inoculated 50 mL flasks containing 30 mL of YPD with cells from a Petri dish with YPD and incubated them overnight at 28 °C under stirring. Cells were then transferred to 1 L flasks containing 500 mL of MS medium in a final concentration of 2 × 106 cells/mL and incubated at 28 °C with continuous stirring. The fermentation cultures in MS medium were inoculated with 2 × 106 cells/mL in 1.1 L fermentors containing 900 mL of natural must.
Fermentations were performed in 1 L fermenters (NH verre) equipped with a fermentor condenser, at 20 °C, stirred continuously (100 rpm) and linked to a mass flow meter that measured the CO2 release rate online. CO2 release was determined by automatic measurements of fermentor weight every 20 min. The rate of CO2 production, dCO2/dt, is the first derivative of the amount of CO2 produced over time and was calculated automatically by polynomial smoothing of the CO2 production curve . Fermentation experiments were performed in triplicate.
Glucose, glycerol, ethanol, pyruvate, succinic, acetic and α-ketoglutaric acids levels were analysed by high-pressure liquid chromatography (HPLC), with an Rezex ROA - Organic Acid column (Phenomenex) at 45 °C. The column was eluted with 4 mM H2SO4 at a flow rate of 0.6 mL/min. Dual detection was performed with a refractometer and a UV detector (Agilent).
Volatile aroma compounds were analyzed by GC-MS after extraction as previously described . Briefly, deuterated internal standards (100 μg/L) were added to samples (5 mL) before twice extraction using 1 mL of dichloromethane. The organic phases were dried over anhydrous sodium sulphate and concentrated under nitrogen flux. Extracts were analyzed with a Hewlett Packard (Agilent Technologies, Santa Clara, California, USA) 6890 gas chromatograph coupled to a HP 5973 mass spectrometer.
RNA isolation and sample labelling
Cells (1 × 109 cells) were harvested at two time points - 5 g/L and 50 g/L of CO2 released - by centrifugation at 1000 g for 5 min at 4 °C and the cell pellets were washed with DEPC-treated water and then frozen in methanol at −80 °C. Total RNA was extracted with Trizol reagent (Gibco BRL, Life Technologies) and was purified with the RNeasy kit (Qiagen). The quantity and the quality of the extracted RNA were checked by spectrometry (NanoDrop 1000, Thermo Scientific). We used the Agilent 8x15k gene expression microarrays (Design ID 016322, Agilent Technologies, Santa Clara, CA, USA) according to the manufacturer’s instructions. Fluorescent cRNAs were synthesized from 100 ng of total RNA using the One color RNA Spike-In kit (Agilent Technologies). Labeled cRNA was purified with the RNeasy Kit (Qiagen). Microarrays were hybridized for 17 h at 65 °C in a rotating hybridization oven (Corning), with the Gene Expression Hybridization kit (Agilent). The hybridization signal was detected with a GenePix 4000B laser scanner (Axon Instruments).
Statistical analyses were performed using R software, version 3.0.3 . To obtain a general overview of the production of volatile compounds during the fermentation for each stage of fermentation (T1 and T2), principal component analysis (PCA) was performed using the FactoMineR package .
The limma package  was used to import and normalize the global microarray data (quantile method for normalization between arrays). For each studied time of CO2 released (T1 and T2) and based on this normalized dataset of 6200 points for the 4 strains, we used a sparse partial least square – discriminant analysis (sPLS-DA), an exploratory approach in a supervised context in order to select the most important transcripts relative to the 4 strains . We tuned the number of dimensions of the sPLS-DA to 2 and the number of variables to choose on these 2 dimensions to 400.
A functional analysis was performed on the selected transcripts by time point, in order to highlight significant functional groups according to the Gene Ontology (GO) process terms using the GeneCodis program with the FDR method at a p value cutoff of 0.05 .
For each time point, a multivariate factorial analysis (MFA) was also performed to obtain an overview of the dataset, which consisted in 433 variables measured for 4 strains (S288c, VL1, cachaça, sake). The data set included a group of individuals described by two types of variables: the normalized expression of the 400 transcripts selected by the sPLA-DA according to the 4 strains, and the 33 volatile compounds produced during the fermentation by the 4 strains. The MFA takes into account the structure of the two groups of data and balances the influence of each group of variables. This enables the study of links between expression data and volatile compounds production .
Microarray data accession numbers: the complete data set is available through the Gene Expression Omnibus (GEO) database. The microarray description is under GEO accession number GPL16244.
Results and discussion
Fermentative profiles and metabolic characterization
Aiming at a better understanding of the molecular and metabolic bases of aroma production during a fermentation process, we started by characterizing fermentative profiles and metabolite production of grape must fermentations conducted by three Saccharomyces cerevisiae strains isolated from different fermentative environments, namely cachaça Z63, sake Z23 and the commercial wine yeast VL1, as well by the laboratory reference strain S288c. These strains were previously characterized genetically and phenotypically [45, 55] and were selected from a larger yeast collection based on their dissimilarities . Triplicate fermentations were carried out with each of the four strains using natural must Maccabeu. The fermentation performance of the strains is presented in Fig. 1, in which each curve represents the average debit of CO2 from the three replicates for each strain. With the exception of the laboratory strain, for which a slower fermentation and a lower maximum fermentation rate were obtained, the remaining three strains present a similar fermentative profile with a Vmax between 1.2 and 1.4 g/L/h of CO2 released.
In order to obtain a characterization of their metabolic profile, high-performance liquid chromatography (HPLC) and gas chromatography – mass spectrometry (GC-MS) analysis were performed with samples from two time points of fermentation: exponential phase (T1, 5 g/L of CO2 released) and stationary phase (T2, 50 g/L CO2 released). Thirty-eight compounds were quantified including 11 ethyl esters, 7 acetate esters, 4 organic acids, 5 higher alcohols, 10 volatile fatty acids and propanol (Additional file 1: Table S1).
PCA analysis based on the compounds quantified both by HPLC and GC-MS (Fig. 2) showed intra-strain differences, with a discrimination of the laboratory strain from the other three strains at T1 (Fig. 2a) and T2 (Fig. 2c). Circles of correlation (Figs. 2b, d) show the contribution of each quantified metabolic compound to the separation of the strains in the scores plot. Only the first two components were considered, since they explain a high percentage of the variability found between isolates and between compounds: 83.7% and 84.3% for T1 and T2, respectively. At T1 (Figs. 2a and b), a clear differentiation between laboratory strain and the other three strains was obtained according to the first axis. Productions of acetate esters (green) and of some higher alcohols (blue) had positive contributions to this axis while formation of medium chain fatty acids (hexanoic, octanoic and decanoic acids) was negatively involved. Strain Z63, having its origin in the fermentative beverage cachaça, distinguished along the second axis by a higher production of ethyl decanoate, ethyl octanoate and ethyl butanoate, compared with other tested strains.
At time-point T2, corresponding to the stationary phase of fermentation, a similar scenario was observed, with a clear separation of laboratory strain S288c from the others according to the first axis, and a separation of strain Z6 3 (cachaça) from strains Z23 and VL1 along the second one. However, the major contributors to the two axes differed between the two time points. During the stationary phase, fermentation by strains Z63, Z23 and VL1 produced higher amounts of almost all metabolites assessed, in comparison with the laboratory strain: acetate esters, ethyl esters, the majority of the acids apart from decanoic and propanoic acids and most of higher alcohols except propanol (first axis). From the three ethyl esters produced highly by cachaça strain at T1, only ethyl butanoate was again responsible for the separation of this strain from strains VL1 and Z23 (second axis).
Our results show that at the two time points considered in this work, the compounds contributing the most to the strains separation in comparison with S288c were the acetate and ethyl esters and the higher alcohols. It is well known that higher alcohols have positive effect on wine aroma as well [3, 56]. In the same way esters, produced by yeasts during alcoholic fermentation, have a significant influence on the fruity aromas of the final product, both in the case of ethyl fatty acid esters and acetate esters [57, 58]. So the results indicate that must fermentations carried with yeasts isolated from any of the three wild fermentative environments will be characterized by a higher development of the “yeast bouquet” and originate wines with much more complex aroma and flavour, than the laboratory strain used as reference. In addition, the aroma profile of sake strain will be closer to the one of the wine strain. In the case of volatile fatty acids, their concentration varied from 82 to 220 mg/L at T1 and 81 to 289 mg/L at T2, influencing also the PCA position of the analysed strains. The concentration of volatile acids is of particular relevance since in concentrations above 300 mg/L they are associated with unpleasant odors and tastes, such as a pungent smell and taste. In concentrations below that level, volatile acids can have a positive impact with fruity and floral aromas , mainly due to the inhibition of their esters hydrolysis.
Comparative transcriptomics of the three S. cerevisiae strains isolated from the different fermentative environments in comparison with the reference yeast S288c was conducted using Agilent 8x15k microarrays. mRNA samples were collected at the two time points T1 and T2, as explained in the previous section.
Tables 1, 2, 3 and 4 summarize the main findings obtained with transcriptomic characterization of the three fermentation isolates, in comparison with laboratory strain S288c. Results were analysed using Funspec with Bonferroni correction (p < 0.05), and down or upregulated genes are indicated for the three strains in comparison with S288c, both at T1 (Tables 1, 2) and T2 (Tables 3, 4). Genes were categorized in accordance with MIPS Functional Catalogue , and the ones common to the three strains are underlined.
As to time point 1 (T1), analysis of Table 1 shows that one group of genes related with the functions “pheromone response, mating-type determination, sex-specific proteins”, was downregulated in all three strains. Since the 3 isolates used in the present work are diploid [46, 55, 61], and the laboratory strain S288c used for comparison is haploid , differences in ploidy could thus underlie the differences in expression of the genes related with the mating and the pheromone response. Genes involved in the degradation of asparagine/metabolism of aspartate (ASP3–1, ASP3–2, ASP3–3 and ASP3–4) appeared as downregulated in the three isolates, and ASP1 coding for cytosolic L-asparaginase was downregulated in Z23 and VL1 strains. This is likely related with the fact that some S. cerevisiae strains, including some wine and sake strains, had lost the ASP3 locus .
Genes with significantly increased expression at T1, include a group of genes related with tetracyclic and pentacyclic triterpenes metabolism (cholesterin, steroids and hopanoids) that was upregulated in the 3 strains comparatively to the laboratory strain (Table 2). Most of these genes are involved in sterol synthesis namely ergosterol, which by contributing to the fluidity of the yeast membrane, allows a more efficient activity of membrane transporters and increased tolerance to ethanol , correlating with the superior fermentation performances of strains. The higher sterol biosynthesis could also divert acetyl CoA from fatty acid biosynthesis, so the lower levels of these genes in S288c strain could explain the higher production of medium chain fatty acids (MCFA) by this strain (Fig 2b). Several genes involved in aerobic respiration, electron transport and mitochondrion were also upregulated in the three mentioned strains in comparison with S288c (Table 2), suggesting a less strict glucose repression in the strains isolated from the fermentative environments. The higher respiratory capacity might also be associated with the higher production of fusel acids (Fig. 2), due to lower need to reoxidize NADH through the Ehrlich pathway . Also, at T1, the increased expression in Z23 of genes related with aldehyde oxidation, namely AAD4, AAD6, AAD16 and ADH7, might relate with the higher production of fusel alcohols in this strain especially of isoamylalcohol, phenylethanol, isobutanol and methionol (marked in blue in Fig. 2b).
Regarding time point T2 (Table 3), there were no common downregulated genes in the three characterized strains. Genes related with ribosomal proteins were downregulated only in sake strain (Table 3). The differences in the expression of these genes, observed also at T1 for Z23 and VL1 strains, may originate from the different fermentative profile and the different metabolic stage of each strain, at this time point. Regarding upregulated genes (Table 4), a group of genes involved in the synthesis of sterols was still upregulated for the cachaça (Z63) and wine (VL1) strains. For the sake strain (Z23) these genes were similarly expressed when compared to the laboratory strain suggesting that sake strain could be in an less active metabolic stage, in comparison with the other strains, requiring less sterol synthesis, which is also in agreement with the observed repression of ribosomal genes. Also at T2 it is visible that some genes upregulated in strains Z23 and VL1 (ADH7, ADH6 and AAD6) are involved in the Ehrlich pathway and so related with the formation of specific compounds, such as higher alcohols. In accordance with these results, metabolic analysis showed an increase of the same higher alcohols for T2 in comparison with T1, namely: methionol, isobutanol, isoamyl alcohol and phenylethanol. The only alcohols that seem not to be included in this association are amylalcohol and propanol, which were equal or less produced, respectively, in these strains in relation to S288c. The differential production of acetate esters by the two groups of strains (marked in orange in Figs. 2b and d) could be related with the differences in expression of ALD6 , which was overexpressed in strains Z23 and VL1. This gene is involved in the formation of acetic acid that can then be converted into acetyl-CoA and subsequently incorporated in acetate esters.
Similarly to the downregulated genes, at T2 there were no common upregulated genes for the three strains. This is opposite to the observed at T1 and may reflect that the differentiation of the strains, isolated from different fermentation processes, is especially important enduring the multistress stationary phase of fermentation where each strain developed different adaptive mechanisms in response to the specific fermentation conditions .
Combined transcriptomics and metabolomics analysis
Aiming to unravel new associations between genes and aromatic compounds production we next performed a combined analysis of transcriptomic and metabolic data sets. A supervised exploratory approach sPLS-DA was carried out from gene expression data in order to select the 400 most differential expressed genes (200 for each axis) at each time point (from the 6200 S. cerevisiae probes present in the microarray). At the two time points, multiple factorial analysis (MFA) was then performed from expression levels of the 400 chosen genes and the 38 metabolic variables (Figs. 3 and 4). The 400 genes clustered into four main groups together with metabolites, allowing a clear separation of the strains on the basis of their gene expression and metabolic profiles. GeneCodis [54, 66, 67] was used to determine biological annotations with statistical relevance associated with the genes present in each group (Additional files 2 and 3: Tables S2 and S3).
During the growth phase (T1, Fig. 3), the reference strain S288c differed from the other yeasts (sake, cachaça and wine strains) by a higher expression level of genes of group 3 associated with an important production of propanol, glycerol and medium chain fatty acid, and conversely, a lower expression of genes of group 1, connected with a limited formation of isobutanol, methionol, isobutylacetate and phenylethanol. Genes of group 1 were identified as coding for ribosomal proteins (RPL14B, RPS24A, RPS25B, RPL30, RPS26B, MRPL23, RPS17B, RPL40B and RPL26A), involved in the structural integrity of ribosome. The association of genes coding for ribosomal proteins, with the differential production of higher alcohols and the ester isobutyl acetate (Additional file 2: Table S2), could suggest an impact of higher growth rates on the production of these compounds. It is well known that the formation of higher alcohols depends of the reduction from the respective aldehyde with the oxidation of NADH into NAD+ . Consequently, the need for rapid production of oxidised NAD+ could have an important regulatory role in the formation of these compounds, explaining their higher formation by cachaça, wine and sake strains compared with the laboratory yeast. Regarding group 3, it contains genes associated with MAPK signalling pathway, cysteine and methionine metabolism and ABC transporters. The presence in this group of ATM1, coding for a mitochondrial exporter of Fe-S clusters and of genes from metabolism of cysteine, usually the limiting component in glutathione synthesis, suggests a more important response of S288c to oxidative stress compared with the other yeasts, generating a limitation of reductive power in this strain. This decrease may be the driving factor of the formation of several volatile fatty acids such as octanoic acid, decanoic acid, hexanoic acid, butyric acid and dodecanoic acid, which was increased in the laboratory strain. It is also tempting to speculate that PDR5 may be involved in the export of the fatty acids. MFA also revealed that cachaça yeast (Z63) differentiated from the other strains by an increased production of ethyl esters, namely ethylbutanoate, ethyldecanoate and ethyloctanoate while VL1 and Z23 exhibited higher capacities of production of hexylacetate, propylacetate, 2-phenylethylacetate, amylalcohol, isovaleric acid, isoamylacetate, amylacetate, ethilpropionate, propanoic acid and isoamylalcohol (Additional file 2: Table S2). Interestingly, genes that were more expressed specifically in Z63 are related with metabolism of butanoate, tyrosine, beta-alanine and fatty acids, and also associated with glycolysis and gluconeogenesis. Thus, the overexpression of genes involved in the butanoate and more general in fatty acid metabolism, may directly explain the increased production of ethylbutanoate and of the other ethyl esters. Finally, no relevant biological annotation was found among the genes overexpressed in wine and sake yeast (group 4), pointing to a role of each of the genes individually.
At T2 (Fig. 4), a clear separation was also observed between strain S288c and the other strains, being this related with overexpression of genes from groups 1 and 2 versus downregulation of those of group 3 and 4 in the lab strain. In addition, S288c is characterised by an important formation of unpleasant or neutral compounds, in particular acids that contribute with unpleasant odors to wine. Genes from group 1, such as TDH3, FBP26, SLT2, MIG2 and GDH1, which clustered with acids formation, were associated with central carbon metabolism and its regulation, cation transport and cell wall. Thus, the maintenance of ionic homeostasis in the interaction with the environment may appear as a determining factor in the production of the unpleasant acids. Consequently, the manipulation of specific cation homeostasis and cell wall integrity pathway could be a way of avoiding/reducing their production. Genes from group 2 included once again the term “ribosomes” but associated with the formation of alpha-ketoglutarate and pyruvate in addition to the production of higher alcohols (propanol, amylalcohol), as evidenced at T1. The other biological annotations associated with group 2 genes included purine or pyrimidine metabolism, and no clear scenario could be established between gene functions and the compounds produced. Genes from groups 3 and 4 were clearly related with the central carbon metabolism and formation of aroma compounds and are associated with marked increased concentrations of higher alcohols and ethyl and acetate esters for the fermentative yeasts, including several acetate and ethyl esters that contribute to the “floral” and “fruity” characteristics of wine (Additional file 3: Table S3). Specifically, VL1 and Z63 strains were characterised by an overexpression of genes from group 3 combined with a downregulation of those of group 2. Group 3 included a set of 17 genes related with biosynthesis of secondary metabolites, which clearly related with the production of the metabolic compounds, being more specifically associated with the terms “steroid biosynthesis”, “propanoate metabolism” (ALD6, ACS2 and ERG10), “valine, leucine, isoleucine and lysine degradation” (ALD6, ERG10, ERG13), and “fatty acid metabolism” (FAA1, ALD6 and ERG10). This could be associated to an increase production of valeric acid but also succinate, methionol and isobutanol. Group 4 genes, which differentiated strain Z23 from the others, were mainly associated with the production of a high variety of acetate and ethyl ethers. Functional categories more significantly associated with this group of genes were c-compound metabolism and oxidation-reduction process.
In this work we performed the transcriptomic and metabolic characterization of four S. cerevisiae strains, with different origins and technological applications and unravelled new associations between genes and aromatic compounds production. Results showed differences between cachaça, sake and wine strains metabolism and gene expression, significant differences being found mainly between cachaça and sake strains, in comparison with the wine strain. However, although each strain comes from a different industrial application, we must caution that it may not be a standard representative of that industry, as strain differences are often found for the same industrial application . At T1 of fermentation, strain Z63 (cachaça) showed major differences from sake and wine strains, mainly regarding the production of the ethyl esters, ethyl decanoate and ethyl octanoate. These differences were associated with the expression of genes related with the metabolism of butanoate, tyrosine, beta-alanine and fatty acids. At T2, a different scenario was found in which the sake strain (Z23) had the most distinctive behaviour when considering both metabolites produced and transcription results. At this point this strain showed a higher production of several acetate and ethyl esters and an increase in the expression of genes of c-compound metabolism and oxidation-reduction process. On the contrary, wine and cachaça strains showed an upregulation of genes related with steroid biosynthesis, propanoate metabolism, valine, leucine, isoleucine and lysine degradation, and fatty acid metabolism.
In summary, the integration of several technologies (HPLC, GC-MS, microarrays) applied to fermentation results of four strains with diverse origins and technological applications, analysed using several data analysis methods (PCA, MFA) revealed successful to understand and clarify the genes and the pathways that lead to the formation of metabolic compounds that contribute to the wine aroma and flavour. The results also show that the use of Z23 strain in a wine fermentation will produce a major amount of ethyl acetate which contributes to the fruity and floral characteristics of wine. The knowledge here obtained has the potential to be deeply explored and extended to other strains and other metabolic pathways, within an approach using aroma production as the primary selection criteria. The majority of the genes identified in this work as having their expression changed in correlation with the aroma compounds produced, play a central role in the metabolism of S. cerevisiae, namely ADH6, ADH7, AAD6, ALD2, ALD6, FAA1, ACS2, ERG10 and ERG13. These genes are potential targets for gene deletion/overexpression programs using these and/or other strains, in order to better understand their role and their correlation with the aroma production network of S. cerevisiae. Moreover, the information now obtained may be useful in breeding programs to drive the selection of yeast strains with improved aromatic properties.
Direct injection mass spectrometry
False discovery rate
Gene expression omnibus
High performance liquid chromatography
Mitogen-activated protein kinase
Medium-chain fatty acid
Multivariate factorial analysis
- NAD+ :
Nicotinamide adenine dinucleotide
Nicotinamide adenine dinucleotide (reduced form)
Nuclear magnetic resonance
Principal componente analysis
Revolutions per minute
Sparse partial least square – discriminant analysis
Chambers PJ, Pretorius IS. Fermenting knowledge: the history of winemaking, science and yeast research. EMBO Rep 2010;11:914–920.
Lambrechts MG, Pretorius IS. Yeast and its importance to wine aroma - a review. S Afr J Enol Vitic. 2000;21:97–129.
Cordente AG, Curtin CD, Varela C, Pretorius IS. Flavour-active wine yeasts. Appl Microbiol Biotechnol. 2012;96:601–18.
Robinson J. The Oxford companion to wine. Oxford: Oxford University Press Oxford; 1994.
Mannazzu I, Clementi F, Ciani M. Strategies and criteria for the isolation and selection of autochthonous starter. In: Ciani M, editor. Biodivers. Biotechnol. wine yeasts. Trivandrum: Research Signpost; 2002. p. 19–35.
Schuller D. Better yeast for better wine - genetic improvement of Saccharomyces cerevisiae wine strains. In: Rai M, Koevics G, editors. Prog. Mycol. Jodhpur. India: Scientific Publishers; 2010. p. 1–51.
Bird D. Understanding wine technology - the science of wine explained. J Wine Res. 2013;24:156–64.
Dequin S. The potential of genetic engineering for improving brewing, wine-making and baking yeasts. Appl Microbiol Biotechnol. 2001;56:577–88.
Borneman AR, Desany BA, Riches D, Affourtit JP, Forgan AH, Pretorius IS, et al. The genome sequence of the wine yeast VIN7 reveals an allotriploid hybrid genome with Saccharomyces cerevisiae and Saccharomyces kudriavzevii origins. FEMS Yeast Res. 2012;12:88–96.
Borneman AR, Desany BA, Riches D, Affourtit JP, Forgan AH, Pretorius IS, et al. Whole-genome comparison reveals novel genetic elements that characterize the genome of industrial strains of Saccharomyces cerevisiae. PLoS Genet. 2011;7:e1001287.
Pretorius IS. Tailoring wine yeast for the new millennium: novel approaches to the ancient art of winemaking. Yeast. 2000;16:675–729.
Novo M, Bigey F, Beyne E, Galeote V, Gavory F, Mallet S, et al. Eukaryote-to-eukaryote gene transfer events revealed by the genome sequence of the wine yeast Saccharomyces cerevisiae EC1118. Proc Natl Acad Sci U S A. 2009;106:16333–8.
Marsit S, Dequin S. Montpellier F-, Supagro M, Montpellier F-, Montpellier F-. Diversity and adaptive evolution of Saccharomyces wine yeast : a review. FEMS Yeast Res. 2015:1–12.
Gómez-Pastor R, Pérez-Torrado R, Cabiscol E, Ros J, Matallana E. Reduction of oxidative cellular damage by overexpression of the thioredoxin TRX2 gene improves yield and quality of wine yeast dry active biomass. Microb Cell Factories. 2010;9:9.
López-Malo M, Chiva R, Rozes N, Guillamon JM. Phenotypic analysis of mutant and overexpressing strains of lipid metabolism genes in Saccharomyces cerevisiae: implication in growth at low temperatures. Int J Food Microbiol. 2013;162:26–36.
Teixeira MC, Raposo LR, Mira NP, Lourenço AB, Sá-Correia I. Genome-wide identification of Saccharomyces cerevisiae genes required for maximal tolerance to ethanol. Appl Environ Microbiol. 2009;75:5761–72.
Si T, Luo Y, Xiao H, Zhao H. Utilizing an endogenous pathway for 1-butanol production in Saccharomyces cerevisiae. Metab Eng. 2014:1–9.
Rossouw D, Næs T, Bauer FF, Naes T. Linking gene regulation and the exo-metabolome: a comparative transcriptomics approach to identify genes that impact on the production of volatile aroma compounds in yeast. BMC Genomics. 2008;9:530.
Rossouw D, van den Dool AH, Jacobson D, Bauer FF. Comparative Transcriptomic and Proteomic Profiling of Industrial Wine Yeast Strains. Appl Environ Microbiol. 2010;76:3911–23.
Rossouw D, Olivares-hernandes R, Nielsen J, Bauer FF. Comparative transcriptomic approach to investigate differences in wine yeast physiology and metabolism during fermentation. Appl Env. Microbiol. 2009;75:6600–12.
Rossignol T, Dulau L, Julien A, Blondin B. Genome-wide monitoring of wine yeast gene expression during alcoholic fermentation. Yeast. 2003;20:1369–85.
Varela C, Javier C, Melo F, Agosin E, Cárdenas J, Cardenas J. Quantitative analysis of wine yeast gene expression profiles under winemaking conditions. Yeast. 2005;22:369–83.
Marks VD, Ho Sui SJ, Erasmus D, van der Merwe GK, Brumm J, Wasserman WW, et al. Dynamics of the yeast transcriptome during wine fermentation reveals a novel fermentation stress response. FEMS Yeast Res. 2008;8:35–52.
Alexandre H, Ansanay-Galeote V, Dequin S, Blondin B. Global gene expression during short-term ethanol stress in Saccharomyces cerevisiae. FEBS Lett. 2001;498:98–103.
Pizarro FJ, Jewett MC, Nielsen J, Agosin E. Growth temperature exerts a differential physiological and transcriptional response in laboratory and wine strains of Saccharomyces cerevisiae. Appl Env Microbiol. 2008;74:6358–68.
Erasmus DJ, van der Merwe GK, van Vuuren HJ, Vandermerwe G, Vanvuuren H, Merwe GK Van Der, et al. Genome-wide expression analyses: Metabolic adaptation of Saccharomyces cerevisiae to high sugar stress. FEMS Yeast Res 2003;3:375–399.
Carreto L, Eiriz MF, Domingues I, Schuller D, Moura GR, Santos M a S, et al. Expression variability of co-regulated genes differentiates Saccharomyces cerevisiae strains. BMC Genomics. 2011;12:201.
Birkemeyer C, Kolasa A, Kopka J. Comprehensive chemical derivatization for gas chromatography-mass spectrometry-based multi-targeted profiling of the major phytohormones. J Chromatogr A. 2003;993:89–102.
Kleijn RJ, Geertman J-MA, Nfor BK, Ras C, Schipper D, Pronk JT, et al. Metabolic flux analysis of a glycerol-overproducing Saccharomyces cerevisiae strain based on GC-MS, LC-MS and NMR-derived C-labelling data. FEMS Yeast Res. 2007;7:216–31.
Fiehn O. Extending the breadth of metabolite profiling by gas chromatography coupled to mass spectrometry. Trends Analyt Chem. 2008;27:261–9.
Soga T, Ohashi Y, Ueno Y, Naraoka H, Tomita M, Nishioka T. Quantitative metabolome analysis using capillary electrophoresis mass spectrometry. J Proteome Res. 2003;2:488–94.
Monton MRN, Soga T. Metabolome analysis by capillary electrophoresis-mass spectrometry. J Chromatogr A. 2007;1168:237–46.
Tanaka Y, Higashi T, Rakwal R, Wakida S, Iwahashi H. Quantitative analysis of sulfur-related metabolites during cadmium stress response in yeast by capillary electrophoresis-mass spectrometry. J Pharm Biomed Anal. 2007;44:608–13.
Ramautar R, Somsen GW, de Jong GJ. CE-MS in metabolomics. Electrophoresis. 2009;30:276–91.
Ellis DI, Goodacre R. Metabolic fingerprinting in disease diagnosis: biomedical applications of infrared and Raman spectroscopy. Analyst. 2006;131:875–85.
Salek RM, Maguire ML, Bentley E, Rubtsov DV, Hough T, Cheeseman M, et al. A metabolomic comparison of urinary changes in type 2 diabetes in mouse, rat, and human. Physiol. Genomics. 2007;29:99–108.
Barton RH, Nicholson JK, Elliott P, Holmes E. High-throughput 1H NMR-based metabolic analysis of human serum and urine for large-scale epidemiological studies: validation study. Int. J. Epidemiol. 2008;37 Suppl 1:i31–40.
Bjerrum JT, Nielsen OH, Hao F, Tang H, Nicholson JK, Wang Y, et al. Metabonomics in ulcerative colitis: diagnostics, biomarker identification, and insight into the pathophysiology. J Proteome Res. 2010;9:954–62.
Allen J, Davey HM, Broadhurst D, Heald JK, Rowland JJ, Oliver SG, et al. High-throughput classification of yeast mutants for functional genomics using metabolic footprinting. Nat Biotechnol. 2003;21:692–6.
DA MK, Defernez M, Dunn WB, Brown M, Fuller LJ, SRMS d H, et al. Relatedness of medically important strains of Saccharomyces cerevisiae as revealed by phylogenetics and metabolomics. Yeast. 2008;25:501–12.
Villas-Bôas SG, Moxley JF, Akesson M, Stephanopoulos G, Nielsen J, Villas-Boas SG. High-throughput metabolic state analysis: the missing link in integrated functional genomics of yeasts. Biochem J. 2005;388:669–77.
Hollywood K, Brison DR, Goodacre R. Metabolomics: Current technologies and future trends. Proteomics. 2006;6:4716–23.
Dettmer K, Aronov PA, Hammock BD. Mass spectrometry-based metabolomics. Mass Spectrom Rev. 2007;26:51–78.
Garcia DE, Baidoo EE, Benke PI, Pingitore F, Tang YJ, Villa S, et al. Separation and mass spectrometry in microbial metabolomics. Curr Opin Microbiol. 2008;11:233–9.
Mendes I, Franco-Duarte R, Umek L, Fonseca E, Drumonde-Neves J, Dequin S, et al. Computational models for prediction of yeast strain potential for winemaking from phenotypic profiles. Schacherer J, editor. PLoS One. 2013;8:e66523.
Liti G, Carter DM, Moses AM, Warringer J, Parts L, James SA, et al. Population genomics of domestic and wild yeasts. Nature. 2009;458:337–41.
Bely M, Sablayrolles JM, Barre P. Description of Alcoholic Fermentation Kinetics - Its Variability and Significance. Am J Enol Vitic. 1990;41:319–24.
Sablayrolles J, Barre P, Grenier P. Design of laboratory automatic system for studying alcoholic fermentations in anisothermal enological conditions. Biotechnol Tech. 1987;1:181–4.
Rollero S, Bloem A, Camarasa C, Sanchez I, Ortiz-Julien A, Sablayrolles J-M, et al. Combined effects of nutrients and temperature on the production of fermentative aromas by Saccharomyces cerevisiae during wine fermentation. Appl Microbiol Biotechnol. 2014;99:2291–304.
The R. Core Team. R : A Language and Environment for Statistical Computing. 2013;
Husson F, Josse J, Le S, Mazet J. FactoMineR: multivariate exploratory data analysis and data mining with R. R package version 1.18. 2012.
Smyth GK, Speed T. Normalization of cDNA microarray data. Methods. 2003;31:265–73.
Lê Cao K-A, Boitard S, Besse P. Sparse PLS discriminant analysis: biologically relevant feature selection and graphical displays for multiclass problems. BMC Bioinformatics. 2011;12:253.
Nogales-Cadenas R, Carmona-Saez P, Vazquez M, Vicente C, Yang X, Tirado F, et al. GeneCodis: interpreting gene lists through enrichment analysis and integration of diverse biological information. Nucleic Acids Res. 2009;37:W317–22.
Franco-Duarte R, Mendes I, Umek L, Drumonde-Neves J, Zupan B, Schuller D. Computational models reveal genotype-phenotype associations in Saccharomyces cerevisiae. Yeast. 2014:265–77.
Swiegers JHH, Bartowsky EJJ. Henschke P a. A, Pretorius ISS. Yeast and bacterial modulation of wine aroma and flavour. Aust. J. Grape Wine Res. 2005;2:139–73.
Mason AB, Dufour JJ. Alcohol acetyltransferases and the significance of ester synthesis in yeast. Yeast. 2000;16:1287–98.
Ribéreau-Gayon P, Dubourdieu D, Doneche B, Lonvaud A. Handbook of Enology Volume 1 The Microbiology of Wine and Vinifications. 2nd ed. Chichester: Wiley; 2000.
González-Álvarez J, Blanco-Gomis D, Arias-Abrodo P, Díaz-Llorente D, Ríos-Lombardía N, Busto E, et al. Characterization of hexacationic imidazolium ionic liquids as effective and highly stable gas chromatography stationary phases. J Sep Sci. 2011:273–9.
Ruepp A, Zollner A, Maier D, Albermann K, Hani J, Mokrejs M, et al. The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomes. Nucleic Acids Res. 2004;32:5539–45.
Schuller D, Pereira L, Alves H, Cambon B, Dequin S, Casal M. Genetic characterization of commercial Saccharomyces cerevisiae isolates recovered from vineyard environments. Yeast. 2007;24:625–36.
Goffeau A, Barrell BG, Bussey H, Davis RW, Dujon B, Feldmann H, et al. Life with 6000 Genes. Science. 1996;274:563–7.
League GP, Slot JC, Rokas A. The ASP3 locus in Saccharomyces cerevisiae originated by horizontal gene transfer from Wickerhamomyces. FEMS Yeast Res. 2012;12:859–63.
Alexandre H, Rousseaux I, Charpentier C. Ethanol adaptation mechanisms in Saccharomyces cerevisiae. Biotechnol Appl Biochem. 1994;20:173–83.
Saint-Prix F, Bönquist L, Dequin S. Functional analysis of the ALD gene family of Saccharomyces cerevisiae during anaerobic growth on glucose: the NADP+−dependent Ald6p and Ald5p isoforms play a major role in acetate formation. Microbiology. 2004;150:2209–20.
Tabas-Madrid D, Nogales-Cadenas R, Pascual-Montano A. GeneCodis3: a non-redundant and modular enrichment analysis tool for functional genomics. Nucleic Acids Res. 2012;40:W478–83.
Carmona-Saez P, Chagoyen M, Tirado F, Carazo JM, Pascual-Montano A. GENECODIS: a web-based tool for finding significant concurrent annotations in gene lists. Genome Biol. 2007;8:R3.
Ehrlich F. Uber das naturliche Isomere des Leucins. Berichte der Dtsch Chem Gesellschaft. 1907;40:2538–62.
Barbosa EA, Souza MT, Diniz RHS, Godoy-Santos F, Faria-Oliveira F, Correa LFM, et al. Quality improvement and geographical indication of cachaça (Brazilian spirit) by using locally selected yeast strains. J Appl Microbiol. 2016;121:1038–51.
The authors would like to thank all the researchers that kindly provided yeast strains: Gianni Liti, Institute of Genetics UK, Rogelio Brandão, Laboratório de Fisologia e Bioquı́mica de Microorganismos Brazil. Authors would like to thank also to Pierre Delobel, Christian Picou and Jean-Roch Mouret that kindly help in the microarrays and fermentations experiments.
Inês Mendes is recipient of a fellowship from the Portuguese Science Foundation, FCT (SFRH/BD/74798/2010). This work was supported by FCT through grant (PTDC/AGR-ALI/121062/2010) and by the strategic programme UID/BIA/04050/2013 (POCI-01-0145-FEDER-007569) funded by national funds through the FCT I.P. and by the ERDF through the COMPETE2020 - Programa Operacional Competitividade e Internacionalização (POCI).
Availability of data and materials
IM, RFD, DS and SD designed the experiments; IM, CC and IS performed the experiments and data analysis; IM, RFD, MJS, CC and SD wrote the manuscript; SD, DS and MJS supervised the work. All authors contributed to the discussion of the research and read and approved the final manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Concentration (mg/L) of aromatic compounds determined by GC-MS and HPLC for the four Saccharomyces cerevisiae strains and at two time points. (DOCX 26 kb)
List of genes present in each group of Fig. 3, together with their function, obtained after GeneCodis analysis regarding biological annotations with statistical relevance at T1. (XLSX 9 kb)
About this article
Cite this article
Mendes, I., Sanchez, I., Franco-Duarte, R. et al. Integrating transcriptomics and metabolomics for the analysis of the aroma profiles of Saccharomyces cerevisiae strains from diverse origins. BMC Genomics 18, 455 (2017). https://doi.org/10.1186/s12864-017-3816-1