Genome sequencing and genetic breeding of a bioethanol Saccharomyces cerevisiae strain YJS329
© Zheng et al.; licensee BioMed Central Ltd. 2012
Received: 3 May 2012
Accepted: 14 August 2012
Published: 15 September 2012
Environmental stresses and inhibitors encountered by Saccharomyces cerevisiae strains are the main limiting factors in bioethanol fermentation. Strains with different genetic backgrounds usually show diverse stress tolerance responses. An understanding of the mechanisms underlying these phenotypic diversities within S. cerevisiae populations could guide the construction of strains with desired traits.
We explored the genetic characteristics of the bioethanol S. cerevisiae strain YJS329 and elucidated how genetic variations in its genome were correlated with specified traits compared to similar traits in the S288c-derived strain, BYZ1. Karyotypic electrophoresis combined with array-comparative genomic hybridization indicated that YJS329 was a diploid strain with a relatively constant genome as a result of the fewer Ty elements and lack of structural polymorphisms between homologous chromosomes that it contained. By comparing the sequence with the S288c genome, a total of 64,998 SNPs, 7,093 indels and 11 unique genes were identified in the genome of YJS329-derived haploid strain YJSH1 through whole-genome sequencing. Transcription comparison using RNA-Seq identified which of the differentially expressed genes were the main contributors to the phenotypic differences between YJS329 and BYZ1. By combining the results obtained from the genome sequences and the transcriptions, we predicted how the SNPs, indels and chromosomal copy number variations may affect the mRNA expression profiles and phenotypes of the yeast strains. Furthermore, some genetic breeding strategies to improve the adaptabilities of YJS329 were designed and experimentally verified.
Through comparative functional genomic analysis, we have provided some insights into the mechanisms underlying the specific traits of the bioenthanol strain YJS329. The work reported here has not only enriched the available genetic resources of yeast but has also indicated how functional genomic studies can be used to improve genetic breeding in yeast.
KeywordsBioethanol Saccharomyces cerevisiae Stress Genome RNA-Seq
Bioethanol is an important adjunct to fossil fuel because it is renewable, relatively environmentally innocuous, and compatible with the current fuel transport facilities. To date, bioethanol is mainly produced through the yeast-based fermentation of carbohydrates at about 33°C to give a final product concentration of 8–15% (v/v) [1, 2]. Some novel processes, including high-gravity fermentation, high-temperature fermentation, and production from cellulose, intended to increase the economic and social benefits of ethanol, have been proposed and widely studied [2–6]. These processes, however, share the problem that they impose severe environmental stresses or inhibitors on yeast cells which greatly reduces their production efficiency. In addition, these stresses induce the formation of more by-products (mainly glycerol and acetic acid), consuming up to 5% of the carbon source [2–5].
The Saccharomyces cerevisiae strain S288c was, in 1996, the first eukaryotic genome to be sequenced . In the 15 years that have passed since then, many functional genomic studies using the S288c genome as a reference sequence have greatly enriched our knowledge of how yeast cells respond to and resist various environmental stresses [8–16]. The information that has been produced cannot always be extrapolated to other yeast strains because of their diverse genomes and phenotypes [8, 17, 18]. Compared with laboratory strains, industrial strains generally show higher adaptability to specific environments; however, the genetic basis for their improved characteristics is not well understood. Comparisons of the genomes of strains with different backgrounds should help identify the sequence changes that play important roles in the tolerance of particular stresses. Because of the progress in genome sequencing technology, some industrial yeast strains, including AWRI1631, EC1118, JAY270, Vin13 and FostersO, have now been sequenced [19, 20]. Comparisons of the publicly available S. cerevisiae genome sequences have revealed the clear signatures (single nucleotide polymorphisms (SNPs), insertions and deletions (indels), and novel ORFs) of different strains [18, 20, 21]. However, further studies are needed to explore how the genetic variations confer the specific phenotype of each strain. Of these industrial strains, JAY270 (PE-2 derived) which uses sugar cane as feedstock, is the only bioethanol strain . Little is known about the genome structure and characteristics of other bioethanol strains.
In this study, we investigated the genetic characteristics of a bioethanol strain, YJS329, and the molecular mechanisms that underlie its phenotypic differences from the laboratory strain, BYZ1 (S288c-derived). YJS329 exceeded BYZ1 in fermentation rate and ethanol yield under different stress conditions, consistent with its greater tolerance of multiple stresses. Comparative genomic hybridization array and whole genome sequencing revealed many differences in the genomes of these two strains, including SNPs, indels, novel ORFs and changes in chromosome structure. Finally, we used RNA-Seq to determine how the genetic differences might affect the transcriptional profile and physiological metabolism of the two strains. Our study enriches the genetic resources for S. cerevisiae and deepens our knowledge of the effects of genetic variation on phenotypic diversity.
Phenotypic and physiological characteristics of YJS329
Performance of the yeast strains, BYZ1 and YJS329, under different fermentation conditions
Ethanol produced (g/L)
Glycerol produced (g/L)
Acetic acid produced (g/L)
Residual glucose (g/L)
Dry weight (g/L)
71.89 ± 0.67
3.95 ± 0.08
1.06 ± 0.02
0.13 ± 0.00
9.4 ± 0.14
71.15 ± 0.72
4.36 ± 0.07a
0.44 ± 0.01A
0.03 ± 0.00
9.1 ± 1.02
60.18 ± 0.53
3.68 ± 0.09
1.21 ± 0.02
25.07 ± 0.34
3.9 ± 0.11
70.16 ± 0.55B
4.90 ± 0.07B
0.78 ± 0.01B
1.11 ± 0.04B
3.5 ± 0.09b
111.58 ± 1.19
5.93 ± 0.04
1.50 ± 0.04
21.45 ± 0.25
9.2 ± 0.13
9.04 ± 0.14C
0.61 ± 0.02C
1.24 ± 0.01C
8.8 ± 0.08c
We compared YJS329 and BYZ1 using some of the main anti-stress indicators, including trehalose accumulation, antioxidation factors, HMF reductase, and membrane compositions. YJS329 accumulated 1.29-fold ((t test, P <0.05) more intracellular trehalose, a nonspecific protectant that can maintain the function of macromolecules and membrane integrity under multiple stresses (Figure 1B) . Consistent with its better menadione tolerance, YJS329 showed 1.32-fold (t test, P <0.05) higher glutathione content and 5-fold (t test, P <0.001) catalase (CAT) activity than BYZ1. In yeast cells, glutathione and CAT are important for the elimination of the reactive oxygen species that are caused by oxidizing agents or by other stresses . HMF is formed as a result of hexose degradation during the process of lignocellulosic hydrolysis . The chemical toxicity of HMF can be reduced by HMF reductase which converts the aldehyde functional group into an alcohol group in yeast cells [24, 25]. Compared to BYZ1, the higher intracellular HMF reductase activity (t test, P <0.05; Figure 1B) of YJS329 might partly contribute to its increased resistance to HMF. The results in Figure 1B show that, of the various membrane compounds, more ergosterol, palmitoleic acid (C16:1), oleic acid (C18:1), and linoleic acid (C18:2) were detected in YJS329 (t test, P < 0.05). These findings indicated that there was significant variation in cellular components and physiological state between the YJS329 and BYZ1 strains.
Genome structure of YJS329
Whole genome sequencing of YJS329
To investigate the genetic traits of YJS329, we isolated the haploid strain YJSH1 which, under certain conditions, is indistinguishable in ethanol yields from its parent strain YJS329 (See Additional file 4), for whole genome sequencing (See Additional file 5).
Based on the consensus YJSH1 genomic sequence, 412,794 bp that were absent in YJSH1 were identified in the S288c genome and 174,269 bp that were absent in S288c were identified in the YJSH1 genome (the location of the indels and their annotations were listed in Additional file 6). This analysis confirmed that some of the underrepresented regions in YJS329 genome (Figure 2B) were sequences that either were lost in this industrial strain or acquired in S288c. For example, the YJS329 genome had only one copy of CUP1 and ENA1, and none of the ASP3 genes found in S288c. We also identified 21 Ty elements in the YJS329 assembly (9 Ty1, 6 Ty2, 4 Ty3, 1 Ty4, and 1 Ty5), whereas 50 Ty elements have been identified in the S288c genome. The amplification of the Ty3 elements was consistent with the results of comparative genome hybridization for YJS329 (See Additional file 2).
A total of 5,602 ORFs (common to S288c and excluding dubious ORFs) were predicted for the nuclear genome of YJS329 (the location of the ORFs and their annotations were listed in Additional file 7). Predictions indicated that 142 ORFs had in-frame stop codons, 129 ORF were affected by frame shifts, and 27 ORFs had lost start or stop codons because of the presence of SNPs or indels. For example, the HO gene of YJS329 had both an in-frame termination (the C at 238 bp was changed to T) and frame shift (the C at 413 bp was missing) (verified by PCR using YJS329 DNA as the template) that explained the heterothallic life cycle of YJS329. In addition, the YJS329 genome has some ORF sequences that were not present in S288c (Additional file 7); however, nearly all of these ORFs could be found in the genomes of other S. cerevisiae strains. One such example is the ORF EPH1 that encodes the epoxide hydrolase (E.C. 22.214.171.124) that catalyzes the hydration of chemically reactive epoxides to their corresponding dihydrodiol products. A recent study suggested that EPH1 in the S. cerevisiae genome was the result of an introgression event from S. paradoxus and the S. paradoxus EPH1 gene may itself be a result of horizontal transfer from bacteria .
Compared to the strictly diploid S. cerevisiae S288c, many industrial yeast strains display chromosomal copy number variations (CNVs). Whole-chromosome amplifications had been observed in the AWRI796, VL3, FostersO and FostersB strains . Although no large chromosomal aneuploidy or length polymorphisms were observed in the genome of YJSH1, some chromosomal rearrangement events in the YJSH1 genome were observed. The largest indel in the YJS329 genome was the 12.5-kb deletion in chromosome 1 region (11,872–24,331 bp; Figure 3C). The 5’ end of chromosome 2 in YJS329 was apparently subjected to constant remodeling (verified by PCR using YJS329 DNA as the template). In this region two elements from the S288c genome, chromosome 10 (729,223–727,336 bp) and chromosome 3 (315506–307348 bp), and a region that is absent in S288c genome (a BLASTN search showed that this region contained a MEL1 gene that has been found in S. carlsbergensis and in other S. cerevisiae strains), were found in YJSH1 (1–23,308 bp; Figure 3D).
Comparison of BYZ1 and YJS329 transcription using RNA-Seq
To investigate transcription differences at single-nucleotide resolution between BYZ1 and YJS329, poly(A)-enriched mRNAs from BYZ1 and YJS329 were used for high-throughput Illumina sequencing. Overall, 90.9% of the reads mapped to unique genomic regions; 81% mapped to known reference genes when 2-bp mismatches were allowed (Additional file 8). Compared to BYZ1, 888 of the YJS329 genes were up-regulated and 1,433 were down-regulated (P <0.001; Additional file 9). The functions of the up-regulated genes mainly fell within the oxidoreductase, peptidase activity and transporter-related processes categories (Additional file 10). For example, SFA1 which is involved in the detoxification of formaldehyde and long-chain and complex alcohols formation [24, 27] displayed more than a 15-fold increase in mRNA abundance in YJS329. The fair number of the up-expressed genes involved in transport processes in the YJS329 sample suggested that this strain might have higher adaptability to multiple nutrition shortages than BYZ1. The down-regulated genes were mainly involved in the functional categories of DNA/protein binding, ribosome biogenesis, and structural molecules (Additional file 10).
As well as the destruction of binding motifs in transcription factors, SNPs can also create new binding motifs. The Msn2/4p and Cat8p binding sites in the promoter of SFA1 from YJS329 are examples of new motifs that may strengthen the expression of the SFA1 gene (Figure 4B and Additional file 12) which plays a role in the detoxification of furan derivatives . Indels were also important contributors to transcription differentiation among the two strains. An obvious example in BYZ1 is the interruption of CTR3 (which encodes a high-affinity copper transporter responsible for copper uptake when environmental copper is low ) by the insertion of a Ty2 element . This insertion might explain the much lower expression activity of CTR3 in BYZ1 compared to YJS329 (Figure 4C). Further, small indels in the trans-elements can directly modify mRNA expression and phenotypic traits in different strains. The down-regulated expression of ALD6 in YJS329 (whether grown in YPD medium or under fermentation conditions and verified by RT-qPCR; t test, P <0.001), a major gene in acetic-acid formation, probably resulted partly from the insertion of two bases in the Adr1p binding motif in the ALD6 promoter (Figure 4D and Additional file 12). In BYZ1, when the two copies of ALD6 were deleted, the strain produced 56% less acetic acid and 17% more glycerol under the normal fermentation conditions (Additional file 13). This result indicated that the lower expression of ALD6 in YJS329 could be one of the causes of the different patterns of by-product (acetic acid and glycerol) production in YJS329 and BYZ1. Chromosomal aneuploidy accompanied by CNVs in large DNA regions is a ubiquitous phenomenon in yeast populations [20, 31]. As indicated in Figure 4E, the expression levels of regions with CNVs apparently dependent on gene dosage. The average read depth of the amplified region on chromosome 4 of BYZ1 was 1.59 times that in YJS329, close to the increased DNA dosage.
Using RNA-Seq, we detected the expression of the unique ORFs at the whole-transcription-profile level. Among these ORFs (Annotation details are in Additional file 7), MEL1 had the highest RPKM; others, such as YJM-GNAT, showed minimal expression. Additional file 13 shows the expression level and boundary of the predicted ORF chr06.orf003, which provides further evidence of the existence of this novel ORF which is absent in other S. cerevisiae strains. RT-qPCR analyses revealed that the expression of some unique ORFs depended on the growth phase and other conditions (Additional file 14). When grown in YPD medium, all five of the selected genes (especially BIO6) showed the highest expression at the exponential phase. The ORFs YJS-HE and MEL1 were significantly up-regulated under ethanol fermentation, whereas the others were down regulated, indicating the different psychological roles of these unique genes.
Genetic breeding strategies for YJS329
More glycerol might improve the taste of alcoholic beverages but is undesirable for bioethanol production. When FPS1 (involved in efflux of glycerol; this gene showed lower expression in YJS329 compared with BYZ1) was deleted in YJS329 to produce the YJSΔFPS1 strain, the production of glycerol and acetic acid decreased and the conversion rate of glucose to ethanol improved by 1% compared with YJS329; however, the final concentration of ethanol was slightly less than in YJS329 because of the higher residual sugar in YJSΔFPS1 (t test, P <0.05; Figure 5B). Inspired by the different regulatory roles of ALD6 in YJS329 and BYZ1, we explored the possibility to further reduce the production of glycerol in YJSΔFPS1 by overexpression of ALD6. Beyond our expectation, strain YJSΔFPS1ALD6 produced similar amounts of glycerol but 1.3% more ethanol (t test, P <0.05) than YJSΔFPS1 as a result of consuming more sugar than YJSΔFPS1. We found that the over-expression of ALD6 could enhance the tolerance of ethanol in both YJS329 and YJSΔFPS1 (t test, P <0.05; Figure 5C), which may explain the higher fermentation ability of strain YJSΔFPS1ALD6. In addition, the over-expression of ALD6 and deletion of FPS1 significantly improved the tolerance of lignocellulosic hydrolysate (LH, contains inhibitors acetic acid, furan, and 5-HMF) in YJS329 (t test, P <0.05; Figure 5C), suggesting that this strategy may be useful for breeding industrial yeast strains with the ability to increase ethanol production from lignocellulosic biomass.
The genomic structural analysis (DNA content, PFGE, and aCGH analysis) indicated that YJS329 retained a diploid karyotype and had much lower structural polymorphisms than the bioethanol strain JAY270 and some other industrial strains [1, 20]. We also sequenced the genome of YJSH2 (a haploid spore derived from the same tetrad as YJSH1) using the Illumina paired-ends method. After mapping the reads of YJSH2 to the YJSH1 genome, we estimated that the YJS329 genome had about 0.6 SNP/kb between allelic regions in homologous chromosomes (unpublished data). These results indicated that the YJS329 strain was genetically very stable, a desirable phenotype for industry practice. Although S288c has been widely used in scientific research, because of the high number of Ty elements, its genome seems to be more plastic [31, 34]. High expression activity of Ty elements in genes was confirmed in the S288c-derived strain BYZ1 as a result of a dose effect (Additional file 4). The duplicated region on chromosome 4 in BYZ1 is probably the result of chromosomal translocations by ectopic recombination mediated by the flanking Ty elements. Strikingly, no dosage-compensation mechanisms acted to normalize the expression from each gene because the higher expression (1.59-fold) of this duplicated region almost matched the higher gene dose (1.5-fold). These results indicated that spontaneous Ty-driven rearrangements could be quite common and, if ignored, could easily lead to incorrect experimental results in genetic studies, especially for the S288c-derived strains.
Second-generation sequencing technology has proven to be an effective tool for the investigation of the genome sequences and structures of yeast strains and has provided many new insights into genome evolution and phenotypic effects [1, 17, 20, 21, 35, 36]. The level of nucleotide polymorphisms between YJSH1 and S288c (0.57%) is very similar to the level separating S288c and AWRI1631 (wine strain), YJM789 (pathogenic strain), M22 (vineyard strain) or YPS163 (oak tree strain) [21, 36], but, interestingly, YJSH1 was grouped closely with sake strains, consistent with their geographical distributions. To the best of our knowledge, YJS329 is the first bioethanol strain for which a high-quality assembled genome has been completed. The SNPs and indels that we have identified in the aligned regions of YJSH1 and S288c constitute the main genome mutations in these two strains. Mutation frequencies were found to be higher in the intergenic regions than in the coding regions, we found that up to 40% of the SNPs and 88% of the indels were located in intergenic sequences (accounting for about 27% of the genome). This pattern could arise from the sequence characteristics of intergenic regions (for example: the abundance of repeated sequences). However, we also observed a considerable number of mutations in the ORFs that play important roles in specified physiological activities. Remedying some of these mutations may improve the capabilities or change the specified phenotype of YJS329. A total of 11 ORFs were predicted in the YJS329 genome that are absent from the S288c genome. Remarkably, some of these ORFs may be very similar to those in other Saccharomyces species, including S. paradoxus, S. carlsbergensis, and S. mikatae. Therefore, during the evolution of the YJS329 genome, repeated yeast hybridization events that were followed by the gradual loss of one of the contributing genomes might have occurred. Undoubtedly, the genotypic characteristics of YJS329 that have been revealed in the present study will enrich the genetic resources of this species, which will be valuable for breeding strains with the desired phenotypes.
The recently developed RNA-Seq approach was used to explore the transcription profiles of the YJS329 and BYZ1 S. cerevisiae strains. Among the 2,611 differently expressed genes in these two strains, many were involved in the trehalose metabolism pathways, antioxidative factors, and membrane composition biosynthesis that are closely related to multiple stress-tolerance and fermentation characteristics. For example, consistent with the higher oleic acid content of membranes, the genes encoding the subunits of fatty acid synthetase (FAS1 and FAS2), the acetyl-CoA carboxylase gene (ACC1), and the genes that function in fatty-acid desaturation and elongation (ELO1 and OLE1) were considerably up-regulated in YJS329. Our results indicated that most of the differences in the physiological factors were consistent with the mRNA transcription differences between these two strains. Transcription –regulatory network analyses revealed that the transcription factors Msn2/4p, Hap1p, Hsf1p, and Arr1p might give prominence to the differently expressed genes and phenotypic differences between the two strains. This result was consistent with the observation that the trans variation is more common in expression polymorphism in yeast [37–39]. In spite of this, the contributions of cis variations on the divergence of mRNA expression and physiological metabolism should not be neglected because our results confirmed that mutations in the promoters of some important transcription factors and genes could directly affect the efficiency of their promoter efficiency. Overall, the molecular mechanisms underlying the mRNA expression differences between YJS329 and BYZ1 might involve: (i) SNPs and indels in the cis-acting elements that affect the expression efficiency of the genes; (ii) the inactivation of transcription factors by SNPs or indels; and (iii) changes in gene copy number. Remarkably, the discrepancies between the transcriptional profile (for example, of Hap1p) and the phenotype in the two strains might reflect variations in the activities of homologous proteins or posttranscriptional regulation, which deserve further assessment. In addition, here, for the first time, the expression activities of some novel ORFs under different conditions have been determined. Our study shows that whole-genome sequencing combined with RNA-Seq is a powerful tool for linking genotypes and phenotypes in functional genomic studies.
A thorough understanding of the genetic variations and how these variations contribute to phenotypic diversities is vital for the development of excellent yeasts for industrial applications. In this study, functional genomics has revealed the genetic characteristics of a bioethanol strain YJS329 and compared it to the laboratory strain BYZ1. From the results of this study, targeted genetic strategies for YJS329 could be constructed. These strategies might include the introduction of wild type genes to remedy deleterious mutations in some of the strains, a heightening of the effects of beneficial mutations by gene deletion or overexpression, and the expression of novel genes to obtain specified functions. We expect that functional genomics studies of industrial microorganisms, such as those reported here, will, in the future, provide more effective means of improving breeding strategies to obtain the desired production traits.
Yeast strains and culture conditions
The S288c-isogenic strain BYZ1 (MATa/MATα his3 Δ1/his3 Δ1 leu2 Δ0/leu2 Δ0 lys2 Δ0/+ met15 Δ0/+ ura3 Δ0/ura3 Δ0) was generated from a cross between BY4741 and BY4742 (gift from Oliver Valerius, University of Göttingen, Germany). The yeast strain YJS329 (CCTCC 2011275) was isolated from a soil sample and was used for bioethanol production in Henan Tianguan Group Co., Ltd., China. Strain ZTW3 is a triploid strain that is stored in our laboratory. The growth medium (YPD) contained 10 g/L yeast extract, 20 g/L peptone, and 20 g/L glucose and had a pH of 5.5.
The fermentation medium contained 10/L yeast extract, 20 g/L peptone, and 160 or 280 g/L glucose. Yeast cells were precultured in YPD for 20 h at 30°C and transferred to the fermentation medium with an initial OD600 of 1. Three fermentation conditions were used: (i) 160 g/L glucose at 30°C; (ii) 160 g/L glucose at 40°C; and (iii) 280 g/L glucose at 30°C. Glucose and ethanol were measured as previously described .
Analyses of physiological and biochemical factors
Yeast cells were cultured in 25 mL YPD with an initial OD600 of 0.05 and then collected at the early stationary phase (18 h, most genes involved in the stress response are induced at this phase). Trehalose, catalase, superoxide dismutase, and ergosterol were measured as previously described . Glutathione was measured using a Glutathione Assay Kit according to the manufacturer's instructions (Nanjing Jiancheng Bioengineering Institute, China). Fatty acid was extracted by the method of Hama et al.  and then analyzed with a FOCUS GC Gas Chromatograph .
PFGE and Array-comparative genomic hybridization
Total genomic DNA from BYZ1 and YJS329 was isolated with the yeast DNA kit (OMEGA, GA, USA) and then sonicated. The shearing DNA (200–1000 bp) was labeled with Cy5/Cy3 and hybridized to S. cerevisiae CGH 385 K Whole-Genome Tiling Arrays (NimbleGen). Scanning was performed with the Axon GenePix 4000B Microarray Scanner (Axon, USA). Raw data were extracted as pair files using NimbleScan software. Log2-ratio data were calculated and normalized by spatial correction and qspline fit normalization. DNA segments that contained three or more continuous probes with CNVs (|Log2-ratio| ≥0.35) were considered over- or under-represented regions. The microarray data have been deposited in the NCBI Gene Expression Omnibus [GEO:GSE31872].
Whole genome sequencing and data analysis
Strain YJS329 was previously cultured in sporulation medium for 5 days, and an ascus with four ascospores was dissected to obtain four haploid strains (named YJSH1-4). YJSH1 was chosen for genome sequencing. Whole genome sequencing was performed on the 454 Life Sciences Genome Sequencer FLX (Roche) platform according to the manufacturer’s standard recommended sample preparation procedures. A shotgun sequencing library was constructed and a total of 718,904 reads were generated. 98.01% of the reads were assembled into 314 contigs using the Newbler software with the default parameters (minimum overlap length 40, minimum overlap identity 90%). The assembled sequences were manually checked, and some of the gaps were closed by Sanger sequencing reactions (contigs were first mapped to the corresponding chromosome and the sequences in gaps were amplified by PCR) to build the scaffolds. The 16 nuclear YJSH1 chromosomes were covered by 16 scaffolds including 30 contigs (Additional file 5). The sequences of the final contigs and scaffolds have been deposited with DDBJ/EMBL/GenBank under the Whole Genome Shotgun project [GenBank:AGAW00000000]. The version of the sequences described here is the first version of the sequences [GenBank:AGAW01000000].
SNPs were detected using the public BLASTN software  after the YJSH1 contig sequences were aligned to the individual S288c chromosome sequences . The BLASTN parameters were adjusted as match = 4, mismatch = −5, gapopen = 3, gapextend = 5. Indels between the YJSH1 scaffolds and S288c chromosomes were detected using BLAT  (with default parameter) to reveal the physical gaps. The sizes and types (deletion or insertion) of indels were identified using the block sizes, qstarts, and tstarts information in the BLAT results file. Potential ORFs were predicted in two steps: (i) direct mapping of S288c ORFs from the Saccharomyces genome database by BLAT with the match length >95%, and (ii) using the Glimmer software (with the default parameters) to predict the ORFs located in unaligned regions of the YJSH1 contigs and S288c chromosomes . The predicted ORFs were annotated by searching for their homologs in the NCBI non-redundant protein database. To predict structural variations, the YJSH1 scaffolds were aligned to the S288c chromosomes using the Artemis Comparative Tool . The YJSH1 sequences that could not be aligned to the S288c genome were then compared against the contigs in the Whole Genome Shotgun database using BLASTN. Finally, PCRs were used to verify the predicted structural variations.
The total RNA of each sample (three individual cultures of yeast cells) was extracted by the hot phenol method (growth conditions and time of extraction were identical to those used in the physiological factor analysis). cDNA libraries were prepared using the methods described by Pan and co-workers . The cDNA library products were sequenced on the Illumina HiSeq™ 2000. The raw Illumina sequencing data have been deposited in NCBI’s GEO database [GEO:GSE31601]. After removing reads containing sequencing adapters and reads of low quality (reads in which the percentage of low quality bases (quality value ≤5) was more than 50%), the remaining clear reads were aligned to the S. cerevisiae S288c or YJSH1 genes with SOAPAligner . The expression level was normalized by reads per kilobase of exon region per million mapped reads (RPKM) . Screening of differentially expressed genes and P-value calculations were performed using the method proposed by Audic and Claverie . The accuracy of the RNA-Seq experiment was verified by RT-qPCR.
Promoter efficiency evaluation
The promoters of HSF1 (826 bp), SFA1 (1250 bp), and ALD6 (1199 bp) from BYZ1 were cloned into Sac I and Xho I sites before the Cre gene of plasmid pSH47 [GenBank:AF298782.1]. Inverse PCR was used to introduce the sequence mutations of YJS329 shown in Figure 4. The efficiency of the promoters was evaluated by the expression activity (RT-qPCR) of the report gene Cre. The values were represented by the log2 ratio of YJS329/BYZ1. The primers that were used for promoter cloning and RT-qPCR are listed in Additional file 15.
The full-length HSF1 ORF along with 807 bp of the sequence upstream of the ORF was cloned into the CEN6 plasmid, pGFP-ble (derived from pGFP-N-FUS; the URA3 marker was replaced by ble r ). Deletion of the two copies of FPS1 in YJS329 was performed as previously described . In all cases, homozygous gene deletions were confirmed by diagnostic PCR. Overexpression of ALD6 was carried out by cloning the ALD6 ORF plus 1,005 bp of upstream sequence and 407 bp of downstream sequence into plasmid pYZ, which is derived from pYES2 (Invitrogen) but with ble r replacing the URA3 marker.
We thank Ming-Guan Feng, Hai-Chun Gao, Xiao-Hang Ma, Gen-Fu Wu and Zhen-Mei Lv (Institute of Microbiology, Zhejiang University, China); and Mu-Yuan Zhu (Institute of Genetics, Zhejiang University, China) for excellent technical assistance. This work was supported in part by a grant from the National Natural Science Foundation of China (No.31101339).
- Argueso JL, Carazzolle MF, Mieczkowski PA, Duarte FM, Netto OVC, Missawa SK, Galzerani F, Costa GGL, Vidal RO, Noronha MF, Dominska M, Andrietta MGS, Andrietta SR, Cunha AF, Gomes LH, Tavares FCA, Alcarde AR, Dietrich FS, McCusker JH, Petes TD, Pereira GAG: Genome structure of a Saccharomyces cerevisiae strain widely used in bioethanol production. Genome Res. 2009, 19 (12): 2258-2270. 10.1101/gr.091777.109.PubMed CentralView ArticlePubMedGoogle Scholar
- Kollaras A, Kavanagh JM, Bell GL, Purkovic D, Mandarakas S, Arcenal P, Ng WS, Routledge KS, Selwood DH, Koutouridis P, Paras FE, Milic P, Tirado-Escobar ES, Moore MJ, Bell PJ, Attfield PV: Techno-economic implications of improved high gravity corn mash fermentation. Bioresour Technol. 2011, 102 (16): 7521-7525. 10.1016/j.biortech.2011.04.094.View ArticlePubMedGoogle Scholar
- Zheng DQ, Wu XC, Tao XL, Wang PM, Li P, Chi XQ, Li YD, Yan QF, Zhao YH: Screening and construction of Saccharomyces cerevisiae strains with improved multi-tolerance and bioethanol fermentation performance. Bioresour Technol. 2010, 102 (3): 3020-3027.View ArticlePubMedGoogle Scholar
- Abdel-Banat BMA, Hoshida H, Ano A, Nonklang S, Akada R: High-temperature fermentation: how can processes for ethanol production at high temperatures become superior to the traditional process using mesophilic yeast?. Appl Microbiol Biotechnol. 2010, 85 (4): 861-867. 10.1007/s00253-009-2248-5.View ArticlePubMedGoogle Scholar
- Almeida JRM, Runquist D, Nogue VSI, Liden G, Gorwa-Grauslund MF: Stress-related challenges in pentose fermentation to ethanol by the yeast Saccharomyces cerevisiae. Biotechnol J. 2011, 6 (3): 286-299. 10.1002/biot.201000301.View ArticlePubMedGoogle Scholar
- Nakamura T, Watanabe T, Srichuwong S, Arakane M, Tamiya S, Yoshinaga M, Watanabe I, Yamamoto M, Ando A, Tokuyasu K: Selection of stress-tolerant yeasts for simultaneous saccharification and fermentation (SSF) of very high gravity (VHG) potato mash to ethanol. Bioresour Technol. 2010, 101 (24): 9710-9714. 10.1016/j.biortech.2010.07.079.View ArticlePubMedGoogle Scholar
- Goffeau A, Barrell BG, Bussey H, Davis RW, Dujon B, Feldmann H, Galibert F, Hoheisel JD, Jacq C, Johnston M, Louis EJ, Mewes HW, Murakami Y, Philippsen P, Tettelin H, Oliver SG: Life with 6000 genes. Science. 1996, 274 (5287): 546-567. 10.1126/science.274.5287.546.View ArticlePubMedGoogle Scholar
- Kvitek DJ, Will JL, Gasch AP: Variations in stress sensitivity and genomic expression in diverseS. cerevisiaeisolates. PLoS Genet. 2008, 4 (10): e1000223-10.1371/journal.pgen.1000223.PubMed CentralView ArticlePubMedGoogle Scholar
- Ma MG, Liu ZL: Comparative transcriptome profiling analyses during the lag phase uncover YAP1, PDR1, PDR3, RPN4, and HSF1 as key regulatory genes in genomic adaptation to the lignocellulose derived inhibitor HMF for Saccharomyces cerevisiae. BMC Genomics. 2010, 11: 660-10.1186/1471-2164-11-660.PubMed CentralView ArticlePubMedGoogle Scholar
- Causton HC, Ren B, Koh SS, Harbison CT, Kanin E, Jennings EG, Lee TI, True HL, Lander ES, Young RA: Remodeling of yeast genome expression in response to environmental changes. Mol Biol Cell. 2001, 12 (2): 323-337.PubMed CentralView ArticlePubMedGoogle Scholar
- Capaldi AP, Kaplan T, Liu Y, Habib N, Regev A, Friedman N, O'Shea EK: Structure and function of a transcriptional network activated by the MAPK Hog1. Nat Genet. 2008, 40 (11): 1300-1306. 10.1038/ng.235.PubMed CentralView ArticlePubMedGoogle Scholar
- Gasch AP, Spellman PT, Kao CM, Carmel-Harel O, Eisen MB, Storz G, Botstein D, Brown PO: Genomic expression programs in the response of yeast cells to environmental changes. Mol Biol Cell. 2000, 11 (12): 4241-4257.PubMed CentralView ArticlePubMedGoogle Scholar
- Hahn JS, Hu ZZ, Thiele DJ, Iyer VR: Genome-wide analysis of the biology of stress responses through heat shock transcription factor. Mol Cell Biol. 2004, 24 (12): 5249-5256. 10.1128/MCB.24.12.5249-5256.2004.PubMed CentralView ArticlePubMedGoogle Scholar
- Auesukaree C, Damnernsawad A, Kruatrachue M, Pokethitiyook P, Boonchird C, Kaneko Y, Harashima S: Genome-wide identification of genes involved in tolerance to various environmental stresses in Saccharomyces cerevisiae. J Appl Genet. 2009, 50 (3): 301-310. 10.1007/BF03195688.View ArticlePubMedGoogle Scholar
- Giaever G, Chu AM, Ni L, Connelly C, Riles L, Veronneau S, Dow S, Lucau-Danila A, Anderson K, Andre B, Arkin AP, Astromoff A, El Bakkoury M, Bangham R, Benito R, Brachat S, Campanaro S, Curtiss M, Davis K, Deutschbauer A, Entian KD, Flaherty P, Foury F, Garfinkel DJ, Gerstein M, Gotte D, Guldener U, Hegemann JH, Hempel S, Herman Z, et al, et al: Functional profiling of the Saccharomyces cerevisiae genome. Nature. 2002, 41 (6896): 387-391.View ArticleGoogle Scholar
- Harbison CT, Gordon DB, Lee TI, Rinaldi NJ, Macisaac KD, Danford TW, Hannett NM, Tagne JB, Reynolds DB, Yoo J, Jennings EG, Zeitlinger J, Pokholok DK, Kellis M, Rolfe PA, Takusagawa KT, Lander ES, Gifford DK, Fraenkel E, Young RA: Transcriptional regulatory code of a eukaryotic genome. Nature. 2004, 431 (7004): 99-104. 10.1038/nature02800.PubMed CentralView ArticlePubMedGoogle Scholar
- Liti G, Carter DM, Moses AM, Warringer J, Parts L, James SA, Davey RP, Roberts IN, Burt A, Koufopanou V, Tsai IJ, Bergman CM, Bensasson D, O'Kelly MJT, van Oudenaarden A, Barton DBH, Bailes E, Ba ANN, Jones M, Quail MA, Goodhead I, Sims S, Smith F, Blomberg A, Durbin R, Louis EJ: Population genomics of domestic and wild yeasts. Nature. 2009, 458 (7236): 337-341. 10.1038/nature07743.PubMed CentralView ArticlePubMedGoogle Scholar
- Dowell RD, Ryan O, Jansen A, Cheung D, Agarwala S, Danford T, Bernstein DA, Rolfe PA, Heisler LE, Chin B, Nislow C, Giaever G, Phillips PC, Fink GR, Gifford DK, Boone C: Genotype to phenotype: a complex problem. Science. 2010, 328 (5977): 469-469. 10.1126/science.1189015.PubMed CentralView ArticlePubMedGoogle Scholar
- Akao T, Yashiro I, Hosoyama A, Kitagaki H, Horikawa H, Watanabe D, Akada R, Ando Y, Harashima S, Inoue T, Inoue Y, Kajiwara S, Kitamoto K, Kitamoto N, Kobayashi O, Kuhara S, Masubuchi T, Mizoguchi H, Nakao Y, Nakazato A, Namise M, Oba T, Ogata T, Ohta A, Sato M, Shibasaki S, Takatsume Y, Tanimoto S, Tsuboi H, Nishimura A, et al: Whole-genome sequencing of sake yeast Saccharomyces cerevisiae Kyokai no. 7. DNA Res. 2011, 18 (6): 423-434. 10.1093/dnares/dsr029.PubMed CentralView ArticlePubMedGoogle Scholar
- Borneman AR, Desany BA, Riches D, Affourtit JP, Forgan AH, Pretorius IS, Egholm M, Chambers PJ: Whole-genome comparison reveals novel genetic elements that characterize the genome of industrial strains of Saccharomyces cerevisiae. PLoS Genet. 2011, 7 (2): e1001287-10.1371/journal.pgen.1001287.PubMed CentralView ArticlePubMedGoogle Scholar
- Borneman AR, Forgan AH, Pretorius IS, Chambers PJ: Comparative genome analysis of a Saccharomyces cerevisiae wine strain. FEMS Yeast Res. 2008, 8 (7): 1185-1195. 10.1111/j.1567-1364.2008.00434.x.View ArticlePubMedGoogle Scholar
- Gancedo C, Flores CL: The importance of a functional trehalose biosynthetic pathway for the life of yeasts and fungi. FEMS Yeast Res. 2004, 4 (4–5): 351-359.View ArticlePubMedGoogle Scholar
- Ikner A, Shiozaki K: Yeast signaling pathways in the oxidative stress response. Mutat Res. 2005, 569 (1–2): 13-27.View ArticlePubMedGoogle Scholar
- Petersson A, Almeida JRM, Modig T, Karhumaa K, Hahn-Hägerdal B, Gorwa-Grauslund MF, Liden G: A 5-hydroxymethyl furfural reducing enzyme encoded by the Saccharomyces cerevisiae ADH6 gene conveys HMF tolerance. Yeast. 2006, 23 (6): 455-464. 10.1002/yea.1370.View ArticlePubMedGoogle Scholar
- Liu ZL: Molecular mechanisms of yeast tolerance and in situ detoxification of lignocellulose hydrolysates. Appl Microbiol Biotechnol. 2011, 90 (3): 809-825. 10.1007/s00253-011-3167-9.View ArticlePubMedGoogle Scholar
- Dunn B, Richter C, Kvitek DJ, Pugh T, Sherlock G: Analysis of the Saccharomyces cerevisiae pan-genome reveals a pool of copy number variants distributed in diverse yeast strains from differing industrial environments. Genome Res. 2012, 22 (5): 908-924. 10.1101/gr.130310.111.PubMed CentralView ArticlePubMedGoogle Scholar
- Wehner EP, Rao E, Brendel M: Molecular-structure and genetic-regulation of Sfa, a gene responsible for resistance to formaldehyde in Saccharomyces-cerevisiae, and characterization of its protein product. Mol Gen Genet. 1993, 237 (3): 351-358.PubMedGoogle Scholar
- Gaisne M, Bécam AM, Verdiere J, Herbert CJ: A 'natural' mutation in Saccharomyces cerevisiae strains derived from S288c affects the complex regulatory gene HAP1 (CYP1). Curr Genet. 1999, 36 (4): 195-200. 10.1007/s002940050490.View ArticlePubMedGoogle Scholar
- Peña MMO, Puig S, Thiele DJ: Characterization of the Saccharomyces cerevisiae high affinity copper transporter Ctr3. J Biol Chem. 2000, 275 (43): 33244-33251. 10.1074/jbc.M005392200.View ArticlePubMedGoogle Scholar
- Knight SA, Labbe S, Kwon LF, Kosman DJ, Thiele DJ: A widespread transposable element masks expression of a yeast copper transport gene. Genes Dev. 1996, 10 (15): 1917-1929. 10.1101/gad.10.15.1917.View ArticlePubMedGoogle Scholar
- Chan JE, Kolodner RD: A genetic and structural study of genome rearrangements mediated by high copy repeat Ty1 elements. PLoS Genet. 2011, 7 (5): e1002089-10.1371/journal.pgen.1002089.PubMed CentralView ArticlePubMedGoogle Scholar
- Wei W, McCusker JH, Hyman RW, Jones T, Ning Y, Cao Z, Gu Z, Bruno D, Miranda M, Nguyen M, Wilhelmy J, Komp C, Tamse R, Wang X, Jia P, Luedi P, Oefner PJ, David L, Dietrich FS, Li Y, Davis RW, Steinmetz LM: Genome sequencing and comparative analysis of Saccharomyces cerevisiae strain YJM789. Proc Natl Acad Sci USA. 2007, 104 (31): 12825-12830. 10.1073/pnas.0701291104.PubMed CentralView ArticlePubMedGoogle Scholar
- Eastmond DL, Nelson HCM: Genome-wide analysis reveals new roles for the activation domains of the Saccharomyces cerevisiae heat shock transcription factor (Hsf1) during the transient heat shock response. J Biol Chem. 2006, 281 (43): 32909-32921. 10.1074/jbc.M602454200.PubMed CentralView ArticlePubMedGoogle Scholar
- Mieczkowski PA, Lemoine FJ, Petes TD: Recombination between retrotransposons as a source of chromosome rearrangements in the yeast Saccharomyces cerevisiae. DNA Repair. 2006, 5 (9–10): 1010-1020.View ArticlePubMedGoogle Scholar
- Novo M, Bigey F, Beyne E, Galeote V, Gavory F, Mallet S, Cambon B, Legras JL, Wincker P, Casaregola S, Dequin S: Eukaryote-to-eukaryote gene transfer events revealed by the genome sequence of the wine yeast Saccharomyces cerevisiae EC1118. Proc Natl Acad Sci USA. 2009, 106 (38): 16333-16338. 10.1073/pnas.0904673106.PubMed CentralView ArticlePubMedGoogle Scholar
- Doniger SW, Kim HS, Swain D, Corcuera D, Williams M, Yang SP, Fay JC: A catalog of neutral and deleterious polymorphism in yeast. PLoS Genet. 2008, 4 (8): e1000183-10.1371/journal.pgen.1000183.PubMed CentralView ArticlePubMedGoogle Scholar
- Emerson JJ, Hsieh LC, Sung HM, Wang TY, Huang CJ, Lu HH, Lu MY, Wu SH, Li WH: Natural selection on cis and trans regulation in yeasts. Genome Res. 2010, 20 (6): 826-836. 10.1101/gr.101576.109.PubMed CentralView ArticlePubMedGoogle Scholar
- Yvert G, Brem RB, Whittle J, Akey JM, Foss E, Smith EN, Mackelprang R, Kruglyak L: Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors. Nat Genet. 2003, 35 (1): 57-64.View ArticlePubMedGoogle Scholar
- Brem RB, Yvert G, Clinton R, Kruglyak L: Genetic dissection of transcriptional regulation in budding yeast. Science. 2002, 296 (5568): 752-755. 10.1126/science.1069516.View ArticlePubMedGoogle Scholar
- Hama S, Yamaji H, Kaieda M, Oda M, Kondo A, Fukuda H: Effect of fatty acid membrane composition on whole-cell biocatalysts for biodiesel-fuel production. Biochem Eng J. 2004, 21 (2): 155-160. 10.1016/j.bej.2004.05.009.View ArticleGoogle Scholar
- Tao XL, Zheng DQ, Liu TZWPM, Zhao WP, Zhu MY, Jiang XH, Zhao YH, C WX: A Novel Strategy to Construct Yeast Saccharomyces cerevisiae Strains for Very High Gravity Fermentation. PLoS One. 2012, 7 (2): e31235-doi:31210.31371/journal.pone.0031235.PubMed CentralView ArticlePubMedGoogle Scholar
- Argueso JL, Westmoreland J, Mieczkowski PA, Gawel M, Petes TD, Resnick MA: Double-strand breaks associated with repetitive DNA can reshape the genome. Proc Natl Acad Sci USA. 2008, 105 (33): 11845-11850. 10.1073/pnas.0804529105.PubMed CentralView ArticlePubMedGoogle Scholar
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25 (17): 3389-3402. 10.1093/nar/25.17.3389.PubMed CentralView ArticlePubMedGoogle Scholar
- Kent WJ: BLAT - The BLAST-like alignment tool. Genome Res. 2002, 12 (4): 656-664.PubMed CentralView ArticlePubMedGoogle Scholar
- Delcher AL, Bratke KA, Powers EC, Salzberg SL: Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics. 2007, 23 (6): 673-679. 10.1093/bioinformatics/btm009.PubMed CentralView ArticlePubMedGoogle Scholar
- Carver TJ, Rutherford KM, Berriman M, Rajandream MA, Barrell BG, Parkhill J: ACT: the Artemis comparison tool. Bioinformatics. 2005, 21 (16): 3422-3423. 10.1093/bioinformatics/bti553.View ArticlePubMedGoogle Scholar
- Wang B, Guo GW, Wang C, Lin Y, Wang XN, Zhao MM, Guo Y, He MH, Zhang Y, Pan L: Survey of the transcriptome of Aspergillus oryzae via massively parallel mRNA sequencing. Nucleic Acids Res. 2010, 38 (15): 5075-5087. 10.1093/nar/gkq256.PubMed CentralView ArticlePubMedGoogle Scholar
- Li RQ, Yu C, Li YR, Lam TW, Yiu SM, Kristiansen K, Wang J: SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics. 2009, 25 (15): 1966-1967. 10.1093/bioinformatics/btp336.View ArticlePubMedGoogle Scholar
- Mortazavi A, Williams BA, Mccue K, Schaeffer L, Wold B: Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008, 5 (7): 621-628. 10.1038/nmeth.1226.View ArticlePubMedGoogle Scholar
- Audic S, Claverie JM: The significance of digital gene expression profiles. Genome Res. 1997, 7 (10): 986-995.PubMedGoogle Scholar
- Zheng DQ, Wu XC, Wang PM, Chi XQ, Tao XL, Li P, Jiang XH, Zhao YH: Drug resistance marker-aided genome shuffling to improve acetic acid tolerance in Saccharomyces cerevisiae. J Ind Microbiol Biot. 2011, 38 (3): 415-422. 10.1007/s10295-010-0784-8.View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.