Genome-wide transcriptional response of Trichoderma reesei to lignocellulose using RNA sequencing and comparison with Aspergillus niger
© Ries et al.; licensee BioMed Central Ltd. 2013
Received: 5 April 2013
Accepted: 6 August 2013
Published: 9 August 2013
A major part of second generation biofuel production is the enzymatic saccharification of lignocellulosic biomass into fermentable sugars. Many fungi produce enzymes that can saccarify lignocellulose and cocktails from several fungi, including well-studied species such as Trichoderma reesei and Aspergillus niger, are available commercially for this process. Such commercially-available enzyme cocktails are not necessarily representative of the array of enzymes used by the fungi themselves when faced with a complex lignocellulosic material. The global induction of genes in response to exposure of T. reesei to wheat straw was explored using RNA-seq and compared to published RNA-seq data and model of how A. niger senses and responds to wheat straw.
In T. reesei, levels of transcript that encode known and predicted cell-wall degrading enzymes were very high after 24 h exposure to straw (approximately 13% of the total mRNA) but were less than recorded in A. niger (approximately 19% of the total mRNA). Closer analysis revealed that enzymes from the same glycoside hydrolase families but different carbohydrate esterase and polysaccharide lyase families were up-regulated in both organisms. Accessory proteins which have been hypothesised to possibly have a role in enhancing carbohydrate deconstruction in A. niger were also uncovered in T. reesei and categories of enzymes induced were in general similar to those in A. niger. Similarly to A. niger, antisense transcripts are present in T. reesei and their expression is regulated by the growth condition.
T. reesei uses a similar array of enzymes, for the deconstruction of a solid lignocellulosic substrate, to A. niger. This suggests a conserved strategy towards lignocellulose degradation in both saprobic fungi. This study provides a basis for further analysis and characterisation of genes shown to be highly induced in the presence of a lignocellulosic substrate. The data will help to elucidate the mechanism of solid substrate recognition and subsequent degradation by T. reesei and provide information which could prove useful for efficient production of second generation biofuels.
KeywordsTrichoderma reesei Aspergillus niger Glycoside hydrolases Carbohydrate esterases Antisense RNA RNA-sequencing
The burning of fossil fuels is accompanied by many recognised problems affecting the global economy and environmental health. Replacing fossil fuels with biofuels will help to reduce global CO2 emissions, produce a more favourable greenhouse gas profile, decrease dependence on diminishing oil resources and promote local economies [1, 2]. The production of biofuels from plant biomass such as grasses , woods [4, 5] and lignocellulosic wastes , does not compete with food production and is called second generation (2G) biofuel .
In nature, the degradation of lignocellulosic biomass is catalysed by enzymes from various microorganisms such as saprobic fungi and bacteria . Enzymes for biomass deconstruction, and many other industrial applications, are often derived from fungi from the Trichoderma and Aspergillus genera . The overall aim of this study was to investigate the strategy that Trichoderma reesei employs to degrade complex lignocellulosic polysaccharides and compare this to the mechanism used by Aspergillus niger in order to provide novel insights which may prompt the development of new approaches for the production of 2G biofuels.
Trichoderma spp. and Aspergillus spp. have many industrial applications due to their production of very high levels of secreted enzymes [1, 3, 8]. This has led to the development of a wide selection of genetic tools in T. reesei, including random and targeted mutagenesis to create cellulase hyper-producing mutants [9, 10], to elucidate regulatory mechanisms of pathways concerning the metabolism of simple sugars [11, 12] and to target the T. reesei secretion system in order to produce higher protein yields through engineering more efficient and thermostable enzymes [2, 9, 13–17]. To date, the genome of T. reesei has been found to encode a total of 228 polysaccharide-degrading enzymes that represent 61 enzyme families . This is similar to the total number (ca. 280) of carbohydrate-degrading enzymes in A. niger.
This study characterises the transcriptional changes associated with exposure to wheat straw using Next Generation RNA sequencing (RNA-seq) technology, with the aim of gaining an understanding of the steps leading to the deconstruction this complex lignocellulosic substrate. Comparing them to the mechanisms employed by A. niger previously described , will uncover relevant differences and similarities in lignocellulose degradation between both industrially important organisms. The cost of enzymes presents a major challenge in the cost-effectiveness of biofuel production , and the cost of enzymes can be reduced by a combination of factors. Firstly, the yield of enzymes from the fungal source should be maximised and, secondly, the most effective mix of functionalities is required. Other aspects such as the site of production of the enzymes are also important. Ground and autoclaved, but otherwise un-treated, wheat straw was used in this study but a variety of pre-treatments are possible when digesting lignocellulose for the generation of 2G biofuels. That will inevitably alter the fungal responses but an understanding of the mechanistic basis of those responses requires a base-line study with non-pre-treated material, as described here.
Results and discussion
The wheat straw –induced transcriptome of T. reesei QM6a
The T. reesei genome is 33.9 Mb in size with 9,126 predicted genes . Transcriptomes were sequenced from replicated independent cultures under 3 different sets of conditions: 1) after growth from conidia for 48 h in the presence of glucose as sole carbon source, a monosaccharide which represses expression of many genes involved in plant cell wall degradation, 2) 24 h after transfer of washed mycelia from 1) into media containing ground wheat straw as the sole carbon source to monitor the induction of genes involved in polysaccharide deconstruction and 3) 5 h after addition of glucose to the straw cultures from 2) to determine genes responsive to carbon catabolite repression. The ball milled wheat straw used in this study contained 37% cellulose, 32% hemicelluloses, 22 ± 0.1% lignin and was 25% crystalline . Statistical tests [21–23] were applied to enable us to identify all genes which were significantly differentially expressed (p-value of <0.001 for all three tests) between the three conditions studied (see Additional file 1). RPKM values were calculated for each of the biological replicates as well as for the combined mapping of all replicates at 48 h, 24 h straw and 5 h glucose (see Additional file 1). The results shown in this study are from the combined mapping scores and only inductions showing a significant score in all statistical tests are discussed.
Expression of CAZy genes in T. reesei and comparison with A. niger
The degradation of plant cell wall carbohydrates is mediated by enzymes of four different classes: the carbohydrate esterases (CEs), the polysaccharide lyases (PLs), the glycoside hydrolases (GHs) and the auxiliary activities (AAs). These enzymes are classed, based on their primary amino acid sequence and related activity, into families in the Carbohydrate Active Enzyme database (CAZy) (http://www.cazy.org) . Analysis of the T. reesei QM6a genome identified 22 CE-encoding genes representing 8 families, 5 PL-encoding genes representing 3 families, 195 GH-encoding genes, representing 49 families and 6 AA-encoding genes, representing 1 family . The A. niger ATCC 1015 genome contains 25 CEs representing 9 families, 8 PLs representing 2 families, 239 predicted GHs representing 50 families and 7 AAs representing 1 family . There are differences in the families of CEs, PLs and GHs encoded by the genomes of both fungi [18, 20, 25]. PLs are not as important as GHs and CEs for wheat straw degradation, as PLs mainly target pectin, a structure which is also degraded by enzymes of many GH families, including GH family 28 . The family of AAs encoded by the genomes of T. reesei and A. niger (AA family 9) were formerly known as GH61s but were shown to be copper-dependent oxidases and have a different catalytic mechanism to the GHs . Enzymes of AA family 9 play important accessory roles in enhancing lignocellulose degradation .
Carbohydrate esterases which play a role in lignocellulose degradation and which are encoded by T. reesei but not by A. niger belong to CE family 15. The genome of T. reesei encodes one CE family 15 glucuronoyl esterase [JGI:123940], also known as CIP2, which contains a cellulose binding module (CBM1) and which plays an important role in dissociating lignin from hemicelluloses through targeting the ester bonds between the aromatic alcohols of lignin and the glucuronic acid residues from the xylose backbone in hemicelluloses . CE families 8 and 12 are present in A. niger but not in T. reesei[18, 25] and the genome of A. niger encodes 3 CE family 8 pectin methylesterases involved in the de-esterification of pectin , and two CE family 12 rhamnogalacturonan acetylesterases involved in the deconstruction of plant cell wall pectin .
Glycoside hydrolases involved in lignocellulose deconstruction and encoded by T. reesei and not A. niger belong to GH families 39, 115 and 45 and assist in the degradation of xylan (GHs 39, 115) and cellulose (GH 45) . In A. niger, proteins from GH families 26 and 51, which are not encoded by T. reesei, are involved in the degradation of hemicellulosic mannan and arabinan residues . Furthermore, the genome of A. niger encodes proteins of GH families 53 and 88, which are involved in the degradation of pectin [25, 26, 32], inulinases and invertases belonging to GH family 32 and which degrade polysaccharides containing fructose and sucrose .
Straw-induced CAZy genes
48 h Glucose RPKM
24 h Straw RPKM
5 h Glucose RPKM
Acetyl xylan esterase, axe1
Copper-dependent monooxygenase, cel61a
Swollenin, contains CBM1, swo1
Copper-dependent monooxygenase, cel61b
Acetyl Esterase, aes1
Candidate cellulose binding protein, CBM1, cip1
Acetyl xylan esterase
Candidate exo-rhamnogalacturonase, rgx1
A;lpha-L-arabinofuranosidase I, abf1
Glucuronoyl esterase, cip2
Candidate endo-β-1,4-xylanase, xyn5
Although both fungi seem to use a similar array of GHs, they up-regulate the expression of different CE-encoding genes. Whereas transcripts from CE family 1 were most abundant in A. niger (~10% of total CAZy mRNA), transcript levels of genes encoding proteins in CE family 5 were highest in T. reesei (~5% of total CAZy mRNA). This is an agreement with previous microarray studies . The genome of T. reesei encodes 3 acetyl xylan esterases and 1 cutinase, all belonging to CE family 5 . The highest expression values were recorded for 2 acetyl xylan esterases [JGI:73632, JGI:54219] one of which contains a CBM1 module [JGI:73632]. Acetyl xylan esterases remove acetyl groups at O-2 and O-3 positions of the xylose chain in arabinoglucuronoxylans , a process which has been shown to significantly enhance subsequent hydrolysis of xylans and cellulose . The genome of A. niger encodes 3 CE family 1 members: one acetyl xylan esterase, one feruloyl esterase and one un-defined esterase . Expression values of the acetyl xylan esterase and the feruloyl esterase were very high in the presence of straw . Feruloyl esterases cleave ferulic acid groups which are esterified to the 5′-OH of arabinofuranosyl groups (arabinose residues linked to O-2 or O-3 of xylose) and which can be covalently linked to lignin or other ferulic acid groups in xylans . It appears that the enzyme mix secreted by A. niger aids in loosening the lignin-hemicellulose structure in addition to de-acetylating the xylan backbone in order to allow access of other CAZymes to the underlying hemicellulose and cellulose polysaccharides. Thus the bulk of GH and AA enzymes used to degrade straw are from the same GH and AA families in both organisms, whereas different CE family members are used suggesting that both fungi specialised also in the cleavage of different bonds found within plant cell walls.
Not surprisingly, transcript abundance of PL-encoding genes was very low in both fungi (~0.012% of total CAZy mRNA in T. reesei and ~0.5% of total CAZy mRNA in A. niger), confirming that PLs do not play an important role in wheat straw degradation (as mentioned above). Similar results were obtained through previous microarray studies .
T. reesei also highly induced the transcription of genes encoding proteins other than hydrolases, which have been proposed to be involved in enhancing cellulose degradation (Table 1). One such enzyme is the expansin-like swollenin, swo1 (Table 1), thought to play a role in the loosening of the plant cell wall by disrupting hydrogen bonds between plant polysaccharides, thus increasing cell wall area and access of hydrolytic enzymes (such as cellulases) to the underlying polymers . Another enzyme, CIP1 (Table 1), which contains a CBM belonging to family 1, is thought to enhance cellulose hydrolysis . Transcript levels of cip1 were also detected in the presence of sophorose and regulation of this gene is the same as for other well characterised cellulases (e.g. cbh1), indicating a potential role for this protein in cellulose degradation . Our results show that genes encoding both enzymes are highly induced in the presence of straw and that they are regulated similarly to CAZy enzymes-encoding genes, suggesting that these enzymes could be key players in wheat straw degradation. Genes encoding CIP1 and swollenins are absent from the genome of A. niger.
Addition of glucose to the straw cultures exerted strong carbon catabolite repression of the CAZy-encoding genes, and CAZy transcript abundance decreased to 0.82% in T. reesei (1.4% in A. niger, Figure 1) of the total cellular mRNA, with members from GH families 16, 17, 67 and 72 (glucosidases, glucuronidase and glucanosyltransferases) (and GH15 in A. niger) being the most expressed CAZy genes under this condition in T. reesei (Figure 2).
Expression of non-CAZy genes in T. reesei and comparison with A. niger
Straw-induced non-CAZy genes
48 h Glucose RPKM
24 h Straw RPKM
5 h Glucose RPKM
Ceramidase family protein, associated to cellulase signal transduction
Surface interacting proteins
HFB2, class II hydrophobin
QI74 orthologue, cell wall protein
HFB3, class II hydrophobin
Cell wall protein, contains HsbA conserved domain
Cell wall protein with a CFEM domain
Enzymes of carbon and nitrogen metabolism
Xylose reductase XYL1
Xylitol dehydrogenase XDH1
L-arabinitol 4-dehydrogenase LAD1
MFS (major facilitator superfamily) permease
Predicted MFS transporter
SNF2 family helicase
Peptidyl arginine deiminase
C2H2 transcriptional regulator
Alcohol oxidase AOX1
In T. reesei, one lipase-encoding gene had a similar transcriptional pattern to many GH and CE-encoding genes. The gene contains a secretion signal and encodes a protein belonging to the neutral/alkaline non-lysosomal ceramidase family which hydrolyse the sphingolipid ceramide into sphingosine and free fatty acid, bioactive lipids serving as cellular messengers . The alkaline ceramidase described here is thought to be involved in cellulase signal transduction . NCBI BLASTP analysis revealed the protein encoded by this gene [JGI:64397] to be highly similar to a ceramidase [JGI:120161] in A. niger, which was also highly induced in the presence of straw . It is possible that this ceramidase is secreted with, and maybe regulated in a similar manner to the CAZy enzymes and participates in wheat straw deconstruction, although expression of this gene in the presence of other lignocellulosic substrates requires further characterisation. This gene presents an interesting candidate for future studies on enzymes which could be involved in plant cell wall degradation.
Hydrophobic surface interacting proteins
Five genes encoding two hydrophobins and three cell wall proteins were induced in T. reesei when switching from glucose to straw and all genes were repressed when glucose was added to the cultures. These genes have a transcriptional profile similar to many genes of the CAZy group. Hydrophobins can adsorb to hydrophobic surfaces and to interfaces between hydrophobic and hydrophilic phases, thus mediating the interaction of the fungus with its environment [42, 43]. In Aspergillus oryzae, the hydrophobin RolA recruits the esterase CutL to the synthetic polyester polybutylene succinate-co-adipate (PBSA) and promotes its degradation . A similar role for the two highly expressed hydrophobins of T. reesei could therefore be envisaged, as has been proposed for two A. niger hydrophobin-encoding genes which were also highly induced in the same conditions . The other three highly induced cell wall protein-encoding genes are likely to also have a role in mediating interactions of the fungus with a solid substrate. The QI74 cell wall protein-encoding gene has been shown to be induced in Trichoderma harzianium when replacing glucose medium with chitin, simulating mycoparasitic conditions . NCBI BLASTP analysis of the protein encoded by the gene with transcript ID 104277 (Table 2) revealed the presence of putative conserved domain which is also found in the HsbA (hydrophobic surface binding protein A) superfamily; an enzyme family which differs in structure and mechanism of surface interaction, when compared to hydrophobins . In A. oryzae, HsbA also recruits the cutinase CutL to PBSA to promote its degradation . The third gene [JGI:124295] described here, encodes a cell wall protein with a predicted extracellular CFEM (conserved fungal specific extracellular membrane-spanning) domain. Some Pth11-like GPCRs (G-protein coupled receptors) have associated CFEM domains and are important for sensing surface cues [45, 46]. Pth11-like GPCRs were first described in Magnaporthe grisea, where they play an essential role in plant host invasion and pathogenicity [45, 46]. The observation of the induction of hydrophobic surface interacting protein-encoding genes in both T. reesei and A. niger suggests that the recognition of solid surfaces is an important step in the fungal response to the plant cell wall and further experiments are required to confirm this. Nevertheless, genes encoding hydrophobic surface interacting proteins present interesting candidates which potential roles in enhancing lignocellulose degradation.
Eleven genes, encoding seven transporters of the m ajor f acilitator s uperfamily (MFS), one xylose transporter, two oligo-peptide transporters and one iron transporter were highly transcribed in straw and repressed in glucose-rich conditions. The MFS superfamily is a large family of transporters which can be divided into a further 17 families of which families 1, 5 and 7 mediate monosaccharide (hexoses, pentoses) and oligosaccharide transport into the cell by coupling it to proton symport or antiport . The first gene listed in Table 2 [JGI:3405] is possibly involved in hexose and disaccharide transport as NCBI BLASTP analysis of the encoded protein [JGI:3405] revealed 75% amino acid sequence identity to a hexose transporter from Glomerella graminicola and 74% identity to a lactose permease from Verticillium dahlia. This transporter may belong to family 1 of MFS transporters which couple sugar uptake to proton symport and which are involved in the uptake of galactose, quinate, lactose, maltose and α-glycosides . NCBI BLASTP analysis of the protein [JGI:50894] encoded by the second gene listed in Table 2 [JGI:50894] revealed 76% sequence identity to a high affinity glucose transporter from Gaeumannomyces graminis and it may belong to family 7 of MFS transporters which couple hexose to proton import and are involved in the uptake of fucose, galactose and glucose . The induction of transporter-encoding genes after 24 h in the presence of straw indicates that the cellulose and hemicellulose fractions of the wheat straw are being degraded, subsequently releasing simple sugars which are taken up by the fungus.
As in A. niger, genes of the xylose utilisation pathway such as xylose reductase XYL1 (Table 2) and xylitol dehydrogenase XDH1 (Table 2) were up-regulated more than 20-fold (Table 2) when switching from glucose to straw. XYL1 also reduces the pentose sugar L-arabinose to L-arabinitol which is then further oxidised to L-xylulose by the L-arabinitol dehydrogenase LAD1 (Table 2) . In addition, LAD1 is also involved in the alternative pathway of D-galactose metabolism whereas aldose-1-epimerase (Table 2) catalyses the first step of the Leloir pathway of D-galactose catabolism . This indicates that after 24 h, T. reesei had internalised hemicellulosic sugars such as xylose, arabinose and galactose which had then entered their respective metabolic pathways.
Regulation of gene expression can also occur at the post-transcriptional level through regulatory RNAs. Natural antisense transcripts (NATs) are non-protein coding, fully processed mRNAs, which can partially overlap the protein-coding transcripts and which have many regulatory roles . NATs have been found in several fungi including A. niger and N. crassa[3, 52]. To calculate the levels within our T. reesei transcriptomes, the number of reads corresponding to the non-coding strand was counted for each gene and AS (antisense) RPKM values were calculated in each condition. Approximately 1.82%, 1.47% and 2.79% of all reads in T. reesei were antisense reads when mycelia were grown in 48 h glucose, 24 h straw and 24 h straw + 5 h glucose respectively. Thus, AS transcription was detected in T. reesei and 630 genes had an AS RPKM > 1 in at least one condition (see Additional file 3). The 630 genes encoded proteins involved in a wide variety of cellular functions. The% of AS transcripts was similar in T. reesei when compared to A. niger.
The ratio of antisense:sense expression under glucose 48 h and straw 24 h conditions was calculated for these 630 genes in order to find genes with AS transcripts which change between the two conditions. Most of these genes have a low AS/S ratio on 24 h straw indicating that sense transcripts dominate over AS reads in this condition.
This study explored the mechanisms used by T. reesei to degrade an industrially-relevant substrate (wheat straw) for the generation of biofuels and made comparison with the strategies employed by A. niger. Both fungi encode a mix of predicted GHs, CEs and AAs and some families are species-specific. In the presence of wheat straw, many of the CAZy protein-encoding genes were up-regulated in the two fungi and repressed in both glucose conditions. T. reesei and A. niger use a set of core enzymes from the same GH (3, 7, 11, 30 and 67) and AA (9) families but from different CE families for wheat straw degradation. The amount of CAZy mRNA (as a proportion of total cellular RNA) in T. reesei after 24 h incubation in straw was less than that in A. niger (number of CAZy-encoding genes represent ~ 2.5% of the coding genome in both organisms). Thirty-two genes encoding non-CAZy enzymes in T. reesei had an expression pattern similar to the CAZy-encoding genes. The majority of these genes could be classed into functional categories which were also described in A. niger, suggesting a similar approach for both fungi to the degradation of a solid, lignocellulosic substrate. Furthermore, as described in A. niger, the existence of NATs (a type of regulatory RNA) were also shown to be present in T. reesei. This presents an interesting area for future research as regulatory RNAs may be involved in the post-transcriptional regulation of genes encoding enzymes involved in carbohydrate degradation. The use of Next Generation RNA sequencing has allowed us to gain preliminary insights into the global gene expression profile in response to a complex lignocellulosic substrate and to identify genes in T. reesei, which encode enzymes which have previously not been associated to lignocellulose deconstruction. Furthermore we have confirmed the presence of a type of regulatory RNAs which have not previously been described in T. reesei yet.
Strains and growth conditions
T. reesei QM6a [53, 54] was used throughout this project. Conidia were produced from glycerol stocks of T. reesei grown on potato dextrose agar medium (PDA: 4.0 g/L potato extract, 15.0 g/L agar, 20.0 g/L) at 28°C. Conidia were harvested with 2 ml 0.01% (w/v) Tween 80. Liquid batch cultures were inoculated at a concentration of 105 spores/mL.
Strains were cultured in 100 ml Trichoderma Minimal Media [TMM: 15 g/L KH2PO4, 5 g/L (NH4)2SO4, 10 g/L of the respective carbon source, 0.005 g/L FeSO4.7H20, 0.0016 g/L MnSO4.H20, 0.0014 g/L Zn.SO4.H20, 0.0037 g/L CoCl12.6H20, 0.6 g/L MgSO4, 0.6 g CaCl2] in 250 mL Erlenmeyer flasks at 28°C, shaking at 150 rpm. Mycelia were grown for 48 h in 1% (w/v) glucose, after which they were removed by filtration through Miracloth (Merck), washed with double-distilled water (ddH2O), and transferred to fresh media supplemented with the relevant carbon source at 1% w/v. Three different sets of conditions were distinguished: 1) growth from conidia for 48 h in the presence of glucose as the sole carbon source (48 h glucose), 2) 24 h after transfer of washed mycelia from 1) into media containing ground wheat straw as the sole carbon source (24 h straw) and 3) 5 h after addition of glucose to the straw cultures from 2) (5 h glucose). Transcriptomes were analysed from triplicate cultures of 48 h glucose and 24 h straw and from duplicate cultures of 5 h glucose.
Ball milling, sugar, lignin and crystallinity analysis of the wheat straw used in this study can be found elsewhere .
Mycelia from each condition were snap-frozen and ground to a fine powder under liquid N2 using a mortar and pestle. 100 mg of mycelial powder was used for RNA extraction, the procedure of which was described elsewhere . Briefly, total RNA was extracted from mycelial powder using TriZol reagent (Invitrogen) according to manufacturer’s instructions. Extracted RNA was purified using the Qiagen RNeasy Mini Kit following the manufacturer’s instructions of the RNA clean-up protocol with on-column DNA digestion.
RT-PCR and qRT-PCR
The synthesis of cDNA was carried out as described by . PCR reactions were performed using RedTaq DNA Polymerase (Sigma) and 1 μL of cDNA in a 20 μL reaction. PCR conditions were 30 cycles of denaturation at 94°C for 1 min, annealing at 60°C for 2 min and extension at 72°C for 3 min. Strand-specific PCRs (ssPCRs; as described in ) were carried out at an annealing temperature of 64°C using the primer pairs indicated in the Additional file 4.
qRT-PCR reactions were run using the same system as described by  and carried out for 40 cycles with denaturation at 95°C for 30 s and annealing at 64°C for 30 s and extension at 60°C for 60 s. Briefly, qRT-PCR reactions were run in triplicates per gene in each condition in a total reaction volume of 20 μL containing 11.0 μl Fast SYBR green master mix (Applied Biosystems), 0.11 μL/primer (175 nM final concentration), 2.2 μL of cDNA and 8.58 μL ddH2O. Gene expression values were calculated against a standard curve of known genomic DNA concentrations. All primer pair sequences are listed in the Additional file 4.
RNA-seq and subsequent data analysis
Ribosomal RNA was degraded in 10 μg of total RNA using the Ribominus Eukaryotic kit (Invitrogen). SOLiD whole transcriptome libraries were made according to the SOLiD Whole transcriptome kit protocol (Applied Biosystems) and the library concentrations were measured with the Quant-it HS dsDNA assay kit (Invitrogen). Libraries were pooled to equimolar amounts (Invitrogen) and gel purified using 2% size-select E-gels to 200–300 bp (Invitrogen). Emulsion PCR (0.5 M final concentration of pooled libraries) and bead-based enrichment was done according to the SOLiD 4 Templated bead preparation guide containing library. Sequencing was carried out on a SOLiD 4 ABi sequencer platform according to manufacturer’s instructions to generate single fragment 50 bp colour space reads.
Using the BioScope version 1.3 Whole Transcriptome Pipeline (LifeTechnologies), reads from each SOLiD 4 libraries were initially filtered against sequencing library adaptors and other sequencing artefacts. Reads were then mapped independently to the entire unmasked and masked versions of the annotated genome assembly of T. reesei, and to defined gene transcript sequences (JGI T. reesei assembly version 2, annotation filtered models version 2.0). For mapping against the genome assemblies, it was also possible to align reads against a library of exon junctions derived from the exon coordinates detailed in the genome annotation. This allowed reads spanning exon boundaries to be included in alignment results and be recorded in BAM format. Subsequently, the reads corresponding to rRNA encoding gene regions were removed from the BAM records before further downstream analysis. The resulting BAM file of mapped reads from each sample against the unmasked and masked genomes were processed with HTseq-count  to generate read counts per gene from uniquely aligned reads. These counts were determined for both the sense strand only and to both strands (unstranded). Summary metrics for these results are shown in Additional file 5.
When comparing the read alignment metrics between the unmasked and masked genome assemblies a negligible difference of less than 1% is seen for total mapped reads, uniquely mapped reads and for reads mapped within annotated genes. This suggests that the masked regions of the genome do not correspond to transcribed genomic regions. The percentage of reads that were uniquely mapped to the unmasked assembly, as a proportion of total mapped reads, was 63.60% for all 48 h glucose replicates, 69.64% for all 24 h straw replicates and 52.95% for the 5 h glucose duplicates (Additional file 5).
For transcript sequence read mapping, less than half the total number of mapped reads compared to the mapping against the annotated genome was observed for all conditions. This can be explained by the difference in reference sequence space used in mapping. Transcript sequences were derived from the annotated exon coordinates within the genome sequence; therefore it would not be possible to map reads corresponding to transcribed regions outside of these defined coordinates. These unmapped reads could be attributed to incomplete annotation of the genome and/or to reads that are not mRNA coding. This is supported by a similar number of uniquely mapped reads to the transcripts within gene coordinates of the genome mapping when compared to the transcripts. The percentage of reads mapped uniquely to transcripts sequences, as a proportion of total transcript mapped reads, was 38.49%, 54.93% and 33.83% for the conditions 48 h glucose, 24 h straw and 5 h glucose respectively (Additional file 5).
For gene expression analysis the mapped read counts per gene calculated against the unmasked genome. Counts were determined for both sense strand only and to both strands (unstranded), as described previously. Antisense read counts per gene were calculated by subtracting the sense counts from the other. Read counts were then normalized to RPKM (Reads Per Kilobase per Million mapped reads) expression values for each gene  and visualised with the Integrative Genome Viewer (IGV 1.5) programme .
Differential expression values were determined using DEGseq  using sense read counts per gene for each experimental condition. DEGseq implements three independent statistical tests (Fisher’s Exact Test , Likelihood Ratio Test  and an MA-plot-based method with the Random Sampling method ).
Availability of supporting data
The raw and processed RNA-sequencing data sets supporting the results of this article are available in the NCBI’s Gene Expression Omnibus  repository under GEO accession number GSE44648 [http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE44648].
Carbohydrate Active enzyme
Cellulose Binding Module
Conserved fungal specific extracellular membrane-spanning
G-Protein Coupled Receptor
Major Facilitator Superfamily
Natural Antisense Transcript
Reads Per Kilobase of exon model per Million mapped reads
This work was supported by grants from the BBSRC CASE studentship, Roal Oy (Finland) and LACE/BSBEC. The authors would also like to thank Christian Kubicek and his team for the annotation of the Trichoderma reesei genome.
- Banerjee G, Scott-Craig JS, Walton JD: Improving enzymes for biomass conversion: a basic research perspective. Bioen Res. 2010, 3: 82-92. 10.1007/s12155-009-9067-5.View ArticleGoogle Scholar
- Fukuda H, Kondo A, Tamalampudi S: Bioenergy: Sustainable fuels from biomass by yeast and fungal whole-cell biocatalysts. Biochem Eng J. 2009, 44: 2-12. 10.1016/j.bej.2008.11.016.View ArticleGoogle Scholar
- Delmas S, Pullan S, Gaddipati S, Kokolski M, Malla S, Blythe MJ, Ibbett R, Campbell M, Liddell S, Aboobaker A, Tucker GA, Archer DB: Uncovering the genome-wide transcriptional responses of the filamentous fungus Aspergillus niger to lignocellulose using RNA sequencing. PLoS Genet. 2012, 8: e1002875-10.1371/journal.pgen.1002875.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhu JY, Pan X, Zalesny RS: Pre-treatment of woody biomass for biofuel production: energy efficiency, technologies and recalcitrance. Appl Microbiol Biotechnol. 2010, 87: 847-857. 10.1007/s00253-010-2654-8.View ArticlePubMedGoogle Scholar
- Dodd D, Cann IKO: Enzymatic deconstruction of xylan for biofuel production. GCB Bioen. 2009, 1: 2-17. 10.1111/j.1757-1707.2009.01004.x.View ArticleGoogle Scholar
- Dashtban M, Schraft H, Qin W: Fungal bioconversion of lignocellulosic residues; opportunities and perspectives. Int J Biol Sci. 2009, 5: 578-595.PubMed CentralView ArticlePubMedGoogle Scholar
- Alvira P, Tomas-Pejo E, Ballesteros M, Negro MJ: Pretreatment technologies for an efficient bioethanol production process based on enzymatic hydrolysis: a review. Bioresource Technol. 2010, 101: 4851-4861. 10.1016/j.biortech.2009.11.093.View ArticleGoogle Scholar
- Elkins JG, Raman B, Keller M: Engineered microbial systems for enhanced conversion of lignocellulosic biomass. Curr Opin Biotech. 2010, 21: 657-662. 10.1016/j.copbio.2010.05.008.View ArticlePubMedGoogle Scholar
- Schuster A, Schmoll M: Biology and biotechnology of Trichoderma. Appl Microbiol Biot. 2010, 87: 787-799. 10.1007/s00253-010-2632-1.View ArticleGoogle Scholar
- Ike M, Park J-Y, Tabuse M, Tokuyasu K: Cellulase production on glucose-based media by the UV-irradiated mutants of Trichoderma reesei. Appl Microbiol Biot. 2010, 87: 2059-2066. 10.1007/s00253-010-2683-3.View ArticleGoogle Scholar
- Kubicek CP, Mikus M, Schuster A, Schmoll M, Seiboth B: Metabolic engineering strategies for the improvement of cellulase production by Hypocrea jecorina. Biotechnol Biofuels. 2009, 2: 6834-6848.View ArticleGoogle Scholar
- Portnoy T, Margeot A, Seidl-Seiboth V, Le Crom S, Chaabane FB, Linke R, Seiboth B, Kubicek CP: Differential regulation of the cellulase transcription factors XYR1, ACE2 and ACE1 in Trichoderma reesei strains producing high and low levels of cellulase. Eukaryot Cell. 2011, 10: 262-271. 10.1128/EC.00208-10.PubMed CentralView ArticlePubMedGoogle Scholar
- Ayrinhac C, Margeot A, Ferreira NL, Chaabane FB, Monot F, Ravot G, Sonet J-M, Fourage L: Improved saccharification of wheat straw for biofuel production using an engineered secretome of Trichoderma reesei. Org Process Res Dev. 2011, 15: 275-278. 10.1021/op100218a.View ArticleGoogle Scholar
- Jeoh T, Michener W, Himmel ME, Decker SR, Adney WS: Implications of cellobiohydrolase glycosylation for use in biomass conversion. Biotechnol Biofuels. 2008, 1: 1186-1198.View ArticleGoogle Scholar
- Liu T, Wang T, Li X, Liu X: Improved heterologous gene expression in Trichoderma reesei by cellobiohydrolase I gene (cbh1) promoter optimisation. ABBS. 2008, 40: 158-165.PubMedGoogle Scholar
- Mathew GM, Sukumaran RK, Singhania RR, Pandey A: Progress in research on fungal cellulases for lignocelluloses degradation. J Sci Ind Res India. 2008, 67: 898-907.Google Scholar
- Rahman Z, Shida Y, Furukawa T, Suzuki Y, Okada H, Ogasawara W, Morikawa Y: Application of Trichoderma reesei cellulase and xylanase promoters through homologous recombination for enhanced production of extracellular β-glucosidase I. Biosci Biotech Bioch. 2009, 73: 1083-1089. 10.1271/bbb.80852.View ArticleGoogle Scholar
- Häkkinen M, Arvas M, Oja M, Aro N, Penttilä M, Saloheimo M, Pakula TM: Re-annotation of the CAZy genes of Trichoderma reesei and transcription in the presence of lignocellulosic substrates. Microb Cell Fact. 2012, 11: 134-160. 10.1186/1475-2859-11-134.PubMed CentralView ArticlePubMedGoogle Scholar
- Klein-Marcuschamer D, Oleskowicz-Popiel P, Simmons BA, Blanch HW: The challenge of enzyme cost in the production of lignocellulosic biofuels. Biotechnol Bioeng. 2012, 109: 1083-1087. 10.1002/bit.24370.View ArticlePubMedGoogle Scholar
- Martinez D, Berka RM, Henrissat B, Saloheimo M, Arvas M, Baker SE, Chapman J, Chertkov O, Coutinho PM, Cullen D, Danchin EGJ, Grigoriev IV, Harris P, Jackson M, Kubicek C, Han CS, Ho I, Larrondo LF, Lopez de Leon A, Magnuson JK, Merino S, Misra M, Nelson B, Putnam N, Robbertse B, Salamov AA, Schmoll M, Terry A, Thayer N, Westerholm-Parvinen A: Genome sequencing and analysis of the biomass-degrading fungus Trichoderma reesei (syn. Hypocrea jecorina). Nat Biotechnol. 2008, 26: 553-560. 10.1038/nbt1403.View ArticlePubMedGoogle Scholar
- Bloom JS, Khan Z, Kruglyak L, Singh M, Caudy AA: Measuring differential gene expression by short read sequencing: quantitative comparison to 2-channel expression microarrays. BMC Genomics. 2009, 10: 221-230. 10.1186/1471-2164-10-221.PubMed CentralView ArticlePubMedGoogle Scholar
- Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B: Mapping and quantifying mammalian transcriptomes by RNA-seq. Nat Methods. 2008, 5: 621-628. 10.1038/nmeth.1226.View ArticlePubMedGoogle Scholar
- Wang L, Feng Z, Wang X, Wang X, Zhang X: DEGseq: an R package for identifying differentially expressed genes from RNA-seq data. Bioinformatics. 2010, 26: 136-138. 10.1093/bioinformatics/btp612.View ArticlePubMedGoogle Scholar
- Cantarel BL, Coutinho PM, Rancurel C, Bernard T, Lombard V, Henrissat B: The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics. Nucleic Acids Res. 2009, 37: D233-D238. 10.1093/nar/gkn663.PubMed CentralView ArticlePubMedGoogle Scholar
- Pel HJ, De Winde JH, Archer DB, Dyer PS, Hofmann G, Schaap PJ, Turner G, De Vries RP, Albang R, Albermann K, Andersen MR, Bendtsen JD, Benen JAE, Van den Berg M, Breestraat S, Caddick MX, Contreras R, Cornell M, Coutinho PM, Danchin EGJ, Debets AJM, Dekker P, Van Dijck PWM, Van Dijk A, Dijkhuizen L, Driessen AJM, D’Enfert C, Geysens S, Goosen C, Groot GSP: Genome sequencing and analysis of the versatile cell factory Aspergillus niger CBS 513.88. Nat Biotechnol. 2007, 25: 221-231. 10.1038/nbt1282.View ArticlePubMedGoogle Scholar
- van den Brink J, de Vries RP: Fungal enzyme sets for plant polysaccharide degradation. Appl Microbiol Biotechnol. 2011, 91: 1477-1492. 10.1007/s00253-011-3473-2.PubMed CentralView ArticlePubMedGoogle Scholar
- Quinlan RJ, Sweeney MD, Leggio LL, Otten H, Poulsen J-CN, Johansen KS, Krogh KBRM, Jørgensen CI, Tovborg M, Anthonsen A, Tryfona T, Walter CP, Dupree P, Xu F, Davies GJ, Walton PH: Insights into the oxidative degradation of cellulose by a copper metalloenzyme that exploits biomass components. P Natl Acad Sci USA. 2011, 108: 15079-15084. 10.1073/pnas.1105776108.View ArticleGoogle Scholar
- Harris PV, Welner D, McFarland KC, Re E, Poulson J-CN, Brown K, Salbo R, Ding H, Vlasenko E, Merino S, Xu F, Cherry J, Larsen S, Leggio LL: Stimulation of lignocellulosic biomass hydrolysis by proteins of glycoside hydrolase family 61: structure and function of a large, enigmatic family. Biochemistry. 2010, 49: 3305-3316. 10.1021/bi100009p.View ArticlePubMedGoogle Scholar
- Li X-L, Špániková S, de Vries RP, Biely P: Identification of genes encoding microbial glucuronoyl esterases. FEBS Lett. 2007, 581: 4029-4035. 10.1016/j.febslet.2007.07.041.View ArticlePubMedGoogle Scholar
- Kester HC, Benen JAE, Visser J, Warren ME, Orlando R, Bergmann C, Magaud D, Anker D, Doutheau A: Tandem mass spectrometric analysis of Aspergillus niger pectin methylesterase: mode of action on fully methyl-esterified oligogalacturonates. Biochem J. 2000, 346: 469-474. 10.1042/0264-6021:3460469.PubMed CentralView ArticlePubMedGoogle Scholar
- Mølgaard A, Kauppinen S, Larsen S: Rhamnogalacturonan acetylesterase elucidates the structure and function of a new family of hydrolases. Structure. 2000, 8: 373-383. 10.1016/S0969-2126(00)00118-0.View ArticlePubMedGoogle Scholar
- De Vries RP, Parenicova L, Hinz SWA, Kester HCM, Beldman G, Benen JAE, Visser J: The beta-1,4-endogalactanase A gene from Aspergillus niger is specifically induced on arabinose and galacturonic acid and plays an important role in the degradation of pectic hairy regions. Eur J Biochem. 2002, 269: 4985-4993. 10.1046/j.1432-1033.2002.03199.x.View ArticlePubMedGoogle Scholar
- Fujii T, Murakami K, Sawayama S: Cellulase hyperproducing mutants derived from the fungus Trichoderma reesei QM9414 produced large amounts of cellulases at the enzymatic and transcriptional levels. Biosci Biotechnol Biochem. 2010, 74: 419-422. 10.1271/bbb.90655.View ArticlePubMedGoogle Scholar
- Ganzlin M, Rinas U: In-depth analysis of the Aspergillus niger glucoamylase (glaA) promoter performance using high-throughput screening and controlled bioreactor cultivation techniques. J Biotechnol. 2008, 135: 266-271. 10.1016/j.jbiotec.2008.04.005.View ArticlePubMedGoogle Scholar
- Zhou Q, Zu J, Kou Y, Lv X, Zhang X, Zhao G, Zhang W, Chen G, Liu W: Differential involvement of β-glucosidases from Hypocrea jecorina in rapid induction of cellulase genes by cellulose and cellobiose. Eukaryot Cell. 2012, 11: 1371-1381. 10.1128/EC.00170-12.PubMed CentralView ArticlePubMedGoogle Scholar
- Wang Z, Gerstein M, Snyder M: RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet. 2009, 10: 57-63. 10.1038/nrg2484.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhang J, Siika-aho M, Tenkanen M, Viikari L: The role of acetyl xylan esterase in the solubilisation of xylan and enzymatic hydrolysis of wheat straw and giant reed. Biotechnol Biofuels. 2012, 4: 60-69.View ArticleGoogle Scholar
- Saloheimo M, Paloheimo M, Hakola S, Pere J, Swanson B, Nyyssönen E, Bhatia A, Ward M, Penttilä M: Swollenin, a Trichoderma reesei protein with sequence similarity to the plant expansions, exhibits disruption activity on cellulosic materials. Eur J Biochem. 2002, 269: 4202-4211. 10.1046/j.1432-1033.2002.03095.x.View ArticlePubMedGoogle Scholar
- Foreman PK, Brown D, Dankmeyer L, Dean R, Diener S, Dunn-Coleman NS, Goedegebuur F, Houfek TD, England GJ, Kelley AS, Meerman HJ, Mitchell T, Mitchinson C, Olivares HA, Teunissen PJM, Yao J, Ward M: Transcriptional regulation of biomass-degrading enzymes in the filamentous fungus Trichoderma reesei. J Biol Chem. 2003, 278: 31988-31997. 10.1074/jbc.M304750200.View ArticlePubMedGoogle Scholar
- Mao C, Obeid LM: Ceramidases: regulators of cellular responses mediated by ceramide, sphingosine and sphingosine-1-phosphate. Biochim Biophys Acta. 2008, 1781: 424-434. 10.1016/j.bbalip.2008.06.002.PubMed CentralView ArticlePubMedGoogle Scholar
- Schmoll M, Zeilinger S, Mach RL, Kubicek CP: Cloning of genes expressed early during cellulase induction in Hypocrea jecorina by a rapid substraction hybridisation approach. Fungal Genet Biol. 2004, 41: 877-887. 10.1016/j.fgb.2004.06.002.View ArticlePubMedGoogle Scholar
- Linder MB, Szilvay GR, Nakari-Setälä T, Penttilä M: Hydrophobins: the protein-amphiphiles of filamentous fungi. FEMS Microbiol Rev. 2005, 29: 877-896. 10.1016/j.femsre.2005.01.004.View ArticlePubMedGoogle Scholar
- Ohtaki S, Maeda H, Takahashi T, Yamagata Y, Hasegawa F, Gomi K, Nakajima T, Abe K: Novel hydrophobic surface binding protein, HsbA, produced by Aspergillus oryzae. Appl Environ Microb. 2006, 72: 2407-2413. 10.1128/AEM.72.4.2407-2413.2006.View ArticleGoogle Scholar
- Rey M, Ohno S, Pintor-Toro JA, Llobell A, Benitez T: Unexpected homology between inducible cell wall protein QID74 of filamentous fungi and BR3 salivary protein of the insect Chironomus. Proc Natl Acad Sci U S A. 1998, 95: 6212-6216. 10.1073/pnas.95.11.6212.PubMed CentralView ArticlePubMedGoogle Scholar
- Dean RA, Talbot NJ, Ebbole DJ, Farman ML, Mitchell TK, Orbach MJ, Thon M, Kulkarni R, Xu J-R, Pan H, Read ND, Yong-Hwan L, Carbone I, Brown D, Oh YY, Donofrio N, Jeong JS, Soanes DM, Djonovic S, Kolomiets E, Rehmeyer C, Li W, Harding M, Kim S, Lebrun M-H, Bohnert H, Coughlan S, Butler J, Calvo S, Ma L-J: The genome sequence of the rice blast fungus Magnaporthe oryzae. Nature. 2005, 434: 980-986. 10.1038/nature03449.View ArticlePubMedGoogle Scholar
- Wilson RA, Talbot NJ: Under pressure: investigating the biology of plant infection by Magnaporthe oryzae. Nat Rev. 2009, 7: 185-195. 10.1038/nrmicro2032.Google Scholar
- Pao SS, Paulsen IT, Saier MH: Major facilitator Superfamily. Microbiol Mol Biol R. 1998, 62: 1-34.Google Scholar
- Lingner U, Münch S, Deising HB, Sauer N: Hexose transporters of a hemibiotrophic plant pathogen: Functional variations and regulatory differences at different stages of infection. J Biol Chem. 2011, 286: 20913-20922. 10.1074/jbc.M110.213678.PubMed CentralView ArticlePubMedGoogle Scholar
- Mach-Aigner A, Gudynaite-Savitch L, Mach RL: L-arabitol is the actual inducer of xylanase expression in Hypocrea jecorina (Trichoderma reesei). Appl Environ Microbiol. 2011, 77: 5988-5994. 10.1128/AEM.05427-11.PubMed CentralView ArticlePubMedGoogle Scholar
- Seiboth B, Pakdaman BS, Hartl L, Kubicek CP: Lactose metabolism in filamentous fungi: how to deal with an unknown substrate. Fungal Biol Rev. 2007, 21: 42-48. 10.1016/j.fbr.2007.02.006.View ArticleGoogle Scholar
- Faghihi MA, Wahlestedt C: Regulatory roles of natural antisense transcripts. Nature Rev. 2009, 10: 637-643. 10.1038/nrm2738.View ArticleGoogle Scholar
- Dang Y, Yang Q, Xue Z, Liu Y: RNA interference in fungi: pathways, functions and applications. Eukaryot Cell. 2011, 10: 1148-1155. 10.1128/EC.05109-11.PubMed CentralView ArticlePubMedGoogle Scholar
- Druzhinina IS, Schmoll M, Seiboth B, Kubicek CP: Global carbon utilisation profiles of wild-type, mutant, and transformant strains of Hypocrea jecorina. Appl Environ Microb. 2006, 72: 2126-2133. 10.1128/AEM.72.3.2126-2133.2006.View ArticleGoogle Scholar
- Nakari-Setälä T, Paloheimo M, Kallio J, Vehmaanperä J, Penttilä M, Saloheimo M: Genetic modification of carbon catabolite repression in Trichoderma reesei for improved protein production. Appl Environ Microb. 2009, 75: 4853-4860. 10.1128/AEM.00282-09.View ArticleGoogle Scholar
- Edgar R, Domrachev M, Lash AE: Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 2002, 30: 207-210. 10.1093/nar/30.1.207.PubMed CentralView ArticlePubMedGoogle Scholar