- Research article
- Open Access
Genetic complexity of miscanthus cell wall composition and biomass quality for biofuels
BMC Genomics volume 18, Article number: 406 (2017)
Miscanthus sinensis is a high yielding perennial grass species with great potential as a bioenergy feedstock. One of the challenges that currently impedes commercial cellulosic biofuel production is the technical difficulty to efficiently convert lignocellulosic biomass into biofuel. The development of feedstocks with better biomass quality will improve conversion efficiency and the sustainability of the value-chain. Progress in the genetic improvement of biomass quality may be substantially expedited by the development of genetic markers associated to quality traits, which can be used in a marker-assisted selection program.
To this end, a mapping population was developed by crossing two parents of contrasting cell wall composition. The performance of 182 F1 offspring individuals along with the parents was evaluated in a field trial with a randomized block design with three replicates. Plants were phenotyped for cell wall composition and conversion efficiency characters in the second and third growth season after establishment. A new SNP-based genetic map for M. sinensis was built using a genotyping-by-sequencing (GBS) approach, which resulted in 464 short-sequence uniparental markers that formed 16 linkage groups in the male map and 17 linkage groups in the female map. A total of 86 QTLs for a variety of biomass quality characteristics were identified, 20 of which were detected in both growth seasons. Twenty QTLs were directly associated to different conversion efficiency characters. Marker sequences were aligned to the sorghum reference genome to facilitate cross-species comparisons. Analyses revealed that for some traits previously identified QTLs in sorghum occurred in homologous regions on the same chromosome.
In this work we report for the first time the genetic mapping of cell wall composition and bioconversion traits in the bioenergy crop miscanthus. These results are a first step towards the development of marker-assisted selection programs in miscanthus to improve biomass quality and facilitate its use as feedstock for biofuel production.
Miscanthus is a perennial C4 grass capable of producing high biomass yields in temperate climates . It is a crop characterized by high resource-use efficiency owing to its early spring emergence and long vegetative phase, as well as its rhizomatous growing habit, which allows the recycling of nutrients between growing seasons [2,3,4]. These characteristics make miscanthus an interesting lignocellulose feedstock for the production of cellulosic biofuels .
So far, M. × giganteus is the only species of the genus Miscanthus that is commercially exploited for biomass production [6, 7]. M. × giganteus (2n = 3x = 57) is derived from a natural cross between the diploid M. sinensis (2n = 2 × = 38) and the Japanese allotetraploid species M. ogiformis (2n = 4 × = 76), which is often erroneously referred to as tetraploid M. sacchariflorus [8, 9]. Its success is mainly due to its high productivity. In a quantitative review of biomass yields of M. × giganteus across 100 diverse field trial locations, the average dry matter yield was 22 t ha−1 yr−1 . However, the genetic variation in this triploid clone is extremely limited due to its sterility, which poses risks upon large-scale cultivation and significantly limits further progress through breeding [6, 9, 11,12,13]. In contrast, great and largely untapped genetic diversity is harboured within and among natural populations of M. sacchariflorus and M. sinensis, which have adapted to a wide range of geographical conditions [6, 13].
One of the key challenges that currently impedes the wide-scale commercialization of cellulosic ethanol production resides is our inability to efficiently deconstruct plant lignocellulose into fermentable sugars. The development of feedstocks with improved biomass quality is envisioned to contribute to the economic feasibility of cellulosic biofuel technologies [5, 14,15,16]. Lignocellulosic feedstocks are composed of cellulose, hemicellulosic polysaccharides and lignin . High contents of cellulose and hemicellulosic polysaccharides are desirable, as these constituents can be hydrolyzed and subsequently fermented to produce biofuels. Lignin, on the other hand, cross-links to hemicellulosic polysaccharides and forms a highly impermeable and complex matrix that shields cell wall polysaccharides from degradation, and impedes the extraction of fermentable sugars from the cell wall [18,19,20,21]. Genotypic variation in cell wall composition has been reported in M. sinensis and M. sacchariflorus, providing ample scope for improving biomass quality in these species through breeding [22, 23].
Compared to annual crops, progress in breeding of perennials, such as miscanthus, is slowed-down by the need to evaluate genotype performance in multi-year field trials. Miscanthus typically matures in 3 years and selection at a premature stage, specifically during its first year of establishment, has proven unreliable . Therefore, the application of marker-assisted selection could substantially increase the efficiency of breeding in miscanthus, as selections could be done at the seedling stage using marker data. Genetic maps form the basis for finding marker-trait associations, but their construction in miscanthus is complicated by the large genome size and the high levels of heterozygosity inherent to its obligate outcrossing nature [11, 13]. Nonetheless, a few genetic maps of miscanthus have been published to date [25,26,27,28,29].
So far three of these genetic maps have been used for the identification of quantitative trait loci (QTL) for different traits of interest, but none of these studies focused on biomass quality for biofuel production. The randomized amplified polymorphic DNA (RAPD) marker-based map by Atienza et al. has been used for identification of QTL associated with agronomic performance and combustion quality [30,31,32,33,34]. The simple-sequence repeat (SSR) marker-based map by Swaminathan et al. was used for the identification of QTL associated with agronomic performance . This map was recently extended with simple nucleotide polymorphism (SNP) markers, obtained through restriction site-associated DNA (RAD) sequencing, and was used for the identification of QTL underlying the zebra stripe phenotype that is desirable for the use of miscanthus as an ornamental grass . Currently, no marker-trait associations have been reported in miscanthus for traits relating to cell wall composition or biomass quality for the production of cellulosic biofuels.
Here we report the construction of a new genetic map for M. sinensis using SNP markers obtained through a genotyping-by-sequencing (GBS) approach. The mapping population used in this study segregates for biomass quality traits, as it was derived from a cross between two parental lines with contrasting cell wall composition. The objectives of this study were (1) to detect QTL for biomass composition and quality in miscanthus regarding its use as a lignocellulose feedstock for biofuel production and (2) to align marker sequences to the sorghum reference genome to facilitate cross-species comparisons.
A mapping population of 182 F1 progeny was generated by crossing two M. sinensis genotypes with contrasting cell wall composition. The male parent, hereafter referred to as P1, was a genotype (H0227) originating from the miscanthus collection of Wageningen University and Research (WUR). The female parent, hereafter referred to as P2, was derived from a cross between two genotypes from the BIOMIS mapping population (H0012 × H0163) described by Atienza et al., . Both H0012 and H0163 (grandparents) were also included in the field trial and are hereafter referred to as G1-P2 (H0012) and G2-P2 (H0163), respectively. A random sample of seeds was sown in August 2011 in trays in a heated greenhouse; seedlings were subsequently potted and raised to vigorous plants by the end of the winter of 2011/2012. These were split by the end of May 2012 into four roughly equally sized clonal pieces (ramets). Three ramets of each genotype were immediately used to establish a field trial in May 2012; one spare ramet per genotype was potted to replace possible fall-outs. The trial was located at an experimental site of WUR at Wageningen (The Netherlands) and had a randomized block design with the individual ramets used as experimental units. The ramets were planted in rows with a distance between and within rows of 75 cm. The trial was surrounded by two rows of medium-sized M. sinensis plants in order to minimize possible border effects. In the second and third growth season, heading date was scored per plant. At the end of the second and third growth season (December 2013 and 2014), all plants were harvested separately, dried to constant weight using ventilated air (dm% ~ 92%) and weighed. A random sample of each plant was subsequently taken, from which leaves and inflorescences were separated from the stem material. The stem fraction of each plant was then chopped into ~2 cm chips, and air-dried at 60 °C for 72 h in a forced-air oven. Stem samples (n = 186 genotypes × 3 replicates × 2 years = 1104, minus fall-outs) were ground using a hammer mill with a 1-mm screen and used for biomass quality analyses.
Biomass quality analysis
Neutral detergent fiber (NDF) and acid detergent fiber (ADF) contents of stem dry matter were determined by detergent fiber analysis using an ANKOM 2000 Fiber Analyzer (ANKOM Technology Corporation, Fairpoint, NY). Acid detergent lignin (ADL) contents were determined after 3-h hydrolysis of the ADF residue in 72% H2SO4 with continuous shaking. All analyses were performed in triplicate and fiber fractions were expressed in gram per kg dry matter. Fiber fractions were used to calculate the concentrations (in g/kg dm) of cell wall (NDF), cellulose (CEL, equals ADF – ADL), hemicellulosic polysaccharides (HEM, equals NDF – ADF) and acid detergent lignin (ADL) on a dry matter basis. The residual NDF material of the replicated fiber analyses was pooled per sample and used for the determination of neutral sugar and Klason lignin (KL) content as described previously . Briefly, 30 mg of NDF material was hydrolysed for 1 h in 0.3 ml 72% H2SO4 at 30 °C, after which the acid concentration was diluted to 4% and samples were autoclaved for 60 min at 121 °C. Autoclaved samples were cooled and centrifuged, after which the supernatant was used for determination of glucose (GLU), xylose (XYL) and arabinose (ARA) contents using high performance anion exchange chromatography (HPAEC) on a Dionex system (Dionex, Sunnydale, CA) equipped with a CarboPac PA1 column and a pulsed amperometric detector. The pellet remaining after centrifugation was vacuum-filtered through a pre-weighed glass fibre filter (AP25, Fischer Scientific, Loughborough, UK). The residue was dried overnight at 103 °C and weighed for the determination of KL.
Separate analyses of ground stem samples were performed for the characterization of saccharification efficiency by two different methods. The first method was used for the high-throughput, small-scale quantification of the rate of glucose release during enzymatic hydrolysis of hot-water pretreated samples, as described previously . The release of glucose was expressed as the concentration in nmol of reducing sugars released per mg of biomass per hour of digestion; hereafter referred to as saccharification rate (SacR). The second method was aimed at quantifying the final yield of fermentable sugars using a highly controlled lab-scale alkaline pretreatment and enzymatic saccharification setup, as described by van der Weijde et al. . The released amounts of glucose and xylose are expressed either (1) as a weight percentage of the amount of glucose and xylose present in the untreated sample as determined by neutral sugar analysis (i.e., referred to as glucose conversion (GC) or xylose conversion (XC)) or (2) as a weight percentage of the amount of cellulose and hemicellulose present in the untreated sample as determined by fiber analysis (i.e., referred to as cellulose conversion (CC) and hemicellulose conversion (HC)).
To allow high-throughput analysis of all biomass quality traits we used near-infrared spectroscopy (NIRS) technology. Multivariate prediction models that combined near-infrared (NIR) spectral data and biochemical data were developed for all traits except for SacR. Near-infrared absorbance spectra of stem samples were obtained using a Foss DS2500 near-infrared spectrometer (Foss, Hillerød, Denmark) and processed by weighted multiplicative scatter correction and mathematical derivatization and smoothing treatments (2,6,4,1) using WinISI 4.9 statistical software (Foss, Hillerød, Denmark). Different prediction models were developed for different traits, depending on the number of samples that could be biochemically analyzed and on the availability of existing data for creating robust prediction models (containing a range of miscanthus samples from different experiments) (Table 1). All models contained at least 140 samples from the first growing season of the mapping population. The quality of the prediction models was validated using the squared Pearson coefficient of correlation (r 2) between predicted and biochemical data and by evaluating for these samples the standard error of cross-validation (SECV) for each of the traits (Table 1). Subsequently, the developed prediction models were used to determine biomass composition and conversion efficiency of all 1104 stem samples (minus fall-outs).
Genomic DNA from young leaf tissues was extracted following a CTAB based protocol . DNA concentration and quality were checked using a NanoDrop ND-1000 spectrophotometer (Thermo Fisher Scientific, Waltham, MA) and standardized using a Qubit fluorometer (Thermo Fisher Scientific, Waltham, MA). DNA integrity was confirmed on 1% agarose gels. Libraries were prepared for GBS using the restriction endonuclease ApeKI (five-cutter) to digest the genomic DNA for complexity reduction. Each digested DNA sample was ligated to a set of uniquely barcoded sequencing adaptor pairs, following PCR amplification with adapter-specific primers, and amplicons between 300 and 500 bp were extracted from an agarose gel and sequenced in four single lanes of Illumina HiSeq2000 using a 100 bp paired-end protocol. DNA digestion, adapter ligation, library construction, and sequencing were carried out by the Beijing Genomics Institute (BGI), China.
The de-multiplexed sequence reads obtained from BGI were filtered by removing those reads that did not start with the 5’-CWCG-3’ site pattern, typically resulting from ApeKI digestion, or that contained undefined (‘N’) nucleotides. Reads were right-trimmed to a length of 82 nucleotides and clustered in order to count the number copies per unique read sequence. Note that this clustering was not only done for each sample individually, but also separately for the forward and reverse reads. Only unique reads that occurred at least four times were kept. Unique reads from all samples were jointly clustered using the RADSNP program (RADNPGTv1.1 package, BGI, China). Our initial approach to classify genotypes was to assign a genotypic score to the studied genotypes with a cluster size of at least five reads by applying a set of classification rules to separate clustered reads. The first classification rule was that if the genotype had a frequency of 0.8 or higher for the most abundant read in the cluster, this was considered to be present in homozygous condition. The second classification rule was applied when the two most abundant reads in a cluster if both had frequencies of at least 0.2. The genotype was then classified to be heterozygous. If for a particular cluster neither rule 2 nor 3 held true, no genotypic assignment was given. Unfortunately this approach did not result in acceptable data for map construction, because the average cluster size was too small to allow for a proper genotypic classification due to insufficient sequencing depth. Therefore we refrained from this approach and focused on segregation analyses for single reads. The number of reads for each selected sequence was in this case the basis for genotypic classification using a dominant way of scoring. Genotypes with one or more reads were considered to be either homozygous dominant or heterozygous for this short-sequence marker, whereas the ones showing no reads were supposed to be homozygous recessives. A missing value was assigned to genotype-marker combinations when both the number of reads for this marker over genotypes as well as the average number reads over all markers for a genotype was low. This was done to prevent misclassification of genotypes.
A genetic map was constructed following the two-way pseudo test-cross strategy , using the dominantly scored SNP markers. To this end, suitable markers were first filtered out of all available markers (49102) based on segregation ratio, with only uniparental single-dose markers, i.e., markers that segregated in a 1:1 ratio in the population, used for further analysis. A total of 1145 markers remained and were coded according to segregation type following the coding scheme for cross pollinated populations as used in JoinMap . Male simplex × female nulliplex markers were classified as lm × ll, while male nulliplex × female simplex markers were classified as nn × np. Markers were imported into JoinMap 4.1 (Kyazma, Wageningen, Netherlands) and after elimination of segregation distorted markers and markers that had high similarity (>0.99) to other markers, a total of 1003 markers were used for linkage analyses. These markers were separated into linkage groups using JoinMap grouping analysis with a maximum recombination threshold of 0.25 and a minimum independence logarithm of odds (LOD) score of 2. Markers resolved into 33 linkage groups, 16 linkage groups for the male map and 17 linkage groups for the female map. Marker order within each linkage group was then determined using Haldane’s regression mapping algorithm in JoinMap with a maximum recombination threshold of 0.40 and a minimum independence logarithm of odds (LOD) score of 1. This procedure built a map by adding loci one by one, starting from the most informative pair of loci. Each locus was added at its best position according to a goodness-of-fit measure or removed again until all loci are handled two times. The male map spanned 2139.7 cM and consisted of 242 markers with a median inter-marker spacing of 8.0 cM. The female map spanned 2479.5 cM and consisted of 322 markers with a median inter-marker spacing of 6.7 cM.
Statistical analysis and QTL mapping
General analyses of variance (ANOVA) were performed to determine the significance of genotype differences (p < 0.05) in the mapping population for cell wall composition and saccharification efficiency. Variance analyses were performed separately for both growing seasons, taking into account the randomized complete block design of the trial. Estimates of genotypic (σg 2) and residual (σe 2) variance were used to calculate broad sense heritability (h 2) estimates following h 2 = σg 2/(σg 2 + σe 2). To visualize associations amongst traits, a principal component analysis was performed on genotype means for all traits evaluated in both growth seasons. Origin centered, normalized scores for the first two principal components were plotted in a principal-component biplot. All statistical analyses were performed using Genstat for Windows, 18th edition software package (VSN International, Hemel Hempstead, UK).
Quantitative trait loci (QTL) analysis was performed with MapQTL 6.0 (Kyazma, Wageningen, Netherlands) using a maximum likelihood mixture model. An interval mapping approach was used with a step size of 1.0 cM. Significance of a QTL was called based on a LOD score higher than a genome-wide significance threshold based on 1000 permutations , which was determined to be 3.561 for the male and 3.655 for the female map. One-LOD and two-LOD support intervals were determined to show the uncertainty on the QTL position. The percentage of variance explained (PVE) by the QTL was calculated by 100 × ([residual variance with no QTL fitted – residual variance with QTL fitted]/population variance) .
Results and discussion
Genotypic variation for biomass quality traits
Significant heritable variation was observed in the mapping population for all stem biomass quality traits determined after the second and third growth seasons, as shown by the population statistics and parental and grand-parental values summarized in Table 2. Cell wall material (NDF) comprised the largest fraction of biomass and ranged from ~815 to 911 g/kg dm in the second and from ~877 to 918 g/kg dm in the third growth season. The main cell wall components were CEL and HEM, with variation in the population in the second growth season ranging from ~446–527 and ~304–365 g/kg dm, respectively. In the third growth season plants had on average higher CEL and lower HEM contents compared to the second growth season and ranged from ~474–532 and ~282–345 g/kg dm, respectively. Particularly large variation in cell wall glucose content (GLU) was also found, ranging from ~35 to 50% of the cell wall fraction in the second and from ~21 to 39% in the third growth season.
Variation in ADL ranged from ~42 to 82 g/kg dm in the second and from ~75 to 110 g/kg dm in the third growth season. ADL/cw and KL ranged from ~5–9% and ~12–15% of the cell wall in the second and from ~8–12% and 12–16% in the third growth season, respectively. Variation in lignin content is of particular interest for improving biomass quality of miscanthus, and variation in both ADL and KL was extensive. KL values are higher than ADL values, as during the quantification of ADL, detergents are used that likely dissolve a fraction of the total lignin. However, KL values might overestimate lignin as it is more likely to be contaminated with protein [43, 44]. Both methods provide different but valuable insights into biomass quality .
The mapping population also harbored extensive variation in conversion efficiency. Particularly for SacR, GC and XC, considerable variation was observed among genotypes. Variation in SacR ranged from ~11 to 24 nmol reducing sugars per mg biomass per hour. Variation in GC ranged from ~42 to 55% in the second and from ~33 to 46% in the third growth season. These ranges are comparable with the ranges observed in other highly diverse sets of miscanthus genotypes [23, 36, 45,46,47], indicating that variation in conversion efficiency in this population created by crossing two highly compositionally distinct parents is substantial. Conversion efficiency values in the third growth season were substantially lower than those found in the second, which is presumably associated with the increase in lignin content observed with increasing plant age (Table 2).
Genotype performance for most of the evaluated traits was highly reproducible across replicated blocks. As a result, for most traits a high heritability (h 2 > 0.5) was observed, with the highest heritability for quality traits across years observed for lignin (ADL/cw) (h 2 = 0.62–0.72). For all traits, heritability estimates in the third growth season were reduced compared to those in the second. The lower heritabilities in the third growth season can be caused by the lower range of observed genetic variation, environmental effects, errors in biochemical analyses, and/or lower NIRS prediction accuracy. The heritability estimates for compositional and conversion efficiency characters are consistent with values observed by others in maize and sorghum mapping studies [48,49,50].
Frequency distributions of all traits evaluated in the third growth season were reasonably uniform and showed continuous unimodal histograms (Fig. 1). For all traits, with the exception of CEL, parental and grand-parental performance were contrasting and for most traits population variation extended beyond parental and grand-parental values in both directions. For KL and GLU, the performance of P1 was very near the low-end population extreme; hence genetic variation leading to concentrations lower than those observed for P1 for these traits is not expected in this population.
Principal components analysis revealed that approximately 58% of the observed genotypic variation in biomass quality resolved into two composite variables (Fig. 2). The first principal component summarized 32% of the observed genotypic variation and predominantly discriminated genotypes based on differences in the content of cellulosic and hemicellulosic polysaccharides. The second component, which summarized 26% of observed variation, discriminated genotypes mostly based on differences in lignin and conversion efficiency characters. As the angle between vectors is representative of correlations between traits, from this plot it can be deduced that the different conversion efficiency characters are positively associated to each other and negatively associated with lignin. It is also evident that SacR was more strongly correlated with the content of cellulosic polysaccharides than was cellulose conversion. These trait associations are consistent with other reports on miscanthus biomass composition and quality for biofuel production [36, 45, 47].
Synteny with Sorghum bicolor and coding of linkage groups
The DNA sequences of the mapped markers were aligned to the Sorghum bicolor (L.) Moench genome (version ‘sbi1’) from the Plant Genome Database using NCBI BLASTN . Only hits with an identity score greater than 85% and an alignment length of at least 50 nucleotides were retained and used to label the miscanthus linkage groups according to which sorghum chromosome the markers in each linkage group mapped (Additional file 1: Figure S1). Linkage groups of the female map were designated by the corresponding Sorghum bicolor chromosome numbers, followed by an ‘a’ or ‘b’, as well as linkage groups of the male map, but followed by a ‘c’ or ‘d’. These suffixes were randomly appointed to the two homologous miscanthus linkage groups of each map that are syntenic to each sorghum chromosome, as the genome of M. sinensis consists of two sub-genomes with a high level of synteny to the sorghum genome [26, 27]. In both, the male and the female map, there was one linkage group that aligned with two sorghum chromosomes; these groups were designated ‘4b7b’ and ‘4d7d’. The occurrence of this phenomenon in miscanthus has been reported previously and is ascribed to an ancient chromosome fusion or translocation event between two miscanthus chromosomes syntenic to sorghum chromosomes 4 and 7. This event explains why miscanthus has a basic chromosome number of 19 and not 20 (twice the basic chromosome number of sorghum) [27, 28].
QTL mapping of miscanthus biomass quality traits
QTL analysis was performed to investigate associations between genomic regions and stem composition and conversion traits. In a combined QTL analysis carried out on the male and female map simultaneously a total of 86 QTLs were found to be associated with cell wall composition and conversion efficiency characters with LOD scores ranging from 3.58 to 9.02 (Table 3). Heterozygosity was uncovered in 58 loci of the male parent and 28 loci of the female parent, but these may be partly the same loci if the male and the female map would be combined. Twenty out of 86 QTLs were found in both growth seasons. In the combined analysis, significant QTLs were located across 21 out of the total of 33 male and female linkage groups (Fig. 3). For several traits, QTLs were observed to be present in roughly the same genomic position in presumably homeologous linkage groups in both parental maps (e.g., QTLs for ARA on groups 2c and 2d).
Out of the 86 QTLs that were observed, 9 were associated with stem cell wall, 5 with cellulose, 6 with hemicellulosic polysaccharides, 22 with lignin and 23 with neutral sugar contents (Table 3).. The QTLs on the male map for ARA were numerous; this high number was likely due to factors such as the inherent imprecision of the QTL mapping approach, marker multicollinearity and the height of the LOD score thresholds used. The large number of QTLs found to be associated with lignin content could be partly explained by the fact that three different lignin characters (ADL, ADL/cw and KL) were evaluated. Notably, QTLs associated with KL did not co-localize with QTLs for ADL or ADL/cw (Fig. 3). Two major-effect QTLs were identified for CEL/cw (CEL/cw 3 and CEL/cw 4) in linkage groups of the male parent, each respectively accounting for 29% and ~17% of the observed genotypic variation during the third growth season (Table 3). These may be interesting targets for further study.
A total of 20 QTLs were found for conversion efficiency characters with LOD-scores ranging from 3.75 to 7.00, among which 7 for SacR, 4 for cellulose conversion, 2 for hemicellulose conversion and 7 for glucose conversion (Table 3).. QTLs for SacR and GC co-localized on linkage group 3c, QTLs for SacR and HC co-localized on linkage groups 6b and 6c (potentially homologous groups) and QTLs for CC and GC co-localized in linkage group 8c. However, many QTLs for the different conversion characters did not co-localize and seem to be independently controlled characters (Fig. 3). On linkage groups 1b, 1c, 3c, 4d7d, 6c, 6d and 8c QTLs for conversion efficiency characters co-localized with QTLs for lignin characters. Particularly strong evidence for co-localization of QTLs for these traits was found on linkage group 1b and 8c, where QTLs for lignin (KL, ADL and ADL/cw) and conversion characters (CC and GC) co-localized in both growth seasons. On linkage groups 4b7b, 4d7d, 6b, 6c and 8c QTLs for conversion efficiency characters co-localized with QTLs for accumulation of hemicellulosic polysaccharides. A big clustering of co-localized QTLs were observed on linkage groups 6b and 6c, possibly indicating the presence of a master-regulator affecting cell wall biosynthesis. QTLs for the same traits co-localized in both clusters, suggesting that 6b and 6c are homologous linkage groups. Several QTLs for conversion efficiency characters did not co-localize with any of the QTLs for compositional characters evaluated in this study (e.g., SacR on 3b and 10b), suggesting that other, unidentified compositional characters are affecting conversion efficiency. One such character, for example, could be the content of hydroxycinnamic acids, such as para-coumaric or ferulic acids, which were recently identified as key factors affecting the conversion efficiency of miscanthus biomass .
Comparative analysis of QTL in miscanthus and sorghum
In addition to identifying QTLs for miscanthus biomass composition and conversion characters, an objective of this study was to demonstrate that by aligning the genetic map of miscanthus to the physical map of Sorghum bicolor, the exchange of information from genetic studies across species is facilitated and a wealth of information becomes available for the genetic improvement of miscanthus. For this particular objective, the heading date of the genotypes used in this study was scored in both growth seasons, as this is a trait that normally has a high heritability in miscanthus and was previously mapped in miscanthus . Due to the high level of synteny between miscanthus and sorghum, QTL found in one species might have corresponding QTL in homologous regions in the other. In this study, 3 QTLs were identified for heading date, located on linkage groups 1c, 3c and 6d (Table 3). A QTL for heading date on the linkage group that aligns with Sb03 was also identified by Gifford et al.,  on the same position at the end of the chromosome arm (position 6–9 cM) as HD2 in this study. In addition, a QTL for heading date was consistently reported in sorghum on the end of the chromosome arm of Sb06 [50, 52, 53], which is in accordance with HD3 found in this study.
Similarly, QTLs for NDF are reported in sorghum on chromosomes Sb02, Sb03, Sb04 and Sb06 [50, 54], which may correspond to QTLs for NDF found in this study on the corresponding linkage groups 2c, 3a, 3c, 4c and 6c (Table 3, Fig. 3). The QTL on chromosome Sb03 was reported to have a strong effect and explained a large fraction of the observed variation in a sorghum mapping population . The strong effect of this QTL in sorghum may explain why the presumably corresponding QTL was detected on both the female and the male map in both growth seasons (NDF2 on linkage group 3a and NDF6 on linkage group 3d). QTLs for ADL were identified on Sb03, Sb04, Sb06, Sb07 and Sb08 in sorghum [50, 54], which may correspond to QTLs for ADL in this study, which were observed on all of the corresponding linkage groups (Table 3, Fig. 3). Similar to the clusters of QTLs for different traits that co-localized on miscanthus linkage groups 6b and 6c, a cluster of co-localizing QTLs, including QTLs for cellulose and hemicellulosic polysaccharide accumulation, was observed in sorghum chromosome Sb06 . In a number of genetic studies in sorghum that mapped conversion efficiency characters, QTLs for conversion efficiency repeatedly mapped to chromosome Sb03, Sb04 and Sb07 [55,56,57]. In this study, QTLs for SacR and GC were located on corresponding linkage groups 3b, 3c, 4b7b and 4d7d. However, several QTLs also mapped to linkage groups that correspond to sorghum chromosome Sb06, for which no QTL associated with conversion efficiency were detected in sorghum so far. These could represent previously unidentified loci affecting conversion efficiency in sorghum.
The fact that (1) several QTLs were identified in both growth seasons and (2) that several QTLs mapped to syntenous chromosomal segments in sorghum provides some indications that these QTLs contain genetic determinants for the traits of interest. Characterization of these QTLs, however, needs further validation. The alignment of this miscanthus genetic map to the Sorghum bicolor physical map facilitates the exchange of information between the two species, as well as to other grass species with a syntenic relationship to sorghum. Novel tools, such as the Orphan Crop Genome Browser provide excellent opportunities to exploit such phylogenetic relationships to annotate the genome of miscanthus . Using this tool the regions in the sorghum genome that are homeologous to the QTLs mapped in miscanthus in this study can be easily examined for putative orthologous genes that are reported to affect cell wall compositional characters in crops such as sorghum, maize or rice.
To our knowledge this is the first report of QTLs for biomass composition and conversion efficiency characters in miscanthus. The large number (86) of identified QTLs highlights the genetic complexity and highly quantitative genetic control of such traits. The alignment of this miscanthus genetic map to the Sorghum bicolor physical map facilitates cross-species comparisons of mapped traits and may expedite our understanding of the genetic control of important biomass quality traits in the large, complex and largely unexplored genome of the bioenergy crop miscanthus. These results are a first step towards the development of marker-assisted selection programs in miscanthus to improve biomass quality for biofuel production.
Heaton EA, Dohleman FG, Miguez AF, Juvik JA, Lozovaya V, Widholm J, Zabotina OA, McIsaac GF, David MB, Voigt TB, et al. Miscanthus. A promising biomass crop. Adv Bot Res. 2010;56:76–137.
Lewandowski I, Scurlock JMO, Lindvall E, Christou M. The development and current status of perennial rhizomatous grasses as energy crops in the US and Europe. Biomass Bioenergy. 2003;25(4):335–61.
Long SP, Beale CV, Farage PK. Resource capture by miscanthus. In: Jones B, Walsh M, editors. Miscanthus for energy and fibre. London: James & James (Science Publishers) Ltd; 2001. p. 10–21.
Heaton EA, Flavell RB, Mascia PN, Thomas SR, Dohleman FG, Long SP. Herbaceous energy crop development: recent progress and future prospects. Curr Opin Biotechnol. 2008;19(3):202–9.
van der Weijde T, Alvim Kamei CL, Torres AF, Vermerris W, Dolstra O, Visser RGF, Trindade LM. The potential of C4 grasses for cellulosic biofuel production. Front Plant Sci. 2013;4:107.
Clifton-Brown JC, Chiang YC, Hodkinson TR. Miscanthus: genetic resources and breeding potential to enhance bioenergy production. In: Vermerris W, editor. Genetic improvement of bioenergy crops. New York: Springer Science + Business Media, LLC; 2008. p. 295–308.
Lewandowski I, Clifton-Brown JC, Scurlock JMO, Huisman W. Miscanthus: European experience with a novel energy crop. Biomass Bioenergy. 2000;19(4):209–27.
Lafferty J, Lelley T. Cytogenetic studies of different Miscanthus species with potential for agricultural use. Plant Breed. 1994;113(3):246–9.
Sacks EJ, Juvik JA, Lin Q, Stewart JR, Yamada T. The gene pool of miscanthus species and its improvement. In: Paterson AH, editor. Genomics of the Saccharinae, vol. 11. New York: Springer New York; 2013. p. 73–101.
Heaton E, Voigt T, Long SP. A quantitative review comparing the yields of two candidate C4 perennial biomass crops in relation to nitrogen, temperature and water. Biomass Bioenergy. 2004;27(1):21–30.
Głowacka K. A review of the genetic study of the energy crop Miscanthus. Biomass Bioenergy. 2011;35(7):2445–54.
Yan J, Chen W, Luo FAN, Ma H, Meng A, Li X, Zhu M, Li S, Zhou H, Zhu W, et al. Variability and adaptability of Miscanthus species evaluated for energy crop domestication. GCB Bioenergy. 2012;4(1):49–60.
Hodkinson TR, Klaas M, Jones MB, Prickett R, Barth S. Miscanthus: a case study for the utilization of natural genetic variation. Plant Genet Resour. 2015;13(03):219–37.
Torres AF, Slegers PM, Noordam-Boot CMM, Dolstra O, Vlaswinkel L, Boxtel AJB, Visser RGF, Trindade LM. Maize feedstocks with improved digestibility reduce the costs and environmental impacts of biomass pretreatment and saccharification. Biotechnol Biofuels. 2016;9(63):1–15.
Torres AF, van der Weijde T, Dolstra O, Visser RGF, Trindade LM. Effect of maize biomass composition on the optimization of dilute-acid pretreatments and enzymatic saccharification. Bioenergy Res. 2013;6(3):1038–51.
Wyman CE. What is (and is not) vital to advancing cellulosic ethanol. Trends Biotechnol. 2007;25(4):153–7.
Pauly M, Keegstra K. Plant cell wall polymers as precursors for biofuels. Curr Opin Plant Biol. 2010;13(3):304–11.
Grabber JH, Ralph J, Lapierre C, Barrière Y. Genetic and molecular basis of grass cell-wall degradability. I. Lignin–cell wall matrix interactions. Comptes Rendus Biologies. 2004;327(5):455–65.
Grabber JH. How Do lignin composition, structure, and cross-linking affect degradability? A review of cell wall model studies. Crop Sci. 2005;45(3):820–31.
Zhao X, Zhang L, Liu D. Biomass recalcitrance. Part I: the chemical compositions and physical structures affecting the enzymatic hydrolysis of lignocellulose. Biofuels Bioprod Biorefin. 2012;6(4):465–82.
Himmel ME, Ding S-Y, Johnson DK, Adney WS, Nimlos MR, Brady JW, Foust TD. Biomass recalcitrance: engineering plants and enzymes for biofuels production. Science. 2007;315(5813):804–7.
Allison GG, Morris C, Clifton-Brown J, Lister SJ, Donnison IS. Genotypic variation in cell wall composition in a diverse set of 244 accessions of Miscanthus. Biomass Bioenergy. 2011;35(11):4740–7.
Zhao H, Li Q, He J, Yu J, Yang J, Liu C, Peng J. Genotypic variation of cell wall composition and its conversion efficiency in Miscanthus sinensis, a potential biomass feedstock crop in China. GCB Bioenergy. 2014;6(6):768–76.
Arnoult S, Mansard M-C, Brancourt-Hulmel M. Early prediction of biomass production and composition based on the first Six years of cultivation. Crop Sci. 2015;55(3):1104–16.
Atienza SA, Satovic ZS, Petersen KP, Dolstra OD, Martín AM. Preliminary genetic linkage map of Miscanthus sinensis with RAPD markers. Theor Appl Genet. 2002;105(6):946–52.
Kim C, Zhang D, Auckland S, Rainville L, Jakob K, Kronmiller B, Sacks E, Deuter M, Paterson A. SSR-based genetic maps of Miscanthus sinensis and M. sacchariflorus, and their comparison to sorghum. Theor Appl Genet. 2012;124(7):1325–38.
Ma X-F, Jensen E, Alexandrov N, Troukhan M, Zhang L, Thomas-Jones S, Farrar K, Clifton-Brown J, Donnison I, Swaller T, et al. High resolution genetic mapping by genome sequencing reveals genome duplication and tetraploid genetic structure of the diploid Miscanthus sinensis. PLoS ONE. 2012;7(3):e33821.
Swaminathan K, Chae W, Mitros T, Varala K, Xie L, Barling A, Glowacka K, Hall M, Jezowski S, Ming R, et al. A framework genetic map for Miscanthus sinensis from RNAseq-based markers shows recent tetraploidy. BMC Genomics. 2012;13(1):142.
Liu S, Clark LV, Swaminathan K, Gifford JM, Juvik JA, Sacks EJ. High-density genetic map of Miscanthus sinensis reveals inheritance of zebra stripe. GCB Bioenergy. 2015;n/a-n/a.
Atienza SG, Ramirez MC, Martin A. Mapping QTLs controlling flowering date in Miscanthus sinensis Anderss. Cereal Res Commun. 2003;31(3/4):265–71.
Atienza SG, Satovic Z, Petersen KK, Dolstra O, Martin A. Influencing combustion quality in Miscanthus sinensis Anderss.: identification of QTLs for calcium, phosphorus and sulphur content. Plant Breed. 2003;122(2):141–5.
Atienza SG, Satovic Z, Petersen KK, Dolstra O, Martín A. Identification of QTLs associated with yield and its components in Miscanthus sinensis Anderss. Euphytica. 2003;132(3):353–361.
Atienza SG, Satovic Z, Petersen KK, Dolstra O, Martín A. Identification of QTLs influencing agronomic traits in Miscanthus sinensis Anderss. I. Total height, flag-leaf height and stem diameter. Theor Appl Genet. 2003;107(1):123–9.
Atienza SG, Satovic Z, Petersen KK, Dolstra O, Martín A. Identification of QTLs influencing combustion quality in Miscanthus sinensis Anderss. II. Chlorine and potassium content. Theor Appl Genet. 2003;107(5):857–63.
Gifford JM, Chae WB, Swaminathan K, Moose SP, Juvik JA. Mapping the genome of Miscanthus sinensis for QTL associated with biomass productivity. GCB Bioenergy. 2015;7(4):797–810.
van der Weijde T, Torres AF, Dolstra O, Dechesne A, Visser RGF, Trindade LM. Impact of different lignin fractions on saccharification efficiency in diverse species of the bioenergy crop Miscanthus. BioEnergy Research. 2016;9(1):146–56.
Gomez LD, Whitehead C, Roberts P, McQueen-Mason SJ. High-throughput saccharification assay for lignocellulosic materials. J Vis Exp. 2011;53:e3240.
Tai TH, Tanksley SD. A rapid and inexpensive method for isolation of total DNA from dehydrated plant tissue. Plant Mol Biol Rep. 1990;8(4):297–303.
Grattapaglia D, Sederoff R. Genetic linkage maps of Eucalyptus grandis and Eucalyptus urophylla using a pseudo-testcross: mapping strategy and RAPD markers. Genetics. 1994;137(4):1121–37.
Van Ooijen JW. JoinMap 4, Software for the calculation of genetic linkage maps in experimental populations. Wageningen: Kyazma BV; 2006.
Churchill GA, Doerge RW. Empirical threshold values for quantitative trait mapping. Genetics. 1994;138(3):963–71.
Van Ooijen J. MapQTL® 6, Software for the mapping of quantitative trait in experiment populations of diploid species. Wageningen: Kyazma B V; 2009.
Hatfield RD, Jung H-JG, Ralph J, Buxton DR, Weimer PJ. A comparison of the insoluble residues produced by the Klason lignin and acid detergent lignin procedures. J Sci Food Agric. 1994;65(1):51–8.
Hatfield R, Fukushima RS. Can lignin be accurately measured? Crop Sci. 2005;45(3):832–9.
Xu N, Zhang W, Ren S, Liu F, Zhao C, Liao H, Xu Z, Huang J, Li Q, Tu Y, et al. Hemicelluloses negatively affect lignocellulose crystallinity for high biomass digestibility under NaOH and H2SO4 pretreatments in Miscanthus. Biotechnol Biofuels. 2012;5(1):58.
Li F, Ren S, Zhang W, Xu Z, Xie G, Chen Y, Tu Y, Li Q, Zhou S, Li Y, et al. Arabinose substitution degree in xylan positively affects lignocellulose enzymatic digestibility after various NaOH/H2SO4 pretreatments in Miscanthus. Bioresour Technol. 2013;130:629–37.
van der Weijde T, Kiesel A, Iqbal Y, Muylle H, Dolstra O, Visser RGF, Lewandowski I, Trindade LM. Evaluation of Miscanthus sinensis biomass quality as feedstock for conversion into different bioenergy products. GCB Bioenergy. 2016;9:176–90.
Lorenzana RE, Lewis MF, Jung HJG, Bernardo R. Quantitative trait loci and trait correlations for maize stover cell wall composition and glucose release for cellulosic ethanol. Crop Sci. 2010;50(2):541–55.
Torres A, Noordam-Boot CM, Dolstra O, van der Weijde T, Combes E, Dufour P, Vlaswinkel L, Visser RF, Trindade L. Cell wall diversity in forage maize: genetic complexity and bioenergy potential. Bio Energy Res. 2014;8(1):187–202.
Murray SC, Rooney WL, Mitchell SE, Sharma A, Klein PE, Mullet JE, Kresovich S. Genetic improvement of sorghum as a biofuel feedstock: II. QTL for stem and leaf structural carbohydrates. Crop Sci. 2008;48(6):2180–93.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215(3):403–10.
Zou G, Zhai G, Feng Q, Yan S, Wang A, Zhao Q, Shao J, Zhang Z, Zou J, Han B, et al. Identification of QTLs for eight agronomically important traits using an ultra-high-density map based on SNPs generated from high-throughput sequencing in sorghum under contrasting photoperiods. J Exp Bot. 2012;63(15):5451–62.
Felderhoff TJ, Murray SC, Klein PE, Sharma A, Hamblin MT, Kresovich S, Vermerris W, Rooney WL. QTLs for Energy-related Traits in a Sweet × Grain Sorghum [Sorghum bicolor (L.) Moench] Mapping Population. Crop Sci. 2012;52(5):2040–9.
Shiringani AL, Friedt W. QTL for fibre-related traits in grain × sweet sorghum as a tool for the enhancement of sorghum as a biomass crop. Theor Appl Genet. 2011;123(6):999–1011.
Vandenbrink JP, Goff V, Jin H, Kong W, Paterson AH, Alex Feltus F. Identification of bioconversion quantitative trait loci in the interspecific cross Sorghum bicolor × Sorghum propinquum. Theor Appl Genet. 2013;126(9):2367–80.
Wang Y-H, Acharya A, Burrell AM, Klein RR, Klein PE, Hasenstein KH. Mapping and candidate genes associated with saccharification yield in sorghum. Genome. 2013;56(11):659–65.
Wang Y-H, Poudel DD, Hasenstein KH, Van Deynze A. Identification of SSR markers associated with saccharification yield using pool-based genome-wide association mapping in sorghum. Genome. 2011;54(11):883–9.
Kamei CLA, Severing EI, Dechesne A, Furrer H, Dolstra O, Trindade LM. Orphan crops browser: a bridge between model and orphan crops. Mol Breed. 2016;36(9). doi:10.1007/s11032-015-0430-2.
We gratefully acknowledge the help of Gerrit Huisman and Herman Meurs for establishment and harvest of the field trial, as well as the help of Eyup Oztutuncu, Juan Carlos Rivas Baeza, Heleen Fürrer, Jan Leunissen and Annemarie Dechesne for technical assistance during laboratory analyses.
The presented research has received funding from the European Union consortium SUNLIBB (project ID 251132). Tim van der Weijde and Luisa Trindade further acknowledge funding from European Union consortium OPTIMISC (project ID 289159) under grant agreement n° 289159. Both consortia operated under the European Union Seventh Framework Programme (FP7/2007–2013). We further acknowledge Genencor International B.V./DuPont Industrial Biosciences for kindly supplying us with their commercial Accellerase 1500 enzyme cocktail used in this study.
Availability of data and materials
All data generated or analysed during this study are included in this published article (and its supplementary information files).
OD and LMT created the mapping population and designed the field experiment. CLAK performed DNA extractions. CLAK and EIS guided the sequencing experiment and analyzed and transformed sequence data into short-sequence haplotype markers. OD and TvdW created the genetic map. TvdW, AT, LMT, LDG and SJMM guided and performed biochemical and NIR analysis of stem samples. TvdW, OD and CAM mapped the traits on the genetic map. TvdW wrote the manuscript, with contributions from the coauthors in the methods section. LM, OD, RGFV, AT, CLAK and EIS have edited the manuscript. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Synteny map depicting the alignment and localization of M. sinensis mapped markers to the Sorghum bicolor (L.) Moench genome. Linkage groups of the female map are designated as ‘a’ or ‘b; linkage groups of the male map are designated as ‘c’ or ‘d’. The position of mapped QTLs is also shown. For each QTL, colored boxes indicate 1-LOD support intervals while extension bars delimit 2-LOD support intervals. (PDF 5499 kb)
About this article
Cite this article
van der Weijde, T., Kamei, C.L.A., Severing, E.I. et al. Genetic complexity of miscanthus cell wall composition and biomass quality for biofuels. BMC Genomics 18, 406 (2017). https://doi.org/10.1186/s12864-017-3802-7
- Quantitative trait loci (QTL)
- Genetic map
- Biomass quality
- Cell wall composition
- Saccharification efficiency
- Conversion efficiency