- Research article
- Open Access
Transciptome analysis reveals flavonoid biosynthesis regulation and simple sequence repeats in yam (Dioscorea alata L.) tubers
© Wu et al.; licensee BioMed Central. 2015
- Received: 31 October 2014
- Accepted: 17 April 2015
- Published: 30 April 2015
Yam (Dioscorea alata L.) is an important tuber crop and purple pigmented elite cultivar has recently become popular because of associated health benefits. Identifying candidate genes responsible for flavonoid biosynthesis pathway (FBP) will facilitate understanding the molecular mechanism of controlling pigment formation in yam tubers. Here, we used Illumina sequencing to characterize the transcriptome of tubers from elite purple-flesh cultivar (DP) and conventional white-flesh cultivar (DW) of yam. In this process, we also designed high quality molecular markers to assist molecular breeding for tuber trait improvement.
A total of 125,123 unigenes were identified from the DP and DW cDNA libraries, of which about 49.5% (60,020 unigenes) were annotated by BLASTX analysis using the publicly available protein database. These unigenes were further annotated functionally and subject to biochemical pathway analysis. 511 genes were identified to be more than 2-fold (FDR < 0.05) differentially expressed between the two yam cultivars, of which 288 genes were up-regulated and 223 genes were down-regulated in the DP tubers. Transcriptome analysis detected 61 unigenes encoding multiple well-known enzymes in the FBP. Furthermore, the unigenes encoding chalcone isomerase (CHS), flavanone 3-hydroxylase (F3H), flavonoid 3′-monooxygenase (F3’H), dihydroflavonol 4-reductase (DFR), leucoanthocyanidin dioxygenase (LDOX), and flavonol 3-O-glucosyltransferase (UF3GT) were found to be significantly up-regulated in the DP, implying that these genes were potentially associated with tuber color formation in this elite cultivar. The expression of these genes was further confirmed by qRT-PCR. Finally, 11,793 SSRs were successfully identified with these unigenes and 6,082 SSR markers were developed using Primer 3.
This study provides the first comprehensive transcriptomic dataset for yam tubers, which will significantly contribute to genomic research of this and other related species. Some key genes associated with purple-flesh trait were successfully identified, thus providing valuable information about molecular process of regulating pigment accumulation in elite yam tubers. In the future, this information might be directly used to genetically manipulate the conventional white-fleshed tuber cultivars to enable them to produce purple flesh. In addition, our SSR marker sets will facilitate identification of QTLs for various tuber traits in yam breeding programs.
- Flavonoid biosynthesis
- Dioscorea alata L
- Tuber color
- Differentially expressed genes
- Microsatellite markers
Yam (Dioscorea alata L.) is an important tuber crop valued for its dietary carbohydrate, amino acids and essential minerals. It is widely cultivated in tropical and subtropical regions. Most D. alata tubers have white flesh, but occasionally, purple-flesh tubers with high anthocyanidin content are produced because of spontaneous variation . Anthocyanins are responsible for the deep purple to red pigmentation of various flowers, fruits, leaves, and other plant tissues . Anthocyanins are perhaps the best characterized flavonoids with studies indicating their important role in plant physiology, in particular, plant defense against herbivores and pathogens. They have also been shown to have multiple health benefits for humans including immunomodulatory, anticancer, cardio-protective, vasodilation, antithrombotic, and UV-protection due to their antioxidant, and anti-inflammatory properties . Therefore, it is no surprise that the purple-flesh yam tubers have recently been selling at a premium price owing to consumer awareness about its health benefits. The current study is aimed at understanding how spontaneous variation leads to anthocyanin synthesis in certain yam strains. Understanding the molecular mechanism of triggering anthocyanin biosynthesis and accumulation in these strains makes it possible to transfer the purple pigment trait to conventional white-flesh cultivar, thus improving the tuber quality and market value.
In the past decade, the flavonoid biosynthesis pathway (FBP) has been well characterized genetically and biochemically in model and non-model plants [4,5]; and a number of genes encoding important enzymes and transcription factors responsible for the FBP have been cloned from a dozen organisms such as Arabidopsis , grapevine , Petunia , and maize . Nevertheless, the complicated mechanism that controls anthocyanin catabolism in different plant species and tissues is far from conclusive. It is reasonable to expect that the loss or accumulation-of-color adaptations in yams are relatively unconstrained because they can be obtained in various ways , and affected by multiple intracellular factors such as co-pigmentation , pH  and metal-chelation in vacuoles . The promotion or suppression of any one of the enzymes catalyzing a series of reactions that make up a pathway will change its final product. For example, Chen et al.  found that for independent events causing accumulation of red pigments in variegated peach flowers, a particular subset of genes (C4H, CHS, CHI and F3H) were enhanced and co-regulated in the FBP. Lou et al.  also revealed that the loss of delphinidin (blue pigment) resulted from the gene suppression of FLS and DFR in grape hyacinth.
Recently, transcriptomic analysis based on next generation sequencing (NGS) technology has emerged as an extremely powerful method for identifying novel genes associated with biosynthesis of various secondary metabolites in non-model plant species [16,17]. Specially, it has been widely applied to investigate molecular mechanisms of color variation in plant species such as blueberry , grape , Brassica Juncea , and potato . In yam, transcription profiles of leaf tissues from one anthracnose susceptible (TDa 95–0310) and two resistant yam genotypes (TDa 87–01091, TDa 95–0328) were analyzed upon infection with the anthracnose fungus; a set of genes involved in defense against anthracnose were identified . Anthocyanins are considered as an important quality trait in yam . However, to date, no effort has been made to uncover molecular basis of different color formation in yam tubers by using RNA-Seq. A previous study by Zhou et al.  through RACE technology and RT-PCR analysis, reported that DaANS1 (a member of ANS genes in FBP) controls anthocyanin accumulation of purple-flesh tubers based on its regulation at transcription level. However, a limitation in this study was the use of single cultivar (purple-flesh tuber) and study of one gene (ANS). Without comparing the global transcriptional differences between the purple-flesh cultivar and conventional white-flesh cultivar, it is impossible to separate candidate genes related to color formation. Therefore, the molecular mechanism underlying the purple-flesh formation has not yet been fully understood in yam.
Further, being a non-model species, there is lack of genomic resources, in particular, information on SSR markers for marker-assisted breeding (MAS) of yam. Previous genetic inheritance study revealed that some important traits (such as resistance) are controlled by a single dominant locus in yam . Genic-SSR markers appear to be tightly linked to specific gene functions and perhaps even play a direct role in controlling important traits [25,26]. Recent studies have demonstrated the variability in cultivated yam accessions in terms of tuber shape, color, taste and yield [27,28]. However, very limited knowledge is available on the genetic regions associated with these variations, in particular SSR or SNP markers for important genes [29,30]. Therefore, identification of SSR markers from yam transcriptome is crucial for the future of marker-assisted breeding programs.
Here, we report use of RNA-Seq to investigate the transcriptomic differences between yam tubers of a purple-flesh cultivar (DP) and conventional white-flesh cultivar (DW). Differentially expressed genes and their expression patterns were analyzed, and some potential candidate genes responsible for the FBP were successfully identified. We expect this genome-wide transcriptome comparison to provide a novel resource to understand the molecular mechanisms underlying the purple-flesh trait. Moreover, transcriptomic datasets were further exploited to identify a large number of gene-based SSR markers that enable linkage mapping and marker assisted breeding of yams.
Sequencing statistics and assembly
Number of mapped reads to the assembled unigenes of yam
Raw bases (bp)
Clean bases (bp)
Raw reads (N)
Clean reads (N)
Total alignment (percent of clean reads) (N, %)
Unique matches (percent of clean reads) (N, %)
Multi-position match (percent of clean reads) (N, %)
Currently, the yam EST library found in Genbank database (http://www.ncbi.nlm.nih.gov/nucest/?term=Dioscorea+alata ) contains 44,134 ESTs from leaves of three genotypes differing in resistance to anthracnose disease . To estimate the level of transcript coverage in this study, we downloaded these ESTs from GenBank and compared them to our transcriptome unigenes using BLASTN (e ≤ 1.00 × 10−7). Only 32.88% ESTs (14,512,) from GenBank matched to 23,874 unigenes (Additional file 2). This was probably associated with different tissues used for transcriptome analysis. It also highlights the high level of sequencing depth achievable through NGS compared to low coverage obtained using conventional cDNA library sequencing. Furthermore, 97,379 novel yam unigenes were discovered, some of which may be specifically expressed in yam tuber tissue. These novel unigenes may serve as a crucial genomic resource for future studies, such as gene identification, cloning and functional analysis.
Annotation, functional classification and KEGG pathway analysis of the unigenes
Summary statistics of functional annotation for yam tuber unigenes in public databases
Public protein database
NO. of unigene hit
In addition, to identify which metabolic pathways were enriched, a pathway-based analysis was conducted through the KEGG pathway database using BLASTX with an E-value cutoff of <10−5. In total, 24,289 unigenes were assigned to 279 KEGG pathways (Additional file 3). The result reveals that metabolic pathway (Ko01100) was the most enriched (4,064 unigenes), followed by biosynthesis of secondary metabolites (Ko1110; 1,687 unigenes) and microbial metabolism in diverse environments (Ko1120; 1,020 unigenes). The focus of this study was differential anthocyanin accumulation in the DP cultivar. Therefore, genes associated with two secondary metabolic pathways including flavonoid biosynthesis, flavone and flavonol biosynthesis were separately analyzed. A total of 61 genes were found to be directly or indirectly involved in flavonoid biosynthesis, and they were mapped and highlighted in this pathway (Ko00941) (Additional file 4). In contrast, relatively few genes (25) were found to encode key enzymes in the flavone and flavonol biosynthesis (Ko00944) (Additional file 5). Overall, these findings provide useful information to further uncover the molecular mechanism of anthocyanin accumulation in yam tuber.
Identification of differentially expressed genes (DEGs) in yam tubers of the purple-flesh and white-flesh cultivars
To profile gene expression, the expression levels were measured as Fragments Per Kilobase of transcript per Million fragments mapped (FPKM), with FPKM values ranging from 0 to 104 . As a result, 63,040 and 100,140 unigenes were discovered in the DP and DW libraries, respectively. Among them, 15,048 unigenes specifically expressed in the tuber of DP, 52,148 unigenes only expressed in the DW, and 47,992 unigenes expressed in both cultivars. This indicated that some unique genes may play an important role in the accumulation of purple pigment.
Based on the false discovery rate (FDR) ≤ 0.05, and fold change (FC) ≥ 1, 511 DEGs were identified from the two libraries, among which 288 genes were up-regulated, and 223 genes were down-regulated in DP versus DW. For a detailed comparison, see Additional file 6. There were more up-regulated genes than down-regulated ones, suggesting that many genes were positively regulated for biosynthesis of anthocyanins. Similar results were also reported in other species [19,20]. Annotation of differentially expressed unigenes revealed that 433 unigenes were grouped into 45 GO groups while the remaining 78 unigenes could not be classified (not shown). The most common categories were “intracellular part” (27 up-regulated and 17 down-regulated) and “protein binding” (24 up-regulated and 17 down-regulated), followed by “cellular macromolecule metabolic process”, and “intracellular organelle”.
Identification of candidate genes associated with the flavonoid biosynthesis pathway
Candidate genes associated with anthocyanin pigmentation in yam tuber.
KO id (EC No.)
NO. all a
NO. up b
NO. down c
Shikimate hydroxycinnamoyl transferase
As shown in Figure 5, flavonoids are initially derived from cinnamate and converted to chalcone via the phenylpropanoid pathway by cinnamate 4-monooxygenase (C4H) (EC 18.104.22.168; 5 annotated unigenes) and CHS (EC 22.214.171.124; 17 unigenes). Subsequently, CHI (EC 126.96.36.199; 1 unigene) catalyzes the stereo-specific cyclization of chalcones into naringenin. Furthermore, naringenin can be converted through F3H (EC 188.8.131.52; 2 unigenes) and F3’H (EC 184.108.40.206; 8 unigenes) to produce dihydroxyflavonols including dihydrokaempferol (DHK) and dihydroquercetin (DHQ). These flavanones serve as the lead compounds for conversion into almost all flavonoids. Following the above reaction, DFR (EC 220.127.116.11; 2 unigenes) further catalyzes the divergent conversion of dihydroflavonols to produce colorless procyanidins including leucopelargonidin, and leucocyanidin. They are the direct precursors for production of colored anthocyanidins (pelargonidin, and cyanidin) by LODX (EC18.104.22.168; 2 unigenes) catalysis. In the end, two glucosyltransferases [UGT75C1 (EC 22.214.171.1248; 1 unigene) and UF3GT (EC 126.96.36.199; 3 unigenes)] catalyze the glucosylation of anthocyanidins to produce stable molecules of the FBP. Notably, the formation of (−) -epiflavan 3-ols (such as epicatechin and epigallocatechin) is also achieved by a two-step conversion of leucoanthocyanidin by leucoanthocyanidin reductase [LAR, (EC 188.8.131.52; 1 unigene)] and anthocyanidin reductase [ANR, (EC 184.108.40.206; 1 unigene)] (not shown in Figure 5), suggesting that flavonoid biosynthesis looks more like a complex metabolic grid than a linear pathway .
Considering the anthocyanin accumulation in purple-flesh tuber is associated with specific molecular functions, we compared the differences in gene expression profile of the purple and white flesh tubers to identify putative genes co-expressed with anthocyanin accumulation. Among the above described genes involved in the FBP (Table 3), one CHS (unigene 003987), one F3H (unigene005154), one DFR (unigene004195), one UF3GT (unigene02509), two F3’H (unigene014794, unigene004018), and two LDOX (unigene028912, unigene017716) homologous sequences were significantly up-regulated in the purple-flesh tuber. In contrast, one O-methyltransferase (FOMT, unigene065894) was significantly down-regulated. These up-regulated genes code for important proteins and their expression directly correlated with anthocyanin biosynthesis (Figure 5). For example in the upstream of the FBP, the up-regulated CHS, F3H and F3’H in purple-flesh tubers can increase functional redundancies for forming primary precursor (chalcone) and lead compounds (DHK, DHQ) of all flavonoids. Similar results were also observed during the differential pigment deposition in potato tubers , peach and grape flowers [14,15].
On the other hand, in the downstream of FBP, three up-regulated genes (DFR, LDOX, UF3GT) also play a critical role during formation of colored anthocyanins. We found that DFR and LDOX unigenes were not expressed in the white-flesh tubers, whereas two LDOX unigenes were expressed at high levels in the purple-flesh tubers (Additional file 6). In addition, the up-regulated glycosyltransferase (UF3GT) can potentially make structural modifications to anthocyanins. Two anthocyanins (cyanidin-3- O-glucoside and pelargonidin-3- O-glucoside) in the purple-flesh tuber are glycosylated at the 3-postion of the C-ring by this enzyme. Similar results were also reported in previous studies [35-37]. For instance, two glycosyltransferases, UGT79B1 and UGT84A2 were found to cause high levels of anthocyanin modifications (3-O-glucosylated anthocyanidins) in Arabidopsis flavonoid biosynthesis , whilst anthocyanins were drastically reduced in the UGT79B1 and UGT84A2 knockout mutants. Besides, the O-methyltransferase is one of the most important modification reactions of flavonoids and the resulting O-methylated flavonoids have been shown to display new biological activities . In this study, the down-regulated O-methyltransferase (FOMT) was assigned to code quercetin-3-O-methyltransferase protein and may have redundant function in the FBP. Taken together, these results indicate that key genes responsible for the FBP have a higher expression level in the purple-flesh tubers of yam. This finding is an important explanation of well-known higher antioxidant activity in pigmented tissues found in a number of tissues.
Identification of genes associated with transcription factors (TFs)
Besides structural genes, it is well known that transcription factors play an essential role in regulating the overall activity of flavonoid biosynthesis. In most species, the anthocyanin branch within the FBP is controlled by a ternary complex of MYB-bHLH-WD40 TFs [5,39], which generally regulate expression of many structural genes. In our transcriptome database, a total of 183, 146, 95 unigenes were respectively predicted to code bHLH, MYB, and WD40 proteins including a large number of its members. Of these genes, the transcriptomic analysis detected four TFs that were differentially expressed between the two cultivars of yam tubers, including three WD40 repeat proteins with one up-regulated (unigene050252) and two down-regulated (unigene041043, unigene056944) in purple-flesh tubers. Further, one MYB4R1 protein (unigene029894) was also found to be up-regulated in purple-flesh tubers (Additional file 6). The high variation in expression of structural genes associated with the FBP in the purple-flesh tubers may most likely be regulated by one or more of these TFs. However, the specific function of these TFs in the FBP of yams still needs to be validated using a functional genomics approach.
Gene validation and expression analysis
Identification of simple sequence repeats (SSRs) in yam
Usually, gene-derived SSRs are more transferable between species than random genomic SSRs. This is perhaps because they are associated with functional genetic variation, as opposed to non-coding SSRs, with presence in transcribed regions potentially influencing gene function, transcription or translation . In this study, transcriptome analysis of the two yam tuber cultivars (DP and DW) led to identification of 11,793 SSRs within 121,253 unigenes, of which, 977 sequences contained more than 1 SSR, and 1706 SSRs were present in compound form. The observed frequency of unigenes was 8.6% (10,426); considering that approximately 71,983 kb total size was analyzed, and least one SSR per 6.1 kb could be detected in the expressed sequences of yam.
Frequencies of repeat types with repeat number in the cSSRs of yam
The focus of this study was use of NGS-based Illumina paired-end sequencing platform to characterize the gene expression differences between an elite purple-flesh tuber and conventional white-flesh tuber of yam. A total of 125,123 unigenes were identified from the two cDNA libraries, which will contribute significantly to further genome-wide research and analyses of this species and other related species. Analysis of the transcriptome data revealed a number of candidate genes which are possibly involved in purple-flesh tuber formation. The candidates include not only structural genes such as CHS, F3H, F3’H, DFR, LDOX and UF3GT, but also some transcription factors (bHLH, MYB, and WD40) that potentially regulate development of purple-flesh in yam tubers. Such knowledge can be used to genetically enhance tuber color of conventional white-flesh cultivar. In addition, we also used transcriptomic data as a resource to develop new SSR markers. These marker sets will facilitate identification of quantitative traits loci (QTL) associated with yam tuber quality in future.
The elite purple-flesh tubers and the conventional white-flesh tubers of yam (D. alata) were cultivated in a yam producing region (Wenzhou city, Zhejiang province, China; 121°09′48.82″ E and 28°27′53.62 N). Both were planted at the same time and cultivated in similar conditions. Tubers were harvested 10 days after new tuber emergence (DAM) and used for transcriptome analysis. Tubers of each cultivar were collected from five different plants, with a total of 15 tubers per cultivar. The tubers were washed and their skin was peeled off. The samples were labeled as DP (purple-flesh tuber) and DW (white-flesh tuber), then immediately frozen in liquid nitrogen, and stored at −80°C prior to use.
RNA extraction, cDNA library construction and Illumina deep sequencing
Total RNA from the DP and DW samples was extracted using the RNAiso kit for polysaccharide-rich plant tissue (Takara Biotechnology (Dalian) Co., Ltd.) and purified using RNeasy plant mini kit (Qiagen, Valencia, CA) to avoid DNA contamination. The RNA quality was analysed by measuring the absorbance at 260 nm/280 nm (A260/A280) using a ND-1000 spectrophotometer (Nano-Drop Technologies, Wilmington, DE, USA). Further, RNA Integrity Number (RIN) values were determined using a Bioanalyzer 2100 (Aligent Technologies, Santa Clara, CA) to make sure all samples had a RIN greater than 8.5. Two separate RNA pools for the DP and DW cultivars were prepared for cDNA library construction, each comprising 15 RNA samples from 15 tubers of five plants per cultivar.
Two sequencing libraries were constructed using a cDNA Synthesis kit (Illumina Inc., San Digo, CA, USA) following the manufacturer’s instructions. Paired-end (2 × 150 bp) sequencing of the cDNA libraries was performed on the Illumina HiSeq 2000 (Illumina Inc., San Diego, CA, USA). Libraries from both the cultivars yielded more than 4 GB of clean data. Sequencing was completed by the Hangzhou Woosen Bio-technology Co. Ltd. (Hangzhou, China).
Reads assembly and transcriptome annotation
The clean reads were obtained by read trimming of raw data by removing adaptors, reads in which unknown bases were more than 10%, low-quality reads with quality scores less than Q30, and low-quality bases less than (Q30) at the 3′ end. Next, the high-quality filtered reads were further assembled using a de novo assembly program Trinity (released 2011-05-19, http://trinityrnaseq.sourceforge.net/) with the main parameters “K-mer = 25, group_pairs_distance = 500, min_glue = 2, min_kmer_cov = 1” . .Briefly, for each library (DP and DW), short reads were first assembled into longer contiguous sequences (contigs) according to their overlap regions, and then these reads were mapped back to the contigs based on their paired-end information. With paired-end reads it is possible to detect the contigs from the same transcript as well as the distances between these contigs. Afterwards, the contigs were further assembled, and the assembled sequences that could not be extended on either end and were defined as unigenes. Finally, the potential unigenes from DP and DW library were clustered using the TGICL clustering tool  to acquire a single set of non-redundant unigenes. In addition, to obtain assembly statistics profile about reads that could be mapped back to the assembled unigenes, TopHat (version 2.0.8) (released 2013-02-26, http://tophat.cbcb.umd.edu/)  with the parameter “mate-inner-dist = 250”, was used to align short reads to the constructed transcripts by de novo assembling.
All assembled unigenes were annotated by matching against the NCBI non-redundant protein (NR), the Arabidopsis thaliana protein dataset of NR (ATNR), Gene Ontology (GO), and the Kyoto Encyclopedia of Genes and Genomes (KEGG) using the BLASTX analysis with a cut-off E-value of 10−5. Based on NR annotation, the Blast2GO software (version 2.3.5) was used to obtain GO annotations according to molecular function, biological process and cellular component ontologies (http://www.geneontology.org) . The unigene sequences were subsequently matched against the COG database to predict and classify possible functions. The KEGG pathway annotation was also performed by comparison against the KEGG database using the online KEGG Automatic Server (KAAS) (http://www.genome.jp/tools/kaas/) [46,47].
Differentially expressed genes (DEGs) between the DP and DW tubers
In order to assess the differential expression between the two investigated yam cultivars, TopHat (version 2.0.8) was first used to match against the assembled unigenes, which was followed by estimation of total mapped reads . After the alignment, cufflinks (version 2.1.1) (released 2013-04-11, http://cufflinks.cbcb.umd.edu) was used to estimate the abundances of unigenes as Fragments Per Kilobase of transcript per Million fragments mapped (FPKM) ; and cuffdiff was carried out to perform pairwise comparisons between different investigated cultivars. Differentially expressed genes (DEGs) were further characterized and estimated using the R software module edge R (R v2.14; edgeRv 2.3.52) in term of the results from cufflinks . False discovery rate (FDR) <0.05 and an estimated absolute log2 fold-change (log2 FC) ≥ 1 were used as threshold for determining significant difference in gene expression between the purple- and white flesh tubers of yam. Moreover, all DEGs were mapped to terms in the KEGG database and searched for KEGG terms to identify pathways related to purple-flesh trait in yam tubers.
Total RNA was extracted from the white and purple flesh-tubers of yam as described above. Approximately 2 mg of total RNA per sample was treated with DNaseI (Takara), and reverse transcribed into cDNA using Promega A3500 reverse transcription system. Eight DEGs were selected for Quantitative real-time PCR (qRT-PCR) analysis to verify the expression patterns revealed by the RNA-seq analysis. Gene specific qRT-PCR primers (Additional file 7) were designed using Premier 5.0 software (Premier Biosoft International, Palo Alto, CA). qRT-PCR was performed using SybrGreen qRT-PCR Master Mix (Ruian Biotechnologies, Shanghai, China) in an ABI 7500 FAST Real-Time PCR System (Applied Biosystems, Foster City, CA, USA). Amplification program comprised an initial denaturation step at 95°C for 2 min, followed by 40 cycles of denaturation at 95°C for 10 s and annealing at 60°C for 30 s. Three replicates were performed, and the amplicons were subject to melting curve analysis to determine amplification specificity. The relative expression level of the selected unigenes were normalized to UBC gene and calculated using the 2-ΔΔCt method .
Simple sequence repeats (SSRs) identification and primer design
Simple sequence repeats (SSRs) in unigene sequences were identified using MIcroSAtellite package (MISA, http://pgrc.ipk-gatersleben.de/misa). The SSR search parameters were defined to identify di-, tri-, tetra-, penta- and hexa-nucleotide motifs with a minimum of 6, 5, 4, 5 and 5 repeats, respectively. Subsequently, primer pairs were designed for genes with SSRs using the Primer3 (version 2.23) (http://sourceforge.net/projects/primer3/) with default settings , and the PCR product size ranging from 100 to 280 bp.
Availability of supporting data
All clean reads generated by Illumina sequencing have been deposited in the Sequence Read Archive (SRA) database (http://trace.ncbi.nlm.nih.gov/Traces/sra/) under the accession ID SRX652481 for DP, and SRX652483 for DW.
The present study was financially supported by Zhejiang Provincial Natural Science Foundation of China (Grant No. LY14C130002), Zhejiang Provincial Key Laboratory for Genetic Improvement and Quality Control of Medicinal Plants (Grant No. 2011E10015), and the Innovation and Improvement Program in Zhejiang Academy of Agricultural Sciences (Grant No. 2014CX021).
- Arnau G, Abraham K, Sheela MN, Chair H, Sartie A, Asiedu R. Yams. In: Bradshaw JE, editor. Root and Tuber Crops. New York, USA: Springer-Verlag Inc; 2010.Google Scholar
- Tanaka Y, Sasaki N, Ohmiya A. Biosynthesis of plant pigments: anthocyanins, betalains and carotenoids. Plant J. 2008;54:733–49.View ArticlePubMedGoogle Scholar
- Tsuda T. Dietary anthocyanin-rich plants: Biochemical basis and recent progress in health benefits studies. Mol Nutr Food Res. 2012;56:159–70.View ArticlePubMedGoogle Scholar
- Hichri I, Barrieu F, Bogs J, Kappel C, Delrot S, Lauvergeat V. Recent advances in the transcriptional regulation of the flavonoid biosynthetic pathway. J Exp Bot. 2011;62:2465–83.View ArticlePubMedGoogle Scholar
- Petroni K, Tonelli C. Recent advances on the regulation of anthocyanin synthesis in reproductive organs. Plant Sci. 2011;181:219–29.View ArticlePubMedGoogle Scholar
- Saito K, Yonekura-Sakakibara K, Nakabayashi R, Higashi Y, Yamazaki M, Tohge T, et al. The flavonoid biosynthetic pathway in Arabidopsis: structural and genetic diversity. Plant Physiol Biochem. 2013;72:21–34.View ArticlePubMedGoogle Scholar
- Czemmel S, Heppel SC, Bogs J. R2R3 MYB transcription factors: key regulators of the flavonoid biosynthesis pathway in grapevine. Protoplasma. 2012;249:109–18.View ArticleGoogle Scholar
- Kroon J, Souer E, de Graaff A, Xue Y, Mol J, Koes R. Cloning and structural analysis of the anthocyanin pigmentation locus Rt of Petunia hybrida: Characterization of insertion sequences in two mutant alleles. Plant J. 1994;5:69–80.View ArticlePubMedGoogle Scholar
- Selinger DA, Chandler VL. A mutation in the pale aleurone color1 gene identifies a novel regulator of the maize anthocyanin pathway. Plant Cell. 1999;11:5–14.View ArticlePubMed CentralPubMedGoogle Scholar
- Clark ST, Verwoerd WS. A systems approach to identifying correlated gene targets for the loss of colour pigmentation in plants. BMC Bioinformatics. 2011;12:343.View ArticlePubMed CentralPubMedGoogle Scholar
- Bloor SJ, Falshwa R. Covalently linked anthocyanin-flavonol pigments from blue Agapanthus flowers. Phytochemistry. 2000;53:575–9.View ArticlePubMedGoogle Scholar
- Yoshida K, Toyama-Kato Y, Kameda K, Kondo T. Sepal color variation of Hydrangea macrophylla and vacuolar pH measured with a proton-selective microelectrode. Plant Cell Physiol. 2003;44:262–8.View ArticlePubMedGoogle Scholar
- Yoshida K, Kitahara S, Ito D, Kondo T. Ferric ions involved in the flower color development of the Himalayan blue poppy, Meconopsis grandis. Phytochemistry. 2006;67:992–8.View ArticlePubMedGoogle Scholar
- Chen YN, Mao Y, Liu HL, Yu FX, Li SX, Yin TM. Transcriptome analysis of differentially expressed genes relevant to variegation in peach flowers. PLoS One. 2014;9:e90842.View ArticlePubMed CentralPubMedGoogle Scholar
- Lou Q, Liu YL, Qi YY, Jiao SZ, Tian FF, Jiang L, Wang YJ: Transcriptome sequencing and metabolite analysis reveals the role of delphinidin metabolism in flower colour in grape hyacinth. J Exp Bot 2014;65:3157-3164.Google Scholar
- Costa V, Angelini C, De Feis I, Ciccodicola A. Uncovering the complexity of transcriptomes with RNA-Seq. J Biomed Biotechnol. 2010;2010:853–916.View ArticleGoogle Scholar
- Ward JA, Ponnala L, Weber CA. Strategies for transcriptome analysis in nonmodel plants. Am J Bot. 2012;99:267–76.View ArticlePubMedGoogle Scholar
- Li XY, Sun HY, Pei JB, Dong YY, Wang FW, Chen H, et al. De novo sequencing and comparative analysis of the blueberry transcriptome to discover putative genes related antioxidants. Gene. 2012;511:54–61.View ArticlePubMedGoogle Scholar
- Liu XJ, Lu Y, Yuan YH, Liu SY, Guan CY, Chen SY, et al. De Novo transcriptome of Brassica juncea seed coat and identification of genes for the biosynthesis of flavonoids. PLoS One. 2013;89, e7110.Google Scholar
- Stushnoff C, Ducreux LJM, Hancock RD, Hedley PE, Holm DG, McDougall GJ, et al. Flavonoid profiling and transcriptome analysis reveals new gene-metabolite correlations in tubers of Solanum tuberosum L. J Exp Bot. 2010;61:1225–38.View ArticlePubMed CentralPubMedGoogle Scholar
- Narina SS, Buyyarapu R, Kottapalli KR, Sartie AM, Ali MI, Robert A, et al. Generation and analysis of expressed sequence tags (ESTs) for marker development in yam (Dioscorea alata L.). BMC Genomics. 2011;12:100.View ArticlePubMed CentralPubMedGoogle Scholar
- Fang ZX, Wu D, Yü D, Ye XQ, Liu DH, Chen JC. Phenolic compounds in Chinese purple yam and changes during vacuum frying. Food Chem. 2013;128:943–8.View ArticleGoogle Scholar
- Zhou SM, Wang LP, Xiang X, Wei BH, Li LZ, Li YR, et al. Cloning and molecular characteristics of ANS gene and its correlations with anthocyanin accumulation in yam. Acta Hort Sin. 2009;36:1317–26.Google Scholar
- Petro D, Onyeka TJ, Etienne S, Rubens S. An intraspecific genetic map of water yam (Dioscorea alata L.) based on AFLP markers and QTL analysis for anthracnose resistance. Euphytica. 2011;179:405–16.View ArticleGoogle Scholar
- Gupta PK, Rustgi S. Molecular markers from the transcribed/expressed region of the genome in higher plants. Funct Integr Genomic. 2004;4:139–62.Google Scholar
- Tranbarger TJ, Kluabmongkol W, Sangsrakru D, Morcillo F, Tregear JW, Tragoonrung S, et al. SSR markers in transcripts of genes linked to post-transcriptional and transcriptional regulatory functions during vegetative and reproductive development of Elaeis guineensis. BMC Plant Biol. 2012;12:1.View ArticlePubMed CentralPubMedGoogle Scholar
- Mengesha WA, Demissew S, Fay MF, Smith RJ, Nordal I, Wilkin P. Genetic diversity and population structure of Guinea yams and their wild relatives in South and South West Ethiopia as revealed by microsatellite markers. Genet Resour Crop Evol. 2013;60:529–41.View ArticleGoogle Scholar
- Wu ZG, Li XX, Lin XC, Jiang W, Tao ZM, Mantri N, et al. Genetic diversity analysis of yams (Dioscorea spp.) cultivated in China using ISSR and SRAP markers. Genet Resour Crop Evol. 2014;61:639–50.View ArticleGoogle Scholar
- Tostain S, Agbangla C, Scarcelli N, Mariac C, Daïnou O, Berthaud J, et al. Genetic diversity analysis of yam cultivars (Dioscorea rotundata Poir.) in Benin using simple sequence repeat (SSR) markers. Plant Genet Resour. 2007;5:71–81.View ArticleGoogle Scholar
- Nascimento WF, Rodrigues JF, Koehler S, Gepts P, Veasey EA. Spatially structured genetic diversity of the Amerindian yam (Dioscorea trifida L.) assessed by SSR and ISSR markers in Southern Brazil. Genet Resour Crop Evol. 2014;60:2405–20.View ArticleGoogle Scholar
- Hua WP, Zhang Y, Song J, Zhao LJ, Wang ZZ. De novo transcriptome sequencing in Salvia miltiorrhiza to identify genes involved in the biosynthesis of active ingredients. Genomics. 2011;98:272–9.View ArticleGoogle Scholar
- Yates SA, Swain MT, Hegarty MJ, Chernukin I, Matthew L, Allison GG, et al. De Novo assembly of red clover transcriptome based on RNA-Seq data provides insight into drought response, gene discovery and marker identification. BMC Genomics. 2014;15:453.View ArticlePubMed CentralPubMedGoogle Scholar
- Mortazavi A, Williams BA, McCue K, Scaeffe L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008;5:621–8.View ArticlePubMedGoogle Scholar
- Winkel-Shirley B. Flavonoid biosynthesis. A colorful model for genetics, biochemistry, cell biology, and biotechnology. Plant Physiol. 2011;126:485–93.View ArticleGoogle Scholar
- Yonekura-Sakakibara K, Fukushima A, Nakabayashi R, Hanada K, Matsuda F, Sugawara S, et al. Two glycosyltransferases involved in anthocyanin modification delineated by transcriptome independent component analysis in Arabidopsis thaliana. Plant J. 2012;69:154–67.View ArticlePubMed CentralPubMedGoogle Scholar
- Montefiori M, Espley RV, Stevenson D, Cooney J, Datson PM, Saiz A, et al. Identification and charactherisation of F3GT1 and F3GGT1, two glycosyltransferases responsible for anthocyanin biosynthesis in red-fleshed kiwifruit (Actinidia chinensis). Plant J. 2011;65:106–18.View ArticlePubMedGoogle Scholar
- Kovinich N, Saleem A, Arnason JT, Miki B. Combined analysis of transcriptome and metabolite data reveals extensive differences between black and brown nealy-isogenic soybean (Glycine max) seed coats enabling the identification of pigment isogenes. BMC Genomics. 2011;12:381.View ArticlePubMed CentralPubMedGoogle Scholar
- Schmidt A, Li C, Jones DA, Pichersky E. Characterization of a flavonol 3-O-methltransferase in the trichomes of the wild tomato species Solanum habrochaites. Planta. 2012;236:839–49.View ArticlePubMedGoogle Scholar
- Laitinen RAE, Ainasoja M, Broholm SK, Teeri TH, Elomaa P. Identification of target genes for a MYB-type anthocyanin regulator in Gerbera hybrid. J Exp Bot. 2008;59:3691–703.View ArticlePubMed CentralPubMedGoogle Scholar
- Li HY, Dong YY, Yang J, Liu XM, Wang YF, Yao N, et al. De Novo transcriptome of Safflower and the identification of putative genes for Oleosin and the biosynthesis of flavonoids. PLos One. 2012;7:e30987.View ArticlePubMed CentralPubMedGoogle Scholar
- Shi SG, Yang M, Zhang M, Wang P, Kang YX, Liu JJ. Genome-wide transcriptome analysis of genes involved in flavonoid biosynthesis between red and white strains of Magnolia sprengeri pamp. BMC Genomics. 2014;15:706.View ArticlePubMed CentralPubMedGoogle Scholar
- Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. Full-length transcriptome assembly from RNA-Seq data without a references genome. Nat Biotechnol. 2011;29:644–52.View ArticlePubMed CentralPubMedGoogle Scholar
- Pertea G, Huang XQ, Liang F, Antonescu V, Sultana R, Karamycheva S, et al. TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics. 2003;19:651–2.View ArticlePubMedGoogle Scholar
- Trapnell C, Pachter L, Salzberg SL. TopHat: discovering splice junctions with RNA-seq. Bioinformatics. 2009;25:1105–11.View ArticlePubMed CentralPubMedGoogle Scholar
- Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005;21:3674–6.View ArticlePubMedGoogle Scholar
- Aoki-Kinoshita KF, Kanehisa M. Gene annotation and pathway mapping in KEGG. Methods Mol Biol. 2007;396:71–91.View ArticlePubMedGoogle Scholar
- Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 2007;35:D182–5.View ArticleGoogle Scholar
- Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–40.View ArticlePubMed CentralPubMedGoogle Scholar
- Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2-DDCT method. Methods. 2001;25:402–8.View ArticlePubMedGoogle Scholar
- Rozen S, Skaletsky H. Primer3 on the WWW for general users and for biologist programmers. Methods Mol Biol. 2000;132:365–86.PubMedGoogle Scholar
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.