Transcriptomic analysis of Chinese bayberry (Myrica rubra) fruit development and ripening using RNA-Seq
- Chao Feng†1,
- Ming Chen†2,
- Chang-jie Xu1,
- Lin Bai2,
- Xue-ren Yin1,
- Xian Li1,
- Andrew C Allan3,
- Ian B Ferguson1, 3 and
- Kun-song Chen1Email author
© Feng et al; licensee BioMed Central Ltd. 2012
Received: 19 September 2011
Accepted: 13 January 2012
Published: 13 January 2012
Chinese bayberry (Myrica rubra Sieb. and Zucc.) is an important subtropical fruit crop and an ideal species for fruit quality research due to the rapid and substantial changes that occur during development and ripening, including changes in fruit color and taste. However, research at the molecular level is limited by a lack of sequence data. The present study was designed to obtain transcript sequence data and examine gene expression in bayberry developing fruit based on RNA-Seq and bioinformatic analysis, to provide a foundation for understanding the molecular mechanisms controlling fruit quality changes during ripening.
RNA-Seq generated 1.92 G raw data, which was then de novo assembled into 41,239 UniGenes with a mean length of 531 bp. Approximately 80% of the UniGenes (32,805) were annotated against public protein databases, and coding sequences (CDS) of 31,665 UniGenes were determined. Over 3,600 UniGenes were differentially expressed during fruit ripening, with 826 up-regulated and 1,407 down-regulated. GO comparisons between the UniGenes of these two types and interactive pathways (Ipath) analysis found that energy-related metabolism was enhanced, and catalytic activity was increased. All genes involved in anthocyanin biosynthesis were up-regulated during the fruit ripening processes, concurrent with color change. Important changes in carbohydrate and acid metabolism in the ripening fruit are likely associated with expression of sucrose phosphate synthase (SPS) and glutamate decarboxylase (GAD).
Mass sequence data of Chinese bayberry was obtained and the expression profiles were examined during fruit ripening. The UniGenes were annotated, providing a platform for functional genomic research with this species. Using pathway mapping and expression profiles, the molecular mechanisms for changes in fruit color and taste during ripening were examined. This provides a reference for the study of complicated metabolism in non-model perennial species.
Chinese bayberry (Myrica rubra Sieb. and Zucc.) is an economically important subtropical fruit crop native to southern China and other Asian countries . The fruit is popular for its appealing color, delicious taste, and essential micro-nutrients and bioactive constituents such as antioxidants  and anti-tumor . The fruit can be eaten fresh, canned, dried, and is widely used in wine-making and juice-making. Chinese bayberry is widely planted not only as a fruit crop, but also for forestry . There is a large germplasm resource for Chinese bayberry and fruit characteristics vary widely among different cultivars [1, 4, 5]. Moreover, bayberry fruit undergoes rapid changes in pigments, sugars, organic acids and many other quality components during fruit development and ripening, which provides a very useful model for studying fruit quality, particularly properties associated with color, taste and health-associated nutrients.
Despite the economic and ecological importance of Chinese bayberry, there is no genomic resource for this non-model genus, with only 76 nucleotide sequences deposited in GenBank (as at the 30th of November 2011). Current biological studies on this plant, including photosynthesis and stress responses , growth regulation , and postharvest responses [8–10] have mostly been carried out at the physiological level. To date, the genes involved in anthocyanin biosynthesis as well as a transcription factor gene MrMYB1 have been obtained  by our group, showing the potential for functional genomic research for this crop. However, a lack of sequence data has become a limitation for extensive and intensive research on this fruit crop.
For woody plants, especially those of high heterozygosity, such as Chinese bayberry, whole genome sequencing requires long-term and expensive investment, and therefore is currently limited to only few species. Instead, it has been more useful to obtain information of UniGenes through transcriptome sequencing [12, 13]. The recent RNA-Seq based on NGS (next-generation sequencing) enables studies to be carried out on species without corresponding sequenced genome information as a reference . It has become widely applied to model as well as non-model organisms to obtain mass sequence data for molecular marker development, gene discovery and transcriptional analysis [14–21]. Compared with traditional laboratory methods, RNA-Seq is a high throughput technology, overcoming the weakness of microarrays in exploring unknown genes. Furthermore, it has great advantages in examining transcriptome fine structure, such as detection of allele-specific expression and splice junction variation .
In the present work, an RNA-Seq project for Chinese bayberry was initiated (NCBI BioProject Accession: PRJNA77861, http://www.ncbi.nlm.nih.gov/bioproject/77861). Four RNA samples containing various tissues and fruit of different development and ripening stages were sequenced using the latest Illumina deep sequencing technique. The objective of the present study is to gain an understanding of molecular mechanisms of fruit quality changes during ripening and establish a sound foundation for future molecular studies based on high throughput sequence and expression data.
Results and Discussion
Throughput and quality of RNA-Seq of Chinese bayberry cv. Biqi libraries
Total nucleotides (nt)
In previous works, take Illumina platform as an example, PE reads of 75 bp or 90 bp were de novo assembled to obtain transcriptome reference, and then 21 bp-tags or 50 bp single-end (SE) reads, from other runs of deep sequencing, were mapped back to the reference for expression annotation in different transcripts [19, 20]. In this study, however, a novel experimental design was applied. Four RNA samples, each ligated with a different adapter, were sequenced altogether in a same run. The data produced from the mixed samples were used to construct the whole transcriptome file, and meanwhile, they were used as the reference and combine with data from each separate sample to do further gene expression analysis. In addition to being less costly, longer reads were easier and more probably to be mapped into correct transcript sequences than shorter ones, especially for the higher plant species which contain more gene family members. This strategy can be generally ideal for species without a good reference.
De novo assembly
Short reads from four libraries were assembled into 199,438, 139,918, 137,722 and 174,012 contigs with a mean length of 147, 159, 156 and 148 bp, respectively. These were assembled into Scaffolds and UniGenes, taking the distance of PE reads into account. MR1 contained approximately 38,000 UniGenes, while the other libraries had around 30,000, with a mean length of UniGenes from each library ranging from 406 to 437 bp (Additional File 1). All these sequences were assembled to give 41,239 non-redundant UniGenes with a mean length of 531 bp. A N50 length of 708 bp, i.e., half of the assembled bases were incorporated into UniGenes with a length at least 708 bp, was obtained, with one third (13,115 UniGenes) having a length over 500 bp (Additional File 2A). The average gap percentage (ratio of number of 'N' to UniGene length) was 1.4%. 76.6% of the UniGenes had no gap, and less than 2% of the UniGenes had gap over 20% (Additional File 2B). The mean depth of UniGenes was 52.9, while 44.3% and 5.5% of UniGenes had the depth over 10 and 100, respectively (Additional File 2C).
There were 14,172 UniGenes mapped into 125 KEGG pathways. The maps with highest UniGene representation were Plant-pathogen interaction (Ko04626, 1,010 UniGenes, 7.13%), followed by Spliceosome (Ko03040, 897 UniGenes, 6.33%) and Ribosome (Ko03010, 424 UniGenes, 2.99%). The pathways with highest representation were purine metabolism (Ko00230, UniGenes, 2.99%), starch and sucrose metabolism (Ko00500, 389 UniGenes, 2.75%) and pyrimidine metabolism (Ko00240, 366 UniGenes, 2.58%), respectively (Additional File 4). From these pathways, information on Chinese bayberry metabolism can be obtained. As an example, biosynthesis of ascorbic acid (AsA, vitamin C), an important component of bayberry fruit, was examined. There are four biosynthetic pathways for AsA in plants, the Smirnoff-Wheeler (S-W), galactonate, glucose, and myo-inositol pathways [23–26]. In Chinese bayberry, the sequences for all 9 genes in the S-W pathway were assembled; however, this was not the case for the other three pathways (Additional File 5). Therefore, it is reasonable to conclude that S-W pathway is the main pathway for the biosynthesis of AsA in Chinese bayberry, unlike that in kiwifruit, which possesses both S-W and the myo-inositol pathways  and in strawberry, where the galactonate pathway was firstly discovered . Such annotations provide a good platform for further research into understanding metabolic pathways in this species. In contrast, the KO (KEGG Orthology) ids were used in Interactive pathways (Ipath) analysis and were helpful for the study of fruit quality related metabolism (see below).
Although only 62.4% of all annotated UniGenes were found in the SwissProt database (considerably less than in the nr database), the functional annotation provided by this database may be more reliable due to manually reviewed annotation. It was noteworthy that all the 9,509 UniGenes annotated against COG were included in SwissProt annotated UniGenes (Figure 2A). The COG database represented major phylogenetic lineages of Chinese bayberry, as shown in Additional File 6. This data would be useful in researching protein classification and evolution rates. The fact that 15.3% (2,586) of 16,916 UniGenes share homology with signal transduction (group T) or transcription factors (group K), which is higher than those in tomato , confirms that bayberry fruit development and ripening requires complex regulatory processes.
Through BLASTx against the databases mentioned above, the direction and region of CDS (coding sequences) were extracted from the sequences. 31,665 CDS were translated into peptide sequences, over 10% of which (3,714) had a length over 300 AA, with the 34 longest UniGenes having a length over 1000 AA. The detailed length distribution is shown in Additional File 2D.
Expression profiling during ripening
The proportions and comparisons between up-regulated and down-regulated UniGenes were summarized in three main functional categories (Figure 3C). Metabolism and catalytic activity were major responsive classes, where the number of up-regulated UniGenes was significantly higher than those down-regulated. This suggests that metabolic processes are enhanced and catalytic activity increases during fruit ripening.
Metabolic pathway analysis during fruit ripening
Interactive pathways (Ipath) is an open-access online tool that integrates 123 KEGG maps from 183 species . It has been used in association with RNA-Seq transcriptomics , for example, in the analysis of Solanum glandular trichomes , and global metabolic map analysis of horned beetles . In the present study, the Ipath application provided an interactive metabolic net associated with gene expression changes during fruit ripening in Chinese bayberry. A focus was then made on two important fruit quality traits: color and taste.
Genes related to color development
Candidate genes involved in bayberry fruit quality related metabolism
KO id (EC-No.)
Genes related to color development
Genes related to taste development
Sucrose phosphate synthase
The correlated expression of anthocyanin biosynthetic genes with color change during ripening is consistent with coordinated regulation by a MYB transcription factor, as reported in many other fruits . Of the 55 R2R3-MYBs present in the Chinese bayberry data-set, two UniGenes (11,792 and 33,051) were clustered together with AtMYB75 and AtMYB90, which were identified as transcription factors related to anthocyanin biosynthesis (Figure 5C). The UniGene 11792 had 100% similarity in amino acid sequence to MrMYB1, which has been reported to control bayberry fruit anthocyanin biosynthesis . However, the UniGene 33051 showed a premature translation termination codon, which would result in a peptide of only 73 amino acids, rather than around 250 amino acids for MrMYB1, and a very low expression magnitude, and is therefore unlikely to have a role in regulating anthocyanin biosynthesis. Overexpression of MYB transcription factors can lead to increased anthocyanin production in transgenic tomato, apple and strawberry [36–38], and similar phenotypic changes were observed in tobacco and Arabidopsis transformed with MrMYB1 (data not shown).
Genes related to taste quality
The pH value of bayberry fruit juice increased from 1.4 to 2.5 during fruit ripening. The content of citric acid, the main organic acid, which accounts for over 95% of total organic acids, decreased from 28.6 mg/g to 8.4 mg/g (Figure 6A). The genes encoding the TCA cycle showed no significant change in expression during fruit ripening (Figure 4). As to the GABA shunt, gene expression of all three UniGenes encoding glutamate decarboxylase (GAD) were increased, two of which (GenBank accession no.: JR053944 and JR053945) were up-regulated significantly (Table 2, Figure 6B and Additional File 10), which coincided with the q-PCR results (Figure 7). It has been reported that GAD activity participated in regulating cytosolic pH . Therefore, the up-regulation of GAD expression may be one of the key factors in citric acid degradation. Meanwhile, one UniGene encoding glutamine synthase (GS) (GenBank accession no.: JR053946) increased significantly (Table 2, Figure 6B and Additional File 10), with the expression magnitude of this gene always high (Additional File 9). Therefore, glutamate might be utilized by other pathways, causing the loss of accumulation of citric acid. A similar viewpoint was proposed by Cercos in citrus .
It is widely known that citrate can be transported, mediated by a vacuolar H+-ATPase or a Citrate/H+ symporter, between the vacuole and cytoplasm to maintain homeostasis . It was suggested that a large influx of protons is required and that this is mediated by the vacuolar H+-ATPase [45, 46], a multi-heteromeric complex of at least 11 different subunits . During fruit ripening, the expression of vacuolar H+-ATPase A subunit was increased in tomato  and grape , while a number of other subunits are still largely uncharacterized at the molecular level. In the present study, 41 UniGenes encoding vacuolar H+-ATPase subunits, with one encoding subunit A, were found. Three UniGenes, but none of them encoding subunit A, showed up-regulated expression (Table 2). Further more detailed work is needed to elucidate whether and how vacuolar H+-ATPase participates in regulation of organic acid accumulation in Chinese bayberry fruit. On the other hand, citrate may also be removed from the vacuole via a Citrate/H+ symporter. The expression of a citrus symporter (CsCit1) decreased in line with the reduction in acidity during fruit ripening . In Chinese bayberry, however, only one UniGene encoding a Citrate/H+ symporter was detected but this showed no change in gene expression during fruit ripening (Table 2).
Transcription factors play important roles in regulation of plant growth, including fruit development and ripening. Additional File 5 shows 1,508 UniGenes classified as transcription factors based on annotation by the COG database, accounting for 8.9% of the annotated UniGenes. Through BLASTx to TFs downloaded from PlantTFDB with an E value threshold of 10-20, 2,284 UniGenes encoding transcription factors from 57 families were obtained. In future work these transcription factors could be used to build a network between TFs and their target genes, based on their co-expression relationships.
Chinese bayberry has rich germplasm resources, with variable fruit characteristics in size, color, taste, texture and bioactive components [1–5, 8–11]. With the data provided by this study, critical genes involved in development and regulation of these fruit quality attributes can be characterized and the mechanisms elucidated at the molecular level, which can further contribute to understanding of fruit biology as well as exploring manipulations for fruit production and quality improvement. Currently, we are carrying out further RNA-Seq of fruit from various cultivars with different contents of organic acids. Based on this information and related physiological data, we hope to build a regulatory network of citric acid metabolism in Chinese bayberry fruit.
Based on mass sequence data of Chinese bayberry obtained by RNA-Seq, many UniGenes were identified and annotated, which provides an excellent platform for future genetic and functional genomic research. Genes related to fruit quality and their expression profiles through three fruit ripening stages were analysed further. This offered new insights into the molecular mechanisms underlying Chinese bayberry fruit characteristics. The up-regulated UniGenes encoding anthocyanin biosynthesis and MrMYB1 played roles in accumulation of anthocyanin and the development of the deep dark-red color. The up-regulated expression of SPS could be associated with the increase in sweetness, while GAD may participate in accumulation and loss of organic acids during fruit ripening. The study provides a platform for further functional genomic research on this fruit crop, and a reference for study of complicated metabolism in non-model perennial species.
All samples were collected from the same Chinese bayberry (Myrica rubra Sieb. and Zucc. cv. Biqi) tree growing in Yuyao County, Zhejiang Province, China. Figure 1 shows different tissues and fruit of three different ripening stages sampled. All samples were transported to the laboratory within 4 hours after picking and fruit were screened for uniform size and absence of mechanical damage, before freezing in liquid nitrogen and storing at -80°C.
RNA extraction, library construction and RNA-Seq
Total RNA was extracted as described previously  and treated with DNase (Takara, China). The MR1 library was constructed by mixing equal quantities of RNA from stems and leaves, flowers, buds, and young fruit. The MR2-4 libraries consisted of separate RNA samples from fruit of three different ripening stages, i.e., 75 DAF (the breaker stage fruit), 80 DAF (red ripe stage fruit), and 85 DAF (dark red ripe stage fruit). The fruit RNA used for RNA-Seq was extracted from mixed samples of 15 fruit picked from the same tree. Enrichment of mRNA, fragment interruption, addition of adapters, size selection and PCR amplification and RNA-Seq were performed by staff at Beijing Genome Institute (BGI) (Shenzhen, China). mRNA was isolated with Oligo(dT) cellulose, and then broken into short fragments. Taking these short fragments as templates, first-strand cDNA and second-strand cDNA were synthesized. Sequencing adapters were ligated to short fragments after purifying with QiaQuick PCR extraction kit, which were used to distinguish different sequencing samples. Fragments with lengths ranging from 200 to 700 bp were then separated by agarose gel electrophoresis and selected for PCR amplification as sequencing templates. Finally, the four libraries were sequenced using Illumina HiSeq™ 2000.
De novo assembly
The raw reads were first filtered by removing the adapter sequences and low quality sequences, which included the reads with N percentage (i.e., the percentage of nucleotides in read which could not be sequenced) over 5% and ones containing more than 20% nucleotides in read with Q-value ≤ 10. The Q-value represents the sequencing quality of related nucleotides. Clean reads were used in de novo assembly and read-mapping to the transcriptome. RNA-Seq data was de novo assembled using the SOAPdenovo assembly program (version 1.04)  at the parameters of "-K 29 -M 2 -L 50" by BGI. The meaning and selection principles of the parameters were available on Internet (http://soap.genomics.org.cn/soapdenovo.html). For each library, short reads were first assembled into longer but gapless contigs. Then the reads were mapped back to contigs, taking the distance of PE reads as frame, unknown sequences were replaced with 'N's, and then Scaffolds were made. The gaps of Scaffolds were filled by PE reads in order to get the sequence with least 'N's, and thus sequence of the UniGene produced. After that, UniGenes from four libraries were further spliced and assembled to obtain non-redundant UniGenes by TGICL with the minimum overlap length of 100 bp , and this was used for further analysis in this study.
Function of UniGenes was annotated by BLASTxing with E-value threshold of 10-5 to protein databases including the NCBI non-redundant (nr) database, the Swiss-Prot protein database, the Kyoto Encyclopedia of Genes and Genomes (KEGG) database , and the Clusters of Orthologous Groups of proteins (COG) database. The proteins with the highest sequence similarity were retrieved for analysis. KEGG produced annotation of metabolic pathways, while COG matched each annotated sequences to an ancient conserved domain, and represented major phylogenetic lineages of Chinese bayberry. Based on nr annotation, 25 top-hit species were identified and Gene ontology (GO) classification was obtained by WEGO  (http://wego.genomics.org.cn/cgi-bin/wego/index.pl) via GO id annotated by Blast2GO (Version 2.3.4) . Moreover, coding sequence (CDS) regions and direction were determined based on those of similar sequences from other plants. Phylogenetic analysis, based on amino acid sequences of MrMYB1 (GenBank accession no. GQ340767), R2R3-MYBs projected in this work and R2R3-MYBs in Arabidopsis were performed by MEGA (version 5.02)  using Neighbor-joining methods with 1000 bootstrap replicates.
An alignment package, SOAPaligner (Version 2.20) was used to map reads back to the transcriptome at the parameters of "-m 0 -x 1000 -s 40 -l 35 -v 3 -r 2". The meaning and selection principles of the parameters were available on Internet (http://soap.genomics.org.cn/soapaligner.html). Then the number of mapped clean reads for each UniGene was counted and then normalized into RPKM value (Reads Per kb per Million reads), which was widely used to calculate the UniGene expression . P value was used to identify genes expressed differentially between each samples following the formula below via our PERL program (Additional File 13). N1 represented the total clean reads in Sample 1, while the total clean reads in Sample 2 were noted as N2, and gene A contained × and y clean reads mapped to Sample 1 and 2, respectively .
FDR (false discovery rate) was applied to identify the threshold of the P Value in multiple tests and analyses , and this value was calculated via SAS (version 9.1.3). In our work, the differentially expressed UniGenes between each of two samples were screened with the threshold of FDR < 0.001 and the absolute value of log2Ratio ≥ 1 . Overall expression patterns of UniGenes, excluding those with no significant expression changes as determined by FDR analysis or with changes over 32 fold, were visualized by 3D using MATLAB (version 7.0) after clustering via cluster 3.0. Detailed expression profiles I (up-regulation), II (irregularly regulated), III (irregularly regulated), IV (down-regulation) were distinguished for the differentially expressed UniGenes via value of log2 (RPKM 80 DAF /RPKM 75 DAF) and log2 (RPKM 85 DAF /RPKM 80 DAF), where I = (the former one ≥ 0 and the latter one ≥ 0), II = (the former one ≥ 0 and the latter one ≤ 0), III = (the former one ≤ 0 and the latter one ≥ 0), IV = (the former one ≤ 0 and the latter one ≤ 0). Furthermore, the GO classifications were compared between up-regulation and down-regulation UniGenes using WEGO , with * and ** indicating significant difference at 5% and 1%, respectively.
Based on RPKM value of differentially expressed UniGenes, clustering analysis was performed via MultiExperiment Viewer (MeV) (version 4.6.2)  using the algorithm of "K Means Clustering". Co-expressed gene networks were built using the following steps: Firstly, UniGenes with RPKM value equal to 0 were filtered, then PCC (Pearson's correlation coefficients) value of a pair of UniGene expression patterns, considering sample redundancy, were calculated following the formula of the online help page (http://atted.jp/help/coex_cal.shtml) and further transformed into Mutual Rank (MR) with the method described (http://atted.jp/help/mr.shtml) . After that, co-expressed gene relationships were generated from the 12 most strongly correlated genes for each gene. Finally, the branch of the network related with UniGenes in the anthocyanin biosynthesis pathway was examined using Cytoscape (version 2.8.2) by drawing a line between co-expressed genes.
Metabolic pathways analysis
Interactive Pathways (ipath) analysis was carried out via interactive pathways explorer (version 2.0) (http://pathways.embl.de/). Through KO (KEGG Orthology) id, the expression of a specific gene family was summed from all family members encoding the gene. To understand the dynamic changes and absolute expression magnitude during fruit ripening, ten different colors were applied to indicate different RPKM values of UniGenes. For obtaining accurate relative expression magnitude, another ipath figure was generated, where the standard of change was FDR < 0.001 and the absolute value of log2Ratio ≥ 1 as mentioned above . Metabolic pathways related with fruit color, sugar and organic acids were produced manually. The sequences of related UniGenes (described in Figure 5B and Figure 6B) were deposited in the Transcriptome Shotgun Assembly Sequence Database (TSA) at NCBI with accession number from JR053922 to JR053947. Expression and detailed functional annotation are shown in Additional File 10.
RNA extraction, first strand cDNA synthesis and q-PCR analysis
Total RNA used for q-PCR analysis was extracted from fruit of three ripening stages (75 DAF, 80 DAF and 85 DAF), using three biological replicates of 15 fruit. After RNA extraction and DNase treatment described above, the first-strand cDNA was synthesized from 1.0 μg DNA-free RNA using Revert Aid™ First Strand cDNA Synthesis Kit (Fermentas, USA) according to the manufacturer's protocol. The cDNA was diluted tenfold and used as the template for q-PCR.
Primers for q-PCR analysis
GenBank accession no.
Forward primer (5' to 3')
Reverse primer (5' to 3')
MrSPS 2 (9305)
MrGAD 1 (113)
Color, TSS (total soluble solids) and pH Measurement
Fruit surface color at different ripening stages was measured using a Hunter Lab Mini Scan XE Plus colorimeter (Hunter Associates Laboratory, Inc., USA). The CIE L*a*b* color scale was adopted, and the raw data as L*, a*, b* were obtained. The CIRG, a comprehensive indicator of the color index of red grapes, was calculated according to CIRG = (180-H)/(L*+C), while C = (a*2 +b*2)0.5 and H = arctan(b*/a*) [8, 62]. Four random measurements were made for each fruit and a mean value was obtained from the measurements of ten fruit.
After color measurement, the fruit was used to measure the TSS and pH with refractometer PR101-α (Atago, Japan) and pHTestr 30 (Eutech, USA) following the manufacturers' protocols. Two measurements were made for each fruit and a mean was obtained from the measurements of ten fruit.
The extraction of sugars and organic acids was performed according to previous methods . Briefly, three grams of the sample was ground into fine powder in liquid N2, and then homogenized in 6.0 ml of ethanol (80%) solution, shaken for 10 min at 35°C and centrifuged at 10,000 rpm for 10 min. The supernatant was collected and the precipitate homogenized again with ethanol (80%). This procedure was repeated, the supernatants were made up to a constant volume of 25 ml. The extract (1 ml) was dried into powder by a Termovap Sample Concentrator (Eyela, Japan), and then the volume was made up to 500 μl with distilled water, which was used for sugar and organic acid analysis.
Soluble sugars were analysed by HPLC (Beckman, USA) following Koch's method . Acetonitrile: water (80: 20) was used as the mobile phase at a flow rate of 1.0 ml/min. A NH2 (4.6 mm × 250 mm) column (GL Sciences Inc., Japan) and a refractive index detector RI-1530 (Jasco, Japan) were used.
Organic acids were analysed by HPLC (Beckman, USA) following the method of Shiomi . 50 mM (NH4)2HPO4 (pH = 2.7) was used as the mobile phase at flow rate of 0.5 ml/min, using a Beckman ODS C18 (4.6 mm × 250 mm) column (Beckman, USA) and a 166 UV-vis detector (Beckman, USA).
List of abbreviations
Ascorbic acid, vitamin C
Beijing Genome Institute
Basic local alignment search tool
Color index of red grapes
The clusters of orthologous groups of proteins database
Days after flowering
Expressed sequence tag
False discovery rate
The Kyoto encyclopedia of genes and genomes database
KEGG Orthology ids
National center for biotechnology information
NCBI non-redundant database
Practical extraction and report language
Real-time quantitative PCR
Reads per kb per million reads
Short oligonucleotide analysis package
Sucrose phosphate synthase
Tricarboxylic acid cycle
Total soluble solids
UDP-glucose: flavonoid 3-O-glucosyltransfersae
Acknowledgements and Funding
We would like to thank Prof. Don Grierson for his critical review and advice on data analysis, Prof. Long-jiang Fan for his valuable suggestions on RNA-Seq. This work was supported by the National Basic Research Program of China (2011CB100602), the Special Fund for Agro-scientific Research in the Public Interest (201203089), and the Science and Technology Project of Zhejiang Province (2009C14023).
- Chen K, Xu C, Zhang B, Ferguson IB: Red bayberry: botany and horticulture. Hortic Rev. 2004, 30: 83-114.Google Scholar
- Zhang B, Kang M, Xie Q, Xu B, Sun C, Chen K, Wu Y: Anthocyanins from Chinese bayberry extract protect β cells from oxidative stress-mediated injury via HO-1 up-regulation. J Agric Food Chem. 2010, 59 (2): 537-545.View ArticlePubMedGoogle Scholar
- Sun C, Zheng Y, Chen Q, Tang X, Jiang M, Zhang J, Li X, Chen K: Purification and anti-tumour activity of cyaniding-3-O-glucoside from Chinese bayberry fruit. Food Chem. 2012, 131: 1287-1294. 10.1016/j.foodchem.2011.09.121.View ArticleGoogle Scholar
- Zhang SM, Gao ZS, Xu CJ, Chen KS: Genetic diversity of Chinese bayberry (Myrica rubra Sieb. et Zucc.) accessions revealed by amplified fragment length polymorphism. Hortscience. 2009, 44 (2): 487-491.Google Scholar
- Zhang SM, Xu CJ, Gao ZS, Chen KS, Wang GY: Development and characterization of microsatellite markers for Chinese bayberry (Myrica rubra Sieb. & Zucc.). Conserv Genet. 2009, 10 (5): 1605-1607. 10.1007/s10592-008-9804-x.View ArticleGoogle Scholar
- Guo YP, Guo DP, Zhou HF, Hu MJ, Shen YG: Photoinhibition and xanthophyll cycle activity in bayberry (Myrica rubra) leaves induced by high irradiance. Photosynthetica. 2006, 44 (3): 439-446. 10.1007/s11099-006-0048-z.View ArticleGoogle Scholar
- Meng CF, Cao ZH, Jiang PK, Zhou GM, Lin XG, Xu QF: Effects of application of boron on growth, yields, and quality of red bayberry. J Plant Nutr. 2007, 30 (7): 1047-1058. 10.1080/01904160701394444.View ArticleGoogle Scholar
- Zhang WS, Chen KS, Zhang B, Sun CD, Cai C, Zhou CH, Xu WP, Zhang WQ, Ferguson IB: Postharvest responses of Chinese bayberry fruit. Postharvest Biol Technol. 2005, 37 (3): 241-251. 10.1016/j.postharvbio.2005.05.005.View ArticleGoogle Scholar
- Zhang WS, Li X, Wang XX, Wang GY, Zheng HT, Abeysinghe DC, Ferguson IB, Chen KS: Ethanol vapour treatment alleviates postharvest decay and maintains fruit quality in Chinese bayberry. Postharvest Biol Tec. 2007, 46 (2): 195-198. 10.1016/j.postharvbio.2007.05.001.View ArticleGoogle Scholar
- Zhang WS, Li X, Zheng JT, Wang GY, Sun CD, Ferguson IB, Chen KS: Bioactive components and antioxidant capacity of Chinese bayberry (Myrica rubra Sieb. and Zucc.) fruit in relation to fruit maturity and postharvest storage. Eur Food Res Technol. 2008, 227 (4): 1091-1097. 10.1007/s00217-008-0824-z.View ArticleGoogle Scholar
- Niu SS, Xu CJ, Zhang WS, Zhang B, Li X, Lin-Wang K, Ferguson IB, Allan AC, Chen KS: Coordinated regulation of anthocyanin biosynthesis in Chinese bayberry (Myrica rubra) fruit by a R2R3 MYB transcription factor. Planta. 2010, 231 (4): 887-899. 10.1007/s00425-009-1095-z.View ArticlePubMedGoogle Scholar
- Newcomb RD, Crowhurst RN, Gleave AP, Rikkerink EHA, Allan AC, Beuning LL, Bowen JH, Gera E, Jamieson KR, Janssen BJ, Laing WA, McArtney S, Nain B, Ross GS, Snowden KC, Souleyre EJF, Walton EF, Yauk YK: Analyses of expressed sequence tags from apple. Plant Physiol. 2006, 141 (1): 147-166. 10.1104/pp.105.076208.PubMed CentralView ArticlePubMedGoogle Scholar
- Crowhurst RN, Gleave AP, MacRae EA, Ampomah-Dwamena C, Atkinson RG, Beuning LL, Bulley SM, Chagne D, Marsh KB, Matich AJ, Montefiori M, Newcomb RD, Schaffer RJ, Usadel B, Allan AC, Boldingh HL, Bowen JH, Davy MW, Eckloff R, Ferguson AR, Fraser LG, Gera E, Hellens RP, Janssen BJ, Klages K, Lo KR, MacDiarmid RM, Nain B, McNeilage MA, Rassam M, Richardson AC, Rikkerink EHA, Ross GS, Schroder RA, Snowden KC, Souleyre EJF, Templeton MD, Walton EF, Wang D, Wang MY, Wang YY, Wood M, Wu R, Yauk YK, Laing WA: Analysis of expressed sequence tags from Actinidia: applications of a cross species EST database for gene discovery in the areas of flavor, health, color and ripening. BMC Genomics. 2008, 9 (1): 351-10.1186/1471-2164-9-351.PubMed CentralView ArticlePubMedGoogle Scholar
- Brautigam A, Mullick T, Schliesky S, Weber APM: Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C(3) and C(4) species. J Exp Bot. 2011, 62 (9): 3093-3102. 10.1093/jxb/err029.View ArticlePubMedGoogle Scholar
- Parchman TL, Geist KS, Grahnen JA, Benkman CW, Buerkle CA: Transcriptome sequencing in an ecologically important tree species: assembly, annotation, and marker discovery. BMC Genomics. 2010, 11 (1): 180-10.1186/1471-2164-11-180.PubMed CentralView ArticlePubMedGoogle Scholar
- Shi CY, Yang H, Wei CL, Yu O, Zhang ZZ, Jiang CJ, Sun J, Li YY, Chen Q, Xia T, Wan XC: Deep sequencing of the Camellia sinensis transcriptome revealed candidate genes for major metabolic pathways of tea-specific compounds. BMC Genomics. 2011, 12 (1): 131-10.1186/1471-2164-12-131.PubMed CentralView ArticlePubMedGoogle Scholar
- Hahn DA, Ragland GJ, Shoemaker DD, Denlinger DL: Gene discovery using massively parallel pyrosequencing to develop ESTs for the flesh fly Sarcophaga crassipalpis. BMC Genomics. 2009, 10 (1): 234-10.1186/1471-2164-10-234.PubMed CentralView ArticlePubMedGoogle Scholar
- Dubey A, Farmer A, Schlueter J, Cannon SB, Abernathy B, Tuteja R, Woodward J, Shah T, Mulasmanovic B, Kudapa H, Raju N, Gothalwal R, Pande S, Xiao Y, Town CD, Singh NK, May GD, Jackson S, Varshney RK: Defining the transcriptome assembly and its use for genome dynamics and transcriptome profiling studies in Pigeonpea (Cajanus cajan L.). DNA Res. 2011, 18 (3): 153-164. 10.1093/dnares/dsr007.PubMed CentralView ArticlePubMedGoogle Scholar
- Wang XW, Luan JB, Li JM, Bao YY, Zhang CX, Liu SS: De novo characterization of a whitefly transcriptome and analysis of its gene expression during development. BMC Genomics. 2010, 11: 400-10.1186/1471-2164-11-400.PubMed CentralView ArticlePubMedGoogle Scholar
- Xiang LX, He D, Dong WR, Zhang YW, Shao JZ: Deep sequencing-based transcriptome profiling analysis of bacteria-challenged Lateolabrax japonicus reveals insight into the immune-relevant genes in marine fish. BMC Genomics. 2010, 11: 472-10.1186/1471-2164-11-472.PubMed CentralView ArticlePubMedGoogle Scholar
- Wu J, Zhang Y, Zhang H, Huang H, Folta K, Lu J: Whole genome wide expression profiles of Vitis amurensis grape responding to downy mildew by using Solexa sequencing technology. BMC Plant Biol. 2010, 10 (1): 234-10.1186/1471-2229-10-234.PubMed CentralView ArticlePubMedGoogle Scholar
- Malone JH, Oliver B: Microarrays, deep sequencing and the true measure of the transcriptome. BMC Biol. 2011, 9 (1): 34-10.1186/1741-7007-9-34.PubMed CentralView ArticlePubMedGoogle Scholar
- Wheeler GL, Jones MA, Smirnoff N: The biosynthetic pathway of vitamin C in higher plants. Nature. 1998, 393 (6683): 365-369. 10.1038/30728.View ArticlePubMedGoogle Scholar
- Agius F, Gonzalez-Lamothe R, Caballero JL, Munoz-Blanco J, Botella MA, Valpuesta V: Engineering increased vitamin C levels in plants by overexpression of a D-galacturonic acid reductase. Nat Biotechnol. 2003, 21 (2): 177-181. 10.1038/nbt777.View ArticlePubMedGoogle Scholar
- Wagner C, Sefkow M, Kopka J: Construction and application of a mass spectral and retention time index database generated from plant GC/EI-TOF-MS metabolite profiles. Phytochemistry. 2003, 62 (6): 887-900. 10.1016/S0031-9422(02)00703-3.View ArticlePubMedGoogle Scholar
- Lorence A, Chevone BI, Mendes P, Nessler CL: myo-inositol oxygenase offers a possible entry point into plant ascorbate biosynthesis. Plant Physiol. 2004, 134 (3): 1200-1205. 10.1104/pp.103.033936.PubMed CentralView ArticlePubMedGoogle Scholar
- Alba R, Payton P, Fei ZJ, McQuinn R, Debbie P, Martin GB, Tanksley SD, Giovannoni JJ: Transcriptome and selected metabolite analyses reveal multiple points of ethylene control during tomato fruit development. Plant Cell. 2005, 17 (11): 2954-2965. 10.1105/tpc.105.036053.PubMed CentralView ArticlePubMedGoogle Scholar
- Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B: Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008, 5 (7): 621-628. 10.1038/nmeth.1226.View ArticlePubMedGoogle Scholar
- Letunic I, Yamada T, Kanehisa M, Bork P: iPath: interactive exploration of biochemical pathways and networks. Trends Biochem Sci. 2008, 33 (3): 101-103. 10.1016/j.tibs.2008.01.001.View ArticlePubMedGoogle Scholar
- Cantacessi C, Jex AR, Hall RS, Young ND, Campbell BE, Joachim A, Nolan MJ, Abubucker S, Sternberg PW, Ranganathan S, Mitreva M, Gasser RB: A practical, bioinformatic workflow system for large data sets generated by next-generation sequencing. Nucleic Acids Res. 2010, 38 (17): e171-10.1093/nar/gkq667.PubMed CentralView ArticlePubMedGoogle Scholar
- McDowell ET, Kapteyn J, Schmidt A, Li C, Kang JH, Descour A, Shi F, Larson M, Schilmiller A, An LL, Jones AD, Pichersky E, Soderlund CA, Gang DR: Comparative functional genomic analysis of Solanum glandular trichome types. Plant Physiol. 2011, 155 (1): 524-539. 10.1104/pp.110.167114.PubMed CentralView ArticlePubMedGoogle Scholar
- Choi JH, Kijimoto T, Snell-Rood E, Tae H, Yang Y, Moczek AP, Andrews J: Gene discovery in the horned beetle Onthophagus taurus. BMC Genomics. 2010, 11 (1): 703-10.1186/1471-2164-11-703.PubMed CentralView ArticlePubMedGoogle Scholar
- Castellarin S, Di Gaspero G: Transcriptional control of anthocyanin biosynthetic genes in extreme phenotypes for berry pigmentation of naturally occurring grapevines. BMC Plant Biol. 2007, 7 (1): 46-10.1186/1471-2229-7-46.PubMed CentralView ArticlePubMedGoogle Scholar
- Espley RV, Hellens RP, Putterill J, Stevenson DE, Kutty-Amma S, Allan AC: Red colouration in apple fruit is due to the activity of the MYB transcription factor, MdMYB10. Plant J. 2007, 49 (3): 414-427. 10.1111/j.1365-313X.2006.02964.x.PubMed CentralView ArticlePubMedGoogle Scholar
- Wei YZ, Hu FC, Hu GB, Li XJ, Huang XM, Wang HC: Differential expression of anthocyanin biosynthetic genes in relation to anthocyanin accumulation in the pericarp of Litchi Chinensis Sonn. PLoS One. 2011, 6 (4): e19455-10.1371/journal.pone.0019455.PubMed CentralView ArticlePubMedGoogle Scholar
- Lin-Wang K, Bolitho K, Grafton K, Kortstee A, Karunairetnam S, McGhie TK, Espley RV, Hellens RP, Allan AC: An R2R3 MYB transcription factor associated with regulation of the anthocyanin biosynthetic pathway in Rosaceae. BMC Plant Biol. 2010, 10 (1): 50-10.1186/1471-2229-10-50.PubMed CentralView ArticlePubMedGoogle Scholar
- Butelli E, Titta L, Giorgio M, Mock HP, Matros A, Peterek S, Schijlen E, Hall RD, Bovy AG, Luo J, Martinet C: Enrichment of tomato fruit with health-promoting anthocyanins by expression of select transcription factors. Nat Biotechnol. 2008, 26 (11): 1301-1308. 10.1038/nbt.1506.View ArticlePubMedGoogle Scholar
- Espley RV, Brendolise C, Chagne D, Kutty-Amma S, Green S, Volz R, Putterill J, Schouten HJ, Gardiner SE, Hellens RP, Allan AC: Multiple repeats of a promoter segment causes transcription factor autoregulation in red apples. Plant Cell. 2009, 21 (1): 168-183. 10.1105/tpc.108.059329.PubMed CentralView ArticlePubMedGoogle Scholar
- Komatsu A, Takanokura Y, Moriguchi T, Omura M, Akihama T: Differential expression of three sucrose-phosphate synthase isoforms during sucrose accumulation in citrus fruits (Citrus unshiu Marc.). Plant Sci. 1999, 140 (2): 169-178. 10.1016/S0168-9452(98)00217-9.View ArticleGoogle Scholar
- Verma AK, Upadhyay SK, Verma PC, Solomon S, Singh SB: Functional analysis of sucrose phosphate synthase (SPS) and sucrose synthase (SS) in sugarcane (Saccharum) cultivars. Plant Biol. 2011, 13 (2): 325-332. 10.1111/j.1438-8677.2010.00379.x.View ArticlePubMedGoogle Scholar
- Choudhury SR, Roy S, Sengupta DN: A comparative study of cultivar differences in sucrose phosphate synthase gene expression and sucrose formation during banana fruit ripening. Postharvest Biol Tec. 2009, 54 (1): 15-24. 10.1016/j.postharvbio.2009.05.003.View ArticleGoogle Scholar
- Bouché N, Fromm H: GABA in plants: just a metabolite?. Trends Plant Sci. 2004, 9 (3): 110-115. 10.1016/j.tplants.2004.01.006.View ArticlePubMedGoogle Scholar
- Cercós M, Soler G, Iglesias D, Gadea J, Forment J, Talón M: Global analysis of gene expression during development and ripening of citrus fruit flesh. A proposed mechanism for citric acid utilization. Plant Mol Biol. 2006, 62 (4): 513-527. 10.1007/s11103-006-9037-7.View ArticlePubMedGoogle Scholar
- Terol J, Soler G, Talon M, Cercos M: The aconitate hydratase family from Citrus. BMC Plant Biol. 2010, 10: 222-10.1186/1471-2229-10-222.PubMed CentralView ArticlePubMedGoogle Scholar
- Canel C, Bailey-Serres JN, Roose ML: In Vitro [14C] Citrate uptake by tonoplast vesicles of acidless Citrus juice cells. J Am Soc Hortic Sci. 1995, 120 (3): 510-514.Google Scholar
- Müller ML, Irkens-Kiesecker U, Kramer D, Taiz L: Purification and reconstitution of the vacuolar H+-ATPases from lemon fruits and epicotyls. J Biol Chem. 1997, 272 (19): 12762-12770. 10.1074/jbc.272.19.12762.View ArticlePubMedGoogle Scholar
- Dietz KJ, Tavakoli N, Kluge C, Mimura T, Sharma SS, Harris GC, Chardonnens AN, Golldack D: Significance of the V-type ATPase for the adaptation to stressful growth conditions and its regulation on the molecular and biochemical level. J Exp Bot. 2001, 52 (363): 1969-1980. 10.1093/jexbot/52.363.1969.View ArticlePubMedGoogle Scholar
- Amemiya T, Kanayama Y, Yamaki S, Yamada K, Shiratake K: Fruit-specific V-ATPase suppression in antisense-transgenic tomato reduces fruit growth and seed formation. Planta. 2006, 223 (6): 1272-1280. 10.1007/s00425-005-0176-x.View ArticlePubMedGoogle Scholar
- Terrier N, Sauvage F-X, Ageorges A, Romieu C: Changes in acidity and in proton transport at the tonoplast of grape berries during development. Planta. 2001, 213 (1): 20-28. 10.1007/s004250000472.View ArticlePubMedGoogle Scholar
- Shimada T, Nakano R, Shulaev V, Sadka A, Blumwald E: Vacuolar citrate/H+ symporter of citrus juice cells. Planta. 2006, 224 (2): 472-480. 10.1007/s00425-006-0223-2.View ArticlePubMedGoogle Scholar
- Shan LL, Li X, Wang P, Cai C, Zhang B, Sun CD, Zhang WS, Xu CJ, Ferguson I, Chen KS: Characterization of cDNAs associated with lignification and their expression profiles in loquat fruit with different lignin accumulation. Planta. 2008, 227 (6): 1243-1254. 10.1007/s00425-008-0696-2.View ArticlePubMedGoogle Scholar
- Li R, Zhu H, Ruan J, Qian W, Fang X, Shi Z, Li Y, Li S, Shan G, Kristiansen K, Li S, Yang H, Wang J, Wang J: De novo assembly of human genomes with massively parallel short read sequencing. Genome Res. 2010, 20 (2): 265-272. 10.1101/gr.097261.109.PubMed CentralView ArticlePubMedGoogle Scholar
- Pertea G, Huang XQ, Liang F, Antonescu V, Sultana R, Karamycheva S, Lee Y, White J, Cheung F, Parvizi B, Tsai J, Quackenbush J: TIGR gene indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics. 2003, 19 (5): 651-652. 10.1093/bioinformatics/btg034.View ArticlePubMedGoogle Scholar
- Kanehisa M, Goto S, Kawashima S, Okuno Y, Hattori M: The KEGG resource for deciphering the genome. Nucleic Acids Res. 2004, 32: D277-D280. 10.1093/nar/gkh063.PubMed CentralView ArticlePubMedGoogle Scholar
- Ye J, Fang L, Zheng HK, Zhang Y, Chen J, Zhang Z, Wang J, Li S, Li R, Bolund L: WEGO: a web tool for plotting GO annotations. Nucleic Acids Res. 2006, 34: W293-W297. 10.1093/nar/gkl031.PubMed CentralView ArticlePubMedGoogle Scholar
- Conesa A, Gotz S, Garcia-Gomez JM, Terol J, Talon M, Robles M: Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005, 21 (18): 3674-3676. 10.1093/bioinformatics/bti610.View ArticlePubMedGoogle Scholar
- Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28 (10): 2731-2739. 10.1093/molbev/msr121.PubMed CentralView ArticlePubMedGoogle Scholar
- Audic S, Claverie JM: The significance of digital gene expression profiles. Genome Res. 1997, 7 (10): 986-995.PubMedGoogle Scholar
- Benjamini Y, Drai D, Elmer G, Kafkafi N, Golani I: Controlling the false discovery rate in behavior genetics research. Behav Brain Res. 2001, 125 (1-2): 279-284. 10.1016/S0166-4328(01)00297-2.View ArticlePubMedGoogle Scholar
- Saeed AI, Sharov V, White J, Li J, Liang W, Bhagabati N, Braisted J, Klapa M, Currier T, Thiagarajan M, Sturn A, Snuffin M, Rezantsev A, Popov D, Ryltsov A, Kostukovich E, Borisovsky I, Liu Z, Vinsavich A, Trush V, Quackenbush J: TM4: a free, open-source system for microarray data management and analysis. Biotechniques. 2003, 34 (2): 374-378.PubMedGoogle Scholar
- Obayashi T, Kinoshita K, Nakai K, Shibaoka M, Hayashi S, Saeki M, Shibata D, Saito K, Ohta H: ATTED-II: a database of co-expressed genes and cis elements for identifying co-regulated gene groups in Arabidopsis. Nucleic Acids Res. 2007, 35 (suppl 1): D863-D869.PubMed CentralView ArticlePubMedGoogle Scholar
- Carreño J, Martínez A, Almela L, Fernández-López JA: Proposal of an index for the objective evaluation of the color of red table grapes. Food Res Int. 1995, 28 (4): 373-377. 10.1016/0963-9969(95)00008-A.View ArticleGoogle Scholar
- Koch KE, Avigne WT: Postphloem, nonvascular transfer in citrus: kinetics, metabolism, and sugar gradients. Plant Physiol. 1990, 93 (4): 1405-1416. 10.1104/pp.93.4.1405.PubMed CentralView ArticlePubMedGoogle Scholar
- Shiomi S, Kubo Y, Wamocho LS, Koaze H, Nakamura R, Inaba A: Postharvest ripening and ethylene biosynthesis in purple passion fruit. Postharvest Biol Technol. 1996, 8 (3): 199-207. 10.1016/0925-5214(95)00073-9.View ArticleGoogle Scholar