- Research article
- Open Access
De novo transcriptome analysis using 454 pyrosequencing of the Himalayan Mayapple, Podophyllum hexandrum
BMC Genomics volume 14, Article number: 748 (2013)
The Himalayan or Indian Mayapple (Podophyllum hexandrum Royle) produces podophyllotoxin, which is used in the production of semisynthetic anticancer drugs. High throughput transcriptome sequences or genomic sequence data from the Indian Mayapple are essential for further understanding of the podophyllotoxin biosynthetic pathway.
454 pyrosequencing of a P. hexandrum cell culture normalized cDNA library generated 2,667,207 raw reads and 1,503,232 high quality reads, with an average read length of 138 bp. The denovo assembly was performed by Newbler using default and optimized parameters. The optimized parameter generated 40, 380 assembled sequences, comprising 12,940 contigs and 27,440 singlets which resulted in better assembly as compared to default parameters. BLASTX analysis resulted in the annotation of 40,380 contigs/singlet using a cut-off value of ≤1E-03. High similarity to Medicago truncatula using optimized parameters and to Populus trichocarpa using default parameters was noted. The Kyoto encyclopedia of genes and genomes (KEGG) analysis using KEGG Automatic Annotation Server (KAAS) combined with domain analysis of the assembled transcripts revealed putative members of secondary metabolism pathways that may be involved in podophyllotoxin biosynthesis. A proposed schematic pathway for phenylpropanoids and podophyllotoxin biosynthesis was generated. Expression profiling was carried out based on fragments per kilobase of exon per million fragments (FPKM). 1036 simple sequence repeats were predicted in the P. hexandrum sequences. Sixty-nine transcripts were mapped to 99 mature and precursor microRNAs from the plant microRNA database. Around 961 transcripts containing transcription factor domains were noted. High performance liquid chromatography analysis showed the peak accumulation of podophyllotoxin in 12-day cell suspension cultures. A comparative qRT-PCR analysis of phenylpropanoid pathway genes identified in the present data was performed to analyze their expression patterns in 12-day cell culture, callus and rhizome.
The present data will help the identification of the potential genes and transcription factors involved in podophyllotoxin biosynthesis in P. hexandrum. The assembled transcripts could serve as potential candidates for marker discovery and conservation, which should form the foundations for future endeavors.
Podophyllum hexandrum Royle, commonly referred to as the Himalayan/Indian Mayapple, is an endangered perennial herb belonging to the family Berberidaceae, which is distributed on the lower slopes of the Himalayas in scrub and forest, from Afghanistan to central China. Roots and rhizomes of P. hexandrum contain lignans, such as podophyllotoxin and other related aryltetralin lignans, which are present in Podophyllum spp. but are not restricted to this genus. Extracts of Podophyllum spp. have been used by diverse cultures as antidotes against poisons and as cathartic, purgative, anthelmintic, vesicant and suicidal agents. A crude resin extract of Podophyllum spp., podophyllin, was included in the US Pharmacopoeia in 1820, and this resin has been prescribed to treat venereal warts. Podophyllotoxin has been used as the starting compound for the production of the semi-synthetic drugs etoposide (VP-16-213), teniposide (VM-26) and etopophos, which are used to treat lung and testicular cancers, leukaemia and rheumatoid arthritis. A recent review stated that species containing podophyllotoxin were traditionally used as folk-remedies in China, Japan, India and the United States to treat several illnesses, including gout, tuberculosis, syphilis, warts and various tumors. The Indian species, P. Hexandrum, contains three times more Podophyllotoxin than its American counterpart, P. peltatum, which contains other lignans, such as α- and β-peltatins[7, 8]. However, peltatins do not contribute to the anti-cancer properties of the plant.
To meet the commercial demand, podophyllotoxin has been extracted from the rhizomes of P. hexandrum and P. peltatum collected in the wild. The chemical synthesis of podophyllotoxin is possible, but not economically feasible. Therefore, large quantities of rhizomes have been collected indiscriminately to meet the ever-increasing demands of modern medicine. Severe habitat destruction and over-collection has created acute depletion in the population of this herb. Together with a lack of organized cultivation, this has led to P. hexandrum being classified as a critically endangered species in the Himalayan region[11, 12].
In addition to this genus, other plants, including Linum album, Juniperus chinensis and Callitris drummondii, have been investigated for the in vitro production of podophyllotoxin and its derivatives. However, the production of podophyllotoxin using cell cultures may not be sufficient for biotechnological production systems.
The complete sequences of dirigent protein oxidase (DPO), secoisolariciresinol dehydrogenase (SDG) and cinnamyl alcohol dehydrogenase (CAD) from P. hexandrum have been deposited in the National Centre for Biotechnology Information (NCBI). Lignan biosynthesis involves mechanisms for enantioselective dimerization. DPO affects the selective coupling of the coniferyl alcohol radical to produce (+)-pinoresinol and pinoresinol reductase converts pinoresinol to secoisolariciresinol via lariciresinol. Then, (-)-secoisolariciresinol is converted to (-)-matairesinol by SDG. Matairesinol is a starting point for the biosynthesis of podophyllotoxin. One possible pathway is that matairesinol is converted to yatein and then to podophyllotoxin via deoxypodophyllotoxin. Another direct pathway to podophyllotoxin from matairesinol via thujaplicatin has been proposed. Although the podophyllotoxin biosynthetic pathway is reasonably well characterized and several cDNAs have been reported, the transformation from matairesinol to podophyllotoxin involves hydroxylation, methylations and methylenedioxy bridge formation, and these late steps are yet to be characterized. A recent report revealed two cytochrome P450 enzymes in P. hexandrum and P. peltatum that are capable of converting (-)-matairesinol into (-)-pluviatolide by catalyzing the formation of a methylenedioxy bridge.
De novo transcriptome analysis of next-generation sequencing data is an appropriate technique for identifying unknown genes in non-model organisms. Expressed sequence tag (EST) sequencing, which excludes non-coding and repetitive DNA components, is a cost-effective and frequently used strategy to analyze the transcriptome. Here, we sequenced the transcriptome of P. hexandrum cell culture using the 454 GS-FLX Titanium technology, assembled the raw reads using three assemblers, and chose an assembler with the best performance. Finally, functional annotation, FPKM value, domain analysis, transcription factors (TFs) and simple sequence repeat (SSR) identification, and miRNA targeted transcript identification, were determined. Domains from the identified transcripts that could represent downstream genes encoding enzymes that catalyze the late steps in podophyllotoxin biosynthesis were also identified. The data from this study will form the basis for future studies towards the isolation and characterization of the podophyllotoxin biosynthetic pathway genes from P. Hexandrum.
Results and discussion
454 sequencing of the Mayapple cell culture transcriptome
Clonally amplified cDNA library beads obtained from emulsion-based clonal amplification (emPCR amplification) reactions were subjected to two experimental runs on a Pico Titre Plate (PTP) for sequencing using Roche 454 GS FLX pyrosequencing chemistry. A total of 2,667,207 raw reads (Table 1) were obtained, and the low quality reads, adapters and primer sequences were removed using PRINSEQ. After quality filtration and adapter trimming of raw reads, 1,503,232 high quality (HQ) reads with an average read length of 138 bp was obtained. The high quality reads were uniqued and mapped to Rfam, non coding RNA database. Approximately, 50% filtered reads were obtained and used for further analysis.
Comparison between default and optimized parameters of Newbler
Here we present a simple workflow for 454 sequencing, assembly, annotation and other analyses (Figure 1). Newbler is frequently used in de novo pyrosequencing projects. The comparative denovo assembly was carried out using Newbler with default and optimized parameters. The optimized parameter generated 40,380 assembled sequences, comprising 12,940 contigs and 27,440 singlets with an N50 of 463 and 240 for contigs and singlets respectively. Newbler with optimized parameters gave the best results in terms of the numbers of assembled contigs and singlet, N50, mean contig/singlets length and total bases of contigs /singlets (bp) (Table 2). Additional file1 (A and B) shows the distribution of contig lengths generated by Newbler using default and optimized parameters respectively. Further analysis of the singlet generated by Newbler default assembly were excluded because their mean length was below 200 bp.
Functional annotation of assembled transcripts and determination of FPKM values
The annotation of transcripts was carried out using green plants of non-redundant (nr) protein database at NCBI by BLASTX. BLASTX resulted in the annotation of 3,249 contigs out of 3,372 assembled contigs obtained using Newbler default parameters (Additional file2) whereas 40,380 transcripts from among 12,940 contigs and 27,440 singlet generated using Newbler optimized parameters (Additional file3). Using default parameters, transcripts showed significant similarity with P.trichocarpa followed by Oryza sativa, Ricinus communis and so forth, (Figure 2A) while, using optimized parameters, significant similarity was achieved with M. truncatula, followed by Glycine Max, Sorghum bicolor, P. trichocarpa and others (Figure 2B). GO assignments were used to classify the functions of the predicted transcript contigs to determine the E-value distribution, sequence similarity distribution, evidence code distribution of sequences, evidence code distribution of BLAST hits, annotation score distribution, annotation distribution and GO-level distribution of transcripts generated by default (Additional file4) and optimized Newbler assembly ( Additional file5). The GO annotation of transcripts from Newbler default and optimized assemblies are represented in Additional file6 and Additional file7 respectively. GO-level sequence distribution for Biological processes, Molecular functions and Cellular components of the transcript contigs and singlet generated by Newbler default and optimized assembly are shown in Figure 3 (A, B respectively). Digital expression profiling by FPKM of each transcript generated from Newbler using default and optimized parameters assembly were also determined (Additional file8 and Additional file9).
To identify the biological pathways that are active in P. hexandrum cell culture, we mapped the annotated sequences to the reference canonical pathways in KEGG. Among the annotated sequences generated by Newbler using default and optimized parameters, 321 and 1069 unique non-redundant sequences were involved in a particular KEGG pathway of which 32 and 100 unique sequences could be assigned to secondary metabolism respectively (Additional file10 and Additional file11).
Protein domains encoded by the P. hexandrum transcriptome that may represent genes involved in podophyllotoxin biosynthesis
We were interested in probable podophyllotoxin pathway genes that could be identified from the transcriptome, therefore Conserved Domains Database (CDD), Pfam, and Tigr databases were searched for domains encoding CADs, monooxygenases, peroxidases (POD), pinoresinol reductases, DPOs, SDGs, and methyl transferases. Our search identified transcripts coding for domains of CAD, SDG, monooxygenase, POD, methyl transferase, NADB Rossmann superfamily, Flavin utilizing monooxygenases superfamily, Uroporphyrinogen decarboxylase methyltransferase (URO-D CIMS) like superfamily, Isoprene-C2-like reductase (ISOPREN C2) like superfamily, Cytochrome oxidase (CypX) superfamily and Oxidoreductase q1 (Oxidored q1) superfamily (Additional file12 and Additional file13). According to the hypothetical scheme of podophyllotoxin pathway, matairesinol is converted to podophyllotoxin by two consecutive methyl group additions forming a compound like yatein. We were also interested in finding methyl transferases that can transfer two methyl groups to the same substrate at the same time. A Uroporphyrinogen IIIC methyl transferase from P. hexandrum was identified in our previous studies, which is known to function in transferring two methyl groups from S-Adenosyl-L-methionine (SAM) to its substrate[28, 29]. Therefore, in addition to finding SAM dependent methyl transferases, we also identified transcripts encoding URO-D CIMS domains.
Combining KAAS-KEGG pathway analysis with domain searching for phenylpropanoid and probable downstream podophyllotoxin pathway genes
BLASTX analysis and KAAS-KEGG pathway mapping of transcripts from Newbler default and optimized parameter identified cDNAs encoding Phenylalanine ammonia lyase (PAL), hydroxyl cinnamoyl transferase (HCT), cinnamate-4-hydroxylase (C4H), 4-Coumarate Ligase (4CL), 4-Coumarate CoA Ligase (4CL), cinnamoyl reductase (CCR), CAD, sinapyl alcohol dehydrogenase (SAD), β-Glucosidase (β-GLUC) and POD as being involved in the phenylpropanoid pathway. Podophyllotoxin pathway is hypothesized to start with CAD converting coniferaldehyde to coniferyl alcohol, which then undergoes dirigent-mediated coupling to form pinoresinol. Specific reductases, dehydrogenases and methyl transferases are then believed to convert pinoresinol to podophyllotoxin. We surveyed the CDD results for cDNAs with domains that may represent genes of this pathway and identified transcripts containing Phenylcoumaran benzylic ether reductase (PCBER), SDGs, monooxygenases, SAM dependent methyl transferases and URO-D CIMS-like domains. A scheme has been presented combining the BLASTX annotation, KAAS-KEGG mapping and domain search for phenylpropanoid pathway transcripts and transcripts with domains that may be part of podophyllotoxin biosynthetic pathway ( Figure 4).
Transcription factors related to secondary metabolism
Controlled transcription of biosynthetic genes is an important mechanism for regulating secondary metabolite production in plant cells. Certain TFs are known to be involved in the regulation of secondary metabolism, such as R2R3-MYB, basic helix-loop-helix (bHLH) proteins like CrMYC2, AP2/ERF family proteins, WRKY, NAC, DOF, bZIP, HD-ZIP, and TFIIIA zinc finger TFs. We identified 96 transcripts from Newbler default assembly (Additional file14) and 961 transcripts from optimized parameter assembly (Additional file15) that may encode TFs (Figure 5A, B respectively). Amongst them, notable transcripts were AP2-EREBP, NAC, bHLH, MYB or MYB related, bZIP, mTERF, WRKY, C2C2-CO-like and C2C2-Dof. A number of plant MYB TFs regulating the phenylpropanoid biosynthetic pathway, identified from many species, including Arabidopsis, apple, grape, maize, petunia and snapdragon, most of which are R2R3-MYB TFs can be correlated with our study as 48 transcripts coding for MYB or MYB related TFs have been identified from the optimized Newbler assembly. Again, R2R3-MYB transcription factor MYB12 in A. thaliana has been shown to function as a flavonol specific activator of flavonoid biosynthesis. Transcriptional regulation of flavonoid biosynthesis, a major branch of phenylpropanoid pathway, controlled by a set of R2R3 MYB transcription factors, have been reported in several plants such as Prunus persica, Epimedium sagittatum as well[33, 34]. Other than this TF, 18 transcripts coded for bHLH TFs have been identified here. The bHLH-domain of the maize R-gene is reported to participate in anthocyanin formation and serve as a link between flavonoid formation and histone modification. Amongst the diverse functions, bHLH transcription factors also regulate the biosynthetic pathway of flavonoids, in several plant species. 1 DOF family TF has been identified in our analyses. AtDOF4 is known to influence phenylpropanoid metabolism in an environmental and tissue-specific manner by positively regulating the production of hydroxycinnamic acids in the hypocotyl and flower buds, and negatively regulating flavonoid biosynthesis in pollen grains. Together, TFs identified here and related to the phenylpropanoid pathway can be explored further in the regulation of podophyllotoxin biosynthesis.
In silico SSR marker identification
SSRs can be divided into genomic SSRs and EST-SSRs. EST-SSRs are more evolutionary conserved than noncoding sequences and therefore have a relatively high transferability[33, 37]. Next-generation sequencing has identified EST-SSRs in many plant species[38–41]. However, there have been no reports of EST-SSRs in P. hexandrum to date.
SSRs were identified with MISA search tool (http://pgrc.ipk-gatersleben.de/misa/), which is based on the criteria that a dinucleotide or a trinucleotide pattern should appear at least six times, and tetra, penta and hexa nucleotide patterns should appear five times each (Additional file16 for Newbler default, Additional file17 for Newbler optimized). SSR distribution and SSR mining of transcripts identified a total of 1,011 SSRs from 40,380 transcripts, with 94 transcripts containing more than one SSR. The most abundant repeat type was dinucleotides (68.6%) and the dominant tandem repeat motifs were (AT)6 and (AT)7 representing 19.4% and 25.7% respectively.
Transcriptome wide survey of miRNA targets in P. hexandrum cell cultures
MiRNAs are known to regulate many developmental and effector genes at the posttranscriptional level[38, 42]. Using oligonucleotide arrays, miRNAs have been shown to be differentially expressed between tissues and during the maturation of the fruit in the grapevine. Wong et al., predicted three wood related genes, flavonol synthase-like, xyloglucan fucosyltransferase and glucan synthase-like genes to be the targets of miR170, miR172 and miR319, respectively, and suggested that these miRNAs might be directly involved in regulation of the phenylpropanoid pathway and hemicellulose biosynthesis pathway. Downregulation of Flavonol synthase by miR170 would redirect the precursor 4-coumaroyl CoA to lignin biosynthesis.
We identified precursor and mature miRNAs in the P. hexandrum cell culture transcriptome, by searching the contigs and singlet in the public miRNA database (Additional file18 and Additional file19)[41, 45]. Transcripts targeted by miRNAs that are possibly related to phenylpropanoid and podophyllotoxin biosynthesis include cytochrome b6, cytochrome p450 like, cell wall associated hydrolase, cell wall associated protein, laccase, and cytochrome f. Deoxypodophyllotoxin 6-hydroxylase is a cytochrome p450 dependent monooxygenase, that catalyzes the introduction of a hydroxyl group in position 6 of deoxypodophyllotoxin. A cytochrome p450 protein is known to catalyze the biosynthesis of a lignan, (+)- sesamin, by forming two methylenedioxy bridges. Laccases have auxiliary roles in stereoselective coupling to 8-8′ linked lignans.
Comparative qRT-PCR of selected phenylpropanoid pathway genes in cell culture, callus and rhizome and podophyllotoxin accumulation in P. hexandrum cell culture
Podophyllotoxin content in P. hexandrum rhizome is known to vary from 4% to 10% in reference to age, altitude, net photosynthetic rate, stomatal conductance, carbon uptake and number of leaves[49–52]. Green calli were found to have 40–50 μg/g podophyllotoxin which were used to generate cell suspension culture. Enhanced accumulation of podophyllotoxin was observed from 3 day old cell culture and increased up to 12 day (Figure 6A) as noted by HPLC analysis (Figure 6B, C). However, no significantly increased accumulation was observed till 18 days. Hence, we chose to compare relative gene expression profile of selected phenylpropanoid pathway transcripts amongst the calli, 12 day old cell culture, and the rhizome. Transcripts of CAD identified here, CAD1, CAD5, and CAD8 share sequence similarity to Arabidopsis CAD1, CAD5, and CAD8 respectively as identified by BLASTX analysis. Primers were designed from transcript contigs for qRT-PCR analysis (Table 3). PAL in 12 days cell culture shows 12.12 fold upregulation with respect to the callus, while upregulation of PAL expression levels in rhizome is insignificant (Figure 6D). C4H is upregulated in the cell culture and the rhizome by 8 and 3.4 folds respectively in comparison to that of callus. The high FPKM value of C4H in the 12 day cell culture samples may correlate with this observation (Table 3). Furthermore, upregulation of CAD 1,5, and 8 can be correlated as well with the FPKM value on the higher side.
This report comprises a large, assembled and functionally annotated high throughput genomic resource for P. hexandrum. Our efforts to unravel the probable genes related to the podophyllotoxin biosynthetic pathway using the next-generation whole transcriptome sequencing of P. hexandrum identified almost all the known members of the phenylpropanoid pathway. The annotated transcripts represent a useful resource for subsequent isolation of podophyllotoxin pathway genes in P. hexandrum. In addition to pathway identification, the identification of EST-SSRs as molecular markers will be useful for conservation of P. hexandrum, which is an endangered species.
Sample preparation and 454 pyrosequencing
Calli were induced from mature leaves of P. hexandrum in MS medium supplemented with 2.68 μM Napthyl acetic acid (NAA) and 8.88 μM Benzylaminopurine (BAP). Cell suspension cultures were initiated from freshly subcultured green calli of P. hexandrum in modified liquid MS medium containing 60mM total N content, 1.25 mM potassium dihydrogen phosphate, 6% glucose and 11.41 μM Indole acetic acid (IAA). An inoculum of 5 g cells was used in 50 ml of cell suspension culture medium. Six flasks containing cell suspension cultures were shaken in the dark at 110 rpm for 12 days. The cells were collected by centrifugation at 1000 × g for 5 min, and immediately put in liquid nitrogen and used for RNA isolation. Total RNA was isolated from 12 days old cell suspension cultures using Purelink miRNA isolation kit (Invitrogen, CA, USA). Total RNA was quantified by NanoDrop technology, checked on a 1% denaturing agarose gel and on a bioanalyzer 2100 (Agilent Technologies, Palo Alto, CA). Removal of rRNA from total RNA was performed using RiboMinus Plant kit for RNA seq (Invitrogen cat. no. A10838-08) using the standard procedure and then concentrated by RiboMinus concentration module (cat no. K155005), according to the manufacturer’s instructions. Library preparation performed using a cDNA Rapid Library Preparation Method Manual-GS FLX Titanium Series, according to the manufacturer’s instructions. For transcriptome sequencing, 1 μg of Ribo-minus total RNA from each sample was used for fragmentation using ZnCl2 solution, followed by ds-cDNA synthesis using a standard cDNA synthesis kit (Roche). This ds-cDNA was then subjected to fragment end-repair followed by adaptor ligation using Rapid Library Prep kit (Roche). emPCR amplification of the cDNA library was performed according to the manufacturer’s instructions (Roche). Clonally amplified cDNA library beads obtained from the emPCR amplification reaction were deposited on a PTP for sequencing using pyrosequencing chemistry. The next-generation sequencing run for whole transcriptome analysis was performed on a Roche 454 GS FLX.
De novo assembly
Raw reads obtained from 454 pyrosequencing were preprocessed by removing low quality reads, and adapter/ primer sequences using PRINSEQ. The high quality reads were uniqed and mapped to non-coding RNA database Rfam (v11.0) using gsMapper. Reads that mapped to ncRNAs sequences were excluded and remaining reads were used for further analysis.The preprocessed reads were then assembled using Newbler with default parameters and optimized parameters. Optimized parameters were set by checking “Use duplicate reads”, “Extend low depth overlaps”, “Reads limited to one contig” and “Single Ace file options”. The sequence data generated in this study have been deposited at NCBI in the Short Read Archive database under the accession (SRX180870 and SRX180386).
Functional annotation, GO mapping, pathway analysis, FPKM value determination and EST-SSR identification
Annotation of the transcripts was carried out using green plants of non-redundant (nr) protein database NCBI using BLASTX. GO mapping was carried out with BLAST 2GO (GO; http://www.geneontology.org). KEGG maps and an enzyme classification number (EC number) were built for pathway analysis. FPKM values for the transcripts were determined using the formula, FPKM = (Number of reads Mapped × 109) / (Length of Transcript × Total Number of Reads). Here number of reads mapped were calculated by mapping reads on assembled transcripts using CLC Genomics Work bench with a mismatch, insertion, deletion cost of 2, 3 and 3 respectively. Potent EST-SSR markers were identified by MISA, a customized Perl script tool freely available for prediction of SSRs.
Protein domains and transcription factor identification in P. hexandrum
Transcripts were searched against a conserved domain database (CDD v3.07) with an E-value cut-off of 0.01 for different domains. For the identification of transcription factor families represented in the P. hexandrum cell culture transcriptome, the transcript contigs were searched against all the transcription factor protein sequences at the plant transcription factor database (http://plntfdb.bio.uni-potsdam.de) using BLASTX with an E-value cutoff 1E-06.
MiRNA target identification in P. hexandrum cell cultures
Conserved miRNAs and their target cDNAs, were found by aligning transcripts against the mature and precursor sequences of known plant miRNAs deposited in miRBase version 19 http://www.mirbase.org/ using CLC Genomic Work bench with a mismatch, insertion, deletion cost of 2, 3 and 3 respectively.
Lignan extraction and high performance liquid chromatography (HPLC) analysis
Lignans were extracted from P. hexandrum cells. In brief, 100 mg of cells were extracted with 2 ml ethanol for 20 min at 60°C in microtubes and sonicated for 15 min. The supernatant was collected after centrifugation and evaporated to dryness under a vacuum. Extracts were dissolved in methanol and used for HPLC analysis. Podophyllotoxin (Sigma-Aldrich, Bangalore, India) was used as a standard. Podophyllotoxin extractions were performed with three biological replicates.
For HPLC, a Waters 2998 photodiode-array detector was set at 290 nm, and separation was carried out using an XTerra RP18, 5 μm, (4.6 × 250 mm i.d.) column. Data analyses were performed with Empower 2 software. Chromatographic conditions were essentially as previously described and standardized in our laboratory.
Quantitative RT-PCR (qRT-PCR)
Total RNA was extracted with a Purelink miRNA isolation kit (Invitrogen) using three biological replicates. RNA (2 μg) was reverse transcribed with oligo(dT) primers using RevertAid H Minus First Strand cDNA Synthesis kit (Fermentas, USA). PCR amplification was performed with Power SYBR Green PCR Master Mix (Applied Biosystems, Japan) on an Applied Biosystems 7500 Real-Time PCR System (Applied Biosystems, USA). Relative expression levels were calculated using the ∆-∆Ct method. All primers for qRT-PCR of selected phenylpropanoid pathway genes have been designed from the sequences obtained from optimized Newbler assembly (Additional file20). Actin primers were designed as reported (Acc. No. FL640971.1), Forward primer: 5′-CTCGGGAGGTGCCACCACC-3′ and Reverse primer 5′-GATGGAAGCTGCTGGGTATTCA-3′.
Chatterji R: A note on germination of Podophyllum seeds. Econ Bot. 1952, 36: 3423-3426.
Giri A, Lakshmi Narasu ML: Production of podophyllotoxin from Podophyllum hexandrum: a potential natural product for clinically useful anticancer drugs. Cytotechnology. 2000, 34: 17-26. 10.1023/A:1008138230896.
Ayres DC, Loike JD: Lignans. Chemical, biological and clinical properties. Chemistry and Pharmacology of Natural Products. Edited by: Phillipson JD, Ayres DC, Baxter H. 1990, Cambridge: Cambridge University Press
Stahelin HF, Von Wartburg A: The chemical and biological route from podophyllotoxin-glucoside etoposide. Cancer Res. 1991, 51: 5-15.
Lerndal T, Svensson B: A clinical study of CPH 82 vs methotrexate in early rheumatoid arthritis. Rheumatology. 2000, 39: 316-320. 10.1093/rheumatology/39.3.316.
Liu YQ, Yang L, Tian X: Podophyllotoxin: current perspectives. Curr Bioact Compd. 2007, 3: 37-66. 10.2174/157340707780126499.
Kartal M, Konuklugil B, Indrayanto G, Alfermann AW: Comparison of different extraction methods for the determination of podophyllotoxin and 6-methoxypodophyllotoxin in Linum species. J Pharm Biomed Anal. 2004, 35: 441-447. 10.1016/j.jpba.2004.01.016.
Empt U, Alfermann AW, Pras N, Petersen M: The use of plant cell cultures for the production of podophyllotoxin and related lignans. J Appl Botany. 2000, 74: 145-150.
Sagar BPS, Zafar R: In vitro-enhanced production of podophyllotoxin in phytohormonal-induced and regenerated roots of Podophyllum hexandrum. Pharm Biol. 2005, 3: 404-410.
Smollny T, Wichers H, Kalenberg S, Shahsavari A, Petersen M, Alfermann AW: Accumulation of podophyllotoxin and related lignans in cell suspension cultures of Linum album. Plant Cell Tiss Organ Cult. 1995, 42: 73-79. 10.1007/BF00037684.
Nadeem M, Palni LMS, Purohit AN, Pandey H, Nandi SK: Propagation and conservation of Podophyllum hexandrum Royle: an important medicinal herb. Biol Conserv. 2000, 92: 121-129. 10.1016/S0006-3207(99)00059-2.
Pandey H, Nandi SK, Kumar A, Palni UT, Palni LMS: Podophyllotoxin content in Podophyllum hexandrum Royle plants of known age of seed origin and grown at a lower altitude. Acta Physiol Plant. 2007, 29: 121-126. 10.1007/s11738-006-0015-0.
Petersen M, Alfermann AW: Production of cytotoxic lignans by plant cell cultures. Appl Microbiol Biotechnol. 2001, 55: 135-142. 10.1007/s002530000510.
Fuss E: Lignans in plant cell and organ cultures: an overview. Phytochem Rev. 2003, 2: 307-320.
Davin LB, Wang HB, Crowell AL, Bedgar DL, Martin DM, Sarkanen S, Lewis NG: Stereoselective bimolecular phenoxy radical coupling by an auxiliary (dirigent) protein without an active center. Science. 1997, 275: 362-366. 10.1126/science.275.5298.362.
Katayama T, Davin LB, Chu A, Lewis NG: Novel benzylic ether reductions in lignan biogenesis in Forsythia intermedia. Phytochemistry. 1993, 33: 581-591. 10.1016/0031-9422(93)85452-W.
Umezawa T, Davin LB, Lewis NG: Formation of lignans (-)-secoisolariciresinol and (-)-matairesinol with Forsythia intermedia cell-free extracts. J Biol Chem. 1991, 266: 10210-10217.
Dewick PM: Biosynthesis of lignans. Studies in natural products chemistry, Structure elucidation (part B). Vol V. Edited by: Rahman A. 1989, Amsterdam: Elsevier, 459-503.
Sakakibara N, Suzuki S, Umezawa T, Shimada M: Biosynthesis of yatein in Anthriscus sylvestris. Organic Biomol Chem. 2003, 1: 2474-2485. 10.1039/b304411d.
Marques JV, Kim KW, Lee C, Costa MA, May GD, Crow JA, Davin LB, Lewis NG: Next generation sequencing in predicting gene function in podophyllotoxin biosynthesis. J Biol Chem. 2013, 288: 466-479. 10.1074/jbc.M112.400689.
Su Z, Ning B, Fang H, Hong H, Perkins R, Tong W, Shi L: Next-generation sequencing and its applications in molecular diagnostics. Expert Rev Mol Diagn. 2011, 11: 333-343.
Schmieder R, Edwards R: Quality control and preprocessing of metagenomic datasets. Bioinformatics. 2011, 27: 863-864. 10.1093/bioinformatics/btr026.
Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen YJ, Chen Z, Dewell SB, Du L, Fierro JM, Gomes XV, Godwin BC, He W, Helgesen S, Ho CH, Irzyk GP, Jando SC, Alenquer ML, Jarvie TP, Jirage KB, Kim JB, Knight JR, Lanza JR, Leamon JH, Lefkowitz SM, Lei M, Li J: Genome sequencing in microfabricated high-density picolitre reactors. Nature. 2005, 437: 376-380.
Knudsen B, Knudsen T, Flensborg M, Sandmann H, Heltzen M, Andersen A, Dickenson M, Bardram K, Steffensen PJ, Monsted S: CLC Genomics Workbench. Version 5.5. 2007, Denmark: CLC Bio
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.
Conesa A, Gotz S, Garcia-Gomez JM, Terol J, Talon M, Robles M: Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005, 21: 3674-3676. 10.1093/bioinformatics/bti610.
Kanehisa M, Goto S, Kawashima S, Okuno Y, Hattori M: The KEGG resource for deciphering the genome. Nucleic Acids Res. 2004, 32: D277-D280. 10.1093/nar/gkh063.
Vévodová J, Graham RM, Raux E, Schubert HL, Roper DI, Brindley AA, Ian Scott A, Roessner CA, Stamford NPJ, Elizabeth Stroupe M, Getzoff ED, Warren MJ, Wilson KS: Structure/function studies on a S-adenosyl-L-methionine dependent uroporphyrinogen–III C-methyltransferase (SUMT), a key regulatory enzyme of tetrapyrrole biosynthesis. J Mol Biol. 2004, 344: 419-433. 10.1016/j.jmb.2004.09.020.
Bhattacharyya D, Sinha R, Ghanta S, Chakraorty A, Hazra S, Chattopadhyay S: Proteins differentially expressed in elicited cell suspension culture of Podophyllum hexandrum with enhanced Podophyllotoxin content. BMC Proteome Scienc. 2012, 10: 34-10.1186/1477-5956-10-34.
Endt DV, Kijne JW, Memelink J: Transcription factors controlling plant secondary metabolism: what regulates the regulators?. Phytochemistry. 2002, 61: 107-114. 10.1016/S0031-9422(02)00185-1.
Hichri I, Barrieu F, Bogs J, Kappel C, Delrot S, Lauvergeat V: Recent advances in the transcriptional regulation of the flavonoid biosynthetic pathway. J Exp Bot. 2011, 62: 2465-2483. 10.1093/jxb/erq442.
Mehrtens F, Kranz H, Bednarek P, Weisshaar B: The Arabidopsis transcription factor MYB12 is a flavonol-specific regulator of phenylpropanoid biosynthesis. Plant Physiol. 2005, 138: 1083-1096. 10.1104/pp.104.058032.
Ravaglia D, Espley RV, Henry-Kirk RA, Andreotti C, Ziosi V, Hellens RP, Costa G, Allan AC: Transcriptional regulation of flavonoid biosynthesis in nectarine (Prunus persica) by a set of R2R3 MYB transcription factors. BMC Plant Biol. 2013, 13: 68-10.1186/1471-2229-13-68.
Huang W, Sun W, Haiyan LV, Luo M, Zheng S, Pattanaik S, Yuan L, Wang Y: A R2R3-MYB transcription factor from Epimedium sagittatum regulates the flavonoid biosynthetic pathway. Plos One. 2013, 8: e70778-10.1371/journal.pone.0070778.
Hernandez JM, Feller A, Morohashi K, Frame K, Grotewold E: The basic helix–loop– helix domain of maize R links transcriptional regulation and histone modifications by recruitment of an EMSY-related factor. Proc Natl Acad Sci USA. 2007, 104: 17222-17227. 10.1073/pnas.0705629104.
Skiry CZ, Jozefczuk S, Stobiecki M, Muth D, Zanor MI, Witt I, Mueller-Roeber B: Transcription factor AtDOF4;2 affects phenylpropanoid metabolism in Arabidopsis thaliana. New Phytol. 2007, 175: 425-438. 10.1111/j.1469-8137.2007.02129.x.
Peakall R, Gilmore S, Keys W, Morgante M, Rafalski A: Cross-species amplification of soybean (Glycine max) simple sequence repeats (SSRs) within the genus and other legume genera: implications for the transferability of SSRs in plants. Mol Biol Evol. 1998, 15: 1275-1287. 10.1093/oxfordjournals.molbev.a025856.
Cho YG, Ishii T, Temnykh S, Chen X, Lipovich L, McCouch SR, Park WD, Ayres N, Cartinhour S: Diversity of microsatellites derived from genomic libraries and GenBank sequences in rice (Oryza. sativa L.). Theor Appl Genet. 2000, 100: 713-722. 10.1007/s001220051343.
Poncet V, Rondeau M, Tranchant C, Cayrel A, Hamon S, De Kochko A, Hamon P: SSR mining in coffee tree EST databases: potential use of EST-SSRs as markers for the Coffea genus. Mol Genet Genomics. 2006, 276: 436-449. 10.1007/s00438-006-0153-5.
Luro FL, Costantino G, Terol J, Argout X, Allario T, Wincker P, Talon M, Ollitrault P, Morillon R: Transferability of the EST-SSRs developed on Nules clementine (Citrus clementina Hort ex Tan) to other Citrus species and their effectiveness for genetic mapping. BMC Genomics. 2008, 9: 287-10.1186/1471-2164-9-287.
Wang JY, Pan LJ, Yang QL, Yu SL: Development and Characterization of EST-SSR Markers from NCBI and cDNA Library in Cultivated Peanut (Arachis hypogaea L.). Mol Plant Breed. 2009, 7: 806-810.
Filipowicz W, Bhattacharyya SN, Sonenberg N: Mechanisms of posttranscriptional regulation by microRNAs: are the answers in sight?. Nat Rev Genet. 2008, 9: 102-114.
Mica E, Piccolo V, Delledonne M, Ferrarini A, Pezzotti M, Casati C, Del Fabbro C, Valle G, Policriti A, Morgante M, Pesole G, Pe ME, Horner DS, Correction: High throughput approaches reveal splicing of primary microRNA transcripts and tissue specific expression of mature microRNAs in Vitis vinifera. BMC Genomics. 2010, 11: 109-10.1186/1471-2164-11-109.
Wong MM, Cannon CH, Wickneswari R: Identification of lignin genes and regulatory sequences involved in secondary cell wall formation in Acacia auriculiformis and Acacia mangium via de novo transcriptome sequencing. BMC Genomics. 2011, 12: 342-10.1186/1471-2164-12-342.
Griffiths-Jones S, Saini HK, Van Dongen S, Enright AJ, miRBase: Tools for microRNA genomics. Nucleic Acids Res. 2008, 36: D154-D158. 10.1093/nar/gkn221.
Molog GA, Empt U, Kuhlmann S, Van Uden W, Pras N, Alfermann AW, Petersen M: Deoxypodophyllotoxin 6-hydroxylase, a cytochrome P450 monooxygenase from cell cultures of Linum flavum involved in the biosynthesis of cytotoxic lignans. Planta. 2011, 214: 288-294.
Ono E, Nakai M, Fukui Y, Tomimori N, Mizutani MF, Saito M, Satake H, Tanaka T, Katsuta M, Umezawa T, Tanaka Y: Formation of two methylene dioxy bridges by a Sesamum CYP81Q protein yielding a furofuran lignin, (+)- sesamin. Proc Natl Acad Sci USA. 2006, 103: 10116-10121. 10.1073/pnas.0603865103.
Turlapati PV, Kim KW, Davin LB, Lewis NG: The laccase multigene family in Arabidopsis thaliana: towards addressing the mystery of their gene functions. Planta. 2011, 233: 439-470. 10.1007/s00425-010-1298-3.
Nadeem M, Palni LMS, Kumar A, Nandi SK: Podophyllotoxin Content, above and belowground biomass in relation to altitude in Podophyllum hexandrum populations from kumaun region of the Indian Central Himalaya. Planta Med. 2007, 73: 388-391. 10.1055/s-2007-967154.
Vats SK, Kumar S: Photosynthetic response of Podophyllum hexandrum Royle from different altitudes in Himalayan ranges. Photosynthetica. 2006, 44: 136-139. 10.1007/s11099-005-0169-9.
Purohit MC, Bahuguna R, Maithani UC, Purohit AN, Rawat MSM: Variation in podophylloresin and podophyllotoxin contents in different populations of. Podophyllum hexandrum Curr Sci. 1999, 77: 1078-1079.
Kushwaha R, Bhattacharya A, Singh B, Singh RD: Factors affecting podophyllotoxin yield in the ex situ grown Podophyllum hexandrum, an endangered alpine native of the western Himalayas. J Nat Med. 2012, 66: 1-7. 10.1007/s11418-011-0541-y.
Murashige T, Skoog F: A revised medium for rapid growth and bioassays with tobacco tissue cultures. Plant Physiol. 1962, 15: 473-497. 10.1111/j.1399-3054.1962.tb08052.x.
Chakraborty A, Bhattacharya D, Ghanta S, Chattopadhyay S: An efficient protocol for in vitro regeneration of Podophyllum hexandrum, a critically endangered medicinal plant. Indian J Biotechnol. 2010, 9: 217-220.
Chattopadhyay S, Mehra RS, Srivastava AK, Bhojwani SS, Bisaria VS: Effect of major nutrients on podophyllotoxin production in Podophyllum hexandrum suspension cultures. Appl Microbiol Biotechnol. 2003, 60: 541-546.
Thiel T, Michalek W, Varshney RK, Graner A: Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor Appl Genet. 2003, 106: 411-422.
Heyenga AG, Lucas JA, Dewick PM: Production of tumor-inhibitory lignans in callus cultures of Podophyllum hexandrum. Plant Cell Rep. 1990, 9: 382-385.
This work was supported by the CSIR Net Work project (grant No. NWP 0008), project of Council of Scientific and Industrial Research, New Delhi, India. DB, RS acknowledge CSIR and SH to UGC, New Delhi, respectively for their fellowship. The authors acknowledged Director, CSIR-IHBT for providing plant material and Xcelris Labs Ltd, (Ahmedabad, India) for NGS data generation and informatics. We thank Edanz Group Global Ltd. for substantive manuscript copy editing.
The authors declare that they have no competing interests.
The study was conceived by SC and DB. SC collected fresh samples of P. hexandrum from CSIR-IHBT, Palampur, India. The plant material preparation and gene expression analyses were carried out by DB, RS and SH. RD is maintaining the un-treated and treated cell suspension cultures of P. hexandrum. SC and DB contributed to data analysis, bioinformatics analysis, and manuscript preparation. All authors had read and approved the final manuscript.
Electronic supplementary material
Additional file 1:Distribution of transcript contigs according to length assembled by Newbler using default parameters (A) and optimized parameters (B).(JPG 4 MB)
Additional file 4:E-value distribution, sequence similarity distribution, evidence code distribution for sequences, evidence code distribution for BLAST hits, annotation score distribution, annotation distribution, and GO-level distribution for transcripts generated by Newbler using default parameters.(PDF 1 MB)
Additional file 5:E-value distribution, sequence similarity distribution, evidence code distribution for sequences, evidence code distribution for BLAST hits, annotation score distribution, annotation distribution, and GO-level distribution for transcripts by generated by Newbler using optimized parameters.(PDF 79 KB)
Additional file 12:List of important domains that may be required for podophyllotoxin biosynthesis from sequences assembled using Newbler default parameters.(XLS 22 KB)
Additional file 13:List of important domains that may be required for podophyllotoxin biosynthesis from sequences assembled using Newbler optimized parameters.(XLS 34 KB)
Additional file 14:Complete list of transcription factors identified from transcripts generated by Newbler default parameter assembly.(XLS 33 KB)
Additional file 15:Complete list of transcription factors identified from transcripts generated by Newbler optimized parameter assembly.(XLS 156 KB)
Additional file 16:List of SSRs, SSR distribution and SSR mining of transcripts generated by Newbler default parameter assembly.(XLS 21 KB)
Additional file 17:List of SSRs, SSR distribution and SSR mining of transcripts generated by Newbler optimized parameter assembly.(XLS 144 KB)
Additional file 18:List of mature and precursor miRNAs with their respective transcript IDs for transcripts generated by Newbler default parameter assembly.(XLS 24 KB)
Additional file 19:List of mature and precursor miRNAs with their respective transcript IDs for transcripts generated by Newbler optimized parameter assembly.(XLS 36 KB)
Authors’ original submitted files for images
About this article
Cite this article
Bhattacharyya, D., Sinha, R., Hazra, S. et al. De novo transcriptome analysis using 454 pyrosequencing of the Himalayan Mayapple, Podophyllum hexandrum. BMC Genomics 14, 748 (2013). https://doi.org/10.1186/1471-2164-14-748
- 454 pyrosequencing
- Podophyllum hexandrum
- Podophyllotoxin biosynthesis