Gene discovery from Jatropha curcas by sequencing of ESTs from normalized and full-length enriched cDNA library from developing seeds
© Natarajan et al; licensee BioMed Central Ltd. 2010
Received: 19 June 2010
Accepted: 27 October 2010
Published: 27 October 2010
Jatropha curcas L. is promoted as an important non-edible biodiesel crop worldwide. Jatropha oil, which is a triacylglycerol, can be directly blended with petro-diesel or transesterified with methanol and used as biodiesel. Genetic improvement in jatropha is needed to increase the seed yield, oil content, drought and pest resistance, and to modify oil composition so that it becomes a technically and economically preferred source for biodiesel production. However, genetic improvement efforts in jatropha could not take advantage of genetic engineering methods due to lack of cloned genes from this species. To overcome this hurdle, the current gene discovery project was initiated with an objective of isolating as many functional genes as possible from J. curcas by large scale sequencing of expressed sequence tags (ESTs).
A normalized and full-length enriched cDNA library was constructed from developing seeds of J. curcas. The cDNA library contained about 1 × 106 clones and average insert size of the clones was 2.1 kb. Totally 12,084 ESTs were sequenced to average high quality read length of 576 bp. Contig analysis revealed 2258 contigs and 4751 singletons. Contig size ranged from 2-23 and there were 7333 ESTs in the contigs. This resulted in 7009 unigenes which were annotated by BLASTX. It showed 3982 unigenes with significant similarity to known genes and 2836 unigenes with significant similarity to genes of unknown, hypothetical and putative proteins. The remaining 191 unigenes which did not show similarity with any genes in the public database may encode for unique genes. Functional classification revealed unigenes related to broad range of cellular, molecular and biological functions. Among the 7009 unigenes, 6233 unigenes were identified to be potential full-length genes.
The high quality normalized cDNA library was constructed from developing seeds of J. curcas for the first time and 7009 unigenes coding for diverse biological functions including oil biosynthesis were identified. These genes will serve as invaluable genetic resource for crop improvement in jatropha to make it an ideal and profitable crop for biodiesel production.
Jatropha curcas L., one of the 175 species in genus Jatropha of the family Euphorbiaceae is a perennial small tree or large shrub native to tropical America and is distributed throughout the tropics and subtropics of Asia and Africa . Recently, jatropha oil is promoted as alternative transport fuel which can be directly blended with petro-diesel or transesterified with methanol and used as biodiesel. From a global food security point of view, jatropha being a non-edible crop which can be grown in areas not suitable for agriculture, is a preferred source for biodiesel feedstock as it does not compete with production of food crops. It reduces the dependence on fossil fuel which is often imported by using precious foreign currency. Its decentralized production will provide income for a large number of small and marginal farmers. Biodiesel is also less harmful to the environment in that its production and combustion reduces emission of greenhouse gases by 41% relative to fossil fuel . It is reported that biodiesel emits less particulate matter than diesel upon combustion . In fact, large scale jatropha cultivation will improve the environment by greening the area and transforming the wasteland to productive land by preventing soil erosion, causing accumulation of organic matter, increasing soil microbial activity, etc.
The demand for biodiesel production is very huge that it cannot be met from wild grown plants. Increasing the jatropha production requires both to bring more area under cultivation and to enhance productivity. Though jatropha can grow and survive in wasteland with less water, nutrient, and virtually no pest and disease management, productive growth and better yield under restrictive environmental conditions require the development of resilient genotypes. Genotypes with improved drought tolerance are preferred for plantations in marginal lands. Large scale plantations may bring in new challenges which need to be addressed. It was reported that J. curcas planted in continuous stretches as a monocrop were devastated by flower and seed feeding insects Scutellera nobilis and Pempelia morosalis. This indicates that plant breeding programs to develop pest and disease resistance are required when large scale cultivation of jatropha is planned.
In addition, jatropha oil composition itself may have to be modified to make it the best feedstock for biodiesel production. Oils with more of saturated fatty acids give higher cetane number, and oxidative stability which are desirable for combustion/ignition quality and shelf life of biodiesel, respectively. But jatropha oil contains less of saturated fatty acids (21.6%) and more of unsaturated fatty acids (78.4%) . While viscosity of petro-diesel is 2.6 mm2/s, it is 4.8 mm2/s for biodiesel derived from jatropha oil. Viscosity affects atomization of the fuel upon injection into the combustion chamber, and thereby, increases the formation of engine deposits . More the carbon number of saturated fatty acids higher will be the viscosity. Jatropha contains 84.5% 18-carbon fatty acids and only 14.9% 16-carbon fatty acids . It is possible to make significant changes in the jatropha oil composition by genetic engineering of the metabolic pathway of oil biosynthesis. Therefore, there is a need and scope for genetic improvement of jatropha by using plant breeding and genetic engineering methods.
Genetic intervention in jatropha requires understanding of the biosynthetic pathways, metabolic flux control points, cloning of the genes that code for the enzymes and proteins involved in the metabolic pathways and development of molecular markers. Molecular studies in jatropha are limited and only a few genes have been isolated from jatropha [7, 8]. Currently, the dbEST of NCBI contains only 250 annotated Expressed sequence tags (ESTs) from jatropha. We have initiated a gene discovery project by large scale sequencing of ESTs. ESTs are short, single pass sequence reads from 5'or 3' end of randomly selected cDNA clones. Sequencing of ESTs has been successfully employed in several plants including tomato , citrus , castor , arabidopsis , rice , water melon , radish  with the objective of gene discovery, metabolic pathway interpretation, gene cloning, molecular marker development, construction of genetic and physical map and comparative mapping and analysis. When sequencing of ESTs is employed for gene discovery purpose, normalization of the cDNA library will greatly increase the efficiency and economy of the process. Normalization reduces the frequency of abundant genes (hundreds of mRNA copies per cell) and enriches the library with rare genes (< 10 mRNA copies per cell) . Since sequencing of ESTs is carried out with the ultimate objective of cloning the genes, it would be highly desirable to combine the normalization with enrichment for full-length clones. This paper reports construction of a normalized and full-length enriched cDNA library from developing seeds of J. curcas and isolation of 7009 unigenes by sequencing of 12,084 ESTs.
Results and Discussion
Construction of cDNA Library
Construction of cDNA library and sequencing of ESTs helps in rapid gene discovery especially in non-model organisms where no prior sequencing data is available. Next generation sequencing technologies can circumvent the need for constructing cDNA libraries and generate extraordinarily huge amount of sequencing data to further speed up the gene discovery process. However, sequencing of cDNA clones has several advantages over the next generation sequencing methods such as higher average read length, virtually no assembly problem, ability to isolate full length genes without going for RACE PCR and availability of physical clones for further characterization and applications.
For the present study, a normalized and full-length enriched cDNA library was constructed from developing seeds of J. curcas. The normalization efficiency was monitored by using chloramphenicol reporter gene. Before normalization, the reporter gene was added to the cDNA population to a redundant rate of about 1% which was found to be reduced to less than 0.025% after normalization. This indicated 40 fold reduction in abundance due to normalization. This normalization will greatly help to enrich the library for rare genes. In addition, it will increase the rate recovery of unigenes and reduce the cost of sequencing by avoiding redundant clones. In fact, the rate of recovery of unigenes in this study was about 58% (7009 unigenes out of 12,084 ESTs) which is much higher than 30 to 40% reported from non-normalized cDNA libraries [11, 17, 18].
Sequencing of Expressed Sequence Tags (ESTs)
From the normalized and full-length enriched cDNA library, 13,220 clones were patched for plasmid DNA isolation. Clones which did not grow in the selection medium or did not yield sufficient quantity or quality of plasmid DNA were discarded. Plasmid DNA suitable for sequencing was isolated from 12,810 clones. These clones were sequenced using M13 reverse primer (CAGGAAACAGCTATGAC) which directly reads the cDNAs from the 5' end. Nucleotide bases having Phred value less than 20 were discarded. Vector backbone and additional sequences that were added during cDNA synthesis were removed. After the trimming exercise 144 empty clones (resulted in 0 bases after trimming) and 12,084 high quality ESTs were obtained. The read length of the high quality ESTs ranged between 105 bp and 874 bp with an average length of 576 bp which is comparable with other reports [19, 20, 18].
Contig Assembly of ESTs
Summary of Contig assembly
Total number of ESTs assembled
Number of contigs
Number of ESTs in contigs
No. of ETS as singletons
Number of unique ESTs (unigenes)
List of abundant ESTs from the normalized cDNA library of Jatropha curcas L
No. of ESTs
Conserved hypothetical protein
Phosphoric diester hydrolase
Disease resistance protein RPM1
Protein disulfide isomerase
Indole-3-acetic acid-amido synthetase GH3.3
Receptor protein kinase
Branched-chain amino acid aminotransferase
5-AMP-activated protein kinase
ATP binding protein
Vacuolar Ca2+/H+ exchanger
Conserved hypothetical protein
RNA recognition motif (RRM)-containing protein
Serine/threonine-protein kinase ASK1
SEC14 cytosolic factor family protein
Conserved hypothetical protein
ARMADILLO REPEAT ONLY 1(ARO1)
Bifunctional dihydrofolate reductase-thymidylate synthase
Ethylene-responsive transcription factor
Annotation of Unigenes
Genes for oil biosynthesis and β-oxidation
Recently jatropha seed oil is widely used for biodiesel production as an alternative renewable energy source. It is important to undertake genetic improvement of this crop to increase oil content, to modify the oil composition, to remove toxic compounds, to increase drought tolerance etc. Seed oil content in brassica and arabidopsis has been increased by the overexpression of diacyl glycerol acyl transferase, lysophosphatidic acid acyltransferase, and glycerol-3-phosphate acyltransferase [37–39]. Seed oil composition has been changed in soybean by using mutant 3-Keto-acyl-ACP synthase II gene which increases the 16-carbon fatty acids and decreases 18-carbon fatty acids . Silencing of stearoyl-ACP desaturase has dramatically increased the content of saturated fatty acid (stearic acid) from 1.2% to 32% in brassica . Therefore, it is possible that increased oil content and specialized seed oil composition can be achieved in jatropha also, provided the genes involved in rate limiting steps of oil biosynthesis pathway are cloned. Important oil biosynthesis genes identified in the present study include the genes involved in fatty acid biosynthesis in plastids (carboxyl transferase of ACCase β subunit, biotin carboxyl carrier protein of ACCase, malonyl-CoA:ACP transacylase, 3-keto acyl ACP reductase, beta-keto acyl ACP synthase I, beta-keto acyl ACP synthase II and acyl carrier protein), desaturation of fatty acids (ω-3-fatty acid desaturase and ω-6-fatty acid desaturase), hydrolysis of fatty acids from acyl-ACP (acyl ACP thioesterase A), activation and transport of free fatty acids to endoplasmic reticulum (long chain acyl-CoA synthetase, acyl -CoA binding protein) and serial incorporation of activated fatty acids to the glycerol backbone to form triacylglycerol or oil (glycerol-3-phosphate acyl transferase, lysophosphatidic acid acyl transferase). Some of these genes are being cloned in plant expression vector for functional evaluation in arabidopsis and tobacco. The unigene collection also contained genes for acyl-CoA oxidase, enoyl-CoA hydratase, β-hydroxyl acyl-CoA dehydrogenase and acyl-CoA acetyl transferase which are involved in β-oxidation of fatty acids and their derivatives. Many of the mutant studies revealed the importance of these enzymes in the breakdown of reserve triacylglycerol, seed development, seed germination, vegetative and reproductive growth phases . Hence the cloning of the genes for beta oxidation pathway will be a valuable source for the genetic manipulation of oil degradation and plant growth.
Genes for crop improvement
Jatropha is a non-edible plant proposed to be grown in areas not suitable for agriculture such as wasteland, sides of railway track, lands with severe water scarcity, saline areas etc. Hence jatropha should be able to withstand these stresses. Plants respond to these stresses by modulating gene expression, which restores the cellular homeostasis, detoxification and recovery of growth [43, 44]. For example, overexpression of betaine aldehyde dehydrogenase gene conferred salt stress tolerance in carrot, maize and tomato [45–47] and Zhang et al.  showed that glycine betaine level is increased under drought, heat and salt stresses in jatropha also. When enzymes for glycine betaine biosynthesis are expressed in plants that do not naturally produce glycine betaine, they accumulate little glycine betaine, because their endogenous choline supply is inadequate [49, 50]. These plants may require overexpression of phosphoethanolamine N-methyltransferase to overproduce choline. Overexpression of Na+/H+ antiporter from Pennisetum glaucum conferred high level of salinity tolerance in transgenic Brassica  and overexpression of E. coli trehalose-6-phosphate synthase gene conferred drought and salt tolerance in rice . The stress related unigenes identified in the current study include the genes coding for phosphoethanolamine N-methyltransferase, Na+/H+ antiporter, trehalose-6-phosphate synthase, glutathione peroxidase, glutathione s-transferase, spermidine synthase, ethylene-responsive transcription factors, ascorbate peroxidase, late embryogenesis abundant proteins, aquaporin, and salt tolerance protein.
The genes that code for the enzymes involved in different metabolic pathways are very important for genetic manipulation in jatropha. Several genes involved in diverse metabolic pathways such as phospholipids biosynthesis, flavanol synthesis, glycolysis, TCA cycle, HMP shunt, glycogenesis were identified from the current study. Other important genes identified from this study are ferritin, mevalonate kinase, lipoxygenase, glutamate decarboxylase, ent-kaurenoic acid oxidase, cinnamoyl-CoA reductase, zeta-carotene desaturase, gibberellin 2-oxidase, lipoic acid synthetase and beta-carotene hydroxylase. The unigenes also included 208 gene families with 3 or more genes. There were 15 families with more than 10 genes and a highest number of 46 members were identified in serine-threonine kinase gene family. Other gene families included glucan endo-1,3-beta-glucosidase precursor, aspartic proteinase nepenthesin-1 precursor, calmodulin binding protein, F-Box family protein, WD-repeat protein, pentatricopeptide repeat-containing protein, cytochrome P450 etc.
Genes with unknown functions
BLASTX results showed that 40.4% of J. curcas unigenes (2836) are having significant similarity with genes that code for unknown, hypothetical and putative proteins. This is significantly higher than 13 to 25% unknown genes reported in arabidopsis, citrus and oil palm [12, 53, 54]. It was found that 2.8% of J. curcas unigenes (191) did not have significant similarity with any genes available in the non-redundant database at NCBI. This is significantly lower than 8 - 24% of such genes reported in other plants [55, 12, 56]. These genes are very important as they may be specific to jatropha.
The unigenes were first classified according their function in 16 different cellular compartments and anatomical structures such as endoplasmic reticulum, plastid, mitochondria, nucleus, cell wall, golgi apparatus etc. which may have unique genes or specific expression profiles. The proportion of genes identified under each class was comparable with A. thaliana genome except those unigenes which were classified under 'unknown cellular components' (Figure 5A). Majority of the unigenes were grouped under 'other intra-cellular components', 'unknown cellular components', 'other cytoplasmic components' and 'chloroplast', which accounted for about 55% of the unigenes. Though, non-green tissue was used for the cDNA library construction, about 12% of the unigenes belonged to chloroplast cellular component.
The unigenes were then classified according to their involvement in 14 different biological processes such as protein metabolism, developmental processes, response to stress, transcription, signal transduction etc. These processes are very important for a cell to live and reproduce. Hence, the genes that cover the biological processes are of great importance for functional study. It was found that the larger part of the J. curcas unigenes were grouped under 'other cellular process', 'other metabolic process' and 'unknown biological processes' accounting for 23.4%, 20.6% and 13.3%, respectively (Figure 5B).
The unigenes were finally classified under 15 different molecular functions which mainly correspond to the activities performed by the gene products from individual gene or group of genes such as transferase activity, hydrolase activity, kinase activity, receptor binding, receptor activity etc. Greater part of the unigenes was classified under 'unknown molecular function', 'other enzyme activity', 'other binding', 'transferase activity' which accounted for 49.3% of the unigenes (Figure 5C).
Validation of ESTs
Normalized and full-length enriched cDNA library was constructed and 12,084 clones were sequenced from J. curcas for the first time. From these sequences, 7009 unigenes were identified which included 6233 potential full-length genes. These genes encoded for diverse biological functions in J. curcas including oil biosynthesis, stress response, flavanol biosynthesis etc. These genes will serve as invaluable resource for the genetic engineering to modify oil composition and to increase oil content, seed yield, pest and disease resistance to make jatropha more suitable for biodiesel production and profitable to farmers. Gene discovery from other tissues of J. curcas are being attempted by using next generation sequencing technology.
Collection of Seeds
Seeds in different developmental stages were collected from Jatropha curcas fruits which were between 0.5 and 2.5 cm in diameter and green in colour (Figure 1). The seeds were flash frozen in liquid nitrogen and stored at -86°C until further use.
Isolation of Total RNA
Total RNA from developing seeds was prepared using Trizol reagent following the manufacturer's protocol (Invitrogen, USA). One gram of the developing seeds were ground with liquid nitrogen to a fine powder and suspended in 10 ml Trizol reagent. The suspension was mixed well, and incubated on ice for 10 min. Subsequently, 2 ml chloroform was added to the suspension and incubated on ice for 10 min. The suspension was centrifuged at 10,000 × g for 15 min at 4°C. The aqueous phase was transferred to a fresh tube, and equal volume of isopropanol was added. It was incubated at -20°C for 1 h, and total RNA was recovered by centrifugation at 10,000 × g for 20 min at 4°C. The supernatant was discarded and the pellet was washed with 70% ethanol, dried and dissolved in 200 μl RNase free water. The total RNA was further purified by using RNeasy Mini Kit following the manufacturer's protocol (Qiagen, Hilden, Germany). Quality of total RNA was checked by agarose gel electrophoresis as well as OD260/OD280 ratio before using it for cDNA synthesis.
cDNA Synthesis and Normalization
Synthesis of full length cDNAs from total RNA and normalization were performed as described by Patanjali et al.  and Soares et al.  with slight modifications. Two cDNA preparations were made using the same total RNA. One was used as tester cDNA and the other was used as driver cDNA. First strand tester cDNA and driver cDNA were synthesized using tester3 primer (CAGTGGTATCAACGCAGAGT GGCCGAGGCGGCCT15) and driver3 primer (GGGATAACAGGGTAATGGCCGAGGCGGCCGACATGT15), respectively. Tester adaptor (GTAACTAGGCCGTAATGGCCACTCTGCGTTGATACCACTG) and driver adaptor (GGCCGTAATGGCCTCGCTACCTTAGGA) were ligated to the 3'end of the newly synthesised first strand tester cDNA and driver cDNA, respectively. These adaptors will ligate only to the 3'end because, they were 5'phosphorylated and 3'blocked. Double strand tester cDNA was prepared by low cycling PCR using tester3 primer and phosphorylated tester5 primer (CAGTGGTATCAACGCAGAGT GGCCATTACGGCCTAGTTACGGG) which is complementary to the 5'tester adaptor. The sense strand of the double strand tester cDNA which is phosphorylated at the 5'end was destroyed by treating it with lambda exonuclease. As a result only the anti-sense strands are retained. Double strand driver cDNA was prepared by low cycling PCR using phosphorylated driver3 primer and driver5 primer (TCCTAAGGTAGCGAGGCCATTACGGCCGGG) which is complementary to the 5'driver adaptor. The anti-sense strand of the double strand driver cDNA which is phosphorylated at the 5'end was destroyed by treating it with lambda exonuclease. As a result only the sense strands are retained.
Anti-sense strands from tester cDNA and sense strands from driver cDNA were mixed and hybridized in 1× hybridization buffer (50 mM Tris-HCl pH 8.0, 0.5 M NaCl, 0.2 mM EDTA) at 68°C for 6 h. The double strand hybrids formed during hybridization were removed by hydroxyapatite chromatography  for normalization of the single strand cDNA population. The normalized single strand cDNAs were converted to double strand cDNA and amplified by FailsafeTM PCR system (Epicentre Biotechnologies, USA) using amplification primer (CAGTGGTATCAACGCAGAGT) for which the binding sites are present both in the 5' and 3' end of the cDNA and it is specific to the tester cDNA only (underlined in tester3 and tester5 primers used for cDNA synthesis). The amplified cDNA was digested with Sfi I restriction enzyme and cloned in modified pBluescript II SK- in which SfiA (GGCCATTACGGCC) and SfiB (GGCCGAGGCGGCC) sites were introduced between Eco RI and Xho I. The ligated plasmids were transferred to E. coli DH10B-T1R (Invitrogen, USA) to construct the cDNA library.
Normalization efficiency was strictly monitored by performing parallel normalization experiment. Chloramphenicol resistance gene with same adaptor and primer sequences was used as reporter gene. In the parallel normalization, the reporter gene was added to the cDNA population as internal control at 1.0% redundant rate. Cloning of cDNA using modified pBluescript II SK- was performed. Normalization efficiency was determined by plating the cDNA library on LB plates containing chloramphenicol.
Quality Control of cDNA Library
Ninety six colonies from the cDNA library were randomly selected for quality control experiment. Cells from these 96 colonies were used for colony PCR as well as plasmid DNA isolation for sequencing. Colony PCR was performed using M13 forward (GTAAAACGACGGCCAGT) and M13 reverse primer (CAGGAAACAGCTATGAC). The PCR products were run on 1.0% agarose gel with 1.0 kb ladder as marker (Genie, Bangalore, India). Clones without cDNAs (empty clones) will give PCR product size of 226 bp and clones with cDNA will give variable PCR product size higher than 226 bp depending on the length of the insert. Based on this, approximate size of cDNA in each clone was determined. For plasmid DNA isolation, the cells were inoculated in 5 ml LB Broth supplemented with 50 mg/L ampicillin and incubated at 37°C at 200 rpm for 16 hours. Plasmid DNA was isolated from 1.5 ml of culture using Plasmid Miniprep Kit following manufacturer's protocol (Biobasic Inc, Canada). About 200 ng of plasmid DNA was used for sequencing using M13 reverse primer and BigDyeTM Terminator v3.1 Cycle Sequencing Kit in 3130xl Genetic Analyzer (Applied Biosystems, CA, USA). These sequences were annotated by using BLASTX algorithm and non-redundant database at NCBI .
Sequencing of Expressed Sequence Tags (ESTs)
The cDNA library was serially diluted and plated on Luria Bertani (LB) agar plates supplemented with 50 μg/ml ampicillin and incubated for 16 hours at 37°C. Well separated single colonies were randomly selected and patched on LB agar plates supplemented with 50 μg/ml ampicillin, and incubated overnight at 37°C. The cells from patched colonies were inoculated in 5 ml LB Broth supplemented with 50 mg/L ampicillin and incubated at 37°C at 200 rpm for 16 hours. Plasmid DNA was isolated from 1.5 ml of culture using Plasmid Miniprep Kit following manufacturer's protocol (Biobasic Inc, Canada). Plasmid DNA was isolated from 12,810 cDNA clones. Sequencing of ESTs was performed using M13 reverse primer and BigDyeTM Terminator v3.1 Cycle Sequencing Kit in 3130xl Genetic Analyzer (Applied Biosystems, CA, USA). The raw sequence data was base called by Phred program  using DNA Sequencing Analysis Software Version 5.1 (Applied Biosystems, CA, USA). Only the bases having Phred value above 20 were considered for further analysis. The vector sequences and cDNA anchor sequences were removed using Codon Code Aligner (Codon Code Corporation, MA, USA). The clones having less than 100 bp good quality sequences were also removed from the analysis.
Contig assembly of ESTs was done using CAP3 program  and Codon Code Aligner (Codon Code Corporation, MA, USA). The default parameters set by CAP3 was used to assemble the ESTs in to contigs and singletons. The number of ESTs in the contigs was used to find the redundancy rate of the ESTs in the library. After the assembly result, the outputs were manually checked and redundant clones were removed. The total numbers of unique ESTs (unigenes) were calculated by adding all singletons and one EST from each contig.
Annotation of Unigenes
The unigenes were annotated using BLASTX and non-redundant database at NCBI . All the sequences which showed significant similarity (e-value < e-10) were assigned putative functions. Others were classified as unigenes with no significant similarity.
Ten ESTs of J. curcas from the current study were selected and corresponding genes from five oilseed crops and Vitis vinifera were obtained from Genbank at NCBI. The Phylogenetic analysis was done using ClustalW2 program .
Identification of full-length unigenes
Full-length nature of the cDNAs at the 5'end was analyzed from BLASTX output. A unigene was considered as potential full-length gene if (1) there was a bonafide 5'UTR, (2) the first amino acid in one of the positive reading frames of query sequence matches with the first amino acid in subject sequence from the database and (3) the reading frame covers the entire length of the query sequence. The unigenes that did not show significant similarity were analyzed for Open Reading Frames (ORF) using ORF Finder at NCBI . The unigenes were considered as potential full-length gene if there was a bonafide 5' UTR and the longest ORF covered almost the entire length of the sequence.
Functional Classification of Unigenes
The unigenes were annotated again using BLASTN and NT database of arabidopsis at The Arabidopsis Information Resource (TAIR) . This resulted in locus identifier for each unigenes. These locus identifiers were submitted for functional classification using GO annotation tool (GOSlim) at TAIR. Functional classification was done under three Gene Ontology categories viz., cellular component, molecular function and biological process. These three broad categories were further classified with different GO Slim terms.
Primers and PCR product size of the genes used for semi-quantitative RT-PCR analysis
Product size (bp)
Carboxyl transferase of ACCase β subunit (CT-β)
Biotin carboxyl carrier protein I of ACCase (BCCPI)
Biotin carboxyl carrier protein II of ACCase (BCCPII)
Malonyl-CoA:ACP transacylase (MCAT)
3-keto acyl ACP Reductase (KAR)
Beta-keto acyl ACP synthase I (KASI)
Beta-keto acyl ACP synthase II (KASII)
Acyl carrier Protein (ACP)
Acyl ACP thioesterase (FATA)
ω-3-fatty acid Desaturase (ω-3-FAD)
Acyl -CoA binding protein (ACBP)
Long chain acyl-CoA synthetase (LCACS)
Acyl-CoA thioesterase (ACT)
Glycerol-3-phosphate acyl transferase (GPAT)
Lysophosphatidic acid acyl transferase (LPAT)
Diacyl glycerol kinase (DGK)
Triacyl glycerol lipase(TAGL)
J. curcas unigenes were submitted to the dbEST database of Genbank at NCBI with the accession numbers GW874611 to GW881590 and HO004465 to HO004493.
Financial support provided for this project by the Department of Biotechnology, Government of India (Sanction order BT/PR/8647/PBD/26/38/2007) and the 16-capillary automated DNA sequencing machine (Genetic Analyzer 3130xl, Applied Biosystems) provided by the SRM University are gratefully acknowledged.
- Mabberley DJ: The Plant Book, A portable dictionary of vascular plants. Cambridge. 2005, Cambridge University Press
- Hill J, Nelson E, Tilman D, Polasky S, Tiffany D: Environmental, economic, and energetic costs and benefits of biodiesel and ethanol biofuels. Proc Natl Acad Sci. 2006, 103: 11206-11210. 10.1073/pnas.0604600103.PubMed CentralPubMedView Article
- Yuan Y, Mei D, Wang Z, Zhang T: Combustion and emissions of the diesel engine using bio-diesel fuel. Front Mech Eng China. 2008, 3: 189-192. 10.1007/s11465-008-0021-6.View Article
- Shanker C, Dhyani SK: Insect pests of Jatropha curcas L. and the potential for their management. Current Science. 2006, 91: 162-163.
- Akbar E, Yaakob Z, Kamarudin SK, Ismail M, Salimon J: Characteristic and Composition of Jatropha Curcas Oil Seedfrom Malaysia and its Potential as Biodiesel Feedstock. European Journal of Scientific Research. 2009, 29: 396-403.
- Knothe G: Dependence of biodiesel fuel properties on the structure of fatty acid alkyl esters. Fuel Processing Technology. 2005, 86: 1059-1070. 10.1016/j.fuproc.2004.11.002.View Article
- Xie WW, Gao S, Wang SH, Zhu JQ, Xu Y, Tang L, Chen F: Cloning and expression analysis of carboxyl transferase of acetyl-coA carboxylase from Jatropha curcas. Z Naturforsch C. 2010, 65: 103-108.PubMedView Article
- Wu PZ, Li J, Wei Q, Zeng L, Chen YP, Li MR, Jiang HW, Wu GJ: Cloning and functional characterization of an acyl-acyl carrier protein thioesterase (JcFATB1) from Jatropha curcas. Tree Physiology. 2009, 29: 1299-1305. 10.1093/treephys/tpp054.PubMedView Article
- Aoki K, Yano K, Suzuki A, Kawamura S, Sakurai N, Suda K, Kurabayashi A, Suzuki T, Tsugane T, Watanabe M, Ooga K, Torii M, Narita T, Shin-I T, Kohara Y, Yamamoto N, Takahashi H, Watanabe Y, Egusa M, Kodama M, Ichinose Y, Kikuchi M, Fukushima S, Okabe A, Sato TAY, Yazawa K, Satoh S, Omura T, Ezura H, Shibata D: Large-scale analysis of full-length cDNAs from the tomato (Solanum lycopersicum) cultivar Micro-Tom, a reference system for the Solanaceae genomics. BMC Genomics. 2010, 11: 210-10.1186/1471-2164-11-210.PubMed CentralPubMedView Article
- Marques MC, Cantabrana HA, Forment J, Arribas R, Alamar S, Conejero V, Amador MAP: A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus. BMC Genomics. 2009, 10: 428-10.1186/1471-2164-10-428.PubMed CentralPubMedView Article
- Lu C, Wallis JG, Browse J: An analysis of expressed sequence tags of developing castor endosperm using a full-length cDNA library. BMC Plant Biology. 2007, 7: 42-10.1186/1471-2229-7-42.PubMed CentralPubMedView Article
- White JA, Todd J, Newman T, Focks N, Girke T, Ilarduya OMD, Jaworski JG, Ohlrogge JB, Benning C: A New Set of Arabidopsis Expressed Sequence Tags from Developing Seeds. The Metabolic Pathway from Carbohydrates to Seed Oil. Plant Physiology. 2000, 124: 1582-1594. 10.1104/pp.124.4.1582.PubMed CentralPubMedView Article
- Jantasuriyarat C, Gowda M, Haller K, Hatfield J, Lu G, Stahlberg E, Zhou B, Li H, Kim H, Yu Y, Dean RA, Wing RA, Soderlund C, Wang GL: Large-Scale Identification of Expressed Sequence Tags Involved in Rice and Rice Blast Fungus Interaction. Plant Physiology. 2005, 138: 105-115. 10.1104/pp.104.055624.PubMed CentralPubMedView Article
- Kim J, Lun SH, Lee L, Kang HG, Gynheung : An Expressed Sequence Tags and mRNA Expression Levels of Tagged cDNAs from Watermelon Anthers and Developing Seeds. Journal of Plant Biology. 2001, 44: 172-177. 10.1007/BF03030236.View Article
- Moon YH, Chae S, Jung JY, An G: Expressed sequence tags of radish flower buds and characterization of a CONSTANS LIKE 1 gene. Mot Cells. 1998, 4: 452-458.
- Carninci P, Shibata Y, Hayatsu N, Sugahara Y, Shibata K, Itoh M, Konno H, Okazaki Y, Muramatsu M, Hayashizaki Y: Normalization and subtraction of cap-trapper-selected cDNAs to prepare full-length cDNA libraries for rapid discovery of new genes. Genome Res. 2000, 10: 1617-1630. 10.1101/gr.145100.PubMed CentralPubMedView Article
- Sakurai T, Plata G, Zapata RF, Seki M, Salcedo A, Toyoda A, Ishiwata A, Tohme J, Sakaki Y, Shinozaki K, Ishitani M: Sequencing analysis of 20,000 full-length cDNA clones from cassava reveals lineage specific expansions in gene families related to stress response. BMC Plant Biology. 2007, 7: 66-10.1186/1471-2229-7-66.PubMed CentralPubMedView Article
- Crowhurst RN, Gleave AP, MacRae EA, Dwamena CA, Atkinson RG, Beuning LL, Bulley SM, Chagne D, Marsh KB, Matich AJ, Montefiori M, Newcomb RD, Schaffer RJ, Usadel B, Allan AC, Boldingh HL, Bowen JH, Davy MW, Eckloff R, Ferguson AR, Fraser LG, Gera E, Hellens RP, Janssen BJ, Klages K, Lo KR, MacDiarmid RM, Nain B, McNeilage MA, Rassam M, Richardson AC, Rikkerink EHA, Ross GS, Schröder R, Snowden KC, Souleyre EJF, Templeton MD, Walton EF, Wang D, Wang MY, Wang YY, Wood M, Wu R, Yauk YK, Laing WA: Analysis of expressed sequence tags from Actinidia :applications of a cross species EST database for gene discovery in the areas of flavor, health, color and ripening. BMC Genomics. 2008, 9: 351-10.1186/1471-2164-9-351.PubMed CentralPubMedView Article
- Futamura N, Totoki Y, Toyoda A, Igasaki T, Nanjo T, Seki M, Sakaki Y, Mari A, Shinozaki K, Shinohara K: Characterization of expressed sequence tags from a full-length enriched cDNA library of Cryptomeria japonica male strobili. BMC Genomics. 2008, 9: 383-10.1186/1471-2164-9-383.PubMed CentralPubMedView Article
- Umezawa T, Sakurai T, Totoki Y, Toyoda A, Seki M, Ishiwata A, Akiyama K, Kurotani A, Yoshida T, Mochida K, Kasuga M, Todaka D, Maruyama K, Nakashima K, Enju A, Mizukado S, Ahmed S, Yoshiwara K, Harada K, Tsubokura Y, Hayashi M, Sato S, Anai T, Ishimoto M, Funatsuki H, Teraishi M, Osaki M, Shinano T, Akashi R, Sakaki Y, Shinozaki KY, Shinozaki K: Sequencing and Analysis of Approximately 40 000 Soybean cDNA Clones from a Full-Length-Enriched cDNA Library. DNA Research. 2008, 15: 333-346. 10.1093/dnares/dsn024.PubMed CentralPubMedView Article
- Naoumkina M, Jerez IT, Allen S, He J, Zhao PX, Dixon RA, May GD: Analysis of cDNA libraries from developing seeds of guar (Cyamopsis tetragonoloba (L.) Taub). BMC Plant Biology. 2007, 7: 62-10.1186/1471-2229-7-62.PubMed CentralPubMedView Article
- Clarke BC, Hobbs M, Skylas D, Appels R: Genes active in developing wheat endosperm. Funct Integr Genomics. 2000, 1: 44-55. 10.1007/s101420000008.PubMedView Article
- Rao KS, Chakrabarti PP, Rao VSK, Prasad RBN: Phospholipid Composition of Jatropha curcas Seed Lipids. J Am Oil Chem Soc. 2009, 86: 197-200. 10.1007/s11746-008-1325-8.View Article
- Holmstrom KO, Somersalo S, Mandal A, Palva TE, Welin B: Improved tolerance to salinity and low temperature in transgenic tobacco producing glycine betaine. Journal of Experimental Botany. 2000, 51: 177-185. 10.1093/jexbot/51.343.177.PubMedView Article
- Liang C, Zhang XY, Luo Y, Wang GP, Zou Q, Wang W: Overaccumulation of Glycine Betaine Alleviates the Negative Effects of Salt Stress in Wheat. Russian Journal of Plant Physiology. 2009, 56: 370-376. 10.1134/S1021443709030108.View Article
- Sarowar S, Oh HW, Cho HS, Baek KH, Seong ES, Joung YH, Cho GJ, Lee S, Choi D: Capsicum annuum CCR4-associated factor CaCAF1 is necessary for plant development and defence response. The Plant Journal. 2007, 51: 792-802. 10.1111/j.1365-313X.2007.03174.x.PubMedView Article
- Ding X, Cao Y, Huang L, Zhao J, Xu C, Li X, Wang S: Activation of the Indole-3-Acetic Acid-Amido Synthetase GH3-8 Suppresses Expansin Expression and Promotes Salicylate- and Jasmonate-Independent Basal Immunity in Rice. The Plant Cell. 2008, 20: 228-240. 10.1105/tpc.107.055657.PubMed CentralPubMedView Article
- Lorenzo O, Piqueras R, Sánchez-Serrano JJ, Solano R: ETHYLENE RESPONSE FACTOR 1 Integrates Signals from Ethylene and Jasmonate Pathways in Plant Defense. The Plant Cell. 2002, 15: 165-178.View Article
- Gu Z, Ma B, Jiang Y, Chen Z, Su X, Zhang H: Expression analysis of the calcineurin B-like gene family in rice (Oryza sativa L.) under environmental stresses. Gene. 2008, 415: 1-12. 10.1016/j.gene.2008.02.011.PubMedView Article
- Ondzighi CA, Christopher DA, Cho EJ, Chang SC, Staehelin LA: Arabidopsis Protein Disulfide Isomerase-5 Inhibits Cysteine Proteases during Trafficking to Vacuoles before Programmed Cell Death of the Endothelium in Developing Seeds. The Plant Cell. 2008, 20: 2205-2220. 10.1105/tpc.108.058339.View Article
- Damer CK, Bayeva M, Hahn ES, Rivera J, Socec CI: Copine A, a calcium-dependent membrane-binding protein, transiently localizes to the plasma membrane and intracellular vacuoles in Dictyostelium. BMC Cell Biology. 2005, 6: 46-10.1186/1471-2121-6-46.PubMed CentralPubMedView Article
- Turner NC, Furbank RT, Berger JD, Gremigni P, Abbo S, Leport L: Seed Size Is Associated with Sucrose Synthase Activity in Developing Cotyledons of Chickpea Crop. Science. 2009, 49: 621-627.
- Bentham G, Hooker JD: Genera Plantarum. 1862, L. Reeve & Co, London
- APG II: An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG II. Botanical Journal of the Linnean Society. 2003, 141: 399-436. 10.1046/j.1095-8339.2003.t01-1-00158.x.View Article
- APG III: An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG III. Botanical Journal of the Linnean Society. 2009, 161: 105-121. 10.1111/j.1095-8339.2009.00996.x.View Article
- Angiosperm Phylogeny Website. [http://www.mobot.org/mobot/research/apweb]
- Sharma N, Anderson M, Kumar A, Zhang Y, Giblin EM, Abrams SR, Zaharia LI, Taylor DC, Fobert PR: Transgenic increases in seed oil content are associated with the differential expression of novel Brassica -specific transcripts. BMC Genomics. 2008, 9: 619-10.1186/1471-2164-9-619.PubMed CentralPubMedView Article
- Maisonneuve S, Bessoule JJ, Lessire R, Delseny M, Roscoe TJ: Expression of Rapeseed microsomal Lysophosphatidic Acid Acyltransferase Isozymes Enhances Seed Oil Content in Arabidopsis. Plant Physiology. 2010, 152: 670-684. 10.1104/pp.109.148247.PubMed CentralPubMedView Article
- Jain RK, Coffey M, Lai K, Kumar A, MacKenzie SL: Enhancement of seed oil content by expression of glycerol-3-phosphate acyltransferase genes. Biochemical Society Transactions. 2000, 28: 958-961. 10.1042/BST0280958.PubMedView Article
- Aghoram K, Wilson RF, Burton JW, Dewey RE: A mutation in a 3-Keto-Acyl-ACP Synthase II Gene is associated with elevated Palmitic acid levels in Soybean seeds. Crop Science. 2006, 46: 2453-2459. 10.2135/cropsci2006.04.0218.View Article
- Knutzon DS, Gregory A, Thompson , Sharon E, Radke , William B, Johnson , Knauf VC, kridlt JC: Modification of Brassica seed oil by antisense expression of a stearoyl-acyl carrier protein desaturase gene. Proc Natl Acad Sci. 1992, 89: 2624-2628. 10.1073/pnas.89.7.2624.PubMed CentralPubMedView Article
- Goepfert S, Poirier Y: β-Oxidation in fatty acid degradation and beyond. Current Opinion in Plant Biology. 2007, 10: 245-251. 10.1016/j.pbi.2007.04.007.PubMedView Article
- Gao S, Ouyang C, Wang S, Xu Y, Tang L, Chen F: Effects of salt stress on growth, antioxidant enzyme and phenylalanine ammonia-lyase activities in Jatropha curcas L. seedlings. Plant Soil Environ. 2008, 54: 374-381.
- Xiong L, Zhu JK: Molecular and genetic aspects of plant responses to osmotic stress. Plant Cell and Environment. 2002, 25: 131-139. 10.1046/j.1365-3040.2002.00782.x.View Article
- Kumar S, Dhingra A, Daniell H: Plastid- Expressed Betaine Aldehyde Dehydrogenase Gene in Carrot Cultured Cells, Roots, and Leaves Confers Enhanced Salt Tolerance. Plant Physiology. 2004, 136: 2843-2854. 10.1104/pp.104.045187.PubMed CentralPubMedView Article
- Wu W, Su Q, Xia XY, Wang Y, Luan YS, An LJ: The Suaeda liaotungensis kitag betaine aldehyde dehydrogenase gene improves salt tolerance of transgenic maize mediated with minimum linear length of DNA fragment. Euphytica. 2008, 159: 17-25. 10.1007/s10681-007-9451-1.View Article
- Jia GX, Zhu ZQ, Chang FQ, Li YX: Transformation of tomato with the BADH gene from Atriplex improves salt tolerance. Plant Cell Rep. 2002, 21: 141-146. 10.1007/s00299-002-0489-1.View Article
- Zhang FL, Niu B, Wang YC, Chen F, Wang SH, Xu Y, Jiang LD, Gao S, Wu J, Tang L, Jia YJ: A novel betaine aldehyde dehydrogenase gene from Jatropha curcas , encoding an enzyme implicated in adaptation to environmental stress. Plant Science. 2008, 174: 510-518.View Article
- Nuccio ML, Russell BL, Nolte KD, Rathinasabapathi B, Gage DA, Hanson AD: The endogenous choline supply limits glycine betaine synthesis in transgenic tobacco expressing choline monooxygenase. Plant Journal. 1998, 16: 487-496. 10.1046/j.1365-313x.1998.00316.x.PubMedView Article
- Huang J, Hirji R, Adam L, Rozwadowski KL, Hammerlindl JK, Keller WA, Selvaraj G: Genetic Engineering of Glycinebetaine Production toward Enhancing Stress Tolerance in Plants: Metabolic Limitations. Plant Physiology. 2000, 122: 747-756. 10.1104/pp.122.3.747.PubMed CentralPubMedView Article
- Rajagopal D, Agarwal P, Tyagi W, Singla-Pareek SL, Reddy MK, Sopory SK: Pennisetum glaucum Na+/H+ antiporter confers high level of salinity tolerance in transgenic Brassica juncea. Mol Breeding. 2007, 19: 137-151. 10.1007/s11032-006-9052-z.View Article
- Garg AK, Kim JK, Owens TG, Ranwala AP, Choi YD, Kochian LV, Wu RJ: Trehalose accumulation in rice plants confers high tolerance levels to different abiotic stresses. Proc Natl Acad Sci. 2002, 99: 15898-15903. 10.1073/pnas.252637799.PubMed CentralPubMedView Article
- Terol J, Conesa A, Colmenero JM, Cercos M, Tadeo T, Agustí J, Alós E, Andres F, Soler G, Brumos J, Iglesias DJ, Götz S, Legaz F, Argout X, Courtois B, Ollitrault P, Dossat C, Wincker P, Morillon R, Talon M: Analysis of 13000 unique Citrus clusters associated with fruit quality, production and salinity tolerance. BMC Genomics. 2007, 8: 31-10.1186/1471-2164-8-31.PubMed CentralPubMedView Article
- Ho CL, Kwan YY, Choi MC, Tee SS, Ng WH, Lim KA, Lee YP, Ooi SE, Lee WW, Tee JM, Tan SH, Kulaveerasingam H, Alwee SS, Abdullah MO: Analysis and functional annotation of expressed sequence tags(ESTs) from multiple tissues of oil palm (Elaeis guineensis Jacq). BMC Genomics. 2007, 8: 381-10.1186/1471-2164-8-381.PubMed CentralPubMedView Article
- Yoshida S, Ishida JK, Kamal NM, Ali AM, Namba S, Shirasu K: A full-length enriched cDNA library and expressed sequence tag analysis of the parasitic weed, Striga hermonthica. BMC Plant Biology. 2010, 10: 55-10.1186/1471-2229-10-55.PubMed CentralPubMedView Article
- Fukuoka H, Yamaguchi H, Nunome T, Negoro S, Miyatake K, Ohyama A: Accumulation, functional annotation, and comparative analysis of expressed sequence tags in eggplant (Solanum melongena L.), the third pole of the genus Solanum species after tomato and potato. Gene. 2010, 450: 76-84. 10.1016/j.gene.2009.10.006.PubMedView Article
- Nanjo T, Sakurai T, Totoki Y, Toyoda A, Nishiguchi M, Kado T, Igasaki T, Futamura N, Seki M, Sakaki Y, Shinozaki K, Shinohara K: Functional annotation of 19,841 Populus nigra full-length enriched cDNA clones. BMC Genomics. 2007, 8: 448-10.1186/1471-2164-8-448.PubMed CentralPubMedView Article
- O'Hara P, Slabas AR, Fawcett T: Fatty Acid and Lipid Biosynthetic Genes Are Expressed at Constant Molar Ratios But Different Absolute Levels during Embryogenesis. Plant Physiology. 2002, 129: 310-320. 10.1104/pp.010956.PubMed CentralPubMedView Article
- Patanjali SR, Parimoo S, Weissman SM: Construction of a uniform-abundance (normalized) cDNA library. Proc Natl Acad Sci. 1991, 88: 1943-1947. 10.1073/pnas.88.5.1943.PubMed CentralPubMedView Article
- Soares MB, Bonaldo MF, Jelene P, Su L, Lawton L, Efstratiadis A: Construction and characterization of a normalized cDNA library. Proc Natl Acad Sci. 1994, 91: 9228-9232. 10.1073/pnas.91.20.9228.PubMed CentralPubMedView Article
- National Centre for Biotechnology Information BLAST. [http://blast.ncbi.nlm.nih.gov/Blast.cgi]
- Ewing B, Hillier L, Wendl MC, Green P: Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res. 1998, 8: 175-185.PubMedView Article
- Huang X, Madan A: CAP3: A DNA sequence assembly program. Genome Res. 1999, 9: 868-877. 10.1101/gr.9.9.868.PubMed CentralPubMedView Article
- ClustalW2 Program. [http://www.ebi.ac.uk/Tools/clustalw2/index.html]
- Open Reading Frame (ORF) Finder. [http://www.ncbi.nlm.nih.gov/projects/gorf]
- The Arabidopsis Information Resource. [http://www.arabidopsis.org]
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.