ASAP: Amplification, sequencing & annotation of plastomes
© Dhingra and Folta; licensee BioMed Central Ltd. 2005
Received: 02 August 2005
Accepted: 07 December 2005
Published: 07 December 2005
Availability of DNA sequence information is vital for pursuing structural, functional and comparative genomics studies in plastids. Traditionally, the first step in mining the valuable information within a chloroplast genome requires sequencing a chloroplast plasmid library or BAC clones. These activities involve complicated preparatory procedures like chloroplast DNA isolation or identification of the appropriate BAC clones to be sequenced. Rolling circle amplification (RCA) is being used currently to amplify the chloroplast genome from purified chloroplast DNA and the resulting products are sheared and cloned prior to sequencing. Herein we present a universal high-throughput, rapid PCR-based technique to amplify, sequence and assemble plastid genome sequence from diverse species in a short time and at reasonable cost from total plant DNA, using the large inverted repeat region from strawberry and peach as proof of concept. The method exploits the highly conserved coding regions or intergenic regions of plastid genes. Using an informatics approach, chloroplast DNA sequence information from 5 available eudicot plastomes was aligned to identify the most conserved regions. Cognate primer pairs were then designed to generate ~1 – 1.2 kb overlapping amplicons from the inverted repeat region in 14 diverse genera.
100% coverage of the inverted repeat region was obtained from Arabidopsis, tobacco, orange, strawberry, peach, lettuce, tomato and Amaranthus. Over 80% coverage was obtained from distant species, including Ginkgo, loblolly pine and Equisetum. Sequence from the inverted repeat region of strawberry and peach plastome was obtained, annotated and analyzed. Additionally, a polymorphic region identified from gel electrophoresis was sequenced from tomato and Amaranthus. Sequence analysis revealed large deletions in these species relative to tobacco plastome thus exhibiting the utility of this method for structural and comparative genomics studies.
This simple, inexpensive method now allows immediate access to plastid sequence, increasing experimental throughput and serving generally as a universal platform for plastid genome characterization. The method applies well to whole genome studies and speeds assessment of variability across species, making it a useful tool in plastid structural genomics.
Chloroplast DNA (cpDNA) represents an extranuclear capsule of genetic information, encoding essential structural and enzymatic proteins of the organelle. This satellite genome contains a wealth of information that has been shaped by speciation, rendering it a rich resource to trace evolutionary relationships between photosynthetic taxa . Genetic manipulation of the chloroplast genome can transform the chloroplast into a bioreactor, allowing large-scale production of proteins vital to agriculture or pharmacology [2, 3]. Maternal inheritance of plastid genome in most species ensures gene containment in genetically modified plants making it an attractive alternative for the integration of foreign genes [4, 5].
Genomic and phylogenetic studies and efficient genetic modification begin with the base material of plastid DNA sequence. Despite their relatively small size, few plastid genomes have been fully sequenced, thus limiting comparative genomics studies across the species. Complete sequence coverage has been resolved for only 13 species representing model and crop plants, namely, Arabidopsis thaliana , Atropa belladonna , Medicago truncatula (GenBank: NC_003119), Oryza sativa , Spinacea oleracea , Nicotiana tabacum , Triticum aestivum , Lotus corniculatus , Zea mays , Panax ginseng , Cucumis sativus (GenBank: NC_007144), Glycine max  and Saccharum officinarum . Comparison of a small set of representative coding or intergenic sequences derived from a large number of species has been used to perform phylogenetic studies , but there are many unresolved phylogenetic questions . Use of complete genome data is another emerging approach in plant phylogenetics . There are at least three federally-sponsored, multi-institution endeavors underway to sequence about 200 plastid genomes from plants [20–22].
There are a number of challenges to rapid access to chloroplast DNA sequence from a given species. Typical sequencing efforts begin with construction of chloroplast plasmid or other genomic libraries. Construction of such resources requires isolation of pure plastid DNA, which may be troublesome in some species. Even shotgun sequencing to appreciable coverage is considerably expensive and time consuming. From the standpoint of method and cost, the generation of plastid sequence data is considerable and a potential hindrance to productive data mining and engineering efforts. One recent report describes a sophisticated methodology using FACS (fluorescence-assisted cell sorting) and RCA (rolling circle amplification) for sequencing a plastid genome . The degree of technological sophistication is inversely proportional to its wider applicability due to the prohibitive costs associated with it, slowing chloroplast genomics studies. These large-scale studies are expensive and prohibit testing of additional related species or ecotypes that may be informative. Additionally, smaller research programs do not necessarily have access to these tools and techniques. Most importantly some rare and/or difficult-to-obtain taxa that are not amenable to large-scale chloroplast DNA extraction can not be analyzed by the existing methodology. While downstream processing of sequence information is highly streamlined due to the presence of freely available tools on the World Wide Web, such as DOGMA [24, 25], the lack of cost-effective innovation in rapid sequence acquisition has restricted plastid informatics studies.
To address these issues, we exploited the fact that chloroplast genomes are extremely well conserved in size, gene arrangement, and coding sequence, at least within major subgroups of the plant kingdom [26, 27]. We formulated the hypothesis that conserved islands of cpDNA sequence could serve as universal anchors to generate overlapping PCR products comprised of conserved coding regions, and adjacent polymorphic intergenic regions. The resulting amplicons could then be sequenced, assembled, annotated and analyzed. This approach exploits the high degree of sequence conservation and general synteny within discrete portions of chloroplast genome. In this report this powerful technique has been applied to the large inverted repeat (IR) region of strawberry (Fragaria × ananassa) and peach (Prunus persica). The entire ~30 kb region was amplified from total DNA, sequenced, annotated and submitted to public databases in several days for a fraction of the cost of traditional or other recently published approaches . To further validate this application, corresponding regions were amplified from a series of other eudicots and a monocot of agricultural importance as well as two gymnosperms (Pinus and the distant vascular plant Gingko) and a pteridophyte (Equisetum). This universal method represents a rapid, inexpensive means to obtain complete coverage of many higher plant plastid genome regions, and even substantial coverage from distant genera. The sequence information generated form this method can hasten phylogenetic and genomics studies and also help in identification of regulatory elements necessary for design of transformation vectors for the manipulation of chloroplast genomes of new species.
ASAP PCR primers. Primer sequences, annealing site and the relative position in tobacco (Nt), Arabidopsis (At) and maize (Zm) are presented, along with the anticipated amplicon size. * represent the primers with low sequence similarity in maize.
Sequence 5' to 3'
Position in Nt
Position in At
Position in Zm
Upstream of JLB
ORF 87/YCF 15
ndhB 3' exon
ndhB 3' exon
ndhB 5' exon
ndhB 5' exon
rps 7 5'end
rps 7 5'end
rps12 exon 2
rps12 exon 2
trnV 3' end
ASAP PCR conditions. The thermalcycler parameters used to generate ASAP amplicons in successive rounds of PCR are presented.
ASAP PCR conditions
1. Touchdown PCR
55 – 0.5/cycle
2. Touchdown PCR
52 – 0.5/cycle
3. Touchdown – extension PCR
52 – 0.5/cycle
1:00 + .05/cycle
1:30 + 0.05/cycle
ASAP coverage. Percentage coverage of the IR B region using ASAP method in 10 different genera and unique features of 4 diverse genera used in the study.
Percentage coverage of the IR region using ASAP
Round 1 (%)
Round 2 (%)
Round 3 (%)
Fragaria × ananassa
Coleus × hybrida
No Inverted Repeat
Small inverted repeat of 17 kb
Fragaria and Prunus (Rosaceae)
Complete coverage of the plastid IR region from Fragaria was obtained after proceeding through three rounds of ASAP PCR (Table 2) with the 27 pairs of primers (Figure 2). These amplicons were generated using Pfu Turbo DNA polymerase (Stratagene Inc., Carlsbad, CA) in order to minimize potential errors generated during PCR reactions. These amplicons were directly sequenced in a 96-well format. The sequence was assembled and annotated as described in Methods.
Interestingly, in Prunus complete coverage was obtained with Round 2 PCR. Sequence comparison with Fragaria revealed that Prunus and Fragaria share considerable sequence similarity in the IR region as expected being from the same phylogenetic group. This is another demonstration of the utility of this technique where two members of the same taxonomic group were sequenced and compared in a very short time frame and in a cost-effective manner.
Other eudicots and identification of a variable region
Chloroplast sequencing efforts of model photosynthetic organisms have provided a wealth of information detailing structural features and plant phylogeny, as well as a basis for manipulation of the plastid genome in the interest of bioengineering. Current molecular phylogenetic studies are carried out using large or complete plastid genome sequences or small coding or intergenic sequences and the Amplification, Sequencing, and Annotation of Plastomes (ASAP) method caters to both approaches. Rapid generation of plastid sequence information is necessary as it can facilitate better design of plastid transformation vectors . One of the factors in successful engineering of cotton and carrot plastomes was the use of species-specific flanking sequences in the transformation vectors [31, 32].
The simple yet powerful ASAP technique described herein expands the capacity for any laboratory to dissect the chloroplast genome at the informatics level with a basic set of available resources. The obvious limitation of ASAP method is that some plastid genomes have undergone extensive rearrangements . Therefore, this method complements other strategies  for obtaining plastid genome sequences and provides a convenient platform for plastomes that share a considerable level of synteny. A long range PCR based approach was used earlier to sequence the Amborella trichopoda plastid genome , requiring nebulization of large amplicons and cloning prior to sequencing. In contrast, the ASAP method performs the dual task of generating small amplicons for direct sequencing and the electrophoresis profiles provide structural genomics information. Direct sequencing of the PCR products limits incorrect base calls resulting from amplification or sequencing errors, as large pools of products are sequenced and even early PCR errors will not be scored incorrectly. In a worst-case scenario a base call at a given position will be unclear, requiring re-amplification and re-sequencing of the amplicon. This approach allows the implementation of non-proofreading, highly processive polymerases that limit costs yet generate substantial quantities of template for downstream analyses.
ASAP also allows description and characterization of the frequent islands of high sequence identity present within the coding regions of sequenced genomes. Using local identities found within the cpDNA sequences of five sequenced eudicot plant species, primer pairs were designed to produce overlapping amplicons that bracket sequences of plastid genes and the rich sequence variability resident to their adjacent intergenic regions. The ASAP method was herein shown to generate representative regions of 10 diverse plant species to almost 100% coverage. Even the rrn region was also amplified from four divergent genomes studied, such as Pisum, Equisetum, Pinus and Ginkgo, with expected efficiency. The success of the method suggests it an excellent first step in the analysis of any novel plastid genome. The ASAP method may be the only practical approach for some rare and/or difficult-to-obtain taxa that are not amenable to chloroplast DNA extraction.
One caveat of this technique is that plastid sequences are not confined to the chloroplast. Plastid DNA sequences are represented in both mitochondrial and nuclear genomes, and may serve as templates for the amplicons generated with the 27 primer pairs. The high copy number of plastomes in green leaf derived total DNA and long primers (26 bp each) used in this approach should preferentially amplify chloroplast sequences. Additionally, nuclear integrated plastid sequences are continuously shuffled and eliminated , making them less likely to be incorrectly amplified via this approach. However, with the design of specific primers this technique could be extended to plastid genomes of distant phylogenetic groups or mitochondrial genomes in species where high level of synteny is present.
The ASAP method represents a rapid means to generate a large amount of plastid genome information from simple PCR steps to facilitate bioinformatic dissection and functional genomic studies. The products generated spotlight AFLPs that serve as low-resolution beacons to report regions of high diversity, such as the hypervariable Region 11 (Figure 5). These regions may be particularly meaningful for phylogenetic analyses. In this capacity the ASAP methods may facilitate studies of hypervariable specific regions of the plastid genome, helping to quickly identify areas of likely importance, bypassing the necessity to sequence an entire genome as a first step. The ASAP method eliminates cloning steps, thus negating the need to identify and trim plasmid sequences before assembly. Another powerful facet of the technique is the potential to generate plastid fragment fingerprint for any species. AFLPs produced may be immediately comparable to those produced by other genomes, revealing deviations in otherwise conserved sequence, thus informing of structural-genomic variation. Even the absence of products in related species may be extremely informative.
To summarize, the ASAP method has the following advantages: The method is ideal for laboratories or programs with limited resources interested in obtaining chloroplast DNA sequence information for their particular research interest. ASAP method may be the only practical approach to obtain chloroplast DNA sequence from rare or small plant samples. This method provides a fingerprint of a given chloroplast region, which can be readily compared amongst different genera and give information of structural variability even without the sequence information. Another practical application of this approach is the use of PCR amplicons generated by ASAP method for construction of chloroplast genome microarrays from a given plant species.
Specific to this report, the method was used to amplify and sequence the large IR region from octoploid strawberry and peach. This ~30 kbp region from strawberry was amplified in 33 PCR reactions and then sequenced. The PCR fragments were generated in three thermalcycler runs over two days, and the entire process, from leaf to data on the server, was performed in under a week for under $500. With this minimal time and capital investment roughly 25% of the chloroplast genome has been deciphered; bidirectionally and with complete coverage. The entire process is now being scaled up to sequence an entire plastid genome within the context of a 96-well plate. In this system 1.2 to 1.5 kb amplicons may be produced and sequenced from this standard format. The common format also lends itself to robotic manipulation, and it is exciting to speculate that a single set of 96 primer pairs may be matched to a high-throughput robotic amplification and sequencing system to generate plastid DNA sequence at the rate of a plastid genome per day. It is our hope that this methodology will hasten study of chloroplast sequences, especially those from unusual organisms or those not considered worthy of large investment.
The cpDNA sequences of five eudicot plant species namely Nicotiana tabacum (NC_001879), Arabidopsis thaliana (NC_000932), Atropa belladonna (NC_004561), Spinacea oleracea (NC_002202) and Panax ginseng (NC_006290) were aligned using ClustalW . Tobacco chloroplast genome sequence was used as a reference and its coding regions were delineated in the aligned sequences. Highly conserved, putative primer sites were derived by hand parsing the aligned sequences of the IR B region. Primer candidates satisfied several criteria. A candidate primer must be resident to the coding region or conserved intergenic region and primer pairs must be spaced by ~1.0 to 1.2 kb. The primers must share 95% sequence identity among the representative plastid genomes. For universal, large-format application in simultaneous PCR reactions the primers should maintain approximately 50% GC content and a Tm of approximately 50°C. Table 1 lists the primer sequences, annealing sites, respective position in tobacco, Arabidopsis and maize plastid genomes and the expected amplicon sizes. This set of primers, used to amplify the large IR region in this manuscript, will be supplied by the authors upon request.
DNA preparation and primary optimization
Total plant DNA was isolated from fresh leaf tissue using the Qiagen DNeasy Kit (Qiagen Inc., Valencia, CA) according to manufacturer's instructions, except that the homogenized plant material was mixed in Buffer AP1 for 10 min on a platform vortexer for 10 min and the sample was centrifuged for 10 min at 1000 rpm to remove any debris at this stage. The supernatant was then incubated at 65°C for 10 min and from this point on the manufacturer's protocol was followed. DNA was isolated from two plant species namely Arabidopsis thaliana (ecotype Col-0) and tobacco (Nicotiana tabacum). These two species served as a positive control and a basis for detection of computationally predicted AFLPs. Upon validation of the technique with known genomes, amplification was performed on several crops of agricultural importance namely, strawberry (Fragaria × ananassa; cv. Strawberry Festival), Sweet orange (Citrus sinensis), lettuce (Lactuca sativa), peach (Prunus persica; cv. UF Sun), Tomato (Lycopersicon esculentum), coleus (Coleus × hybrida), Amaranthus (Amaranthus hypochondricus). Maize (Zea mays; W22) was tested as a model monocot. Since it has been sequenced, amplification discrepancies related to insertion/deletion and/or rearrangement can be anticipated and circumvented. The technique was also performed on diverse plant species to test the range of the approach. These species included Pisum sativum L. (Little Marvel), which does not have an IR region and has only one copy of the rRNA genes, an ancient gymnosperm (Ginkgo biloba), a contemporary gymnosperm loblolly pine (Pinus taeda), along with horsetail, (Equisetum hyemale), a pteridophyte which represents an ancient plastid genome that would test and define the limit of the application of the eudicot species-based primer designs reported here.
Before performing reactions with the 27 primer pairs that define the large IR, it was important to establish the amount of total DNA to use in each reaction. Variability in amplification may be introduced from several sources, namely the relative amount of chloroplast DNA to total DNA ratio and the tendency for inhibitory compounds to co-purify with DNA templates in total DNA isolation . For instance, the relative cpDNA: total DNA ratio will be significantly different between Arabidopsis (haploid genome size ~140 Mbp) and Pinus (haploid genome size ~21658 Mbp), two organisms with massively different nuclear genome sizes. DNA preparation from some species, such as strawberry, may introduce phenolics or polysaccharides that could inhibit efficient amplification during PCR.
To accommodate both of these variables an initial reaction set is performed using the most conserved primers, those representing the 16 S rrn locus (primer set 19; Table 1). These primers predictably generate a fragment in all species tested, yet the yield varies considerably based on the amount of template used in the reaction. For instance, while coleus was best amplified with 1 ng per reaction, Equisetum required 10 ng per reaction (data not shown). The fidelity of the reaction is extremely dependent upon total isolated DNA concentration and the inherent cpDNA: nuclear DNA ratio in total DNA. A pilot experiment testing for the production of PCR product over 4–5 orders of magnitude must be performed to optimize conditions for subsequent reactions.
PCR conditions for plastome amplification
Touchdown PCR was utilized to generate PCR amplicons . The first set of reactions was performed using Round I conditions, conditions conducive to amplification in the species for which the primers were designed (Figure 1; Table 2). This reaction would routinely produce amplicons in 24 out of 27 reactions. Reactions that failed to produce a product were reconstituted from fresh reagents and template, and PCR is performed using Reaction II conditions (Table 2). Typically this amplification was sufficient to obtain complete coverage of the large IR in 7 of the 10 species studied (Table 3). Amplicons generated from this approach will be made feely available upon request.
If Round II conditions failed to produce a PCR product it could be assumed that sequence differences in the primer landing site are present or those sites are deleted or rearranged. In this case PCR is performed using each fragment-specific primer and primers from adjacent fragments that successfully amplified in Round I and/or Round II. The process is outlined in Figure 1.
Sequencing was performed directly from PCR products in a 96-well format at the University of Florida ICBR Core Facility using ET Terminator (Amersham Inc, Schaumburg, IL) as reported earlier . A 5 μl aliquot of PCR product from a 50 μl reaction was analyzed by gel electrophoresis to verify purity and concentration. PCR products were treated with ExoSAP to remove primers and nucleotides. Each amplicon was sequenced bidirectionally using the primers used in initial amplification. Sequencing primers were added at 10 pmol/μl in a 10 μl final reaction volume. Sequences with Phred score of >20 were used for assembly. Both strands were sequenced for each fragment thus providing a 2X coverage and, 4X coverage in the overlapping regions.
Assembly, annotation and dissemination of the sequences from Fragaria and Prunus
Sequences generated from a primer pair were first aligned using Blast 2 sequences (bl2seq) available at NCBI website [39, 40]. Individual amplicons derived from this process were further assembled using CAP3 [41, 42]. The sequence was then annotated using DOGMA [24, 25]. Chloroplast genome sequences from the IR B region of Fragaria and Prunus were submitted to the GenBank under accession numbers FACPINVREP (Fa) and PPCPINVREP (Pp).
List of abbreviations
Amplification, sequencing, annotation of plastomes
Open reading frame
Amplified fragment length polymorphism
Single nucleotide polymorphism
Polymerase chain reaction
Fluorescence-assisted cell sorting
Rolling circle amplification
Dual organeller genome annotator
The authors would like to thank Professor Pam Soltis and Professor Doug Soltis, University of Florida, for critical review of the work and for their helpful comments and suggestions. The authors would also like to thank Jeremy Ramdial and Gene Peir for their technical support, Dr. Angel Alpuche-Solis, IPICYT, Mexico for Amaranthus and Lactuca sativa DNA; Professor John Davis, University of Florida for Pinus taeda DNA; Professor Harry Klee, University of Florida for tomato DNA; Professor Karen Koch, University of Florida for maize DNA; Professor Ben Elmo Whitty, University of Florida for tobacco plants and Professor John Gray, John Innes Institute, UK for pea chloroplast genome sequence. The authors acknowledge Denise Tombolato and Philip Stewart for critical reading and evaluation of this manuscript. This work was performed with support from NSF grant #0416877 (KMF) and funding from IFAS at the University of Florida.
- Palmer JD: Chloroplast DNA and molecular phylogeny. Bioessays. 1985, 2: 263-267. 10.1002/bies.950020607.View ArticleGoogle Scholar
- Maliga P: Plastid transformation in higher plants. Annu Rev Plant Biol. 2004, 55: 289-313. 10.1146/annurev.arplant.55.031903.141633.PubMedView ArticleGoogle Scholar
- Grevich JJ, Daniell H: Chloroplast genetic engineering: Recent advances and future perspectives. Critical Reviews in Plant Sciences. 2005, 24: 83-107. 10.1080/07352680590935387.View ArticleGoogle Scholar
- Daniell H, Kumar S, Dufourmantel N: Breakthrough in chloroplast genetic engineering of agronomically important crops. Trends Biotechnol. 2005, 23: 238-245. 10.1016/j.tibtech.2005.03.008.PubMedPubMed CentralView ArticleGoogle Scholar
- Dhingra A, Daniell H: Arabidopsis Protocols. Edited by: Salinas J and Sanchez-Serrano JJ. 2005, Totowa, New Jersey, Humana Press, 323: 525- Chloroplast genetic engineering via organogenesis or somatic embryogenesis, Second, Methods in Molecular Biology.Google Scholar
- Sato S, Nakamura Y, Kaneko T, Asamizu E, Tabata S: Complete structure of the chloroplast genome of Arabidopsis thaliana. DNA Res. 1999, 6: 283-290. 10.1093/dnares/6.5.283.PubMedView ArticleGoogle Scholar
- Schmitz-Linneweber C, Regel R, Du TG, Hupfer H, Herrmann RG, Maier RM: The plastid chromosome of Atropa belladonna and its comparison with that of Nicotiana tabacum: the role of RNA editing in generating divergence in the process of plant speciation. Mol Biol Evol. 2002, 19: 1602-1612.PubMedView ArticleGoogle Scholar
- Hiratsuka J, Shimada H, Whittier R, Ishibashi T, Sakamoto M, Mori M, Kondo C, Honji Y, Sun CR, Meng BY: The complete sequence of the rice (Oryza sativa) chloroplast genome: intermolecular recombination between distinct tRNA genes accounts for a major plastid DNA inversion during the evolution of the cereals. Mol Gen Genet. 1989, 217: 185-194.PubMedView ArticleGoogle Scholar
- Schmitz-Linneweber C, Maier RM, Alcaraz JP, Cottet A, Herrmann RG, Mache R: The plastid chromosome of spinach (Spinacea oleracea): complete nucleotide sequence and gene organization. Plant Mol Biol. 2001, 45: 307-315. 10.1023/A:1006478403810.PubMedView ArticleGoogle Scholar
- Shinozaki K, Ohme M, Tanaka M, Wakasugi T, Hayashida N, Matsubayashi T, Zaita N, Chunwongse J, Obokata J, Yamaguchi-Shinozaki K, Ohto C, Torazawa K, Meng BY, Sugita M, Deno H, Kamogashira T, Yamada K, Kusuda J, Takaiwa F, Kato A, Tohdoh N, Shimada H, Sugiura M: The complete nucleotide sequence of tobacco chloroplast genome:its gene organization and expression. EMBO J. 1986, 5: 2043-2049.PubMedPubMed CentralGoogle Scholar
- Ogihara Y, Isono K, Kojima T, Endo A, Hanaoka M, Shiina T, Terachi T, Utsugi S, Murata M, Mori N, Takumi S, Ikeo K, Gojobori T, Murai R, Murai K, Matsuoka Y, Ohnishi Y, Tajiri H, Tsunewaki K: Structural features of a wheat plastome as revealed by complete sequencing of chloroplast DNA. Mol Genet Genomics. 2002, 266: 740-746. 10.1007/s00438-001-0606-9.PubMedView ArticleGoogle Scholar
- Kato T, Kaneko T, Sato S, Nakamura Y, Tabata S: Complete structure of the chloroplast genome of a legume, Lotus japonicus. 2000, 7: 323-330.Google Scholar
- Maier RM, Neckermann K, Igloi GL, Kossel H: Complete sequence of the maize chloroplast genome: gene content, hotspots of divergence and fine tuning of genetic information by transcript editing. J Mol Biol. 1995, 251: 614-628. 10.1006/jmbi.1995.0460.PubMedView ArticleGoogle Scholar
- Kim KJ, Lee HL: Complete chloroplast genome sequences from Korean ginseng (Panax schinseng Nees) and comparative analysis of sequence evolution among 17 vascular plants. DNA Res. 2004, 11: 247-261. 10.1093/dnares/11.4.247.PubMedView ArticleGoogle Scholar
- Saski C, Lee SB, Daniell H, Wood TC, Tomkins J, Kim HG, Jansen RK: Complete chloroplast genome sequence of Glycine max and comparative analyses with other legume genomes. 2005, 59: 309-322.Google Scholar
- Asano T, Tsudzuki T, Takahashi S, Shimada H, Kadowaki K: Complete nucleotide sequence of the sugarcane (Saccharum officinarum) chloroplast genome: a comparative analysis of four monocot chloroplast genomes. DNA Res. 2004, 11: 93-99. 10.1093/dnares/11.2.93.PubMedView ArticleGoogle Scholar
- Soltis DE, Albert VA, Savolainen V, Hilu K, Qiu YL, Chase MW, Farris JS, Stefanovic S, Rice DW, Palmer JD, Soltis PS: Genome-scale data, angiosperm relationships, and 'ending incongruence': a cautionary tale in phylogenetics. Trends in Plant Science. 2004, 9: 477-483. 10.1016/j.tplants.2004.08.008.PubMedView ArticleGoogle Scholar
- Wolf PG, Rowe CA, Sinclair RB, Hasebe M: Complete nucleotide sequence of the chloroplast genome from a leptosporangiate fern, Adiantum capillus-veneris L. DNA Research. 2003, 10: 59-65. 10.1093/dnares/10.2.59.PubMedView ArticleGoogle Scholar
- Martin W, Deusch O, Stawski N, Grunheit N, Goremykin V: Chloroplast genome phylogenetics: why we need independent approaches to plant molecular evolution. Trends Plant Sci. 2005, 10: 203-209. 10.1016/j.tplants.2005.03.007.PubMedView ArticleGoogle Scholar
- Project DT: . [http://www.flmnh.ufl.edu/deeptime/]
- Project CGS: . [http://www.genome.clemson.edu/projects/Chloroplast/]
- project C: . [http://evogen.jgi.doe.gov/second_levels/chloroplasts/jansen_project_home/chlorosite.html]
- Wolf PG, Karol KG, Mandoli DF, Kuehl J, Arumuganathan K, Ellis MW, Mishler BD, Kelch DG, Olmstead RG, Boore JL: The first complete chloroplast genome sequence of a lycophyte, Huperzia lucidula (Lycopodiaceae). Gene. 2005, 350: 117-128. 10.1016/j.gene.2005.01.018.PubMedView ArticleGoogle Scholar
- Website D: . [http://bugmaster.jgi-psf.org/dogma/]
- Wyman SK, Jansen RK, Boore JL: Automatic annotation of organellar genomes with DOGMA. Bioinformatics. 2004, 20: 3252-3255. 10.1093/bioinformatics/bth352.PubMedView ArticleGoogle Scholar
- Palmer JD: Comparative organization of chloroplast genomes. Annu Rev Genet. 1985, 19: 325-354. 10.1146/annurev.ge.19.120185.001545.PubMedView ArticleGoogle Scholar
- Odintsova MS, Iurina NP: Plastidic genome of higher plants and algae: structure and function. Mol Biol (Mosk). 2003, 37: 768-783.View ArticleGoogle Scholar
- Jansen RK, Raubeson LA, Boore JL, dePamphilis CW, Chumley TW, Haberle RC, Wyman SK, Alverson AJ, Peery R, Herman SJ: Methods for obtaining and analyzing whole chloroplast genome sequences. Methods in Enzymology. 2005, , Academic Press, 348-384. 10.1016/S0076-6879(05)95020-9. Volume 395Google Scholar
- Folta KM, Staton M, Stewart PJ, Jung S, Bies DH, Jesudurai C, Main D: Expressed sequence tags (ESTs) and simple sequence repeat (SSR) markers from octoploid strawberry (Fragaria x ananassa). BMC Plant Biol. 2005, 5: 12-10.1186/1471-2229-5-12.PubMedPubMed CentralView ArticleGoogle Scholar
- Lin J, Davis TM: S1 analysis of long PCR heteroduplexes: detection of chloroplast indel polymorphisms in Fragaria. Theoretical and Applied Genetics. 2000, 101: 415-420. 10.1007/s001220051498.View ArticleGoogle Scholar
- Kumar S, Dhingra A, Daniell H: Plastid-expressed betaine aldehyde dehydrogenase gene in carrot cultured cells, roots, and leaves confers enhanced salt tolerance. Plant Physiology. 2004, 136: 2843-2854. 10.1104/pp.104.045187.PubMedPubMed CentralView ArticleGoogle Scholar
- Kumar S, Dhingra A, Daniell H: Stable transformation of the cotton plastid genome and maternal inheritance of transgenes. Plant Molecular Biology. 2004, 56: 203-216. 10.1007/s11103-004-2907-y.PubMedPubMed CentralView ArticleGoogle Scholar
- Cosner ME, Jansen RK, Palmer JD, Downie SR: The highly rearranged chloroplast genome of Trachelium caeruleum (Campanulaceae): multiple inversions, inverted repeat expansion and contraction, transposition, insertions/deletions, and several repeat families. Curr Genet. 1997, 31: 419-429. 10.1007/s002940050225.PubMedView ArticleGoogle Scholar
- Goremykin VV, Hirsch-Ernst KI, Wolfl S, Hellwig FH: Analysis of the Amborella trichopoda chloroplast genome sequence suggests that Amborella is not a basal angiosperm. Mol Biol Evol. 2003, 20: 1499-1505. 10.1093/molbev/msg159.PubMedView ArticleGoogle Scholar
- Matsuo M, Ito Y, Yamauchi R, Obokata J: The rice nuclear genome continuously integrates, shuffles, and eliminates the chloroplast genome to cause chloroplast-nuclear DNA flux. Plant Cell. 2005, 17: 665-675. 10.1105/tpc.104.027706.PubMedPubMed CentralView ArticleGoogle Scholar
- website CW: . [http://www.ebi.ac.uk/clustalw/]
- Demeke T, Adams RP: The effects of plant polysaccharides and buffer additives on PCR. Biotechniques. 1992, 12: 332-334.PubMedGoogle Scholar
- Don RH, Cox PT, Wainwright BJ, Baker K, Mattick JS: Touchdown PCR to circumvent spurious priming during gene amplification. Nucleic Acids Research. 1991, 19: 4008-4008.PubMedPubMed CentralView ArticleGoogle Scholar
- . [http://www.ncbi.nlm.nih.gov/blast/bl2seq/wblast2.cgi]
- Tatusova TA, Madden TL: BLAST 2 SEQUENCES, a new tool for comparing protein and nucleotide sequences. Fems Microbiology Letters. 1999, 174: 247-250. 10.1016/S0378-1097(99)00149-4.PubMedView ArticleGoogle Scholar
- Huang XQ, Madan A: CAP3: A DNA sequence assembly program. Genome Research. 1999, 9: 868-877. 10.1101/gr.9.9.868.PubMedPubMed CentralView ArticleGoogle Scholar
- Huang XQ, Madan A: . [http://genome.cs.mtu.edu/cap/cap3.html]
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.