- Research article
- Open Access
A sweetpotato gene index established by de novo assembly of pyrosequencing and Sanger sequences and mining for gene-based microsatellite markers
© Schafleitner et al; licensee BioMed Central Ltd. 2010
- Received: 8 June 2010
- Accepted: 26 October 2010
- Published: 26 October 2010
Sweetpotato (Ipomoea batatas (L.) Lam.), a hexaploid outcrossing crop, is an important staple and food security crop in developing countries in Africa and Asia. The availability of genomic resources for sweetpotato is in striking contrast to its importance for human nutrition. Previously existing sequence data were restricted to around 22,000 expressed sequence tag (EST) sequences and ~ 1,500 GenBank sequences. We have used 454 pyrosequencing to augment the available gene sequence information to enhance functional genomics and marker design for this plant species.
Two quarter 454 pyrosequencing runs used two normalized cDNA collections from stems and leaves from drought-stressed sweetpotato clone Tanzania and yielded 524,209 reads, which were assembled together with 22,094 publically available expressed sequence tags into 31,685 sets of overlapping DNA segments and 34,733 unassembled sequences. Blastx comparisons with the UniRef100 database allowed annotation of 23,957 contigs and 15,342 singletons resulting in 24,657 putatively unique genes. Further, 27,119 sequences had no match to protein sequences of UniRef100database. On the basis of this gene index, we have identified 1,661 gene-based microsatellite sequences, of which 223 were selected for testing and 195 were successfully amplified in a test panel of 6 hexaploid (I. batatas) and 2 diploid (I. trifida) accessions.
The sweetpotato gene index is a useful source for functionally annotated sweetpotato gene sequences that contains three times more gene sequence information for sweetpotato than previous EST assemblies. A searchable version of the gene index, including a blastn function, is available at http://www.cipotato.org/sweetpotato_gene_index.
- Gene Index
- Paralogous Sequence
- Normalize cDNA Library
- Singleton Sequence
- Previous Assembly
Sweetpotato (Ipomoea batatas (L.) Lam.) belongs to the Convolvulaceae (morning glory family). With an annual global production of 110 million tons, it is an important crop in developing countries and has an abundance of uses, ranging from consumption of fresh roots or leaves to processing into animal feed, starch, flour, candy, and alcohol. Sweetpotato produces more food per hectare than wheat, rice, or cassava, which makes it an important food security crop. The high pro-vitamin A content of orange-fleshed sweetpotato varieties plays a crucial role in mitigating vitamin A deficiency that is still widespread among the poor in developing countries. Adding orange-fleshed sweetpotato to the daily diet could prevent vitamin A deficiency-related blindness and maternal mortality .
African farmers produce about 14 million tons of sweetpotato annually with yields of about 4.2 t/ha. Yields in Africa amount to about a fifth of those in China, indicating a huge potential for future growth . Main reasons for low yields in Africa are, besides lack of high-quality planting material, fertilizers and irrigation, prevailing insect pests, and viral diseases . Genomic tools such as molecular markers could support breeding efforts to produce varieties with high nutritional value and improved virus and insect resistance that would ensure food security of resource-poor farmers in developing countries.
Sweetpotato breeding is constrained by the complexity of the genetics of this out-crossing hexaploid crop and by the lack of genomic resources. Up to now, sweetpotato sequence information has been limited to some 22,000 expressed sequence tag (EST) sequences and ~ 1,500 GenBank sequences and one bacterial artificial chromosome sequence deposited in public databases. At the Plant Gene Database http://www.plantgdb.org/download/download.php?dir=/Sequence/ESTcontig/Ipomoea_batatas/current_version and at TIGR Plant Transcript Assemblies http://plantta.jcvi.org/index.shtml, assemblies of GenBank-deposited Ipomoea batatas ESTs are available. More assembled EST sequences are available from morning glory (Ipomoea nil), a wild relative of cultivated sweetpotato http://compbio.dfci.harvard.edu/tgi/cgi-bin/tgi/gimain.pl?gudb=morning_glory that might be of use as a model plant for some aspects of sweetpotato genomics. Sweetpotato genomics tools are currently restricted to a medium-density cDNA microarray http://www.picme.at/index.php/products/arrays/?task=details&species=Sweetpotato%20Microarray%20PICME_SWP_15K_1, linkage maps [4, 5], and some 130 published microsatellite (SSR) markers [6–10]. To prepare the path for using the large biodiversity of this crop in molecular breeding of improved cultivars, more genomic resources such as sweetpotato gene sequences and markers are urgently needed.
CIP holds in trust 7,783 sweetpotato accessions, including breeding lines, improved varieties, landraces, and wild accessions from 58 countries. A subset of this collection, consisting of 472 accessions that represent the diversity of this crop with respect to agronomical and resistance traits as well as nutritional quality--referred to as a "composite genotype set"--is available for international distribution in the form of disease-free in vitro plants to facilitate access of breeders to sweetpotato biodiversity http://gcpcr.grinfo.net/files/cr_files/gcpcr_file832.xls. The sweetpotato clone Tanzania, a member of the composite genotype set, is a high-yielding, stress-tolerant, and broadly cultivated African landrace that is used in various breeding programs due to its high dry matter content and drought tolerance. This clone has been submitted to transcriptome sequencing in the present work, with the aim to augment the available gene sequence and marker information for this crop.
454 pyrosequencing has become a popular method for high throughput sequencing. At low cost, it provides a huge amount of sequence information at relative low error frequency . It is widely used for genome resequencing , de novo sequencing of small genomes [e.g., ], SNP detection [14, 15], or transcriptome sequencing . Specialized software for de novo assembly of the relative short 454 sequences, ranging from around 100 bp with the 454 FLX system but now approaching 500 bp with the 454 FLX TITANIUM system, is available. However, only a few software packages claim to be suitable for hybrid assembly of the relatively short 454 reads with Sanger sequences [17–20]. The choice of optimal assembly parameters to significantly reduce redundancy of an EST collection and, at the same time, maintain paralogous sequences separated in different contigs remains a challenging task. Particularly in a highly heterozygote hexaploid plant such as sweetpotato, discrimination between paralogs and allelic variants on whole transcriptome level remains difficult. For an EST assembly, it might therefore be preferable to accept an increased level of redundancy instead of risking the merging of different members of a gene family into one contig by using too loose assembly parameters. The quality of an assembly generally augments with increasing sequence information. Therefore assemblies with a limited number of ESTs represent a work in progress rather than a finished transcriptome assembly. As more sequence information becomes available, reassembly can lead to splitting or rearrangement of contigs, thus leading to an optimized representation of the transcriptome.
Here we present the de novo assembly of 454 sequencing reads from two normalized sweetpotato libraries, together with publically available ESTs, which resulted in a gene index for hexaploid sweetpotato. This gene index facilitates the access to sweetpotato gene sequences and has allowed for the design of gene-based microsatellite markers.
A total of 87,307 raw reads comprising 21,292,096 bases were obtained with a 454 FLX quarter run of a normalized cDNA library of leaves from drought-stressed plants of the sweetpotato clone Tanzania. Another quarter run of 454 FLX TITANIUM resulted in 436,817 raw reads and 136,844,411 bases for a normalized cDNA library of stems of the same clone. The average length per read amounted to 243.9 bp for the 454 FLX run and to 313.3 for the 454 FLX TITANIUM run. cDNA synthesis primer sequences present in the reads were removed and very short reads with less than 100 bp and low-quality sequences were eliminated. The remaining 402,506 raw reads were blasted against vector-cleaned sweetpotato ESTs from the GenBank. Blast hits at > 80% coverage and > 80% identity were found in 31.6% of the 454 sequences, suggesting that the remaining two-thirds (68.4%) of the 454 reads represent new EST sequences.
Redundancy in the gene index was assessed by self-megablast and blastx with a A. thaliana protein database. At the nucleic acid level, we performed a self-megablast test to assess similarity between contig and singleton sequences produced at different MMPs. Sequences sharing 80% identity over 80% of their length with another sequence were considered as redundant. As expected, self-megablast hits augmented with increasing MMPs, indicating that sequences representing allelic variants increasingly remained unassembled or were resolved into different contigs. The increase in the number of self-megablast hits went in parallel with the number of sequences and became steeper at MMPs above 80% (Figure 1B). Additionally, we have assessed the redundancy of the assemblies produced at different MMP levels by counting the number of total and unique blastx hits of the contig and singleton sequences with a protein database. At this stage, in order to make redundancy analysis quicker and less demanding with respect to computer power, we used the A. thaliana protein database instead of the very large UniRef100 protein set for blastx analysis. Predictably, the total number of blastx hits increased with rising MMP and the increase became steeper above 80% MMP, whereas unique blastx hits increased linearly over the whole range of MMP and at a much lower path than the total hit number. The number of hits increased mainly for singletons, as more sequences remained unassembled at higher MMP values (Figure 1C).
Reciprocal blastn queries with previous Ipomoea EST assemblies
For I. nil, a near relative of sweetpotato, a gene index has been established based on 61,199 EST and 133 mature transcripts (ETs), resulting in 11,754 tentative consensus sequences (contigs), 9,721 EST-singletons, and 39 ET singletons , http://compbio.dfci.harvard.edu/cgi-bin/tgi/gimain.pl?gudb=morning_glory. The present gene index had 37,986 significant (E = 10-6) blastn hits with the I. nil database, while 28,432 sequences remained without a hit and can be considered as specific for sweetpotato. About 19% of the sequences of the I. nil gene index (4,146 contigs and singletons) had no significant hits with the present sweetpotato gene index. These reciprocal blastn tests indicate good coverage of the present gene index for sweetpotato, but also a relatively high redundancy compared to previous EST assemblies.
The lower hit frequency of singletons in blastx searches might be caused by the smaller size of these sequences compared to contigs (373 vs. 790 bp). Another possibility for decreased hit frequency of singleton sequences would be contamination of the cDNA preparation with genomic DNA, resulting in reads derived from genomic DNA that remained unassembled. Analogous to the contig sequences, we also have searched for the presence of poly-A tails and ORFs in singletons without significant blastx hits. From 19,391 singletons without significant similarity to any protein of the UniRef100 database, 6,775 had a putative poly-A tail. Only 635 singletons of the no-hit singletons, including 47 ESTs from the GenBank database, did not encode any ORF larger than 10 amino acids, while the remaining no-hit singletons had ORFs from 30 to 896 bp. The presence of ORFs in more than 90% of the un-annotated singletons suggests that these sequences also are derived from protein coding genes.
The sweetpotato gene index including the annotation of the contigs and singletons can be viewed, searched, and downloaded at http://www.cipotato.org/sweetpotato_gene_index.
Viral sequences contained in the sweetpotato gene index
Polyprotein (Fragment) n = 1 Tax = Sweet potato feathery mottle virus RepID = Q59A02_9POTV
Reverse transcriptase n = 1 Tax = Petunia vein clearing virus isolate Hohn RepID = POLG_PVCV2
Polyprotein n = 1 Tax = Sweet potato feathery mottle virus RepID = O39734_9POTV
P3 protein n = 1 Tax = Sweet potato feathery mottle virus RepID = UPI0000163D32
Polyprotein n = 1 Tax = Sweet potato feathery mottle virus RepID = O39734_9POTV
Coat protein AV1 n = 1 Tax = Sweet potato leaf curl virus RepID = Q9QS58_9GEMI
Polyprotein n = 1 Tax = Potato virus A RepID = Q9QBT4_PVMA
NIb protein n = 1 Tax = Sweet potato feathery mottle virus RepID = UPI0000156FE9
AC3 n = 1 Tax = Sweet potato leaf curl virus RepID = Q9QS57_9GEMI
coat protein n = 1 Tax = Sweet potato feathery mottle virus RepID = UPI0000163D38
AV2 protein n = 1 Tax = Sweet potato leaf curl virus RepID = B3Y549_9GEMI
Polyprotein (Fragment) n = 1 Tax = Sweet potato feathery mottle virus RepID = Q80MW4_9POTV
Polyprotein n = 1 Tax = Sweet potato chlorotic stunt virus RepID = Q8JJW9_9CLOS
HC-Pro protein n = 1 Tax = Sweet potato feathery mottle virus RepID = UPI0000163D31
Polyprotein (Fragment) n = 1 Tax = Sweet potato feathery mottle virus RepID = Q5GIU0_9POTV
CI protein n = 1 Tax = Sweet potato feathery mottle virus RepID = UPI0000163D34
Polyprotein n = 1 Tax = Sweet potato feathery mottle virus RepID = C4N332_9POTV
Polyprotein n = 1 Tax = Sweet potato feathery mottle virus RepID = O39734_9POTV
Microsatellite marker identification and testing
Sweetpotato accessions used for SSR testing
diploid mapping parent
diploid mapping parent
hexaploid, improved variety
Papua New Guinea
hexaploid mapping parent, improved variety
hexaploid mapping parent, landrace
The present gene index http://www.cipotato.org/sweetpotato_gene_index strongly augments the available sequence information on sweetpotato genes. Owing to the heterozygous and hexaploid nature of this plant, the assembly contains a relatively high level of redundancy compared to previous assemblies to prevent merging of paralogous sequences into contigs. Fifty-nine percent of the obtained sequences could be functionally annotated. The index has been successfully mined for microsatellite sequences and 195 out of 223 selected loci could be successfully amplified.
Plant material, cDNA synthesis, 454 sequencing
Shoots from field-grown plants of the sweetpotato I. batatas variety Tanzania (CIP accession number 440160) were acclimated to the greenhouse and cultivated in pots for one month. Drought was then imposed by not watering the plants for eight weeks. After that time, leaf and stem tissue were sampled separately and total RNA was produced using the Trizol reagent according to the instructions of the supplier (Invitrogen). Complementary DNA was synthesized from the two RNA batches by Evrogen http://www.evrogen.com using the SMART approach , normalized according to , and amplified. Seven μg DNA of each normalized cDNA library was submitted to a quarter 454 sequencing run at the School of Biological Sciences, University of Liverpool. The cDNA library of leaves was sequenced with the 454 FLX technology and the stem library was sequenced using 454 FLX TITANIUM.
Sequence cleaning and assembly
Sequence cleaning and assembly were performed on the CIP High Performance Computer http://hpc.cip.cgiar.org/. Adaptor primer and SMART oligonucleotide sequences as well as low-complexity regions present in the 454 raw reads were masked using the open source software RepeatMasker 3.2.7 http://www.repeatmasker.org/RMDownload.html. Short reads (< 100 bp) were eliminated from the sequence and quality files using custom scripts. A total of 20,094 publically available sweetpotato EST sequences http://www.ncbi.nlm.nih.gov/Genbank/ were downloaded from http://www.ncbi.nlm.nih.gov/sites/entrez and cleaned from vector sequences using SeqClean http://compbio.dfci.harvard.edu/tgi/cgi-bin/tgi/download.pl?ftp_dir=software&file_dir=seqclean/seqclean.tar.gz. The 454 reads were assembled together with the GenBank ESTs with the NGen software (DNASTAR, Madison, WI, USA). For optimization of the assembly, the assembly was tried at MMP in a range from 70 to 90% with 5% intervals. The final assembly was done using the following parameters: match size: 25, gap penalty: 7, mismatch penalty: 12, match score: 10, minimal match percentage: 80, and match spacing: 40. The quality of the assembly was assessed manually for 300 randomly selected contigs by checking the plausibility of the read alignments. Redundancy of the assembly was checked by selfblast using megablast (NCBI) on the CIP High Performance Computer http://hpc.cip.cgiar.org.
Contigs and singletons were annotated through blastx sequence comparison to the UniRef100 database http://www.uniprot.org and A. thaliana proteins (TAIR, http://www.arabidopsis.org) using BLAST 2.2.14 on the CIP High Performance Computer http://hpc.cip.cgiar.org. The best BLAST hits were filtered out with the custom R-script Chomper2.R. GO annotation was performed using Blast2GO software v2.4.2 [25, 26] with a blastx cutoff of 10-6.
Identification of open reading frames
Open reading frames in contig and singleton sequences without significant blastx hit were identified using Orf-Predictor http://proteomics.ysu.edu/tools/OrfPredictor.html.
Microsatellite marker identification and testing
Gene index sequences that had high similarity to GenBank-deposited EST sequences were filtered out to avoid identification of already known SSR sequences. SSR motifs were identified with the SSRlocator , http://minerva.ufpel.edu.br/~lmaia.faem/ssr1.html limiting the hits to motifs that consisted of at least 7 dimers, 5 trimers, or tetramers. Primers for SSR loci were designed with Primer3 with 100-200 bp amplicon size. The primer sequences of the identified SSR loci were compared via blastn with the gene index sequences. Only those primers that gave an unambiguous hit with one contig or singleton or "single locus sequence group" were recorded as candidate SSR. A single locus sequence group was defined as a group of sequences that, based on sequence similarity, was putatively derived from a single locus. We assumed sequences to represent a single locus if the distal sequences were identical. SSR loci were polymerase chain reaction (PCR) amplified in six biodiverse hexaploid sweetpotato and two diploid wild sweetpotato accessions. Four of these accessions represented clones from Asia, Latin America, and Africa, and four accessions were parents of mapping populations (Tab. 2). DNA extraction was done according to  for SSR analysis, and PCR reactions contained 1x PCR-buffer with 2 mM MgCl2, 0.6 mM dNTPs (New England Biolabs), 0.5 μM forward and reverse primer, and 0.5 U Taq polymerase (New England Biolabs) in a volume of 25 μl. Amplification conditions were 4 min. initial denaturation followed by 30 cycles of 1 min at 94°C, 1 min. at annealing temperature (given with the description of the SSR primers, 'Additional file 1), 1 min. at 72°C, and followed by terminal elongation at 72°C for 7 min. The obtained fragments were separated on a Li-Cor 4300 (LI-COR Biosciences, Lincoln, NE, USA) and analyzed essentially as described in .
We thank Raymundo Gutierrez for growing the experimental plants. The works were funded by the Generation Challenge Program http://www.generationcp.org/ GCP project number: G4008.09. We thank Gary Harrison for improving the manuscript.
- Low JW, Arimond M, Osman N, Cunguara B, Zano F, Tschirley D: A food-based approach introducing orange-fleshed sweet potatoes increased vitamin A intake and serum retinol concentrations in young children in rural Mozambique. J Nutr. 2007, 137: 1320-1327.PubMedGoogle Scholar
- FAOSTAT: (Accessed 5 October 2010), [http://faostat.fao.org/]
- Low J, Lynam J, Lemaga B, Crissman C, Barker I, Thiele G, Namanda S, Wheatley C, Andrade M: Sweetpotato in Sub-Saharan Africa. The sweetpotato. Edited by: Loebenstein G, Thottappilly G. 2009, Heidelberg: Springer, 359-390. full_text.View ArticleGoogle Scholar
- Kriegner A, Cervantes JC, Burg K, Mwanga ROM, Zhang D: A genetic linkage map of sweetpotato (Ipomoea batatas (L.) Lam.) based on AFLP markers. Mol Breeding. 2003, 11: 169-185. 10.1023/A:1022870917230.View ArticleGoogle Scholar
- Cervantes-Flores JC, Yencho GC, Kriegner A, Pecota KV, Faulk MA, Mwanga ROM, Sosinski BR: Development of a genetic linkage map and identification of homologous linkage groups in sweetpotato using multiple-dose AFLP markers. Mol Breeding. 2008, 21: 511-532. 10.1007/s11032-007-9150-6.View ArticleGoogle Scholar
- Buteler MI, Jarret RL, LaBonte DR: Sequence characterization of microsatellites in diploid and polyploid Ipomoea. TAG. 1999, 99: 123-132. 10.1007/s001220051216.View ArticleGoogle Scholar
- Jarret RL, Bowen N: Simple sequence repeats (SSRs) for sweet potato germplasm characterization. Plant Gen Res Newsletter. 1994, 100: 9-11.Google Scholar
- Tseng YT, Lo HF, Hwang SY: Genotyping and assessment of genetic relationships in elite polycross breeding cultivars of sweetpotato in Taiwan based on SAMPL polymorphisms. Bot Bull Acad Sinica. 2002, 43: 99-105.Google Scholar
- Zhang DP, Carbajulca D, Ojeda L, Rossel G, Milla S, Herrera C, Ghislain M: Microsatellite Analysis of Genetic Diversity in Sweetpotato Varieties from Latin America. CIP Program Report. 1999, -2000, International Potato Center, Lima, PeruGoogle Scholar
- Hu J, Nakatani M, Mizuno K, Fujimura T: Development and characterization of microsatellite markers in sweetpotato. Breeding Sci. 2004, 54: 177-188. 10.1270/jsbbs.54.177.View ArticleGoogle Scholar
- Harismendy O, Ng PC, Strausberg RL, Wang X, Stockwell TB, Beeson KY, Schork NJ, Murray SS, Topol EJ, Levy S, Frazer KA: Evaluation of next generation sequencing platforms for population targeted sequencing studies. Gen Biol. 2009, 10: R32-10.1186/gb-2009-10-3-r32.View ArticleGoogle Scholar
- Bentley DR: Whole-genome re-sequencing. Curr Opinion Gen & Development. 2006, 16: 545-552.View ArticleGoogle Scholar
- Thomson NR, Holden MTG, Carder C, Lennard N, Lockey SJ, Marsh P, Skipp P, O'Connor CD, Goodhead I, Norbertzcak H, Harris B, Ormond D, Rance1 R, Quail MA, Parkhill J, Stephens RS, Clarke IN: Chlamydia trachomatis: Genome sequence analysis of lymphogranuloma venereum isolates. Genome Res. 2008, 18: 161-171. 10.1101/gr.7020108.PubMed CentralPubMedView ArticleGoogle Scholar
- Barbazuk WB, Emrich SJ, Chen HD, Li L, Schnable PS: SNP discovery via 454 transcriptome sequencing. Plant J. 2007, 51: 910-918. 10.1111/j.1365-313X.2007.03193.x.PubMed CentralPubMedView ArticleGoogle Scholar
- Bundock PC, Eliott FG, Ablett G, Benson AD, Casu RE, Aitken KS, Henry RJ: Targeted single nucleotide polymorphism (SNP) discovery in a highly polyploid plant species using 454 sequencing. Plant Biotech J. 2009, 7: 347-354. 10.1111/j.1467-7652.2009.00401.x.View ArticleGoogle Scholar
- Vera JC, Wheat CW, Fescemyer HW, Frilander MJ, Crawford D, Hanski I, Marden JH: Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencing. Mol Ecology. 2008, 17: 1636-1647. 10.1111/j.1365-294X.2008.03666.x.View ArticleGoogle Scholar
- Chevreux B, Pfisterer T, Drescher B, Driese AJ, Müller WEG, Wetter T, Suhai S: Using the miraEST assembler for reliable and automated mRNA transcript assembly and SNP detection in sequenced ESTs. Genome Res. 2004, 14: 1147-1159. 10.1101/gr.1917404.PubMed CentralPubMedView ArticleGoogle Scholar
- Miller JR, Delcher AL, Koren S, Venter E, Walenz BP, Brownley A, Johnson J, Li K, Mobarry C, Sutton G: Aggressive assembly of pyrosequencing reads with mates. Bioinformatics. 2008, 24: 2818-2824. 10.1093/bioinformatics/btn548.PubMed CentralPubMedView ArticleGoogle Scholar
- Papanicolaou A, Stierli R, Ffrench-Constant RH, Heckel DG: Next generation transcriptomes for next generation genomes using est2assembly. BMC Bioinformatics. 2009, 10: 447-10.1186/1471-2105-10-447.PubMed CentralPubMedView ArticleGoogle Scholar
- Soderlund C, Johnson E, Bomhoff M, Descour A: PAVE: Program for assembling and viewing ESTs. BMC Genomics. 2009, 10: 400-10.1186/1471-2164-10-400.PubMed CentralPubMedView ArticleGoogle Scholar
- Childs KL, Hamilton JP, Zhu W, Ly E, Cheung F, Wu H, Rabinowicz PD, Town CD, Buell CR, Chan AP: The TIGR Plant Transcript Assemblies database. Nuc Ac Res. 2007, 35: D846-51. 10.1093/nar/gkl785.View ArticleGoogle Scholar
- Quackenbush J, Cho J, Lee D, Liang F, Holt I, Karamycheva S, Parvizi B, Pertea G, Sultana R, White J: The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic species. Nuc Ac Res. 2001, 29: 159-164. 10.1093/nar/29.1.159.View ArticleGoogle Scholar
- Zhu YY, Machleder EM, Chenchik A, Li R, Siebert PD: Reverse transcriptase template switching: a SMART approach for full-length cDNA library construction. Biotechniques. 2001, 30: 892-897.PubMedGoogle Scholar
- Shagin DA, Rebrikov DV, Kozhemyako VB, Altshuler IM, Shcheglov AS, Zhulidov PA, Bogdanova EA, Staroverov DB, Rasskazov VA, Lukyanov S: A novel method for SNP detection using a new duplex-specific nuclease from crab hepatopancreas. Genome Res. 2002, 12: 1935-1942. 10.1101/gr.547002.PubMed CentralPubMedView ArticleGoogle Scholar
- Conesa A, Götz S, Garcìa-Gòmez JM, Terol J, Talòn M, Robles M: Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005, 21: 3674-3676. 10.1093/bioinformatics/bti610.PubMedView ArticleGoogle Scholar
- Aparicio G, Gotz S, Conesa A, Segrelles D, Blanquer I, Garcia JM, Hernandez V, Robles M, Talon M: Blast2GO goes grid: developing a grid-enabled prototype for functional genomics analysis. Stud Health Technol Inform. 2006, 120: 194-204.PubMedGoogle Scholar
- Da Maia LC, Palmieri DA, de Souza VQ, Kopp MM, de Carvalho FI, Costa de Oliveira A: SSR Locator: Tool for simple sequence repeat discovery integrated with primer design and PCR simulation. Int J Plant Genomics. 2008, 41: 2696-Google Scholar
- Doyle JJ, Doyle JL: A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bulletin. 1987, 19: 11-15.Google Scholar
- Ghislain M, Nunez J, del Rosarion Herrera M, Pignataro J, Guzman F, Bonierbale M, Spooner DM: Robust and highly informative microsatellite-based genetic identity kit for potato. Mol Breeding. 2009, 23: 377-388. 10.1007/s11032-008-9240-0.View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.