RAD tag sequencing as a source of SNP markers in Cynara cardunculus L
© Scaglione et al; licensee BioMed Central Ltd. 2012
Received: 6 August 2011
Accepted: 3 January 2012
Published: 3 January 2012
The globe artichoke (Cynara cardunculus L. var. scolymus) genome is relatively poorly explored, especially compared to those of the other major Asteraceae crops sunflower and lettuce. No SNP markers are in the public domain. We have combined the recently developed restriction-site associated DNA (RAD) approach with the Illumina DNA sequencing platform to effect the rapid and mass discovery of SNP markers for C. cardunculus.
RAD tags were sequenced from the genomic DNA of three C. cardunculus mapping population parents, generating 9.7 million reads, corresponding to ~1 Gbp of sequence. An assembly based on paired ends produced ~6.0 Mbp of genomic sequence, separated into ~19,000 contigs (mean length 312 bp), of which ~21% were fragments of putative coding sequence. The shared sequences allowed for the discovery of ~34,000 SNPs and nearly 800 indels, equivalent to a SNP frequency of 5.6 per 1,000 nt, and an indel frequency of 0.2 per 1,000 nt. A sample of heterozygous SNP loci was mapped by CAPS assays and this exercise provided validation of our mining criteria. The repetitive fraction of the genome had a high representation of retrotransposon sequence, followed by simple repeats, AT-low complexity regions and mobile DNA elements. The genomic k-mers distribution and CpG rate of C. cardunculus, compared with data derived from three whole genome-sequenced dicots species, provided a further evidence of the random representation of the C. cardunculus genome generated by RAD sampling.
The RAD tag sequencing approach is a cost-effective and rapid method to develop SNP markers in a highly heterozygous species. Our approach permitted to generate a large and robust SNP datasets by the adoption of optimized filtering criteria.
Cynara cardunculus (2n = 2x = 34, haploid genome size ~1.08 Gbp ) an allogamous, highly heterozygous Asteraceae species, includes three taxa: the globe artichoke (var. scolymus), the cultivated cardoon (var. altilis) and their common progenitor the wild cardoon (var. sylvestris) . Globe artichoke contributes significantly to the Mediterranean agricultural economy, and is also cultivated in South America, North Africa, China and USA. Over the past 30 years, a body of evidence has grown that plant-based foods can be effective for the alleviation of several chronic diseases, and globe artichoke in particular has been shown to produce a number of nutraceutically and pharmaceutically active compounds. Extracts from both globe artichoke and cultivated cardoon have exhibited hepatoprotective, anticarcinogenic, antioxidative and antibacterial qualities, and even an inhibition of cholesterol biosynthesis and LDL oxidation [3–6]. Finally, there is increasing interest in developing the species as an energy and oilseed crop [7–10].
Since the first linkage map produced for globe artichoke , a number of other segregating populations have been exploited for genetic mapping, including one generated from a hybrid between a globe artichoke and a cultivated cardoon genotype  and, more recently, one obtained by crossing globe artichoke with wild cardoon . The recent development of a set of gene-based microsatellites  has aided the construction of consensus genetic maps [13, 15, 16]. However, these maps remains insufficiently densely populated for trait mapping and marker assisted selection. Current high throughput sequencing technology, which produces DNA sequence at a rate several orders of magnitude faster than conventional methods, is effective as a platform for SNP (single nucleotide polymorphism) discovery. A particularly efficient protocol, termed "restriction-site associated DNA" (RAD) , in combination with the Illumina Genome Analyzer sequencing device , discovers SNPs by sequencing a large set of restriction fragments [19–21]. Here we report the generation of genomic RAD tags from the three C. cardunculus accessions used as the parents for two of our mapping populations. The RAD tags were used to derive SNP markers some of which were then validated by a Cleaved Amplified Polymorphic Sequence (CAPS) assay. The identified SNPs could be useful to produce denser C. cardunculus genetic maps via high-throughput genotyping technologies. The RAD sequence has also been informative for characterizing the repetitive DNA component of the C. cardunculus genome, in particular allowing some inferences to be made regarding the contribution of DNA methylation in inhibiting its expansion.
Results and Discussion
RAD tag sequencing and de novo contig assembly
Annotation and GO categorization of contigs
K-mer distribution analysis
Altogether, our data suggest that the RAD procedure, despite its use of GC-rich recognition sites, has produced a random representation of the C. cardunculus genome, and shows that it represents a reliable means of assessing genome complexity.
SNP calling and classification
SNP mining results.
Total SNPs mining (CcRAD1)
"Fully informative" RAD loci (CcRAD2)
Putative testcross markers (CcRAD2)
"Romanesco C3" testcross over "Altilis 41"
"Altilis 41" testcross over "Romanesco C3"
"Romanesco C3" testcross over "Creta 4"
"Creta 4" testcross over "Romanesco C3"
Common intercross markers (CcRAD2)
CAPS markers conversion and linkage analysis
CAPs markers conversion.
Product size (bp)
Restriction site (bp)
"RomanescoC3" restriction produts
"Altilis 41" restriction products
New LG Alt_22
In crop species where the number of markers available to date is limiting, the use of high throughput sequencing to generate large numbers of genetically informative assays can make a valuable and rapid contribution to linkage mapping, and its major downstream application, marker-assisted selection. RAD tag sequencing based on the Illumina platform has proven to be a highly reliable and cost-effective means of SNP discovery. We were able to identify thousands of putative SNP markers in this way, and the majority of a random sample of 24 was fully validated through conversion to CAPS assays and linkage analysis. Furthermore, the reduction in template complexity generated by the RAD approach greatly facilitates its implementation in mapping-by-sequencing approaches.
A large proportion of the methylation present in DNA occurs in the form of CpG dinucleotides, and there is little evidence for negative selection against these in the many genomes which have been analysed to date [32, 37]. Acquiring genome-wide sequence has given a glimpse of the genome complexity present in C. cardunculus. Even though the RAD tags represent only a sample of the genome as a whole, it was clear that there exists a relationship between the frequency of CpG dinucleotides and the level of sequence repetitiveness, consistent with the known role played by methylation in controlling genome expansion due to transposable element activity [30, 31].
Plant material and RAD tag sequencing
Genomic DNA was extracted from the leaf of the three C. cardunculus accessions, following the protocol described by Lanteri et al. . The three accessions have been used as parents of two F1 populations, made by crossing globe artichoke variety "Romanesco C3" as female with either the cultivated cardoon variety "Altilis 41" or the wild cardoon accession "Creta 4" as male . "Romanesco C3" is a late-maturing variety, which forms large purple-green capitula, each bearing violet coloured florets; "Altilis 41" was selected at the University of Catania  on the basis of its biomass yield potential; its foliage is grey and its florets white. "Creta 4" was collected from a wild population in Crete; it produces a large number of capitula, forms green-violet bracts and violet florets. Each DNA sample was processed into a separate RAD libraries as reported by Baird et al. . Briefly, 300 ng DNA were digested with 20 U of PstI (New England Biolabs, NEB) for 60 min at 37°C in a 50 μl reaction, after which the reactions were heat inactivated by holding at 65°C for 20 min. A 2.5 μL aliquot of 100 nM P1 adaptor (a modified Illumina adapter)  was added to each sample along with 1 μL 10 mM ATP (Promega), 1 μL 10x NEB Buffer4, 1,000 U T4 DNA ligase (Enzymatics, Inc) and 5 μL H2O, and the reaction was incubated at room temperature for 20 min, ending with a heat inactivation step (65°C/20 min). The reactions were then pooled and sheared to an average length of 500 bp using a Bioruptor (Diagenode). The sheared DNA was separated by electrophoresis through a 1.5% agarose gel, and fragments in the 300-800 bp range were isolated using a MinElute Gel Extraction kit (Qiagen). The End-Repair mix (Enzymatics, Inc.) was used to blunten the dsDNA ends, and the samples were re-purified using a MinElute column (Qiagen), following which 15 U Exo-Klenow (Enzymatics, Inc.) were added and the sample incubated at 37°C to generate 3'-adenine overhangs. After subsequent purification, 1 μL 10 μM P2 adapter (a second modified Illumina adapter)  was ligated and the sample purified as above. The concentration of DNA in the eluate was quantified using a Qubit fluorimeter, and a 20 ng aliquot was used for a 100 μL PCR comprising 20 μL Phusion Master Mix (NEB), 5 μL 10 μM P2 and H2O. The 18 cycle PCR amplification regime followed the recommendation of the manufacturer (NEB). After this PCR, the samples were separated by electrophoresis once again through a 1.5% agarose gel, and fragments in the 300-700 bp range were excised from the gel and diluted to 3 ng/μL. The material was analysed on an Illumina Genome Analyzer IIx following the paired ends (2x 54 bp) genomic DNA sequencing protocol suggested by the manufacturer.
RAD contig assembly
The sequences were sorted according to their multiplex identifier tag. A RAD LongRead® contig assembly was generated by a set of algorithms developed at Floragenex Inc. Sequences having more than 5 bases with poor Illumina quality scores (Phred10 or lower) were discarded. Paired reads were collapsed into sequence "clusters" on the basis of single ends (SE) sharing 100% sequence identity. To maximize assembly efficiency, a minimum of 25x and maximum 400x sequence coverage at RAD SE reads were imposed. The variable paired end sequences for each common SE were extracted using the filtered sequence set and compiled for the LongRead® contig construction, using a modified version of the Velvet sequence assembler (v. 1.0.04)  and testing several k-mers in graph construction for each RAD contig. After analysis of the first-pass assembly from each template, "Creta 4" was selected as the reference sequence set. Additional filters were then applied to remove short contigs (< 100 bp in length), low paired end coverage (< 4.0x) or ambiguous contigs (containing N's homopolymers). If more than a single contig (NODE1) was assembled for a given RAD locus, alternative ones were retained in the dataset and labelled accordingly (NODE2, NODE3).
Contig annotation and categorization
RAD contigs were annotated using Blast2GO software , and were submitted to the NCBI nr protein database where an E-value of 10e-3 or lower were retrieved (20 best hits recorded). Gene names and GIs (gene identifiers) were assigned according to NCBI guidelines, and PIR (Protein Information Resource) identifiers in reference to UniProt, SwissProt, TrEMBL, RefSeq, GenPept and PDB. The annotation was obtained by applying the formula embedded in Blast2GO , setting a threshold score of 55. In the Blast2Go pipeline, GO terms are "transferred" to query sequences only whether a score threshold is reached. This score is calculated basing on both sequence similarity and presence of children node in the directed acyclic graph (DAG). Therefore, in this scenario the first e-value cut-off is used only for the purpose of "collecting" GO-terms, while other more stringent criteria are ruling whether transfer these terms to our sequences. Enzyme codes were retrieved from GO tables and mapped onto KEGG pathways. Transposable elements were detected using RepeatMasker v3.2.9 software http://www.repeatmasker.org, based on the RMBlast algorithm. Default parameters (except for -s flag) were used to search against Viridiplantae repeats.
K-mer distribution and CpG suppression
where CpG represents the observed frequency of CpG dinucleotides and p(C) and p(G) the respective frequencies of each single nucleotide.
MAQ software (v. 0.5.0)  was used to align the paired end reads in the "Creta 4" reference contig set. The alignment threshold was set to a maximum of three nucleotides mismatch between Illumina reads and the reference. Gaps in the alignment of up to 2 nt allowed. Two levels of stringency were applied. In the first (CcRAD1), a comprehensive list of putative SNPs and 1-2 bp indels was populated with a minimum coverage of 6x as threshold prior to uploading to a Microsoft Access relational database; and for the second (CcRAD2), "fully informative" SNPs were defined when a minimum of 1-read allele calling was achieved for each of the three samples. In the latter set, heterozygous SNPs were assessed where the within sample allele frequency ranged from 0.25 to 0.75, together with a minimum coverage of 4x and allele calling for two reads. Sites were assigned as homozygous when the minor allele frequency fell below 0.10.
Candidate SNP markers were categorized as testcross in pair-wise comparisons of genotypes, whether a heterozygous imputation was present for one parent only (testcross) and a homozygous site was predicted for the other. Common intercross markers were defined for loci showing heterozygous states across all the three samples.
CAPS assay design and application
A subset of heterozygous SNPs was selected from the "Altilis 41" sequence, and a search carried out for Bam HI, Eco RI, Eco RV, Nde I, Xba I, Bcc I, Fok I, Xmn I and Dra I (6 bp cutters), or Taq I and Mse I (4 bp cutters) recognition sites using SNP2CAPS script (v. 0.6) . A predicted fragment size difference of at least 20 bp was imposed to allow detection on standard agarose gels. Locus-specific primers were designed from the BatchPrimer3 web interface , using default parameters but for product size (100-400 bp) and annealing sites (within a 50 bp window at either end of the RAD contig). The resulting assays were applied to a set of 94 F1 segregants from the cross "Romanesco C3" × "Altilis 41" . PCRs were carried out in a 20 μl volume containing 12.5 ng genomic DNA, 1x GoTaq Buffer (Promega), 1.5 mM MgCl2, 0.2 mM dNTPs, 1 U GoTaq (Promega) and 0.5 μM of each primer. The cycling regime was 95°C/5 min, followed by 35 cycles of 95°C/30 s, 55°C/30 s, 72°C/45 s and a final incubation of 72°C/5 min. Amplification was checked by electrophoresis through a 1.5% agarose gel and quantified using a Beckman Coulter spectrophotometer. Restriction reactions (20 μl) comprised 800 ng amplified DNA, 0.3 U restriction enzyme (New England Biolabs), reaction buffer and BSA according to the manufacturers' specifications, incubated for 4 h at 37°C (except for Taq I, where the incubation temperature was 65°C), after which the reactions were heat inactivated (80°C/10 min). The resulting products were electrophoresed through 2% agarose gels.
The CAPS derived genotypic data were incorporated into a pre-existing data set of 273 molecular loci, mainly AFLP and EST-SSRs, already used to generate the cultivated cardoon genetic map [11, 14, 15] including five SNP from genes underlying caffeoylquinic acids synthesis reported by Comino et al.  and Menin et al. ; all maps data are available on request by the authors. Goodness-of-fit between observed and expected segregation ratios was tested by χ2 and only markers fitting or deviating only marginally from expectation (χ2α = 1 < χ2 ≤ χ2α = 0.01) were included for mapping. Linkage groups (LGs) were established by JoinMap v4.0 software , on the basis of a LOD threshold of 6.0, using as parameter settings Rec = 0.40, LOD = 1.0, Jump = 5. Map distances were converted to centiMorgans (cM) using the Kosambi mapping function. LGs were drawn and aligned using MapChart v2.1 .
This research was supported by grants from: (i) the National Science Foundation Plant Genome Research Program (No. 0421630), (ii) the Georgia Research Alliance, (iii) the University of Georgia Research Foundation, and (iv) by MIPAAF (Ministero delle Politiche Agricole, Alimentari e Forestali - Italy) through the CYNERGIA ("Costituzione e valutazione dell'adattabilita' di genotipi di Cynara cardunculus per la produzione di biomassa e biodiesel in ambiente mediterraneo") project and CARVARVI ("Valorizzazione di germoplasma di carciofo attraverso la costituzione varietale ed il risanamento da virus") project.
- Marie D, Brown S: A cytometric exercise in plant DNA histograms, with 2C values for 70 species. Biol Cell. 1993, 78: 41-51. 10.1016/0248-4900(93)90113-S.View ArticlePubMedGoogle Scholar
- Rottenberg A, Zohary D: The wild ancestry of the cultivated artichoke. Genet Resour Crop Ev. 1996, 43 (1): 53-58. 10.1007/BF00126940.View ArticleGoogle Scholar
- Shimoda H, Ninomiya K, Nishida N, Yoshino T, Morikawa T, Matsuda H, Yoshikawa M: Anti-hyperlipidemic Sesquiterpenes and new sesquiterpene glycosides from the leaves of artichoke (Cynara scolymus L.): Structure requirement and mode of action. Bioorg Med Chem Lett. 2003, 13 (2): 223-228. 10.1016/S0960-894X(02)00889-2.View ArticlePubMedGoogle Scholar
- Gebhardt R: Antioxidative and protective properties of extracts from leaves of the artichoke (Cynara scolymus L) against hydroperoxide-induced oxidative stress in cultured rat hepatocytes. Toxicol Appl Pharm. 1997, 144 (2): 279-286. 10.1006/taap.1997.8130.View ArticleGoogle Scholar
- Gebhardt R: Inhibition of cholesterol biosynthesis in HepG2 cells by artichoke extracts is reinforced by glucosidase pretreatment. Phytother Res. 2002, 16 (4): 368-372. 10.1002/ptr.960.View ArticlePubMedGoogle Scholar
- Schutz K, Kammerer D, Carle R, Schieber A: Identification and quantification of caffeoylquinic acids and flavonolds from artichoke (Cynara scolymus L.) heads, juice, and pomace by HPLC-DAD-ESI/MSn. J Agr Food Chem. 2004, 52 (13): 4090-4096. 10.1021/jf049625x.View ArticleGoogle Scholar
- Ierna A, Mauromicale G: Cynara cardunculus L. genotypes as a crop for energy purposes in a Mediterranean environment. Biomass Bioenerg. 2010, 34 (5): 754-760. 10.1016/j.biombioe.2010.01.018.View ArticleGoogle Scholar
- Raccuia S, Melilli M: Biomass and grain oil yields in Cynara cardunculus L. genotypes grown in a Mediterranean environment. Field Crop Res. 2007, 101 (2): 187-197. 10.1016/j.fcr.2006.11.006.View ArticleGoogle Scholar
- Encinar J, Gonzalez J, Rodriguez J, Tejedor A: Biodiesel fuels from vegetable oils: Transesterification of Cynara cardunculus L. oils with ethanol. Energ Fuel. 2002, 16 (2): 443-450. 10.1021/ef010174h.View ArticleGoogle Scholar
- Encinar J, Gonzalez J, Gonzalez J: Steam gasification of Cynara cardunculus L.: influence of variables. Fuel Proces Technol. 2002, 75 (1): 27-43. 10.1016/S0378-3820(01)00247-8.View ArticleGoogle Scholar
- Lanteri S, Acquadro A, Comino C, Mauro R, Mauromicale G, Portis E: A first linkage map of globe artichoke (Cynara cardunculus var. scolymus L.) based on AFLP, S-SAP, M-AFLP and microsatellite markers. Theor Appl Genet. 2006, 112 (8): 1532-1542. 10.1007/s00122-006-0256-8.View ArticlePubMedGoogle Scholar
- Portis E, Mauromicale G, Mauro R, Acquadro A, Scaglione D, Lanteri S: Construction of a reference molecular linkage map of globe artichoke (Cynara cardunculus var. scolymus). Theor Appl Genet. 2009, 120 (1): 59-70. 10.1007/s00122-009-1159-2.View ArticlePubMedGoogle Scholar
- Sonnante G, Gatto A, Morgese A, Montemurro F, Sarli G, Blanco E, Pignone D: Genetic map of artichoke × wild cardoon: toward a consensus map for Cynara cardunculus. Theor Appl Genet. 2011, 123 (7): 1215-1229. 10.1007/s00122-011-1661-1.View ArticlePubMedGoogle Scholar
- Scaglione D, Acquadro A, Portis E, Taylor C, Lanteri S, Knapp S: Ontology and diversity of transcript-associated microsatellites mined from a globe artichoke EST database. BMC Genomics. 2009, 10: 454-10.1186/1471-2164-10-454.PubMed CentralView ArticlePubMedGoogle Scholar
- Portis E, Acquadro A, Scaglione D, Comino C, Mauromicale G, Mauro R, Taylor C, Knapp S, Lanteri S: Construction of a reference linkage map for globe artichoke. Proceedings of the 53rd Italian Society of Agricultural Genetics Annual Congress Torino, Italy - 16/19 September. 2009, ISBN 978-88-900622-9-2, [http://www.siga.unina.it/SIGA2009/SIGA_2009/1_04.pdf]Google Scholar
- Portis E, Acquadro A, Scaglione D, Mauromicale G, Mauro R, Taylor CA, Knapp SJ, Lanteri S: Construction of an SSR-based linkage map for Cynara cardunculus. 2009, 8th Plant Genomics European MeetingGoogle Scholar
- Miller M, Dunham J, Amores A, Cresko W, Johnson E: Rapid and cost-effective polymorphism identification and genotyping using restriction site associated DNA (RAD) markers. Genome Res. 2007, 17: 240-248. 10.1101/gr.5681207.PubMed CentralView ArticlePubMedGoogle Scholar
- Baird N, Etter P, Atwood T, Currey M, Shiver A, Lewis Z, Selker E, Cresko W, Johnson E: Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers. Plos One. 2008, 3 (10): e3376-10.1371/journal.pone.0003376.PubMed CentralView ArticlePubMedGoogle Scholar
- Pfender WF, Saha MC, Johnson EA, Slabaugh EA: Mapping with RAD (restriction-site associated DNA) markers to rapidly identify QTL for stem rust resistance in Lolium perenne. Theor Appl Genet. 2011, 122 (8): 1467-1480. 10.1007/s00122-011-1546-3.View ArticlePubMedGoogle Scholar
- Chutimanitsakun Y, Nipper R, Cuesta-Marcos A, Cistue L, Corey A, Filichkina T, Johnson E, Hayes P: Construction and application for QTL analysis of a Restriction Site Associated DNA (RAD) linkage map in barley. BMC Genomics. 2011, 12: 4-10.1186/1471-2164-12-4.PubMed CentralView ArticlePubMedGoogle Scholar
- Barchi L, Lanteri S, Portis E, Acquadro A, Vale G, Toppino L, Rotino GL: Identification of SNP and SSR markers in eggplant using RAD tag sequencing. BMC Genomics. 2011, 12: 304-10.1186/1471-2164-12-304.PubMed CentralView ArticlePubMedGoogle Scholar
- Hohenlohe P, Amish S, Catchen J, Allendorf F, Luikart G: Next-generation RAD sequencing identifies thousands of SNPs for assessing hybridization between rainbow and westslope cutthroat trout. Mol Ecol Res. 2011, 11: 117-122.View ArticleGoogle Scholar
- Kaul S, Koo H, Jenkins J, Rizzo M, Rooney T, Tallon L, Feldblyum T, Nierman W, Benito M, Lin X, et al: Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000, 408: 796-815. 10.1038/35048692.View ArticleGoogle Scholar
- Jaillon O, Aury J, Noel B, Policriti A, Clepet C, Casagrande A, Choisne N, Aubourg S, Vitulo N, Jubin C, et al: The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature. 2007, 449: 463-467. 10.1038/nature06148.View ArticlePubMedGoogle Scholar
- Etter PD, Preston JL, Bassham S, Cresko WA, Johnson EA: Local De Novo Assembly of RAD Paired-End Contigs Using Short Sequencing Reads. Plos One. 2011, 6 (4): e18561-10.1371/journal.pone.0018561.PubMed CentralView ArticlePubMedGoogle Scholar
- Baxter SW, Davey JW, Johnston JS, Shelton AM, Heckel DG, et al: Linkage Mapping and Comparative Genomics Using Next-Generation RAD Sequencing of a Non-Model Organism. PLoS One. 2011, 6 (4): e19315-10.1371/journal.pone.0019315.PubMed CentralView ArticlePubMedGoogle Scholar
- Willing EM, Hoffmann M, Klein JD, Weigel D, Dreyer C: Paired-end RAD-seq for de-novo assembly and marker design without available reference. Bioinformatics. 2011, 27 (16): 2187-2193. 10.1093/bioinformatics/btr346.View ArticlePubMedGoogle Scholar
- Palmer LE, Rabinowicz PD, O'Shaughnessy AL, Balija VS, Nascimento LU, Dike S, de la Bastide M, Martienssen RA, McCombie WR: Maize Genome Sequencing by Methylation Filtration. Science. 2003, 302 (5653): 2115-2117. 10.1126/science.1091265.View ArticlePubMedGoogle Scholar
- Chor B, Horn D, Goldman N, Levy Y, Massingham T: Genomic DNA k-mer spectra: models and modalities. Genome Biol. 2009, 10: 108-10.1186/gb-2009-10-6-108.View ArticleGoogle Scholar
- Slotkin R, Vaughn M, Borges F, Tanurdzic M, Becker J, Feijo J, Martienssen R: Epigenetic reprogramming and small RNA silencing of transposable elements in pollen. Cell. 2009, 136 (3): 461-472. 10.1016/j.cell.2008.12.038.PubMed CentralView ArticlePubMedGoogle Scholar
- Lisch D: Epigenetic regulation of transposable elements in plants. Annual Review of Plant Biology. 2009, 60: 43-66. 10.1146/annurev.arplant.59.032607.092744.View ArticlePubMedGoogle Scholar
- Karlin S, Mrazek J: Compositional differences within and between eukaryotic genomes. P Natl Acad Sci USA. 1997, 94 (19): 10227-10232. 10.1073/pnas.94.19.10227.View ArticleGoogle Scholar
- Shulaev V, Sargent D, Crowhurst R, Mockler T, Folkerts O, Delcher A, Jaiswal P, Mockaitis K, Liston A, Mane S, et al: The genome of woodland strawberry (Fragaria vesca). Nat Genet. 2011, 43: 109-116. 10.1038/ng.740.PubMed CentralView ArticlePubMedGoogle Scholar
- Velasco R, Zharkikh A, Troggio M, Cartwright D, Cestaro A, Pruss D, Pindo M, FitzGerald L, Vezzulli S, Reid J, et al: A high quality draft consensus sequence of the genome of a heterozygous grapevine variety. Plos One. 2007, 2 (12): e1326-10.1371/journal.pone.0001326.PubMed CentralView ArticlePubMedGoogle Scholar
- Jiang D, Ye Q, Wang F, Cao L: The mining of citrus EST-SNP and its application in cultivar discrimination. Agricultural Sciences in China. 2010, 9 (2): 179-190. 10.1016/S1671-2927(09)60082-1.View ArticleGoogle Scholar
- Portis E, Mauromicale G, Barchi L, Mauro R, Lanteri S: Population structure and genetic variation in autochthonous globe artichoke germplasm from Sicily Island. Plant Sci. 2005, 168 (6): 1591-1598. 10.1016/j.plantsci.2005.02.009.View ArticleGoogle Scholar
- Stacey K, Young G, Clark F, Sester D, Roberts T, Naik S, Sweet M, Hume D: The molecular basis for the lack of immunostimulatory activity of vertebrate DNA. J Immunol. 2003, 170: 3614-3620.View ArticlePubMedGoogle Scholar
- Lanteri S, Di Leo I, Ledda L, Mameli M, Portis E: RAPD variation within and among populations of globe artichoke cultivar 'Spinoso sardo'. Plant Breeding. 2001, 120 (3): 243-246. 10.1046/j.1439-0523.2001.00605.x.View ArticleGoogle Scholar
- Zerbino D, Birney E: Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008, 18: 821-829. 10.1101/gr.074492.107.PubMed CentralView ArticlePubMedGoogle Scholar
- Conesa A, Gotz S, Garcia-Gomez J, Terol J, Talon M, Robles M: Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005, 21 (18): 3674-3676. 10.1093/bioinformatics/bti610.View ArticlePubMedGoogle Scholar
- Marcais G, Kingsford C: A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics. 2011, 27 (6): 764-770. 10.1093/bioinformatics/btr011.PubMed CentralView ArticlePubMedGoogle Scholar
- Li H, Ruan J, Durbin R: Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res. 2008, 18 (11): 1851-1858. 10.1101/gr.078212.108.PubMed CentralView ArticlePubMedGoogle Scholar
- Thiel T, Kota R, Grosse I, Stein N, Graner A: SNP2CAPS: a SNP and INDEL analysis tool for CAPS marker development. Nucleic Acids Res. 2004, 32 (1): e5-10.1093/nar/gnh006.PubMed CentralView ArticlePubMedGoogle Scholar
- You F, Huo N, Gu Y, Luo M, Ma Y, Hane D, Lazo G, Dvorak J, Anderson O: BatchPrimer3: A high throughput web application for PCR and sequencing primer design. Bmc Bioinformatics. 2008, 9: 253-10.1186/1471-2105-9-253.PubMed CentralView ArticlePubMedGoogle Scholar
- Comino C, Hehn A, Moglia A, Menin B, Bourgaud F, Lanteri S, Portis E: The isolation and mapping of a novel hydroxycinnamoyltransferase in the globe artichoke chlorogenic acid pathway. BMC Plant Biol. 2009, 9: 30-10.1186/1471-2229-9-30.PubMed CentralView ArticlePubMedGoogle Scholar
- Menin B, Comino C, Moglia A, Dolzhenko Y, Portis E, Lanteri S: Identification and mapping of genes related to caffeoylquinic acid synthesis in Cynara cardunculus L. Plant Sci. 2010, 179 (4): 338-347. 10.1016/j.plantsci.2010.06.010.View ArticleGoogle Scholar
- Van O: JoinMap® 4: software for the calculation of genetic linkage maps in experimental populations. 2006, Wageningen: Kyazma B.VGoogle Scholar
- Voorrips R: MapChart: Software for the graphical presentation of linkage maps and QTLs. J Hered. 2002, 93 (1): 77-78. 10.1093/jhered/93.1.77.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.