- Research article
- Open Access
High-throughput SNP genotyping in Cucurbita pepo for map construction and quantitative trait loci mapping
BMC Genomics volume 13, Article number: 80 (2012)
Cucurbita pepo is a member of the Cucurbitaceae family, the second- most important horticultural family in terms of economic importance after Solanaceae. The "summer squash" types, including Zucchini and Scallop, rank among the highest-valued vegetables worldwide. There are few genomic tools available for this species.
The first Cucurbita transcriptome, along with a large collection of Single Nucleotide Polymorphisms (SNP), was recently generated using massive sequencing. A set of 384 SNP was selected to generate an Illumina GoldenGate assay in order to construct the first SNP-based genetic map of Cucurbita and map quantitative trait loci (QTL).
We herein present the construction of the first SNP-based genetic map of Cucurbita pepo using a population derived from the cross of two varieties with contrasting phenotypes, representing the main cultivar groups of the species' two subspecies: Zucchini (subsp. pepo) × Scallop (subsp. ovifera). The mapping population was genotyped with 384 SNP, a set of selected EST-SNP identified in silico after massive sequencing of the transcriptomes of both parents, using the Illumina GoldenGate platform. The global success rate of the assay was higher than 85%. In total, 304 SNP were mapped, along with 11 SSR from a previous map, giving a map density of 5.56 cM/marker. This map was used to infer syntenic relationships between C. pepo and cucumber and to successfully map QTL that control plant, flowering and fruit traits that are of benefit to squash breeding. The QTL effects were validated in backcross populations.
Our results show that massive sequencing in different genotypes is an excellent tool for SNP discovery, and that the Illumina GoldenGate platform can be successfully applied to constructing genetic maps and performing QTL analysis in Cucurbita. This is the first SNP-based genetic map in the Cucurbita genus and is an invaluable new tool for biological research, especially considering that most of these markers are located in the coding regions of genes involved in different physiological processes. The platform will also be useful for future mapping and diversity studies, and will be essential in order to accelerate the process of breeding new and better-adapted squash varieties.
The Cucurbita genus, of American origin, is one of the most variable genera within the Cucurbitaceae family (reviewed by Esteras et al. ). C. pepo L. (2 n = 40), the most economically important crop of this genus , displays eight commercial morphotypes grouped into two subspecies (subsp. pepo L.: Pumpkin, Vegetable Marrow, Cocozelle and Zucchini; subsp. ovifera (L.) Decker (syn subsp. texana (Scheele) Filov): Scallop, Acorn, Crookneck and Straightneck). The main economic value of the species resides in the consumption of its immature fruits as vegetables, commonly known as summer squashes. Summer squashes of the Zucchini type rank among the highest-valued vegetables worldwide, whereas the "winter squash" types (fruits consumed when mature) of C. pepo and related Cucurbita spp. are food staples and rich sources of fat and vitamins in developing countries .
Despite its economic importance, there are few genomic tools available for Cucurbita, unlike other cucurbits, such as watermelon (Citrullus lanatus (Thunb.) Matsum & Nakai), cucumbers (Cucumis sativus L.) and melons (Cucumis melo L.), for which new mapping populations, dense genetic maps [4–7], microarrays [8, 9], reverse genetics platforms [10, 11], transcriptomes [12, 13] and even whole genome sequences have already been generated [14–16]. Many of these resources are available at the database maintained by the International Cucurbit Genomics Initiative (ICuGI, ) and are being successfully employed by cucurbit researchers to study gene functions and their related polymorphisms.
High-throughput sequencing technologies, mainly Roche 454 and Illumina GA , are contributing to filling this gap for non-model crops, thereby allowing the rapid generation of sequence information, even in species about which there is little prior knowledge. One of the most interesting applications of massive sequencing is the large-scale discovery of genetic variants that can be converted into genetic markers, mainly microsatellites or Simple Sequence Repeats (SSR) and Single Nucleotide Polymorphisms (SNP) . SSR and SNP are now the predominant markers in plant genetic analysis. The first transcriptome of C. pepo was recently generated using 454 GS FLX Titanium technology. A total of 49,610 unigenes were assembled from 512,751 new EST (Expressed Sequence Tags) and used to generate the first large collection of EST-derived SSR and SNP in this species . SNP are abundant in the genomes, and are stable, amenable to automation and increasingly cost-effective, and are therefore fast becoming the marker system of choice in modern genomics research. SSR, however, continue to be widely used in studies with no need for automation due to their co-dominant and multiallelic nature.
A practical way of optimizing large SNP collections is that of using them with cost-effective platforms for medium- to high-density genotyping. A large number of commercial platforms for SNP genotyping are currently available (reviewed by Gupta et al. ). The Illumina GoldenGate assays that genotype 384, 768 or 1,536 SNP in parallel have been the most widely used for mid-throughput applications . This genotyping technique has been used extensively in humans  and several animal species [24–26]. SNP platforms are also available for several plant species, made up mostly of cereals, legumes and conifers [27–35]. One of their main applications is the rapid development and saturation of genetic maps [36, 37].
Dense genetic maps are necessary tools for efficient molecular breeding. They are particularly useful for quantitative trait loci (QTL) mapping and for the development of new high-quality mapping populations, such as introgression line libraries [38, 39]. Four genetic maps have been reported in the Cucurbita genus to date. The first two maps were constructed using a population derived from an inter-specific cross between C. pepo x C. moschata Duchesne, a closely related species, with Random Amplified Polymorphic DNA (RAPD) markers [40, 41]. Two maps were subsequently produced from two intra-specific crosses, one using a cross between the oil-seed Pumpkin × Zucchini "True French" varieties (both of which belong to C. pepo subsp. pepo), and the other using a C. pepo subsp. pepo x C. pepo subsp. ovifera cross (oil-seed Pumpkin × Italian Crookneck, respectively). These maps consisted mainly of RAPD and Amplified Fragment Length Polymorphisms (AFLP) [42, 43]. These markers are dominant and cannot be transferred readily to other populations. The first collection of SSR markers was recently produced from genomic libraries in Cucurbita by Gong et al. . Part of this collection, consisting of 178 SSR, was used to increase the density of the Pumpkin × Crookneck map and also to study macrosynteny with C. moschata. Before the study by Blanca et al. , no SNP were available for the species, which is why these markers have not previously been used for mapping purposes.
Even though nearly one hundred major genes controlling different aspects of Cucurbita biology have been described , most have not been mapped. The available maps only include a few monogenic traits and have not yet been efficiently used for QTL mapping. There is a growing need for generating new maps with more informative and transferable markers that are amenable to large-scale genotyping. Markers linked to traits of interest are necessary for molecular breeding in these species, mainly in the Zucchini type, which by far dominates the squash market and the breeding efforts of seed companies. The current availability of a collection of 19,980 EST-SNP, located mostly in gene-coding regions , will facilitate map development with functional markers.
In this study, we used a set of 9,043 EST-SNP that were detected in silico by Blanca et al. , and which are suitable for detecting polymorphism between two main commercial types of C. pepo (Zucchini and Scallop) that have contrasting vine, flowering and fruit phenotypes, in order to develop an Illumina GoldenGate 384-SNP platform. This platform was employed to build the first SNP-based genetic map with an F2 population (Zucchini × Scallop) and to detect QTL for the very first time. The genotyping platform and the genetic map are invaluable new tools for molecular breeding in Cucurbita.
An F2 population of 146 plants derived from the C. pepo subsp. pepo var. Zucchini MU-CU-16 × C. pepo subsp. ovifera var. Scallop UPV-196 cross was used to generate the linkage map. These are the same parental genotypes that were previously employed to generate the first C. pepo transcriptome . Both represent the main summer squash cultivar groups of each subspecies, and have contrasting phenotypes for vine, flowering and fruit traits (Figure 1). Four F1 plants and several individuals of each backcross generation to MU-CU-16 (BCZ, 30) and to UPV-196 (BCS, 30) were also included in the assay. In order to check if the selected SNP might also be useful for genetic diversity studies and genotyping in other mapping populations, a panel of seven accessions of C. pepo, including representatives of the four morphotypes of the subspecies pepo (two Zucchini landraces from southern Spain, MU-20 and E-27; one Vegetable Marrow from Morocco, AFR-12; one Spanish Cocozelle landrace, V112; and two Pumpkin accessions, Styrian Pumpkin and the Mexican landrace, CATIE 18887) and one morphotype of the subspecies ovifera (the cultivar Early Summer Crookneck) were included in the genotyping assay. One accession of the related species, C. moschata, was also genotyped (the Spanish landrace AN-45). All these accessions belong to the Cucurbita core collection of the Cucurbits Breeding Group of the Institute for the Conservation and Breeding of Agricultural Biodiversity (COMAV) [47, 48] except for CATIE 18887, which was kindly provided by the Genebank of the Centro Agronómico Tropical de Investigación y Enseñanza (CATIE) in Costa Rica.
Total DNA was extracted from young leaves using the CTAB method , with minor modifications. To improve the quality of the obtained DNA, 70% ethanol containing 15 mM ammonium acetate was used in the last wash, and the DNA was treated with RNase. DNA concentrations in TE buffer were adjusted to 50 ng/μl, with the PicoGreen fluorescence being measured on an ABI7900 apparatus (Applied Biosystems). Samples were sent for genotyping to the Centro Nacional de Genotipado (CEGEN-ISCIII, CRG-Node, Barcelona), a high-throughput genotyping service. A minimum of 200 ng of DNA were used for SNP genotyping.
SNP selection for the GoldenGate platform
Using the first C. pepo transcriptome  as a reference, a collection of 512,751 C. pepo EST, generated using 454 pyrosequencing, from the two genotypes used as the parentals for the mapping population (Zucchini MU-CU-16 and Scallop UPV-196), was mined for SNP. This screening yielded a total of 19,980 putative SNP and 1,174 INDEL, distributed in 8,147 unigenes. Using the different filters established in , we selected a set of markers that, in silico, were monomorphic within and polymorphic between the two sequenced genotypes and suitable for genotyping with the Illumina GoldenGate system. Only SNP were selected, as the INDEL were discarded . Sequences with more than 4 SNP or INDEL per 100 bp were discarded (using filter HVR4) to avoid SNP located in hypervariable regions. This selection was intended to reduce false polymorphisms caused by the alignment of paralogs, a potentially significant problem when aligning short sequence reads. To facilitate their use in a GoldenGate genotyping assay, we also discarded those SNP that were closer than 60 bp to another SNP or INDEL, to an intron or to the unigene edge (filtering them out with CS60, I60 and CL60, respectively). Only SNP with two or more reads per allele were selected, since our previous experience with in silico-detected SNP in melon  indicated that putative SNP with only one read in one allele have a low percentage of validation (even when the quality of the sequenced nucleotide is high). Blanca et al.  annotated the unigene collection using the Blast2GO package , which assigns Gene Ontology (GO) terms based on the BLAST definition. We used this annotation to prioritize the selection of SNP located in the Open Reading Frame regions (ORF) of annotated unigenes (with GO terms and significant BLAST in the Swiss-Prot, Arabidopsis org or Uniref90 databases [51–53] and with orthologs of Arabidopsis and/or melon). A set of SNP that generate allele-specific restriction targets, with the possibility of being detected via Cleaved Amplified Polymorphic Sequences (CAPS), was also included, even though they did not meet some of the aforementioned requirements.
The GO terms were reclassified into different functional groups based on a set of GO slims in the Molecular Function and Biological Process categories in order to provide a broad overview of the ontology content of the final platform.
The sequence of each selected locus, including the polymorphic nucleotide and a 60 bp flanking sequence, was submitted to the Illumina Assay Design Tool (ADT) (Illumina, San Diego, CA), and designability scores were used for final marker selection. These scores ranged from 0 to 1.0, where a score of > 0.6 means a high success rate for the conversion of an SNP into a successful GoldenGate assay. On the basis of these scores, a final set of 384 SNP was selected, which was predicted to have a high likelihood of success. The GoldenGate genotyping assay was conducted as described elsewhere [22, 28, 33].
To summarize, three primers were designed for each locus. Two were allele-specific oligos (ASOs), complementary to the sequence directly adjacent to the SNP, only differing at the 3' base complementary to each allele. The third primer was a locus-specific oligo (LSO), which hybridizes to the complementary sequence located downstream of the target SNP. The three oligos had three universal primers attached at the 5' end. Each locus-specific oligo also had an "IllumiCode" sequence complementary to the array. The sequence of each locus and the 1,152 custom oligos, three at each of the 384 different SNP loci, are listed in Additional File 1: " Sequence and primers for genotyping the 384 SNP included in the GoldenGate platform".
After DNA hybridization, an extension and ligation step was performed connecting each allele-specific oligo with the locus-specific oligo. A PCR step was then conducted for all 384 loci using common universal primers. The GoldenGate assay was deployed on the BeadXpressR platform using VeracodeR technology (Illumina, San Diego, CA) . The PCR products, labeled Cy3 or Cy5 depending on the allele, were hybridized to glass Veracode micro-beads, each bearing a locus-specific barcode via the corresponding Illumicode sequence. Then, each SNP was identified by its IllumiCode and alleles were discriminated by their fluorescent signals on a Veracode BeadXpress Reader .
The automatic allele calling for each locus was accomplished using the GenomeStudio software (Illumina, San Diego, CA). The clusters were manually edited when necessary.
SSR selection and amplification
A set of 25 genomic SSR (gSSR), evenly distributed in the previously published map constructed by Gong et al.  using the F2C. pepo subsp. pepo oil-Pumpkin variety "Lady Godiva" × C. pepo subsp. ovifera Crookneck variety "Bianco Friulano", were selected to be used as anchors between both maps. Information about the selected SSR is included in Additional File 2: " Primers for genotyping the SSR included in the map".
PCR reactions were carried out in a total volume of 15 μl in PCR buffer 1× (75 mM Tris-HCl pH 9, 20 mM (NH4)2SO4, 50 mM KCl), 3 mM MgCl2, 200 μM each dNTP, 0.15 μM each primer, 0.2 μM M13 IRDye700/800 (LI-COR. Lincoln, Nebraska) tagged tail, 0.35 U Taq DNA Polymerase (Biotools B&M Labs, S.A., Madrid, Spain) and 10-15 ng DNA. Forward primers were designed with an added M13 tail sequence at their 5' end. The thermal profile was the following for all the loci: 3 min denaturation at 95°C, 10 cycles of 30 s at 95°C, 30 s at 65°C (decreasing 1°C every cycle) and 30 s at 72°C, 20 cycles of 30 s at 95°C, 30 s at 55°C and 30 s at 72°C with a final extension of 5 min at 72°C. A LICOR 4300 analyzer was employed to visualize SSR-allele size differences on a denaturing polyacrylamide gel, loading a 1/10 or 1/20 dilution in formamide.
Linkage analysis and map construction
The genetic map was constructed using the genotyping results for the F2 Zucchini × Scallop mapping population, obtained with the new 384-SNP GoldenGate platform and the anchor SSR. Segregation distortion at each marker locus was tested against the expected ratio for F2 (1:2:1) using a χ2 test. The linkage map was generated with MAPMAKER/EXP version 3.0b . Markers were associated with the "group" command with LOD> 4. Markers within groups were ordered using the "order" command. Distances in centiMorgans (cM) were calculated from the recombination frequencies using the Kosambi mapping function . The remaining markers were then located with the "try" command. The map was drawn with MapChart version 2.1 .
Synteny with cucumber
The colinearity of the C. pepo genetic map with the cucumber genome was evaluated by doing a BLAST search of the unigenes corresponding to every C. pepo SNP against the cucumber genome. The FASTA sequence of this genome was downloaded from the ICuGI database . The hits obtained in the tBLASTx search of the C. pepo unigenes against the cucumber genome were considered to be significant if they had an e-value above 10-6. The locations of these significant hits were plotted in a scatter plot in which one axis represented the cucumber genome and the other the C. pepo map. The processing of the BLAST results was carried out with a custom Python script that is available upon request.
All F2, BCZ and BCS plants were cultivated in a greenhouse with a fully randomized experimental design (February to July, 2010) and extensively phenotyped. Five plants of each parental and the F1 generation were also included in the assay. Fifty traits were measured for each single plant, and twelve were scored visually (Table 1). Vine traits were related to plant color, length and branching intensity, and flowering traits were related to the flowering time and male/femaleness tendency. Each plant was selfed and two fruits per plant were analyzed. One fruit per plant was analyzed when immature, 7 days after pollination, which corresponds to the commercial state of summer squashes. The second fruit per plant was analyzed at physiological maturity (ranging from 20 to 60 days after pollination). Traits measuring fruit size, shape, texture, firmness, rind and flesh color, sugar content and acidity were analyzed. More details about quantitative and qualitative traits are included in Table 1. Correlations between pairs of traits were estimated by using the Pearson correlation coefficient.
QTL for quantitative traits were analyzed by composite interval mapping with Windows QTL Cartographer 2.5  using the developed genetic map and the stepwise forward regression procedure with a walking speed of 1 cM, a window size of 15 cM and the inclusion of up to 5 maximum background marker loci as QTL cofactors. The LOD threshold for a Type I error P < 0.05 value was calculated by a permutation test  implemented in Windows QTL Cartographer with 1,000 permutations independently for each trait. Additive and dominant QTL effects (a and d, respectively), the degree of dominance (d/[a]) and the proportion of phenotypic variance explained by QTL (R2) were estimated at the highest peaks depicted by the QTL analysis.
A positive value of additive effects (positive a) indicates that the Zucchini allele increases the trait, and, conversely, a negative value indicates the Scallop allele increases the trait. For positive a values, positive values of d indicate that the Zucchini allele is dominant, whereas negative values indicate that the Scallop allele is dominant. Conversely, for negative a values, positive and negative d values indicate dominance of the Scallop and Zucchini alleles, respectively.
In order to validate the QTL effects and the utility of the linked markers for breeding purposes (Marker-Assisted Selection, MAS), genotypic and phenotypic data of the two backcross populations, BCZ and BCS, were analyzed for the detected QTL. ANOVA analysis conducted using the SPSS v. 16.0 software was employed to detect significant differences in the average value of homozygous backcross individuals (Zucchini, a, or Scallop, b) versus heterozygous individuals (h) for the markers located within or near the 1-LOD interval for the QTL. In those traits displaying QTL confirmed in the backcrosses, broad-sense heritabilities were estimated as described by Wright : H2 = [VF2 - (0.25VZ + 0.25VS + 0.5VF1)]/VF2, where V is the variance of F2, Zucchini (Z), Scallop (S) and F1 populations, respectively.
For the QTL analysis, the qualitative traits were coded as dummy variables, absent (0) or present (1), and analyzed with the Qgene v. 4.3.9 software  using composite interval mapping analysis and conducting 1,000 permutations to calculate the LOD threshold value for P < 0.05 using a resampling test. In order to confirm the observed linkage with the flanking markers, a contingency χ2 test was conducted in those cases in which significant LOD values were found. A null hypothesis (H0) of independence of frequency between a trait (scored as 0-1) and the marker (genotyped as homozygous or heterozygous) was checked for the F2, with an error type I rate of α = 0.05 and 2 degrees of freedom (df). BCZ and BCS populations were also used for validating QTL effects in qualitative traits. We checked the frequency of the corresponding category in each group of individuals classified according to their genotype for the corresponding linked markers. The association between trait categories and linked markers was also checked using the Fisher exact probability test, as the number of individuals was too low [63, 64]. P was calculated as the probability of the observed array of cell frequencies plus the sum of the probabilities of all other cell-frequency arrays that are smaller than the probability of the observed array. H0 of independence was rejected when P < 0.05.
Information on those QTL for quantitative and qualitative traits that were validated in the backcrosses was also included in the MapChart file to obtain a more complete map of the species.
Results and discussion
Design of the 384-SNP GoldenGate genotyping platform
Of the 19,980 SNP identified in silico, 9,043 were monomorphic within and polymorphic between the two parents of the map (Zucchini MU-CU-16 and Scallop UPV-196) (Figure 1) and were not located in highly variable regions (filtered out with HVR4). A total of 3,538 of these high-confidence SNP met the criteria for high-throughput genotyping platforms, i.e., being absent of any other known SNP in their vicinity and having enough sequence information up- and downstream of the SNP (filtered out with CS60, CL60 and I60). A preliminary set of 713 SNP, located in different unigenes, was selected, prioritizing SNP in long unigenes with well-defined functions. Designability scores were then given to each locus using the Illumina ADT. Only SNP with scores of > 0.6 were selected. Sequences and primers of the finally selected SNP collection are included in Additional File 1. The Illumina scores and annotation details of the corresponding unigenes are also described in Additional File 3: "Annotation data and map position of the 384 loci included in the GoldenGate platform".
The final set of the 384 SNP included in the GoldenGate platform had a mean designability score of 0.89. The average length of the selected unigenes was 1,057 bp (ranging from 398 to 2,336). These unigenes were previously annotated . Most SNP (367, 95.6%) were located in the ORF of the corresponding unigene, with only 17 in the untranslated regions (UTR).
Blanca et al.  functionally classified the unigenes following the Gene Ontology (GO) scheme. Only 24 of the 384 selected unigenes (6.25%) with SNP could not be assigned to any GO term. We used the GO annotations to assign most unigenes (360, 93.8%) to a set of GO slims in the Biological Process and Molecular Function categories (Additional File 4: "Number of unigenes in each functional category"). The GO annotations for the unigenes showed a fairly consistent sampling of functional classes, indicating that these SNP markers represent genes with different molecular functions and that they are involved in various different biological processes. Cellular, metabolic, biosynthetic and developmental processes were among the most highly represented groups under the Biological Process category (Additional File 4). Other abundant assignments were transcriptional regulation, translation, signal transduction, transport and oxidation-reduction functions. Stimulus, stress- and defense-responsive genes were also well represented. Genes involved in other important biological processes, such as growth, ripening and hormone-signaling processes were included. Some of these genes might play a role in the response to diseases, floral sex determination and fruit development and quality. Under the Molecular Function GO hierarchy (Additional File 4), assignments were mainly to catalytic and binding activities. A large number of hydrolases, kinases and transferases, representing genes involved in the secondary metabolite synthesis pathways, were also included. Transcription and translation factors were also well-represented.
The putative orthologs of all the unigenes were identified  by doing a reciprocal BLAST search of the Arabidopsis and melon databases [52, 17]. Most unigenes selected for the GoldenGate platform had an Arabidopsis ortholog (236, 61.4%) and/or a melon ortholog (228, 59.4%). Only 22.4% had no orthologs. GO terms, gene description and a list of the identified orthologs are included in Additional File 3.
Genotyping results: allele call and polymorphism
The GoldenGate genotyping assay was carried out successfully, with 90.1% of the SNP successfully genotyped taking into account both monomorphic and polymorphic SNP. Only 38 of the 384 SNP included in the platform failed to give a clear genotype. Fifteen and sixteen SNP could not be analyzed due to the absence of or low cluster separation, respectively; five displayed more than three clusters and two had low intensities, according to quality Veracode genotyping (Additional File 3). The absence of cluster separation might be the result of a non-allele-specific match of the primers. Likewise, the existence of more than two alleles and/or the amplification of a non-unique genomic region might be the cause of the existence of more than three clusters.
The average designability score values for failed markers was significantly lower than that of the successful markers (0.86 versus 0.89, P < 0.05), but all scores were > 0.6, which is considered to be the optimal threshold for a GoldenGate assay. The percentage of failed markers with only 2 reads in one or both alleles (according to the sequencing results, ) was higher compared to that of the successful SNP (65.8% versus 43.6%).
All in all, a total of 346 SNP were classified as successful assays. Similar success rates have been reported in soybean , barley , maize  and pea . All of these markers amplified in nearly all the accessions of C. pepo. Knowing their polymorphism in diverse germplasm could help to determine their usefulness in future genetic diversity studies or mapping efforts.
One hundred and ninety-six SNP detected variation among the morphotypes of C. pepo subsp. pepo (Zucchini, Vegetable Marrow, Cocozelle and Pumpkin), making them useful for genetic diversity studies or for mapping purposes using intra-subspecific crosses. Eighty-two detected variability among the assayed Zucchini types. Zucchini is by far the most important commercial type of summer squash and at the same time the most recently developed and the least variable. Therefore, markers detecting variability within this culti-group could be of interest for cultivar fingerprinting. Fifty-nine SNP detected variation between the two accessions of C. pepo subsp. ovifera (Crookneck and Scallop) and seventy-eight markers yielded alleles exclusive to only one subspecies. The latter could be of interest for mapping purposes using inter-subspecific crosses. We included two accessions in our assay that belong to the morphotypes of the map parentals of Gong et al.  (Styrian Pumpkin and Early Summer Crookneck). Two hundred and fourteen markers were polymorphic between them, and may be used to increase the density of that map, connecting both maps with common markers.
In addition, 305 SNP (79%) were amplified in the C. moschata accession. This is an interesting result as most of the platform's markers could potentially be used in introgression programs aimed at transferring traits from C. moschata into C.pepo. The only previous set of markers that proved to be transferable between C. pepo and C. moschata was a set of 76 genomic SSR used to perform the first macrosynteny studies between the two species . Our set of functional markers will be useful for further macrosynteny studies with this species and for the marker-assisted selection of traits introgressed from C. moschata into C. pepo.
Details about the polymorphism detected by each SNP are included in Additional File 3. In order to facilitate the future application of this marker set, information about possible detection via CAPS is also provided. Sixty-two of the 384 SNP affected restriction targets and could be easily assayed as CAPS.
The 384-SNP set was selected in silico for being polymorphic between Zucchini MU-CU-16 and Scallop UPV-196 . Of the 346 successfully called SNP, 330 were polymorphic between these genotypes, yielding 3 clear clusters representing the two homozygous plus the heterozygous genotypes. Sixteen did not show polymorphism between the parentals. This could be explained by artifacts generated during the sequencing process. However, the lack of polymorphism could also be explained by the incapacity of this technique to discriminate an SNP at this locus, for example, because of the lack of amplification of one allele due to polymorphism in the priming site. In order to reduce false SNP, we only selected SNP with two or more reads per allele. Most of the monomorphic markers have only 2 reads in one or both alleles (81.3%). These results suggest that a higher number of reads per allele is a good criterion for selecting true SNP from in silico-mined collections.
The microsatellite transferability rate from the previous Cucurbita map  to our mapping population proved to be low, as only 17 out of 25 SSR (68%) amplified, and only 11 (44%) could be mapped. SSR that displayed several amplification problems, such as nonspecific amplification (CMTp235 and CMTp245) and preferential amplification of the MU-CU-16 allele (CMTp86, CMTp188, CMTp47, CMTp256, CMTp33 and CMTp208), were discarded. The preferential amplification of the MU-CU-16 parental is consistent with the origin of these genomic SSR, which were developed from a genomic library derived from a Pumpkin genotype (subsp. pepo). Four of the 17 amplified markers were monomorphic in our parentals and two resulted in a distorted F2 segregation. Details of the SSR results are included in Additional File 2. SSR are multiallelic markers, easily used by single PCR. However, SSR genotyping cannot be automated and the analysis of large populations is still time-consuming. This makes SNP the preferred markers for different high-throughput genotyping purposes.
Genetic map of the Zucchini × Scallop population
We were able to successfully map 304 of the 330 SNP that were polymorphic between the parentals. A set of 26 validated markers was discarded for mapping, either because the SNP did not show the three genotypic classes in the F2 or because one of the parents was heterozygous. The genetic map was constructed using a set of 315 high-quality markers (304 SNP and 11 SSR) using MAPMAKER at a LOD score of 4 (Figures 2, 3, 4, 5 and 6). The MU-CU-16 × UPV-196 genetic map covered 1,740.8 cM and was divided into 22 major linkage groups (LGs) and a minor group (LG23, with only 2 markers, 1.1 cM), with an average of 6.02 ± 6.65 cM between markers. The maximum gap between markers was 30.3 cM in LG13. Two SNP, C007167 and C008395, remained unlinked.
The total number of markers included in major LGs varied from 5 in LG17 to 31 in LG2. Apart from LG23, only three groups contained less than eight markers (LG17, LG19 and LG22), with the markers being more or less evenly distributed among and within each LG group. LG length ranged from 12.2 cM in LG22 to 173.8 cM in LG2. On average, a linkage group covered 79.1 ± 34.7 cM and contained 14.1 ± 5.8 markers, resulting in an average map density of 5.56 ± 1.70 cM/marker. Less coverage was presented herein in comparison to the previous map for the species (1,936 cM and a density of 2.9 cM/marker) . However, the previous map was mainly constructed with dominant, non-transferable RAPD or AFLP. Of the 659 loci mapped, only 178 correspond to co-dominant SSR, which appeared unevenly distributed across the genome. Our results with the transferability of these markers have also been very low. The SNP-based map presented here is the first to include high-quality markers amenable to automation in the genus Cucurbita, many of which are putatively transferable to other populations and even to other species, and most of which are in fully annotated genes involved in diverse biological processes. In addition, distances have been reported not to be comparable between different software, as Joinmap lengths of the individual linkage groups are usually shorter than those obtained with MAPMAKER [66, 67, 44].
Distorted segregation was observed in 30 SNP and 2 SSR, a larger number than in the Pumpkin × Crookneck cross [43, 44], but lower than that reported in maps constructed from interspecific crosses . Grouped markers were especially observed in LG2 and LG5 (Figures 2 and 3).
Using the microsatellites as anchors to the previous Cucurbita map, it was possible to associate the linkage groups of both maps: LG2, LG5, LG7, LG8, LG9, LG12, LG14, LG16, LG18, LG21 and LG23 correspond to groups LGp2, 6, 9, 14, 3a, 18, 4, 10a, 8, 10a and 15 from Gong et al. , respectively. In the previous map, CMTp145 and CMTp66 mapped in the same group (10a) at LOD 3, but in this study, they appear associated with different groups (LG16 and LG21). In the future, newly developed SNP will have to be mapped to improve the map saturation and obtain the 20 expected linkage groups, merging some of those that are less represented in the current map.
The distorted segregation found in LG2 was not reported in the corresponding LGp2 , even though only three markers were mapped in this linkage group and the anchor SSR mapped in LG2 is out of this area. Scallop alleles were over-represented, suggesting that the alleles in this region may be subject to gametic or zygotic selection and/or related to preferential germination or better seedling viability. Different functions were associated with the distorted markers (Additional File 3). Some of these unigenes may be the cause of the segregation distortion, but it could also be the result of linkage to other genes.
Synteny with cucumber
Three hundred of the 304 mapped unigenes, yielded significant tBLASTx hits (threshold e-value of 10-6) and were assigned to the cucumber chromosomes. Figure 7 shows the colinearity between the genomes of the two species, C. sativus and C. pepo; details about the position of the unigenes in the cucumber genome are also included in Additional File 3. We found syntenic blocks between most of the C. pepo linkage groups and C. sativus chromosomes.
Syntenic studies in the family Cucurbitaceae have been conducted with the two main cultivated species of the Cucumis genus: cucumber (2n = 14) and melon (2n = 24). Recent studies, using common markers and the whole genome sequence of cucumber, have shown that colinearity exists between cucumber and melon, indicating that chromosome fusions and other complex structural changes have generated cucumber chromosomes from a progenitor species with 2n = 24 . We also found a high level of colinearity between C. pepo and the cucumber genome. Some Cucurbita linkage groups (LG) can be considered homoeologous to cucumber chromosomes (Chr). For example, Cucurbita LGs 3, 5 and 18 showed syntenic blocks with cucumber Chromosome 1, LG9 and 17 with Chr2, LG6 and 10 with Chr3, LG21 with Chr4, LG1, 2 and 14 with Chr5, LG7 with Chr6, and LG11 and 15 with Chr7. Some of the remaining LGs (4, 8, and 20) were syntenic to genetic blocks from two cucumber chromosomes (Chr 2-6, and Chr 4-6).
Most cucumber chromosomes contained two to three partially overlapping syntenic blocks with different LGs of C. pepo, which may suggest a certain level of duplication in this species. The higher chromosome number (2n = 40) of Cucurbita suggests that this genus may be of polyploid origin. In fact, previous cytogenetic and isozyme studies indicate that this genus may be an ancient tetraploid [69, 70]. Our results agree with a certain degree of duplication in this species.
A recent study on the level of macrosynteny between two species of the genus, C. pepo and C. moschata, through a comparative alignment of SSR markers, did not provide any indication of a possible ancient polyploid origin of the species . In that paper, the authors studied the segregation of SSR loci, previously selected to be uniquely located in the genome. However, in our study, synteny has been analyzed by blasting whole unigene sequences, which is more likely to yield significant matches in diverse genome sites than the uniquely located SSR primers. Differences in the approaches and the higher number of markers used in our study may explain the differences between the BLAST-based and SSR-based results.
QTL identification and QTL effect validation for Marker-Assisted Selection
Additional File 5: "Quantitative and qualitative traits" shows the values found for each attribute in the parents, F2 and the backcross populations, clearly demonstrating phenotypic variability for most attributes. Forty-eight QTL were detected for 31 quantitative traits and 11 QTL were detected for 11 qualitative traits. These QTL were distributed in 24 independent positions in 13 linkage groups. The proportion of the phenotypic variance explained by a single QTL (R2) varied from 7% to 81%. Fifteen major QTL (R2> 25%) were detected for flowering traits (associated with late flowering and maleness tendency) and for immature and mature fruit traits (associated with fruit length and rind and flesh color). Detailed information about all these QTL (explained variance, LOD peaks, flanking markers, additive and dominance effects and heritabilities) are shown in Additional Files 6 and 7: "QTL analysis for quantitative and qualitative traits 1 and 2".
The genetic inheritance of important agronomic traits is largely unknown in Cucurbita. This QTL analysis provides the first results of the genetic control of most of these plant, flowering and fruit traits. Our preliminary results should be further confirmed using additional populations and phenotypic replications. In this paper, we confirmed the utility of some of these QTL for Marker-Assisted Selection by validating their effects on the backcross populations. Despite the limited number of plants, the effects of eleven of the 15 major QTL (NoMaF_3, NoFeF_3, DMaF_3, DFeF_3, N°MaF_3, MaF/FeF_3, IFLe_6, MFLe_6, MLRCo_14, MaRCo_14 and MbFCo_16) detected in the F2 were verified in one or both backcross populations (Table 2). In addition, six minor QTL (all with R2> 10%) (IFLe_18, MFWi_6, MRib_11, MCaTh_6, MFFi_2 and MLoN°_5) and eight QTL involved in qualitative traits (SC_14, LIns_14, T_1, IFCo_20, MFSh_6, MPriRCo_14, MPSecRCo_14 and MFCo_16) were also verified in the backcrosses (Table 2). The verified QTL segregated differently between the backcross populations, segregating only in one or in both of them. This differential segregation is in general compatible with the direction of additive effects and dominance deviation estimated in the F2. Information about the QTL set validated in the backcross populations is detailed in Additional File 7. The most likely positions on the linkage map for these validated QTL are shown in Figures 2, 3, 4, 5 and 6. The most important QTL displaying real effects in backcrosses related to flowering, fruit shape and color are described below in greater detail.
A cluster of QTL controlling several flowering traits (all with medium-high broad-sense heritabilities,0.71 - 0.85) was detected in LG3, most of which had major effects (R2> 25%) and partial or complete dominance of the Zucchini alleles (d/[a] from -0.78 to -1.05), associated with the early appearance of male and female flowers as well as an enhanced femaleness tendency of the plant (NoMaF_3, DMaF_3, NoFeF_3, DFeF_3, N°MaF_3, MaF/FeF_3) (Additional File 7, Figure 2). In agreement with the a and d values estimated in the F2, the backcrosses show how the Scallop alleles delayed flowering and increased maleness with a recessive gene action (Table 2). Consequently, no differences between plants homozygous for the Zucchini alleles versus heterozygous were found in the BCZ population, whereas the mean of the plants homozygous for the Scallop alleles was significantly higher than those of the heterozygous genotypes in the BCS population. The sex expression in Cucurbitaceae is known to be controlled by various genetic, environmental and hormonal factors, with ethylene being the main hormone involved in this trait. In C. sativus and C. melo, it is controlled by several major independent genes, some of which have been cloned [71–73]. Our results also suggest the existence of a major gene controlling flowering time and the enhanced female/maleness phenotype in summer squash. Further research is necessary to determine whether the co-segregation of the flowering time traits and female/male tendency is due to pleiotropy at a single locus or linkage between loci.
Two major QTL (R2> 25%) involved in fruit shape, controlling the length of immature and mature fruits (IFLe_6 and MFLe_6), co-segregate in LG6, along with various minor QTL that control mature-fruit width and cavity thickness (MFWi_6, MCaTh_6) and also with a QTL controlling fruit shape (MFSh_6) (Additional File 7, Figure 3). The Zucchini type contributed alleles producing elongated fruits, while the Scallop alleles produced wider fruits with wider cavities. Most of these traits presented moderate heritabilities. The two major QTL (IFLe_6 and MFLe_6), with additive gene action estimated in the F2 (d/[a] 0.24 and -0.09 respectively), were verified in both the BCS and BCZ populations with the expected direction of allelic effects (Table 2). These results suggest that these QTL can be exploited in both genetic backgrounds for hybrid or pure line development. MFWi_6 and MCaTh_6 were also additive in the F2, but they were verified only in one of the backcross populations, which may be due to the low capacity for QTL detection in the backcross populations due to their modest sample size or to genetic background effects. An independent QTL for fruit length was detected in LG18 (IFLe_18). Also, Scallop alleles of MLoN°_5 and MRib_11 modified fruit shape by increasing the number of locules and the ribbing intensity.
Several genes have been reported to be related to fruit shape. A dominant gene (Di) seems to control the discoid fruit shape of scallop squash . This gene was reported to be dominant over spherical or pyriform shapes. A digenic epistatic control has also been reported for summer squash fruit shape. Our results are consistent with the existence of a major gene that is, however, not dominant, and several minor modifiers.
Major QTL for the rind color of mature fruits mapped in LG14 (MLRCo_14 and MaRCo_14), with lightness (L Hunter parameter, white color) increasing with Scallop alleles and greenness increasing with Zucchini alleles (Additional File 7, Figure 5). High heritabilities were found (0.95 and 0.97) for these rind color parameters. Also, the visual scores of primary rind color and the pattern of secondary color mapped in the same region (MPriRCo_14 and MPSecRCo_14), with Zucchini alleles leading to dark rind colors and Scallop alleles to the absence of a secondary color (Table 2).
The genetic control of flesh color seems to be independent. A major QTL for immature flesh color is located in LG20 (IFCo_20), whereas a major QTL was found for mature fruit flesh color in LG16 (MbFCo_16), which is consistent with the location of the qualitative trait color MFCo_16 (Additional File 7, Figures 5 and 6).
Squash fruit color has been studied intensively, and a complex genetic control has been proposed for rind color, with major genes (one dominant, derived from Scallop W (weak rind coloration)) , complemented by modifiers, whereas less complexity is reported for flesh color. The QTL that control rind color in mature fruits were validated in one or both backcrosses (Table 2). Plants homozygous for the Zucchini allele are dark green or black, whereas individuals that are heterozygous or homozygous for the Scallop allele are white or cream-colored in any genetic background, consistently with the major gene W, which confers a white or cream color independently of genetic background . This gene has been reported to be complementary to the major gene, Wf, also from Scallop, which is dominant over colored flesh , as most white-rinded squashes are also white-fleshed. Accordingly, mature homozygous Zucchinis for the SNP marker C017913, which is linked to MFCo-16, were mostly orange/yellow-fleshed, whereas homozygous Scallops were mostly white-fleshed. However, heterozygous individuals were all white-fleshed in the Scallop background, while some yellow-fleshed fruits appeared in the Zucchini background. Consistently, a significant effect of Scallop alleles of C030754 (linked to MbFCo_16), reducing flesh yellowness, was only detected in the Scallop background. Therefore, it seems to be a major gene with dominance of white flesh (Wf), although other minor genes also seem to contribute to the control of flesh color.
Most of the QTL reported in this paper had not been located previously. The maps that have been developed to date include six monogenic traits (precocious yellow fruit, B; bush growth habit, Bu; leaf mottling, M; hull-less seed coat, n; and mature fruit color) [41, 42, 44], most of which did not segregate in our population. QTL for fruit length, width and number of fruit locules were located on a Zucchini × Crookneck map constructed using RAPD markers . Other QTL were also reportedly associated with RAPD markers for fruit shape and leaf indentation using an interspecific C. pepo x C. moschata map . However, the comparison of the results is not possible due to the lack of common markers with the current map. These previously detected QTL have not been used to date for MAS selection in Cucurbita.
Our results demonstrate the utility of the 384-SNP GoldenGate genotyping array in Cucurbita pepo. Next-generation sequencing, together with this cost-effective genotyping technique, have been successfully applied to constructing the first SNP-based genetic map reported in the genus. This Zucchini × Scallop map is not only an important resource for high-quality markers that are polymorphic between two highly contrasting squash types, but is also an invaluable tool for breeding purposes, since these markers are developed in coding regions involved in different physiological processes. Several preliminary QTL related to vine, flowering and fruit traits in the mature and immature stages have been reported and mapped for the first time. QTL effects have been validated as has been the utility of various markers for marker-assisted selection, which demonstrates the suitability of the current population and genetic map for dissecting genetically complex fruit traits in Cucurbita ssp. This information will be essential for future breeding programs focused on obtaining better-adapted varieties. The SNP platform has been successfully assayed to detect variability between/within both C. pepo subspecies and different squash morphotypes, and has also revealed a great number of loci transferable to C. moschata. This will facilitate synteny studies with other cucurbits and subsequent diversity and mapping studies that will contribute to increasing the genomic resources for these crops.
Esteras C, Nuez F, Picó B: Genetic diversity studies in Cucurbits using molecular tools. Genetics, Genomics and Breeding of Cucurbits. Edited by: Behera TK, Wang Y, Kole C. 2012, New Hampshire: Science Publishers Inc, Enfield, 140-198.
Paris H: Summer Squash. Handbook of Plant Breeding Vegetables I Part 4. Edited by: Prohens J, Nuez, F. 2008, Springer, 1: 351-381.
Ferriol M, Picó B: Pumpkin and Winter Squash. Handbook of Plant Breeding Vegetables I Part 4. Edited by: Prohens J, Nuez, F. 2008, Springer, 1: 317-349.
Ren Y, Zhang Z, Liu J, Staub JE, Han Y, Cheng Z, Li X, Lu J, Miao H, Kang H, Xie B, Gu X, Wang X, Du Y, Jin W, Huang S: An Integrated Genetic and Cytogenetic Map of the Cucumber Genome. PLoS ONE. 2009, 4 (6): e5795-10.1371/journal.pone.0005795.
Deleu W, Esteras C, Roig C, González-To M, Fernández-Silva I, Blanca J, Aranda MA, Arús P, Nuez F, Monforte AJ, Picó MB, Garcia-Mas J: A set of EST-SNP for map saturation and cultivar identification in melon. BMC Plant Biol. 2009, 9: 90-10.1186/1471-2229-9-90.
Diaz A, Fergany M, Formisano G, Ziarsolo P, Blanca J, Fei Z, Staub JE, Zalapa JE, Cuevas HE, Dace G, Oliver M, Boissot N, Dogimont C, Pitrat M, Hofstede R, Koert P, Harel-Beja R, Tzuri G, Portnoy V, Cohen S, Schaffer A, Katzir N, Xu Y, Zhang H, Fukino N, Matsumoto S, Garcia-Mas J, Monforte AJ: A consensus linkage map for molecular markers and Quantitative Trait Loci associated with economically important traits in melon (Cucumis melo L.). BMC Plant Biol. 2011, 11: 111-10.1186/1471-2229-11-111.
Levi A, Wechter P, Massey L, Carter L, Hopkins D: An Extended Genetic Linkage Map for Watermelon Based on a Testcross and a BC2F2 Population. American Journal of Plant Sciences. 2011, 2: 93-110. 10.4236/ajps.2011.22012.
Wechter WP, Levi A, Harris KR, Davis AR, Fei Z, Katzir N, Giovannoni JJ, Salman-Minkov A, Hernandez A, Thimmapuram J, Tadmor Y, Portnoy V, Trebitsh T: Gene expression in developing watermelon fruit. BMC Genomics. 2008, 9: 275-10.1186/1471-2164-9-275.
Mascarell-Creus A, Cañizares J, Vilarrasa J, Mora-García S, Blanca J, González-Ibeas D, Saladié M, Roig C, Deleu W, Picó B, López-Bigas N, Aranda MA, Garcia-Mas J, Nuez F, Puigdomènech P, Caño-Delgado A: An oligo-based microarray offers novel transcriptomic approaches for the analysis of pathogen resistance and fruit quality traits in melon (Cucumis melo L.). BMC Genomics. 2009, 10: 467-10.1186/1471-2164-10-467.
Dahmani-Mardas F, Troadec C, Boualem A, Léveˆque S, Alsadon AA, Aldoss AA, Dogimont C, Bendahmane A: Engineering Melon Plants with Improved Fruit Shelf Life Using the TILLING Approach. PLoS ONE. 2010, 5 (12): e15776-10.1371/journal.pone.0015776.
González M, Xu M, Esteras C, Roig C, Monforte1 AJ, Troadec C, Pujol M, Nuez F, Bendahmane A, Garcia-Mas J, Picó B: Towards a TILLING platform for functional genomics in Piel de Sapo melons. BMC Research Notes. 2011, 4: 289-10.1186/1756-0500-4-289.
Guo S, Zheng Y, Joung JG, Liu S, Zhang Z, Crasta OR, Sobral BW, Xu Y, Huang S, Fei Z: Transcriptome sequencing and comparative analysis of cucumber flowers with different sex types. BMC Genomics. 2010, 11: 384-10.1186/1471-2164-11-384.
Blanca J, Cañizares J, Ziarsolo P, Esteras C, Mir G, Nuez F, Garcia-Mas J, Picó B: Melon transcriptome characterization. SSRs and SNPs discovery for high throughput genotyping across the species. The Plant Genome. 2011, 4 (2): 118-131. 10.3835/plantgenome2011.01.0003.
Huang S, Li R, Zhang Z, Li L, Gu X, Fan W, Lucas WJ, Wang X, Xie B, Ni P, Ren Y, Zhu H, Li J, Lin K, Jin W, Fei Z, Li G, Staub J, Kilian A, van der Vossen EA, Wu Y, Guo J, He J, Jia Z, Ren Y, Tian G, Lu Y, Ruan J, Qian W, Wang M, Huang Q, Li B, Xuan Z, Cao J, Wu Z, Zhang J, Cai Q, Bai Y, Zhao B, Han Y, Li Y, Li X, Wang S, Shi Q, Liu S, Cho WK, Kim JY, Xu Y, Heller-Uszynska K, Miao H, Cheng Z, Zhang S, Wu J, Yang Y, Kang H, Li M, Liang H, Ren X, Shi Z, Wen M, Jian M, Yang H, Zhang G, Yang Z, Chen R, Liu S, Li J, Ma L, Liu H, Zhou Y, Zhao J, Fang X, Li G, Fang L, Li Y, Liu D, Zheng H, Zhang Y, Qin N, Li Z, Yang G, Yang S, Bolund L, Kristiansen K, Zheng H, Li S, Zhang X, Yang H, Wang J, Sun R, Zhang B, Jiang S, Wang J, Du Y, Li S: The genome of the cucumber, Cucumis sativus. L Nat Genet. 2009, 41: 1275-1281. 10.1038/ng.475.
González VM, Rodríguez-Moreno L, Centeno E, Benjak A, Garcia-Mas J, Puigdomènech P, Aranda MA: Genome-wide BAC-end sequencing of Cucumis melo using two BAC libraries. BMC Genomics. 2010, 11: 618-10.1186/1471-2164-11-618.
Fei , Guo S, Zhang H, Ren Y, Zheng Y, Xu Y: Genome and transcriptome sequencing of watermelon. 2011, Town & Country Convention Center. San Diego. CA, Plant & Animal Genomes XIX Conference: 15-19 January 2011
Cucurbit Genomics Database of the International Cucurbit Genomics Initiative (ICuGI). [http://www.icugi.org]
Metzker ML: Sequencing technologies-the next generation. Nat Rev Genet. 2010, 11: 31-46. 10.1038/nrg2626.
Deschamps S, Campbell MA: Utilization of next-generation sequencing platforms in plant genomics and genetic variant discovery. Mol Breed. 2010, 25: 553-570. 10.1007/s11032-009-9357-9.
Blanca J, Cañizares J, Roig C, Ziarsolo P, Nuez F, Picó B: Transcriptome characterization and high throughput SSRs and SNPs discovery in Cucurbita pepo (Cucurbitaceae). BMC Genomics. 2011, 12: 104-10.1186/1471-2164-12-104.
Gupta PK, Rustgi S, Mirr RR: Array-based high-throughput DNA markers for crop improvement. Heredity. 2008, 101 (1): 5-18. 10.1038/hdy.2008.35.
Fan JB, Oliphant A, Shen R, Kermani BG, Garcia F, Gunderson KL, Hansen M, Steemers F, Butler SL, Deloukas P, Galver L, Hunt S, McBride C, Bibikova M, Rubano T, Chen J, Wickham E, Doucet D, Chang W, Campbell D, Zhang B, Kruglyak S, Bentley D, Haas J, Rigault P, Zhou L, Stuelpnagel J, Chee MS: Highly parallel SNP genotyping. Cold Spring Harb Symp Quant Biol. 2003, 68: 69-78. 10.1101/sqb.2003.68.69.
International HapMap Consortium I: The International HapMap Project. Nature. 2003, 426 (6968): 789-96. 10.1038/nature02168.
McKay SD, Schnabel RD, Murdoch BM, Matukumalli LK, Aerts J, Coppieters W, Crews D, Neto ED, Gill CA, Gao C, Mannen H, Wang Z, Van Tassell CP, Williams JL, Taylor JF, Moore SS: An assessment of population structure in eight breeds of cattle using a whole genome SNP panel. BMC Genet. 2008, 9: 37-
Kijas JW, Townley D, Dalrymple BP, Heaton MP, Maddox JF, McGrath A, Wilson P, Ingersoll RG, McCulloch R, McWilliam S, Tang D, McEwan J, Cockett N, Oddy VH, Nicholas FW, Raadsma H: A genome wide survey of SNP variation reveals the genetic structure of sheep breeds. PLoS ONE. 2009, 4 (3): e4668-10.1371/journal.pone.0004668.
Malhi RS, Trask JS, Shattuck M, Johnson J, Chakraborty D, Kanthaswamy S, Ramakrishnan U, Smith DG: Genotyping single nucleotide polymorphisms (SNPs) across species in Old World Monkeys. Am J Primatol. 2011, 73: 1031-1040.
Pavy N, Pelgas B, Beauseigle S, Blais S, Gagnon F, Gosselin I, Lamothe M, Isabel N, Bousquet J: Enhancing genetic mapping of complex genomes through the design of highly-multiplexed SNP arrays: application to the large and unsequenced genomes of white spruce and black spruce. BMC Genomics. 2008, 9: 21-10.1186/1471-2164-9-21.
Hyten DL, Song Q, Choi IY, Yoon MS, Specht JE, Matukumalli LK, Nelson RL, Shoemaker RC, Young ND, Cregan PB: High-throughput genotyping with the GoldenGate assay in the complex genome of soybean. Theor Appl Genet. 2008, 116 (7): 945-52. 10.1007/s00122-008-0726-2.
Akhunov E, Nicolet C, Dvorak J: Single nucleotide polymorphism genotyping in polyploid wheat with the Illumina GoldenGate assay. Theor Appl Genet. 2009, 119 (3): 507-17. 10.1007/s00122-009-1059-5.
Close TJ, Bhat PR, Lonardi S, Wu Y, Rostoks N, Ramsay L, Druka A, Stein N, Svensson JT, Wanamaker S, Bozdag S, Roose ML, Moscou MJ, Chao S, Varshney RK, Szucs P, Sato K, Hayes PM, Matthews DE, Kleinhofs A, Muehlbauer GJ, DeYoung J, Marshall DF, Madishetty K, Fenton RD, Condamine P, Graner A, Waugh R: Development and implementation of high-throughput SNP genotyping in barley. BMC Genomics. 2009, 10: 582-10.1186/1471-2164-10-582.
Muchero W, Diop NN, Bhat PR, Fenton RD, Wanamaker S, Pottorff M, Hearne S, Cisse N, Fatokun C, Ehlers JD, Roberts PA, Close TJ: A consensus genetic map of cowpea [Vigna unguiculata (L) Walp.] and synteny based on EST-derived SNPs. Proc Natl Acad Sci USA. 2009, 106 (43): 18159-64. 10.1073/pnas.0905886106.
Yan J, Shah T, Warburton ML, Buckler ES, McMullen MD, Crouch J: Genetic characterization and linkage disequilibrium estimation of a global maize collection using SNP markers. PLoS ONE. 2009, 4 (12): e8451-10.1371/journal.pone.0008451.
Deulvot C, Charrel H, Marty A, Jacquin F, Donnadieu C, Lejeune-Hénaut I, Burstin J, Aubert G: Highly-multiplexed SNP genotyping for genetic mapping and germplasm diversity studies in pea. Genomics. 2010, 11: 468-10.1186/1471-2164-11-468.
Lepoittevin C, Frigerio J-M, Garnier-Géré P, Salin F, Cervera M-T, Vornam B, Harvengt L, Plomion C: In Vitro vs In Silico Detected SNPs for the Development of a Genotyping Array: What Can We Learn from a Non-Model Species?. PLoS ONE. 2010, 5 (6): e11034-10.1371/journal.pone.0011034.
Massman J, Cooper B, Horsley R, Neate S, Dill-Macky R, Chao S, Dong Y, Schwarz P, Muehlbauer GJ, Smith KP: Genome-wide association mapping of Fusarium head blight resistance in contemporary barley breeding germplasm. Mol Breed. 2011, 27: 439-454. 10.1007/s11032-010-9442-0.
Eckert AJ, Pande B, Ersoz ES, Wright MH, Rashbrook VK, Nicolet CM, Neale DB: High-throughput genotyping and mapping of single nucleotide polymorphisms in loblolly pine (Pinus taeda L.). Tree Genetics & Genomes. 2009, 5: 225-234. 10.1007/s11295-008-0183-8.
Yan J, Yang X, Shah T, Sánchez-Villeda H, Li J, Warburton M, Zhou Y, Crouc JH, Xu Y: High-throughput SNP genotyping with the GoldenGate assay in maize. Mol Breed. 2010, 25: 441-451. 10.1007/s11032-009-9343-2.
Zamir D: Improving plant breeding with exotic genetic libraries. Nature Review Genetics. 2001, 2: 983-989.
Collard BCY, Jahufer MZZ, Brouwer JB, Pang ECK: An introduction to markers, quantitative trait loci (QTL) mapping and marker-assisted selection for crop improvement: The basic concepts. Euphytica. 2005, 142: 169-196. 10.1007/s10681-005-1681-5.
Lee YH, Jeon HJ, Hong KH, Kim BD: Use of random amplified polymorphic DNA for linkage group analysis in an interspecific cross hybrid F2 generation of Cucurbita. J Kor Soc Hortic Sci. 1995, 36: 323-330.
Brown RN, Myers JR: A genetic map of squash (Cucurbita ssp.) with randomly amplified polymorphic DNA markers and morphological markers. J Am Soc Hortic Sci. 2002, 127: 568-575.
Zraidi A, Lelley T: Genetic map for pumpkin Cucurbita pepo L. using Random Amplified Polymorphic DNA markers. Progress in Cucurbit Genetics and Breeding Research. Edited by: Lebeda A, Paris H. 2004, Palacky University in Olomouc, 507-514. 8th International EUCARPIA Meeting on Cucurbit Genetics and Breeding
Zraidi A, Stift G, Pachner M, Shojaeiyan A, Gong L, Lelley T: A consensus map for Cucurbita pepo. Mol Breed. 2007, 20: 375-388. 10.1007/s11032-007-9098-6.
Gong L, Stift G, Kofler R, Pachner M, Lelley T: Microsatellites for the genus Cucurbita and an SSR-based genetic linkage map of Cucurbita pepo L. Theor Appl Genet. 2008, 117: 37-48. 10.1007/s00122-008-0750-2.
Gong L, Pachner M, Kalai K, Lelley T: SSR-based genetic linkage map of Cucurbita moschata and its synteny with Cucurbita pepo. Genome. 2008, 51: 878-887. 10.1139/G08-072.
Paris H, Kabelka E: Gene List for Cucurbita species. Cucurbit Genet Coop Rep. 2009, 31-32: 44-69.
Ferriol M, Pico B, Nuez F: Genetic diversity of a germplasm collection of Cucurbita pepo using SRAP and AFLP markers. Theor Appl Genet. 2003, 107: 271-282. 10.1007/s00122-003-1242-z.
Ferriol M, Picó B, Fernández de Córdova P, Nuez F: Molecular diversity of a germplasm collection of squash (Cucurbita moschata) determined by SRAP and AFLP markers. Crop Sci. 2004, 44: 653-664.
Doyle JJ, Doyle JL: Isolation of plant DNA from fresh tissue. Focus. 1990, 12: 13-15.
Conesa A, Gotz S, Garcia-Gomez JM, Terol J, Talon M, Robles M: Blast2GO: a universal tool for anotation, visualization and analysis in functional genomics research. Bioinformatics. 2005, 21 (18): 3674-3676. 10.1093/bioinformatics/bti610.
The Swiss-Prot Database. [http://www.uniprot.org/downloads], uniprot_sprot_ release of 2010 04 23
The TAIR Database: The Arabidopsis Information Resource. http://www.arabidopsis.org/, Tair_9_pep_ release 2009 06 19
Uniref90. [http://www.ebi.ac.uk/uniref], uniref90_release 2010 04 23
Tindall EA, Petersen DC, Nikolaysen S, Miller W, Schuster SC, Hayes VM: Interpretation of custom designed Illumina genotype cluster plots for targeted association studies and next-generation sequence validation. BMC Research Notes. 2010, 3: 39-10.1186/1756-0500-3-39.
Lin CH, Yeakley JM, McDaniel TK, Shen R: Medium-to high-throughput SNP genotyping using VeraCode microbeads. Methods Mol Biol. 2009, 496: 129-142. 10.1007/978-1-59745-553-4_10.
Lincoln S, Daly M, Lander ES: Constructing genetic maps with MAPMAKER/EXP 3.0: a tutorial and reference manual. Whitehead Inst Biomed Res Tech Rpt. 1993, Whitehead Institute for Biomedical Research, Cambridge, 97-3
Kosambi DD: The estimation of map distances from recombination values. Ann Eugen. 1944, 12: 172-175.
Voorrips RE: MapChart: Software for the graphical presentation of linkage maps and QTL. J Hered. 2002, 93 (1): 77-78. 10.1093/jhered/93.1.77.
Basten CJ, Weir BS, Zeng ZB: QTL Cartographer, Version 2.5. 2005, Department of Statistics, North Carolina State University, Raleigh, NC
Churchill GA, Doerge RW: Empirical threshold values for quantitative trait mapping. Genetics. 1994, 138: 963-971.
Wright S: Evolution and Genetics of Populations. Vol I, Genetics and Biometric Foundations. 1968, The University of Chicago Press, Chicago
Joehanes R, Nelson JC: QGene 4.0, an extensible Java QTL-analysis platform. Bioinformatics. 2008, 24: 2788-2789. 10.1093/bioinformatics/btn523.
VassarStats: Website for statistical computation. [http://faculty.vassar.edu/lowry/fisher3x3.html]
SISA: Simple Interactive Statistical Analysis. [http://www.quantitativeskills.com/sisa/statistics/ord2.htm]
Rostoks N, Ramsay L, Mackenzie K, Cardle L, Bhat PR, Roose ML, Svensson JT, Stein N, Varshney RK, Marshall DF, Graner A, Close TJ, Waugh R: Recent history of artificial outcrossing facilitates whole-genome association mapping in elite inbred crop varieties. Proc Natl Acad Sci USA. 2006, 103: 18656-18661. 10.1073/pnas.0606133103.
Bradeen JM, Staub JE, Wye C, Antonise R, Peleman J: Towards an expanded and integrated linkage map of cucumber (Cucumis sativus L.). Genome. 2001, 44: 111-119. 10.1139/gen-44-1-111.
Tani N, Takahashi T, Iwata H, Mukai Y, Ujino-Ihara T, Matsumoto A, Yoshimura K, Yoshimaru H, Murai M, Nagasaka K, Tsumura Y: A consensus linkage map for sugi (Cryptomeria japonica) from two pedigrees, based on microsatellites and expressed sequence tags. Genetics. 2003, 165: 1551-1568.
Li D, Cuevas HE, Yang L, Li Y, Garcia-Mas J, Zalapa J, Staub JE, Luan F, Reddy U, He X, Gong Z, Weng Y: Syntenic relationships between cucumber (Cucumis sativus L.) and melon (C. melo L.) chromosomes as revealed by comparative genetic mapping. BMC Genomics. 2011, 12: 396-10.1186/1471-2164-12-396.
Weeden NF: Isozyme studies indicate that the genus Cucurbita is an ancient tetraploid. Cucurbit Genet Coop Rep. 1984, 7: 84-85.
Lebeda A, Widrlechner MP, Staub JE, Ezura H, Zalapa JE, Krístková H: Cucurbits (Cucurbitaceae; Cucumis spp., Cucurbita spp., Citrullus spp.). Genetic Resources, Chromosome Engineering and Crop Improvement. Vegetable Crops. Edited by: Singh RJ. 2007, Taylor & Francis Group, Boca Raton, FL, USA, 3: 271-376.
Boualem A, Fergany M, Fernandez R, Troadec C, Martin A, Morin H, Sari MA, Collin F, Flowers JM, Pitrat M, Purugganan MD, Dogimont C, Bendahmane A: A Conserved Mutation in an Ethylene Biosynthesis Enzyme Leads to Andromonoecy in Melons. Science. 2008, 321 (5890): 836-838. 10.1126/science.1159023.
Martin A, Troadec C, Boualem A, Rajab M, Fernandez R, Morin H, Pitrat M, Dogimont C, Bendahmane A: A transposon-induced epigenetic change leads to sex determination in melon. Nature. 2009, 461: 1135-1138. 10.1038/nature08498.
Li Z, Huang S, Liu S, Pan J, Zhang Z, Tao Q, Shi Q, Jia Z, Zhang W, Chen H, Si L, Zhu L, Cai R: Molecular Isolation of the M Gene Suggests That a Conserved-Residue Conversion Induces the Formation of Bisexual Flowers in Cucumber Plants. Genetics. 2009, 182: 1381-1385. 10.1534/genetics.109.104737.
This research was funded by the INIA projects RTA2008-00035-C02-01/02 and RTA2011-00044-C02-1/2 of the Spanish Instituto Nacional de Investigación y Tecnología Agraria and FEDER funds (EU). The NVD grant was supported by the Programa de Formación del Personal Técnico e Investigador from IFAPA, co-financed with European Social Funds. The authors wish to thank P. Salas and E. Martínez Pérez for their technical assistance in the fruit characterization. We are thankful for the kindly suggestions of Dr. Harry Paris for the F2C. pepo mapping population.
BP designed and coordinated the study, developed the F2 population, selected the markers for the GoldenGate assay, provided the phenotypic data and contributed to the mapping, QTL and synteny analysis. CE participated in the design of the study, and in the development of the F2 population, and contributed to the phenotyping process. CE prepared the DNA of the populations, contributed to the SNP selection process, and performed the SSR analysis. AJM and CE performed the mapping and QTL analysis. JB assisted in the marker selection and in the synteny study. CR assisted in the annotation process. PG participated in the conception of the study. PG and NVD also participated in the design and development of the F2 population, and contributed to the phenotyping and genotyping processes. FN is the director of the COMAV and participated in the conception of the study. BP and CE integrated all the information and drafted the manuscript, with contributions from all authors. All authors have read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Sequence and primers for genotyping the 384 SNP included in the GoldenGate platform. The sequences of each of the 384 SNP included in the GoldenGate platform are indicated, including the polymorphic nucleotide and a 60 bp flanking sequence, along with the locus-specific and two allele-specific primers designed for detecting each locus with the GoldenGate assay, the three universal primers and the Illumicode. (XLSX 71 KB)
Additional file 2: Primers for genotyping the SSR included in the map. The primers for genotyping the 25 SSR selected from the previous C. pepo map  are listed, along with the genotyping results and map position for each locus. (XLSX 12 KB)
Additional file 3: Annotation data and map position of the 384 loci included in the GoldenGate platform. Annotation data of the 384 unigenes included in the GoldenGate platform are described. Previous annotation data provided by Blanca et al. , after in silico detection of the unigenes, consist of unigene length, position of the SNP, in ORF or UTR, number of reads in each parental, GO terms, gene description after sequential BLAST of Swissprot, Arabidopsis org and Uniref90 [51–53], orthologs detected with Arabidopsis and C. melo by reciprocal BLAST of Arabidopsis _pep and ICUGI databases [52, 17] and putative SNP-CAPS. Data generated in this paper are final CRG/CEGEN scores for GoldenGate genotyping reactions, GoldenGate genotyping results, distribution of the unigenes in the C. sativus genome (scaffolds or chromosomes) after BLAST against cucumber genome available at ICUGI , linkage group according to the Zucchini × Scallop map obtained, variability of SNP in the germplasm panel of C. pepo subsp. pepo and subsp. ovifera accessions and amplification in C. moschata. (XLSX 95 KB)
Additional file 4: Number of unigenes in each functional category. Number of unigenes, of the 384 included in the GoldenGate platform, assigned to each GO Slim in the Biological Process category (A) and the Molecular Function category (B). (PPT 231 KB)
Additional file 5: Quantitative and qualitative traits. A. Scored quantitative traits are described. Mean values, ranges and standard deviation for parental, F2 and backcross populations are indicated. B. Pearson correlations between pairs of quantitative traits. C. Visually scored qualitative traits are described. Relative frequency of each phenotype is shown for both parental, F2 and backcross population. (XLSX 53 KB)
Additional file 6: QTL analysis for quantitative and qualitative traits 1. QTL whose effects have not been validated in the backcross populations are included. A. Linkage group positions and flanking markers of 31 QTL, along with their associated logarithms of odds (LOD) for 20 vine, flowering and fruit quantitative traits analyzed in the F2 population derived from the Zucchini × Scallop cross. Major QTL (R2> 25%) are indicated in bold. B. Linkage group positions and flanking markers of 3 QTL, along with their associated logarithms of odds (LOD) and contingency χ2 results for fruit qualitative traits analyzed in the F2 population derived from the cross Zucchini × Scallop. Major QTL (R2> 25%) are indicated in bold. (XLSX 19 KB)
Additional file 7: QTL analysis for quantitative and qualitative traits 2. QTL whose effects have been validated in the backcross populations are included. A. Linkage group positions and flanking markers of 17 QTL, along with their associated logarithms of odds (LOD) for 16 flowering and fruit quantitative traits analyzed in the F2 population derived from the Zucchini × Scallop cross. Major QTL (R2> 25%) are indicated in bold. Heritabilities have been calculatedfor these traits B. Linkage group positions and flanking markers of 8 QTL, along with their associated logarithms of odds (LOD) and contingency χ2 results for vine and fruit qualitative traits analyzed in the F2 population derived from the cross Zucchini × Scallop. Major QTL (R2> 25%) are indicated in bold. (XLSX 20 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Esteras, C., Gómez, P., Monforte, A.J. et al. High-throughput SNP genotyping in Cucurbita pepo for map construction and quantitative trait loci mapping. BMC Genomics 13, 80 (2012). https://doi.org/10.1186/1471-2164-13-80
- Quantitative Trait Locus
- Single Nucleotide Polymorphism
- Major Quantitative Trait Locus
- Cleave Amplify Polymorphic Sequence
- Quantitative Trait Locus Effect