- Research Article
- Open Access
An SNP-based saturated genetic map and QTL analysis of fruit-related traits in Zucchini using Genotyping-by-sequencing
BMC Genomicsvolume 18, Article number: 94 (2017)
Cucurbita pepo is a cucurbit with growing economic importance worldwide. Zucchini morphotype is the most important within this highly variable species. Recently, transcriptome and Simple Sequence Repeat (SSR)- and Single Nucleotide Polymorphism (SNP)-based medium density maps have been reported, however further genomic tools are needed for efficient molecular breeding in the species. Our objective is to combine currently available complete transcriptomes and the Zucchini genome sequence with high throughput genotyping methods, mapping population development and extensive phenotyping to facilitate the advance of genomic research in this species.
We report the Genotyping-by-sequencing analysis of a RIL population developed from the inter subspecific cross Zucchini x Scallop (ssp. pepo x ssp. ovifera). Several thousands of SNP markers were identified and genotyped, followed by the construction of a high-density linkage map based on 7,718 SNPs (average of 386 markers/linkage group) covering 2,817.6 cM of the whole genome, which is a great improvement with respect to previous maps. A QTL analysis was performed using phenotypic data obtained from the RIL population from three environments. In total, 48 consistent QTLs for vine, flowering and fruit quality traits were detected on the basis of a multiple-environment analysis, distributed in 33 independent positions in 15 LGs, and each QTL explained 1.5–62.9% of the phenotypic variance. Eight major QTLs, which could explain greater than 20% of the phenotypic variation were detected and the underlying candidate genes identified.
Here we report the first SNP saturated map in the species, anchored to the physical map. Additionally, several consistent QTLs related to early flowering, fruit shape and length, and rind and flesh color are reported as well as candidate genes for them. This information will enhance molecular breeding in C. pepo and will assist the gene cloning underlying the studied QTLs, helping to reveal the genetic basis of the studied processes in squash.
Cucurbita pepo L. is an economically important species of the Cucurbitaceae family cultivated worldwide, with more than 24 million tons produced in 2013 and nearly 1.8 million ha cultivated . It is particularly important in Asian, American and Mediterranean countries, being Mexico and Spain the main worldwide exporters. Like other cucurbits, C. pepo has become a model to study sex expression, fruit set and development, and parthenocarphy [2, 3].
Taxonomically, the species is divided into three subspecies, C. pepo ssp. pepo L., C. pepo ssp. ovifera (L.) Decker (also known as ssp. texana (Scheele) Filov), both of them including cultivated varieties, and C. pepo ssp. fraterna (L.H. Bailey) Lira, Andres & Nee that is considered as a true wild subspecies [4, 5]. The domestication occurred at least twice in Southern USA and Northern Mexico, where the cultivar diversification was initiated. An additional diversification process occurred after the European contact with the New World and the subsequent migration mainly to Mediterranean countries. Nowadays, this species displays a high variability for many agronomic traits, such as fruit shape and color, flowering habit, leaf morphology, etc. [5, 6].
Cultivars are classified in eight horticultural groups (ssp. pepo: Pumpkin, Vegetable Marrow, Cocozelle and Zucchini, and ssp. ovifera: Scallop, Acorn, Crookneck, and Straightneck). The “Zucchini” group rank among the highest-valued vegetables worldwide . Cultivars from Zucchini, Vegetable Marrow and Cocozelle groups, all of them producing elongated fruits, are considered as modern cultivars being newly developed in Europe, whereas the round and flattened Pumpkins and Scallops are ancient groups developed after domestication in North America .
Despite the agricultural and biological interest of this species, knowledge of its genetics and genome has been very limited in comparison with other cucurbits [9–11]. Recently, we produced the first transcriptome of the species, from root, leaves, and flower tissues, by using the 454 sequencing technology  that was later significantly improved with Illumina technology by sequencing cDNA from additional tissues (shoot apices, floral buds, and pre-harvest and postharvest fruit subjected to ethylene, methylcyclopropene and cold treatments). The C. pepo transcriptome v3 is available at the Cucurbigene website . Genes were annotated and classified in different biological functions. Some of them have been recently used to study biological processes in Cucurbita [2, 14].
The transcriptome sequencing was focused on two varieties with contrasting phenotypes, representing the two C. pepo main subspecies, the inbreeding line ‘MU-CU-16’, belonging to the Zucchini morphotype (C. pepo ssp. pepo), the main summer squash sold in European markets, and the inbreeding line ‘UPV-196’ of the Scallop morphotype (ssp. ovifera). These genotypes also represent different domestication and diversification steps, Scallop types were already selected by Native Americans, whereas elongated Zucchini were selected in Italy after the introduction of this species in Europe . Recently, other transcriptomic studies have been carried out with specific purposes, for instance, to identify genes involved in aphid infestation , or to develop markers in Pumpkin types .
Dense genetic maps are necessary tools for efficient molecular breeding. In the past decades, the linkage maps of the Cucurbita genus were constructed using populations derived from both inter (C. pepo x C. moschata) and intra-specific crosses (C. pepo ssp. pepo x ssp. pepo, and C. pepo ssp. pepo x ssp. ovifera). These maps were first composed of dominant markers (Random Amplified Polymorphic DNA (RAPD) and Amplified Fragment Length Polymorphisms (AFLP)) [18–20], and later completed with SSRs [21, 22]. The two genotypes, Zucchini and Scallop, used to generate the C. pepo transcriptome were also selected previously as parents of an F2 mapping population that was employed to construct the first SNP-based genetic map of the species and to map several QTLs (Quantitative Trait Loci) involved in plant, flowering and fruit traits .
The SNPs located in the map by Esteras et al.  were selected among those identified in silico by mining the transcriptomic sequences of the F2 parents (‘MU-CU-16’ and ‘UPV-196’), and were validated using a Golden-Gate genotyping platform. Despite this map produced valuable information, its marker density was only moderate (6.02 cM/marker). New methods integrating simultaneously SNP discovery and genotyping such as Genotyping-by-sequencing (GBS) [24, 25] can be applied to rapidly develop high-density genetic maps.
The aim of this work is to generate a high-density genetic map using several thousands of SNPs obtained by the GBS analysis of a new Recombinant Inbred Line population (RIL), developed through single seed descent from the previous Zucchini x Scallop F2 population . RIL populations are more adequate for genetic mapping as a higher number of recombinations are produced, improving the mapping resolution, and they can be replicated by seed, making easy replicated trials that facilitates a better estimation of QTL effects . Consequently, we also investigated the genetic control of economically important traits as vine, flowering and fruit quality traits, by QTL analysis, taking advantage of the new high-density map and of the first draft of the C. pepo genome available at the Cucurbigene website . The results presented herein will help to establish a molecular breeding system in this species.
Generation of the RIL population
Two genotypes of C. pepo were used as parentals to produce the intra-specific RIL mapping population used in this study, belonging to different subspecies, ‘MU-CU-16’ is a Zucchini type of the subspecies pepo and ‘UPV-196’ is a Scallop type of the subspecies ovifera. The two parents present contrasting phenotypes for vine, flowering and fruit traits detailed in . F1 plants produced from the Zucchini x Scallop cross were selfed to generate the F2 population used to construct the first SNP-based map in C. pepo . F2 individual plants were selfed until the F8 generation by single seed descent. A final set of 120 F8 RILs was obtained.
DNA was isolated from young leaves of plants of each of the 120 RILs. Additionally, three replicated samples of parentals and the F1 generation and two independent DNA extractions of two randomly selected RIL families were included as controls. DNA extraction was carried out using DNeasy Plant Mini Kit (Qiagen, Hilden, Germany). DNA concentrations were measured with Nanodrop ND-1000 Spectrophotometer v.3.5 to select samples with at least 50 ng/μl. Finally, 131 samples with high-quality DNA were selected for sequencing. Samples were sent for genotyping to the Genomic Diversity Facility at Cornell University (Ithaca, New York, USA). A minimum of 500 ng of DNA were used for SNP genotyping.
A Genotyping-by-sequencing approach was used for SNP discovery between the parents and among the RILs as described by Elshire et al. . The GBS libraries from all samples were prepared using ApeKI endonuclease (recognition site: G/CWGC) and sequenced using the Illumina HiSeq 2000 platform (Illumina Inc, San Diego, CA, USA). GBS libraries were constructed including the parents and F1 (three replicates each) and 122 RIL samples. GBS sequencing reads were de-multiplexed according to the sample barcodes and adapter sequences were removed using GBS barcode splitter . Reads were trimmed by base Phred quality (Q score < 25) and reads shorter than 30 base pairs were discarded before mapping and SNP calling.
The filtered, high-quality sequences from each sample were aligned to the last version of the C. pepo genome v.3.2  using Bowtie 2 with default parameters . SNP calling was performed using Freebayes . A minimum mapping quality of 30, minimum base quality of 20 and minimum coverage of 5 was required. Resulting SNPs were additionally filtered discarding those with more than 70% of missing data, no biallelic, with a Minimum Allele Frequency < 10%, or with heterozygosity >70%. Genotypes with a quality lower than 10 were also discarded.
Phenotyping of the RIL population
The seeds of 120 F8 RILs were germinated in Petri dishes and transplanted to pots (in glasshouses) and to soil (under tunnel) at 2 weeks after germination in three independent assays. The first assay was performed from February to July 2014 (assay Paip2014) at the tunnel facilities of CAJAMAR (Paiporta, Valencia, Spain). Three plants per RIL were phenotyped. DNA for GBS analysis was obtained from this trial. The other two assays were conducted in 2015, at the same tunnel facilities (Paip2015, from February to July) and in the glasshouse facilities at the Polytechnic University of Valencia (UPV) (Valencia, Spain, from May to October, UPV2015). In these two assays, also three sister plants were phenotyped per RIL with a fully randomized experimental design.
In the three assays we measured traits related to vine growth, plant morphology, flowering and fruit traits. Forty-three traits were measured for each single plant (Additional file 1). Vine traits were related to plant length (growth habit, plant length, and number of nodes of the plants at the end of the assay), branching intensity, and leaf traits (spines in the leaf petiole, leaf incision, and occurrence of silver leaf); and flowering traits were related to the flowering time (first node with male/female flower, and days from transplanting to the development of the first male/female flower).
Each plant was selfed and two fruits per plant were analyzed. One of them at immature stage, 7 days after pollination, which corresponds to the commercial stage of summer squashes. The second fruit was analyzed at physiological maturity (ranging from 30 to 60 days after pollination). Traits measured at both physiological states were immature/mature peduncle length, fruit size and shape (fruit length, width, shape, weight, number of locules and ribbing), flesh firmness, rind and flesh color, and sugar content and acidity. More details about all measured traits are included in Additional file 1.
Data from the assays performed in the three locations were used to calculate pairwise genetic correlations between locations as r x,y = covariance x,y/√(variance x * variance y), where x and y represented two different locations .
Genetic map construction and QTL detection
A genetic map was constructed using R packages R/qtl  and ASMap . ASMap package implements the MSTmap algorithm , which is an extremely fast algorithm for linkage map clustering and ordering of thousands of markers. SNPs were coded as homozygotes similar to one parent (A), to the other (B) or heterozygotes (H) using custom scripts. Polymorphic loci that were heterozygous in any of the parents, markers with more than 10% of missing data or duplicated markers (markers with the same genotype for all individuals) were discarded. To remove possible genotyping errors, we used our own implementation of the SMOOTH algorithm . This method corrects the genotype of an individual based on neighboring markers. mstmap function from ASMap package was used to cluster and order markers and quickEst to estimate genetic map distances using Kosambi map function . To detect segregation distortion, Chi-square (χ2) tests were computed for each SNP using R/qtl function geno.table and p-values were corrected for multiple testing using Benjamini and Yekutieli correction . Highly distorted and unlinked markers were excluded from analysis. Mapchart 2.2  was used to visualize a constructed map for each linkage group (LG). The new map was compared with the previous C. pepo 384-SNP map developed with the F2 population derived from the same Zucchini x Scallop cross . This set of SNPs were located in the C. pepo genome v.3.2 using BLAST with their flanking sequences in order to obtain an equivalence between Esteras’ genetic map and current genome version. Linkage groups were named according to the RIL map generated in the current study.
Once the genetic map was established, SNPs showing segregation distortion were located in the physical map together with the mapped SNPs to identify genomic regions with genetic distortion.
QTL analysis for each trait was performed by Composite Interval Mapping (CIM) using the R/qtl function cim with a scan window size of 20 cM and 20 background marker loci as QTL cofactors. A multi environment search of QTLs was performed using the data from the three assays. Environmental effects and Genotype x Environment (G x E) interactions were estimated for each trait using a two factor Analysis of Variance (ANOVA), with all the phenotypic data. Also an additional QTL analysis was performed per environment, using data of each assay separately. Logarithm of odds (LOD) threshold for a Type I error P < 0.05 and P < 0.01 value was obtained based on a permutation test (1000 permutations were run per trait). LOD support interval was calculated using R/qtl lodint function using a 1.5 LOD units drop. The additive QTL effect (a) and the proportion of phenotypic variance explained by QTL (R2) were estimated at the highest QTL peaks using R/qtl function fitqtl. QTLs exceeding the threshold value (p < 0.01) in this analysis were considered significant.
Results and discussion
Sequence data and SNP discovery
A total of 242.4 million cleaned reads with a total of 21.7 Gb were generated for the parents, F1 and the 122 RIL samples. The number of reads obtained varied from 0.9 million to 4.6 million with an average of 1.85 million reads per line. The genome of C. pepo v3.2 has a total assembled size of 263,500,453 bp (with 909 large scaffolds > = 10 Kb). All the cleaned reads represent an average percentage of covered genome with a read depth > 1 and > 20 of 3.0% and 0.5%, respectively (Additional file 2). Sequences have been submitted to the Sequence Read Archive (SRA) of the NCBI (SRR4299463-SRR4299615, BioPoject PRJNA344022).
The sequences obtained were filtered and used for SNP identification. A 93.9% of the cleaned reads were mapped to the C. pepo genome and 62,617 SNPs were identified after the SNP calling with Freebayes. After excluding SNPs that were monomorphic in the RIL population, those non biallelic, with more than 70% of missing data, with heterozygosity >70% or with Minimum Allele Frequency (MAF) < 10%, 26,430 SNPs remained. The SNPs identified were again filtered to remove heterozygous SNPs in parents.
The number of SNPs identified in this population is higher than the number identified in a recent GBS analysis performed using an F2 population derived from the cross between two accessions of the closely related species Cucurbita maxima . Using the same filters employed in the study by Zhang et al. , that is, selecting SNPs with less than 20% missing data and MAF ≥ 0.2, we identified about eight times more SNPs in our RIL population, 16,222 SNPs in C. pepo RIL population versus 1,881 in C. maxima population. Differences can be explained by the higher variability of the C. pepo species and by the fact that we have crossed two accessions belonging to two different subspecies generated in two independent domestication events .
Construction of genetic map
A genetic map was constructed using genotypic data of the RIL population. After discarding those SNPs with more than 10% of missing data, and those showing a statistically significant deviation from Mendelian segregation, 10,166 SNPs distributed in 178 scaffolds (representing 212,381,440 bp, 80.6% of the genome) remained. We found 3,676 SNPs forming groups of SNPs with the same genotype for all samples. Only one SNP per group was retained. We also removed unlinked markers and SNPs that had different genotypes in the two DNA replicates used as controls. The average degree of heterozygosity existing in this F8 RIL population was 1.47% (ranging from 0.013 to 3.72%).
The map consisted of 7,718 SNPs distributed across 21 linkage groups (Table 1, Fig. 1). The individual LGs had between 770 and 101 markers each, with a mean of 367.5 markers per LG. The LG size ranged from 51.4 cM (LG 21) to 303.4 cM (LG 1), giving a total genetic length of 2,817.6 cM. Average genetic distance between successive markers was 0.4 cM, and maximum spacing between markers ranged from 16.8 cM in LG 1 to 4.6 cM in LG 4. A total of 145 scaffolds (from 1 to 16 scaffolds per LG) of the current version of the C. pepo genome (version 3.2) could be anchored to the genetic map (Table 1). Additional file 3 (a) includes detailed information about the genetic map with the genetic and the physical position of each SNP marker in the C. pepo genome (version 3.2), along with the flanking sequences of all SNPs. All SNPs have unique physical locations in the C. pepo genome, and are potentially transferable among species allowing comparative studies within this genus.
This new map significantly improves the previously reported C. pepo map constructed with the F2 population in the earlier study by Esteras et al. , which included 315 markers covering 1,740.8 cM, with an average distance between markers of 6.02 cM and a maximum gap of 30.3 cM. The sequences of these markers were mapped to the C. pepo genome to compare both maps. It was possible to associate most of the linkage groups established in the F2 map, with unique LG of the new high density genetic map (Fig. 1). The RIL map enabled merging former LGs 19 and 21, and 22 and 12 (corresponding to new LG 2 and 7 respectively). Also current LG 18 and 20 corresponded with the former LG 3. Therefore, the comparison of both maps revealed 20 linkage groups that corresponded to the 20 chromosomes of C. pepo.
The map length is similar to that reported for C. maxima, 2,566.8 cM , with a much higher marker density, 385.9 versus 22.9 marker per LG, an average of 0.4 versus 5.6 cM between successive markers, and fewer genetic gaps, making the current map the most saturated genetic map of the Cucurbita genus to date.
SNPs with statistically significant distorted segregation that were discarded for map construction, were located in the genome (Additional file 3b). In some regions, distorted SNPs were interspersed with non-distorted SNPs, so the observed distortion was not likely due to genetic reasons, but to artefacts of the GBS analysis or complex genomic causes (i.e., gene duplication or deletion). However, in other regions, blocks of continuous distorted SNPs were observed. In such cases, a real genetic distortion was assumed (Additional file 3c). These regions were mainly located in the distal region of LG 19 (CP32_scaffold000005_127543-2532610 Scallop alleles overrepresented) and of LG 21 and LG 20 (CP32_scaffold000007_32640-4220403 and CP32_scaffold000027 11033-2401349, Zucchini alleles overrepresented). This distortion in the distal region of LG 20 could explain why this LG could not be merged with LG 18 in the RILs map (Fig. 1). Other regions with distorted segregation were for example in LG 16 (28.2–51.4 cM; CP32_scaffold000028_577-1064181), LG 12 (91.9–111.1 cM, scaffold000045_87006-1620317), LG 1 (127.3–129.5 cM, scaffold000084_267107-735527) and LG 4 (24.5 cM, scaffold000086_313524-580307), with Scallop alleles overrepresented, and in LG 15 (9.2 cM, scaffold000060_271409-477442), with Zucchini alleles overrepresented. Also some scaffolds that could not be anchored to the genetic map showed distorted segregation skewed towards Zucchini and towards Scallop alleles, respectively. Some of these regions were also observed in the former F2 map, mainly in the previous LG 2 and LG 5 (current LG 1 and 21).
Different functions were associated to the genes located in the distorted regions. Some were directly related to the flowering process (Protein UNUSUAL FLORAL ORGANS: CP32_scaffold000007_1554474, squamosa promoter binding protein-like: CP32_scaffold000045_1030767 and CP32_scaffold000084_557489, Flowering time control protein FPA: CP32_scaffold000045_1196335) [39–41]. Others are transcription factors involved in plant growth and development (Scarecrow-like transcription factor PAT1: CP32_scaffold000050_1278397, TCP family transcription factor: CP32_scaffold000084_653160), in the embryogenesis process (embryo defective 2170: CP32_scaffold000028_353221, Homeobox protein knotted-1-like 7: CP32_scaffold000086_568011) [42, 43] or genes related to hormone metabolism (ethylene or auxin related) (Additional file 3c). Scallop and Zucchini alleles were overrepresented in different regions, suggesting that the alleles in this region may be subjected to gametic or zygotic selection and/or related to preferential germination or better seedling viability. Some of these unigenes may be the cause of the segregation distortion, but it could also be the result of linkage to other genes.
Given the low level of heterozygosity observed in the RILs grown in Paip2014 (see above), the GBS genotype for the RILs is adequate for QTL analysis in the three environments after removing the heterozygous loci. We performed composite interval mapping with window size of 20 cM. QTL analysis based on genotypic data and phenotypic data for 43 traits, identified a total of 48 QTLs (Table 2) for vine, flowering and fruit traits that were detected with the full set of data and with the data from at least two environments separately. These QTLs were distributed in 33 independent positions in 15 linkage groups. The proportion of the phenotypic variance explained by a single QTL (R2) varied from 1.5 to 62.9%. Detailed information about all these QTLs (explained variance, LOD peaks, flanking markers, additive effects) is shown in Table 2. Additional files 4 and 5 include additional data of the analyzed traits and QTLs. Identified QTLs are discussed below grouped by traits.
No significant QTLs were found for vine size and architecture despite RIL parents, Zucchini and Scallop, differed in growth habit (bushy versus intermediate), branching intensity (non-branched versus branched), and plant length and number of nodes (an average from 78 to 120 versus 210 to 277 cm, and from 5 to 70 versus 73 to 90 nodes, in the three environments for Zucchini and Scallop respectively).
Major QTLs were identified for leaf traits, such as leaf blade incision and the occurrence of silver leaf (Li_10 and Sl_12) (Table 2 and Additional file 4), both traits related to photosynthesis rate. Zucchini plants developed deeply incised leaves whereas Scallop plants had weak incisions (Li scored as 4 versus 1 respectively in all environments). Large genetic correlations between locations where found for these two traits, r x, y ranging from 0.66 to 0.77 (Additional file 4) indicating that the norm of reaction for each genotype was similar in the three environments.
Li_10 (located at 33.9 cM, CP32_scaffold000009_2374010-CP32_scaffold000009_2556718) explained most of the variation found in this trait (R2 = 50.1%). ANOVA results show a lack of environment effect and of G x E interaction (Table 2), with RILs with Zucchini alleles having deeper incisions than RILs with Scallop alleles (Fig. 2) in all environments (average incision 2.9 versus 1.8) (Additional file 4 and 5). Two genes belonging to the homeobox-leucine zipper protein family were annotated in the Li_10 region (CP32_scaffold000009-2401206-2403737 and 2409170-2410649) (Additional file 6). The best hit of these Cucurbita genes (against the non-redundant protein sequence database) was with homeobox-leucine zipper protein ATHB-22-like of Cucumis melo. This family of homeobox genes has roles in meristem identity and in the regulation of leaf development in several plant species [44, 45]. These could be good candidates to explain differences in leaf morphology found in this population.
The Zucchini and Scallop parents also differed in the occurrence leaf silvering. This is a typical feature of the Zucchini parental, whereas Scallop develops uniformly green leaves (Sl scored as 1 versus 0 respectively). Silver mottling of leaves is caused by the development of large air spaces between palisade cells and the epidermis. This trait has been reported to be controlled by a dominant single gene (m for non-mottled or non-silvery leaves and M for mottled or silvery leaves), with modifiers [46–48]. Also non-genetic factors, such as variation in light and temperature and drought stress, affect the expression of the silvery-leaf trait. Squash leaf silvering can also appear as a systemic response to whitefly feeding . One major QTL, explaining 23.3% of the observed variation, was identified in our population in LG 12, Sl_12 (59.3 cM, CP32_scaffold000031_1726079-CP32_scaffold000031_2012785) (Table 2 and Additional file 4). RILs with the Zucchini alleles in that region show different degrees of silvering, whereas those with the Scallop genotype develop green leaves in all the environments (Additional file 5, Fig. 3) (mean scores 0.49 versus 0.11). In fact no significant environment or G x E effect was found for this trait (Table 2, Additional file 4). Other two QTLs had LOD values above the threshold in the analysis with all the data (p = 0.01), Sl_1 (132.5 cM, CP32_scaffold000084_728121-CP32_scaffold000059_548298) and Sl_16 (66.1 cM, CP32_scaffold000034_1033951-CP32_scaffold000028_1777958) (Table 2, Additional file 4). However, these explained a low percentage of the observed variation (3.8 and 8.0%, respectively), and were not significant in all the assayed environments, showing a significant G x E interaction (Additional files 4 and 5).
Silvered leaves exhibit reduced photosynthetic ability, but this trait also seems to have a favorable effect on protection against aphids or plant desiccation, as silvery leaves reflect more light than non- silvery leaves . A further study of the genes annotated in the corresponding QTL intervals (Additional file 6) is of interest to identify candidates involved in the variation of this trait and to manage it in breeding programs.
Flowering time related traits
Cucurbits have become model species for the study of plant sex determination and some genes involved in sex expression (dioecy, monoecy, andromonoecy and gynoecy) have been characterized in melon, cucumber and squash [14, 50–55]. Most of them are enzymes involved in ethylene biosynthesis, signaling and regulation. The Cucurbita orthologs of some of the cloned genes (CmACS11, CmACS7, CsACS2, CSACS1g and CmWIP1 of melon and cucumber) are located in LG 13, LG 14, LG 18, LG 10, and LG 17 (CP32_ scaffold000017_63557, CP32_scaffold000011_561409; CP32_scaffold00025_1079779; CP32_ scaffold000009_993755; CP32_ scaffold000004_4527055). Other genes, such as CTR1 and CTR2, that confer reduced ethylene sensitivity and have been reported to be involved in the male/female ratio in C. pepo  are located in LG 3 and LG 6 (CP32_ scaffold000038_689227 and CP32_ scaffold000008_2476160). No significant QTLs involved in the variation of the flowering traits analyzed in this study colocalize with any of these regions (Additional file 4). This result is not unexpected as the Zucchini and Scallop accessions do not differ in sex expression, being both monoecious. In fact, sex expression is less variable in Cucurbita than in Cucumis crops, cucumber and melon.
Early flowering, however, is a highly variable and economically important agronomic trait related to early yield in C. pepo. It is affected by genetic, environmental and hormonal factors [57, 58]. Zucchini and Scallop do differ in flowering time, mainly in the days to the development of the first male and female flower, being the Scallop parental more late-flowering than Zucchini squash (average DMaF 18 to 23.5 versus 21 to 24.5 days and DFeF 18 to 30 versus 31.5 to 42 for Zucchini and Scallop in the three environments, respectively). Genetic correlations between locations for this trait were positive and significant, although moderate (r x, y > 0.28) (Additional file 4). The analysis of the RIL population suggests the existence of at least two genomic regions controlling flowering time. We found a QTL (R2 = 7.9%) involved in the earliness of female flowers in LG 12 (DFeF_12, 33.1 cM, CP32_scaffold000074_621895-CP32_scaffold000096_181501) (Table 2, Additional file 4). ANOVA results also indicate a significant effect of the environment and a lack of G X E interaction in this locus (Table 2). The RILs homozygous for the Zucchini alleles developed the first female flower in all environments between 3 and 8 days before than those homozygous for the Scallop allele (average DFeF 34.2 versus 39.7 days) (Additional file 5). One additional minor QTL involved in the variation of the same trait was found in LG 9 (R2 < 5%), DFeF_9 (116 cM, CP32_scaffold000013_960688-CP32_scaffold000013_1308585) (Table 2 and Additional file 4), with significant E and G x E effects. Significant differences between the RILs with the Zucchini versus Scallop genotypes in this region were found in two environments, with an advance of the female flowering from 2 to 6 days (average DFeF 34.9 versus 37.4 days) (Additional file 5).
Some candidate genes are found in these regions (Additional file 6), opening new possibilities for the study of the genetics of this poorly studied trait. For example, two MADS-box genes are annotated in the DFeF_12 region (CP32_ scaffold000074_189252-192320 and 193342-199503) that are most similar to the Momordica charantia AGAMOUS LIKE6-like protein (AG6) and to the MADS-box protein SOC1-like from C. sativus. The AGL6 gene acts as a floral promoter in Arabidopsis through the control of the transcription of key regulators of flowering time (the FLOWERING LOCUS C (FLC), the main flowering switch gene in Arabidopsis, and the FLOWERING LOCUS T (FT)). The overexpression of this gene results in a late-flowering phenotype . Also the overexpression of SOC1-like stimulate flowering in different species . In this region is also located a TCP transcription factor (CP32_ scaffold000074_159157-160182, best nr hit transcription factor TCP9 of C. melo). The TCP gene family plays important roles in regulating diverse processes, including phytohormone biosynthesis and signal transduction, branching and flower development . A gene of this family was also annotated in the distorted region of LG 1 described above (Additional file 3c).
Variation in flowering time has been less studied than sex expression in cucurbits and little is known about the underlying genetic mechanism. In a recent study conducted in cucumber, a candidate gene for early flowering was identified (Csa1G651710), which is a homolog of the Arabidopsis LOCUS T . Our results suggest that the regulation of the FT gene is also a main mechanism underlying the variation of the flowering time in C. pepo. The C. pepo ortholog of the Csa1G651710 is located in the scaffold150 (CP32_scaffold000150_132214-134774) that could not be mapped to any of the C. pepo linkage groups. This scaffold showed a high degree of distorted segregation towards the Zucchini alleles (Additional file 3c), which is consistent with the idea that Scallop alleles in this locus have resulted in late flowering affecting the reproduction of the carrier RILs during the selfing process. It remains to be studied if the presence of Zucchini regions in DFeF_12 results in a change of FT expression associated to the early flowering genotypes.
Additionally, three genes are located in the region of the DFeF_9, that can be associated to the flowering process, one annotated as WUSCHEL-related homeobox (WOX) (CP32_ scaffold000013_1016547-1018778, Best hit WUSCHEL-related homeobox of Cucumis melo) and a second gene as an auxin response factor 4 (CP32_ scaffold000013_1041004-1045906). WOX genes are a large group of transcription factors essential in maintaining shoot apical meristem, some of which play important roles in the regulation of floral patterning. Some of these processes are conducted through the regulation of auxin transport [63, 64]. The third candidate in this region is an ethylene-responsive transcription factor (ERF4) (CP32_scaffold000013_1092413-1092817) as flowering is also associated to ethylene metabolism. Further research is necessary to determine whether flowering time traits co-segregate with variation in these genes.
None of these QTLs colocalize with the major QTL in LG 3 controlling several flowering traits (days to flowering, node to the first flower, etc.) detected using the F2 population of the same cross and the previous map by Esteras et al. . Former LG 3 corresponds to LG 18 and LG 20 in the current map (Fig. 1). The SNPs flanking the flowering QTL in the F2 map (C006328, C001057 and C003831) were located in CP32_scaffold000027 (507871 and 2031514) and CP32_scaffold000025 (1811879), the first and second ones anchored in one end of the LG 20 and the last in another end of the LG 18. The CP32_scaffold000027 presents a high percentage of distorted markers (Additional file 3c), thus as described before we were not able to merge this two LG into one. Therefore, likely the lack of markers in this region is the reason why this main QTL detected in the F2 was not detected in the RILs population. In this region are two ETHYLENE INSENSITIVE3 (EIN3) genes and one ethylene-responsive transcription factor (ERF) annotated (CP32_scaffold000025-1811503 and 2037348; CP32_scaffold000027-1697493-1702248) encoding transcription factors that represent downstream components of ethylene signaling, also reported to be involved in flowering .
Squash fruit morphology is related to different traits: fruit weight, fruit shape, fruit length and width, number of locules, and ribbing intensity. Zucchini and Scallop immature and mature fruits did not significantly differ in fruit weight, but they did differ in fruit shape (IFSh and MFSh scored as 6, elongated, versus 1, discoidal, respectively). Fruit shape differences were mainly due to differences in fruit length (IFLe from 16.7 to 23.7 cm versus 4.7 to 7.5 cm, and MFLe from 37.8 to 38.7 cm versus 6.9 to 9.8 cm, respectively for Zucchini and Scallop in the three environments).
Genetic correlations between locations were very high for some traits related to immature and mature fruit shape (r x,y ranging from 0.65 to 0.89 for IFSh, IFLe, MFSh, MFLe), and moderate for some others (r x, y ranging from 0.33 to 0.65 for IFWi, IFRib, MFWi, and MFRib) (Additional file 4).
A major QTL affecting immature fruit shape, IFSh_3 (R2 = 17.8%, 173.2 cM, CP32_scaffold000038_1385297-CP32_scaffold000038_1853138) colocalized in LG 3 with major QTLs affecting immature fruit length and width (IFLe_3 and IFWi_3) (R2 = 31.8 and 18.7%, 173.2 and 171.1 cM, respectively) (Table 2, Additional file 4). Zucchini alleles in this region resulted in more elongated immature fruits, significantly longer and narrower than fruits with Scallop alleles (average IFLe 14.5 versus 10.3 cm, IFWi 5.8 versus 7.1 cm for homozygous Zucchini and Scallop respectively) (Additional file 5, Fig. 4a). Differences were significant in the three environments. The effect of the environment was significant for IFLe and IFLWi, whereas no significant G x E interaction was found in any of the traits (Table 2, Additional file 4). These shape differences were also appreciated in mature fruits, affecting more to fruit length than to width (MFLe 24.9 versus 16.0 cm and MFWi 10.1 versus 11.6 cm) (Additional file 5, Fig. 4b). In fact, QTLs involved in the mature fruits shape, length and width colocalized in the same region, MFSh_3, MFLe_3 and MFWi_3 (R2 = 11.0, 38.7 and 15.1%, 171.0, 169.1, and 171.0 cM) (Table 2, Additional file 4). No effect was observed in this region for fruit weight, number of locules, and other traits associated to fruit morphology, such as ribbing intensity. This region is the same found in the previous map (developed with the F2 population of the same cross) located in LG 6 , controlling the length of immature and mature fruits, and associated fruit shape traits (mature fruit width and cavity thickness). The comparison between the F2 and the RIL genetic map indicates the correspondence between LG 6 (F2) and LG 3 (RIL) (Fig. 1). The genetic basis of variation in fruit shape has been studied most extensively in tomato and in other cucurbits such as melon. For example in melon, some QTLs associated to fruit shape colocalize with members of the OVATE family proteins (OFP). Genes of this family are also involved in tomato fruit morphology [66–68]. The anchorage of the RIL map to the genome sequence provide the list of genes annotated in the IFSh_3, IFLe_3, IFWi_3, MFSh_3, MFLe_3 and MFWi_3 region (Additional file 6). Interestingly, this list includes an OFP gene (CP32_scaffold000038-1785881-1786918), the ortholog of the OFP2-like gene of Cucumis sativus, which likely contributes to the observed variation in C. pepo fruit shape.
Two additional minor QTLs affecting immature fruit shape and length, IFSh_12 (R2 = 6.7%, 25.0 cM, CP32_scaffold000065_760668-CP32_scaffold000074_729362) that colocalized in LG 12 with minor QTLs affecting immature and mature fruit length and width (IFLe_12, MFLe_12 and MFWi_12) (R2 = 7.6, 6.6 and 7.4%, 24.2, 19.3 and 24.9 cM), and IFLe_15 (R2 = 6.7%, 34.7 cM, CP32_scaffold000056_543412-CP32_scaffold000056_920404) were identified (Table 2, Additional file 4). In all cases Zucchini alleles increased fruit length, but the effects of these QTLs were much lower than that of QTLs in LG 3 and significant differences were found only in some environments (Additional file 5).
Also four additional regions in LG 4 (39.4 cM, CP32_scaffold000159_92504-CP32_scaffold000022_375710), LG 5 (27.9 cM, CP32_scaffold000018_362002-CP32_scaffold000018_549964), LG 6 (19.7 cM, CP32_scaffold000003_3022364-CP32_scaffold000003_3087204), and LG 9 (119.3 cM, CP32_scaffold000013_890110-CP32_scaffold000013_959962) affected mature fruit shape and length, MFSh_4, MFSh_5, MFLe_6 and MFLe_9 (R2 = 3.4, 3.0, 3.6 and 6.2%) (Table 2, Additional file 4). MFLe_9 was the only QTL in which the Scallop alleles increased fruit length (average MFLe 19.1 versus 22.7 cm for Zucchini and Scallop respectively) (Additional file 5). In these four QTLs the effect was only observed in mature fruits, suggesting that different regions affect fruit shape in the early and late steps of fruit development. Several genes had been previously reported to be related to fruit shape in C. pepo . A dominant gene (Di) was reported associated to the discoidal fruit shape of scallop squash. This gene was suggested to be dominant over spherical or pyriform shapes. A digenic epistatic control has also been reported for summer squash fruit shape. Our results are consistent with the existence of a major gene, that could be the ovate underlying the IFLe_3 QTL, and several minor modifiers.
Apart from fruit length and width, there are other traits associated to fruit morphology. The Scallop fruit is strongly scalloped with deeper ribs than Zucchini (IFRib 0 to 1 versus 2 to 3 and MFRi 0 to 0.5 versus 3, for Zucchini and Scallop respectively). The ribbing intensity was variable in the RIL population. Three QTLs involved in this trait were detected, one controlling ribbing intensity in immature fruits, IFRib_3 (R2 = 8.7%, 69.4 cM, CP32_scaffold000006_3196015-CP32_scaffold000006_3616125) and two in mature fruits, MFRib_12 (R2 = 10.7% 110.2 cM, CP32_scaffold000031_115435-CP32_scaffold000045_1342921), and MFRib_21 (R2 = 6.2% 19.2 cM, CP32_scaffold000007_4220403-CP32_scaffold000110_136233) (Table 2, Additional file 4). Zucchini alleles in the IFRib_3 region resulted in more ribbed fruits, although not in all environments (1.5 versus 0.85 in Zucchini and Scallop genotype, respectively). In fact, both environment and G x E effects were significant for IFRib_3, whereas in MFRib_12 and MFRib_21 resulted in less ribbed fruits (0.34 versus 0.79 and 0.46 versus 0.83) in all environments, with no significant G x E effects. MFRib_12, that explained the highest percentage of the variation found in this trait (Fig. 5), had been previously detected in the map of the F2 population (MRib_11 in LG 11 of the F2 map that correspond to LG 12 of the RIL map) (Fig. 1). Transcription factors belonging to the WOX family have been reported to control the carpel number . We did not find genes belonging to this family in the reported regions, so other genes must be underlying these QTLs (Additional file 6).
Peduncle length is also involved in fruit typology. Significant differences were found among parents in the three environments, with shorter peduncles in the Zucchini fruits (IPeLe 2–3 versus 4.3–5.3 cm and MPeLe 1.4–3.6 versus 5.2–9.5 cm). Genetic correlations between assays were moderate for peduncle traits (Additional file 4). Three QTLs were identified controlling immature and mature fruit peduncle length (Table 2 and 3, Additional file 4): IPeLe_10 (R2 = 7.6%, 71.3 cM, CP32_scaffold000009_272548-CP32_scaffold000009_704495), IPeLe_16 (R2 = 3.7% 14.9 cM, CP32_scaffold000030_1135277-CP32_scaffold000030_1337312) and MPeLe_14 (R2 = 3.9%, 29.3 cM, CP32_scaffold000039_1662180-CP32_scaffold000011_1038061). These QTLs have opposite effects of the Zucchini versus Scallop alleles (IPeLe 4.2 versus 5.5 and 5.3 versus 4.4 for IPeLe_10 and 16, respectively and 4.4 versus 5.3 for MFLe_14) (Additional file 5).
Apart from fruit shape, other factors involved in fruit quality have been studied: organoleptic (flesh sugar content, Brix, and acidity, pH), nutritional (several measures of rind and flesh color), and physical (flesh firmness). Zucchini and Scallop parents did not differ in flesh firmness, Brix degree and pH, but clearly differed in fruit color, both of rind and flesh. Color was measured with colorimeter (taking the three parameters of the Hunter scale, L, Lightness, from white, L = 100, to black, L = 0; a, from redness for positive values to greenness for negative values; b, from yellowness for positive values to blueness for negative values). The Zucchini parent develops immature dark green fruits (characterized by low ILRCo values, 24.4 to 28.3, negative IaRCo, −2.6 to −6.8, and, positive IbRCo, 4.3 to 4.6, scores) with light green flesh (characterized by high ILFCo values, 52.9 to 81.3, negative IaFCo, −4.3 to −6.8, and high IbFCo values, 11.1 to 23.8) in the three environments. Immature Scallop fruits have light green rinds (with significantly higher values of ILRCo, 64.7 to 77.3 and IbRCo, 17.7 to 19.1, but similar values of the a parameter) and light green flesh, similar to that of the Zucchini fruits (ILFCo 79.5 to 82.4, IaFCo −4.1 –6.6, and IbFCo 13.9–15.5). Color differences were more significant in mature fruits. Zucchini fruits were dark green (MLRCo 21.2 to 25.6, MaRCo −1.6 to −0.48 and MbRCo 1.2 to 6.0), and the flesh varied from light green to light yellow or orange at physiological maturity (MLFCo 49.8 to 76.6, MaFCo −1.6 to 0.8 and MbFCo 12.9 to 24.4). However, the Scallop fruits remained white, both rind and flesh, at full maturity (with significantly higher rind and flesh lightness, MLRCo 78.8 to 81.9 and MFLCo 78.7 to 80.4, and rind yellowness values, MbRCo 11.4 to 12.5, and significantly lower flesh redness and yellowness, MaFCo −1.3 to −0.62 and MbFCo 9.7 to 13.3). Genetic correlations between environment were very high for the L trait (r x, y >0.75) and from moderate to high for the a and b color parameters (r x, y > 0.45) measured both in fruit rinds and flesh (Additional file 4).
We found a major region controlling the rind color of the immature and mature fruit in LG 4. The major QTLs ILRCo_4 and IbRCo_4 (R2 = 40.6 and 15.4%, 14.9 and 15.3 cM, CP32_scaffold000066_82542-CP32_scaffold000143_130468), explaining most of the variability found in immature rind color, colocalized with two major QTLs explaining most of the variation found in mature rind color, MLRCo_4 and MbRCo_4 (R2 = 40.3 and 12.8%, 14.9 and 19.6 cM, CP32_scaffold000066_82542 -CP32_scaffold000143_294806) (Table 2, Additional file 4). These QTLs control the occurrence of rind dark green color. In fact, the fruits from RILs with Zucchini alleles develop immature/mature fruits with darker green primary color, with or without stripped or mottle secondary color pattern, and with low values of L and b parameters (average values in the three environments ILRCo 52.6 versus 72.4, IbRCo 16.1 versus 20.5 and MLRCo 47.8 versus 73.1, MbRCo 14.9 versus 21.2 for RILs with Zucchini versus Scallop alleles) (Additional file 5) (Fig. 6a and b). These major QTLs colocalize with the major QTL for the rind color of mature fruits mapped previously in LG 14 with the F2 population (MLRCo_14 and MaRCo_14)  that correspond to LG 4 in the RIL map (Fig. 1).
In the LOD peak region of these QTLs, two genes annotated as related to an Arabidopsis APRR2-Like (ARABIDOPSIS PSEUDO RESPONSE REGULATOR2-LIKE gene) (CP32_ scaffold000180_115300-127146 and 130920-135450) (Additional file 6) were found. Genes of this family have been demonstrated to act as fruit-related regulators of pigment accumulation in tomato and pepper . In fact the presence of a stop codon mutation explain color differences between the wild-type and a white-fruited pepper cultivar associated to differences in chlorophyll content. Our results also suggest a similar function of these genes in the Cucurbitaceae family.
Apart from the main effect of ILRCo_4-IbRCo_4/MLRCo-4MbRCo_4, immature rind color seems to be controlled by additional genomic regions. The main ones were in LG 10 and LG 3. The first region affected also flesh color, ILRCo_10, IaRCo_10, IaFCo_10 and IbFCo_10 (R2 = 6.0, 9.7, 28.3 and 15.0%, 99.5, 104.9, 100.4 and 102.9 cM, CP32_scaffold000029_1132025-CP32_scaffold000029_1802506). RILs with Scallop/Zucchini genotype had immature fruits with different greenish and yellowish tones in rind and flesh (ILRCo 67.9 versus 60.5 and IaRCo −8.3 versus −10.4, IaFCo −6.2 versus −4.1 and IbFCo 16.6 versus 13.9 for Zucchini and Scallop, respectively) (Additional file 5). A similar effect was found in LG 3 (IaRCo_3, R2 = 12.1%, 86.1 cM CP32_scaffold000187_88187-CP32_scaffold000187_88366), but significant environment and G x E effect were found in this region (Table 2, Additional file 4). Other less important regions (R2 < 5%) involved in the variation of immature rind color were ILRCo_1, IbRCo_3 and IbRCo_12. These results are consistent with the traditionally proposed genetic control for rind color in squash, with a major gene derived from the Scallop genotype, W (weak rind coloration), complemented by modifiers, which confers a white or cream solid color independently of the genetic background. This gene has been reported to be epistatic to D (Dark stem) derived from Zucchini squash that result in dark stem and dark intermediate-age fruit . The APRR2-like genes underlying ILRCo_4-IbRCo_4/MLRCo-4MbRCo_4 are good candidates to be the W previously described squash gene.
The rind color of mature fruits was more variable than the immature fruit color, as during the ripening process yellow and orange colors develop in some fruits (Fig. 6b). Two QTLs were found involved in the redness (a parameter) variation (MaRCo_4 50.4 cM, CP32_scaffold000022_ 598182-CP32_scaffold000022_1004798) and in the yellowness (b parameter) variation (MbRCo_19, 52.1 cM, CP32_scaffold000005_1580923-CP32_scaffold000005_2422557). The major QTL MaRCo_4 explained most of the variation found in MaRCo (R2 = 18.6%), with the Zucchini alleles resulting in fruits with more orange color in rinds in all environments (MaRCo 0,88 versus −2,6 for Zucchini and Scallop genotypes) (Additional file 5). The QTL MbRCo_19 (R2 = 8.6%) resulted in fruits with rind color variable for the yellowness trait (MbRCo 22.2 versus 16.9 for Zucchini and Scallop genotypes). MbRCo_19 colocalized with the major QTL explaining variation in mature flesh color discussed below. Two additional minor (R2 < 5%) QTLs affecting rind lightness were detected in LG 1 and 2, MLRCo_1 and MLRCo_2 with Zucchini genotypes having darker fruits than Scallops, but with a significant environment and G x E interaction (Additional file 5).
The genetic control of external fruit color has been investigated in melons. Recently, Feder et al.  identified a Kelch domain-containing F-box protein regulating naringenin chalcone accumulation in melon rind producing the change from white to yellow rind. This gene (MELO3C011980, annotated as similar to F-box/kelch-repeat protein At1g23390) colocalizes with QTLs involved in the variation of external color in melons [72, 73]. In C. pepo, we found two genes annotated as F-box/kelch-repeat protein (Additional file 6) in the LOD peak regions of the MaRCo_4 QTL (CP32_ scaffold000022_600093-603383 and 624476-611084, best nr hit F-box/kelch-repeat protein At1g55270-like and At2g44130-like respectively). Also a Zeaxanthin epoxidase (ZEP) gene (CP32_ scaffold000022_750677-751766), an enzyme known to be involved in carotenoid metabolism , was found in that region. Also in the region of the QTL MbRCo_19 is annotated a Phytoene synthase (PSY) enzyme (CP32_scaffold000005_2373613-2377294), known to be involved in the carotenoids biosynthesis (Additional file 6).
Our data suggest that flesh color in mature fruits is controlled by two major regions in LG 19. MbFCo_19 (R2 = 62.9%) (52.1 cM, CP32_scaffold000005_1580923-CP32_scaffold000005_2422557) and MaFCo_19 (R2 = 18.9%) (38.8 cM, CP32_scaffold000005_2532610-CP32_scaffold000005_2907973). The major QTL controlling flesh color, MbFCo_19, corresponds to the QTL for mature fruit flesh color found previously in the F2 map in LG 16 (MbFCo_16) (Fig. 1). Zucchini alleles in this regions, MbFCo_19, resulted in light orange fleshed fruits (Fig. 7), with higher b values than those found in white-fleshed fruits from RILs with Scallop alleles (MbFCo 24.1 versus 14.8). The effect of MaFCo_19 (MaFCo −0.53 versus −2.77) was similar although less pronounced (Additional files 4 and 5), and significant E and G x E effects (Additional file 5). In melon, flesh color is controlled by two major genes: green flesh (gf)  and white flesh (wf) . The gen wf has been recently isolated, corresponding to a close homolog of the Cauliflower OR (Orange) protein with the capacity of inducing β-carotene accumulation . The C. pepo ortholog of this gene is located in CP32_scaffold00085-628461. The function of OR is to induce the differentiation of plastids into chromoplasts for carotenoid accumulation. This protein contains a Cysteine-rich zinc finger domain that is highly specific to DnaJ-like molecular chaperons. It is possible that OR works in association with a DnaJ-like protein to bind to proteins specific for the plastid differentiation/division. Underlying the QTLs for flesh color MbFCo_19 and MaFCo_19 there are several DnaJ-like proteins, along with several enzymes of the carotenoid biosynthesis pathway, such as one Phytoene synthase (PSY) (CP32_scaffold000005-2373613) and one Carotenoid cleavage dioxygenase (CCD) (CP32_scaffold000005-2201145) (Additional file 6). These genes are good candidates to be the previously described major gen Wf (white flesh), from Scallop, which is dominant over colored flesh .
Other two minor QTLs (R2 < 5%) involved in the variation of flesh color were also detected, although with significant E and G x E effect (Additional file 5) (MaFCo_10, 37.9 cM, CP32_scaffold000009_1923620-CP32_scaffold000009_2317405, and MaFCo_13, 53.1 cM, CP32_scaffold000017_2743863-CP32_scaffold000108_169208) that can act modulating the effect of the major gene, as it has been also reported in melons [72, 73, 78–80].
A high-quality SNP marker collection has been developed for mapping and construction of the first saturated map in the species, with more than 7,000 markers and anchored to the current version of the C. pepo genome by 145 scaffolds. The improvement in the number of markers per LG and the extensive phenotyping of the RIL population, have enabled the detection of 48 QTLs, most of them stable across three environments.
The availability of the C. pepo genome annotation https://cucurbigene.upv.es  has facilitated the identification of candidate genes underlying most of these QTLs, which will allow the knowledge of the underlying processes that give rise to these phenotypic traits. We can highlight the identification of candidate genes underlying the variation of QTLs that explain more than 30% of the variation found in leaf incision, fruit shape, rind and flesh color, traits of evident economic importance, which can be exploited for searching new attractive market products and may also imply and increase of nutritional value.
Amplified fragment length polymorphisms
Analysis of variance
Carotenoid cleavage dioxygenase
Composite interval mapping
- D :
Dark stem (gene)
- Di :
Discoidal fruit (gene)
Ethylene-responsive transcription factor
Flowering locus C protein
Flowering locus T protein
- gf :
Green flesh (gene)
logarithm of odds
- M :
Mottled or silvery leaves (gene)
Minimum allele frequency
OVATE family protein
Quantitative trait loci
Random amplified polymorphic DNA
Recombinant inbreed line
Singl nucleotide polymorphism
Simple sequence repeat
- W :
Weak rind coloration (gene)
- wf :
White flesh (gene)
FAOSTAT 2015. http://www.fao.org/faostat/en/#data/QC. Accessed 1 Apr 2016.
Manzano S, Martínez C, Megias Z, Garrido D, Jamilena M. Involvement of ethylene biosynthesis and signalling in the transition from male to female flowering in the monoecious Cucurbita pepo. J Plant Growth Regul. 2013;32(4):789–98.
Martínez C, Manzano S, Megias Z, Garrido D, Pico B, Jamilena M. Sources of parthenocarpy for Zucchini breeding: relationship with ethylene production and sensitivity. Euphytica. 2014;200(3):349–62.
Decker-Walters DS, Walters TW, Posluszny U, Kevan PG. Genealogy and gene flow among annual domesticated species of Cucurbita. Can J Bot. 1990;68:782–9.
Paris HS, Doron-Faigenboim A, Reddy UK, Donahoo R, Levi A. Genetic relationships in Cucurbita pepo (pumpkin, squash, gourd) as viewed with high frequency oligonucleotide-targeting active gene (HFO-TAG) markers. Genet Resour Crop Evol. 2015;62:1095–111.
Ferriol M, Picó B, Nuez F. Genetic diversity of a germplasm collection of Cucurbita pepo using SRAP and AFLP markers. Theor Appl Genet. 2003;107:271–82.
Formisano G, Roig C, Esteras C, Ercolano MR, Nuez F, Monforte AJ, et al. Genetic diversity of Spanish Cucurbita pepo landraces: an unexploited resource for summer squash breeding. Genet Resour Crop Evol. 2012;59(6):1169–84.
Paris HS. Historical records, origins, and development of the edible cultivar groups of Cucurbita pepo (Cucurbitaceae). Econ Bot. 1989;43:423–43.
Huang S, Li R, Zhang Z, Li L, Gu X, Fan W, et al. The genome of the cucumber, Cucumis sativus L. Nat Genet. 2009;41:1275–81.
Garcia-Mas J, Benjak A, Sanseverino W, Bourgeois M, Mir G, González VM, et al. The genome of melon (Cucumis melo L.). Proc Natl Acad Sci. 2012;109:11872–7.
Guo SG, Zhang JG, Sun HH, Salse J, Lucas WJ, Zhang HY, et al. The draft genome of watermelon (Citrullus lanatus) and resequencing of 20 diverse accessions. Nat Genet. 2013;45(1):51–U82.
Blanca J, Cañizares J, Roig C, Ziarsolo P, Nuez F, Picó B. Transcriptome characterization and high throughput SSRs and SNPs discovery in Cucurbita pepo (Cucurbitaceae). BMC Genomics. 2011;12:104.
Cucurbigene. https://cucurbigene.upv.es. Accessed 1 Apr 2016.
Martínez C, Manzano S, Megías Z, Barrera A, Boualem A, Garrido D, et al. Molecular and functional characterization of CpACS27A gene reveals its involvement in monoecy instability and other associated traits in squash (Cucurbita pepo L.). Planta. 2014;239:1201–15.
Paris HS. History of the cultivar-groups of Cucurbita pepo. In: Janick J, Wiley J, editors. Horticulture Review. New York, USA: John Wiley & Sons; 2000;25:71–170.
Vitiello A, Scarano D, D’Agostino N, Digilio MC, Pennacchio F, Corrado G, et al. Unraveling zucchini transcriptome response to aphids. PeerJ PrePrints. 2016; https://peerj.com/preprints/1635.pdf
Xanthopoulou A, Psomopoulos F, Ganopoulos I, Manioudaki M, Tsaftaris A, Nianiou-Obeidat I, et al. De novo transcriptome assembly of two contrasting pumpkin cultivars. Genomics Data. 2016;7:200–1.
Lee YH, Jeon HJ, Hong KH, Kim BD. Use of random amplified polymorphic DNA for linkage group analysis in an interspecific cross hybrid F2 generation of Cucurbita. J Kor Soc Hortic Sci. 1995;36:323–30.
Brown RN, Myers JR. A genetic map of squash (Cucurbita ssp.) with randomly amplified polymorphic DNA markers and morphological markers. J Am Soc Hortic Sci. 2002;127:568–75.
Zraidi A, Stift G, Pachner M, Shojaeiyan A, Gong L, Lelley T. A consensus map for Cucurbita pepo. Mol Breed. 2007;20:375–88.
Gong L, Pachner M, Kalai K, Lelley T. SSR-based genetic linkage map of Cucurbita moschata and its synteny with Cucurbita pepo. Genome. 2008;51:878–87.
Gong L, Stift G, Kofler R, Pachner M, Lelley T. Microsatellites for the genus Cucurbita and an SSR-based genetic linkage map of Cucurbita pepo L. Theor Appl Genet. 2008;117:37–48.
Esteras C, Gómez P, Monforte AJ, Blanca J, Vicente-Dólera N, Roig C, et al. High-throughput SNP genotyping in Cucurbita pepo for map construction and quantitative trait loci mapping. BMC Genomics. 2012;13:80.
Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES, et al. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE. 2011;6(5):e19379.
Poland JA, Rife TW. Genotyping-by-sequencing for plant breeding and genetics. Plant Genome J. 2012;5(3):92–102.
Takuno S, Terauchi R, Innan H. The power of QTL mapping with RILs. PLoS ONE. 2012;7(10):e46545.
GBS barcode splitter. https://sourceforge.net/projects/gbsbarcode/. Accessed 1 Apr 2016.
Langmead B, Salzberg S. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–9.
Garrison E, Marth G. Haplotype-based variant detection from short-read sequencing. ArXiv preprint. 2012; arXiv:1207.3907.
da Silva Pereira G, Di Cassia Laperuta L, Nunes ES, Chavarría L, Pastina MM, Gazaffi R, et al. The sweet passion fruit (Passiflora alata) crop: genetic and ghenotypic parameter estimates and QTL mapping for fruit traits. Tropical Plant Biol. 2016; DOI 10.1007/s12042-016-9181-4.
Broman KW, Wu H, Sen S, Churchill GA. R/qtl: QTL mapping in experimental crosses. Bioinformatics. 2003;19(7):889–90.
Taylor J, Butler D. ASMap: Linkage Map Construction using the MSTmap Algorithm. R package version 0.4-5. 2015.
Wu Y, Bhat PR, Close TJ, Lonardi S. Efficient and accurate construction of genetic linkage maps from the minimum spanning tree of a graph. PLoS Genet. 2008;4(10):e1000212.
van Os H, Stam P, Visser RGF, van Eck HJ. SMOOTH: a statistical method for successful removal of genotyping errors from high-density genetic linkage data. Theor Appl Genet. 2005;112:187–94.
Kosambi DD. The estimation of map distances from recombination values. Ann Eugenics. 1943;12:172–5.
Benjamini Y, Yekutieli D. The control of the false discovery rate in multiple testing under dependency. Ann Stat. 2001;29:1165–88.
Voorrips RE. MapChart: Software for the graphical presentation of linkage maps and QTL. J Hered. 2002;93(1):77–8.
Zhang G, Ren Y, Sun H, Guo S, Zhang F, Zhang J, et al. A high-density genetic map for anchoring genome sequences and identifying QTLs associated with dwarf vine in pumpkin (Cucurbita maxima Duch.). BMC Genomics. 2015;16(1):1101.
Hepworth SR, Klenz JE, Haughn GW. UFO in the Arabidopsis inflorescence apex is required for floral-meristem identity and bract suppression. Planta. 2006;223(4):769–78.
Baurle I, Smith L, Baulcombe DC, Dean C. Widespread role for the flowering-time regulators FCA and FPA in RNA-mediated chromatin silencing. Science. 2007;318:109–12.
Zhang W, Fan S, Pang C, Wei H, Ma J, Song M, et al. Molecular cloning and function analysis of two SQUAMOSA-Like MADS-box genes from Gossypium hirsutum L. J Integr Plant Biol. 2013;55(7):597–607.
Tsuda K, Ito Y, Sato Y, Kurata N. Positive autoregulation of a KNOX gene is essential for shoot apical meristem maintenance in rice. Plant Cell. 2011;23(12):4368–81.
Li S. The Arabidopsis thaliana TCP transcription factors: A broadening horizon beyond development. Plant Signal Behav. 2015;10(7):e1044192.
Wanga Y, Henrikssona E, Södermana E, Henrikssona KN, Sundberga E, Engströma P. The Arabidopsis homeobox gene, ATHB16, regulates leaf development and the sensitivity to photoperiod in Arabidopsis. Dev Biol. 2003;264(1):228–39.
Bou-Torrenta J, Salla-Martreta M, Brandtb R, Musielakb T, Palauquic JC, Martínez-García JF, et al. ATHB4 and HAT3, two class II HD-ZIP transcription factors, control leaf development in Arabidopsis. Plant Signal Behav. 2012;7(11):1382–7.
Paris HS, Nerson H, Burger Y. Leaf silvering of Cucurbita. Can J Plant Sci. 1987;67:593–8.
Shifriss O. Further notes on the silvery-leaf trait in Cucurbita. Cucurbit Genet Coop Rep. 1984;7:81–3.
Paris HS, Padley Jr LD. Gene List for Cucurbita species. Cucurbit Genet Coop Rep. 2014;37:1–7.
Young K, Kabelka EA. Characterization of resistance to squash silverleaf disorder in summer squash. Hortscience. 2009;44(5):1213–4.
Knopf RR, Trebitsh T. The female-specific Cs-ACS1G gene of cucumber. A case of gene duplication and recombination between the non-sex-specific 1-aminocyclopropane-1-carboxylate synthase gene and a branched-chain amino acid transaminase gene. Plant Cell Physiol. 2006;47(9):1217–28.
Boualem A, Fergany M, Fernandez R, Troadec C, Martin A, Morin H, et al. A conserved mutation in an ethylene biosynthesis enzyme leads to andromonoecy in melons. Science. 2008;321:836–8.
Boualem A, Troadec C, Kovalski I, Sari MA, Perl-Treves R, Bendahmane A. A conserved ethylene biosynthesis enzyme leads to andromonoecy in two Cucumis species. PLoS One. 2009;4:e6144.
Li Z, Huang S, Liu S, Pan J, Zhang Z, Tao Q, et al. Molecular isolation of the m gene suggests that a conserved-residue conversion induces the formation of bisexual flowers in cucumber plants. Genetics. 2009;182(4):1381–5.
Martin A, Troadec C, Boualem A, Rajab M, Fernandez R, Morin H, et al. A transposon induced epigenetic change leads to sex determination in melon. Nature. 2009;461:1135–8.
Boualem A, Troadec C, Camps C, Lemhemdi A, Morin H, Sari MA, et al. A cucurbit androecy gene reveals how unisexual flowers develop and dioecy emerges. Science. 2015;350(6261):688–91.
Manzano S, Martínez C, Domínguez V, Avalos E, Garrido D, Gómez P, et al. Major gene conferring reduced ethylene sensitivity and maleness in Cucurbita pepo. J Plant Growth Regul. 2010;29:73–80.
Wien HC, Stapleton SC, Maynard DN, McClurg C, Riggs D. Flowering, sex expression, and fruiting of pumpkin (Cucurbita sp.) cultivars under various temperatures in greenhouse and distant field trials. Hortscience. 2004;39(2):239–42.
Peñaranda A, Payan MC, Garrido D, Gómez P, Jamilena M. Production of fruits with attached flowers in zucchini squash is correlated with the arrest of maturation of female flowers. J Hortic Sci Biotech. 2007;82(4):579–84.
Yoo SK, Wu X, Lee JS, Ahn JH. AGAMOUS-LIKE 6 is a floral promoter that negatively regulates the FLC/MAF clade genes and positively regulates FT in Arabidopsis. Plant J. 2011;65(1):62–76.
Na X, Jian B, Yao W, Wu C, Hou W, Jiang B, et al. Cloning and functional analysis of the flowering gene GmSOC1-like, a putative SUPPRESSOR OF OVEREXPRESSION CO1/AGAMOUS-LIKE 20 (SOC1/AGL20) ortholog in soybean. Plant Cell Rep. 2013;32:1219–29.
Shi P, Guy KM, Wu W, Fang B, Yang J, Zhang M, et al. Genome-wide identification and expression analysis of the ClTCP transcription factors in Citrullus lanatus. BMC Plant Biol. 2016;16:85.
Lu H, Lin T, Klein J, Huang S. QTL-seq identifies an early flowering QTL located near flowering locus T in cucumber. Theor Appl Genet. 2014;127(7):1491–9.
Lian G, Ding Z, Wang Q, Zhang D, Xu J. Origins and evolution of WUSCHEL-related homeobox protein family in plant kingdom. Sci World J. 2014;2014:534140.
Costanzo E, Trehin C, Vandenbussche M. Review: Part of a special issue on flower development. The role of WOX genes in flower development. Ann Bot. 2014;114:1545–53.
Chao QM, Rothenberg M, Solano R, Roman G, Terzaghi W, Ecker JR. Activation of the ethylene gas response pathway in Arabidopsis by the nuclear protein ETHYLENE-INSENSITIVE3 and related proteins. Cell. 1997;89(7):1133–44.
Liu J, Van Eck J, Cong B, Tanksley SD. A new class of regulatory genes underlying the cause of pear-shaped tomato fruit. Proc Natl Acad Sci. 2002;99:13302–6.
Monforte AJ, Diaz A, Caño-Delgado A, van der Knaap E. The genetic basis of fruit morphology in horticultural crops: lessons from tomato and melon. J Exp Bot. 2014;65(16):4625–37.
Wang S, Chang Y, Ellis B. Overview of OVATE FAMILY PROTEINS, a novel class of plant-specific growth regulators. Front Plant Sci. 2016;7:417.
Muños S, Ranc N, Botton E, Bérard A, Rolland S, Duffé P, et al. Increase in tomato locule number is controlled by two SNPs located near WUSCHEL. Plant Physiol. 2011;156(4):2244–54.
Pan Y, Bradley G, Pyke K, Ball G, Lu C, Fray R, et al. Network inference analysis identifies an APRR2-like gene linked to pigment accumulation in tomato and pepper fruits. Plant Physiol. 2013;161(3):1476–85.
Feder A, Burger J, Gao S, Lewinsohn E, Katzir N, Schaffer AA, et al. A Kelch domain-containing F-Box coding gene negatively regulates flavonoid accumulation in muskmelon. Plant Physiol. 2015;169(3):1714–26.
Monforte AJ, Oliver M, Gonzalo MJ, Alvarez JM, Dolcet-Sanjuan R, Arús P. Identification of quantitative trait loci involved in fruit quality traits in melon (Cucumis melo L.). Theor Appl Genet. 2004;108:750–8.
Eduardo I, Arús P, Monforte AJ, Obando J, Fernández-Trujillo JP, Martínez JA, et al. Estimating the genetic architecture of fruit quality traits in melon using a genomic library of near isogenic lines. J Amer Soc Hort Sci. 2007;132:80–9.
Nakkanong K, Yang JH, Zhang MF. Carotenoid accumulation and carotenogenic gene expression during fruit development in novel interspecific inbred squash lines and their parents. J Agric Food Chem. 2012;60(23):5936–44.
Hughes MB. The inheritance of two characters of Cucumis melo and their interrelationship. Proc American Soc Horticultural Sci. 1948;52:399–402.
Iman MK, Abo-Bakr MA, Hanna HY. Inheritance of some economic characters in crosses between sweet melon and snake cucumber. I. Inheritance of qualitative characters. Assiut J Ag Sco. 1972;3:363–80.
Tzuri G, Zhou X, Chayut N, et al. A ‘golden’ SNP in CmOr governs the fruit flesh color of melon (Cucumis melo). Plant J. 2015;82:267–79.
Harel-Beja R, Tzuri G, Portnoy V, Lotan-Pompan M, Lev S, Cohen S, et al. A genetic map of melon highly enriched with fruit quality QTLs and EST markers, including sugar and carotenoid metabolism genes. Theor Appl Genet. 2010;121:511–33.
Ramamurthy RK, Waters BM. Identification of fruit quality and morphology QTLs in melon (Cucumis melo) using a population derived from flexuosus and cantalupensis botanical groups. Euphytica. 2015;204:163–77.
Leida C, Moser C, Esteras C, Sulpice R, Lunn JE, de Langen F, et al. Variability of candidate genes, genetic structure and association with sugar accumulation and climacteric behavior in a broad germplasm collection of melon (Cucumis melo L.). BMC Genet. 2015;16:28.
This research was funded by the INIA project RTA2011-00044-C02-1/2 and E-RTA2013-00020-C04-03 of the Spanish Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA) cofunded with FEDER funds (EU).
This work has been carried out in the framework of the INIA projects RTA2011-00044-C02-1/2 and E-RTA2013-00020-C04-03 of the Spanish Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA) cofunded with FEDER funds (EU).
Availability of data and material
Most of the data generated or analyzed during this study are included in this published article and its supplementary information files. The remaining datasets generated are available in https://cucurbigene.upv.es/ and in the corresponding SRA archives of the NCBI indicated in the Results section.
BP, JC and JB designed the study. JMP, JB, and JC performed the bioinformatic analysis. BP, CE, EM and PG developed the RILs population. BP and EM phenotyped the RIL population. AJM contributed to the genetic analysis. BP, JMP, AJM, CE and JC drafted the manuscript. All authors read and approved the final version of the manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
Phenotyped traits in the RIL population. (DOCX 13 kb)
Average percentage of genome covered with the GBS reads with a read depth from > 1 to > 20. (PPTX 713 kb)
a) Genetic map developed using the GBS approach with the RIL population derived from the cross Zucchini x Scallop. The genetic and physical position of each of the the 7,718 SNP markers (in the C. pepo genome version 3.2) is indicated and the flanking sequenced of each SNPs is also included. b) Physical position and annotation (in the C. pepo genome version 3.2) of the SNPs that showed statistically significant distorted segregation. Chi-square p-values were corrected for multiple testing using Benjamini and Yekutieli correction . Distorted SNPs are integrated with the 7,718 non-distorted SNPs used for the map construction for which the genetic position is also indicated. c) Main regions with distorted segregation (main genomic blocks with most of the SNPs with distorted segregation). (XLS 4656 kb)
a) LOD peaks of the QTLs identified in the RIL population for vine, flowering and fruit traits. Lines represent the thresholds p 0.01 and 0.05. Results of CIM analysis with a 20 cM windows are shown for each trait. First box show QTL results with data from all the environments and the other three boxes show the QTL results for single environments Paip2014, Paip2015 and UPV2015. b) Means and errors of the phenotype of alternative allelic classes, homozygous Zucchini (AA) and homozygous Scallop (BB), are shown for each QTL, calculated with the full set of data and with the data form each environment separately. c) Genetic correlations (r x,y) between locations for each trait (*p < 0.05, **p < 0.01, ***p < 0.001, ns non-significant). (PPTX 4972 kb)
Phenotypic values of RILs carrying Zucchini and Scallop alleles in the detected QTL regions. Means and standard errors of the phenotypic value of RILs belonging to the alternative allelic classes (Zuchini versus Scallop) for the markers in the LOD peak regions of each significant QTL are shown. Data of the three environments, combined and separate, are shown to validate the effects of the different QTLs (Paip2014, Paip2015, and UPV2015). (DOCX 39 kb)