- Research article
- Open Access
Chromosome level high-density integrated genetic maps improve the Pyrus bretschneideri ‘DangshanSuli’ v1.0 genome
BMC Genomics volume 19, Article number: 833 (2018)
Chromosomal level reference genomes provide a crucial foundation for genomics research such as genome-wide association studies (GWAS) and whole genome selection. The chromosomal-level sequences of both the European (Pyrus communis) and Chinese (P. bretschneideri) pear genomes have not been published in public databases so far.
To anchor the scaffolds of P. bretschneideri ‘DangshanSuli’ (DS) v1.0 genome into pseudo-chromosomes, two genetic maps (MH and YM maps) were constructed using half sibling populations of Chinese pear crosses, ‘Mantianhong’ (MTH) × ‘Hongxiangsu’ (HXS) and ‘Yuluxiang’ (YLX) × MTH, from 345 and 162 seedlings, respectively, which were prepared for SNP discovery using genotyping-by-sequencing (GBS) technology. The MH and YM maps, each with 17 linkage groups (LGs), were constructed from 2606 and 2489 SNP markers and spanned 1847 and 1668 cM, respectively, with average marker intervals of 0.7. The two maps were further merged with a previously published genetic map (BD) based on the cross ‘Bayuehong’ (BYH) × ‘Dangshansuli’ (DS) to build a new integrated MH-YM-BD map. By using 7757 markers located on the integrated MH-YM-BD map, 898 scaffolds (400.57 Mb) of the DS v1.0 assembly were successfully anchored into 17 pseudo-chromosomes, accounting for 78.8% of the assembled genome size. About 88.31% of them (793 scaffolds) were directionally anchored with two or more markers on the pseudo-chromosomes. Furthermore, the errors in each pseudo-chromosome (especially 1, 5, 7 and 11) were manually corrected and pseudo-chromosomes 1, 5 and 7 were extended by adding 19, 12 and 14 scaffolds respectively in the newly constructed DS v1.1 genome. Synteny analyses revealed that the DS v1.1 genome had high collinearity with the apple genome, and the homologous fragments between pseudo-chromosomes were similar to those found in previous studies. Moreover, the red-skin trait of Asian pear was mapped to an identical locus as identified previously.
The accuracy of DS v1.1 genome was improved by using larger mapping populations and merged genetic map. With more than 400 MB anchored to 17 pseudo-chromosomes, the new DS v1.1 genome provides a critical tool that is essential for studies of pear genetics, genomics and molecular breeding.
Pear (Pyrus spp.) is an important temperate fruit crop that is popular worldwide because of its sweet and juicy flesh, excellent eating quality, and nutritional, medicinal and economic value [1, 2]. In the ancient Chinese pharmacopoeia Compendium of Materia Medica (1596), pears were thought to resolve phlegm and relieve cough and asthma. In 2016, the area used for pear cultivation in China exceeded 1.113 million ha and yielded a harvest of 18.70 million tons of fruit (National Bureau of Statistics of the PRC, <http://data.stats.gov.cn/search.htm?s=%E6%A2%A8>, September 26, 2018, data last accessed), which accounted for more than 70% of global pear production (FAOSTAT, <http://www.fao.org/faostat/en/#data/QC>, September 26, 2018, data last accessed).
At present, conventional hybrid breeding is still the main method for generating new pear cultivars. However, the efficiency of this method is low in pear, because of its large tree size, long juvenile phase, and high genome heterozygosity [3,4,5]. The tens of thousands of hybrid trees required for breeding occupy a large area of land for many years and consume significant manpower and financial resources. Molecular marker-assisted selection (MAS) based on Quantitative Trait Locus (QTL) mapping of important agronomic traits allows breeders to screen hybrids at the seedling stage and eliminate those lacking potential [4, 6]. Thus, this practice can reduce the land, labor and financial investment required for crop breeding, especially for perennial fruit trees . Developing molecular markers linked to important agronomic traits is an important part of molecular breeding.
Significant progress has been made in developing and applying genetic markers in pears over the last two decades. Iketani et al. constructed two genetic maps from the segregation data of 82 F1 individuals and mapped the resistance allele of pear scab and the susceptibility allele of black spot in different linkage groups of ‘Kinchaku’ in 2001 . Since then, researchers have used RAPD, SRAP, AFLP, SSR and other molecular markers to construct pear genetic maps and identified QTLs for agronomic traits [8,9,10,11,12]. In recent years, a hybrid population of ‘Bayuehong (BYH)’ and ‘Dangshansuli (DS)’, which was established by the Zhengzhou Fruit Research Institute of the Chinese Academy of Agricultural Sciences, has been used by several research teams to construct genetic maps and map QTLs for dozens of fruit traits, including fruit weight, fruit shape index, soluble solid content, and fruit maturity [3, 5, 6, 13,14,15,16].
Genetic maps constructed using RAPD, AFLP and SSR markers are not ideal for map-based cloning or identification of genes responsible for important traits because of the low density of the mapped markers. Following the application of next generation sequencing (NGS) technology, high density pear genetic maps constructed with SNP markers were used for QTL mapping and map-based gene cloning [6, 17, 18]. For example, Yao et al.  identified the MYB114 gene responsible for the red skin trait using a high-density BYH × DS (BD) map . Furthermore, high-density genetic maps can guide the anchoring of scaffolds to pseudo-chromosomes. Wu et al. anchored 796 scaffolds of 386.7 Mb (or 75.5%) of DS v1.0 genome sequences to 17 pseudo-chromosomes according to a genetic map consisting of 2005 SNP markers . Chagné et al. anchored 868 scaffolds of 171.3 Mb (29.7% of the genome) to the 17 pseudo-chromosomes of Pyrus communis ‘Bartlett’ according to an integrated map consisting of 2279 markers (1391 and 888 apple and pear SNPs, respectively, genetically mapped using seven apple full-sib families and five pear segregating populations) [2, 19, 20]. Recently, Li et al.  anchored another 291.5 Mb of the ‘Bartlett’ genome scaffolds in 17 pseudo-chromosomes using three high-density SNP genetic maps (OH × LBJ, BYH × DS-JXB and PH-CG) with European pear genetic backgrounds, which dramatically increased the anchored portion of the ‘Bartlett’ genome from 29.7% in the original assembly to 50.5%. The assembled genomes of DS and Bartlett provide useful tools for fine mapping of genes/QTLs and map-based gene cloning, and they have laid a solid foundation for future pear genetics and genomics research. However, pseudo-chromosome genome assemblies do not yet exist for ‘Bartlett’ or DS.
Scaffold anchoring of the two published pear genomes was performed using genetic maps constructed with F1 populations that were relatively small in size, which may have resulted in errors and conflicts, some of which were discovered in subsequent analysis [6, 21]. High-density consensus genetic maps constructed by integrating high-quality genetic maps have been used to improve the quality of pear genome assemblies . In this study, we used two relatively large F1 populations of Asian pear to construct genetic maps. By integrating these two new maps with the BD map , we were able to improve the scaffolding of the DS reference genome v1.1 to the chromosome level <http://genedenovoweb.ticp.net:81/pear/>.
Construction of new genetic maps for MTH × HXS and YLX × MTH
A total of 778 and 338 million clean reads were obtained from the MTH × HXS and YLX × MTH populations, yielding more than 92.9 Gb and 46.3 Gb of clean data, respectively. The number of reads per individual plant varied from 0.88 to 3.56 million reads in MTH × HXS and from 1.63 to 3.45 million reads in YLX × MTH. The reads obtained from the parents were as follows: 10.06 million for MTH, 7.13 million for HXS, and 2.96 million for YLX (Additional file 1: Table S1). From these data, 31,423 putative SNP markers were obtained from MTH × HXS, consisting of 13,259 lmxll (42.2%), 12,345 nnxnp (39.3%) and 5819 hkxhk markers (18.5%), whereas 28,491 SNP markers were obtained from YLX × MTH, consisting of 9552 lmxll (33.5%), 13,388 nnxnp (47.0%), and 5551 hkxhk markers (19.5%). After removing unqualified and redundant SNPs, qualified SNPs for MTH × HXS and YLX × MTH were loaded into JoinMap 4.1. Finally, 2606 and 2489 SNP markers were used for final map construction for the MTH × HXS and YLX × MTH populations, respectively (Additional file 1: Table S2).
The statistical data for the genetic maps of MTH × HXS and YLX × MTH are shown in Table 1. The total length of the MTH × HXS map was 1846.6 cM, with an average distance between markers of 0.7 cM and a LG length ranging from 83.0 to 162.9 cM. The total genetic length of the YLX × MTH map was 1667.9 cM, with an average distance between markers of 0.7 cM and a LG length ranging from 74.7 cM to 141.2 cM. The largest gaps of the MH and YM maps were 15.5 cM in LG16 and 10.1 cM in LG17, respectively. Although the average genetic distance between markers is not the smallest, the numbers of LGs with gaps > 10 cM and > 5 cM in the MH and YM maps were three and ten and one and eleven, respectively, which were far fewer than the corresponding numbers in the BD (> 10 cM: 10, > 5 cM: 17) and RM (> 10 cM: 8, > 5 cM: 16) maps with the highest density of markers based on different pear reference genomes. This finding indicates that the markers on the MH and YM maps were more evenly distributed on the LGs.
Integration of three genetic maps and anchoring scaffolds into the integrated map
The DS reference genome scaffolds were anchored to the three genetic maps (Table 2). 2606, 2489 and 3143 unique markers from the MH, YM, and BD maps, corresponding to an average physical marker density of 7.4 markers/Mb, 7 markers/Mb, and 9.3 markers/Mb, respectively, were loaded into ALLMAPS. Although the BD map had the highest marker density, the difference in the number of scaffolds anchored among the three maps was very small, and the proportions of the scaffolds of the MH and YM maps that covered the total genome length were even larger than that of the BD map. The percentage of scaffolds anchored with only one marker was 2.09, 4.77 and 31.22% for MH, YM and BD, respectively, while 97.91, 95.23 and 68.78% of the scaffolds anchored in the MH, YM and BD maps had two or more markers, respectively. This finding confirms that the markers in the MH and YM maps are more evenly distributed across the genome in comparison with the markers in the BD map (Fig. 1).
The MH, YM, and BD genetic maps were integrated to form a consensus MH-YM-BD map using ALLMAPS (Table 3 & Table 4 for a summary, Additional file 1: Table S1 for a detailed list). The consensus map contained a total of 7863 unique markers. With 7757 markers, 898 scaffolds were anchored into 17 pseudo-chromosomes. Scaffolds containing the rest 106 markers, were not anchored into pear pseudo-chromosomes due to a lack of markers or positional conflicts. The anchored scaffolds accounted for 78.8% of the assembled genome size. Of these, 793 scaffolds (88.3% of the anchored scaffolds), making up 74.5% of the total genome length, were anchored with two or more markers, so that it was also possible to determine their orientation on the chromosomes. The unanchored 1284 short scaffolds accounted for only 21.20% of the assembled genome length. The DS genome v1.1 chromosome level sequences in FASTA format are available at our Pear Genomics & Breeding website <http://genedenovoweb.ticp.net:81/pear/>.
Comparison of the consensus map and the three individual genetic maps
ALLMAPS can compute a scaffold ordering that maximizes collinearity across a collection of maps; if there is a conflict within the entered map, ALLMAPS will output the most probable ordering to generate a consistent map. Using the consensus map produced by ALLMAPS as a reference, we compared differences in scaffold order among the three maps and manually checked the errors in each map. For example, LG1 and LG7 (Fig. 2) were not completely separated from each other in the BD map, but they were completely separated in the integrated map, the MH map, and the YM map. LG1 in the BD map was split in two: the smaller part showed collinearity with LG1 of the consensus map, while the larger part, together with LG 7 and a small piece of LG 17, showed collinearity with LG 7 of the consensus map. We therefore integrated the three pieces and re-assigned them to LG7. Furthermore, LGs 1 and 7 in the BD map lacked a long fragment with respect to the consensus map; LG5 in the BD map also looked incomplete, and several parts of LG5 were re-assigned to LG11. According to the MH-YM-BD map, 19, 12 and 14 newly anchored scaffolds were added to Chr1, Chr5 and Chr7, respectively, of the DS v1.1 genome to extend them. Overall, the BD map had a weak correlation with the consensus map, especially on LGs 2, 8, 10, 13, 14 and 16, for which the Spearman rank correlation coefficients to the integrated map LGs were less than 0.9. On the contrary, the MH and YM maps were highly consistent, which helped us to correct the potential error in the BD map and construct a more reliable set of pseudo-chromosomes for the DS v1.1 genome.
Synteny analyses between the DS genome v1.1 and the GDDH13 apple genome
Previous studies have shown that apple and pear have collinear genomes [2, 6]. To further confirm the quality of our newly assembled pear genome, we performed a collinear comparison between the DS genome v1.1 and a previously published GDDH13 apple genome . We used homologous proteins to determine the collinearity of the pear and apple genomes. After Blastp alignment and filtration, 44,840 and 45,110 proteins from pear and apple, respectively, corresponding to 19,454 homologous protein pairs between the two species, were selected and used for the collinearity comparison. There was good one-to-one collinearity between the pseudo-chromosomes of DS genome v1.1 and the GDDH13 apple genome, with high correlation coefficients from 0.9354 to 0.9902 except for pseudo-chromosome 3 (0.8945) (Fig. 3 and Table 5). The overall collinearity between the DS genome v1.1 and the GDDH13 genome was superior to that between the 1st ‘Golden Delicious’ genome and the ‘Bartlett’ genome v1.1 , which indicated that the high-quality genetic maps generated in this study improved the accuracy of the pear genome assembly. In addition to the one-to-one correspondence between the pear-apple homologous chromosome pairs, we also observed collinear blocks among non-homologous pear-apple chromosomes: 1–7, 2–7, 2–15, 3–11, 4–12, 5–10, 6–14, 8–15, 9–17, 12–14 and 13–16. This non-homologous inter-chromosomal synteny between species was consistent with the inter-chromosomal synteny within pear and apple [1, 2, 6, 23], which indicated that these genome-wide duplication events occurred in the common ancestor of pear and apple. This finding was consistent with a previous report that whole-genome duplication events in pear and apple occurred earlier than their species differentiation .
The relocation of Asian pear red/green locus confirmed the accuracy and convenience of the DS v1.1 genome
78,220,374 reads from the red-skinned DNA pool (19.65× depth coverage or 94.74% coverage) and 93,410,888 reads from the green-skinned DNA pool (22.98× depth coverage or 95.15% coverage)  were aligned to the DS v1.1 reference genome and SNPs were identified. After removing low quality SNPs, a total of 4,284,472 SNPs were retained and used to perform the sliding window analysis. The candidate interval of the red/green skin locus was positioned on the fifth chromosome according to the |∆(SNP-index)| plot (Additional file 2: Figure S1), which is consistent with previous results . The candidate interval derived from the threshold line for the top 0.5% in the |∆(SNP-index)| plot was 2.1 Mb, which was separated into two subintervals of 0.9-Mb and 1.2 Mb by a 3.26 Mb gap. The 2.1 Mb interval was comprised of parts of five scaffolds: NW_008988126.1 (779.4 kb, 188,765–968,163 bp of the scaffold), NW_008988141.1 (120.5 kb, 1–120,502 bp of the scaffold), NW_008988091.1 (37.7 kb, 1,156,306–1,193,985 bp of the scaffold), NW_008988243.1 (620.2 kb, 1–620,203 bp of the scaffold) and NW_008988130.1 (542.0 kb, 1–542,018 bp of the scaffold). Four of these five scaffolds were involved in the candidate scaffolds located previously , with the exception of scaffold NW_008988243.1.
With the development of high-throughput sequencing technology and the popularity of simplified genomic sequencing technologies such as RAD , GBS  and SLAF , genetic maps with high (or even ultra-high) marker density have laid a foundation for downstream applications such as QTL mapping and chromosome assembly [27,28,29,30]. Wu et al. constructed a genetic map of 2005 SNP markers in pear using a BYH × DS cross and anchored scaffolds of the DS genome . Subsequently, they improved the genetic map and identified QTLs by increasing the number of offspring from 56 to 102 and increasing the number of markers to 3241 (the BD map) . Montanari et al. successfully mapped 829 polymorphic pear markers and 569 polymorphic apple markers from their newly developed apple and pear Infinium® II 9 K SNP array to the parental genetic maps for five pear segregating populations, and they found that the number of markers per map varied from 318 to 645 . These parental genetic maps were then used, along with maps for seven apple full-sib families , to anchor 171.3 Mb of the assembled ‘Bartlett’ genome into 17 LGs . Recently, Wang et al. built a high-density linkage map with 4865 markers (4664 SNP markers and 201 SSR markers) of the segregating population of ‘Red Clapp’s Favorite’ × ‘Mansoo’ (RM map), spanning 2703.61 cM, with an average distance of 0.56 cM between adjacent markers . To date, the BD and RM maps are the densest pear genetic maps. In this study, we selected only five of the polymorphic parental SNP markers equally distributed on each scaffold to reduce marker redundancy for construction of the MH map and the YM map. This practice did not maximize the number of markers or marker density of these genetic maps, but it led to the generation of maps on which the markers were relatively evenly distributed.
During comparative mapping of multiple genetic maps, some markers may be revealed to be located at different positions of a LG or may even be assigned to different LGs. This phenomenon is observed even between male and female maps from a single outcross population , and it may be caused by segregation distortion of markers. Segregation distortion of markers affects the recombination distance between markers and the order of the markers on LGs [33, 34]. Wang et al. found that two distorted markers (NAUpy59t and ZFRIt051) were able to change the order of other markers on a LG . Differences between the genetic backgrounds of parents in different mapping populations can also lead to differences in marker order [31, 32]. In this study, the parent material for the MH map consisted of ‘Mantianhong’ (P. pyrifolia) and ‘Hongxiangsu’ (P. bretschneideri), whereas the parent material for the YM map consisted of ‘Yuluxiang’ (P. bretschneideri) and ‘Mantianhong’, and the genetic backgrounds of the two populations were relatively similar. However, the parents of the BD map were ‘Bayuehong’ (P. communis) and DS (P. bretschneideri), so their backgrounds were quite different from those of the MH and YM populations [35,36,37,38]. Therefore, the markers of the MH and YM maps have relatively good collinearity between each other compared to that of the markers of the BD map. In addition, the presence of incorrect insertions of contigs in some scaffolds  and large differences in population size may also be responsible for differences in the order of the markers among the different maps analyzed in this study.
The quality of the genome assembly has an important and direct effect on the anchoring of scaffolds into pseudo-chromosomes [6, 39, 40]. Pootakham et al. constructed an integrated linkage map and anchored 28,965 contigs, covering only 12% of the published rubber tree genome, of which 78% of sequences were identified as repetitive DNA, and the average scaffold length was 1.84 kb [41, 42]. Westbrook et al. mapped 3305 loblolly pine scaffolds onto 12 linkage groups using 3762 markers and found that most scaffolds were too short to span two or more markers . In pear, there were 2103 scaffolds (with an N50 scaffold length of 540.8 kb) and 142,083 scaffolds (with an N50 scaffold length of 88.114 kb) in the DS and ‘Bartlett’ genome assemblies, respectively, however, in comparison with the ‘Bartlett’ genome assembly, more sequences of the DS assembly had been anchored into pseudo-chromosomes with fewer markers [1, 2, 6]. Genetic maps are useful tools for guiding scaffold anchoring into pseudo-chromosome assembly, and the quality and density of genetic maps are important factors [6, 39]. In this study, the polymorphic markers of the parents of two populations were first filtered according to their positions before the genetic map was constructed, so the number of mapped markers in the two genetic maps was not the largest among published genetic maps, but the number of anchored scaffolds of these two maps were both close to that of the previously published BD map, and the total length of the anchored scaffolds in each of these two maps exceeded the total length of the scaffold anchored by the BD map (excluding the SSR markers in the BD map). Moreover, in comparison with the BD map, the direction of more than 95% of anchored scaffolds (anchored with two or more markers) can be confirmed by both newly constructed maps, while the direction of only 68.78% of anchored scaffolds can be determined according to the BD map. This finding indicates that the MH and YM maps had fewer redundant markers and higher quality in comparison with the BD map, and thus they are more useful as guides for pseudo-chromosome assembly.
The new DS v1.1 genome is a greatly improved version in comparison with DS v1.0, with particular improvement to pseudo-chromosomes 1 and 7. In this study, seven scaffolds previously anchored to pseudo-chromosome 1 of DS v1.0 were transferred to pseudo-chromosome 7 of DS v1.1. Of the 38 scaffolds anchored to pseudo-chromosome 1 of DS v1.1, 19 were newly anchored in this study. Only 12 of the other 19 scaffolds were also anchored to pseudo-chromosome 1 according to the BD map, while the other seven were scattered among the other pseudo-chromosomes of DS v1.0. Li et al. reported similar findings in a recent study, in which they integrated existing genetic maps and performed scaffold anchoring. They anchored nine new scaffolds on DS pseudo-chromosome 1 and re-anchored a large number of markers located on LG1 of the BD map (BYH × DS-JXB) to pseudo-chromosome 7 of the DS genome . Five of the nine scaffolds (scaffold 341.0, scaffold 467.0, scaffold 638.0, scaffold 797.0 and scaffold 872.0) newly anchored by Li et al. also appeared in the list of 19 newly anchored scaffolds in this study, which demonstrated the reliability of the DS v1.1 assembly. The partially homologous blocks on pseudo-chromosomes 1 and 7 may be the reason that a large number of markers that should have been mapped onto LG7 in the BD map were instead mapped onto LG1 [1, 6, 23]. However, the failure to anchor more scaffolds onto pseudo-chromosome 1 may also be related to the small size of the population that was used to construct the BD map, as well as lethal genes present in chromosome 1, which can cause interspecific hybrid necrosis .
In our previous study, a modified QTL-seq analysis was performed on the red/green fruit skin trait locus of Asian pear at the scaffold level, and the scaffolds linked to the red/green locus were then mapped to LG5 according to the high density BD map [3, 21]. In this study, we directly performed chromosome-level QTL-seq analysis using the new DS v1.1 genome assembly and located the red/green locus in an interval consistent with that reported in previous studies [18, 21], which greatly reduced the workload required to map scaffolds to linkage groups and supported the accuracy of DS v1.1. The peak of the candidate interval in the |∆(SNP-index)| plot (Additional file 2: Figure S1) was split into two parts by a 3.26 Mb gap because most part of each of the three scaffolds (NW_008988076.1, NW_008988141.1 and NW_008988091.1) within the gap were unlinked to the R/G locus . As the three individual genetic maps are highly consistent in the entire region of LG5 (Fig. 2), we were able to rule out the possibility of assembly error regarding the genetic maps. In short, although there were small errors in some scaffolds, the DS v1.1 genome assembly provides a foundation for studies aimed at locating important agronomic traits at the chromosome level.
In this study, two sets of maps, MH and YM, were constructed and further merged with a previously published BD map to build a new integrative MH-YM-BD map. Eight hundred ninety-eight scaffolds, of which 88.31% were directionally anchored with two or more markers, were then successfully anchored into 17 pseudo-chromosomes of the DS v1.1 genome according to the MH-YM-BD map, accounting for 78.8% (400.57 Mb) of the assembled genome size. Errors in each pseudo-chromosome were corrected. Pseudo-chromosomes 1, 5 and 7 were extended by 50, 20 and 27.5%, respectively. Seven scaffolds from pseudo-chromosome 1 were transferred into pseudo-chromosome 7 in the newly constructed DS v1.1 genome. The DS v1.1 genome, which has high collinearity with the apple genome and was used to accurately locate the red/green locus of Asian pear, provides a critical tool that is essential for studies of pear genetics, genomics and molecular breeding.
Construction of new genetic maps for MTH × HXS and YLX × MTH
Asian pear cultivars ‘Mantianhong’ (MTH) (P. pyrifolia), ‘Hongxiangsu’ (HXS) (P. bretschneideri) and ‘Yuluxiang’ (YLX) (P. bretschneideri) were grown as parental lines in the orchard of the Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences (ZFRI, CAAS) in Zhengzhou (Henan Province, China). The 345 and 162 hybrid plants resulting from the MTH/HXS cross and YLX/MTH cross, respectively, were used in this study. The hybrids were produced in 2009 and grown at the affiliated experimental orchard of the ZFRI, CAAS in Xinxiang (Henan Province, China). MTH × HXS (MH) and YLX × MTH (YM) genetic maps were constructed using SNP markers derived via genotyping by sequencing (GBS) . Young leaves of each plant were collected in mid-April at the beginning of vegetative growth. The leaves were first frozen in liquid nitrogen and then transferred to a − 80 °C freezer. Total DNA was extracted using the cetyltrimethylammonium bromide method , and GBS libraries were constructed using the two-enzyme modification of the original GBS protocol [25, 46]. One hundred ng of DNA for each plant sample was digested with restriction enzymes EcoRI and NIaIII (New England Biolabs, Ipswich, MA, USA). The digested products were then ligated to 25 pmol of A1 and A2 adapters. The libraries were pooled, size-selected (400–600 bp) on a 1% agarose gel, column-cleaned using a PCR purification kit (NEB), and amplified for 12 cycles using Phusion DNA polymerase (NEB). Average fragment size was estimated on a Bioanalyzer 2100 (Agilent, Santa Clara, CA) using a DNA1000 chip following a second column-cleaning. Library quantification was performed using PicoGreen (Invitrogen, Carlsbad, CA, USA). Pooled libraries were adjusted to 10 nmol and sequenced with PE125 on a HiSeq4000 instrument (Illumina, San Diego, CA). High-quality clean reads were obtained by (1) trimming raw reads, (2) removing reads with > 10% unidentified nucleotides and (3) removing reads with > 50% bases having a low Phred quality score (< 5). The Burrows-Wheeler Aligner  was used to align the clean reads against the scaffolds of the P. bretschneideri DS genome <ftp://ftp.ncbi.nlm.nih.gov/genomes/Pyrus_x_bretschneideri/CHR_Un/>  using ‘mem −k 32−M’, where k is the minimum seed length and M is an option used to mark shorter split alignment hits as secondary alignments. SNP calling was performed on all samples using GATK’s Unified Genotyper 3.3 , and SNPs were filtered using GATK’s Variant Filtration with appropriate parameter settings (-Window 4, −filter ‘QD < 4.0||FS > 60.0||MQ < 40.0, −G_filter ‘GQ < 20′). Variants exhibiting segregation distortion or sequencing errors were discarded. The ANNOVAR Software Tool (Philadelphia, PA, USA)  was used to annotate SNPs in the genome. Polymorphic parental SNP markers were classified into ‘CP’ population segregation patterns, such as lm × ll, nn × np, and hk × hk, in a Mendelian manner . SNP variants outside the sequencing depth range of 5–1500 were considered missing data. All SNPs with missing data for > 10 individuals were removed from the analysis. Markers showing significantly distorted segregation (Chi-square test, P < 0.05), having low integrity (< 95%), or containing abnormal bases were filtered by JoinMap 4.1 . To reduce the number of redundant markers, only five of the SNPs equally distributed on a scaffold were retained for the scaffolds with more than five qualified SNP markers. Next, the qualified SNP markers were used to construct the genetic linkage maps of MH and YM using the Kosambi mapping function in JoinMap 4.1. The LOD value was set to separate most markers into 17 linkage groups (LG), which was consistent with the BYH × DS (BD) map published by Wu et al. .
Construction of high quality pear consensus genetic maps and anchoring of the DS scaffolds
A integrated genetic map for MH, YM and BD was constructed using ALLMAPS software . In the pilot test, we compared the consistency between pairs of maps and found the most consistency between the MH map and YM map. This finding indicated that these two maps had higher reliability in comparison with the other maps. Therefore, for construction of the final integrated map, the weight factor in ALLMAPS was set to 2 for the MH map and YM map, but it was set to 1 for the BD map. The scaffolds were sorted based on the positions of the SNP markers on the three individual maps, after which the scaffolds were integrated to obtain a consensus map. The scaffolds with two or more markers were further defined with regard to their direction in the consensus map, and un-anchored scaffolds were assigned into pseudo-chromosome 0. Consistencies and differences among the three maps (MH, YM and BD) were confirmed by visual evaluation of collinearity.
Synteny of the new version of the pear genome with the GDDH13 apple genome
We downloaded the GDDH13 doubled-haploid apple genome  sequence from NCBI <ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/002/114/115/GCA_002114115.1_ASM211411v1/> and obtained the sequences of protein-coding genes that were homologous to pear genes. These homologous genes were used to search all of the protein sequences of pear and apple using the best reciprocal hit BLAST strategy of the Blastp software (ftp://ftp.ncbi.nlm.nih.gov/blast/executables/blast+), setting the lowest threshold for e < 1 × 10E-7. The one-to-one correspondence between homologous pear and apple chromosomes was determined using the homologous information of each pair of genes. The collinearity diagram for apple and pear was plotted using in house R script based on the physical positions of the homologous genes on homologous chromosomes.
QTL-seq analysis for the red/green locus of Asian pear to verify the accuracy of the DS v1.1 genome
We directly used previous resequencing data of the red- and green-skinned pools for the QTL-seq analysis. The procedures used for clean read alignment, variant calling, and annotation were the same as those described in a previously published study . Seventeen pseudo-chromosome sequences of the newly assembled DS genome were used as the reference sequences to calculate the SNP-index, Δ(SNP-index) and |Δ(SNP-index)| between the red- and green-skinned pools [21, 52]. Sliding window analysis was performed on the 17 newly assembled DS pseudo-chromosome sequences and applied to the SNP-index and |∆(SNP-index)| plots with 1-Mb windows and 20-kb increments. To avoid the ‘pseudoexchange effect’ in heterozygous crops, |Δ(SNP-index)| was used instead of Δ(SNP-index) as the main parameter to identify the target phenotype . The top 0.5% of the highest |Δ(SNP-index)| intervals were selected as candidate gene intervals .
Amplified fragment length polymorphism
BYH × DS
MTH × HXS
Next generation sequencing
Quantitative trait locus
Random amplified polymorphic DNA
Red Clapp’s Favorite × Mansoo
Single nucleotide polymorphism
Sequence-related amplified polymorphism
Simple sequence repeat
YLX × MTH
Wu J, Wang Z, Shi Z, Zhang S, Ming R, Zhu S, Khan MA, Tao S, Korban SS, Wang H. The genome of the pear (Pyrus bretschneideri Rehd.). Genome Res. 2013;23(2):396–408.
Chagne D, Crowhurst RN, Pindo M, Thrimawithana A, Deng C, Ireland H, Fiers M, Dzierzon H, Cestaro A, Fontana P. The draft genome sequence of European pear (Pyrus communis L. 'Bartlett'). PLoS One. 2014;9(4):e92644.
Wu J, Li LT, Li M, Khan MA, Li XG, Chen H, Yin H, Zhang SL. High-density genetic linkage map construction and identification of fruit-related QTLs in pear using SNP and SSR markers. J Exp Bot. 2014;65(20):5771–81.
Xue H, Zhang P, Shi T, Yang J, Wang L, Wang S, Su Y, Zhang H, Qiao Y, Li X. Genome-wide characterization of simple sequence repeats in Pyrus bretschneideri and their application in an analysis of genetic diversity in pear. BMC Genomics. 2018;19(1):473.
Chen H, Song Y, Li L-T, Khan MA, Li X-G, Korban SS, Wu J, Zhang S-L. Construction of a high-density simple sequence repeat consensus genetic map for pear (Pyrus spp.). Plant Mol Biol Report. 2014;33(2):316–25.
Li L, Deng CH, Knabel M, Chagne D, Kumar S, Sun J, Zhang S, Wu J. Integrated high-density consensus genetic map of Pyrus and anchoring of the 'Bartlett' v1.0 (Pyrus communis) genome. DNA Res. 2017;24(3):289–301.
Iketani H, Abe K, Yamamoto T, Kotobuki K, Sato Y, Saito T, Terai O, Matsuta N, Hayashi T. Mapping of disease-related genes in Japanese pear using a molecular linkage map with RAPD markers. Breed Sci. 2001;51(3):179–84.
Dondini L, Pierantoni L, Gaiotti F, Chiodini R, Tartarini S, Bazzi C, Sansavini S. Identifying QTLs for fire-blight resistance via a European pear (Pyrus communis L.) genetic linkage map. Mol Breed. 2004;14(4):407–18.
Le Roux P-MF, Christen D, Duffy B, Tartarini S, Dondini L, Yamamoto T, Nishitani C, Terakami S, Lespinasse Y, Kellerhals M. Redefinition of the map position and validation of a major quantitative trait locus for fire blight resistance of the pear cultivar ‘harrow sweet’ (Pyrus communis L.). Plant Breed 2012;131(5):656–664.
Sun W, Zhang Y, Zhang X, Le W, Zhang H. Construction of a genetic linkage map and QTL analysis for some growth traits in pear. J Plant Genet Resour. 2009;10(2):182–9.
Dondini L, Pierantoni L, Ancarani V, D’Angelo M, Cho KH, Shin IS, Musacchi S, Kang SJ, Sansavini S. The inheritance of the red colour character in European pear (Pyrus communis) and its map position in the mutated cultivar ‘max red Bartlett. Plant Breed. 2008;127(5):524–6.
Yamamoto T, Terakami S, Takada N, Nishio S, Onoue N, Nishitani C, Kunihisa M, Inoue E, Iwata H, Hayashi T. Identification of QTLs controlling harvest time and fruit skin color in Japanese pear (Pyrus pyrifolia Nakai). Breed Sci. 2014;64(4):351–61.
Han M, LIu Y, Zheng X, Yang J, Wang L, Wang S, Li X, Teng Y. Construction of a genetic linkage map and QTL analysis for some fruit traits in pear [Chinese]. J Fruit Sci. 2010;27(4):496–503.
Liu J, Cui H, Wang L, Wang X, Yang J, Zhang Z, Li X, Qiao Y. Analysis of pear fruit acid /low-acid trait by SSR marker [Chinese]. J Fruit Sci. 2011;28(3):389–93.
Zhang R, Wu J, Li X, Yang J, Wang L, Wang S, Zhang S. Construction of AFLP genetic linkage map and analysis of QTLs related to fruit traits in pear [Chinese]. Acta Horticulturae Sinica. 2011;38(10):1991–8.
Zhang R, Wu J, Li X, Khan MA, Chen H, Korban SS, Zhang S, An AFLP. SRAP, and SSR genetic linkage map and identification of QTLs for fruit traits in pear (Pyrus L.). Plant Mol Biol Report. 2012;31(3):678–87.
Kumar S, Kirk C, Deng C, Wiedow C, Knaebel M, Brewer L. Genotyping-by-sequencing of pear (Pyrus spp.) accessions unravels novel patterns of genetic diversity and selection footprints. Horticulture research. 2017;4:17015.
Yao G, Ming M, Allan AC, Gu C, Li L, Wu X, Wang R, Chang Y, Qi K, Zhang S. Map-based cloning of the pear gene MYB114 identifies an interaction with other transcription factors to coordinately regulate fruit anthocyanin biosynthesis. Plant J. 2017;92(3):437–51.
Kumar S, Chagne D, Bink MC, Volz RK, Whitworth C, Carlisle C. Genomic selection for fruit quality traits in apple (Malus x domestica Borkh.). PLoS One. 2012;7(5):e36674.
Montanari S, Saeed M, Knabel M, Kim Y, Troggio M, Malnoy M, Velasco R, Fontana P, Won K, Durel CE. Identification of Pyrus single nucleotide polymorphisms (SNPs) and evaluation for genetic mapping in European pear and interspecific Pyrus hybrids. PLoS One. 2013;8(10):e77022.
Xue H, Shi T, Wang F, Zhou H, Yang J, Wang L, Wang S, Su Y, Zhang Z, Qiao Y. Interval mapping for red/green skin color in Asian pears using a modified QTL-seq method. Hortic Res. 2017;4:17053.
Daccord N, Celton JM, Linsmith G, Becker C, Choisne N, Schijlen E, van de Geest H, Bianco L, Micheletti D, Velasco R. High-quality de novo assembly of the apple genome and methylome dynamics of early fruit development. Nat Genet. 2017;49(7):1099–106.
Velasco R, Zharkikh A, Affourtit J, Dhingra A, Cestaro A, Kalyanaraman A, Fontana P, Bhatnagar SK, Troggio M, Pruss D. The genome of the domesticated apple (Malus x domestica Borkh.). Nat Genet. 2010;42(10):833–9.
Baird NA, Etter PD, Atwood TS, Currey MC, Shiver AL, Lewis ZA, Selker EU, Cresko WA, Johnson EA. Rapid SNP discovery and genetic mapping using sequenced RAD markers. PLoS One. 2008;3(10):e3376.
Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES. Mitchell SE. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS One. 2011;6(5):e19379.
Sun X, Liu D, Zhang X, Li W, Liu H, Hong W, Jiang C, Guan N, Ma C, Zeng H. SLAF-seq: an efficient method of large-scale de novo SNP discovery and genotyping using high-throughput sequencing. PLoS One. 2013;8(3):e58700.
Liu N, Li M, Hu X, Ma Q, Mu Y, Tan Z, Xia Q, Zhang G, Nian H. Construction of high-density genetic map and QTL mapping of yield-related and two quality traits in soybean RILs population by RAD-sequencing. BMC Genomics. 2017;18(1):466.
Yang Z, Chen Z, Peng Z, Yu Y, Liao M, Wei S. Development of a high-density linkage map and mapping of the three-pistil gene (Pis1) in wheat using GBS markers. BMC Genomics. 2017;18(1):567.
Zhang J, Zhang Q, Cheng T, Yang W, Pan H, Zhong J, Huang L, Liu E. High-density genetic map construction and identification of a locus controlling weeping trait in an ornamental woody plant (Prunus mume Sieb. Et Zucc). DNA Res. 2015;22(3):183–91.
Zhou G, Jian J, Wang P, Li C, Tao Y, Li X, Renshaw D, Clements J, Sweetingham M, Yang H. Construction of an ultra-high density consensus genetic map, and enhancement of the physical map from genome sequencing in Lupinus angustifolius. Theor Appl Genet. 2018;131(1):209–23.
Wang L, Li X, Wang L, Xue H, Wu J, Yin H, Zhang S. Construction of a high-density genetic linkage map in pear ( Pyrus communis × Pyrus pyrifolia Nakai) using SSRs and SNPs developed by SLAF-seq. Sci Hortic. 2017;218:198–204.
Wang L, Wang L, Xue H, Li X, Li J. Construction of SSR genetic linkage map and comparison on pears [Chinese]. Sci Agric Sin. 2016;49(12):2353–67.
Lorieux M, Goffinet B, Perrier X, De Leon DG, Lanaud C. Maximum-likelihood models for mapping genetic markers showing segregation distortion. 1. Backcross populations. Theoret Appl Genet. 1995;90(1):73–80.
Lorieux M, Perrier X, Goffinet B, Lanaud C, De León DG. Maximum-likelihood models for mapping genetic markers showing segregation distortion. 2. F 2 populations. Theor Appl Genet. 1995;90(1):81–9.
Teng Y, Tanabe K, Tamura F, Itai A. Genetic relationships of Pyrus species and cultivars native to East Asia revealed by randomly amplified polymorphic DNA markers. J Am Soc Hortic Sci. 2002;127(2):262–70.
Jiang S, Zong Y, Yue X, Postman J, Teng Y, Cai D. Prediction of retrotransposons and assessment of genetic variability based on developed retrotransposon-based insertion polymorphism (RBIP) markers in Pyrus L. Mol Genet Genomics. 2015;290(1):225–37.
Kimura T, Shi YZ, Shoda M, Kotobuki K, Matsuta N, Hayashi T, Ban Y, Yamamoto T. Identification of Asian pear varieties by SSR analysis. Breed Sci. 2002;52(2):115–21.
Bao L, Chen K, Zhang D, Li X, Teng Y. An assessment of genetic variability and relationships within Asian pears based on AFLP (amplified fragment length polymorphism) markers. Sci Hortic. 2008;116(4):374–80.
Tang H, Zhang X, Miao C, Zhang J, Ming R, Schnable JC, Schnable PS, Lyons E, Lu J. ALLMAPS: robust scaffold ordering based on multiple maps. Genome Biol. 2015;16:3.
Pop M, Kosack DS, Salzberg SL. Hierarchical scaffolding with Bambus. Genome Res. 2004;14(1):149–59.
Rahman AYA, Usharraj AO, Misra BB, Thottathil GP, Jayasekaran K, Feng Y, Hou S, Ong SY, Ng FL, Lee LS. Draft genome sequence of the rubber tree Hevea brasiliensis. BMC Genomics. 2013;14(1):75.
Pootakham W, Ruang-Areerate P, Jomchai N, Sonthirod C, Sangsrakru D, Yoocha T, Theerawattanasuk K, Nirapathpongporn K, Romruensukharom P, Tragoonrung S. Construction of a high-density integrated genetic linkage map of rubber tree (Hevea brasiliensis) using genotyping-by-sequencing (GBS). Front Plant Sci. 2015;6:367.
Westbrook JW, Chhatre VE, Wu LS, Chamala S, Neves LG, Munoz P, Martinez-Garcia PJ, Neale DB, Kirst M, Mockaitis KA. Consensus genetic map for Pinus taeda and Pinus elliottii and extent of linkage disequilibrium in two genotype-phenotype discovery populations of Pinus taeda. G3 (Bethesda, Md). 2015;5(8):1685–94.
Montanari S, Brewer L, Lamberts R, Velasco R, Malnoy M, Perchepied L, Guerif P, Durel CE, Bus VG, Gardiner SE. Genome mapping of postzygotic hybrid necrosis in an interspecific pear population. Hortic Res. 2016;3:15064.
Doyle JJ. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull. 1987;19:11–5.
Poland JA, Brown PJ, Sorrells ME, Jannink JL. Development of high-density genetic maps for barley and wheat using a novel two-enzyme genotyping-by-sequencing approach. PLoS One. 2012;7(2):e32253.
Li H, Durbin R. Fast and accurate short read alignment with burrows-wheeler transform. Bioinformatics. 2009;25(14):1754–60.
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20(9):1297–303.
Wang K, Li M, Hakonarson HANNOVAR. Functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38(16):e164.
Wu R, Ma CX, Painter I, Zeng ZB. Simultaneous maximum likelihood estimation of linkage and linkage phases in outcrossing species. Theor Popul Biol. 2002;61(3):349–63.
Van Ooijen JW. Multipoint maximum likelihood mapping in a full-sib family of an outbreeding species. Genet Res. 2011;93(5):343–9.
Takagi H, Abe A, Yoshida K, Kosugi S, Natsume S, Mitsuoka C, Uemura A, Utsushi H, Tamiru M, Takuno S. QTL-seq: rapid mapping of quantitative trait loci in rice by whole genome resequencing of DNA from two bulked populations. Plant J. 2013;74(1):174–83.
We thank Dr. Richard Volz (Hawke’s Bay Research Centre, The New Zealand Institute for Plant and Food Research Limited) for comments on this manuscript. We thank the research sharing service platform of Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences for providing the instruments and technologies.
This work was funded by the Agricultural Science and Technology Innovation Program (CAAS-ASTIP, CAAS-XTCX2016017005), the Earmarked Fund for China Agriculture Research System (CARS-28), the Central Public-interest Scientific Institution Basal Research Fund (1610192017709) and the Science-Technology Project of Henan Province (172102110244).
Availability of data and materials
All data generated or analyzed during this study are included in this published article and its additional files.
Ethics approval and consent to participate
The pear plant samples were obtained from the orchard of the Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences (ZFRI, CAAS) in Zhengzhou (Henan Province, China), and the affiliated experimental orchard of the ZFRI, CAAS in Xinxiang (Henan Province, China). Since these studies did not involve endangered or protected species, no specific permissions were required for this material. The authors declare that the experimental research on plants described in this paper complied with institutional and national guidelines.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Table S1. Statistics of sequencing data for the crosses MTH × HXS and YLX × MTH. Table S2. Details of markers localized in the integrated pear consensus MH-YM-BD map. (XLS 1695 kb)
Figure S1. |Δ(SNP-index)| graph derived from the modified QTL-seq analysis of mapping for the red skin trait of Asian pear. The X-axis indicates the position of the 17 pseudo-chromosomes and the Y-axis indicates the |Δ(SNP-index)|. (PPTX 278 kb)
About this article
Cite this article
Xue, H., Wang, S., Yao, J. et al. Chromosome level high-density integrated genetic maps improve the Pyrus bretschneideri ‘DangshanSuli’ v1.0 genome. BMC Genomics 19, 833 (2018). https://doi.org/10.1186/s12864-018-5224-6
- Genetic map