- Research article
- Open Access
Construction of high-resolution recombination maps in Asian seabass
BMC Genomics volume 18, Article number: 63 (2017)
A high-density genetic map is essential for de novo genome assembly, fine mapping QTL for important complex traits, comparative genomic studies and understanding the mechanisms of genome evolution. Although a number of genomic resources are available in Asian seabass (Lates calcarifer), a high-density linkage map is still lacking. To facilitate QTL mapping for marker-assisted selection and genome assembly, and to understand the genome-wide recombination rates, we constructed high density linkage maps using three families and genotyping by sequencing.
A high-density consensus linkage map consisting of 8, 274 markers was constructed based on sex-averaged genetic maps. The genetic maps were then aligned and integrated with the current genome assembly of Asian seabass. More than 90% of the genome contig sequences were anchored onto the consensus genetic map. Evidence of assembly errors in the current genome assembly was identified. A fragment of up to 2.5 Mb belonging to LG14 was assembled into Chr15. The length of family-specific sex-averaged maps ranged from 1348.96 to 1624.65 cM. Female maps were slightly longer than male maps using common markers. Female-to-male ratios were highly variable both across chromosomes within each family and throughout three families for each chromosome. However, the distribution patterns of recombination along chromosomes were similar between sexes across the whole genome. The overall recombination rates were significantly correlated with genome-wide GC content and the correlations were revealed to be stronger in females than in males.
These high-density genetic maps provide not only essential tools for facilitating de novo genome assembly and comparative genomic studies in teleosts, but also critical resources for fine mapping QTL and genome-wide association mapping for economically important traits in Asian seabass.
A high-density genetic map is essential for facilitating genome assembly and examining the accuracy of de novo genome assembly . De novo genome assembly achieved using massively parallel short read sequencing is not perfectly precise due to genome complexity resulting from ancestral vertebrate genome duplications (2R), gene duplications, and occurrences of transposable elements [2, 3]. Particularly, de novo genome assembly in teleosts is much more challenging, because of the additional fish-specific genome duplication event (FSGD or 3R) . A high-density genetic map is particularly useful for determining the genetic basis of complex phenotypic traits, and studying the chromosomal structure variation and genome evolution by comparative genomic approaches . Linkage maps can help identify genomic regions responsible for both economical and adaptive traits in evolutionary biology . With the development of high-throughput SNP genotyping technologies [7, 8], high-density genetic maps become indispensable resources for investigation on genome-wide heterogeneity through the analysis of recombination rates because genomic regions showing higher levels of linkage are commonly exhibiting lower recombination rates [9, 10]. Although genetic maps with low marker density can also be useful in mapping QTL and aiding genome assembly, they are not sufficient to accurately detect the genetic loci and aid high-quality genome assembly. In particular, the architecture of genomic heterogeneity can only be detected at high resolution . Thus, a robust high-density genetic map is considered as indispensable for studies in the genomic era.
Recombination is the most common genetic phenomenon and the basis for multiple biological contexts [12, 13]. It is considered to be associated with most genomic features, such as nucleotide diversity, GC content, gene expression regulations and epigenetic modifications, although the mechanism is still need to be elucidated . Recombination interrupts linkage and thus allows more effective selection on multiple loci so as to generate complex phenotypic traits [9, 15]. However, elevated recombination would produce adverse effects from a perspective of animal breeding. Genomic regions with low recombination, e.g., holding together epistatically interacting alleles and inversions reducing recombination at specific areas, are also of great importance, as these regions are likely associated with selection . Therefore, screening genome-wide recombination signatures and determining recombination hotspots and deserts are critically important issues for genomic studies. Besides in model species, investigations of recombination in non-model species are also needed in order to clarify both the general and species-specific patterns of recombination . Due to the importance in genetic studies, high-density genetic maps have been constructed in a few economically important teleosts , e.g., Atlantic salmon [19–21], Coho salmon , channel catfish  and Japanese flounder , European sea bass , Nile tilapia , rainbow trout  and lake whitefish .
Asian seabass (Lates calcarifer) is one of the most important foodfish species in Southeast Asia and Australia . The total production in these regions reached up to 75,000 t in 2012 . This fish firstly matures as a male at 1–2 years of age and later reverses to a female at approximately 4 years old . Sex reversal is a feature of species without specialized sex chromosomes . Nevertheless, the genetic architecture underlying this interesting biological phenomenon remains to be elucidated. For the past decades, a lot of genomic resources have been developed in this species to facilitate genetic improvement, e.g., hundreds of thousands of genome-wide genetic markers derived from both coding and genomic sequences [31–33], high-density genetic linkage maps [34–36], a BAC-based physical map  and also a chromosomal-level genome assembly . Several QTL responsible for growth-related traits [34, 38, 39] and viral disease resistance [36, 40] have also been identified and used for marker-assisted selection. Nevertheless, most of the linkage maps were constructed using microsatellites and they are of a resolution of less than 4000 markers. Thus, previous linkage maps are likely not enough to obtain confident examination of the accuracy of genome assembly and robust information on genomic heterogeneity of recombination [33, 35], and also to achieve accurate genomic prediction in the selective breeding programs of this species.
In this study, we constructed both family-specific and sex-specific genetic maps of Asian seabass, and generated a high-density consensus map by integration of these maps using high-throughput genotyping-by-sequencing (GBS) technology. These high-density genetic maps were firstly used to examine the accuracy of the de novo genome assembly of this species. In addition, the genome-wide recombination landscapes were investigated between sexes so as to understand the roles of genomic architecture in recombination, particularly for species without specialized sex chromosomes and with a sex reversal life history. Finally, the correlations between genome-wide recombination rates and nucleotide composition were examined to obtain the insights of the roles of genome composition on biological phenomenon. In total, these high density maps, provide essential tools for genomic studies and investigation of the genomic landscapes of teleost in correlation but not limited to sex reversal life history.
Genotyping of genetic markers
After a series of filtering steps for removing samples with low sequence depth, 112, 118 and 118 progeny were retained for families Fam1, Fam2 and Fam3, respectively (Table 1). The average number of QC filtered reads for each progeny was more than 3.79 M. After excluding the markers showing significant segregation distortion, 2259, 3241 and 4025 SNPs were retained for genetic map construction for the three families, respectively. For microsatellites, 127 and 64 were used for linkage mapping for families Fam1 and Fam3, respectively, while for Fam2 no microsatellites were genotyped (Table 1). The number of common markers was relatively small with a proportion from 21.1 to 26.7% throughout families, which was significantly lower than that of the common catalogue loci between families, ranging from 52.9 to 79.0% (Table 1).
Construction of family- and sex-specific genetic maps
Three sex-averaged genetic maps were constructed using 2328, 3175 and 3836 markers for families Fam1, Fam2 and Fam3, with total lengths of 1348.96 cM, 1624.65 cM and 1412.91 cM, respectively (Table 2). The total number of markers with unique positions incorporated into the three family-specific maps was 2, 271 (97.6%), 3, 136 (98.8%) and 3, 686 (96.1%), respectively. The length of linkage group (LG), the number of markers with unique positions and the average marker interval based on the markers with unique positions for each individual LG showed significant variations throughout families. For example, LG1 of Fam1 was 68.67 cM in length with a marker interval of 0.82 cM, while LG1 of Fam2 and Fam3 were 31.40 cM and 23.03 cM in length, with marker intervals of 1.85 cM and 0.21 cM, respectively (Table 2). Detailed information for each sex-averaged genetic map is listed in Table 2. Using the common markers between families, a consensus genetic map consisting of 8, 274 markers was constructed by integration of the family-specific genetic maps, which included 7, 426 markers with unique positions (89.8%). The total length, average marker interval and resolution based on the markers with unique positions were 2546.86 cM, 0.37 cM and 4.52 cM/Mb, respectively. The number of markers in each LG ranged from 172 for LG24 to 499 for LG4 with an average of 345, while the marker interval was from 0.23 cM for both LG13 and LG15 to 0.71 cM for LG20 with an average of 0.31 cM, based on the markers with unique positions. In correlation to the physical map, the resolution of this consensus map ranged from 2.49 cM/Mb for LG1 to 7.65 cM/Mb for LG14, with an average of 4.52 cM/Mb (Table 3). The distribution of markers across LGs is shown in Fig. 1, where the largest marker interval is 11.07 cM in LG19. Estimated with the markers with unique positions, over 92.9% of the marker intervals were less than 1.0 cM, corresponding to 0.28 Mb of genome fragment.
A total of six sex-specific genetic maps were constructed for three families (Additional file 1: Table S1). The total number of markers mapped into these sex-specific maps ranged from 1373 for the Fam1 female map to 2637 for the Fam3 male map, while the total length was from 1246.96 cM for the Fam1 male map to 1422.83 cM for the Fam2 female map. The average marker interval based on markers of unique positions ranged from 0.52 cM for the Fam3 male map to 0.96 cM for the Fam1 female map. Two integrated sex-specific genetic maps were separately constructed based on the common markers of the independent sex-specific maps (Additional file 2: Table S2). The male and female integrated maps consisted of 5, 204 and 4, 771 markers, and had total lengths of 1955.90 cM and 2016.92 cM, respectively. Based on the markers with unique positions, the integrated male and female maps incorporated 4, 698 (90.3%) and 4372 (91.6%) markers, with an average interval of 0.42 cM and 0.46 cM, respectively.
Integration of genetic maps with genome assembly
Integration of the integrated consensus genetic map with the contigs assembly (genome assembly version 2) and scaffold assembly (genome assembly version 3) , revealed that 814 out of 3807 (21.4%) contigs and 259 out of 2964 (8.7%) scaffolds and/or contigs were anchored onto the map, accounting for 560.9 Mb (88.0% of 637.5 Mb, version 2) and 579.6 Mb (90.5% of 640.2 Mb, version 3) of the total length of reference genome, respectively. Integration of genetic map with genome assembly anchored additional 275 contigs with a total length of 19.9 Mb onto all 24 LGs. 7810 (94.4%) genetic loci were consistently identified in expected corresponding linkage groups and chromosomes, while 113 (1.4%) loci showed mismatches between specific LGs and the corresponding chromosomes of the reference genome (Fig. 2).
Genome synteny between chromosomes and LGs for each family-specific genetic map was further used to examine the mismatches. With the exception of the fragments that had no segregated markers for alignment between LGs and chromosomes, the mismatches were verified consistently by all three family-specific genetic maps, and distributed across five chromosomes of Asian seabass : Chr2, Chr8, Chr9, Chr15 and Chr20 (Fig. 3). The largest fragment mismatch was observed for Chr15, where a genome fragment with a length of 2.5 Mb was anchored onto LG14. The most complicated genome syntenic relationships were observed for Chr8 and Chr9. Two fragments of Chr8 were mapped to LG12 and LG23, respectively, while two fragments of Chr9 corresponded to LG12 and LG14, respectively.
Recombination heterogeneity between sexes
The average recombination rate throughout families at the whole genome level showed slight difference, ranging from 2.30 cM/Mb for Fam1 to 2.77 cM/Mb for Fam2 (Table 2). At the chromosome level, the overall recombination rate showed significant variations both across 24 linkage groups within each family (ranging from 1.36 to 3.91 cM/Mb, from 0.97 to 7.57 cM/Mb and from 0.90 to 4.37 cM/Mb for Fam1, Fam2 and Fam3, respectively) and throughout three families for each linkage group (e.g., ranging from 0.90 to 2.67 cM/Mb and from 1.36 to 3.87 cM/Mb for LG1 and LG6, respectively) (Table 2). For sex-specific recombination, we found the overall female-to-male ratios (F: M) throughout three families were not significantly deviated from 1, ranging from 0.94 for Fam3 to 1.09 for Fam2. The female genetic maps were slightly longer than the male ones for both Fam1 (F: 1303.98 cM vs M: 1246.96 cM) and Fam2 (F: 1422.83 cM vs M: 1308.23 cM), whereas the male map was slightly longer than the female map for Fam3 (M: 1332.97 cM vs F: 1249.26 cM). For every linkage group, we observed significant differences in female-to-male (F: M) ratios throughout families and also significant differences in the number of markers between sexes within each family, e.g., LG11 in Fam1 and LG19 in Fam2 (Additional file 1: Table S1). However, due to the markedly different number and sets of genetic markers used for individual map construction, we could not directly compare the recombination rates between sexes. For this reason, all pairs of the common markers shared between sexes for every family were used to estimate and compare the recombination ratios. We observed the overall recombination rates were slightly higher in females than in males throughout all three families (overall F: M, 1.08; P < 0.001, paired t-test; Fig. 4). However, using common markers, we found 18 out of 24 linkage groups did not consistently deviate from F: M = 1 throughout the three families (Additional file 3: Figure S1). The distribution patterns of recombination along chromosomes between sexes were in good agreement for most of the chromosomes within each family (Fig. 5). Although significant differences in the distribution patterns of recombination can be detected in some LGs, e.g., LG2, LG5, LG6, LG10 and LG19 (P < 0.001; Fig. 5), such differences are most likely caused by the variations of the number of markers within these linkage groups (Additional file 1: Table S1).
Correlations between recombination and GC content
In comparison to sex-averaged genotype segregation that includes influences from both sexes, genomic sequence features are more directly associated with sex-specific segregation events. Therefore, correlations between genome-wide recombination rates and GC content were firstly evaluated using sex-specific genetic maps. We observed a significantly positive association between recombination rates and GC content for each sex-specific genetic map throughout all families (Table 4). Interestingly, we detected that the correlations were significantly stronger in females than in males (P = 0.02, Chi-squared test). The locus intervals with GC content of < 40% showed higher recombination rates in males than in females, while locus intervals showing GC content of > 40% had higher recombination rates in females than in males (Additional file 4: Figure S2). However, we did not find any difference in GC content between the chromosomes that showed consistent F: M ratios (>1 or < 1), e.g., LG5, LG6, LG15, LG16_22, LG18 and LG24, and the other chromosomes throughout the three families (Additional file 3: Figure S1).
In this work, genetic maps were computed for multiple families using both GBS based SNPs and microsatellites. A consensus genetic map of the highest resolution to date in Asian seabass was constructed. This strategy for construction of high-density integrated genetic map has been applied in some fish species, e.g., common carp  and channel catfish . The genetic map will be useful for a wide range of genomic studies.
With the availability of different types of high-throughput SNP genotyping platforms, construction of a reliable high-density genetic map is of great importance for downstream studies, such as genome-wide association mapping for complex traits and accurate comparative genomics [7, 35]. Nevertheless, issues from missing genotypes and genotyping errors are still considerations for all the current available SNP genotyping approaches, as they could lead to incorrect ordering of genetic markers in linkage mapping and also inaccurate estimation of recombination in genetic maps . Compared to the other platforms, e.g., SNP arrays, GBS particularly suffers more from the above disadvantages [7, 43]. Thus, it is critical to obtain accurate and effective data sets in this study to construct confident high-density genetic maps.
As revealed in our previous studies using GBS, construction of genetic maps was achievable [32, 35]. Here, we adapted not only much stricter filtering criteria for SNP genotyping, but also more data control processes so as to generate high confidence genetic maps. Firstly, reference based sequence alignment and higher sequence coverage were applied to increase the accuracy of SNP recovery . We only retained the samples of high genotyping quality in each family. Moreover, loci with any evidence of Mendelian distortion, low call rates and being from paralogs that tend to cause errors were removed . In addition, we applied a strategy of using multiple families for cross validation of the mapping results. As revealed by our data, 8274 (94.6%) among all the 8746 loci were successfully mapped into the integrated consensus genetic map, suggesting high confidence of the results of linkage mapping throughout all families. Last but not least, microsatellites with relatively few missing genotypes (< 5%) were also used for linkage mapping. Throughout all constructed genetic maps, these loci showed little deviation between relative genetic positions and physical positions, also indicating that linkage mapping based on GBS in this study was robust. Nevertheless, it was also evident that the number of common markers shared among families was relatively smaller in GBS studies, compared to both microsatellites  and SNP chip based studies [23, 46]. The low number of common markers was caused by both the non-informative markers between families and the RAD loci none identified in all families. The proportion of common markers, to some extent, could be a challenge for integrating genetic maps, although the influence remains to be elucidated.
The high-density consensus genetic map had a total length of 2546.86 cM, showing non-significant difference with the multi-family based genetic map (2411.5 cM) of Asian seabass. As the densest map ever constructed for Asian seabass, it incorporated 8274 markers with an average marker interval of 0.31 cM and covered more than 90.5% of the reference genome. More than 92.9% of pairwise locus distances of unique positions within this map were < 1.0 cM (~ 0.28 Mb). In comparison, the genetic map of the highest resolution (3321 SNPs and 0.52 cM) constructed in previous studies covered 84.6% of the reference genome, with 83.8% of pairwise locus distances within 1.0 cM (~ 0.41 Mb) . Thus, the current genetic map is a substantial improvement to previously published maps in Asian seabass [34, 35] and provides an excellent resource for fine mapping QTL in Asian seabass. For individual family-specific maps, the total length varied from 1348.96 cM to 1624.65 cM, which also showed no difference from a previous map based on GBS with a total length of 1577.67 cM . Similarly, there was also no difference for the length of sex-specific maps between this study and previous studies [34, 35]. All these data suggest robustness of the genetic maps in this study. However, the order of markers in the genetic maps should be treated with caution because the meiosis or the population size in this study might be not enough to accurately order all the available markers. Therefore, the ordering of contigs from the reference genome should be examined in future studies. In QTL studies, a high density genetic map is indispensable. Nevertheless, it also should be noted that the number of crossovers or individuals used for mapping is another critical issue influencing the QTL mapping resolution, which is particularly important in mapping complex traits .
Applications of the recombination map in genome assembly
All SNPs used for mapping were generated by alignment against the reference genome , thus the high-density genetic maps showed particular importance in facilitating and verifying the assembly of the whole genome and in comparative genomic studies of Asian seabass . The genome assembly was integrated with the consensus linkage map using both SNP and microsatellite loci information. More than 90% of contig sequences (579.6 Mb) were mapped onto the genetic map, which was slightly higher than that in the aquaculture species, European seabass (575 Mb, 86%)  and channel catfish (867.4 Mb, ~ 90%) . The integration assigned additional 275 contigs into 24 LGs. Although we observed a rather high level of consistency or synteny between chromosome assembly and the genetic map, 1.4% of loci in the genetic map were identified matching to five non-corresponding chromosomes. These mismatches were further verified and were confirmed by the consistent results of integrations between chromosome assembly and each of the three family-specific maps. All these data possibly suggest errors of assembly at least for five chromosomes (Chr2, Chr8, Chr9, Chr15 and Chr20) in the current genome assembly of Asian seabass . Interestingly, all genetic maps in this study consistently suggest that a fragment of up to 2.5 Mb that was possibly from Chr14 was mistakenly assembled into Chr15. All these possible assembly errors should be noted in the future fine-scale mapping of QTL and high-resolution comparative genomic studies. Actually, it is almost impossible to avoid assembly errors in genome sequencing projects, particularly using massively parallel short read sequencing, due to the complexity of genomes . In teleosts, the genome has experienced duplication two to three times [4, 50]. The occurrences of repetitive fragments, resulting from retention of duplicate genes and transposons, significantly increase the difficulties of assembling an accurate genome . The accuracy of de novo genome assembly is still a major issue that needs to be improved in the genomic era . Here, the integration of genetic map with reference genome provides excellent resources for comprehensive comparative genomic analyses, fine mapping QTL, and even positional cloning of candidate genes.
Patterns of sex-specific recombination
Overall, the sex-specific recombination rates were revealed to be highly variable both across 24 chromosomes within each family and throughout three families for each chromosome, although the female maps presented slightly higher recombination rates than the male maps as revealed using common markers, in Asian seabass, a species without heteromorphic sex chromosomes . The average recombination rates ranged from 2.4 cM/Mb to 2.8 cM/Mb throughout three families, comparable to some representative aquaculture species: Atlantic salmon (~ 2.6 cM/Mb) , channel catfish (~ 3.5 cM/Mb)  and bighead carp (~ 1.8 cM/Mb) . Slight differences in recombination rates within species can be partially explained by the genome sequence variations (e.g., structural and copy number variations) and composition heterogeneity (e.g., GC content) [14, 17, 20].
Sex-specific recombination is a common biological phenomenon and is widely studied in different biology systems [52–55]. For species with chromosomal mechanisms of sex determination, the homogametic sex typically has relatively higher level of recombination than the heterogametic sex [14, 23, 53, 54]. Although sex-specific recombination is also common in species without chromosomal mechanisms of sex determination [17, 19, 56, 57], the comprehensive patterns for specific species are so far still not clear. Here, Asian seabass, lacking specialised heteromorphic sex chromosomes, was also observed having a differential pattern of sex-specific recombination. The overall pattern also showed a slightly female biased recombination (F: M, 1.08), which was similar to species both with specialised heteromorphic sex chromosomes, such as channel catfish , chicken , pig  and mouse  (F: M, ~ 1.26–1.73), and without heteromorphic sex chromosomes, such as Atlantic salmon [19, 20], grouper , zebrafish  and oyster  (F: M, ~ 1.03–2.74). This result was consistent with our previous study in Asian seabass using GBS , but different from another study using microsatellites , which showed a slightly male biased recombination. As revealed in this study, the overall female- and male-specific recombination rates were only slightly different. Thus, differences in marker density for specific LGs are likely to have significant influence on the overall recombination rates when using low-density genetic maps . Besides, different markers used in comparisons can also lead to biased results, particularly in species with slightly different sex-specific recombination rates . For example, Fam3 showed slightly higher recombination in males using the whole data set. However, when using only common markers, we observed a significant female biased recombination (F: M, 1.06). Here, our data also suggest that adequate marker coverage and common markers between sexes are indispensable to evaluating sex-specific recombination in future studies .
Study on the sex-specific recombination across chromosomes for species without differentiated sex chromosomes and with a sex reversal life history is of particular interest for understanding the mechanisms for biological features associated with meiosis and sex determination systems of these species [17, 61]. Here, we found the distribution patterns of recombination along chromosomes between sexes were in high agreement for most of the chromosomes within each family, e.g., LG1, LG4 and LG7_1. Such sex-specific distribution pattern of recombination was similar to that found in a previous study in pig , but extensively different from that in oyster, a species also without differentiated sex chromosomes . Nevertheless, the chromosomes both showing good and none agreement in the distribution patterns of recombination along chromosomes between sexes, did not consistently and significantly deviate from F: M = 1 throughout three families. This was different from some species with specialised heteromorphic sex chromosomes, e.g., pig . Thus, the level and the distribution patterns of recombination might be highly diverse among organisms [14, 17, 23]. It likely suggests that sex-specific recombination is associated with both universal and species-specific mechanisms, which need to be studied using ultra-resolution genetic maps.
Sex-specific recombination is suggested to be associated with genome-wide signature of GC content [61, 62]. Consistent with the other studies in mammals, birds, invertebrates, yeast and plants [9, 63–65], we also observed a significant correlation between GC content and recombination rates for both sexes. Interestingly, the correlation was significantly stronger in females than in males, which was consistent with the pattern that females have a slightly higher overall combination rates than males. Taking into consideration that the studied organism is a species with the feature of sex reversal, these results mean that the recombination rates would change across the genome for one individual at different life history stages. Thus, it is much likely that the pattern of recombination is not determined directly by genomic factors, but is due to the imprints resulting from sex determination and differentiation as discussed above [62, 66]. Studies on genome-wide recombination still have great importance in comparative genomics for various species .
We constructed a high-density genetic map with 8274 markers using multiple families and a GBS approach. Some misassembles in the current genome assembly of Asian seabass were identified using high-density linkage maps. The recombination rates were highly variable both across chromosomes within each family and for each chromosome throughout families. The overall recombination rates were slightly higher in females than in males. Most of the chromosomes showed a good agreement in the distribution pattern of recombination along chromosomes between sexes. In addition, the recombination rates were significantly correlated to the genome-wide GC content for both sexes with an evidence of female bias. These data provide critical genomic resources for genome assembly, mapping QTL for economically important traits to accelerate genetic improvement in Asian seabass and comparative genomic analysis.
Mapping populations and genotyping-by-sequencing
Three full-sib families were used for construction of genetic maps. In detail, Fam1 was a backcross generated by a F2 male offspring crossing its F1 female parent , while both Fam2 and Fam3 were independent F2 populations from two different parents set up as described in our previous study . For each family, two parents and 192 progeny randomly selected and genotyped with microsatellites and genotyping-by-sequencing using the ddRADseq approach .
Genomic DNA was isolated from fin tissue using the salt precipitation method . 149 microsatellites, almost evenly covering the genome, were genotyped according to a previous study . GBS libraries were constructed according to our previous method . In brief, 300 ng genomic DNA was digested with PstI-HF and MspI restriction enzymes (New England Biolabs, USA) and was then ligated with adaptors using T4 ligase (New England Biolabs, USA). The ligation products were pooled for size selection of 300–500 bp by running gels, after clean up with QIAquick PCR Purification Kit (Qiagen, Germany). The libraries were enriched using PCR with Phusion® High-Fidelity DNA Polymerase (New England Biolabs, USA). After a final clean up using QIAquick PCR Purification Kit (Qiagen, Germany), ddRADSeq libraries were sequenced using a NextSeq 500 platform (Illumina, USA) for either paired-end (2 × 150 bp) or single-end (1 × 150 bp).
Raw reads processing and SNP genotyping were conducted using the software package Stacks v1.34 . Reads were trimmed to 120 bp and those with any uncalled base were removed. QC filtered reads were aligned against the reference genome of Asian seabass  using the program BWA with a maximum of two mismatches . Alignments with multiple genome targets were excluded from further analysis. Reference aligned reads were used for stacks assembly using pstacks implemented in the package Stacks v1.34 . The stacks assembled for the parents of the three families were used to construct a catalogue with the program cstacks Stacks v1.34 . The catalogue of loci was then used as reference for SNP discovery and genotyping for each family with the program populations implemented in Stacks v1.34 . SNP filtrations were conducted similarly to our previous study with some modifications . In brief, RAD tags with any SNP of > 2 alleles and showing heterozygosity of > 0.5 were removed . A minimum of 10 × sequence depth and a genotyping success of > 85% of the individuals in each family were used for SNP genotyping. In order to reduce the number of missing genotypes, samples with low sequencing depth were ruled out for each family. Only one SNP per tag was retained for map construction.
Genetic map construction
All genotypes were used for goodness-of-fit tests for Mendelian segregation distortion using χ2-analysis. Loci that showed any signal of segregation distortion at the significance level of 0.05 were removed. The program JoinMap 4.1  was used for map construction, where an LOD of 10 was used for grouping of nodes. The marker distances in each linkage group were determined using the regression mapping algorithm and the Kosambi mapping function. The pseudotestcross strategy was used for construction of sex-specific genetic maps . It assumes that markers with 1:1 ratio segregation imply one parent is heterozygous while the other is homozygous, thus the male and female genotypes can be obtained. Markers which were heterozygous in the male parent but homozygous in the female parent were used to construct the male genetic map, while markers which were heterozygous in the female parent but homozygous in the male parent were used to construct the female genetic map. Consensus genetic maps were constructed by integrating individual genetic maps according to the genetic positions of the common markers among individual maps using the program MergeMap .
Integration of genetic maps with genome assembly
All mapped microsatellites with flanking sequences were aligned against the genome assembly, including all contigs and scaffolds, of Asian seabass  using BLAST with a cutoff of 1E−10 and a minimal sequence identity of 95%. The syntenic relationships between genetic maps and the genome assembly were constructed and the integration was achieved by anchoring the contigs and scaffolds onto the genetic maps according to marker positions. The program Circos  was used to visualize the genomic synteny and integration between genetic maps and genome assembly. The accuracy of genome assembly was examined by integration of both consensus and individual genetic maps with the genome assembly.
Analysis of recombination heterogeneity
In order to investigate recombination heterogeneity across chromosomes between sexes, recombination rates were estimated for both sex-specific and family-specific genetic maps. Recombination fractions for locus intervals between marker pairs were calculated using the program JoinMap 4.1  and were further mapped along the physical map in cM/Mb. Sex-specific recombination heterogeneity throughout independent chromosomes was estimated based on the locus intervals of common markers between sexes for each individual family. The significance was examined using the goodness of fit tests according to Ott’s method , followed by Bonferroni corrections for multiple comparisons. Due to limited number of common markers, recombination heterogeneity among families was examined by comparing the trends of variations for recombination rates along physical distance. For visual comparisons between sexes and among families, recombination rates for continuous locus intervals were plotted against physical distance along each individual chromosome.
Analysis of genomic features of recombination
The possible correlations between average recombination rates and genome-wide nucleotide compositions, i.e., GC content, were estimated. Calculation of GC content was based on the full reference genome sequences within locus intervals. The correlations were assessed using both the Pearson and Spearman correlation tests. Compared to family-specific maps that involve segregations from both sexes, sex-specific maps are more directly associated with genome sequence features. Therefore, only sex-specific maps were used for these studies.
Quantitative trait locus
Sachidanandam R, Weissman D, Schmidt SC, Kakol JM, Stein LD, Marth G, Sherry S, Mullikin JC, Mortimore BJ, Willey DL. A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms. Nature. 2001;409(6822):928–33.
Alkan C, Sajjadian S, Eichler EE. Limitations of next-generation genome sequence assembly. Nat Methods. 2011;8(1):61–5.
Lynch M, Conery JS. The origins of genome complexity. Science. 2003;302(5649):1401–4.
Meyer A, Van de Peer Y. From 2R to 3R: evidence for a fish‐specific genome duplication (FSGD). Bioessays. 2005;27(9):937–45.
Bowers JE, Abbey C, Anderson S, Chang C, Draye X, Hoppe AH, Jessup R, Lemke C, Lennington J, Li Z. A high-density genetic recombination map of sequence-tagged sites for sorghum, as a framework for comparative structural and evolutionary genomics of tropical grains and grasses. Genetics. 2003;165(1):367–86.
Tanksley S, Ganal M, Prince J, De Vicente M, Bonierbale M, Broun P, Fulton T, Giovannoni J, Grandillo S, Martin G. High density molecular linkage maps of the tomato and potato genomes. Genetics. 1992;132(4):1141–60.
Davey JW, Hohenlohe PA, Etter PD, Boone JQ, Catchen JM, Blaxter ML. Genome-wide genetic marker discovery and genotyping using next-generation sequencing. Nat Rev Genet. 2011;12(7):499–510.
Duarte J, Rivière N, Baranger A, Aubert G, Burstin J, Cornet L, Lavaud C, Lejeune-Hénaut I, Martinant J-P, Pichon J-P. Transcriptome sequencing for high throughput SNP development and genetic mapping in Pea. BMC Genomics. 2014;15(1):126.
Roesti M, Moser D, Berner D. Recombination in the threespine stickleback genome—patterns and consequences. Mol Ecol. 2013;22(11):3014–27.
Coop G, Wen X, Ober C, Pritchard JK, Przeworski M. High-resolution mapping of crossovers reveals extensive variation in fine-scale recombination patterns among humans. Science. 2008;319(5868):1395–8.
Kong A, Gudbjartsson DF, Sainz J, Jonsdottir GM, Gudjonsson SA, Richardsson B, Sigurdardottir S, Barnard J, Hallbeck B, Masson G. A high-resolution recombination map of the human genome. Nat Genet. 2002;31(3):241–7.
Spencer CC, Deloukas P, Hunt S, Mullikin J, Myers S, Silverman B, Donnelly P, Bentley D, McVean G. The influence of recombination on human genetic diversity. PLoS Genet. 2006;2(9):e148.
Jolley K, Wilson D, Kriz P, McVean G, Maiden M. The influence of mutation, recombination, population history, and selection on patterns of genetic diversity in Neisseria meningitidis. Mol Biol Evol. 2005;22(3):562–9.
Tortereau F, Servin B, Frantz L, Megens H-J, Milan D, Rohrer G, Wiedmann R, Beever J, Archibald AL, Schook LB. A high density recombination map of the pig reveals a correlation between sex-specific recombination and GC content. BMC Genomics. 2012;13(1):586.
Nachman MW. Variation in recombination rate across the genome: evidence and implications. Curr Opin Genet Dev. 2002;12(6):657–63.
Li E. Chromatin modification and epigenetic reprogramming in mammalian development. Nat Rev Genet. 2002;3(9):662–73.
Jones DB, Jerry DR, Khatkar MS, Raadsma HW, Zenger KR. A high-density SNP genetic linkage map for the silver-lipped pearl oyster, Pinctada maxima: a valuable resource for gene localisation and marker-assisted selection. BMC Genomics. 2013;14(1):810.
Yue GH. Recent advances of genome mapping and marker‐assisted selection in aquaculture. Fish Fish. 2014;15(3):376–96.
Lien S, Gidskehaug L, Moen T, Hayes BJ, Berg PR, Davidson WS, Omholt SW, Kent MP. A dense SNP-based linkage map for Atlantic salmon (Salmo salar) reveals extended chromosome homeologies and striking differences in sex-specific recombination patterns. BMC Genomics. 2011;12(1):615.
Gonen S, Lowe NR, Cezard T, Gharbi K, Bishop SC, Houston RD. Linkage maps of the Atlantic salmon (Salmo salar) genome derived from RAD sequencing. BMC Genomics. 2014;15(1):166.
Tsai HY, Robledo D, Lowe NR, Bekaert M, Taggart JB, Bron JE, Houston RD. Construction and annotation of a high density SNP linkage Map of the Atlantic salmon (Salmo salar) genome. G3. 2016. doi:10.1534/g1533.1116.029009.
Kodama M, Brieuc MS, Devlin RH, Hard JJ, Naish KA. Comparative mapping between Coho Salmon (Oncorhynchus kisutch) and three other salmonids suggests a role for chromosomal rearrangements in the retention of duplicated regions following a whole genome duplication event. G3. 2014;4(9):1717–30.
Li Y, Liu S, Qin Z, Waldbieser G, Wang R, Sun L, Bao L, Danzmann RG, Dunham R, Liu Z. Construction of a high-density, high-resolution genetic map and its integration with BAC-based physical map in channel catfish. DNA Res. 2015;22(1):39–52.
Shao C, Niu Y, Rastas P, Liu Y, Xie Z, Li H, Wang L, Jiang Y, Tai S, Tian Y. Genome-wide SNP identification for the construction of a high-resolution genetic map of Japanese flounder (Paralichthys olivaceus): applications to QTL mapping of Vibrio anguillarum disease resistance and comparative genomic analysis. DNA Res. 2015;22(2):161–70.
Palaiokostas C, Bekaert M, Taggart JB, Gharbi K, McAndrew BJ, Chatain B, Penman DJ, Vandeputte M. A new SNP-based vision of the genetics of sex determination in European sea bass (Dicentrarchus labrax). Genet Sel Evol. 2015;47(1):68.
Palaiokostas C, Bekaert M, Khan MG, Taggart JB, Gharbi K, McAndrew BJ, Penman DJ. Mapping and validation of the major sex-determining region in Nile tilapia (Oreochromis niloticus L.) using RAD sequencing. PLoS ONE. 2013;8(7):e68389.
Palti Y, Vallejo RL, Gao G, Liu S, Hernandez AG, Rexroad III CE, Wiens GD. Detection and validation of QTL affecting bacterial cold water disease resistance in rainbow trout using restriction-site associated DNA sequencing. PLoS ONE. 2015;10(9):e0138435.
Gagnaire PA, Normandeau E, Pavey SA, Bernatchez L. Mapping phenotypic, expression and transmission ratio distortion QTL using RAD markers in the Lake Whitefish (Coregonus clupeaformis). Mol Ecol. 2013;22(11):3036–48.
Jerry DR. Biology and culture of Asian seabass Lates calcarifer. New York: CRC Press; 2013.
Devlin RH, Nagahama Y. Sex determination and sex differentiation in fish: an overview of genetic, physiological, and environmental influences. Aquaculture. 2002;208(3):191–364.
Wang L, Huang SQ, Xia JH, Liu P, Wan ZY, Yue GH. Genome-wide discovery of gene-related SNPs in Barramundi Lates calcarifer. Conserv Genet Resour. 2015;7(3):605–8.
Wang L, Wan ZY, Lim HS, Yue GH. Genetic variability, local selection and demographic history: genomic evidence of evolving towards allopatric speciation in Asian seabass. Mol Ecol. 2016;25(15):3605–21.
Vij S, Kuhl H, Kuznetsova IS, Komissarov A, Yurchenko AA, Van Heusden P, Singh S, Thevasagayam NM, Prakki SRS, Purushothaman K. Chromosomal-level assembly of the Asian seabass genome using long sequence reads and multi-layered scaffolding. PLoS Genet. 2016;12(4):e1005954.
Wang CM, Bai ZY, He XP, Lin G, Xia JH, Sun F, Lo LC, Feng F, Zhu ZY, Yue GH. A high-resolution linkage map for comparative genome analysis and QTL fine mapping in Asian seabass, Lates calcarifer. BMC Genomics. 2011;12(1):174.
Wang L, Wan ZY, Bai B, Huang SQ, Chua E, Lee M, Pang HY, Wen YF, Liu P, Liu F. Construction of a high-density linkage map and fine mapping of QTL for growth in Asian seabass. Scientific Reports. 2015;5:16358.
Liu P, Wang L, Wong S-M, Yue GH. Fine mapping QTL for resistance to VNN disease using a high-density linkage map in Asian seabass. Scientific Reports. 2016;6:32122.
Xia JH, Feng F, Lin G, Wang CM, Yue GH. A first generation BAC-based physical map of the Asian seabass (Lates calcarifer). PLoS ONE. 2010;5(8), e11974.
Wang CM, Lo LC, Zhu ZY, Yue GH. A genome scan for quantitative trait loci affecting growth-related traits in an F1 family of Asian seabass (Lates calcarifer). BMC Genomics. 2006;7(1):274.
Xia JH, Lin G, He X, Liu P, Liu F, Sun F, Tu R, Yue GH. Whole genome scanning and association mapping identified a significant association between growth and a SNP in the IFABP-a gene of the Asian seabass. BMC Genomics. 2013;14(1):295.
Liu P, Wang L, Wan ZY, Ye BQ, Huang S, Wong S-M, Yue GH. Mapping QTL for resistance against viral nervous necrosis disease in Asian seabass. Mar Biotechnol. 2016;18(1):107–16.
Peng W, Xu J, Zhang Y, Feng J, Dong C, Jiang L, Feng J, Chen B, Gong Y, Chen L. An ultra-high density linkage map and QTL mapping for sex and growth-related traits of common carp (Cyprinus carpio). Scientific Reports. 2016;6:26693.
Lander ES, Green P, Abrahamson J, Barlow A, Daly MJ, Lincoln SE, Newburg L. MAPMAKER: an interactive computer package for constructing primary genetic linkage maps of experimental and natural populations. Genomics. 1987;1(2):174–81.
De Donato M, Peters SO, Mitchell SE, Hussain T, Imumorin IG. Genotyping-by-sequencing (GBS): a novel, efficient and cost-effective genotyping method for cattle using next-generation sequencing. PLoS ONE. 2013;8(5), e62137.
Catchen JM, Amores A, Hohenlohe P, Cresko W, Postlethwait JH. Stacks: building and genotyping loci de novo from short-read sequences. G3. 2011;1(3):171–82.
Van Ooijen J. JoinMap® 4, Software for the calculation of genetic linkage maps in experimental populations. Wageningen: Kyazma BV; 2006.
Liu S, Li Y, Qin Z, Geng X, Bao L, Kaltenboeck L, Kucuktas H, Dunham R, Liu Z. High‐density interspecific genetic linkage mapping provides insights into genomic incompatibility between channel catfish and blue catfish. Anim Genet. 2016;47(1):81–90.
Henning F, Lee HJ, Franchini P, Meyer A. Genetic mapping of horizontal stripes in Lake Victoria cichlid fishes: benefits and pitfalls of using RAD markers for dense linkage mapping. Mol Ecol. 2014;23(21):5224–40.
Tine M, Kuhl H, Gagnaire P-A, Louro B, Desmarais E, Martins RS, Hecht J, Knaust F, Belkhir K, Klages S. European sea bass genome and its variation provide insights into adaptation to euryhalinity and speciation. Nat Commun. 2014;5:5770.
Li R, Zhu H, Ruan J, Qian W, Fang X, Shi Z, Li Y, Li S, Shan G, Kristiansen K. De novo assembly of human genomes with massively parallel short read sequencing. Genome Res. 2010;20(2):265–72.
Taylor JS, Braasch I, Frickey T, Meyer A, Van de Peer Y. Genome duplication, a trait shared by 22,000 species of ray-finned fish. Genome Res. 2003;13(3):382–90.
Fu B, Liu H, Yu X, Tong J. A high-density genetic map and growth related QTL mapping in bighead carp (Hypophthalmichthys nobilis). Scientific Reports. 2016;6:28679.
Ma L, O’Connell JR, VanRaden PM, Shen B, Padhi A, Sun C, Bickhart DM, Cole JB, Null DJ, Liu GE. Cattle sex-specific recombination and genetic control from a large pedigree analysis. PLoS Genet. 2015;11(11):e1005387.
Broman KW, Murray JC, Sheffield VC, White RL, Weber JL. Comprehensive human genetic maps: individual and sex-specific variation in recombination. Am J Hum Genet. 1998;63(3):861–9.
Groenen MA, Wahlberg P, Foglio M, Cheng HH, Megens H-J, Crooijmans RP, Besnier F, Lathrop M, Muir WM, Wong GK-S. A high-density SNP-based linkage map of the chicken genome reveals sequence features correlated with recombination rate. Genome Res. 2009;19(3):510–9.
Ihara N, Takasuga A, Mizoshita K, Takeda H, Sugimoto M, Mizoguchi Y, Hirano T, Itoh T, Watanabe T, Reed KM. A comprehensive genetic map of the cattle genome based on 3802 microsatellites. Genome Res. 2004;14(10a):1987–98.
Miles LG, Isberg SR, Glenn TC, Lance SL, Dalzell P, Thomson PC, Moran C. A genetic linkage map for the saltwater crocodile (Crocodylus porosus). BMC Genomics. 2009;10(1):339.
Singer A, Perlman H, Yan Y, Walker C, Corley-Smith G, Brandhorst B, Postlethwait J. Sex-specific recombination rates in zebrafish (Danio rerio). Genetics. 2002;160(2):649–57.
De Vos S, Bossier P, Van Stappen G, Vercauteren I, Sorgeloos P, Vuylsteke M. A first AFLP-based genetic linkage map for brine shrimp Artemia franciscana and its application in mapping the sex locus. PLoS ONE. 2013;8(3), e57585.
Shifman S, Bell JT, Copley RR, Taylor MS, Williams RW, Mott R, Flint J. A high-resolution single nucleotide polymorphism genetic map of the mouse genome. PLoS Biol. 2006;4(12):e395.
You X, Shu L, Li S, Chen J, Luo J, Lu J, Mu Q, Bai J, Xia Q, Chen Q. Construction of high-density genetic linkage maps for orange-spotted grouper Epinephelus coioides using multiplexed shotgun genotyping. BMC Genet. 2013;14(1):113.
Franch R, Louro B, Tsalavouta M, Chatziplis D, Tsigenopoulos CS, Sarropoulou E, Antonello J, Magoulas A, Mylonas CC, Babbucci M. A genetic linkage map of the hermaphrodite teleost fish Sparus aurata L. Genetics. 2006;174(2):851–61.
Smalley S. Sex-specific recombination frequencies: a consequence of imprinting? Am J Hum Genet. 1993;52(1):210.
Fullerton SM, Carvalho AB, Clark AG. Local rates of recombination are positively correlated with GC content in the human genome. Mol Biol Evol. 2001;18(6):1139–42.
Marsolier-Kergoat M-C, Yeramian E. GC content and recombination: reassessing the causal effects for the Saccharomyces cerevisiae genome. Genetics. 2009;183(1):31–8.
Marais G, Charlesworth B, Wright S. Recombination and base composition: the case of the highly self-fertilizing plant Arabidopsis thaliana. Genome Biol. 2004;5(7):R45.
Pàldi A, Gyapay G, Jami J. Imprinted chromosomal regions of the human genome display sex-specific meiotic recombination frequencies. Curr Biol. 1995;5(9):1030–5.
Jensen-Seaman MI, Furey TS, Payseur BA, Lu Y, Roskin KM, Chen C-F, Thomas MA, Haussler D, Jacob HJ. Comparative recombination rates in the rat, mouse, and human genomes. Genome Res. 2004;14(4):528–38.
Peterson BK, Weber JN, Kay EH, Fisher HS, Hoekstra HE. Double digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species. PLoS ONE. 2012;7(5):e37135.
Aljanabi SM, Martinez I. Universal and rapid salt-extraction of high quality genomic DNA for PCR-based techniques. Nucleic Acids Res. 1997;25(22):4692–3.
Catchen J, Hohenlohe PA, Bassham S, Amores A, Cresko WA. Stacks: an analysis tool set for population genomics. Mol Ecol. 2013;22(11):3124–40.
Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009;25(14):1754–60.
Hohenlohe PA, Bassham S, Etter PD, Stiffler N, Johnson EA, Cresko WA. Population genomics of parallel adaptation in threespine stickleback using sequenced RAD tags. PLoS Genet. 2010;6(2), e1000862.
Grattapaglia D, Sederoff R. Genetic linkage maps of Eucalyptus grandis and Eucalyptus urophylla using a pseudo-testcross: mapping strategy and RAPD markers. Genetics. 1994;137(4):1121–37.
Wu Y, Close TJ, Lonardi S. Accurate construction of consensus genetic maps via integer linear programming. IEEE/ACM Trans Comput Biol Bioinform. 2011;8(2):381–94.
Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, Jones SJ, Marra MA. Circos: an information aesthetic for comparative genomics. Genome Res. 2009;19(9):1639–45.
Ott J. Analysis of human genetic linkage. Baltimore: Johns Hopkins University Press; 1999.
This research is supported by the National Research Foundation, Prime Minister’s Office, Singapore under its Competitive Research Program (CRP Award No. NRF-CRP7-2010-01) and TLL’s Innovation Support Fund.
Availability of data and materials
Raw sequence data for the study has been deposited into the NCBI Sequence Read Archive database (accession no. SRP073060) and DDBJ Sequence Read Archive database (accession no. PRJDB5145 and PRJDB5148).
LW, BB and GHY designed the experiments and analysed the data. LW, PL, SQH, ZYW, EC and BY conducted the lab experiments. LW, BB and GHY drafted the manuscript. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
Summary statistics of 24 linkage groups of sex-specific genetic maps for three families of Asian seabass. (XLSX 21 kb)
Summary statistics of the integrated male and female genetic maps of Asian seabass. (XLSX 11 kb)
Distribution of F: M ratios for each linkage group throughout three families of Asian seabass. (JPG 318 kb)
Distribution of the average recombination rates within each category of GC content for male-specific and female-specific maps across families. (JPG 184 kb)
About this article
Cite this article
Wang, L., Bai, B., Liu, P. et al. Construction of high-resolution recombination maps in Asian seabass. BMC Genomics 18, 63 (2017). https://doi.org/10.1186/s12864-016-3462-z
- Linkage map
- Genome assembly
- Recombination rates
- GC content
- Selective breeding