Skip to main content

Advertisement

Identification of genomic regions regulating sex determination in Atlantic salmon using high density SNP data

Article metrics

Abstract

Background

A complete understanding of the genetic basis for sexual determination and differentiation is necessary in order to implement efficient breeding schemes at early stages of development. Atlantic salmon belongs to the family Salmonidae of fishes and represents a species of great commercial value. Although the species is assumed to be male heterogametic with XY sex determination, the precise genetic basis of sexual development remains unclear. The complexity is likely associated to the relatively recent salmonid specific whole genome duplication that may be responsible for certain genome instability. This instability together with the capacity of the sex-determining gene to move across the genome as reported by previous studies, may explain that sexual development genes are not circumscribed to the same chromosomes in all members of the species. In this study, we have used a 220 K SNP panel developed for Atlantic salmon to identify the chromosomes explaining the highest proportion of the genetic variance for sex as well as candidate regions and genes associated to sexual development in this species.

Results

Results from regional heritability analysis showed that the chromosomes explaining the highest proportion of variance in these populations were Ssa02 (heritability = 0.42, SE = 0.12) and Ssa21 (heritability = 0.26, SE = 0.11). After pruning by linkage disequilibrium, genome-wide association analyses revealed 114 SNPs that were significantly associated with sex, being Ssa02 the chromosome containing a greatest number of regions. Close examination of the candidate regions evidenced important genes related to sex in other species of Class Actinopterygii, including SDY, genes from family SOX, RSPO1, ESR1, U2AF2A, LMO7, GNRH-R, DND and FIGLA.

Conclusions

The combined results from regional heritability analysis and genome-wide association have provided new advances in the knowledge of the genetic regulation of sex determination in Atlantic salmon, supporting that Ssa02 is the candidate chromosome for sex in this species and suggesting an alternative population lineage in Spanish wild populations according to the results from Ssa21.

Background

Sexual development is a multistep process involving sex determination (SD), initiation, gonadal differentiation and maintenance. This process comprises a great diversity of strategies that can be controlled by a variety of genetic and/or environmental mechanisms [1,2,3].

Characterising the genetic basis of SD in fish is fundamental for broodstock management in breeding programmes, which require controlling the entire life cycle of the animal, particularly reproduction. Gaining control over SD enables to develop appropriate breeding schemes in early stages of development when sex external secondary characteristics are not yet differentiated [4, 5]. In this regard, sex control is needed to prevent precocious maturation and reduce the impact of phenotypic sex on product quality; to produce monosex populations when there are differences in growth rate between sexes; to favor the stability of mating systems or to protect wild populations through supportive breeding practices, as having control on the sex ratio if fundamental to maintain the effective population size and therefore to avoid inbreeding depression [6].

The family Salmonidae (Salmonids) comprises fish species with a great economic and societal importance. Within them, Atlantic salmon (Salmo salar) represents one of the most important farmed fish species in the world, with a global annual production achieving 2.5 million tonnes that entails an economic value of $15.4 billion (USD) [7].

Based on available evidence, it is assumed that salmonids are male heterogametic and that sex determination is genetically controlled by the master-sex SDY gene (sexually dimorphic on the Y-chromosome), a gene from the interferon-response factor transcription family [8, 9], which is involved in the immune response of vertebrates. In many non-salmonid species, master-sex determining genes are located on differentiated sex chromosomes that have undergone reduced recombination around the areas of the determining gene, leading to heteromorphic sex chromosomes [10, 11]. However, in Atlantic salmon, as in other salmonid species, sex chromosomes are not morphologically distinguishable [12]. Despite the fact that SDY is associated to maleness in most salmonids [9], its location is not syntenically conserved among species. Although previous literature supports that in Atlantic salmon, SDY maps to chromosome Ssa02 [13, 14], in some individuals of the species it has been found mapping to different chromosomes [15,16,17,18]. These findings have been associated to a transposition ability of SDY between chromosomes [14, 19], but the underlying mechanisms for this mobility are still unclear. A recent study by Kijas et al. [17] found evidence of one single ancestral location for Atlantic salmon SDY, thus discarding the hypothesis of multiple genomic locations predating Atlantic salmon speciation. Other studies have suggested that salmonids’ genome is at an early stage of sex chromosome evolution, given its residually tetraploid state resulting from the salmonids extra whole genome duplication (WGD) [12, 14]. The phenomenon, residual tetrasomy [20, 21], explains how some telomeric regions continue recombining between homeologous chromosomes while others have rediploidized [20,21,22], which may facilitate the transposition of genes across the genome and delaying Y degeneration [9, 16, 18]. This delay in Y degeneration has proposed to be mediated by sex reversal events in fish [23], as a consequence of the formidable plasticity in SD mechanisms in this group. In this way, sex reversal might play an important role in the evolution of SD, facilitating the purge of deleterious mutations on the heterogametic sex chromosome through recombination. This has been proposed as a “fountain of youth” [24] that may explain the high incidence of homomorphic sex chromosomes in fish and amphibians.

The advent of next-generation sequencing technologies has facilitated the development of a high quality reference genome [25] and multiple high-density single nucleotide polymorphism (SNP) arrays [26,27,28] for Atlantic salmon. The advance of molecular tools has also entailed the development of new statistical approaches that open new opportunities for the investigation of complex traits in this species. In this sense, the regional heritability analysis (RHA) approach, recently proposed by Nagamine et al. [29] represents an appropriate methodology to obtain a first approximation of the role of the different chromosomes of Atlantic salmon in sex determination.

In this study, we have used a 220 K high-density SNP panel [27] to investigate the genetic regulation of sex determination in wild Spanish populations of Atlantic salmon, which inhabit the South distribution limit of the species in Europe. The combination of RHA and genome-wide association studies (GWAS) allowed us to determine which chromosomes explain the highest proportion of the genetic variance for sex as well as identify candidate regions and genes associated to sexual development in this species.

Results

Regional heritability analysis

Genomic heritability for sex in the group of individuals analysed was significant and high (h2 = 0.70, SE = 0.26). Estimates of chromosomal heritability ranged from 0.00 to 0.42 and are summarised in Table 1. Only estimates from chromosomes Ssa02 (h2 = 0.42, SE = 0.12) and Ssa21 (h2 = 0.26, SE = 0.11) were significant at the chromosomal (suggestive) level, as revealed by Likelihood Ratio Tests (LRT) (Table 1) and 95% confidence intervals (Fig. 1). After applying the strict Bonferroni correction for multiple test (5% level), the estimate of heritability for Ssa02 was still significant. This chromosome explained 60% of the total additive genetic variance for sex in these populations.

Table 1 Whole-genome (genomic) and chromosomal heritability estimates for sex
Fig. 1
figure1

Heritability estimates for each chromosome. Estimates of heritability (h2) for sex and corresponding confidence intervals at 95% level (in red those significantly different from zero)

Genome-wide association analysis

Results derived from GWAS after pruning the SNPs for linkage disequilibrium (LD) and after applying false discovery rate (FDR) multitest correction at 5% level (Log10 (P) > 3.8), revealed 114 SNPs significantly associated with sex (Fig. 2, Table 2). Ssa02 was the chromosome where the highest number of significant SNPs were mapped (15%). Allele frequencies for the significant SNPs were in general intermediate, as expected given the balanced numbers of males and females in the data set analysed, with SNP effects ranging between ―0.29 and 0.33 (Additional file 1A). The distribution of allele frequencies for significant SNPs revealed a generalized differential trend for males and females (Additional file 1B). Further information about SNP location on chromosomes and corresponding regions is available in Additional file 1.

Fig. 2
figure2

Manhattan plot resulting from the GWAS for sex at 5% false discovery rate (FDR) multitest correction threshold. Log transformed p-values are represented in the Y-axis, against the position of the SNP in the chromosome (X-axis). Red discontinuous line represents FDR multitest correction threshold at 5% level

Table 2 Information on the number of significant SNPs identified in the GWAS for each chromosome

Functional analysis

We first used the information available in the SalmoBase database for Atlantic salmon to explore the gene content of the 17 candidate regions identified on Ssa02 (10 regions, including five overlapping regions) and Ssa21 (four regions), the chromosomes showing significant (and highest) estimates of heritability. SalmoBase contains genome annotation information obtained from RNAseq data, including 48,775 protein coding genes identified to date [25]. Our search revealed 543 and 62 genes contained within candidate regions for chromosomes Ssa02 and Ssa21, respectively (Table 3, Additional file 2). However, the still early stages of annotation of the Atlantic salmon genome, hampered gene enrichment analysis for identifying metabolic routes associated to sex determination.

Table 3 Information of the candidate regions identified in the GWA analysis in chromosomes Ssa02 and Ssa21 and corresponding number of genes in Salmo salar annotation

For this reason, an alternative strategy was performed, consisting on BLAST search of sex related genes previously identified in Atlantic salmon [30] and in other species of the Class Actinopterygii. From the 74 genes initially explored, 56 genes from 12 species aligned against the Salmo salar genome with a query cover > 50% (Additional file 3). Those with lower query cover were genes from non-salmonid species. Imperfect query cover was also allowed for sex-related genes from Atlantic salmon, in order to detect mapping of these genes to alternative regions due to replication or recombination in our population. Thirteen of these 74 genes were located within or close to candidate regions previously identified in the GWAS (Table 4), including important genes related to sex in other species, such as: SDY, genes from the SOX family (SOX1, SOX1b, SOX8, SOX9, SOX21, SOX21a), RSPO1, ESR1, U2AF2A, LMO7, GNRH-R, DND and FIGLA (Table 5).

Table 4 Summary of sex related genes from other species of Class Actynopterigii located in candidate regions in the Salmo salar genome
Table 5 Sex-related genes previously described in Actinopterygii that matched with GWAS candidate regions identified in this study

Discussion

In this study, we used two complementary approaches to improve our understanding of the genetic mechanisms responsible for sex determination in Atlantic salmon. The information obtained from a high-density SNP array was used to perform a RHA that allowed to obtain, for the first time in this species, chromosomal estimates of heritability for sex. Furthermore, this information was used to identify putative regions responsible for sex determination using GWAS. By combining both approaches, our results support that Ssa02 is the predominant sex-determining chromosome in native Spanish populations, in agreement with much of the existing literature in other Atlantic salmon populations.

Previous studies seeking to identify sex related QTLs in Atlantic salmon, based their work on linkage map regression methods using a lower number of markers and cytogenetic techniques [13, 44, 45]. These studies suggested that Ssa02 contained the SEX locus, a finding that was later confirmed by the discovery of the SDY master-sex determining gene in rainbow trout [8, 9]. Despite today is generally accepted that SDY is responsible of maleness in most salmonids, some exceptions have been described. Hence, analysing families of a commercial Tasmanian Atlantic salmon population of North American origin, Eisbrenner et al. [15] mapped the SEX loci on three different chromosomal locations (Ssa02, Ssa03 and Ssa06). A recent study in the same population by Kijas et al. [17] confirmed, through whole-genome sequencing, that Atlantic salmon males carry a single copy of the male-sex determining region containing SDY. Although it was identified on chromosome Ssa02 in most of individuals, some animals presented this region on Ssa03 and Ssa06, supporting the results by Eisbrenner et al. [15]. Also through genome-wide association, the authors identified candidate regions for four additional chromosomes.

Although our GWAS pattern was more sparse than that from Kijas et al. [17], we identified the same candidate regions in five chromosomes, including Ssa02 (regions 13–17 in Additional file 1), Ssa03 (22–23), Ssa05 (30–32), Ssa06 (37–39) and Ssa12 (53). The different association patterns observed in both studies can however be explained by different technical and biological reasons. First, the 220 K SNP used in our study was designed with samples from Norwegian origin, thus SNPs segregating in both populations are expected to be different. Although the SNP content from the 50 K SNP chip used by Kijas et al. [17] largely derived from the 220 K SNP chip, it was designed for identifying segregating polymorphic loci in the Tasmanian population, thus maximising the amount of information. Second, we pruned our data set to avoid overweighting the contribution of groups of correlated SNPs due to linkage disequilibrium that might contribute to lower accuracy [46]. For that, we imposed a strong filter based on half the maximum value for r2 observed in this population (Additional file 4), which can affect the pattern observed. Indeed, linkage disequilibrium patterns might be considerably different in both populations, since they have different geographic origin (South European vs North American) and management (wild vs farmed), and therefore are subjected to completely different selection pressures. Our wild populations inhabit the South distribution limit of the species in Europe, where strong local adaptations are expected. Conversely, the samples from Kijas et al. [17] come from a Tasmanian breeding programme of Nova Scotia origin, that has been maintained isolated for generations, where the impact of artificial selection and genetic drift is expected to have affected the distribution of the genetic variability across the genome in a different manner than in wild populations. In addition, due to its strong homing behaviour, the Atlantic salmon is naturally substructured into genetically differentiated and reproductively isolated populations [47]. Previous studies investigating the population structure of this species have found evidence of substantial genetic differentiation between North American and European populations (with 22% of the variation attributable to continents), with a clear pattern of isolation by distance [48]. Within continents, salmon European populations are more divergent than North American ones and are differentiated in Eastern Atlantic and Baltic clusters [49,50,51,52]. Another factor that can affect the association pattern, also related with the origin of populations, is the karyotypic number. While in European populations the karyotype typically consists on 29 (diploid) chromosomes, in North America populations this is usually 28 [53]. Here, Spanish samples presented 29 chromosomes, whereas Tasmanian samples presented 27. Different population origin and karyotype may also explain the signal found in Ssa21 in the RHA. According to this result, Ssa21 may play a noticeably role in sex determination in Spanish wild populations, which is compatible with the existence of an alternative population lineage in these populations. Although incorrect anchoring of markers to chromosomes is not disposable, if that was the case, this signal should also have appeared in the work by Kijas et al. [17]. In addition, residual tetraploidy resulting from the salmonid specific WGD may have influenced the evolution of their genomes leading to genomic instability [18]. An unstable state involves massive reorganizations of the chromosomes, including inversions, duplications and deletions, and may result in different sex chromosomes in different lineages [13, 14, 16, 18, 54].

Our findings also revealed sex-related genes in our candidate regions that had been previously identified in other species of the class Actinopterygii, including important maleness related genes such as SDY and genes from the SOX family. The sexually dimorphic on the Y chromosome is the master sex-determining gene in rainbow trout (Oncorhynchus mykiss) [8], and a male-specific Y-chromosome gene in the majority of salmonids [9]. Previous studies have determined that the expression of SDY is specific to males, being restricted to epithelial cells of the dorsal side of the testis and to some somatic cells adjacent to the germ cells [9, 16]. Indeed, overexpression of SDY in transgenic females of rainbow trout induces testicular differentiation [9]. Although the role of SDY in salmonids sex determination is still unclear, some authors have proposed that it is involved in the up-regulation of inhibitors of CYP19A (cytochrome P450 aromatase), preventing the accumulation of estrogens required for female development [30], in a similar way than SRY (sex-determining region Y) acts in mammals. Although it is not expected that the reference genome assembly carries the male-sex determining region of the SDY gene, since it is female-derived (assembly ICSASG_v2, [25]) the signals identified could be part of the male-female common region, described in Kijas et al. [17] or correspond to partial duplications, given the highly repetitive elements present in the Atlantic salmon genome. Using a comparative genomic approach, Voldoire et al. [31] demonstrated that the expansion of the SOX family after the teleost-specific WGD resulted in a high retention rate of paralogs, which followed lineage-specific evolutionary trajectories in teleost genomes. This is in agreement with the finding of several SOX genes in our regions.

In contrast, R-spondin genes present a conserved profile of enhanced expression in female vertebrates, and they are involved in ovarian determination and differentiation through the WNT/β-catenin signaling pathway [33]. This pathway is inhibited by the emerin gene, regulated by the LMO7 gene [35, 36]. A recent study in medaka [34] demonstrated increased expression of genes of the RSPO family in the female gonad, suggesting a similar role in fish. Expression analysis in Nile tilapia, revealed that estrogen receptors mediate the development of undifferentiated XX gonads thorough estradiol activation [37] and in zebrafish, mutants with ESR1 disrupted exhibited all-male phenotypes [38]. Also involved in the female reproductive process, GNRH-R activates the synthesis and secretion of gonadotropins in vertebrates through neuroendocrine control [40]. A previous study in seabream suggested that this gene has a role in meiosis-stimulating factor in the oocyte [41]. Finally, FIGLA is highly expressed in primary oocytes and has been localized to the ooplasm in medaka fish and coho salmon [43].

Today, it is generally accepted that most salmonids share the same master sex-determining gene, which has been moved through the action of transposable elements into different ancestral autosomes during the evolution of salmonids, resulting in alterative Y sex chromosomes [9].

Indeed, the first step in the evolution of a Y chromosome is the acquisition of a sex-determining locus on one of proto-sex chromosomes and, after that, the suppression of recombination between X and Y may favor the consolidation of the position of this gene. The role of recombination suppression between the X and Y chromosomes to resolve sexual conflict has been recently addressed by Wright et al. [55]. The authors analysed whole genome and transcriptome data in the guppy, a model for sexual selection with many Y-linked colour traits. Their results were consistent with a step-wise pattern of sex chromosome formation, suggesting that different regions of chromosome divergence can form independently within species. However, it is still debated why the sex chromosomes stopped recombining and how this process spread out over most part of the chromosomes. A recent in silico study by Mackiewicz et al. [56] revealed an association between the suppression of recombination and chromosome Y degeneration with the reproductive tactic, suggesting an enhancement of this effect in polygamous populations.

High-density SNP arrays are powerful tools to dissect QTLs and can highlight functional mechanisms underlying traits. However, appropriate analysis of dense marker information requires experimental designs with large sample sizes. In this sense, an important concern of our study was the reduced sample size available. Notwithstanding, our design allowed us to obtain significant estimates of whole-genome and chromosomal heritability, that remained significant even after applying strict Bonferroni correction. Thus, although our GWAS results may be interpreted with some caution, and validation of candidate SNPs in other genetic backgrounds is needed, the combination of both approaches (i.e. RHA and GWAS) provide valuable information for understanding the genetic basis of sex determination in Atlantic salmon.

Conclusions

In summary, our results are compatible with previous studies that suggest a SD mechanism operating in Atlantic salmon where the SDY gene is the master sex determining gene, being Ssa02 the chromosome candidate for sex in this species. Interestingly, our findings regarding Ssa21 point towards an alternative population lineage in Spanish wild salmon, which inhabits the South distribution limit of the species in Europe. This study contributes to improve our understanding of an important trait in Atlantic salmon such as sex determination that has important implications both in terms of ecology and aquaculture production, and highlights the utility of the development and application of genomic tools in fish.

Methods

Samples and genotyping

A total of 203 sexually mature wild adult salmon from recreational fishery or recorded in trapping facilities (94 males and 109 females) were sampled between 2008 and 2013 from six Spanish rivers (Miño, Ulla, Eo, Sella, Urumea, Bidasoa) covering the distribution range of the species in Spain (from 41° 51′ 55.08″ N, 8° 52′ 10.99″ W to 43° 22′ 22″ N, 1° 47′ 31″ W). Since 1950, regulations have required that scale samples are collected from all salmon caught in the recreational fishery to determine fish age and growth and tissue samples (adipose fins) for DNA profiling analysis. In addition, routine sampling in trapping facilities allows for continuous monitoring and sampling of all sea returns and involves recording the length, weight and sex and taking scale samples for aging and a small portion of adipose fin with anesthesia for DNA profiling analysis.

Genomic DNA was purified from ethanol preserved adipose fins using an NZY Tissue gDNA Isolation kit (NZYtech), and quantity and purity assessed with a Nanodrop-1000 spectrophotometer. DNA samples were adjusted to a final concentration of 100 ng/μL and frozen until use. Morphological sex was confirmed by the successful amplification of the SDY intron gene (~ 200 bp) in all males and absence in all females using the primers SDY E1S1 and SDY E2AS4 [9, 57]. Samples were genotyped using an Affymetrix 220 K SNP array (ThermoScientific) for Atlantic salmon [27] according to manufactures recommendations. Genotypes from samples showing a dish quality control (DQC) < 0.82 or call rate < 0.97 were discarded. Only those data from SNPs classified as Poly High Resolution, with a call rate > 0.97 were used in our analysis. Unmapped SNPs and those with a minor allele frequency (MAF) < 0.01 were also removed. After applying these filters, data from 164,883 SNPs and 199 individuals (92 males and 107 females) remained available for analysis.

Estimation of heritability

Genomic heritability analysis

Genomic heritability was estimated assuming a linear model of the form:

$$ \mathbf{y}=\boldsymbol{\upmu} +\mathbf{Zu}+\mathbf{e} $$
(1)

where y is the vector of phenotypic records (male, female), μ is the population mean of the trait u and e are vectors of random animal genetic and residual effects respectively, and Z is a design matrix allocating phenotypes to animals. Animal genetic effects were assumed to be distributed as N (0, G \( {\sigma}_u^2 \)) where G is the genomic relationship matrix (GRM) for all fish and \( {\sigma}_u^2 \) is the additive genetic variance.

Regional heritability analysis

Regional heritability analysis was implemented to assess the contribution of different regions (in this case chromosomes) to the total additive genetic variance following Nagamine et al. [29]. For that, the random additive genetic effects were divided in two components: regional (chromosomal) genomic and residual whole-genomic additive genetic effects. For estimating the whole-genome component, all SNPs were used to build the GRM. For estimating the chromosomal component, 29 GRMs were built for each independent chromosome. The same whole-genome GRM was used for all analyses as in Nagamine et al. [29]. To test for significant chromosomal variance, likelihood ratio tests were performed by comparing the full model (including chromosomal and whole-genome additive genetic effects), with the reduced model (including only the whole-genome additive genetic effect, as in Eq. (1)). The full model was then of the form:

$$ \mathbf{y}=\boldsymbol{\upmu} +\mathbf{Zu}+\mathbf{Zc}+\mathbf{e} $$

where y is the vector of phenotypic records (male, female), μ is the vector of the population mean of the trait, u and c are vectors of whole-genome and chromosomal additive genetic effects, respectively, e is the vector of random residual effects, and Z is a design matrix allocating phenotypes to animals. Again, animal genetic effects were assumed to be distributed as N (0, G \( {\sigma}_u^2 \)) where G is the genomic relationship matrix for all fish and \( {\sigma}_u^2 \) (\( {\sigma}_c^2 \)) is the additive genetic variance (computed from the whole-genome or from each chromosome, respectively).

Genome-wide association studies

In order to avoid over-weighting the contribution of groups of correlated SNPs in the GWAS, we applied an additional filter based on linkage disequilibrium pruning. For that, we used a threshold value of 0.26, as this is the value (measured as the squared correlation coefficient, r2) at which linkage disequilibrium decreased half in the populations analysed (Additional file 4). Linkage disequilibrium pruning was performed using the software PLINK v1.9 [58] with the instruction --indep-pairwise using 0.26 as threshold, resulting in 36,625 SNPs.

GWAS was performed by regressing the standardised SNP genotypes on the trait using the following mixed model:

$$ \mathbf{y}=\boldsymbol{\upmu} +\mathbf{Xb}+\mathbf{Zu}+\mathbf{e} $$

which is basically the same model than in Eq. (1) but including a vector of fixed effects b that contains the substitution effect of the SNP, and where X is the design matrix for the fixed effect. The inclusion of the GRM in the model prevented potential bias for population structure [59]. A false discovery rate (FDR) multitest correction threshold at 5% level was employed to identify significant associations with the software Myriads [60].

Genetic values for RHA and GWAS were in all cases estimated using a relationship matrix as:

$$ {a}_{ij}=\frac{1}{M}\sum \limits_{k=1}^M{z}_{ik}{z}_{jk}, $$

where aij is the genetic relationship between individuals i and j, M is the number of markers and zij is the standardised genotype of individual i at marker k, defined as zij = (sij – μj)/ σj, with sij being the number of reference alleles at locus j of individual i, μj = 2pj and \( {\sigma}_j=\sqrt{2{p}_j\ \left(1-{p}_j\right)} \) . μj and σj are the mean and the standard deviation of the reference allele at marker j among the individuals genotyped, defined as a function of the frequency of the reference allele (pj).

All models were solved assuming a dicotomic trait distribution (male / female) using restricted maximum likelihood (REML) with the software Dissect [61].

Identification of candidate sex-related genes

The gene content within the ±0.5 Mb flanking regions from significant SNPs identified in the GWAS was interrogated to identify and characterise potential causative genes and variants using two approaches. We chose this distance because linkage disequilibrium at 0.5 Mb was still high in these populations (almost half its maximum value, Additional file 4). In a first step, we identified the genes in the Atlantic salmon genome annotation [25] that were located within candidate regions identified in this study. Secondly, we explored whether sex-associated genes previously described in Salmo salar [30] and other species of the Class Actinopterygii were located within (< 0.5 Mb from the candidate SNP) or in the vicinity (< 2.5 Mb, according to the extension of linkage disequilibrium, Additional file 4) of our candidate regions. The identification of sex-related genes in Actinopterygii was performed using the advanced search tool in NCBI database. Specific sex-determining pathway genes in Atlantic salmon were explored in the basis of the expression study by Lubieniecki et al. [30]. For the second approach, a total of 1050 genes were identified in 70 species. We restricted the search to 74 sex-related genes after removing duplicated gene names and genes in non-nuclear locations. In order to map the location of these 74 genes in the Atlantic salmon reference genome (assembly ICSASG_v2, [25]) we used the NCBI BLAST tool (blastn). Default search options were changed to discontiguous megablast, match/mismatch scores of 1,―1 and minimum gap cost (0 existence and 2 extension). Gene locations were inspected to overlap with candidate regions previously identified in the GWAS. Results were ordered by query cover and only alignments with a query cover > 50% or in chromosomes showing significant heritability for sex detected previously in RHA were considered.

Availability of data and materials

The datasets analysed during the current study can be found in Additional file 5 and Additional file 6 of Supplementary Material.

Abbreviations

BLAST:

basic local alignment search tool

CYP19A:

cytochrome P450 aromatase

DND:

dead-end

DQC:

dish quality control

ESR1:

estrogen receptor 1

FDR:

false discovery rate

FIGLA:

folliculogenesis specific basic helix-loop-helix

GNRH-R:

gonadotropin-releasing hormone receptor

GRM:

genomic relationship matrix

GWAS:

genome-wide association studies

LMO7:

LIM domain only 7

MAF:

minor allele frequency

QTL:

quantitative trait loci

REML:

restricted maximum likelihood

RHA:

regional heritability analysis

RSPO1:

R-spondin 1

SD:

sex determination

SDY:

sexually dimorphic on the Y-chromosome

SNP:

single nucleotide polymorphism

SOX:

SRY-type high mobility group box

SRY:

sex-determining region Y

U2AF2A:

U2 small nuclear RNA auxiliary factor 2a

WGD:

whole genome duplication

WNT:

wingless-related MMTV integration site 4

References

  1. 1.

    Devlin RH, Nagahama Y. Sex determination and sex differentiation in fish: an overview of genetic, physiological, and environmental influences. Aquaculture. 2002;208:191–364.

  2. 2.

    Bachtrog D, Mank JE, Peichel CL, Kirkpatrick M, Otto SP, Ashman T-L, et al. Sex determination: why so many ways of doing it? PLoS Biol. 2014;12:e1001899.

  3. 3.

    Herpin A, Schartl M. Plasticity of gene-regulatory networks controlling sex determination: of masters, slaves, usual suspects, newcomers, and usurpators. EMBO Rep. 2015;16:260–1274.

  4. 4.

    Cnaani A, Levavi-Sivan B. Sexual development in fish, practical applications for aquaculture. Sex Dev. 2009;3:164–75.

  5. 5.

    Martínez P, Viñas AM, Sánchez L, Díaz N, Ribas L, Piferrer F. Genetic architecture of sex determination in fish: applications to sex ratio control in aquaculture. Front Genet. 2014;5:340.

  6. 6.

    Budd AM, Bahn QQ, Domingos JA, Jerry DR. Sex control in fish: approaches, challenges and opportunities for aquaculture. J Mar Sci Eng. 2015;3:329–55.

  7. 7.

    International Salmon Farmers Association (ISFA). Salmon farming sustaining communities and feeding the world, 2018. http://www.salmonfarming.org/cms/wp-content/uploads/2018/06/ISFA-Report-2018-web.pdf

  8. 8.

    Yano A, Guyomard R, Nicol B, Jouanno E, Quillet E, Klopp C, et al. An immune-related gene evolved into the master sex-determining gene in rainbow trout, Oncorhynchus mykiss. Curr Biol. 2012;22:1423–8.

  9. 9.

    Yano A, Nicol B, Jouanno E, Quillet E, Fostier A, Guyomard R, et al. The sexually dimorphic on the Y-chromosome gene (sdY) is a conserved male-specific Y-chromosome sequence in many salmonids. Evol Appl. 2013;6:486–96.

  10. 10.

    Graves JAM. Sex chromosome specialization and degeneration in mammals. Cell. 2006;124:901–14.

  11. 11.

    Natri HM, Shikano T, Merilä J. Progressive recombination suppression and differentiation in recently evolved neo-sex chromosomes. Mol Biol Evol. 2013;30:1131–44.

  12. 12.

    Phillips RB. Evolution of the sex chromosomes in salmonid fishes. Cytogenet Genome Res. 2013;141:177–85.

  13. 13.

    Woram RA, Gharbi K, Sakamoto T, Hoyheim B, Holm L-E, Naish K, et al. Comparative genome analysis of the primary sex-determining locus in salmonid fishes. Genome Res. 2003;13:272–80.

  14. 14.

    Davidson WS, Huang T-K, Fujiki K, von Schalburg KR, Koop BF. The sex determining loci and sex chromosomes in the family Salmonidae. Sex Dev. 2009;3:78–87.

  15. 15.

    Eisbrenner WD, Botwright N, Cook M, Davidson EA, Dominik S, Elliott NG, et al. Evidence for multiple sex-determining loci in Tasmanian Atlantic salmon (Salmo salar). Heredity (Edinb). 2014;113:86–92.

  16. 16.

    Lubieniecki KP, Lins S, Cabana EI, Li J, Lay YY, Davidson WS. Genomic instability of the sex-determining locus in Atlantic salmon (Salmo salar). G3. 2015;22:2513–22.

  17. 17.

    Kijas J, McWilliam S, Naval-Sánchez M, Kube P, King H, Evans B, et al. Evolution of sex determination loci in Atlantic salmon. Sci Rep. 2018;8:5664.

  18. 18.

    Sutherland BJG, Rico C, Audet C, Bernatchez L. Sex chromosome evolution, heterochiasmy and physiological QTL in the salmonid brook charr Salvelinus fontinalis. G3 genes, genomes, Genet. 2017;7:2749–62.

  19. 19.

    Faber-Hammond JJ, Phillips RB, Brown KH. Comparative a of the shared sex-determination region (SDR) among salmonid fishes. Genome Biol Evol. 2015;7:1972–87.

  20. 20.

    Allendorf FW, Bassham S, Cresko WA, Limborg MT, Seeb LW, Seeb JE. Effects of crossovers between homeologs on inheritance and population genomics in olyploid-derived salmonid fishes. J Hered. 2015;106:217–27.

  21. 21.

    May B, Delany ME. Meiotic models to explain classical linkage, pseudolinkage, and chromosomal pairing in tetraploid derivative salmonid genomes: II. Wright is still right. J Hered. 2015;106:762–6.

  22. 22.

    Allendorf FW, Thorgaard GH. Tetraploidy and the evolution of salmonid fishes. In: Evolutionary genetics of fishes. Boston, MA: Springer US; 1984.

  23. 23.

    Baroiller J-F, D’Cotta H. The reversible sex of gonochoristic fish. Sex Dev. 2016:242–6266.

  24. 24.

    Perrin N. Sex reversal: a fountain of youth for sex chromosomes? Evolution. 2009;63–12:3043–9.

  25. 25.

    Lien S, Koop BF, Sandve SR, Miller JR, Kent MP, Nome T, et al. The Atlantic salmon genome provides insights into rediploidization. Nature. 2016;533:200–5.

  26. 26.

    Houston RD, Taggart JB, Cézard T, Bekaert M, Lowe NR, Downing A, et al. Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar). BMC Genomics. 2014;15:90.

  27. 27.

    Barson NJ, Aykanat T, Hindar K, Baranski M, Bolstad GH, Fiske P, et al. Sex-dependent dominance at a single locus maintains variation in age at maturity in salmon. Nature. 2015;528:405–8.

  28. 28.

    Yáñez JM, Naswa S, López ME, Bassini L, Correa K, Gilbey J, et al. Genomewide single nucleotide polymorphism discovery in Atlantic salmon (Salmo salar): validation in wild and farmed American and European populations. Mol Ecol Resour. 2016;16:1002–11.

  29. 29.

    Nagamine Y, Pong-Wong R, Navarro P, Vitart V, Hayward C, Rudan I, et al. Localising loci underlying complex trait variation using regional genomic relationship mapping. PLoS One. 2012;7:e46501.

  30. 30.

    Lubieniecki KP, Botwright NA, Taylor RS, Evans BS, Cook MT, Davidson WS. Expression analysis of sex-determining pathway genes during development in male and female Atlantic salmon (Salmo salar). Physiol Genomics. 2015;47.

  31. 31.

    Voldoire E, Brunet F, Naville M, Volff J-N, Galiana D. Expansion by whole genome duplication and evolution of the sox gene family in teleost fish. PLoS One. 2017;12:010936.

  32. 32.

    Jiang T, Hou CC, She Z-Y, Yang W-X. The SOX gene family: function and regulation in testis determination and male fertility maintenance. Mol Biol Rep. 2013;40:2187–94.

  33. 33.

    Kim KA, Wagle M, Tran K, Zhan X, Dixon MA, Liu S, et al. R-Spondin family members regulate the Wnt pathway by a common mechanism. Mol Biol Cell. 2008;19:2588–96.

  34. 34.

    Zhou L, Charkraborty T, Yu X, Wu L, Liu G, Mohapatra S, et al. R-spondins are involved in the ovarian differentiation in a teleost, medaka (Oryzias latipes). BMC Dev Biol. 2012;12:36.

  35. 35.

    Holaska JM, Rais-Bahrami S, Wilson KL. Lmo7 is an emerin-binding protein that regulates the transcription of emerin and many other muscle-relevant genes. Hum Mol Genet. 2006;15:3459–72.

  36. 36.

    Bouma GJ, Hudson QJ, Washburn LL, Eicher EM. New candidate genes identified for controlling mouse gonadal sex determination and the early stages of granulosa and Sertoli cell differentiation. Biol Reprod. 2010;82:380–9.

  37. 37.

    Ijiri S, Kaneko H, Kobayashi T, Wang D-S, Sakai F, Paul-Prasant B. Sexual dimorphic expression of genes in gonads during early differentiation of a teleost fish, the Nile tilapia Oreochromis niloticus. Biol Reprod. 2008;78:333–41.

  38. 38.

    Lu H, Cui Y, Jiang L, Ge W. Functional analysis of nuclear estrogen receptors in zebrafish reproduction by genome editing approach. Endocrinology. 2017;158:2292–308.

  39. 39.

    Ito T, Muto Y, Green MR, Yokoyama S. Solution structures of the first and second RNA-binding domains of human U2 small nuclear ribonucleoprotein particle auxiliary factor (U2AF65). EMBO J. 1999;18:4523–34.

  40. 40.

    Madigou T, Mañanos-Sánchez E, Husholf S, Anglade I, Zanuy S, Kah O. Cloning, tissue distribution, and central expression of the gonadotropin-releasing hormone receptor in the rainbow trout (Oncorhynchus mykiss). Biol Reprod. 2000;63:1857–66.

  41. 41.

    Nabissi M, Pati D, Polzonetti-Magni AM, Habibi HR. Presence and activity of compounds with GnRH-like activity in the ovary of seabream Sparus aurata. Am J Phys. 1997;272:R111–7.

  42. 42.

    Nagasawa K, Fernandes JMO, Yoshizaki G, Miwa M, Babiak I. Identification and migration of primordial germ cells in Atlantic Salmon, Salmo salar: characterization of vasa, dead end, and lymphocyte antigen 75 genes. Mol Reprod Dev. 2013;80:118–31.

  43. 43.

    Luckenbach JA, Iliev DB, Goetz FW, Swanson P. Identification of differentially expressed ovarian genes during primary and early secondary oocyte growth in coho salmon, Oncorhynchus kisutch. Reprod Biol Endocrinol. 2008;6:2.

  44. 44.

    Artieri CG, Mitchell LA, Ng SHS, Parisotto SE, Danzmann RG, Hoyheim B, et al. Identification of the sex-determining locus of Atlantic salmon (Salmo salar) on chromosome 2. Cytogenet Genome Res. 2006;112:152–9.

  45. 45.

    Lien S, Gidskehaug L, Moen T, Hayes BJ, Berg PR, Davidson WS, et al. A dense SNP-based linkage map for Atlantic salmon (Salmo salar) reveals extended chromosome homeologies and striking differences in sex-specific recombination patterns. BMC Genomics. 2011;12:615.

  46. 46.

    Bush WS, Moore JH. Genome-wide association studies. PLoS Comput Biol. 2012;8:e1002822.

  47. 47.

    Stahl G. Genetic population structure of Atlantic salmon. In: Population Genetics and Fishery Management. Ryman N. and Utter F. (eds). Washington Sea Grant Program/University of Washington Press, Seattle and London. 1987. pp. 121–140.

  48. 48.

    King TL, Kalinowski ST, Schill WB, Spidle AP, Lubinski BA. Population structure of Atlantic salmon (Salmo salar L.): a range-wide perspective from microsatellite DNA variation. Mol Ecol. 2001;10:807–21.

  49. 49.

    Bourke EA, Coughlan J, Jansson H, Galvin P, Cross TF. Allozyme variation in populations of Atlantic salmon located throughout Europe: diversity that could be compromised by introductions of reared fish. ICES J Mar Sci. 1997;54:974–85.

  50. 50.

    Verspoor E, McCarthy EM, Knox D, Bourke EA, Cross TF. The phylogeography of European Atlantic salmon (Salmo salar L.) based on RFLP analysis of the ND1/16sRNA region of the mtDNA. Biol J Linn Soc. 1999;68:129–46.

  51. 51.

    Nilsson J, Gross R, Asplind T, Dove O, Jansson H, Kelloniemi J, et al. Matrilinear phylogeography of Atlantic salmon (Salmo salar L) in Europe and postglacial colonization of the Baltic Sea area. Mol Ecol. 2001;10:89–102.

  52. 52.

    Consuegra S, García de Leániz C, Serdio A, González Morales M, Straus LG, Knox D, Verspoor E. Mitochondrial variation in Pleistocene and modern Atlantic salmon from the Iberian glacial refugium. Mol Ecol. 2002;11:2037–48.

  53. 53.

    Brenna-Hansen S, Li J, Kent MP, Boulding EG, Dominik S, Davidson W, Lien S. Chromosomal differences between European and north American Atlantic salmon discovered by linkage mapping and supported by fluorescence in situ hybridization. BMC Genomics. 2012:13–432.

  54. 54.

    Cavileer TD, Hunter SS, Olsen J, Wenburg J, Nagler JJ. A sex-determining gene (sdY) assay shows discordance between phenotypic and genotypic sex in wild populations of Chinook salmon. Trans Am Fish Soc. 2015;144:423–30.

  55. 55.

    Wright AE, Darolti I, Bloch NI, Oostra V, Sandkam B, Buechel SD, et al. Convergent recombination suppression suggests role of sexual selection in guppy sex chromosome formation. Nat Commun. 2016;8:14251.

  56. 56.

    Mackiewicz D, Posacki P, Burdukiewicz M, Błażej P. Role of recombination and faithfulness to partner in sex chromosome degeneration. Sci Rep. 2018;8:8978.

  57. 57.

    Quéméré E, Perrier C, Besnard A-L, Evanno G, Baglinière J-L, Guiguen Y, et al. An improved PCR-based method for faster sex determination in brown trout (Salmo trutta) and Atlantic salmon (Salmo salar). Conserv Genet Resour. 2014;6:825–7.

  58. 58.

    Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a toolset for whole-genome association and population-based linkage analysis. Am J Hum Genet. 2007;81.

  59. 59.

    Kang HM, Sul JH, Service SK, Zaitlen NA, Kong SY, Freimer NB, et al. Variance component model to account for sample structure in genome-wide association studies. Nat Genet. 2010;42:348–54.

  60. 60.

    Carvajal-Rodriguez A. Myriads: p-value-based multiple testing correction. Bioinformatics. 2018;34:1043–104.

  61. 61.

    Canela-Xandri O, Law A, Gray A, Woolliams JA, Tenesa A. A new tool called DISSECT for analysing large genomic data sets using a big data approach. Nat Commun. 2015;6:10162.

Download references

Acknowledgements

We would like to thank Pablo Caballero (Xunta de Galicia), Gerónimo de la Hoz (Principado de Asturias), Iñigo Mendiola (Diputación Foral de Gipuzkoa), Javier Álvarez and Pedro Luenda (GANASA) for providing samples. We also acknowledge Silje Karoliussen (NMBU – CIGENE) for support with processing data quality.

Funding

This study was funded by the Ministerio de Economía y Competitividad of Spain (grant RZ2012–00011-C02–00) and Fondos FEDER. MG was supported by a fellowship from Fundación UCM-Manuel Álvarez López, Universidad Complutense de Madrid. MS was supported by the Juan de la Cierva Programme (Ministerio de Ciencia e Innovación, grant JCI-2011-10896). The funding bodies played no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Author information

MS and PM acquired funding, planned the study design, coordinated the study and performed manuscript corrections. BV assisted with coordination and discussion. MG, AIF, and MS performed the data analysis. AC, LCS and AF assisted with the data analysis. MPK performed the array hybridization and data processing. MG wrote the first draft of the manuscript. All authors discussed the results, made suggestions and corrections. All authors read and approved the final manuscript.

Correspondence to María Saura.

Ethics declarations

Ethics approval and consent to participate

The current study was carried out under a Project License from the INIA Scientific Ethic Committee. Collection of samples and animal manipulations were performed according to the Spanish Policy for Animal Protection RD1201/05, which meets the European Union Directive 86/609 about the protection of animals used in experimentation. We hereby confirm that the INIA Scientific Ethic Committee, which is the named IACUC for the INIA, specifically approved this study.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1. Information relative to SNPs significantly associated with sex identified in the GWAS

Additional file 2. Information relative to candidate regions identified in the GWAS for chromosomes Ssa02 and Ssa21contained in the Salmo salar annotation (Salmo_salar-annotation.gff3)

Additional file 3. Results of the BLAST analysis of sex-related genes previously identified in other species of the class Actinopterygii against the Salmo salar reference genome (assembly ICSASG_v2), indicating the corresponding chromosome match and position in Salmo salar and BLAST parameters

Additional file 4. Plot representing the decrease of linkage disequilibrium (measured as the squared correlation between alleles at different loci, r2) across physical distance in the chromosome

Additional file 5. Genotypic and phenotypic information used for the GWAS. Genotypes for the 36,625 (pruned) SNPs are coded as allele dosage (0, 1, 2) for the minor allele (A1) for each individual (first row). Phenotypic information for sex (1: male, 2: female) for each individual is also included (second row). Missing values are coded as NA

Additional file 6. Map information for the 36,625 (pruned) SNPs used for the GWAS, containing the corresponding chromosome, SNP name, physical position (in bp) and alleles (A1: minor allele, A2: major allele)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Gabián, M., Morán, P., Fernández, A.I. et al. Identification of genomic regions regulating sex determination in Atlantic salmon using high density SNP data. BMC Genomics 20, 764 (2019) doi:10.1186/s12864-019-6104-4

Download citation

Keywords

  • Atlantic salmon
  • GWAS
  • High-dense SNP chip
  • Regional heritability analysis
  • Sex determination