Skip to main content

Genome-wide association analysis of adaptation to oxygen stress in Nile tilapia (Oreochromis niloticus)



Tilapia is one of the most abundant species in aquaculture. Hypoxia is known to depress growth rate, but the genetic mechanism by which this occurs is unknown. In this study, two groups consisting of 3140 fish that were raised in either aerated (normoxia) or non-aerated pond (nocturnal hypoxia). During grow out, fish were sampled five times to determine individual body weight (BW) gains. We applied a genome-wide association study to identify SNPs and genes associated with the hypoxic and normoxic environments in the 16th generation of a Genetically Improved Farmed Tilapia population.


In the hypoxic environment, 36 SNPs associated with at least one of the five body weight measurements (BW1 till BW5), of which six, located between 19.48 Mb and 21.04 Mb on Linkage group (LG) 8, were significant for body weight in the early growth stage (BW1 to BW2). Further significant associations were found for BW in the later growth stage (BW3 to BW5), located on LG1 and LG8. Analysis of genes within the candidate genomic region suggested that MAPK and VEGF signalling were significantly involved in the later growth stage under the hypoxic environment. Well-known hypoxia-regulated genes such as igf1rb, rora, efna3 and aurk were also associated with growth in the later stage in the hypoxic environment. Conversely, 13 linkage groups containing 29 unique significant and suggestive SNPs were found across the whole growth period under the normoxic environment. A meta-analysis showed that 33 SNPs were significantly associated with BW across the two environments, indicating a shared effect independent of hypoxic or normoxic environment. Functional pathways were involved in nervous system development and organ growth in the early stage, and oocyte maturation in the later stage.


There are clear genotype-growth associations in both normoxic and hypoxic environments, although genome architecture involved changed over the growing period, indicating a transition in metabolism along the way. The involvement of pathways important in hypoxia especially at the later growth stage indicates a genotype-by-environment interaction, in which MAPK and VEGF signalling are important components.


Tilapia is one of the most important species in aquaculture noted for their relative ease of culture and rapid growth. Tilapia is currently cultured in over 120 countries, mainly in the tropics and sub-tropics, with a production from 0.3 million tonnes in 1987 to closely 7 million tonnes in 2018, which makes it the second largest aquaculture species in the world [1]. Tilapia is a valuable protein source in developing and emerging economies. Due to its wide range of culturing conditions, tilapia is also an excellent model to study adaptive responses to environmental stresses [2]. One of the most important non-commercial breeding programs is the Genetically Improved Farmed Tilapia (GIFT), executed by WorldFish in Malaysia. It has sustained genetic gains for growth and body trait more than 10% per generation for more than six generations [3]. However, rapid growth potentially exacerbates existing limitations in the production environment. In non-aerated ponds, high stocking density can lead to an extreme hypoxic environment, especially at the end of the night (nocturnal hypoxia), when algae have higher rate of oxygen consumption than oxygen production. The extreme hypoxic environment can lead to lower feed intake, stagnated growth, and susceptibility to disease [4, 5]. The result is a higher mortality and lower yield than what could potentially be achieved [6]. The effects can be mitigated through mechanical aeration of ponds, but a daily fluctuation in oxygen availability is nevertheless inevitable.

Response to hypoxia is a highly complicated biological process that has received considerable scientific attention, both in fishes and in land vertebrates (e.g. high-altitude adaptation studies). Most of these response processes happen very early at the onset of hypoxia through the activation of pathways depending on proteins that are already present [7]. But in the longer term, adaptive responses to hypoxia are leading to different expression of genes. In mammals, studies in the past decades pointed to an essential role of the Hypoxia-inducible factors (HIF) for gene expression regulation during hypoxia [8]. Other genes such as tyrosine hydroxylase (TH), phosphoglycerate kinase 1 (PGK1) and vascular endothelial growth factor (VEGF) are also important key actors [9]. Recent studies have described that fish have homologs of HIF-α and -β, which may show similar function to those in mammals in the hypoxic environment [9, 10]. Several other hypoxia-related proteins and signal pathways have been reported, such as AMP-activated protein kinase (AMPK), reactive oxygen species (ROS), mitogen-activated protein kinase (MAPK) and IGF-1/PI3K/AKT signalling, which have been reported to to be involved in hypoxia adaptation of some fish species [11, 12].

Genetic adaptation to hypoxia is important for survival in many aquatic species, since variation in oxygen availability in water can vary far more, and far more rapidly, than in terrestrial ecosystems. Hypoxia is an important cause of economic losses in aquaculture. Understanding the genomic architecture of hypoxia adaptation could help to improve resilience through breeding programs for economically important species. So far, hypoxia tolerance has been studied in a limited number of fish species, including catfish [13, 14], Atlantic salmon [15], and tilapia [16], with the aim to identify QTLs for hypoxia-tolerant traits. Genome-wide association study (GWAS) has been regarded as a powerful tool to identify genetic markers associated with target traits, and a more complete gene network will provide the knowledge bases required for the aquaculture industry to make improvements [17]. In hybrid catfish, Zhong et al. [13] revealed in total nine SNPs associated with dissolved oxygen (DO) level using a 250 K SNP array. Analysis of the genes overlapping or close to those SNPs suggested that many of those genes were involved in the PI3K/AKT and VEGF pathways. In another study, Brennan et al. [18] aimed to identify population differences in hypoxia tolerance by calculating the amount of time for Killifish to lose equilibrium using GWAS. They found that variation in Hyaluronan synthase 1 (has1) influenced the production of hyaluronan, which can directly effect on hypoxia tolerance.

There are only a few studies that focused on genetic bases of either hypoxia tolerance or growth in Nile tilapia [16, 19], however, none of these investigated how hypoxia influences growth in Nile tilapia. The main objective of this study was to unravel the genomic architecture associated with phenotypic variation during adaptation to hypoxia or normoxia, and to elucidate the effect of hypoxia on the genetic regulation of growth.


Phenotype statistics

Fish fry was produced from generation 15 of the GIFT breeding program. The experiment was carried out in an aerated (normoxic) and non-aerated (nocturnal hypoxic) ponds, each producing 1026 and 1037 fish that were involved in the analysis. Body weight of growing fish was measured at five time points (Table 1). The data show that the number of tilapia in both environments gradually decreased. This effect was more pronounced in the hypoxic environment, with a total loss from stocking to harvest of 23% of the initial number of individuals, compared to 14% in the normoxic environment. The average body weight at five time points in the normoxic environment was significantly higher than those in the hypoxic environment, with the exception of the first time point (BW1). Interestingly, the coefficient of variation in body weight (CV) at each time point in the two separate environments decreased.

Table 1 Summary statistics of body weight across the whole growth period in Nile tilapia

The estimated phenotypic correlations for body weight between different time points in the two environments are shown in Table 2. Results show that phenotypic correlation between time points in the hypoxic and nomorxic environments was initially high (0.80 and 0.81 separately), but decreased with increasing time between measurements.

Table 2 Phenotypic correlations of body weight across the whole growth period in different environments

SNP statistic and population structure

In total 27,090 SNPs that passed SNP minor allele frequency, genotype and individuate call rate criteria, were used for subsequent analysis. Those SNPs were found to be randomly distributed across the genome with a density of approximately 28 SNP per Mb. The highest number of SNPs (4344) on LG3 while LG11 had the lowest number of SNPs (630) (Fig. 1a). A few windows on LG3 show a higher density of SNPs (Fig. 1b). Besides this exception, the distribution of SNPs is uniform with the linkage group physical length of the Oreochromis niloticus genome (GenBank accession GCF_001858045).

Fig. 1

SNP statistics with all individuals. a Histogram of SNPs distribution across all linkage groups. b SNP density plots across all linkage groups. c and d 3D PC plot for origin of tilapia at BW1 in the hypoxic (c) and normoxic (d) environments using all SNPs that passed filtering, where each dot represents one individual

The PCA represents the genetic structure for individuals from the hypoxic and normoxic environments, respectively (Fig. 1c, d and Supplementary Figures 3 and 4). In the hypoxic environment, the first three principal components (PCs) explain 47.0% of the total genotype-based variation and separate samples according to their family differences. PC1 accounts for 15.2% of the total genotype variation and separates families in hapa3 with other families. In the normoxic environment, the first three components explain 39.8% of the total genotype variation, while the first component accounts for 15.3%. Moreover, the largest PC (PC1) of all samples separates disperse cluster from families in hapa3 again.

These results indicated that there was clear genetic variation caused by family differences in both environments. This was partially caused by the different distribution of the number of fish from four rearing hapas under the normoxic and hypoxic environments. Additionally, the average body weight of fish in hapa3 was larger than that of other hapas, especially the mean body weight of male fish at the first time point was much higher in the normoxic environment than the hypoxic environment (Supplementary Figure 2), indicating that a few families with high body weight dominated in one environment but not the other.

Single environmental GWAS at five different time points

Significant SNPs were detected with a univariate GWAS by implementing a linear mixed model. We observed that sex and hapa effects can explain part of the difference in body weight. Thus, these were treated as fixed factors in our analysis. Overall, five association analyses, one for each time point where body weight was measured, were performed for each environment. The Manhattan plots for each of the five time points in the hypoxic and normoxic environments are shown in Fig. 2a and b respectively. In addition, Quantile-Quantile plots with genomic inflation factors were created to aid in estimating the influence of population structure on single environmental GWAS (shown in Supplementary Figures 5 and 6). The P values of corrected thresholds for suggestive and genome-wide significant levels were 4.22 (−log10(1/16504)) and 5.52 (−log10(0.05/16504)), respectively.

Fig. 2

Manhattan plots across the whole growth period in the hypoxic environment. a and normoxic environment (b). Each dot on this figure corresponds to a SNP within the dataset, while the orange and blue horizontal line represent the genome-wide significance (5.52) and suggestive significance threshold value (4.22), respectively. The Manhattan plots contain –log10 observed P-values for genome-wide SNPs (y-axis) plotted against their corresponding position on each chromosome (x-axis)

In the hypoxic environment, the analyses showed 10 significant and 26 suggestive SNPs associated with BW1 to BW5 (Supplementary Table 2). Among those, six SNPs between 19.48 Mb and 21.04 Mb on LG8 attained genome-wide significance for BW1 to BW3. However, those SNPs were not significant for BW4 and BW5. Two SNPs (LG1: 30766342 and LG1:30766336) were significantly associated with BW3 to BW5. Additionally, 16 SNPs above the suggestive level as defined above for BW1 to BW2 were found on LG8, LG18 and LG19, while 18 SNPs mostly located on LG1 and LG8, were found for BW4 to BW5. Interestingly, at BW3, SNPs on LG8 overlapped with BW1 and BW2, while SNPs on LG1 overlapped with BW4 and BW5, further confirming that there is a transition in genomic architecture associated with growth over time.

We also detected 2 significant and 27 suggestive SNPs across different growth stages in the normoxic environment (Supplementary Table 3). The suggestive peak at BW1 covered the same genomic region as that found for the hypoxic environment between 19.48 to 21.03 Mb on LG8. However, similar to the hypoxic environment, the significance of those SNPs declined from BW1 to BW3, a pattern also seen for the SNPs located on LG18 and LG22. A few SNPs on LG7 and LG15 also showed a signal near the suggestive level from BW3 to BW5, which could be potentially interesting, although they did not attain statistical significance.

Meta-analysis GWAS across two environments

A meta-analysis GWAS that considered the effects of 27,090 SNPs in common in the hypoxic and normoxic environments was performed, and the results are shown in Fig. 3. In total 33 SNPs were detected to be significant with five measurements of body weight during the whole growth stage. Clusters of significant SNPs were mostly found on LG8, LG18 and LG22 (Supplementary Table 4). Interestingly, six SNPs located between 19.48 and 21.03 Mb on LG8, three SNPs between 12.44 and 27.32 Mb on LG18 and three SNPs within 1 kb at 35.25 Mb on LG22, were all significantly associated with body weight at time points BW1 and BW2. However, the P-values of those SNPs decreased in subsequent growth periods. Five SNPs between 30.54 and 31.19 Mb on LG1, and one SNP on LG15 (LG15:23051993), were associated with body weight from BW3 to BW5. Moreover, two SNPs on LG8 (LG8:4319661, LG8: 11800435) were significant at BW4 and BW5. Notably, those SNPs located on LG8 were found at a different region compared to SNPs on the same LG in hypoxic GWAS. Hence, associations for BW1 to BW2 were different from BW4 to BW5, although BW3 shows both overlap to early and late growth stages, which could indicate that a transition in the pathways involved occurred around this stage.

Fig. 3

Manhattan plots of Meta-analysis GWAS across two environments. The orange and blue horizontal line represent the genome-wide significance (3.03E-06) and suggestive significance threshold value (6.06E-05) respectively

Functional annotation analysis

Based on the SNP association pattern for five measurements across the whole growth stage, we defined the early stage as BW1 and BW2, while the later stage is BW3 to BW5. Through gene identification within the associated genomic regions, the functional processes and pathways were subsequently enriched for single environmental and across environmental GWAS, respectively. Considering that BW3 is the transition point, SNPs that overlapped with the early stage were excluded in the functional annotation for the later stage. The candidate genes derived from single environment and across environment GWAS are shown in Fig. 4a and b, where 15 and 25 genes from the BW1 to BW2  and BW3 to BW5 respectively, were uniquely associated with body weight in the hypoxic environment while another 12 genes were unique to growth in the normoxic environment. It is also noteworthy that three genes (raraa, rarab, bahcc1) were significant for BW1 and BW2 for both single and across environmental GWAS.

Fig. 4

Functional annotation based on candidate genomic region associated with growth. a Venn diagram summarising the gene count of the early stage (BW1 to BW2) from hypoxia, normoxia and meta-analysis (cross normoxia and hypoxia). b Venn diagram summarising gene count of later stage (BW3 to BW5) from hypoxia, normoxia and meta-analysis. c KEGG enrichment of candidate genes in later stage of hypoxia environment (d) protein association network among candidate genes in later stage of the hypoxia environment

During the early growth stage in the hypoxic environment, 14 GO (Gene ontology) terms were found to be significantly overrepresented (Supplementary Table 5), including central nervous system development and steroid hormone mediated signalling pathways. Six KEGG pathways were found at later growth stage (Fig. 4c), including MAPK and VEGF signalling pathways. Protein interaction network analysis showed dock5, dock10, dock11, baiap2a, baiap2b, aurka and aurkb strongly interacting with rac1b and ppp3ca, which all are proteins participating in MAPK and VEGF signalling (Fig. 4d).

For the early growth stage of the normoxic environment, retinoic acid receptor signalling pathway, apoptotic signalling pathway, liver development, signal transduction, steroid hormone mediated signalling pathway and brain development biological processes (Supplementary Table 6), were significantly enriched, while two (retinoic acid receptor and steroid hormone mediated signalling pathways) overlapped with the same growth period in the hypoxia environment. However, in contrast to the hypoxic environment, we did not find significant terms during the later growth stage in the normoxic environment.

In the meta-analysis GWAS across the normoxic and hypoxic environments, nine GO terms, including retinoic acid receptor signalling pathway and steroid hormone mediated signalling pathway, were mostly enriched in the early growth stage. During the later growth stage, two pathways involved in oocyte meiosis and progesterone-mediated oocyte maturation process. Interestingly, none of hypoxia-related pathway was enriched (Supplementary Table 7).


Hypoxia is one of the major environmental factors in fish. Hypoxia tolerance represents the ability of fish species to tolerate low oxygen level and to maintain a sustainable metabolic rate at lower dissolved oxygen levels [20]. Growth is a key trait for aquaculture and can be assessed by weight gain in order to examine the impact of hypoxic condition on fish production. For more than a half century, various and divergent claims have been made regarding the interaction between body size and hypoxia in teleost fish. Recent studies showed that small individuals have the least hypoxia tolerance within some fish species, such as Oscar cichlid [21, 22] and Red seabream [23]. In contrast, small fish chose lower oxygen levels more than large fish in Largemouth bass [24] and Yellow perch [25], however, this behaviour was suggested that the smaller fish utilize the hypoxic zone as refuge protected from the bigger predators [26]. From these studies it is clear that selection for low oxygen is difficult to ascertain, indicating a clear added value of investigations into genetic consequences of selection, such as the present study.

In general, metabolic rate is highly affected by dissolved oxygen in the rearing environment. Faster growing animals have a higher metabolic rate and therefore require more oxygen. As a consequence, hypoxia is expected to adversely affect fish growth and feed utilization [6]. On the other hand, large individuals have an obvious advantage over small ones in severe hypoxic environments because small fish will use up their glycogen reserves and reach mortality levels much faster with a higher metabolic rate [27]. Overall fish production declines, and disease resistance decreases as a consequence of hypoxia [28]. It has been observed that larger Nile tilapia tolerated low DO levels better than small ones, thought partially due to the fact that Nile tilapia immunity was stronger in larger than smaller [29]. Regardless of the complexity of the relationship between hypoxia and growth, studies focused on the genomic basis of hypoxia-growth interactions in Nile tilapia are sparse.

Our results suggest a number of genes and metabolic pathways involved in the adaptation to differences in dissolved oxygen in Nile tilapia. In the hypoxic environment, 14 significantly enriched processes were associated with the early growth stage, including nervous system development and animal organ development. Rara gene codes for the retinoic acid receptor alpha, a transcription factor which regulates genes involved in cellular growth and differentiation [30]. In addition, raraa and rarab play a key role during development in zebrafish [31]. Mediator of RNA polymerase II transcription subunit 24 (med24), an orthologue also found in human, mouse and zebrafish, participates in nervous system development [32]. However, these genes and associated molecular pathways do not indicate a clear link with hypoxia when comparing to other fish studies, and rather might reflect a relation to general growth and developmental pathways.

During the later growth stage, the results of pathway enrichment suggest that candidate regions are significantly enriched for adherens junctions, oocyte meiosis, MAPK signalling pathway, VEGF signalling pathway, regulation of actin cytoskeleton and progesterone-mediated oocyte maturation. Among these six pathways, various studies in zebrafish, channel catfish, and sea bass have shown MAPK to be involved in low oxygen tolerance in fish [14, 33, 34]. VEGF signalling was shown to be essential for maintaining the vascular density and oxygen supply in tissues [35]. Additionally, the VEGF pathway is also one of the targets of HIF-1α, which rapidly accumulates to activate genes involved in a series of responses to hypoxia [8, 36]. The candidate gene igf1ra, identified in this study, codes for IGF-1 receptor-a, a receptor of insulin-like growth factor that was reported to be a primary mediator of growth hormones [37]. The ephrin-A3 gene (efna3) is shown as a key functional mediator of hypoxic microenvironment and is regarded as a therapeutic target for hypoxia-specific disease [38]. Retinoic acid receptor-related orphan receptor alpha (rora) was demonstrated to be a key regulator of HIF-1α activities in human [39]. Finally, the aurora kinase A (aurka) gene, a serine kinase in neuroblastoma related to cell growth and migration, can up-regulate expression in human BE (2)-C cells under hypoxia [40]. Recently, Li et al. [16] also found that several regions were significantly related with hypoxia tolerance, including LG3, 4, 11, 14 and 22, especially two regions (LG4:15080000, LG11:24255000) are found to be adjacent with the peak in the hypoxic environment (BW5) of our study. Nevertheless, our results suggest that hypoxia has a non-significant effect on growth during the early growth stage, while, conversely, faster growing tilapia have higher tolerance to hypoxia in the later growing stage, reflected by survival probability. Interestingly, it has been shown that tilapia exposure to a nocturnal hypoxia for 9 weeks led to a better growth performance than normoxia, which is related with a compensatory appetite later in the day [41]. Additionally, Roze et al. [42] has reported that fast growing fish display a better ability to maintain balance to acute hypoxia exposure than slow growing fish, by comparing two genetically different growth strains of Rainbow trout, suggesting a better hypoxia tolerance similar to the findings presented in our study.

In the normoxic environment, six biological processes were significantly enriched for BW1 and BW2, including retinoic acid receptor signalling pathway, apoptotic signalling pathway, liver development, signal transduction, steroid hormone mediated signalling pathway and brain development. Steroid hormone mediated and retinoic acid receptor signalling pathway overlapped with the same stage in the hypoxia environment, which seems mostly involved in general growth and development processes. The overlap in the early growth stage between normoxic and hypoxic environments may result from shared conditions until the first time point. Another possibility is that hypoxia affected small fish less, and there still was sufficient dissolved oxygen as a result of lower overall demand. As fish grew bigger, the metabolic impact of high growth on oxygen consumption and availability may have become more pronounced [43].

For the later growth stage, 12 suggestive SNPs tagging regions containing 22 candidate genes were identified. These included the gene coding for mitochondrial calcium uniporter (mcu) that was reported to be a regulator in skeletal muscle growth and homeostasis [44]. The genes coding for oncoprotein-induced transcript 3 (oit3) and MAP 6 domain containing 1 (map 3d1) were both reported to be related with calcium ion binding activity [45]. Yoshida et al. [19] performed the first genome-wide association study to unravel the genetic architecture of harvest weight in a Nile tilapia population derived from a mixture of the 8th generation GIFT and the wild strains from Egypt and Kenya. In that study, four regions were identified that were significantly associated with harvest weight in LG12, 15, 18 and 22, respectively. However, the genes lying in these regions were not significant in our study. One of the reasons could be that the GIFT population has been selected on growth for many generations and those regions have become fixed. This could also explain the limited number of significant SNPs and candidate genes for growth observed in our study. However, it is also likely the specific variants found by Yoshida et al. were never present in our population to begin with.

The results from the meta-analysis show that five genes play a major role in growth and development during the early growth stage, namely raraa, rarab, med24, brms11a and prpf38b. Two of them (raraa, rarab) also showed significance for single GWAS in the normoxic and hypoxic environments, respectively. Prpf38b only showed a major effect in the hypoxic environment. The orthologues of this gene in human, zebrafish and mouse have been shown to have a function in the central nerve system [46]. Development related genes found in single GWAS, such as raraa, rarab, and med24 were significantly associated in the meta-analysis during the later stages. Nucleotide-binding protein 2 (nump2) was reported to be associated with both IGF1 and IGFP3 in a human GWA study [47]. Those results suggest that a few major QTLs determine much of the growth rate. Even though growth rate is known to be determined by many genes [48], similarly in human [49] and cattle [50], it was found that a few genes were exceptionally important in explaining genetic variance.

Moreover, no pathway related to hypoxia tolerance was found in meta-analysis GWAS, which indicates some genes affect body weight in the hypoxic environment while different subset of genes are important for body weight under the normoxic environment (see in Fig. 4a and b). This indicates genotype-by-environment interaction (GxE). However, a GxE analysis for growth rate in the normoxic versus hypoxic environment, based on a quantitative genetic analysis using a genomic relationship matrix derived from the genotyping dataset, showed that the genetic correlation was close to 0.8 [51]. This value suggests some degree of GxE and some reranking of genotypes. Furthermore, there was a large difference in body weight and its variance between environments, which suggests scaling GxE. The genetic correlation of 0.8 suggests that most fish that grow well in a normoxic environment, are also able to grow well in an environment where they experience nocturnal hypoxia. After all, Nile tilapia is a fresh water fish species that has evolved in environments where hypoxia (e.g. as a result of high temperatures, algal blooms or drought) are nocturnal events. Natural selection would favour animals that would be able to cope with these environments if larger fish would have higher reproductive success.


Clear associations between genotype and growth were found for both hypoxic and normoxic environments. The associated SNPs, and hence the underlying genomic architecture, however, changed over the growing period. Furthermore, the meta-analysis GWAS across two environments suggested that growth was not under the control by the same genes compared to single environmental GWAS, which we interpret as a genotype-by-environment interaction. The functional annotation confirms that hypoxic stress pathways such as MAPK signalling pathway and VEGF signalling pathway play an important role during the later growth stage in the hypoxic environment. Our findings reveal the genetic complexity of body weight gain under a variety of dissolved oxygen conditions in Nile tilapia, and provide an essential insight into how hypoxia affects body weight gain during the growth stage, which will benefit future tilapia breeding programmes in the context of genomic architecture.


Animal resource

The fish were derived from the Aquaculture Extension Centre of the Malaysian Department of Fisheries at Jitra, Kedah State, Malaysia (6°15′32°N; 100°25′47°E). Genetically Improved Farmed Tilapia (GIFT) strain was used in this experiment, and it had been selected for growth based on estimating breeding value (EBV) of harvest weight, with the genetic gain ranged from 5 to 15% per generation. The mate allocation strategy has controlled inbreeding and maintained effective population size [52]. The experimental fish were produced using 72 males from 56 families and 200 females from 73 families (total 81 unique families) of selection line of GIFT generation 15. From each family, fish with EBV for growth that were close to the family mean EBV were selected as a breeder. The experimental fish were mass produced in four hapas (net-enclosures, each 30m2) installed in a 500m2 earthen pond, aerated by a paddlewheel. For each hapa, 18 male and 50 female breeders were distributed for stocking, and they were removed from mating hapa after 15 days. Fry were reared in the same hapas for 60 days until they reached a taggable size. The fingerlings from each rearing hapa were tagged and then transferred into two earthen ponds with an equal number. Overall 1570 fish were reared in each pond with stocking density of 3 fish/m2. We managed two ponds with the same feeding management (i.e. feeding frequency twice per day, feeding rate was adjusted with fish number), while aeration was the only different treatment between two ponds.

We measured DO every 2 h for 24 h with a total of 7 days during the different grow-out periods using EcoSense® DO200A. The average DO measurements for aerated pond (normoxia) and non-aerated pond (nocturnal hypoxia) are shown in Supplementary Figure 1 and Supplementary Table 1. Both ponds were normoxic (5 mg/L) from 13:00 to 19:00. Non-aerated pond became hypoxic (under 3 mg/L) between 21:00 to next day 9:00. Body weight was measured at five time points (stocking, 55/56 days, 104/105 days, 167/168 days and 217/218 days) growing out in the hypoxic and normoxic environments, respectively. Fish were euthanized using clove oil at a dose of 400 ppm after the experiment. Fin clips were preserved in 95% ethanol and stored at − 20 °C until DNA extraction. More details about this experiment can be found in Mengistu et al. [51].

Genotyping, variant calling and quality control

DNA extraction and genotyping procedures were described in previous study by Mengistu et al. [51]. In short, we isolated genomic DNA from tilapia fin clips using the DNeasy Blood and Tissue kit. DNA quality was assessed by 260/280 and 260/230 ratios on NanoDrop 2000 spectrophotometer. DNA concentration was measured with Qubit 2.0 Fluorometer. DNA samples were digested with ApeKI, and polymerase chain reaction (PCR) was used to amplify fragments varied from 170 to 350 bp. The prepared libraries were sequenced on the Illumina HiSeq 2000 platform.

Raw sequence reads were trimmed for adaptors and low quality bases with Sickle ( The quality of each individual was evaluated by FastQC (version 1.6) [53]. Sequence mapping for 2171 individuals was performed using bwa -mem algorithm [54] aligning to the tilapia reference genome (GenBank accession GCF_001858045.1). Variant calling was analysed with FreeBayes (version 1.0.2) [55] in a default setting excepted these parameters: --min-base-quality 10, −-haplotype-length 0 and --ploidy 2. The SNP data was further filtered by Plink (version1.9) [56] with the following exclusion criteria: Minor Allele Frequency < 2%, genotyping call-rate for SNPs < 80% and individual rate < 70%. Finally, a total of 2063 individuals and 27,090 SNPs were used for subsequent analyses.

Statistic description, population structure and association analysis

Basic statistics of phenotype data was analysed in R (version 3.5.3). Body weight in our study is not completely following a normal distribution as estimated by Shapiro-Wilk test [57]. Therefore, we compare two paired groups at five time point using the Wilcoxon test. The phenotypic correlation was calculated by spearman’s rank correlation coefficient method. Then, body weight was transformed to better fit the normal distribution by square root method [58]. To estimate the influence of factors such as hapa (early rearing environment) and sex in our experiment, they were tested in a linear model using Stepwise Algorithm [59] with the formula: yij = u + αi + βj + αiβjεij, while y is the body weight; u is the population mean; αi is the effect of the ith level of hapa; βj is the effect of the jth level of sex; ε is the random error effect. It suggested that hapa, sex and their interaction were significant with body weight. Therefore, residuals from the fixed effects model were used for the subsequent association analysis [60].

A principal component analysis (PCA) was performed to estimate population structure before GWAS in Plink (version 1.9) [56]. The top five principal components were added as covariates and included in the subsequent GWAS model as fixed effect to account for the sample structure in this association analysis. Considering the Bonferroni method being overly conservative, we defined the genome-wide significant using the SimpleM method [61]. In total 16,504 independent tests were calculated based on LD (linkage disequilibrium) characteristics. The significant and suggestive lines are 1 and 5% genome-wide significant divided by the SNPs number of independent SNPs in the association. Given the number of effective independent tests, the thresholds for genome-wide and suggestive significance P-value were evaluated as 3.03E-06 (0.05/16504) and 6.06E-05 (1/16504), respectively.

A univariate GWAS was performed by implementing a linear mixed model in GEMMA [62]:

$$ \boldsymbol{y}=\boldsymbol{W}\boldsymbol{\alpha } +\boldsymbol{x}\boldsymbol{\beta } +\boldsymbol{\mu} +\boldsymbol{\varepsilon} $$

In this equation, y is the a vector of observation on body weight; W is a covariate matrix of fixed effects (including top five PCs) used to adjust population structure; α is a vector of the corresponding coefficient including the intercept; x is a vector of the marker genotypes and β is the corresponding vector of marker effects for the phenotypes; μ is a vector of random effects and ε is the random residuals. We performed the Wald statistic for each SNP which means we tested the alternative hypothesis H1β ≠ 0 compared to null hypothesis H0: β= 0 for each SNP, which is one of common methods in GWAS studies of quantitative traits [63].

Meta-analysis is powerful to detect shared genetic architecture across traits and populations [64]. Thus, we applied an inverse-variance weighted (IVW) method to estimate the SNP effect and significance combined normoxic and hypoxic environments through Meta (Version 1.7) [65, 66]. The weight (wi) for ith environment was calculated by the following equation:

$$ {\mathrm{w}}_{\mathrm{i}}=\frac{1}{{\mathrm{s}}_{\mathrm{i}}^2} $$

Here si is the standard error of the SNP effect in ith environment GWAS. Then, the effect size and standard error for ith environment GWAS were estimated by the following:

$$ \upbeta =\frac{\sum_{\mathrm{i}=1}^2{\mathrm{w}}_{\mathrm{i}}{\beta}_{\mathrm{i}}}{\sum_{\mathrm{i}=1}^2{\mathrm{w}}_{\mathrm{i}}} $$
$$ {\mathrm{s}}^2=\frac{1}{\sum_{\mathrm{i}=1}^2{\mathrm{w}}_{\mathrm{i}}} $$

The statistical significance was estimated by a z-score of IVW as bellow:

$$ \mathrm{z}=\frac{\upbeta}{\mathrm{s}}=\frac{\sum_{\mathrm{i}=1}^2{\mathrm{w}}_{\mathrm{i}}{\beta}_{\mathrm{i}}}{\sqrt{\sum_{\mathrm{i}=1}^2{\mathrm{w}}_{\mathrm{i}}}} $$

Post-GWAS analysis

Manhattan and quantile-quantile (Q-Q) plots were generated through the “qqman” package ( The inflation factor λ was calculated to indicate the influence of population structure in the association analyses. Candidate regions were defined as the genomic regions that located 20 kb upstream and downstream of the genome-wide significant SNPs. In order to identify candidate genes nearby the significant SNPs, we used the Custom Annotations function to create an annotation set with parameters (−-distance 20,000 --gene_phenotype --symbol) in Ensembl Variant Effect Predictor (VEP) [67]. All protein sequences of candidate genes were extracted through reference protein sequence with an inhouse python script, and were further used for functional enrichment analysis in STRING V11.0 [68]. The false discovery rate (FDR) adjusted p-value of 0.05 was used to define significant enrichment.

Availability of data and materials

The genotype and phenotype data generated or analysed during this study are in the Harvard Dataverse repository with accession number KCBEON, which can be accessed at The tilapia reference genome (GCF_001858045.1_ASM185804v2_genomic.fna.gz) and annotation file (GCF_001858045.1_ASM185804v2_genomic.gff.gz) were downloaded from the NCBI genome assembly website ( The authors declare that all data supporting the findings are available within this article and its supplementary files.



AMP-activated protein kinase


Aurora kinase A


Body weight


Coefficient of variation


Dissolved oxygen


Estimated breeding value




False discovery rate


Genetically Improved Farmed Tilapia


Gene ontology


Genome Wide Association Study


Genotype-by-environment interaction


Hyaluronan synthase


Hypoxia-Inducible factors


Inverse-variance weighted


Kyoto Encyclopedia of Genes and Genomes


Linkage disequilibrium


Lineage group


Mitogen-activated protein kinase

MAP 3D1:

MAP 6 domain containing 1


Mitochondrial calcium uniporter


Mediator of RNA polymerase II transcription subunit 24


Nucleotide-binding protein 2


Oncoprotein-induced transcript 3


Principal component analysis


Principal component


Phosphoglycerate kinase 1




Retinoic acid receptor-related orphan receptor alpha


Reactive oxygen species


Standard deviation


Single nucleotide polymorphism


Tyrosine hydroxylase


Vascular endothelial growth factor


Variant Effect Predictor


  1. 1.

    FAO. The State of World Fisheries and Aquaculture 2020. Rome: Sustainability in action; 2020.

  2. 2.

    Guyon R, Rakotomanga M, Azzouzi N, Coutanceau JP, Bonillo C, D'Cotta H, Pepey E, Soler L, Rodier-Goud M, D'Hont A, et al. A high-resolution map of the Nile tilapia genome: a resource for studying cichlids and other percomorphs. BMC Genomics. 2012;13:222.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  3. 3.

    Ponzoni RW, Nguyen NH, Khaw HL, Hamzah A, Bakar KRA, Yee HY. Genetic improvement of Nile tilapia (Oreochromis niloticus) with special reference to the work conducted by the WorldFish Center with the GIFT strain. Rev Aquac. 2011;3(1):27–41.

    Article  Google Scholar 

  4. 4.

    Portner HO. Oxygen- and capacity-limitation of thermal tolerance: a matrix for integrating climate-related stressor effects in marine ecosystems. J Exp Biol. 2010;213(6):881–93.

    PubMed  Article  Google Scholar 

  5. 5.

    Fitzgibbon QP, Strawbridge A, Seymour RS. Metabolic scope, swimming performance and the effects of hypoxia in the mulloway, Argyrosomus japonicus (Pisces : Sciaenidae). Aquaculture. 2007;270(1–4):358–68.

    Article  Google Scholar 

  6. 6.

    Mengistu SB, Mulder HA, Benzie JAH, Komen H. A systematic literature review of the major factors causing yield gap by affecting growth, feed conversion ratio and survival in Nile tilapia (Oreochromis niloticus). Rev Aquac. 2020;12(2):524–41.

    Article  Google Scholar 

  7. 7.

    Michiels C. Physiological and pathological responses to hypoxia. Am J Pathol. 2004;164(6):1875–82.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  8. 8.

    Majmundar AJ, Wong WHJ, Simon MC. Hypoxia-inducible factors and the response to hypoxic stress. Mol Cell. 2010;40(2):294–309.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  9. 9.

    Nikinmaa M, Rees BB. Oxygen-dependent gene expression in fishes. Am J Physiol Regul Integr Comp Physiol. 2005;288(5):R1079–90.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  10. 10.

    Xiao W. The hypoxia signaling pathway and hypoxic adaptation in fishes. Sci China Life Sci. 2015;58(2):148–55.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  11. 11.

    Zhu CD, Wang ZH, Yan BA. Strategies for hypoxia adaptation in fish species: a review. J Comp Physiol B. 2013;183(8):1005–13.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  12. 12.

    Du SNN, Mahalingam S, Borowiec BG, Scott GR. Mitochondrial physiology and reactive oxygen species production are altered by hypoxia acclimation in killifish (Fundulus heteroclitus). J Exp Biol. 2016;219(8):1130–8.

    PubMed  PubMed Central  Google Scholar 

  13. 13.

    Zhong X, Wang X, Zhou T, Jin Y, Tan S, Jiang C, Geng X, Li N, Shi H, Zeng Q, et al. Genome-wide association study reveals multiple novel QTL associated with low oxygen tolerance in hybrid catfish. Mar Biotechnol. 2017;19(4):379–90.

    CAS  Article  Google Scholar 

  14. 14.

    Wang XZ, Liu SK, Jiang C, Geng X, Zhou T, Li N, Bao LS, Li Y, Yao J, Yang YJ, et al. Multiple across-strain and within-strain QTLs suggest highly complex genetic architecture for hypoxia tolerance in channel catfish. Mol Gen Genomics. 2017;292(1):63–76.

    CAS  Article  Google Scholar 

  15. 15.

    Anttila K, Dhillon RS, Boulding EG, Farrell AP, Glebe BD, Elliott JAK, Wolters WR, Schulte PM. Variation in temperature tolerance among families of Atlantic salmon (Salmo salar) is associated with hypoxia tolerance, ventricle size and myoglobin level. J Exp Biol. 2013;216(7):1183–90.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  16. 16.

    Li HL, Gu XH, Li BJ, Chen CH, Lin HR, Xia JH. Genome-wide QTL analysis identified significant associations between hypoxia tolerance and mutations in the GPR132 and ABCG4 genes in Nile Tilapia. Mar Biotechnol. 2017;19(5):441–53.

    CAS  Article  Google Scholar 

  17. 17.

    Aquaculture Genomics G, Breeding W, Abdelrahman H, ElHady M, Alcivar-Warren A, Allen S, Al-Tobasei R, Bao L, Beck B, Blackburn H, et al. Aquaculture genomics, genetics and breeding in the United States: current status, challenges, and priorities for future research. BMC Genomics. 2017;18(1):191.

    Article  Google Scholar 

  18. 18.

    Brennan RS, Healy TM, Bryant HJ, Van La M, Schulte PM, Whitehead A. Integrative population and physiological genomics reveals mechanisms of adaptation in killifish. Mol Biol Evol. 2018;35(11):2639–53.

    CAS  PubMed  PubMed Central  Google Scholar 

  19. 19.

    Yoshida GM, Lhorente JP, Correa K, Soto J, Salas D, Yanez JM. Genome-wide association study and cost-efficient genomic predictions for growth and fillet yield in Nile Tilapia (Oreochromis niloticus). G3. 2019;9(8):2597–607.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  20. 20.

    Mandic M, Todgham AE, Richards JG. Mechanisms and evolution of hypoxia tolerance in fish. Proc Biol Sci. 2009;276(1657):735–44.

    CAS  PubMed  PubMed Central  Google Scholar 

  21. 21.

    Almeida-Val VM, Val AL, Duncan WP, Souza FC, Paula-Silva MN, Land S. Scaling effects on hypoxia tolerance in the Amazon fish Astronotus ocellatus (Perciformes: Cichlidae): contribution of tissue enzyme levels. Comp Biochem Physiol B Biochem Mol Biol. 2000;125(2):219–26.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  22. 22.

    Sloman KA, Wood CM, Scott GR, Wood S, Kajimura M, Johannsson OE, Almeida-Val VM, Val AL. Tribute to R. G. Boutilier: the effect of size on the physiological and behavioural responses of oscar, Astronotus ocellatus, to hypoxia. J Exp Biol. 2006;209(Pt 7):1197–205.

    PubMed  Article  PubMed Central  Google Scholar 

  23. 23.

    Ishibashi Y, Inoue K, Nakatsukasa H, Ishitani Y, Miyashita S, Murata O. Ontogeny of tolerance to hypoxia and oxygen consumption of larval and juvenile red sea bream, Pagrus major. Aquaculture. 2005;244(1–4):331–40.

    Article  Google Scholar 

  24. 24.

    Burleson ML, Wilhelm DR, Smatresk NJ. The influence of fish size on the avoidance of hypoxia and oxygen selection by largemouth bass. J Fish Biol. 2001;59(5):1336–49.

    Google Scholar 

  25. 25.

    Robb T, Abrahams MV. Variation in tolerance to hypoxia in a predator and prey species: an ecological advantage of being small? J Fish Biol. 2003;62(5):1067–81.

    Article  Google Scholar 

  26. 26.

    Vejrik L, Matejickova I, Juza T, Frouzova J, Seda J, Blabolil P, Ricard D, Vasek M, Kubecka J, Riha M, et al. Small fish use the hypoxic pelagic zone as a refuge from predators. Freshw Biol. 2016;61(6):899–913.

    CAS  Article  Google Scholar 

  27. 27.

    Nilsson GE, Ostlund-Nilsson S. Does size matter for hypoxia tolerance in fish? Biol Rev. 2008;83(2):173–89.

    PubMed  Article  PubMed Central  Google Scholar 

  28. 28.

    Abdel-Tawwab M, Monier MN, Hoseinifar SH, Faggio C. Fish response to hypoxia stress: growth, physiological, and immunological biomarkers. Fish Physiol Biochem. 2019;45(3):997–1013.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  29. 29.

    Abdel-Tawwab M, Hagras AE, Elbaghdady HAM, Monier MN. Effects of dissolved oxygen and fish size on Nile tilapia, Oreochromis niloticus (L.): growth performance, whole-body composition, and innate immunity. Aquacult Int. 2015;23(5):1261–74.

    CAS  Article  Google Scholar 

  30. 30.

    Wang XJ, Dasari S, Nowakowski GS, Lazaridis KN, Wieben ED, Kadin ME, Feldman AL, Boddicker RL. Retinoic acid receptor alpha drives cell cycle progression and is associated with increased sensitivity to retinoids in T-cell lymphoma. Oncotarget. 2017;8(16):26245–55.

    PubMed  PubMed Central  Article  Google Scholar 

  31. 31.

    Hale LA, Tallafuss A, Yan YL, Dudley L, Eisen JS, Postlethwait JH. Characterization of the retinoic acid receptor genes raraa, rarab and rarg during zebrafish development. Gene Expr Patterns. 2006;6(5):546–55.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  32. 32.

    Harrison C, Wabbersen T, Shepherd IT. In vivo visualization of the development of the enteric nervous system using a Tg(−8.3bphox2b:Kaede) transgenic Zebrafish. Genesis. 2014;52(12):985–90.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  33. 33.

    Tian Y, Wen HS, Qi X, Zhang XY, Li Y. Identification of mapk gene family in Lateolabrax maculatus and their expression profiles in response to hypoxia and salinity challenges. Gene. 2019;684:20–9.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  34. 34.

    Marques IJ, Leito JTD, Spaink HP, Testerink J, Jaspers RT, Witte F, van Den Berg S, Bagowski CP. Transcriptome analysis of the response to chronic constant hypoxia in zebrafish hearts. J Comp Physiol B. 2008;178(1):77–92.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  35. 35.

    Maitland ML, Lou XJ, Ramirez J, Desai AA, Berlin DS, McLeod HL, Weichselbaum RR, Ratain MJ, Altman RB, Klein TE. Vascular endothelial growth factor pathway. Pharmacogenet Genomics. 2010;20(5):346–9.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  36. 36.

    Kurihara T, Westenskow PD, Friedlander M. Hypoxia-inducible factor (HIF)/vascular endothelial growth factor (VEGF) signaling in the retina. Adv Exp Med Biol. 2014;801:275–81.

    PubMed  Article  PubMed Central  Google Scholar 

  37. 37.

    Hakuno F, Takahashi SI. IGF1 receptor signaling pathways. J Mol Endocrinol. 2018;61(1):T69–86.

    CAS  PubMed  Article  Google Scholar 

  38. 38.

    Husain A, Chiu YT, Ho DW, Sze KM, Chan LK, Tsui YM, Wong CC, Ng IO. EFNA3, a key functional mediator of hypoxic microenvironment in hepatocellular carcinoma. Cancer Res. 2018;78(13 Suppl):Abstract nr 2431.

  39. 39.

    Li H, Zhou L, Dai J. Retinoic acid receptor-related orphan receptor RORalpha regulates differentiation and survival of keratinocytes during hypoxia. J Cell Physiol. 2018;233(1):641–50.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  40. 40.

    Romain CV, Paul P, Lee S, Qiao J, Chung DH. Targeting aurora kinase A inhibits hypoxia-mediated neuroblastoma cell tumorigenesis. Anticancer Res. 2014;34(5):2269–74.

    CAS  PubMed  PubMed Central  Google Scholar 

  41. 41.

    Obirikorang KA, Acheampong JN, Duodu CP, Skov PV. Growth, metabolism and respiration in Nile tilapia (Oreochromis niloticus) exposed to chronic or periodic hypoxia. Comp Biochem Physiol A Mol Integr Physiol. 2020;248:110768.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  42. 42.

    Roze T, Christen F, Amerand A, Claireaux G. Trade-off between thermal sensitivity, hypoxia tolerance and growth in fish. J Therm Biol. 2013;38(2):98–106.

    Article  Google Scholar 

  43. 43.

    Mandic M, Regan MD. Can variation among hypoxic environments explain why different fish species use different hypoxic survival strategies? J Exp Biol. 2018;221(21):jeb161349.

    PubMed  Article  PubMed Central  Google Scholar 

  44. 44.

    Kwong JQ, Huo JZ, Bround MJ, Boyer JG, Schwanekamp JA, Ghazal N, Maxwell JT, Jang YC, Khuchua Z, Shi K, et al. The mitochondrial calcium uniporter underlies metabolic fuel preference in skeletal muscle. Jci Insight. 2018;3(22):e121689.

    PubMed Central  Article  Google Scholar 

  45. 45.

    Sprague J, Bayraktaroglu L, Clements D, Conlin T, Fashena D, Frazer K, Haendel M, Howe DG, Mani P, Ramachandran S, et al. The Zebrafish Information Network: the zebrafish model organism database. Nucleic Acids Res. 2006;34(Database issue):D581–5.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  46. 46.

    Koscielny G, Yaikhom G, Iyer V, Meehan TF, Morgan H, Atienza-Herrero J, Blake A, Chen CK, Easty R, Di Fenza A, et al. The International Mouse Phenotyping Consortium Web Portal, a unified point of access for knockout mice and related phenotyping data. Nucleic Acids Res. 2014;42(Database issue):D802–9.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  47. 47.

    Teumer A, Qi Q, Nethander M, Aschard H, Bandinelli S, Beekman M, Berndt SI, Bidlingmaier M, Broer L, Group CLW, et al. Genomewide meta-analysis identifies loci associated with IGF-I and IGFBP-3 levels with impact on age-related traits. Aging Cell. 2016;15(5):811–24.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  48. 48.

    Wood AR, Esko T, Yang J, Vedantam S, Pers TH, Gustafsson S, Chun AY, Estrada K, Luan J, Kutalik Z, et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat Genet. 2014;46(11):1173–86.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  49. 49.

    Couto Alves A, De Silva NMG, Karhunen V, Sovio U, Das S, Taal HR, Warrington NM, Lewin AM, Kaakinen M, Cousminer DL, et al. GWAS on longitudinal growth traits reveals different genetic factors influencing infant, child, and adult BMI. Sci Adv. 2019;5(9):eaaw3095.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  50. 50.

    Seabury CM, Oldeschulte DL, Saatchi M, Beever JE, Decker JE, Halley YA, Bhattarai EK, Molaei M, Freetly HC, Hansen SL, et al. Genome-wide association study for feed efficiency and growth traits in U.S. beef cattle. BMC Genomics. 2017;18(1):386.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  51. 51.

    Mengistu SB, Mulder HA, Benzie JA, Khaw HL, Megens H-J, Trinh TQ, Komen H. Genotype by environment interaction between aerated and non-aerated ponds and the impact of aeration on genetic parameters in Nile tilapia (Oreochromis niloticus). Aquaculture. 2020;529:735704.

    CAS  Article  Google Scholar 

  52. 52.

    Ponzoni RW, Khaw HL, Nguyen NH, Hamzah A. Inbreeding and effective population size in the Malaysian nucleus of the GIFT strain of Nile tilapia (Oreochromis niloticus). Aquaculture. 2010;302(1–2):42–8.

    Article  Google Scholar 

  53. 53.

    Andrews S. FastQC: a quality control tool for high throughput sequence data. In: Babraham bioinformatics. Cambridge: Babraham Institute; 2010.

    Google Scholar 

  54. 54.

    Li H, Durbin R. Fast and accurate short read alignment with burrows-wheeler transform. Bioinformatics. 2009;25(14):1754–60.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  55. 55.

    Garrison E, Marth G: Haplotype-based variant detection from short-read sequencing. arXiv preprint arXiv:12073907. 2012.

    Google Scholar 

  56. 56.

    Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  57. 57.

    Villasenor Alva JA, Estrada EG. A generalization of Shapiro–Wilk’s test for multivariate normality. Commun Stat Theory Methods. 2009;38(11):1870–83.

    Article  Google Scholar 

  58. 58.

    McDonald JH. Handbook of biological statistics, vol. 2. Baltimore: Sparky House Publishing; 2009.

    Google Scholar 

  59. 59.

    Neerchal NK, Morel J, Huang X, Moluh A. A stepwise algorithm for generalized linear mixed models, vol. 2014. Washington, DC: SAS Global Forum; 2014. p. 1822–2014.

    Google Scholar 

  60. 60.

    Gondro C, van der Werf J, Hayes B. Genome-wide association studies and genomic prediction. Volume 1019. Springer: Humana Press; 2013.

  61. 61.

    Gao XY, Becker LC, Becker DM, Starmer JD, Province MA. Avoiding the high Bonferroni penalty in genome-wide association studies. Genet Epidemiol. 2010;34(1):100–5.

    PubMed  PubMed Central  Google Scholar 

  62. 62.

    Zhou X, Stephens M. Genome-wide efficient mixed-model analysis for association studies. Nat Genet. 2012;44(7):821–U136.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  63. 63.

    Wu B, Pankow JS. Fast and accurate genome-wide association test of multiple quantitative traits. Comput Math Methods Med. 2018;2018:2564531.

    PubMed  PubMed Central  Google Scholar 

  64. 64.

    Li YR, Li J, Zhao SD, Bradfield JP, Mentch FD, Maggadottir SM, Hou C, Abrams DJ, Chang D, Gao F, et al. Meta-analysis of shared genetic architecture across ten pediatric autoimmune diseases. Nat Med. 2015;21(9):1018–27.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  65. 65.

    Liu JZ, Tozzi F, Waterworth DM, Pillai SG, Muglia P, Middleton L, Berrettini W, Knouff CW, Yuan X, Waeber G, et al. Meta-analysis and imputation refines the association of 15q25 with smoking quantity. Nat Genet. 2010;42(5):436–U475.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  66. 66.

    Lee CH, Cook S, Lee JS, Han B. Comparison of two meta-analysis methods: inverse-variance-weighted average and weighted sum of Z-scores. Genomics Inform. 2016;14(4):173–80.

    PubMed  PubMed Central  Article  Google Scholar 

  67. 67.

    McLaren W, Gil L, Hunt SE, Riat HS, Ritchie GRS, Thormann A, Flicek P, Cunningham F. The Ensembl variant effect predictor. Genome Biol. 2016;17(1):122.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  68. 68.

    Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, Simonovic M, Doncheva NT, Morris JH, Bork P, et al. STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019;47(D1):D607–13.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

Download references


The authors gratefully acknowledge WorldFish breeding team for their field and technical assistance. We also thank all colleagues who provided help during analysis procedures from Animal Breeding and Genomics group.


This work was funded by the CGIAR Research Program on Fish Agri-Food Systems (FISH) led by WorldFish. The original experiment was funded by the Koepon Foundation with additional support to WorldFish by the International Fund for Agricultural Development (IFAD). The authors also thank to the financial support from China Scholarship Council (CSC, File No.201803250069), which has sponsored Xiaofei Yu’s PhD study at Wageningen University and Research. The funding bodies played no role in design of the study and data collection, analysis, and result interpretation in the manuscript.

Author information




JAHB, HAM, HK and MAMG conceived and designed the experiments. SBM and HAM contributed to animal experiment and collected biological samples, and they also supported data analysis and result interpretation. XFY, HJM wrote the manuscript. XFY, JWMB and HJM contributed to data analysis. XFY, HJM, MAMG and HK contributed to result interpretation. SBM, JWMB, HAM, JAHB, MAMG and HK contributed to manuscript revision. All authors reviewed and approved the final manuscript.

Corresponding author

Correspondence to Xiaofei Yu.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the internal WorldFish ethics committee. All the parties agreed for this experiment.

Consent for publication

Not applicable.

Competing interests

MAMG is a member of the editorial board of BMC Genomics journal. The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Supplementary Figure 1.

Variation of dissolved oxygen in the normoxic and hypoxic environments during the 24-h cycle.

Additional file 2: Supplementary Figure 2.

Body weight comparison amongst four hapas in the normoxic and hypoxic environments.

Additional file 3: Supplementary Figure 3.

Two-dimensional plots of all individuals using SNP markers in the hypoxic environment.

Additional file 4: Supplementary Figure 4.

Two-dimensional plots of all individuals using SNP markers in the normoxic environment.

Additional file 5: Supplementary Figure 5.

Quantile-quantile plots through the whole growth stage in the hypoxic environment.

Additional file 6: Supplementary Figure 6.

Quantile-quantile plots through the whole growth stage in the normoxic environment.

Additional file 7: Supplementary Table 1.

Measurements of dissolved oxygen in the normoxic and hypoxic environments.

Additional file 8: Supplementary Table 2.

Information of genome-wide significant and suggestive SNPs in the hypoxic environment.

Additional file 9: Supplementary Table 3.

Information of genome-wide significant and suggestive SNPs in the normoxic environment.

Additional file 10: Supplementary Table 4.

Information of genome-wide significant SNPs across the two environments by meta-analysis.

Additional file 11: Supplementary Table 5.

Functional enrichment for the early stage in the hypoxic environment.

Additional file 12: Supplementary Table 6.

Functional enrichment for the early stage in the normoxic environment.

Additional file 13: Supplementary Table 7.

Functional enrichment for the whole growth stage across the two environments by meta-analysis.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Yu, X., Megens, HJ., Mengistu, S.B. et al. Genome-wide association analysis of adaptation to oxygen stress in Nile tilapia (Oreochromis niloticus). BMC Genomics 22, 426 (2021).

Download citation


  • Nile tilapia
  • Growth
  • Hypoxia
  • Oxygen stress
  • GWAS
  • Meta-analysis