Skip to main content

Genome-wide identification of genes probably relevant to the adaptation of schizothoracins (Teleostei: Cypriniformes) to the uplift of the Qinghai-Tibet Plateau



Molecular adaptation to the severe environments present during the uplift of the Qinghai-Tibet Plateau has attracted the attention of researchers. The divergence of the three specialization groups of schizothoracins (Primitive, Specialized and Highly Specialized) may correspond to the three phases of plateau uplift. Based on the transcripts of representative species of the three specialized groups and an outgroup, genes in schizothoracins that may have played important roles during the adaptation to new environments were investigated.


The contigs of Gymnodiptychus dybowskii and Schizothorax pseudaksaiensis were compared with those of Gymnocypris przewalskii ganzihonensis and the outgroup Sinocyclocheilus angustiporus, and 5,894 ortholog groups with an alignment length longer than 90 nt after deleting gaps were retained. Evolutionary analyses indicated that the average evolutionary rate of the branch leading to the Specialized group was faster than that of the branch leading to the Highly Specialized group. Moreover, the numbers of gene categories in which more than half of the genes evolved faster than the average values of the genome were 117 and 15 along the branches leading to the Specialized and Highly Specialized groups, respectively. A total of 40, 36, and 55 genes were likely subject to positive selection along the branches leading to the Primitive, Specialized and Highly Specialized groups, respectively, and many of these genes are likely relevant to adaptation to the cold temperatures, low oxygen concentrations, and strong ultraviolet radiation that result from elevation.


By selecting representative species of the three groups of schizothoracins and applying next-generation sequencing technology, several candidate genes corresponding to adaptation to the three phases of plateau uplift were identified. Some of the genes identified in this report that were likely subject to positive selection are good candidates for subsequent evolutionary and functional analyses of adaptation to high altitude.


Understanding the molecular mechanisms of adaptation is one of the central goals of evolutionary biology. In particular, molecular events underlying adaptation to extreme environments, such as high altitude, have attracted widespread attention. Along with the uplift of the land, native species must cope with harsh conditions, such as cold temperatures, low oxygen concentrations, and strong ultraviolet radiation [1, 2]. Many new tribes have emerged due to this adaptation, and they are valuable models for understanding the molecular mechanisms of high-altitude adaptation.

The genetic basis of high-altitude adaptations in humans has been the most studied. Genomic studies targeting Tibetan populations have shown that the genes EGLN1 and PPARA, which are significantly associated with the decreased hemoglobin phenotype, were positively selected in the highland population [3]. Further investigation revealed that genes associated with human reproductive disorders and the biological process categories “response to DNA damage stimulus” and “DNA repair” showed a distinct allele frequency pattern of copy number variable region (CNVR) distribution in Tibetans [4]. In Andeans, EGLN1 also exhibited evidence of positive selection [5]. In Ethiopian highlanders, genomic analysis revealed that a number of candidate loci were associated with hemoglobin levels relating to high-altitude adaptation [6].

The deer mouse (Peromyscus maniculatus), which is native to the Andean highlands, is the most studied high-altitude species other than humans. Deer mice have a relatively low hemoglobin content [7], but variations in the globin genes seem to be the basis for the increased oxygen affinity of the hemoglobin and faster transport of oxygen [8]. Moreover, deer mice use fats as a high percentage of their metabolic fuel in order to retain carbohydrates for small bursts of energy [9]. The yak (Bos grunniens) is the most important domesticated animal of Tibetan highlanders, and it thrives only at high altitudes. Comparison of the genome sequences of yaks with those of cattle indicated an expansion of gene families related to sensory perception and energy metabolism in yaks and an enrichment of protein domains involved in sensing the extracellular environment and hypoxic stress [10]. Researchers also found that genes related to hypoxia and nutrition metabolism showed positive selection [10]. The ground tit (Pseudopodoces humilis) is endemic to the northern Himalayas, and genome sequencing revealed positive selection of genes related to cardiac function and the expansion of the corresponding gene families in this species [11].

However, molecular events underlying high-altitude adaptations of organisms living in water, such as fish, are rarely reported. Water can weaken the intensity of ultraviolet radiation, and the oxygen availability in water differs from that in air; therefore, we assumed that the adapted fish genes may be somewhat different from those of land animals. Yang et al. compared orthologous genes among a schizothoracine fish, Gymnodiptychus pachycheilus, and several model fish with available genome sequences and suggested that many of the genes that are associated with energy metabolism and hypoxia were subjected to positive selection during its plateau adaptation [12]. This finding was supported by a functional verification study [13]. However, given that schizothoracins are polyploid and other model fishes are diploid [14], together with the fact that the divergence times among these fishes are large, comparing orthologs among schizothoracins and closely related polyploid cyprinids should be more effective to gain insights into the molecular mechanisms of plateau adaptation in schizothoracins.

The Qinghai-Tibet Plateau is sometimes called “the Third Pole” and “the Roof of the World” because it is the highest and largest plateau in the world. The uplift of the Plateau can be divided into phases, although there are many debates regarding this process [15, 16]. Schizothoracins are endemic to the Qinghai-Tibet Plateau, and they are assumed to originate from the original barbins. By comparing morphological traits, such as scales, pharyngeal teeth and barbels, Cao et al. divided schizothoracins into three specialized groups: Primitive, Specialized and Highly Specialized (Fig. 1) [17]. They concluded that the variation among these traits corresponded to the continuous uplift of the plateau, which has been supported by molecular phylogenetic analysis [18].

Fig. 1

Phylogenetic relationships of the species used in this report. G. dybowskii and S. pseudoaksaiensis were collected from the Ili River, which is located in Sinkiang, northwestern China; G. p. ganzihonensis and the outgroup species S. angustiporus were respectively collected from the Ganzi River in Qinghai province and from the Huangnihe River in Yunnan province, previously [32, 33]. I, II and III represent branches leading to the Primitive, Specialized and Highly Specialized groups, respectively. Numbers below branches denote the number of fast-evolving gene categories with the same GO term or KEGG pathway assigned along branches II and III, respectively

To understand the molecular events underlying the adaptations of schizothoracins to different phases of plateau uplift, representative fish from the three specialized groups of schizothoracins and an outgroup were selected (Table 1); by comparing orthologs among these species, genes that were potentially relevant to the adaptation to the uplift of the Qinghai-Tibet Plateau during the emergence of Primitive, Specialized and Highly Specialized fish (branches I, II and III, respectively, in Fig. 1) were identified. Our analyses may provide a deeper understanding of the process of high-altitude adaptation of schizothoracins.

Table 1 Summaries of the short reads of the transcriptome sequencing projects for the polyploid cyprinids used in this report


Data acquisition

Transcripts of G. dybowskii and S. pseudoaksaiensis were generated in this report. Using gill nets, four live samples of each species were collected from the Ili River, which is located in Sinkiang, northwestern China (Additional file 1: Figure S1). For each individual, total RNA from the brains and livers was extracted using TRIzol reagent (Transgene Company, Illkirch-Graffenstaden, France) according to the manufacturer’s protocol. All of the specimens were euthanized with 300 mg/L of tricaine methanesulfonate (MS 222) before tissue collection. RNA degradation and contamination were preliminarily monitored using 1% agarose gels. A NanoPhotometer spectrophotometer (IMPLEN, CA, USA) was then used to confirm the purity of the RNA, and a Qubit RNA Assay Kit and a Qubit 2.0 Flurometer (Life Technologies, CA, USA) were employed to measure the concentration of RNA. The RNA integrity number (RIN) of each sample was assessed using the RNA Nano 6000 Assay Kit of the Agilent Bioanalyzer 2100 system (Agilent Technologies, CA, USA), and these were all greater than 8.0. For each species, total RNA from the brain and liver was mixed and used for subsequent library construction with a TruSeq Stranded mRNA Library Prep Kit as described elsewhere, except that the preferentially selected cDNA fragments were 500 bp in length [19]. Sequencing was performed using an Illumina HiSeq 2000 platform, and 125-bp paired-end reads were generated. Given that these two fish are polyploid and that longer short reads may result in more accurate assembly, the same libraries were also sequenced using an Illumina MiSeq platform, and 250-bp paired-end reads for each species were obtained. For the purpose of evolutionary analyses, short reads of the transcriptome sequencing of another polyploid schizothoracins, G. p. ganzihonensis, and S. angustiporus as the outgroup, were downloaded from the NCBI SRA (, Table 1 and Additional file 2: Table S1).

Quality control, sequence assembly and annotation

For each species, each run of short reads was subject to quality control using the NGS QC Toolkit v2.3.3 [20] with the default settings. This toolkit can perform a quality check and filter high-quality next-generation sequencing data automatically. Next, all of the processed data were combined and assembled using Trinity r20131110 [21] with the default settings for each species. For the de novo assembly of the transcriptome for each species, TransDecoder v2.0 ( was used to predict the probable open reading frames. For G. dybowskii, contigs were compared to the proteins of Danio rerio (zebrafish) deposited in the KEGG database with BLASTX 2.3.0+ [22], and corresponding KEGG pathways and gene ontology (GO) terms were retrieved using the online tool KOBAS2.0 with the default settings [23]. For simplicity, all of the subsequent functional analyses were based on the annotations for G. dybowskii.

Ortholog assignments and sequence alignments

Based on the predicted protein sequences, orthologous groups among S. angustiporus, S. pseudaksaiensis, G. dybowskii and G. p. ganzihonensis were identified using Inparanoid 4.1 [24] and Multiparanoid [25] with default settings. Inparanoid can detect in-paralogs with a confidence value, the results were fed into Multiparanoid, and the in-paralogs with the best confidence values were selected as orthologous. The nucleotide sequences of each orthologous group were aligned to the predicted amino acid sequence of G. dybowskii using GeneWise 2-4-1 [26], followed by a series of customized Perl scripts to extract the matched coding regions and to generate the proper alignment format for subsequent analyses. Then, SWAMP 31-03-14 was used to mask the unreasonably high rates of nonsynonymous substitutions, which were likely caused by sequencing, assembly, or alignment errors [27]. Finally, alignments longer than 90 nt after deleting the gaps were retained.

Gene categories with accelerated evolutionary rates

Analyses of gene categories with accelerated evolutionary rates along each lineage were performed following the protocol of Yang et al. [28]. For each ortholog group, the branch-specific values of Ka, Ks and Ka/Ks were estimated using codeml, which is included in the PAML4 software package [29]. The genome-wide average values of Ka, Ks and Ka/Ks along each branch were estimated using 10,000 concatenated alignments that were constructed from 150 randomly chosen ortholog groups. Next, the GO categories with more than 20 assigned orthologous groups were selected, and pairwise comparisons of the Ka/Ks values with the average values for the branches leading to Specialized and Highly Specialized groups (branches II and III shown in Fig. 1) were implemented using a binomial test. For corrections of multiple comparisons, the method of Benjamini and Hochberg was performed to reduce false positives [30].

Detecting candidate genes subject to positive selection

Positive selection on a few codons along branches I, II and III was detected using the optimized branch-site model [31]. By setting each of the three branches as the foreground branch, the likelihood ratio values for the model, which allows sites to be under positive selection on the foreground branch, and the null model, in which sites may evolve neutrally and under purifying selection, were calculated. Based on the likelihood ratio values and a Chi-square distribution, the significance of the differences between the two nested models for all ortholog groups was obtained and adjusted using the method of Benjamini and Hochberg for multiple tests [30]. To confirm the convergence of the Markov process, the aforementioned tests were performed twice with different starting values.

Results and discussion

Sequencing, assembly and alignment of orthologs

For G. dybowskii, the Illumina MiSeq and HiSeq2000 sequencing platforms respectively generated 11,550,142 and 75,226,074 paired-end reads (Table 1 and Additional file 2: Table S1). For S. pseudaksaiensis, the corresponding procedures generated 11,769,716 and 76,383,862 reads. After quality control, more than 74 M paired-end reads totaling more than 10 G nt for both of the species were retained and subjected to subsequent de novo assembly. Based on the combined data generated from different sequencing platforms, a total of 139,382 and 114,997 contigs with N50 lengths of 2407 nt and 2283 nt were obtained for G. dybowskii and S. pseudaksaiensis, respectively. In previous projects sequencing the transcriptomes of polyploid cyprinids, the amounts of data generated for the two species were larger than that for G. pachycheilus [12] but were smaller than those for S. angustiporus and G. przewalskii [32, 33]. However, we hypothesized that the longer length of short reads, especially of those generated by the MiSeq platforms, may lead to some compensation.

For G. p. ganzihonensis, short reads from the gills and kidneys were combined, producing a total of 210,085 contigs with an N50 length of 1447 nt. As expected, when these short reads with different origins were assembled separately, a much larger number of contigs with a shorter N50 length was generated [32]. For S. angustiporus, short reads from the whole eye and brain were combined and assembled, which generated 100,762 contigs with an N50 length of 621 nt. However, based on the same data, the number of contigs generated by Meng et al. was 156,118, with an N50 length of 534 nt [33]. The different numbers of contigs and different N50 lengths may be due to our use of different quality control tools and different versions of de novo assembly software.

A total of 5894 orthologous groups with an alignment length longer than 90 nt after removing gaps was obtained and subject to subsequent evolutionary analyses. Most of the alignment lengths were shorter than 400 nt, and the average and median lengths of the alignments were 219 and 192 nt, respectively (Additional file 3: Figure S2). The relatively short alignment lengths may have resulted from the incomplete sequencing of transcripts. We expect that a much greater number of alignments could be obtained if more tissues were sampled and longer reads were generated, given that all fish species that were subjected to this analysis were polyploid.

Fast-evolving gene categories

The average Ka/Ks value of the branch leading to the Highly Specialized group was significantly lower than that of the branch leading to the Specialized group (p-value < 2.2e-16, Binomial test). By contrast, the average values of Ka and Ks were significantly larger in branches leading to the Specialized group (p-value < 2.2e-16, Binomial test). The same pattern was found when comparing the Ka, Ks and Ka/Ks values between terminal branches leading to G. dybowskii and G. p. ganzihonensis (p-value < 2.2e-16, Binomial test). The somewhat unexpected larger Ks and smaller Ka/Ks values for G. p. ganzihonensis may be due to the following three reasons: the intensive ultraviolet radiation may accelerate the mutation rates of the entire genome; divergence of morphological traits may lag behind sequence divergences; and shorter read lengths generated by Illumina HiSeq sequencing platforms may result in more paralog assemblies.

Several gene categories with the same GO term or KEGG pathway assigned were identified as fast-evolving gene categories along branches II and III, as shown in Fig. 1. There were 117 and 15 fast-evolving gene categories for branches II and III, respectively (fdr < 0.05, Binomial test, Additional file 4: Table S2). Unexpectedly, the evolutionary rates of the GO categories, which were probably relevant to adaptation to high altitudes, such as “response to hypoxia”, “ATP metabolic process” and “response to UV” [28], were not significantly accelerated in our analyses. This phenomenon likely results from at least three causes: in the waters of the Tibetan Plateau, the ultraviolet intensity may be lower than that of air, and the water may contain higher levels of oxygen; the evolutionary rates of these genes probably accelerated during the adaptation to the first phase of plateau uplift; and many of these genes may also be expressed in other organs, such as the skin, or may have been lost when performing ortholog assignments. Moreover, we considered that more fast-evolving gene categories were represented in branch II and may be the result of relaxed selection, given that branch III is believed to undergo stronger selective pressure. Further investigation of recruiting transcripts from various tissues of more representative species of the three main groups of schizothoracins and outgroups should be used to test these hypotheses.

Candidate genes probably subject to positive selection

Likelihood ratio tests indicated that 40, 36, and 55 genes were probably subject to positive selection along branches I, II, and III, respectively (fdr < 0.05, Additional file 5: Table S3). Interestingly, several genes were subject to continuous positive selection during plateau uplift (Fig. 2). Specifically, two of these genes were subject to positive selection along the branches leading to all of the Primitive, Specialized, and Highly Specialized groups (hereafter termed PSH for convenience), and the numbers of genes subject to positive selection along the branches leading to the Primitive and Specialized (PS), Specialized and Highly Specialized (SH), and Primitive and Highly Specialized (PH) groups were 9, 9, and 4, respectively. The numbers of lineage-specific genes subject to positive selection along the branches leading to the Primitive (P), Specialized (S) and Highly Specialized (H) groups were 25, 16, and 40, respectively.

Fig. 2

Venn diagram of the overlap between the candidate genes subject to positive selection along the branches leading to the Primitive, Specialized and Highly Specialized groups of Schizothoracins (branchesI, II and III in Fig. 1)

One of the two PSH genes was highly similar to “ecotropic viral integration site 5b” (evi5b) in zebrafish, which harbors a coiled-coil region that resembles that of structural maintenance of chromosomes (SMC) proteins. These proteins are core components of the condensing and cohesin complexes responsible for regulating chromosomal condensation, pairing and segregation in eukaryotes [34]. Given that the fish that were subjected to analyses in this report were all polyploid, we assumed that evi5b probably played an important role in their diploidization. Another gene, annotated as “supervillin b,” seems to be involved in cytokinesis during cell division [35]. However, why it was subject to positive selection throughout the course of plateau-adaptation in schizothoracins is presently unknown.

Two of the PS genes, “protein jagunal homolog 1-A” (jagn1) and “polypyrimidine tract-binding protein 2” (ptbp2), were subject to positive selection, which is not unexpected, given that the former is an immune gene that can regulate neutrophil function in microbial pathogenesis [36] and the latter is a reproductive protein that may function specifically in the male germline [37]; these two kinds of genes usually undergo more adaptive evolution than other genes [38, 39]. Notably, two PS genes, “mycophenolic acid acyl-glucuronide” and “solute carrier organic anion transporter family member 3a1” (slco3a1), which are related to the transport of organic anions [40, 41], were also subject to positive selection, but the reasons for that selection remain unknown.

The light intensity increases with the elevation of the plateau, and one of the SH genes, “guanine nucleotide-binding protein beta-1” (gnb1), which is involved in the pathway “Phototransduction”, was subject to positive selection, likely resulting from adaptation to the greater light intensity. Moreover, intense ultraviolet light can cause DNA damage and even worse, skin cancer [42]. Thus, the “dab2 interacting protein” (dab2ip) gene, which suppresses the development of many cancer types [43], likely appeared among the SH genes for this reason. Meanwhile, some genes relevant to energy metabolism, for example, the “2-oxoglutarate dehydrogenase” (ogdh) [44] and atp6v0a1 genes, participate in the “oxidative phosphorylation” pathway and probably contributed to adaptation to the reduced water temperature.

P genes may have been important for adaptation to the first phase of plateau uplift. These genes include several relevant to adaptation to hypoxia, low temperature, and high-intensity ultraviolet light. First, hypoxia can reduce the activity of “arylsulfatase b” (arsb) [45], but it is a tumor suppressor and is presumably associated with carcinogenesis at low levels [4648]; therefore, arsb has evolved to adapt to the low-oxygen environment. The formation and maintenance of the cristae structure of the inner mitochondrial membrane is largely based on the mitochondrial contact site and cristae-organizing system (MICOS), and “MICOS complex subunit MIC19” (mic19) may have adapted to maintain the normal mitochondrial structure and function during hypoxia [49]. Mic19, together with “cytochrome c-type heme lyase” (cchl), which is involved in electron transfer processes [50], and “creg1”, which promotes cardiomyogenesis [51], may be indirectly relevant to hypoxia. Second, two other P genes, bag1 and a member of the immunoglobulin superfamily that is homologous to several cell adhesion molecules, “cell surface glycoprotein MUC18” (mcam), may have evolved to adapt to high-intensity ultraviolet light, which can cause DNA damage and induce cancer. The reason for this adaptation may be that the products of bag1 can form complexes with the anti-apoptotic protein bcl-2 and can thereby contribute to rendering cells more resistant to apoptosis, and altered bag1 expression levels can be detected in some cancerous cells [52], whereas mcam cannot be detected in normal melanocytes but can be detected in primary melanomas and is highly expressed in metastatic melanoma cells [53]. Moreover, “bromodomain-containing 2” (brd2) plays a critical role in adipogenesis [54] and may be adapted to low temperatures.

The third phase of uplift may have been the fastest, and the height has increased by 3000 m since approximately 2.6 Myr ago [55]. During this process, a number of genes were supposed to be under positive selection. The “endoplasmic reticulum transmembrane prolyl 4-hydroxylase” (p4h-tm) gene can hydroxylate the subunit of hypoxia-inducible factor (hif) [56]. In normoxia, hydroxylation of 2 critical proline residues can rapidly degrade hif because the hif-subunit isoforms hif-1 and hif-2 are synthesized constitutively. However, in hypoxia, hydroxylation is inhibited, allowing hif to escape degradation and regulate hypoxia-related genes [57]. Moreover, p4h-tm can regulate erythropoietin production, hepcidin expression, and erythropoiesis [57]. Therefore, we suggest that the modification of p4h-tm may contribute to the adaptation to hypoxic waters in the Qinghai-Tibet Plateau. The protein “nibrin” plays an important role in the initial recognition of DNA damage during the recruitment of proteins responsible for DNA double-strand breaks repair [58]. “Tumor protein d54” (tpd52l2) is a member of the D52-like family, and these proteins have been predicted to interact with each other to regulate cell proliferation [59]. “Tumor protein D52” (tpd52) is frequently overexpressed in several carcinomas and may be the co-expressed tpd52l2 [60]. The two aforementioned cancer-related genes were positively selected, perhaps as the result of adaptation to the increasingly high-intensity ultraviolet light associated with the uplift of the plateau. Notably, the H genes included several genes related to the physiological function of the nervous system, such as “neurexin-1β”, which binds with the postsynaptic membrane protein neuroligin-1 and plays a central role in the formation of synapses in the central nervous system [61]; “kinesin-like protein kif1a” (kif1a), a unique monomeric motor for the anterograde axonal transport of synaptic vesicle precursors [62]; “wnt-7a”, which preferentially stimulates excitatory synapse formation and function [63]; “rest corepressor 2” (rcor1), which is co-expressed with zmynd8 and interacts to form a complex that might be involved in the regulation of neural differentiation in the nervous system [64]; and “alsin”, which is particularly abundant in motor neurons and is involved in the development of axons and dendrites [65]. Potassium channels are found in most cell types and control a wide variety of cell functions, including modulating neuronal signaling in the brain and the peripheral nervous system, as well as regulating cell volume and the flow of salts across epithelia [66]. “Voltage-gated potassium channel subunit beta-1” (kcnab1) may markedly influence the gating mode of potassium channels [67], along with “ammonium transporter rh type b” (rhbg), which may be a major ammonium transporter in vertebrates [68], were also included as H genes. The modification of these genes relevant to ion transport and physiological function of the nervous system may have resulted from adaptation to the changed environment.


Using the assembled transcripts, which were based on short reads generated in this report and others downloaded from a public database, this study identified several genes that likely corresponded to the adaptation of three specialized groups of schizothoracins to the three phases of plateau uplift. The short reads generated for G. dybowskii and S. pseudaksaiensis, especially the longer short reads generated using the Illumina MiSeq sequencing platform, should be valuable resources for subsequent analyses of polyploid cyprinids. However, to more thoroughly understand the molecular events of adaptation, additional representative species should be evaluated, and further comprehensive sequencing of the transcripts should be performed.



Gene ontology


Highly Specialized group of schizothoracins


Kyoto encyclopedia of genes and genomes


Primitive group of schizothoracins


Specialized group of schizothoracins


Sequence read archive of NCBI


  1. 1.

    Thompson LG, Yao T, Mosley-Thompson E, Davis ME, Henderson KA, Lin P-N. A High-Resolution Millennial Record of the South Asian Monsoon from Himalayan Ice Cores. Science. 2000;289(5486):1916–9.

    CAS  Article  PubMed  Google Scholar 

  2. 2.

    Bickler PE, Buck LT. Hypoxia tolerance in reptiles, amphibians, and fishes: Life with variable oxygen availability. Annu Rev Physiol. 2007;69:145–70.

    CAS  Article  PubMed  Google Scholar 

  3. 3.

    Simonson TS, Yang Y, Huff CD, Yun H, Qin G, Witherspoon DJ, Bai Z, Lorenzo FR, Xing J, Jorde LB, et al. Genetic Evidence for High-Altitude Adaptation in Tibet. Science. 2010;329(5987):72–5.

    CAS  Article  PubMed  Google Scholar 

  4. 4.

    Zhang YB, Li X, Zhang F, Wang DM, Yu J. A preliminary study of copy number variation in Tibetans. PLoS ONE. 2012;7(7):e41768.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  5. 5.

    Bigham A, Bauchet M, Pinto D, Mao X, Akey JM, Mei R, Scherer SW, Julian CG, Wilson MJ, López Herráez D, et al. Identifying signatures of natural selection in Tibetan and Andean populations using dense Genome Scan Data. PLoS Genet. 2010;6(9):e1001116.

    Article  PubMed  PubMed Central  Google Scholar 

  6. 6.

    Scheinfeldt LB, Soi S, Thompson S, Ranciaro A, Woldemeskel D, Beggs W, Lambert C, Jarvis JP, Abate D, Belay G, et al. Genetic adaptation to high altitude in the Ethiopian highlands. Genome Biol. 2012;13(1):R1.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  7. 7.

    Snyder LR. Low P50 in deer mice native to high altitude. J Appl Physiol (1985). 1985;58(1):193–9.

    CAS  Google Scholar 

  8. 8.

    Storz JF, Runck AM, Moriyama H, Weber RE, Fago A. Genetic differences in hemoglobin function between highland and lowland deer mice. J Exp Biol. 2010;213(15):2565–74.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  9. 9.

    Cheviron ZA, Bachman GC, Connaty AD, McClelland GB, Storz JF. Regulatory changes contribute to the adaptive enhancement of thermogenic capacity in high-altitude deer mice. Proc Natl Acad Sci U S A. 2012;109(22):8635–40.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  10. 10.

    Qiu Q, Zhang G, Ma T, Qian W, Wang J, Ye Z, Cao C, Hu Q, Kim J, Larkin DM, et al. The yak genome and adaptation to life at high altitude. Nat Genet. 2012;44(8):946–9.

    CAS  Article  PubMed  Google Scholar 

  11. 11.

    Cai Q, Qian X, Lang Y, Luo Y, Xu J, Pan S, Hui Y, Gou C, Cai Y, Hao M, et al. Genome sequence of ground tit Pseudopodoces humilis and its adaptation to high altitude. Genome Biol. 2013;14(3):R29.

    Article  PubMed  PubMed Central  Google Scholar 

  12. 12.

    Yang L, Wang Y, Zhang Z, He S. Comprehensive transcriptome analysis reveals accelerated genic evolution in a Tibet fish, Gymnodiptychus pachycheilus. Genome Biol Evol. 2015;7(1):251–61.

    Article  Google Scholar 

  13. 13.

    Guan L, Chi W, Xiao W, Chen L, He S. Analysis of hypoxia-inducible factor alpha polyploidization reveals adaptation to Tibetan Plateau in the evolution of schizothoracine fish. BMC Evol Biol. 2014;14:192.

    Article  PubMed  PubMed Central  Google Scholar 

  14. 14.

    Leggatt RA, Iwama GK. Occurrence of polyploidy in the fishes. Rev Fish Biol Fish. 2003;13(3):237–46.

    Article  Google Scholar 

  15. 15.

    Wang E, Kirby E, Furlong KP, van Soest M, Xu G, Shi X, Kamp PJJ, Hodges KV. Two-phase growth of high topography in eastern Tibet during the Cenozoic. Nat Geosci. 2012;5(9):640–5.

    Article  Google Scholar 

  16. 16.

    Lai S. Three-phase uplift of the Qinghai-Tibet plateau during the Cenozoic period: Igneous petrology constraints. Chin J Geochem. 2000;19(2):152–60.

    CAS  Article  Google Scholar 

  17. 17.

    Cao WX, Chen YY, Wu YF, Zhu SQ. Origin and evolution of schizothoracine fishes in relation to the upheaval of the Xizang Plateau. In: Tibetan Expedition Team of the Chinese Academy of Science, editor. Studies on the period, amplitude and type of the uplift of the Qinghai-Xizang Plateau. Beijing: Science Press; 1981. p. 118–30.

    Google Scholar 

  18. 18.

    Chen Z, Chen Y. Genetic relationships of the specialized schizothoracine fishes inferred from random amplified polymorphic dna analysis. Zool Res. 2000;21(4):7.

    Google Scholar 

  19. 19.

    Zou M, Guo B, Ma X. Characterizing the transcriptome of yellow-cheek carp (Elopichthys bambusa) enables evolutionary analyses within endemic East Asian Cyprinidae. Gene. 2014;547(2):267–72.

    CAS  Article  PubMed  Google Scholar 

  20. 20.

    Patel RK, Jain M. NGS QC Toolkit: A Toolkit for Quality Control of Next Generation Sequencing Data. PLoS ONE. 2012;7(2):e30619.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  21. 21.

    Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng QD, et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011;29(7):644–52.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  22. 22.

    Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215(3):403–10.

    CAS  Article  PubMed  Google Scholar 

  23. 23.

    Xie C, Mao XZ, Huang JJ, Ding Y, Wu JM, Dong S, Kong L, Gao G, Li CY, Wei LP. KOBAS 2.0: a web server for annotation and identification of enriched pathways and diseases. Nucleic Acids Res. 2011;39:W316–22.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  24. 24.

    Remm M, Storm CE, Sonnhammer EL. Automatic clustering of orthologs and in-paralogs from pairwise species comparisons. J Mol Biol. 2001;314(5):1041–52.

    CAS  Article  PubMed  Google Scholar 

  25. 25.

    Alexeyenko A, Tamas I, Liu G, Sonnhammer ELL. Automatic clustering of orthologs and inparalogs shared by multiple proteomes. Bioinformatics. 2006;22(14):E9–E15.

    CAS  Article  PubMed  Google Scholar 

  26. 26.

    Birney E, Clamp M, Durbin R. GeneWise and genomewise. Genome Res. 2004;14(5):988–95.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  27. 27.

    Harrison PW, Jordan GE, Montgomery SH. SWAMP: sliding window alignment masker for PAML. Evol Bioinform. 2014;10.

  28. 28.

    Yang YZ, Wang LZ, Han J, Tang XL, Ma M, Wang K, Zhang X, Ren Q, Chen Q, Qiu Q. Comparative transcriptomic analysis revealed adaptation mechanism of Phrynocephalus erythrurus, the highest altitude Lizard living in the Qinghai-Tibet Plateau. BMC Evol Biol. 2015;15.

  29. 29.

    Yang ZH. PAML 4: Phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24(8):1586–91.

    CAS  Article  PubMed  Google Scholar 

  30. 30.

    Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Royal Stat Soc Series B. 1995;57(1):12.

    Google Scholar 

  31. 31.

    Zhang JZ, Nielsen R, Yang ZH. Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level. Mol Biol Evol. 2005;22(12):2472–9.

    CAS  Article  PubMed  Google Scholar 

  32. 32.

    Zhang RY, Ludwig A, Zhang CF, Tong C, Li GG, Tang YT, Peng ZG, Zhao K. Local adaptation of Gymnocypris przewalskii (Cyprinidae) on the Tibetan Plateau. Sci Rep-Uk. 2015;5.

  33. 33.

    Meng F, Braasch I, Phillips JB, Lin X, Titus T, Zhang C, Postlethwait JH. Evolution of the Eye Transcriptome under Constant Darkness in Sinocyclocheilus Cavefish. Mol Biol Evol. 2013;30(7):1527–43.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  34. 34.

    Wood AJ, Severson AF, Meyer BJ. Condensin and cohesin complexity: the expanding repertoire of functions. Nat Rev Genet. 2010;11(6):391–404.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  35. 35.

    Smith TC, Fridy PC, Li YY, Basil S, Arjun S, Friesen RM, Leszyk J, Chait BT, Rout MP, Luna EJ. Supervillin binding to myosin II and synergism with anillin are required for cytokinesis. Mol Biol Cell. 2013;24(23):3603–19.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  36. 36.

    Wirnsberger G, Zwolanek F, Stadlmann J, Tortola L, Liu SW, Perlot T, Jarvinen P, Durnberger G, Kozieradzki I, Sarao R, et al. Jagunal homolog 1 is a critical regulator of neutrophil function in fungal host defense. Nat Genet. 2014;46(9):1028–33.

    CAS  Article  PubMed  Google Scholar 

  37. 37.

    Robida MD, Singh R. Drosophila polypyrimidine-tract binding protein (PTB) functions specifically in the male germline. EMBO J. 2003;22(12):2924–33.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  38. 38.

    McTaggart SJ, Obbard DJ, Conlon C, Little TJ. Immune genes undergo more adaptive evolution than non-immune system genes in Daphnia pulex. BMC Evol Biol. 2012;12.

  39. 39.

    Clark NL, Aagaard JE, Swanson WJ. Evolution of reproductive proteins from animals and plants. Reproduction. 2006;131(1):11–22.

    CAS  Article  PubMed  Google Scholar 

  40. 40.

    Hagenbuch B, Meier PJ. Organic anion transporting polypeptides of the OATP/SLC21 family: phylogenetic classification as OATP/SLCO superfamily, new nomenclature and molecular/functional properties. Pflug Arch Eur J Phy. 2004;447(5):653–65.

    CAS  Article  Google Scholar 

  41. 41.

    Wolff NA, Burckhardt BC, Burckhardt G, Oellerich M, Armstrong VW. Mycophenolic acid (MPA) and its glucuronide metabolites interact with transport systems responsible for excretion of organic anions in the basolateral membrane of the human kidney. Nephrol Dial Transpl. 2007;22(9):2497–503.

    CAS  Article  Google Scholar 

  42. 42.

    Beani JC. Ultraviolet A-induced DNA damage: role in skin cancer. Bull Acad Natl Med. 2014;198(2):273–95.

    PubMed  Google Scholar 

  43. 43.

    Chen H, Pong RC, Wang Z, Hsieh JT. Differential regulation of the human gene DAB2IP in normal and malignant prostatic epithelia: Cloning and characterization. Genomics. 2002;79(4):573–81.

    CAS  Article  PubMed  Google Scholar 

  44. 44.

    Araújo WL, Trofimova L, Mkrtchyan G, Steinhauser D, Krall L, Graf A, Fernie AR, Bunik VI. On the role of the mitochondrial 2-oxoglutarate dehydrogenase complex in amino acid metabolism. Amino Acids. 2012;44(2):683–700.

    Article  PubMed  Google Scholar 

  45. 45.

    Bhattacharyya S, Tobacman JK. Hypoxia reduces arylsulfatase B activity and silencing arylsulfatase B replicates and mediates the effects of hypoxia. PLoS ONE. 2012;7(3):e33250.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  46. 46.

    Bhattacharyya S, Feferman L, Tobacman JK. Increased Expression of Colonic Wnt9A through Sp1-mediated Transcriptional Effects involving Arylsulfatase B, Chondroitin 4-Sulfate, and Galectin-3. J Biol Chem. 2014;289(25):17564–75.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  47. 47.

    Bhattacharyya S, Feferman L, Tobacman JK. Arylsulfatase B Regulates Versican Expression by Galectin-3 and AP-1 Mediated Transcriptional Effects. Oncogene. 2014;33(47):5467–76.

    CAS  Article  PubMed  Google Scholar 

  48. 48.

    Bhattacharyya S, Feferman L, Tobacman JK. Inhibition of phosphatase activity follows decline in sulfatase activity and leads to transcriptional effects through sustained phosphorylation of transcription factor MITF. PLoS ONE. 2016;11(4):e0153463.

    Article  PubMed  PubMed Central  Google Scholar 

  49. 49.

    Kozjak-Pavlovic V. The MICOS complex of human mitochondria. Cell Tissue Res. 2017;367(1):83–93.

    CAS  Article  PubMed  Google Scholar 

  50. 50.

    Travaglini-Allocatelli C. Protein Machineries Involved in the Attachment of Heme to Cytochrome c: Protein Structures and Molecular Mechanisms. Scientifica (Cairo). 2013;2013:505714.

    Google Scholar 

  51. 51.

    Liu J, Qi Y, Saadat S, Han Y, Hsu S-C, Rahimi S, Lee L, Graham A, Li S. Abstract 12203: CREG1 Interacts With Sec8 to Promote Cardiomyogenic Differentiation From Embryonic Stem Cells. Circulation. 2014;130 Suppl 2, A12203.

    Google Scholar 

  52. 52.

    Takayama S, Reed JC. Molecular chaperone targeting and regulation by BAG family proteins. Nat Cell Biol. 2001;3(10):E237–41.

    CAS  Article  PubMed  Google Scholar 

  53. 53.

    Johnson JP, Bar-Eli M, Jansen B, Markhof E. Melanoma progression–associated glycoprotein MUC18/MCAM mediates homotypic cell adhesion through interaction with a heterophilic ligand. Int J Cancer. 1997;73(5):769–74.

    CAS  Article  PubMed  Google Scholar 

  54. 54.

    Denis GV, Nikolajczyk BS, Schnitzler GR. An emerging role for bromodomain-containing proteins in chromatin regulation and transcriptional control of adipogenesis. Febs Letters. 2010;584(15):3260–8.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  55. 55.

    Zhisheng A, Kutzbach JE, Prell WL, Porter SC. Evolution of Asian monsoons and phased uplift of the Himalaya-Tibetan plateau since Late Miocene times. Nature. 2001;411(6833):62–6.

    CAS  Article  PubMed  Google Scholar 

  56. 56.

    Jaakkola P, Mole DR, Tian Y-M, Wilson MI, Gielbert J, Gaskell SJ, Kriegsheim A, Hebestreit HF, Mukherji M, Schofield CJ, et al. Targeting of HIF-α to the von Hippel-Lindau Ubiquitylation Complex by O < sub > 2</sub > -Regulated Prolyl Hydroxylation. Science. 2001;292(5516):468–72.

    CAS  Article  PubMed  Google Scholar 

  57. 57.

    Laitala A, Aro E, Walkinshaw G, Mäki JM, Rossi M, Heikkilä M, Savolainen E-R, Arend M, Kivirikko KI, Koivunen P, et al. Transmembrane prolyl 4-hydroxylase is a fourth prolyl 4-hydroxylase regulating EPO production and erythropoiesis. Blood. 2012;120(16):3336–44.

    CAS  Article  PubMed  Google Scholar 

  58. 58.

    Kobayashi J. Molecular mechanism of the recruitment of NBS1/hMRE11/hRAD50 complex to DNA double-strand breaks: NBS1 binds to gamma-H2AX through FHA/BRCT domain. J Radiat Res. 2004;45(4):473–8.

    CAS  Article  PubMed  Google Scholar 

  59. 59.

    Byrne JA, Nourse CR, Basset P, Gunning P. Identification of homo- and heteromeric interactions between members of the breast carcinoma-associated D52 protein family using the yeast two-hybrid system. Oncogene. 1998;16(7):873–81.

    CAS  Article  PubMed  Google Scholar 

  60. 60.

    Barbaric D, Byth K, Dalla-Pozza L, Byrne JA. Expression of tumor protein D52-like genes in childhood leukemia at diagnosis: Clinical and sample considerations. Leuk Res. 2006;30(11):1355–63.

    CAS  Article  PubMed  Google Scholar 

  61. 61.

    Nam CI, Chen L. Postsynaptic assembly induced by neurexin-neuroligin interaction and neurotransmitter. Proc Natl Acad Sci. 2005;102(17):6137–42.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  62. 62.

    Okada Y, Yamazaki H, Sekine-Aizawa Y, Hirokawa N. The neuron-specific kinesin superfamily protein KIF1A is a uniqye monomeric motor for anterograde axonal transport of synaptic vesicle precursors. Cell. 1995;81(5):769–80.

    CAS  Article  PubMed  Google Scholar 

  63. 63.

    Ciani L, Boyle KA, Dickins E, Sahores M, Anane D, Lopes DM, Gibb AJ, Salinas PC. Wnt7a signaling promotes dendritic spine growth and synaptic strength through Ca2+/Calmodulin-dependent protein kinase II. Proc Natl Acad Sci. 2011;108(26):10732–7.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  64. 64.

    Zeng W, Kong Q, Li C, Mao B. Xenopus RCOR2 (REST corepressor 2) interacts with ZMYND8, which is involved in neural differentiation. Biochem Biophys Res Commun. 2010;394(4):1024–9.

    CAS  Article  PubMed  Google Scholar 

  65. 65.

    Hadano S, Kunita R, Otomo A, Suzuki-Utsunomiya K, Ikeda JE. Molecular and cellular function of ALS2/alsin: Implication of membrane dynamics in neuronal development and degeneration. Neurochem Int. 2007;51(2-4):74–84.

    CAS  Article  PubMed  Google Scholar 

  66. 66.

    Hille B. “Chapter 5: Potassium Channels and Chloride Channels”. Ion channels of excitable membranes. Sinauer Associates. 2001;131–168. ISBN 0-87893-321-2.

  67. 67.

    Bahring R, Milligan CJ, Vardanyan V, Engeland B, Young BA, Dannenberg J, Waldschutz R, Edwards JP, Wray D, Pongs O. Coupling of voltage-dependent potassium channel inactivation and oxidoreductase active site of Kvbeta subunits. J Biol Chem. 2001;276(25):22923–9.

    CAS  Article  PubMed  Google Scholar 

  68. 68.

    Liu Z, Peng J, Mo R, Hui C-C, Huang C-H. Rh Type B Glycoprotein Is a New Member of the Rh Superfamily and a Putative Ammonia Transporter in Mammals. J Biol Chem. 2001;276(2):1424–33.

    CAS  Article  PubMed  Google Scholar 

Download references


We thanked the three anonymous reviewers for their constructive criticisms, which greatly improved the MS.


This work was funded by the National Natural Science Foundation of China (no. 31401977) and a National Science and Technology support program (no. 2012BAD25B06). The funding bodies did not have a role in the design of the study, data collection, analysis, interpretation of data, writing the manuscript, nor the decision to publish.

Availability of data and materials

Sequencing reads of transcriptomes have been deposited in the NCBI SRA database under the accession numbers shown in Table 1.

Authors’ contributions

MZ conceived and planned the project. XFM performed the practical work including molecular biology. WC performed the bioinformatics analyses. JGN collected the samples. MZ and WC drafted the manuscript and all authors revised it. All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests.

Consent for publication

Not applicable.

Ethics approval and consent to participate

This study was approved by the Institutional Animal Care and Use Committees (IACUC) of Huazhong Agricultural University. No specific permits were required for the field studies described here. The study area is not privately owned or protected in any way, and the field studies did not involve endangered or protected species.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information



Corresponding author

Correspondence to Ming Zou.

Additional files

Additional file 1: Figure S1.

Sampling sites of the studied species. The triangle represents the sampling site of G. dybowskii and S. pseudaksaiensis collected in this study. The circle and diamond represent the sampling sites of G. p. ganzihonensis and the outgroup species S. angustiporus collected previously [32, 33]. The map was adopted from Sogou Map ( (PPT 137 kb)

Additional file 2: Table S1.

Specifics of the short reads of the transcriptome sequencing projects for the polyploid cyprinids used in this report. (XLS 16 kb)

Additional file 3: Figure S2.

Distribution of alignment lengths after deleting gaps. The solid line indicates the average length, and the dotted line indicates the median length. (PPT 34 kb)

Additional file 4: Table S2.

Gene categories showing the accelerated evolutionary rates along the branches leading to the Specialized and Highly Specialized groups of schizothoracins (branches II and III in Fig. 1). (XLS 40 kb)

Additional file 5: Table S3.

Profiles of candidate genes subject to positive selection along branches leading to the Primitive, Specialized and Highly Specialized groups of schizothoracins (branchesI, II and III in Fig. 1). (XLS 39 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Chi, W., Ma, X., Niu, J. et al. Genome-wide identification of genes probably relevant to the adaptation of schizothoracins (Teleostei: Cypriniformes) to the uplift of the Qinghai-Tibet Plateau. BMC Genomics 18, 310 (2017).

Download citation


  • Schizothoracins
  • Polyploid
  • High-altitude adaptation
  • Positive selection