Population history and genomic signatures for high-altitude adaptation in Tibetan pigs
BMC Genomics volume 15, Article number: 834 (2014)
The Tibetan pig is one of domestic animals indigenous to the Qinghai-Tibet Plateau. Several geographically isolated pig populations are distributed throughout the Plateau. It remained an open question if these populations have experienced different demographic histories and have evolved independent adaptive loci for the harsh environment of the Plateau. To address these questions, we herein investigated ~ 40,000 genetic variants across the pig genome in a broad panel of 678 individuals from 5 Tibetan geographic populations and 34 lowland breeds.
Using a series of population genetic analyses, we show that Tibetan pig populations have marked genetic differentiations. Tibetan pigs appear to be 3 independent populations corresponding to the Tibetan, Gansu and Sichuan & Yunnan locations. Each population is more genetically similar to its geographic neighbors than to any of the other Tibetan populations. By applying a locus-specific branch length test, we identified both population-specific and -shared candidate genes under selection in Tibetan pigs. These genes, such as PLA2G12A, RGCC, C9ORF3, GRIN2B, GRID1 and EPAS1, are involved in high-altitude physiology including angiogenesis, pulmonary hypertension, oxygen intake, defense response and erythropoiesis. A majority of these genes have not been implicated in previous studies of highlanders and high-altitude animals.
Tibetan pig populations have experienced substantial genetic differentiation. Historically, Tibetan pigs likely had admixture with neighboring lowland breeds. During the long history of colonization in the Plateau, Tibetan pigs have developed a complex biological adaptation mechanism that could be different from that of Tibetans and other animals. Different Tibetan pig populations appear to have both distinct and convergent adaptive loci for the harsh environment of the Plateau.
The Qinghai-Tibet Plateau, known as the “roof of the world”, is the highest (mostly at 3,500 - 4,500 m) and largest (~2,500,000 km2) highland on the earth. The environmental condition of the plateau is characterized by the reduced oxygen availability, low ambient temperature, high ultraviolent radiation and amid climate. The unique ecological condition imposes severe physiological challenges on inhabits in the high-altitude region. Native inhabitants like Tibetans have evolved the adaptive mechanism to address the harsh environment during the long history of colonization. Today, a number of researchers have identified particular physiological traits favoring local adaption in Tibetans. The adaptive traits include increased nitric oxide level, elevated resting ventilation, reduced pulmonary vasoconstrictor response and low hemoglobin concentration compared with acclimated lowlanders [1–3].
Although the physiological traits in response to high-altitude environments are relatively well characterized, an understanding of the molecular basis underlying these traits has significantly lagged behind. Recently, population genomics offers an effective approach to characterize the adaptive mechanism of Tibetans and other colonists in the Qinghai-Tibet Plateau. One common approach is involved in genotyping a large number of loci across the genome on divergently differentiated populations, and searching “outliers” (candidate targets of selection) in the extreme tail of the empirical distribution of statistics like FST[4, 5] and locus-specific branch length (LSBL) values [6–8]. To date, a series of genomic scan researches have highlighted more than a dozen of candidate genes subject to natural selection in Tibetans [4, 5, 8–11]. Two major candidate genes (EGLN1 and EPAS1) in the hypoxia-inducible factor (HIF) pathway have been concordantly shown to carry the adaptive mutations . Both EPAS1 and EGLN1[10, 11] variants are associated with the adaptively low hemoglobin level of Tibetans relative to acclimatized lowlanders. Recently, a non-synonymous mutation in the EGLN1 gene is suggested to be the causal variant for local adaption in Tibetans . Intriguingly, Tibetans show distinct adaptive mechanism as compared to other highland populations including Ethiopians [7, 13] and Andeans , in which different variants in EGLN1 and different selection-nominated candidate genes like BHLHE41 have been identified. In non-human species indigenous to the Qinghai-Tibet plateau, the draft genomes of the Tibetan antelope  and Yak  have been recently generated. Adaptive signals have been detected in genes associated with oxygen transmission, energy metabolism and DNA repair in the two species, improving our understanding of the genetic mechanisms of high-altitude adaptation in highland animals.
The Tibetan pig, one of Tibetans’ domestic animals, is originally distributed in farming and farming-pastoral regions at altitudes of 2,900 - 4,300 m in the Qinghai-Tibetan Plateau . Like Tibetans and other colonists in the Plateau, Tibetan pigs have several key adaptive features for coping with the harsh environment at high altitude . First, they have black skin and hair with long and dense bristle that protect them from high solar radiation and cold ambient temperature in winter (Figure 1). Second, their well-developed heart and lung may be required for the increased rate of blood flow and oxygen transportation to tissues in response to hypoxia. Third, they have developed a blunted erythropoietic response to high-altitude hypoxia by exhibiting lower than expected hemoglobin concentrations relative to their lowland counterparts and acclimatized lowland pigs . This feature, in contrast to Tibetan yak, goat and sheep that show elevated hemoglobin levels at high altitude, is a crucial protection mechanism for excessive erythrocytosis (a classic feature of chronic mountain sickness ) in Tibetans [1–3]. Forth, their alertness and agility enable them to survive in the grazing condition.
To elucidate the genetic basis of the altitude phenotypes in Tibetan pigs, we have previously searched for population-differentiated SNPs across the genome between two Tibetan pig populations and 16 lowland breeds using the Illumina 60 K SNP data. We identified several candidate genes that might play a role in high-altitude adaptation in Tibetan pigs . More recently, Li et al.  characterize fast evolved genes in Tibetan pigs and highlight a set of genes that may contribute to high-altitude adaptation using whole-genome sequence data. The two studies treated all Tibetan pigs as a single population. However, the Qinghai-Tibet Plateau stretches across a vast region. Several geographically isolated populations of the Tibetan pig are currently living in the region, including Sichuan, Yunnan, Gansu and Tibet populations . Environmental factors including temperature, humidity, precipitation and vegetation are variable across the hypoxic and high-ultraviolet plateau . Therefore, different Tibetan geographical populations may have experienced independent adaptive mechanism for the harsh environment. Besides, it remains an open question whether Tibetan pig populations have experienced differentiation and subdivisions during the long period of their colonization in the Plateau.
Here we genotyped a broad panel of 678 pigs from 5 Tibetan regional populations and 34 lowland breeds using high-density SNP genotyping arrays. By applying population genetic and genomic approaches, we characterize the genetic landscape for Tibetan pigs and reveal both population-shared and -specific candidate genes contributing to local adaptations in the 5 Tibetan populations. Our findings provide novel insights into evolutionary history of Tibetan pigs and the genetic architecture of high-altitude adaptation in highland animals and their owners.
Population structure and evolution history of Tibetan pigs
To investigate population structure of Tibetan pigs, we first constructed a neighbor-joining (NJ) tree based on genome-wide allele sharing of the 678 pigs from 5 Tibetan populations, 28 Chinese lowland breeds (Figure 1, Additional file 1: Table S1) and 6 Western breeds (Additional file 1: Table S1). A notable feature of the NJ tree is the high concordance with which individuals cluster to their population origin. The topological tree (Additional file 2: Figure S1) clearly illustrates that all individuals from the same population or breed gather together, and Western pigs form a cluster separating from Chinese indigenous pigs. The genetic relationships between Chinese breeds are strikingly concordant with their geographic locations. For instance, Erhualian, Meishan, Jiangquhai, Jinhua, Tongcheng, Ganxi and Shaziling from the middle-lower belt of Yangtze River defined a separate grouping, while breeds from South China and Southwest China including Luchuan, Wuzhishan, Bamaxiang, Dahuabai, Congjiang Xiang and Diannan pigs appeared as a closely related cluster. Intriguingly, five Tibetan regional populations did not form a separate cluster. Instead, Tibetan pigs from Gansu province were more closely related to North China breeds like Bamei and Hetao than to any of the other Tibetan populations. Moreover, Tibetan pigs from Sichuan and Yunnan provinces were grouped together with their geographic neighbors, Rongchang and Neijiang. The two populations (Gongbujiangda and Milin) in the Tibet Autonomous Region clustered in an independent clade.
To better understand the population structure of Tibetan pigs, we further performed a principal component analysis (PCA) using a subset of 14,202 SNPs with low linkage disequilibrium (LD) extents (r2 < 0.3) in all tested pigs. The PCA plot (Additional file 3: Figures S2) shows the differentiation pattern between Chinese and Western pigs that is highly consistent the NJ tree results. The differentiation patterns of the five Tibetan regional populations resemble the NJ results. The Tibetan pigs from Yunnan and Sichuan provinces exhibit strong genetic affinity to their geographic neighbors including Neijing and Rongchang, whereas the Gansu Tibetan population cluster near their neighbors, Bamei and Hetao pigs. The Gongbujiang and Milin populations are more genetically similar to each other than to any of the other Tibetan populations.
To assess evolutionary origin and historical admixture patterns of Tibetan pigs in a context of worldwide populations, we further conducted a Bayesian ancestry inference analysis using the program ADMIXTURE (Additional file 4: Figure S3). Within Chinese pigs, from K = 3 to K = 7, variable fractions of Chinese wild ancestry were evidenced. At K = 7 and 8, five ancestry fractions were detected in Chinese breeds, including the ancestors of Gongbujiangda Tibetan, Congjiang Xiang, Luchan, Jinhua and Erhualian pigs. When we focused on Tibetan pigs, the Gansu Tibetan population was derived from two ancestry fractions of Gongbujiangda Tibetan (70%) and Erhualian (30%) pigs, and its ancestry structure was similar to their geographic neighbors: Bamei and Hetao pigs. The ancestry structures of Tibetan pigs from Sichuan and Yunnan were nearly identical, which consist of 75% of Gongbujiangda Tibetan, 10% of Luchuan and 15% of Erhualian ancestry fractions and resemble the structures of their lowland neighbors including Neijing, Rongchang and Mingguang pigs.
To further infer population splits and mixtures of Tibetan pigs, we used a recently developed approach, Treemix , to construct a maximum-likelihood tree of the 5 Tibetan populations, 21 Chinese lowland breeds and 1 Chinese wild boar population. A close examination of residuals from the inferred tree without migration events (Additional file 5: Figure S4) revealed that 6 pairs of populations were apparently not compatible with the best-fit tree, suggestive of gene flow events. Indeed, the tree model without migration events only explained 89.9% of the variance in the relatedness between populations. We sequentially added migration events to the maximum-likelihood tree (Figure 2). The new tree model allowing 6 major migration events explained higher percentage (96.0%) of the variance in the relatedness between populations. In the inferred graph (Figure 2), the regional populations from Tibet, Yunnan and Sichuan provinces were grouped into one of the two major groupings, whereas the Gansu Tibetan population clusters with the other major grouping consisting of Bamei and Hetao pigs. An ancestral edge further divided Tibet and Yunnan/Sichuan populations into different subgroups. Again, Yunan and Sichuan Tibetan populations clustered with their geographic neighbors Neijiang and Rongchang. Of the six migration events, the strongest signal suggests a genetic contribution from the ancestry of Tongcheng, Shaziling and Ganxi breeds to Dongshan pigs (migration weight (w) = 49.9%). Another visually apparent event is a gene flow from the ancestry of Wuzhishan and Luchuan pigs into Diannan pigs (w = 49.2%). We also inferred a significant admixture event from Meishan to Hetao individuals (w = 33.8%). For Tibetan pigs, we found a gene flow event from the early Gansu Tibetan population into the ancestry of the other four Tibetan populations with a migration weight of 38.8%.
Taken together, we conclude that Tibetan pig geographic populations have experienced substantial genetic differentiation and population admixture. During the formation of current populations, Tibetan pigs were likely influenced by their geographic neighbors. Tibetan pigs are thus not a single breed, which appear to be 3 independent populations corresponding to the Tibetan, Gansu, and Sichuan & Yunnan locations (thereafter namely SCYN).
Population-specific genomic signatures of selection in Tibetan pigs
According to the above-mentioned population genetic analyses, we divided Tibetan pigs into three independent populations: Tibet, Gansu and SCYN. To identify population-specific loci under positive selection in Tibetan pigs, we calculated the LSBL value for each of the 41,495 informative SNPs along the genome using a three-group contrasting model: one Tibetan population, one Chinese lowland group and the other two Tibetan populations (See Methods). The three-group test identified SNP outliers that had highly differentiated allele frequencies in one Tibetan population relative to the other two groups. These outlier SNPs are candidate loci (or to be in linkage disequilibrium with selected variants) under selection for adaptation to high-altitude hypoxia.
We identified a total of 207 SNP outliers (0.5% of empirical LSBL distribution), corresponding to 140, 129 and 124 candidate genes (50 kb up- and downstream of each SNP outliers) in Gansu, Tibet and SCYN populations, respectively (Additional file 6: Table S2). Few common signals were found between the three Tibetan populations (Figure 3, Additional file 7: Figure S5). C9ORF3, GRIN2B and GRID1, three functionally plausible genes (See Discussion), exhibited the most significant signals of selection in Gansu, Tibet and SCYN populations, respectively (Figure 4A). These SNPs showed a marked allele frequency difference between one Tibetan population and the other Tibetan and lowland populations (Figure 4B).
We sequentially performed Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses on highlighted candidate genes in each Tibetan pig population. GO analysis identified 27, 15 and 17 overrepresented genes in Gansu, Tibet and SCYN populations, respectively (Table 1). These genes are involved in several biological processes that likely play a role in local adaptation of Tibetan pigs, such as neuron-neuron synaptic transmission (GRIA1, GRIA4, GRM4, GRM5, HTR2A and VDAC1, Pcor = 0.018), detection of mechanical stimulus involved in sensory perception (GRIN2B, ITGA2 and PCDH15, Pcor = 0.029), artery development (CHD7, FOXS1, GLI3, and MYLK2, Pcor = 0.041), positive regulation of smooth muscle cell proliferation (ID2 and ITGA2, Pcor = 0.041) and glutamate receptor signaling pathway (GRIK2, HOMER2, NRXN1 and PLCB1, Pcor = 0.042). We also identified 4 and 3 KEGG pathways in Gansu and Tibet populations (Table 1) that were overrepresented after correction for multiple testing: basal cell carcinoma (Pcor = 0.031), circadian entrainment (Pcor = 0.033), nicotine addiction (Pcor = 0.043) and adherens junction (Pcor = 0.045) pathways in the Gansu population, and propanoate metabolism (Pcor = 0.004), sulfur metabolism (Pcor = 0.004) and DNA replication (Pcor = 0.024) in the Tibet population.
Population-shared genomic signatures in all Tibetan pigs
To test if Tibetan pig populations have common adaptive loci for the Plateau, we also looked across the genome to identity signals of selection using the three-group test model with all Tibetan pigs as one group and two Chinese lowland groups (See Methods). A total of 207 SNP outliers were visualized on the top 0.5% of the empirical distribution, corresponding to 122 genes (Additional file 6: Table S2). The LSBL value plots and allele distribution patterns of the top three loci are depicted in Figure 4. Each top locus shows highly differentiation pattern between Tibetan pigs and Chinese lowland animals. For instance, at the top of the list is an intergenic SNP between HFM1 and ZNF644 (rs80868124, LSBL value = 0.535). Allele C at the SNP is predominantly presented in Tibetan pigs with average frequency of 0.816 whereas is much rarer in Chinese lowland breeds at an average frequency of 0.207 except for Neijiang (0.750), a breed known for its well adaptability to diverse environmental conditions . The biological role of HFM1 and ZNF644 in response to hypoxia has not been established and warrants further investigations. The second strongest LSBL SNP (rs81403518) is located at 12 kb upstream of the PLA2G12A gene, a biologically plausible gene (see Discussion). The third strongest outlier is located at ~17 kb downstream of a hypoxia-inducible gene: RGCC.
GO analysis identified 16 overrepresented genes that are involved in the regulation of collagen metabolic process (CST3, ITGA2, PDGFRB and RGCC, Pcor = 0.002), histone H3-K9 methylation (DNMT3B, KDM1A and PRDM5, Pcor = 0.018), positive regulation of positive chemotaxis (CDH13, CNTN1 and ITGA2, Pcor = 0.033), regulation of cytokinesis (PDZD2 and TEX14, Pcor = 0.038), fatty acid catabolism (MUT and PCCA, Pcor = 0.043), detection of mechanical stimulus involved insensory perception of pain (GRIN2B and ITGA2, Pcor = 0.043), and behavioral defense response (GRIN2B, NR2E1 and VDAC1, Pcor = 0.045) (Table 1). Meanwhile, we found 3 KEGG pathwaysthat were overrepresented after correction for multiple testing: valine, leucine and isoleucine degradation (Pcor = 0.036), fat digestion and absorption (Pcor = 0.042) and amyotrophic lateral sclerosis (Pcor = 0.046) (Table 1). KEGG:04975 (fat digestion and absorption) is particular interesting as it may reflect gene selection for fully utilizing fat as an energy source in case of food shortage, a critical ecological factor restricting the viability of highland animals.
Signatures of selection in three well-characterized hypoxia genes
EGLN1 and EPAS1, two critical regulators in the HIF pathway, have been repeatedly identified as targets of selection for high-altitude adaptation in Tibetans (reviewed in ). ADAM17 is the most prominent locus showing signal of positive selection in the Tibetan yak . We, therefore, are interested in whether any of the three genes has experienced convergent selection in Tibetan pigs. As no SNP on the current porcine 60 K chip is located around the three genomic regions, we sequentially genotyped 56 SNPs at an average density of 1 SNP/5.4 kb covering the three genes on a panel of 324 individuals including 84 Tibetan pigs and 240 lowland animals. The resulting genotype data were merged into the 60 K SNP dataset. We then performed the LSBL analysis on the 324 individuals using a common subset of informative SNPs. No significant selection signal was detected at both EGLN1 and ADAM17 loci (Additional file 8: Figure S6). Two of 22 SNPs around the EPAS1 gene appeared to be population-shared outliers surpassing the significance threshold (LSBL value = 0.301, corresponding to the top 0.5% of empirical distribution). The two SNPs showed apparently different variation patterns between Tibetan pigs and low-altitude individuals (Additional file 8: Figure S6).
Comparison of our findings with previous reports
As shown in Figure 5, a majority of candidate genes implicated in the current study are distinct from those hypoxia genes identified in Tibetans [4–9], other highlanders [11–13] and Tibetan antelope  and yak . We argue that Tibetan pigs may have evolved a different biological adaptation mechanism. Even compared with the recent findings of hypoxia-related genes in Tibetan pigs based on the whole-genome sequence data , most of candidate genes including those ranking in the top list of the current study (such as PLA2G12A, RGCC, C9ORF3, GRIN2B and GRID1) are reported for the first time.
Population history of Tibetan pigs
The formation of Tibetan present-day pig populations was likely influenced by the migration of their owners: Tibetans. Today, both archaeological and genetic data support an ancient initial colonization of modern humans in the early Upper Paleolithic before the last glacial maximum (22,000 – 18,000 years ago) and a recent population expansion and gene flow from outside the highland region in the early Neolithic (10,000 – 7,000 years ago) . The recent migration events are believed to bring agriculture into the Himalayan region, leading to the establishment of farming and yak pastoralism on the Plateau . Since then, the trade between the Tibet and Southeastern/Northeastern China lasted for thousands of years through three major routines: the Tangbo Ancient Road and the Tea-horse Ancient Road including the Yunnan-Tibet and the Sichuan-Tibet routes. The Tangbo Ancient Road extends from Chang’an (the ancient capital of the Chinese Empire) to Lhasa. Historically, it was a very important trade route linking the Upper Yellow River region and the Tibet especially during the Dang Dynasty (618 – 907 A.D.). The Tea-horse Ancient Road played a crucial role in communication and exchange between the residents of Yunnan, Sichuan and Tibet. It had been flourishing for over a century until the end of World War II. The long-standing trade likely brought along human-mediated dispersal of lowland pigs in Southwestern and Northwestern China into the Tibetan Plateau. Genetic data from mitochondria DNA support a local domestication origin of Tibetan pigs . However, there were multiple migrations of Tibetan ancestors at different times from different places and ample trade exchanges between the Tibet and the outside regions. This leads us to assume that the genetic makeup of current Tibetan pig populations may be influenced by their lowland neighbors, leading to population subdivisions of Tibetan pigs. The hypothesis is supported by our following observations. (1) In the PCA results, PC2 axis separated Tibetan pig populations into different groupings corresponding to their geographic locations. (2) In the NJ phylogenetic tree, the Gansu Tibetan population defined an independent branch with its neighbors including the Bamei pig in Qinghai and the Hetao pig in Inner Mongolia. The Sichuan and Yunnan Tibetan populations formed another cluster together with two lowland neighbors (Neijiang and Rongchang) in the Sichuan Basin. The two local populations from the Tibet appeared to be a distinct clade. (3) The Treemix analysis showed the evolutionary splits between the Tibetan populations. The Gansu Tibetan population was assigned to one ancient major group separating from the other Tibetan pig populations. Again, Sichuan and Yunnan Tibetan populations together with Neijiang and Rongchang defined a subgroup separating from the Tibet populations. These findings collectively support the contribution of geographically neighboring lowland populations to the today’s Tibetan populations. The historical admixture events, at least to a certain extent, are responsible for the substantial inter-population differentiation in Tibetan pigs. Altogether, we believe that Tibetan pig populations have experienced distinct demographic histories and have various degree of admixture with different neighboring populations. In general, Tibetan pigs can be divided into 3 independent populations: Tibet, Gansu and Sichuan & Yunnan.
Distinct selection-nominated loci in different Tibetan pig populations
In Tibetans, a list of genes has been highlighted as potential targets of natural selection (reviewed in ). While EPAS1 and EGLN1 at the top list are consistently identified in multiple Tibetan studies, many of these selection candidate genes are unique to each study. Such differences may be at least partly attributed to different demographic histories and genetic differentiation among Tibetan populations that likely have various degree of admixture with different neighboring populations [23, 26, 27]. Tibetan pigs show similar population history patterns to their owners. As mentioned above, our genetic data suggest long-term population isolation and genetic differentiation among Tibetan pig populations with potential for admixture with geographic neighbors. It is therefore reasonable to speculate that each of Tibetan pig population may have unique adaptive variants. Our genome scans on individual population confirm the speculation. The SNP outliers in one population are largely different from the others (Figure 4). Especially, the top SNPs corresponding to C9ORF3, GRIN2B and GRID1 genes exhibit population specific signals in the Gansu, Tibet and Sichuan & Yunan populations, respectively (Figure 4). These candidate genes are worthy mentioning because of their statistical significance and function implications. Further investigations of their potential role in the high-altitude adaptation of Tibetan pigs are worthwhile.
C9ORF3, also known as Aminopeptidase O, encodes a member of the M1 zinc aminopeptidase family that catalyzes the hydrolysis of amino acid residues from the N-terminus of peptide. C9ORF3 is involved in the renin-angiotensin pathway, in which C9ORF3 cleaves angiotensin III to generate angiotensin IV . It is known that angiotensin IV regulates the vasoconstriction [29, 30] and the hydromineral balance as well as arterial blood pressure . Moreover, high expression level of C9ORF3 is positively correlated with maximal oxygen uptake and the percentage of type 1 fibers in humans . As predicted by MalaCards , C9ORF3 is associated with newborn respiratory distress syndrome. Therefore, preferential selection on the C9ORF3 gene is likely required for Tibetan (Gansu) pigs to increase oxygen uptake and avoid pulmonary or cerebral vascular hypertension and edema during the long-standing living in the hypoxic highland.
GRIN2B encodes a subunit of N-methyl-D-apartate receptor that is the predominant excitatory neurotransmitter receptor in the mammalian brain. The receptor has a central role in memory and cognitive function. Defects in the GRIN2B gene have been found in human patients with mental retardation . GRID1 encodes glutamate receptor delta 1, a subunit of glutamate receptor channels that mediate most of the fast excitatory synaptic transmission in the central nervous system and play key roles in synaptic plasticity . Variations in the promoter region of the GRID1 gene have been associated with human schizophrenia [36, 37]. Positive selection of these genes may suggest the importance of neural regulation in the establishment of quickly response to attack under grazing conditions in Tibetan pigs.
Common selection targets in Tibetan pigs
In addition to population-specific candidate loci, we highlighted a list of selection-nominated candidate genes shared by all Tibetan pigs. Many of these genes are functionally related to energy metabolism, angiogenesis, melanin synthesis and behavior defense response. Of note, PLA2G12A and RGCC in our top-ranking list stand out as strong candidate genes that likely play a role in high-altitude adaptation in Tibetan pigs.
PLA2G12A encodes secreted phospholipase A2, group XIIA enzyme. The enzyme hydrolyzes phospholipids into arachidonic acid and other lipophilic molecules that exert a variety of biological effects. Of note, arachidonic acid plays an important role in regulation of pulmonary vascular tone in lungs of both newborns and adults. In pigs and other species, arachidonic acid can induce dilation in pulmonary arteries. Altered arachidonic acid metabolites have been implicated in the development of pulmonary hypertension in chronically hypoxic piglets . Therefore, PLA2G12A is a strong candidate gene contributing to high-altitude adaptation in Tibetan pigs. High frequencies of PLA2G12A favorable mutations in Tibetan pigs could be beneficial for increasing blood flow at low degree of hypoxic pulmonary hypertension.
RGCC plays a critical role in hypoxia-induced antiangiogenesis. It has a physical interaction with HIF1α and vascular endothelial growth factor (VEGF) that are key mediators in cellular response to hypoxia and ischemia . In response to acute hypoxia, HIF1α activates the expression of VEGF that is critically important for skeletal muscle angiogenesis. The increased growth of blood vessels has been associated with capillary leak . Therefore, the preferentially selected variants in the RGCC gene may inhibit the capillary growth and consequently avoid capillary leak in Tibetan pigs under long-term exposure to the hypoxic environment.
EPAS1, the “master regulator” of erythropoiesis in the HIF pathway (reviewed in ), also appear to be a target of selection in this study. EPAS1 allelic variants that are common in Tibetan pigs may be loss of function and be negative associated with hemoglobin concentrations as a blunted erythropoietic response observed in Tibetans and Tibetan pigs [1–3]. It should be noted that the other well-characterized hypoxia gene EGLN1 is not a target of selection in Tibetan pigs.
A complex picture of the genetic mechanism underlying high-altitude adaptation in Tibetan pigs
In this study, we identified a total of 489 genes as potential selection targets in all Tibetan pig populations (Additional file 6: Table S2). A majority of these candidate genes have not been implicated in previous genome-wide studies on highlanders and high-altitude animals. Many genes are functionally related to high-altitude physiology, such as angiogenesis (RGCC), pulmonary hypertension remodeling (PLA2G12A), oxygen intake (C9ORF3), neural response (GRID1 and GRIN2B), melanin synthesis (MITF, PDGFRB and PIK3R3) and heart development (FOXS1, GAA and MYLK2). Further characterization of these genes will be necessary to identify variants responsible for the observed signals. Both distinct and shared genomic signatures of selection were illustrated in Tibetan regional pig populations. We argue that both independent and shared adaptive variants could be responsible for local adaptation in each Tibetan pig population. Our findings reflect a complex picture of the genetic mechanism underlying high-altitude adaptation in Tibetan pigs. The adaptive mechanism seems to involve a range of genes regulating diverse biological processes. Distinct set of genes including some broadly shared loci act in a highly coordinated manner to offset severe physiological challenges imposed by the harsh hypoxic environment in different Tibetan pig populations.
Our analysis provides a list of potential genes involved in high-altitude adaptation in Tibetan pigs, which will be the ground for future investigations. It is worthwhile to conduct resequencing of the most promising genomic regions. Haplotype similarity analysis based on the resequencing-called variants would allow us to identify Tibetan specific and minimal-shared haplotypes. The functional variants residing in these haplotypes, such as protein-altering mutations and regulatory mutations at the evolutionary conserved sites, would be strong candidate causative mutations.
Although we identified a list of candidate genes for high-altitude adaptation in Tibetan pigs, we are limited by the ascertainment bias and low genomic coverage of our SNP dataset. The 60 K SNPs on the Illumina porcine DNA chip were primarily characterized from Western pigs that have relatively higher LD extents compared to Tibetan pigs . A mass of Tibetan specific variants are not included in the chip. Therefore, we must not have the power to identify all variants under positive selection for local adaptation of Tibetan pigs. Own to the substantially decreasing cost, whole-genome sequencing is now affordable in domestic animals. Such population-scale genome analyses on a broad sample of representative individuals would identify additional adaptive variants and further improve our understanding of the genetic basis of high-altitude adaptation in Tibetan pigs.
Tibetan pig populations have experienced substantial genetic differentiation. Geographically neighboring lowland breeds likely contributed to Tibetan pigs by historical admixture events. The present-day Tibetan pigs can be divided into 3 independent populations: Tibet, Gansu and Sichuan & Yunnan. After a long period of colonization in the Qinghai-Tibetan Plateau, Tibetan pigs have developed a complex biological adaptation mechanism that could be different from that of Tibetans and other highland animals. Different Tibetan pig populations appear to have both distinct and convergent adaptive loci for the harsh environment of the Plateau.
All animal work was conducted according to the guidelines for the care and use of experimental animals established by the Ministry of Agriculture of China. The Ethics Committee of Jiangxi Agricultural University specifically approved this study.
A total of 678 pigs from 5 Tibetan geographic populations and 34 Chinese and Western lowland breeds were investigated in this study. These pigs are unrelated animals with no common ancestry for 3 generations. Boars were preferentially collected to cover consanguinity as broadly as possible. Of the 678 pigs, 304 individuals from 18 breeds have been tested in our previous study , and SNP data of 85 pigs from 6 breeds were retrieved from the Dryad Digital Repository: http://dx.doi.org/10 5061/dryad.t1r3d. Sample size and origin of each Tibetan population and lowland breeds are given in Additional file 1: Table S1, and the geographic locations of 33 Chinese breeds including the 5 Tibetan populations are shown in Figure 1. Genomic DNA was extracted from ear tissues using a routine phenol/chloroform protocol, and was diluted to a final concentration of 20 ng/ml.
All animals were genotyped for ~62,000 SNPs on Porcine SNP60 BeadChips  (Illumina, USA) according to the manufacturer protocol. SNP genotypes were recorded using BEADSTUDIO version 3.2 (Illumina, USA). SNPs were filtered with the criterion of call rate > 95% and minor allele frequency (MAF) > 0.05. A total of 41,495 informative SNPs were obtained, and different subsets of SNP data were chose from the 41,495 SNPs for further statistical analyses (see below). SNP genomic positions correspond to the current pig genome assembly (Sscrofa10.2).
A panel of 56 SNP markers at an average interval of 5.4 kb (Additional file 9: Table S3) covering 3 well-characterized hypoxia genes (EGLN, EPAS1 and ADAM1) was genotyped on 324 Chinese indigenous pigs by the iPLEX MassARRAY platform (Sequenom, USA) according to the supplier’s protocol. SNP genotype calls were filtered with MAF > 0.05 and genotype call rate > 95%, resulting in 49 informative SNPs. The 49 SNPs were merged into the illumina SNPs to form a common subset of 41,544 SNP data in Chinese indigenous pigs, which were then used to detect signals of selection on the 3 hypoxia genes in Tibetan pigs.
Population genetic and structure analyses
A common subset of 25,340 SNPs with MAF ≥ 0.2 in all tested pigs were explored to calculate three measures of genetic variability of each population: allelic richness (AR), the proportion of polymorphic markers (PN) and expected heterozygosity (HE) using ADZE  and PLINK  as described previously . All 41,495 informative SNPs were used to analyze the pairwise genetic distance between populations as shown in our previous study . Neighbor joining relationship trees between individuals were constructed using Neighbor in the PHYLIP version 3.69 package  and visualized by Figtree v1.4.0 (BEAST Software, http://beast.bio.ed.ac.uk/FigTree). A subset of 14,202 SNPs with low linkage disequilibrium (r2 < 0.3) filtered by PLINK  was employed to perform principal component analysis using the Smartpca program from EIGENSOFT .
The genetic differentiation between populations was assessed by the FST fixation index. Unbiased genetic differentiation estimates of FST were calculated as described in Reich’s paper  using the whole SNP dataset. Briefly, FST was estimated as follows:
In the above formulae, ni denotes the sample size in the ith population, and ai is the counts of SNP allele A in the ith population (i = 1, 2). Because the range of FST is originally defined between 0 and 1 , negative FST values that do not have a biological interpretation were set to 0.
Population structure was analyzed using the Maximum Likelihood approach implemented in ADMIXTURE v 1.20  and the above- mentioned 14,202 SNPs of low LD extents. The ADMIXTURE program was run in an unsupervised manner with a variable number of clusters (K = 2 to 8). The lowest 10-fold cross-validation values were used to choose an optimum K-value according to the default termination setting.
TreeMix  was employed to infer the patterns of population historical splits and mixtures for Tibetan populations in the context of diverse Chinese breeds. TreeMix estimates the historical relationships of sampled populations with a particular focus on topology rather than on the timing of demographic events. In the maximum likelihood trees, nodes represented inferred population splits, edges indicated the populations having ancestry from multiple parental populations, and branch lengths were proportional to the amount of genetic drift that populations have undergone. Migration events were modeled for populations that did not fit well the bifurcating tree model. These populations having ancestry from multiple parental populations were indicated as arrows. The color shades of the arrows reflected the relative weight of migration. Here, maximum-likelihood trees of Chinese pig populations without migration events and with 6 migration events were tested by using the Chinese wild boars as the outgroup population.
Genome-wide tests of signatures of positive selection
LSBL statistics, a robust indicative of selection-nominated loci , were calculated for each polymorphic site of 41,495 informative SNPs under a three-group contrasting model. To identify population-shared signatures of selection, we treated Tibetan pigs as the highland group, and divided 20 Chinese lowland breeds into two lowland groups that were splited by TreeMix. Group 1 comprises 10 breeds including Hetao, Min, Laiwu, Jinhua, Jiangquhai, Erhualian, Meishan, Tongcheng, Ganxi and Shaziling pigs. Group 2 also consists of 10 breeds including Neijiang, Rongchang, Minguang, Diannan, Congjiang Xiang, Dahuabai, Bamaxiang, Dongshan, Luchuan and Wuzhishan pigs. To detect population-specific loci under selection, we treated one Tibetan population as the highland group, the remaining Tibetan populations as one contrasting group and the 20 Chinese lowland breeds as another contrasting group. To calculate LSBL values, we computed pairwise FST using the above Reich’s calculation at every SNP position for each two-way group comparison (i.e., Tibetan to Lowland group 1, Tibetan to Lowland group 2, and Lowland group 1 to Lowland group 2). Next, the pairwise FST values were used to calculate the LSBL at each SNP as described previously . With three contrasting groups, SNPs showing Tibetan specific FST were identified as candidate selection loci. LSBL outliers were defined as sites with LSBL statistics surpassing 0.5% of the empirical distributions.
Characterization of candidate genes under selection
The 50 kb upstream and downstream of significant LSBL loci were operationally defined as candidate regions under selection. Pig annotated genes within candidate regions were first searched against the pig genome assembly 10.2  via the Ensembl Genome Browser (http://ensembl.org/index.html). Furthermore, human orthologous genes were identified by aligning pig gene-associated regions against the human genome using the BLAST-like Alignment Tool . To perform functional enrichment of the candidate genes, the human Gene Ontology database (http://www.geneontology.org) was then queried by the ClueGO plugin of Cytoscape  using Symbol ID as input parameters. The enriched GO terms and KEGG pathways were characterized according to the default setting.
The genotyping data set supporting the results of this article is available in the Dryad database (doi:10.5061/dryad.53j31).
Kyoto encyclopedia of genes and genomes
Locus-specific branch length
Minor allele frequency
Principal component analysis
Sichuan & Yunnan
Vascular endothelial growth factor
Beall CM: Two routes to functional adaptation: Tibetan and Andean high-altitude natives. Proc Natl Acad Sci U S A. 2007, 104 (Suppl 1): 8655-8660.
Beall CM, Brittenham GM, Strohl KP, Blangero J, Williams-Blangero S, Goldstein MC, Decker MJ, Vargas E, Villena M, Soria R, Alarcon AM, Gonzales C: Hemoglobin concentration of high-altitude Tibetans and Bolivian Aymara. Am J Phys Anthropol. 1998, 106 (3): 385-400. 10.1002/(SICI)1096-8644(199807)106:3<385::AID-AJPA10>3.0.CO;2-X.
Wu T, Wang X, Wei C, Cheng H, Wang X, Li Y, Ge D, Zhao H, Young P, Li G, Wang Z: Hemoglobin levels in Qinghai-Tibet: different effects of gender for Tibetans vs. Han. J Appl Physiol. 2005, 98 (2): 598-604.
Xu S, Li S, Yang Y, Tan J, Lou H, Jin W, Yang L, Pan X, Wang J, Shen Y, Wu B, Wang H, Jin L: A genome-wide search for signals of high-altitude adaptation in Tibetans. Mol Biol Evol. 2011, 28 (2): 1003-1011. 10.1093/molbev/msq277.
Peng Y, Yang Z, Zhang H, Cui C, Qi X, Luo X, Tao X, Wu T, Ouzhuluobu , Basang , Ciwangsangbu , Danzengduojie , Chen H, Shi H, Su B: Genetic variations in Tibetan populations and high-altitude adaptation at the Himalayas. Mol Biol Evol. 2011, 28 (2): 1075-1081. 10.1093/molbev/msq290.
Bigham A, Bauchet M, Pinto D, Mao X, Akey JM, Mei R, Scherer SW, Julian CG, Wilson MJ, Lopez Herraez D, Brutsaert T, Parra EJ, Moore LG, Shriver MD: Identifying signatures of natural selection in Tibetan and Andean populations using dense genome scan data. PLoS Genet. 2010, 6 (9): e1001116-10.1371/journal.pgen.1001116.
Scheinfeldt LB, Soi S, Thompson S, Ranciaro A, Woldemeskel D, Beggs W, Lambert C, Jarvis JP, Abate D, Belay G, Tishkoff SA: Genetic adaptation to high altitude in the Ethiopian highlands. Genome Biol. 2012, 13 (1): R1-10.1186/gb-2012-13-1-r1.
Yi X, Liang Y, Huerta-Sanchez E, Jin X, Cuo ZX, Pool JE, Xu X, Jiang H, Vinckenbosch N, Korneliussen TS, Zheng H, Liu T, He W, Li K, Luo R, Nie X, Wu H, Zhao M, Cao H, Zou J, Shan Y, Li S, Yang Q, Asan , Ni P, Tian G, Xu J, Liu X, Jiang T, Wu R, et al: Sequencing of 50 human exomes reveals adaptation to high altitude. Science. 2010, 329 (5987): 75-78. 10.1126/science.1190371.
Beall CM, Cavalleri GL, Deng L, Elston RC, Gao Y, Knight J, Li C, Li JC, Liang Y, McCormack M, Montgomery HE, Pan H, Robbins PA, Shianna KV, Tam SC, Tsering N, Veeramah KR, Wang W, Wangdui P, Weale ME, Xu Y, Xu Z, Yang L, Zaman MJ, Zeng C, Zhang L, Zhang X, Zhaxi P, Zheng YT: Natural selection on EPAS1 (HIF2alpha) associated with low hemoglobin concentration in Tibetan highlanders. Proc Natl Acad Sci U S A. 2010, 107 (25): 11459-11464. 10.1073/pnas.1002443107.
Simonson TS, Yang Y, Huff CD, Yun H, Qin G, Witherspoon DJ, Bai Z, Lorenzo FR, Xing J, Jorde LB, Prchal JT, Ge R: Genetic evidence for high-altitude adaptation in Tibet. Science. 2010, 329 (5987): 72-75. 10.1126/science.1189406.
Xiang K, Ouzhuluobu , Peng Y, Yang Z, Zhang X, Cui C, Zhang H, Li M, Zhang Y, Bianba G, Basang C, Wu T, Chen H, Shi H, Qi X, Su B: Identification of a Tibetan-specific mutation in the hypoxic gene EGLN1 and its contribution to high-altitude adaptation. Mol Biol Evol. 2013, 30 (8): 1889-1898. 10.1093/molbev/mst090.
Simonson TS, McClain DA, Jorde LB, Prchal JT: Genetic determinants of Tibetan high-altitude adaptation. Hum Genet. 2012, 131 (4): 527-533. 10.1007/s00439-011-1109-3.
Huerta-Sanchez E, Degiorgio M, Pagani L, Tarekegn A, Ekong R, Antao T, Cardona A, Montgomery HE, Cavalleri GL, Robbins PA, Weale ME, Bradman N, Bekele E, Kivisild T, Tyler-Smith C, Nielsen R: Genetic signatures reveal high-altitude adaptation in a set of ethiopian populations. Mol Biol Evol. 2013, 30 (8): 1877-1888. 10.1093/molbev/mst089.
Ge RL, Cai Q, Shen YY, San A, Ma L, Zhang Y, Yi X, Chen Y, Yang L, Huang Y, He R, Hui Y, Hao M, Li Y, Wang B, Ou X, Xu J, Zhang Y, Wu K, Geng C, Zhou W, Zhou T, Irwin DM, Yang Y, Ying L, Bao H, Kim J, Larkin DM, Ma J, Lewin HA, et al: Draft genome sequence of the Tibetan antelope. Nat Commun. 2013, 4: 1858-
Qiu Q, Zhang G, Ma T, Qian W, Wang J, Ye Z, Cao C, Hu Q, Kim J, Larkin DM, Auvil L, Capitanu B, Ma J, Lewin HA, Qian X, Lang Y, Zhou R, Wang L, Wang K, Xia J, Liao S, Pan S, Lu X, Hou H, Wang Y, Zang X, Yin Y, Ma H, Zhang J, Wang Z, et al: The yak genome and adaptation to life at high altitude. Nat Genet. 2012, 44 (8): 946-949. 10.1038/ng.2343.
Wang L, Wang A, Wang L, Li K, Yang G, He R, Qian L, Xu N, Huang R, Peng Z, Zeng Y, Pang Y: Animal Genetic Resourses in China: Pigs. 2011, Beijing: China Agricultural Press, 361-374. in Chinese
Chama Y, Zhang H, Baina Y, Liu J, Shang P, Danzeng W: Determination of blood physiological parameters in Tibet pig at high altitude. Southwest China J Agric Sci. 2011, 24 (6): 2382-2384. (in Chinese)
Pei SX, Chen XJ, Si Ren BZ, Liu YH, Cheng XS, Harris EM, Anand IS, Harris PC: Chronic mountain sickness in Tibet. Q J Med. 1989, 71 (266): 555-574.
Ai H, Huang L, Ren J: Genetic diversity, linkage disequilibrium and selection signatures in Chinese and Western pigs revealed by genome-wide SNP markers. PLoS One. 2013, 8 (2): e56001-10.1371/journal.pone.0056001.
Li M, Tian S, Jin L, Zhou G, Li Y, Zhang Y, Wang T, Yeung CK, Chen L, Ma J, Zhang J, Jiang A, Li J, Zhou C, Zhang J, Liu Y, Sun X, Zhao H, Niu Z, Lou P, Xian L, Shen X, Liu S, Zhang S, Zhang M, Zhu L, Shuai S, Bai L, Tang G, Liu H, et al: Genomic analyses identify distinct patterns of selection in domesticated pigs and Tibetan wild boars. Nat Genet. 2013, 45 (12): 1431-1438. 10.1038/ng.2811.
Pickrell JK, Pritchard JK: Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet. 2012, 8 (11): e1002967-10.1371/journal.pgen.1002967.
Zhang Z, Li B, Chen X: Pig Breeds in China. 1986, Shanghai: Shanghai Scientific and Technical Publisher
Qi X, Cui C, Peng Y, Zhang X, Yang Z, Zhong H, Zhang H, Xiang K, Cao X, Wang Y, Ouzhuluobu , Basang , Ciwangsangbu , Bianba , Gonggalanzi , Wu T, Chen H, Shi H, Su B: Genetic evidence of Paleolithic colonization and Neolithic expansion of modern humans on the Tibetan plateau. Mol Biol Evol. 2013, 30 (8): 1761-1778. 10.1093/molbev/mst093.
Wang Z: History of Nationalities in China. 1994, Beijing: China Social Science Press, in Chinese
Yang S, Zhang H, Mao H, Yan D, Lu S, Lian L, Zhao G, Yan Y, Deng W, Shi X, Han S, Li S, Wang X, Gou X: The local origin of the Tibetan pig and additional insights into the origin of Asian pigs. PLoS One. 2011, 6 (12): e28215-10.1371/journal.pone.0028215.
Aldenderfer M: Peopling the Tibetan plateau: insights from archaeology. High Alt Med Biol. 2011, 12 (2): 141-147. 10.1089/ham.2010.1094.
Zhao M, Kong QP, Wang HW, Peng MS, Xie XD, Wang WZ, Jiayang , Duan JG, Cai MC, Zhao SN, Cidanpingcuo , Tu YQ, Wu SF, Yao YG, Bandelt HJ, Zhang YP: Mitochondrial genome evidence reveals successful Late Paleolithic settlement on the Tibetan Plateau. Proc Natl Acad Sci U S A. 2009, 106 (50): 21230-21235. 10.1073/pnas.0907844106.
Diaz-Perales A, Quesada V, Sanchez LM, Ugalde AP, Suarez MF, Fueyo A, Lopez-Otin C: Identification of human aminopeptidase O, a novel metalloprotease with structural similarity to aminopeptidase B and leukotriene A4 hydrolase. J Biol Chem. 2005, 280 (14): 14310-14317. 10.1074/jbc.M413222200.
Li XC, Campbell DJ, Ohishi M, Yuan S, Zhuo JL: AT1 receptor-activated signaling mediates angiotensin IV-induced renal cortical vasoconstriction in rats. Am J Physiol Renal Physiol. 2006, 290 (5): F1024-F1033.
Lochard N, Thibault G, Silversides DW, Touyz RM, Reudelhuber TL: Chronic production of angiotensin IV in the brain leads to hypertension that is reversible with an angiotensin II AT1 receptor antagonist. Circ Res. 2004, 94 (11): 1451-1457. 10.1161/01.RES.0000130654.56599.40.
Le MT, Vanderheyden PM, Szaszak M, Hunyady L, Vauquelin G: Angiotensin IV is a potent agonist for constitutive active human AT1 receptors. Distinct roles of the N-and C-terminal residues of angiotensin II during AT1 receptor activation. J Biol Chem. 2002, 277 (26): 23107-23110. 10.1074/jbc.C200201200.
Parikh H, Nilsson E, Ling C, Poulsen P, Almgren P, Nittby H, Eriksson KF, Vaag A, Groop LC: Molecular correlates for maximal oxygen uptake and type 1 fibers. Am J Physiol Endocrinol Metab. 2008, 294 (6): E1152-E1159. 10.1152/ajpendo.90255.2008.
Rappaport N, Nativ N, Stelzer G, Twik M, Guan-Golan Y, Stein TI, Bahir I, Belinky F, Morrey CP, Safran M, Lancet D: MalaCards: an integrated compendium for diseases and their annotation. Database (Oxford). 2013, 2013: bat018-
Endele S, Rosenberger G, Geider K, Popp B, Tamer C, Stefanova I, Milh M, Kortum F, Fritsch A, Pientka FK, Hellenbroich Y, Kalscheuer VM, Kohlhase J, Moog U, Rappold G, Rauch A, Ropers HH, von Spiczak S, Tonnies H, Villeneuve N, Villard L, Zabel B, Zenker M, Laube B, Reis A, Wieczorek D, Van Maldergem L, Kutsche K: Mutations in GRIN2A and GRIN2B encoding regulatory subunits of NMDA receptors cause variable neurodevelopmental phenotypes. Nat Genet. 2010, 42 (11): 1021-1026. 10.1038/ng.677.
Yamazaki M, Araki K, Shibata A, Mishina M: Molecular cloning of a cDNA encoding a novel member of the mouse glutamate receptor channel family. Biochem Biophys Res Commun. 1992, 183 (2): 886-892. 10.1016/0006-291X(92)90566-4.
Nenadic I, Maitra R, Scherpiet S, Gaser C, Schultz CC, Schachtzabel C, Smesny S, Reichenbach JR, Treutlein J, Muhleisen TW, Deufel T, Cichon S, Rietschel M, Nothen MM, Sauer H, Schlosser RG: Glutamate receptor delta 1 (GRID1) genetic variation and brain structure in schizophrenia. J Psychiatr Res. 2012, 46 (12): 1531-1539. 10.1016/j.jpsychires.2012.08.026.
Treutlein J, Muhleisen TW, Frank J, Mattheisen M, Herms S, Ludwig KU, Treutlein T, Schmael C, Strohmaier J, Bosshenz KV, Breuer R, Paul T, Witt SH, Schulze TG, Schlosser RG, Nenadic I, Sauer H, Becker T, Maier W, Cichon S, Nothen MM, Rietschel M: Dissection of phenotype reveals possible association between schizophrenia and Glutamate Receptor Delta 1 (GRID1) gene promoter. Schizophr Res. 2009, 111 (1–3): 123-130.
Fike CD, Kaplowitz MR, Pfister SL: Arachidonic acid metabolites and an early stage of pulmonary hypertension in chronically hypoxic newborn pigs. Am J Physiol Lung Cell Mol Physiol. 2003, 284 (2): L316-L323. 10.1152/ajpcell.00125.2002.
An X, Jin Y, Guo H, Foo SY, Cully BL, Wu J, Zeng H, Rosenzweig A, Li J: Response gene to complement 32, a novel hypoxia-regulated angiogenic inhibitor. Circulation. 2009, 120 (7): 617-627. 10.1161/CIRCULATIONAHA.108.841502.
Breen E, Tang K, Olfert M, Knapp A, Wagner P: Skeletal muscle capillarity during hypoxia: VEGF and its activation. High Alt Med Biol. 2008, 9 (2): 158-166. 10.1089/ham.2008.1010.
van Patot MC, Gassmann M: Hypoxia: adapting to high altitude by mutating EPAS-1, the gene encoding HIF-2alpha. High Alt Med Biol. 2011, 12 (2): 157-167. 10.1089/ham.2010.1099.
Pérez-Enciso M, Burgos-Paz W, Souza CA, Megens HJ, Ramayo-Caldas Y, Melo M, Lemús-Flores C, Caal E, Soto HW, Martínez R, Alvarez LA, Aguirre L, Iñiguez V, Revidatti MA, Martínez-López OR, Llambi S, Esteve-Codina A, Rodríguez MC, Crooijmans R, Paiva SR, Schook LB, Groenen MAM: Data from: porcine colonization of the Americas: a 60k SNP story. Dryad Data Repository. 2012
Ramos AM, Crooijmans RP, Affara NA, Amaral AJ, Archibald AL, Beever JE, Bendixen C, Churcher C, Clark R, Dehais P, Hansen MS, Hedegaard J, Hu ZL, Kerstens HH, Law AS, Megens HJ, Milan D, Nonneman DJ, Rohrer GA, Rothschild MF, Smith TP, Schnabel RD, Van Tassell CP, Taylor JF, Wiedmann RT, Schook LB, Groenen MA: Design of a high density SNP genotyping assay in the pig using SNPs identified and characterized by next generation sequencing technology. PLoS One. 2009, 4 (8): e6524-10.1371/journal.pone.0006524.
Szpiech ZA, Jakobsson M, Rosenberg NA: ADZE: a rarefaction approach for counting alleles private to combinations of populations. Bioinformatics. 2008, 24 (21): 2498-2504. 10.1093/bioinformatics/btn478.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, Sham PC: PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007, 81 (3): 559-575. 10.1086/519795.
Felsenstein J: PHYLIP-phylogeny inference package (version 3.2). Cladistics. 1989, 5: 164-166.
Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D: Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet. 2006, 38 (8): 904-909. 10.1038/ng1847.
Reich D, Thangaraj K, Patterson N, Price AL, Singh L: Reconstructing Indian population history. Nature. 2009, 461 (7263): 489-494. 10.1038/nature08365.
Wright S: The Genetical structure of populations. Ann Eugen. 1951, 15: 323-354.
Alexander DH, Novembre J, Lange K: Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009, 19 (9): 1655-1664. 10.1101/gr.094052.109.
Shriver MD, Kennedy GC, Parra EJ, Lawson HA, Sonpar V, Huang J, Akey JM, Jones KW: The genomic distribution of population substructure in four populations using 8,525 autosomal SNPs. Hum Genomics. 2004, 1 (4): 274-286. 10.1186/1479-7364-1-4-274.
Groenen MA, Archibald AL, Uenishi H, Tuggle CK, Takeuchi Y, Rothschild MF, Rogel-Gaillard C, Park C, Milan D, Megens HJ, Li S, Larkin DM, Kim H, Frantz LA, Caccamo M, Ahn H, Aken BL, Anselmo A, Anthon C, Auvil L, Badaoui B, Beattie CW, Bendixen C, Berman D, Blecha F, Blomberg J, Bolund L, Bosse M, Botti S, Bujie Z, et al: Analyses of pig genomes provide insight into porcine demography and evolution. Nature. 2012, 491 (7424): 393-398. 10.1038/nature11622.
Kent WJ: BLAT–the BLAST-like alignment tool. Genome Res. 2002, 12 (4): 656-664. 10.1101/gr.229202. Article published online before March 2002.
Bindea G, Mlecnik B, Hackl H, Charoentong P, Tosolini M, Kirilovsky A, Fridman WH, Pages F, Trajanoski Z, Galon J: ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics. 2009, 25 (8): 1091-1093. 10.1093/bioinformatics/btp101.
This work was supported by Program for Changjiang Scholars and Innovative Research Team in University (IRT1136) and National Key Research Project of China (2013ZX08006-05) to JR.
The authors declare that they have no competing interests.
JR conceived and designed the experiment. JL and XX performed the experiment. HA and JR analyzed the data. BY and HC contributed reagents/materials/analysis tools. JR and HA wrote the paper. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 2: Figure S1: The neighbor-joining tree of all tested breeds and Tibetan pig populations based on genome-wide allele sharing. The tree illustrates a clear evolution split between Chinese and Western pigs. It also shows obvious genetic differentiation among Tibetan pig populations that usually group together with their geographic neighbors. Two Chinese synthetic breeds including Sutai and Lulai define intermediate branches between Chinese and Western groups. Such intermediate branches were also observed for Chinese Licha and Kele pigs, corresponding to our previous findings of the historical introgression of Western pigs into the two Chinese populations . Tibet 1, the Tibetan pig from Gongbujiangda in the Tibet Autonomous Region. Tibet 2, the Tibetan pig from Milin in the Tibet Autonomous Region. (TIFF 5 MB)
Additional file 3: Figure S2: Principle components analysis of all tested breeds and Tibetan pig populations in the present study. Principal component (PC) 1 (x-axis) verus PC2 (y-axis). PC1 clearly discriminates Chinese and Western pigs. PC2 separates Chinese pigs including Tibetan populations in a manner corresponding to their geographic locations. The 5 Tibetan pig populations highlighted in shade exhibit genetic similarity to their geographic neighbors. The two synthetic breeds (Sutai and Lulai) and two admixed breeds (Licha and Kele) show consistent signals of admixture with Western pigs. (TIFF 339 KB)
Additional file 4: Figure S3: Population structure of each population revealed by the ADMIXTURE software. At K = 8, the ancestry structures of Tibetan pigs from Sichuan and Yunnan were nearly identical, differing from those of Tibetan pigs from Tibet and Gansu. From K = 2 to K = 5, ~80% of the Western pig genomes were assigned to Western wild boars. Of note, about 20% of Chinese ancestry was consistently observed in Landrace and Large White, suggesting a historical admixture between Chinese and Western pigs. The observation is in agreement with the previous report of a ~35% Asian fraction in Western pigs according to the whole genome sequence data . The abbreviation of each breed is identical to that given in the legend of Figure 1. (TIFF 12 MB)
Additional file 5: Figure S4: TreeMix plot for residual fit from the maximum likelihood tree without migration events. (TIFF 738 KB)
Additional file 6: Table S2: SNP outliers and candidate genes for high-altitude adaptation in each and all geographic populations of Tibetan pigs. (XLSX 132 KB)
Additional file 7: Figure S5: A venn diagram showing shared and distinct candidate genes in each and all geographic populations of Tibetan pigs. Numbers indicating how many genes belong to each of Tibetan populations are shown in the graph. (TIFF 488 KB)
Additional file 8: Figure S6: Patterns of selection signatures within three well-characterized hypoxia genes in Tibetan pigs. (A) Distribution of LSBL values within the target regions. LSBL values are plotted along the y-axis, and the threshold indicating signature of selection is denoted with a dashed grey line. The candidate gene (EPAS1, EGLN1 and ADAM17) names and their corresponding regions are indicated below each panel. (B) Allele frequencies of the two outlier SNPs at the EPSA1 regions in a panel of Chinese indigenous pig populations. The breed codes are identical to those given in the legend of Figure 1. (TIFF 747 KB)
Additional file 9: Table S3: Genotypes of 56 SNP markers within 3 well- characterized hypoxia genes (EGLN, EPAS1 and ADAM1) in 324 Chinese indigenous pigs. (XLSX 122 KB)
About this article
Cite this article
Ai, H., Yang, B., Li, J. et al. Population history and genomic signatures for high-altitude adaptation in Tibetan pigs. BMC Genomics 15, 834 (2014). https://doi.org/10.1186/1471-2164-15-834