Genome-wide investigations reveal the population structure and selection signatures of Nigerian cattle adaptation in the sub-Saharan tropics
BMC Genomics volume 23, Article number: 306 (2022)
Cattle are considered to be the most desirable livestock by small scale farmers. In Africa, although comprehensive genomic studies have been carried out on cattle, the genetic variations in indigenous cattle from Nigeria have not been fully explored. In this study, genome-wide analysis based on genotyping-by-sequencing (GBS) of 193 Nigerian cattle was used to reveal new insights on the history of West African cattle and their adaptation to the tropical African environment, particularly in sub-Saharan region.
The GBS data were evaluated against whole-genome sequencing (WGS) data and high rate of variant concordance between the two platforms was evident with high correlated genetic distance matrices genotyped by both methods suggestive of the reliability of GBS applicability in population genetics. The genetic structure of Nigerian cattle was observed to be homogenous and unique from other African cattle populations. Selection analysis for the genomic regions harboring imprints of adaptation revealed genes associated with immune responses, growth and reproduction, efficiency of feeds utilization, and heat tolerance. Our findings depict potential convergent adaptation between African cattle, dogs and humans with adaptive genes SPRY2 and ITGB1BP1 possibly involved in common physiological activities.
The study presents unique genetic patterns of Nigerian cattle which provide new insights on the history of cattle in West Africa based on their population structure and the possibility of parallel adaptation between African cattle, dogs and humans in Africa which require further investigations.
African indigenous cattle are considered to be the most desirable livestock by small scale farmers in the continent due to their vast economic benefits. These ranges from meat, milk, drought power, source of leather, manure and bride price. The origin of the domesticated cattle can be traced to around 10,000 years before present (YBP) in Southwest/South Asia, and in West Asia for indicine and taurine cattle, respectively [1, 2], before their migration to other parts of the globe. In Africa, the earliest group of cattle known to have migrated into the continent were the Bos taurus taurus circa 7,000 – 4,000 YBP and later the Bos taurus indicus circa 4,000 – 2,000 YBP from their domestication centers [3, 4].
A large number of African taurine are found in West Africa . However, few taurine cattle are also present in other parts of Africa, for instance the taurine Sheko from Ethiopia in East Africa  with majority of them currently considered to be crossbreds  as compared to other types of African taurine particularly the Muturu breed of West Africa . Several cattle population in Nigeria are a mixture of both taurine and zebu ecotypes [5, 8]. They are suggested to have been introduced circa 1,400 YBP [3, 9] from eastern Africa which is hypothesized to be the entry point of zebus in Africa from Asia during the Indian-ocean maritime trade .
In the past decade, the advancement of genomic technologies has made it possible to analyze the genomic DNA of individuals using WGS and GBS [11, 12]. The GBS technology involves the use of restriction enzymes (REs) to select targeted polymorphic genomic regions for reduced representation , lowering genome associated data complexes, and sequencing costs , thereby enabling large numbers of individuals to be sequenced . A comparative performance ratio of GBS relative to WGS in mammals such as cattle has not yet been fully investigated, although a similar approach has been applied in pigeon birds . However, the assessments of GBS and SNP chip panels have been reported in plants and animals [16,17,18,19].
In this study, we evaluate the efficacy of datasets generated by GBS versus WGS and the subsequent application in downstream population genetics analyses. To the best of our knowledge, we applied GBS approach for the first time in assessing the genetic diversity and adaptation mechanism of cattle samples from Nigeria. Several studies have been carried out to analyze the population structure and selection signatures of modern African cattle [7, 20], however, the genomes of cattle samples from Nigeria has not been intensively explored. Our findings present the unique genetic patterns of Nigerian indigenous cattle in sub-Saharan West Africa.
Comparative analysis based on newly generated data from five selected Nigerian samples
The 5-sample dataset generated a total of 924,152 and 23,955,622 GBS and WGS SNPs, respectively, of high-quality filtered genotypes (more than 95% genotype call rate) for comparisons (Additional file 1: Figures S1 and S2). However, 462,823 variants equivalent to 49.4% of all the total loci called by GBS were reliable for evaluation. Out of these, 93.05% equivalent to 430,635 variants were found in common with WGS dataset while 430,574 of them were concordant at a 99.99% concordance rate i.e., similar alternative alleles at the same loci with WGS dataset (Additional file 1: Figure S1; Additional file 1: Figure S3). Aside from this, GBS dataset showed elements of novelty such that a proportion of 6.95% (32,188 sites) of all the variant loci had partial novel sites since 3% of them (987 sites) still overlapped with WGS dataset leaving 97% (32,201 sites) completely novel i.e., do not overlap with WGS dataset. On the other hand, we have unfortunately observed a major shortcoming in GBS technology. Despite the fact that the detected variants (single nucleotide polymorphisms, SNPs) were highly concordant with WGS, genotype matches between GBS and WGS were unsatisfactorily low and not consistent, ranging between 92.8% and 52.1% in the calling of heterozygous (RA) and homozygous non-reference (AA) genotypes, respectively(see details in in Additional file 1 containing the Supplementary Notes). Such a relatively low rate of genotype matches is not an unexpected scenario for low sequencing coverage data.
The merged GBS and published WGS datasets for population genetics analyses
The genomes of 193 cattle sampled from Nigeria (Fig. 1) were sequenced to generate ~ 1.1 billion clean reads with an average of ~ 4.87 × depth coverage that ranged between 4.33 to 6.28 reads per individual (Additional file 2: Table S1). The reads were then aligned to the taurine reference genome (B. taurus UMD 3.1) at an average mapping rate of 99.13% and jointly merged and genotyped with 75 publicly available genomes [20,21,22,23] (Additional file 1: Table S2). After merging our dataset of the 193 Nigerian cattle genomes containing 3,282,427 SNPs with the additional 75 genomes containing 22,197,616 SNPs, a total of 268 cattle samples with 649,577 common biallelic SNPs of high genotyping rate (0.93), was retained for downstream analyses. The published cattle samples used in this study were classified according to their ecotypes and geographical locations as follows: African taurine (including 8 Muturu and 7 N’Dama), African humped cattle (included 10 Boran, 8 Kenana, and 9 Ogaden), African Sanga (12 Ankole), European taurine (7 sampled Holstein breed individuals) and Asian cattle samples (5 pure Asian zebu and 3 Asian B. indicus X B. taurus crossbreeds) and the outgroup samples (represented by 5 B. javanicus and one Bubalus bubalis) (Additional file 1: Figure S4).
Population structure and genetic diversity of Nigerian cattle
The structure of the cattle populations was extrapolated by principal component analysis (PCA), admixture and phylogenetic tree. PCA results depict three major clusters of cattle (Fig. 2a) and showed agreement of the three major lineages of cattle (Fig. 2b) that include the European B. taurus (blue), the African B. taurus (represented by Muturu in green and N’Dama in brown) and Asian B. indicus (orange) [24, 25]. The clustering of Nigerian cattle together with East African zebu and at intermediate position between pure B. taurus and Asian B. indicus possibly suggest the admixed (B. taurus X B. indicus) background of the majority of African zebu cattle  and that Nigerian cattle could be more of the zebu background well supported by paternal marker studies [5, 26]. Principal Component (PC)1 explained 9.169% of the total variation, separates all zebu cattle populations from taurine; European (Holstein) and African (N’Dama and Muturu) taurine in Fig. 2a while PC2 which explained 4.955% of the total variation depicts geographical partitioning between the Nigerian and East African cattle populations. Noticeably, while the distinction between Nigerian cattle and other populations captured in PC2 on Fig. 2a may be affected by the genotyping effect of GBS, it is consistent with previous maternal studies . PC1 and PC2 extrapolate those three clusters of cattle observed in this study as follows: first, all zebu together with their crossbred clustered together, the second cluster contained only the taurine cattle, and third is Nigerian cattle appearing as a distinct cluster. The concatenated neighbour joining (NJ) phylogenetic tree plotted using the 193 GBS samples and all the reference populations has given a consistent story that Nigerian cattle are of crossbred background having clustered in the same clade with Ankole (Fig. 2c).
Generally, PCA depicts the unique genetic constitution of Nigerian zebu cattle from other African zebu or African hybrids (Additional file 1: Figure S4) clearly supported by the admixture plot (Fig. 2d), reflecting an important genetic resource reservoir . Furthermore, the unsupervised clustering approach results (Fig. 2d) indicate K = 3 as the optimum ancestral populations for inferring the genetic structure and admixture (Fig. 2e) which corresponds to the three cattle clusters. Both admixture and PCA analyses inferred the population of Nigerian zebu cattle is homogenous lacking clear genetic structure albeit with minimal levels of admixture.
The genetic diversity results are shown in Fig. 3a. Nigerian zebu cattle together with all zebu cattle from Africa depict high genetic diversity compared to taurine cattle populations which is not unexpected [7, 20]. Muturu, indigenous Nigerian taurine cattle among other taurine showed the lowest level of genetic diversity.
Maximum likelihood (ML) tree was constructed by TreeMix software in order to infer evidence of gene flow between cattle populations from Europe, Asia and Africa using sequentially migration edges (1 to 4) (Fig. 3b; Additional file 1: Figures S5-6). Our findings showed the possibility of gene flow from N’Dama, the African taurine cattle breed towards Nigerian zebu cattle and from European taurine to Asian cattle in two steps migration event. Nonetheless, at the migration edge 4, we also observe a possible gene flow between Nigerian and European cattle. Furthermore, we also assessed the introgression between Nigerian cattle and other cattle populations from different geographical settings complementing TreeMix results as displayed in Additional file 1: Table S3. We used the nonparametric ‘ABBA-BABA’ test to examine the exchange of genetic variation between two divergent populations from their individual genomes in particularly to study the admixture in Nigerian zebu cattle genomes. Our findings showed significant evidence of admixture for three population pairs involving the Nigerian zebu cattle exclusively: European taurine and Nigerian zebu cattle, African taurine and Nigerian zebu cattle, and East African zebu and Nigerian zebu cattle. These findings are coherent with TreeMix analysis and also complemented by population structure which showed admixture in Nigerian zebu cattle at rather low degree (Fig. 2d). Due to the geographical proximity between Nigerian cattle and the African taurine, hybridization between the two cattle populations is definitely eminent [27, 28].
Signatures of selection
Signatures of selection was computed by two different approaches: the composite likelihood ratio (CLR) approach implemented in SweeD and PBS (Additional file 2: Tables S4-8) using UMD3.1 reference genome and the concordance with ARS-UCD1.2 genome assembly was assessed afterwards (Table 1). For CLR test, focusing on only Nigerian zebu cattle, we obtained 240 positively selected genes (PSGs) in the 1% threshold level (Additional file 2: Table S4). We also used PBS to compute for genes that are under selection in Nigerian zebu cattle by testing different computational scenarios (Additional file 2: Tables S5-8). In all scenarios, Nigerian zebu cattle were considered as the target population while B. bubalis and B. javanicus were the outgroups.
Using PBS approach in scenario one (PBS 1), 2027 PSGs were detected in Nigerian zebu cattle following their separation from the common ancestor with European cattle (Additional file 2: Table S5). In the second category (PBS 2), 2029 PSGs were identified in Nigerian zebu cattle following the divergence from their Asian zebu counterpart (Additional file 2: Table S6). We then carried out a third scenario (PBS 3) by comparing Nigerian zebu cattle against cattle from both Europe and Asia (Euro-Asian), where 2021 PSGs were detected (Additional file 2: Table S7). The final scenario (PBS 4) was carried out to determine evidence of domestication signatures in Nigerian zebu cattle in relation to other African cattle populations of zebu lineage, as such, 2031 PSGs were contemporarily unveiled (Additional file 2: Table S8). In all these four scenarios, PBS generated a total of 2674 PSGs in Nigerian cattle (Fig. 4a and Additional file 2: Tables S5-8). Since the contrasted groups (Nigerian cattle exclusively as the target population against other populations) are genotyped by different methods (GBS and WGS), the estimation of allele frequencies between populations through PBS procedures was importantly addressed by using the intersected genomic regions that would limit false positives. The merging of the two datasets was done at the very beginning of the data analysis prior to downstream analyses as described in the methodology section “Data merge”. Nonetheless, using the same five (5) samples, the estimated alleles genotyped by both methods either by GBS or WGS depicted high correlation (r = 0.9999355, p-value = 6.22e-07, Pearson's product-moment correlation test) supporting the viability of GBS data. Basically, the windows of the 1% threshold level from PBS and CLR test included in total 2674 and 240 PSGs, respectively, indicating that 2613 and 179 (totaling 2792 PSGs) were unique to PBS and CLR, respectively, whereas 61 of all the detected PSGs overlapped in both analyses (Fig. 4a). Moreover, these private genes (2792 PSGs) obtained between CLR and PBS in Nigerian cattle genomes were further utilized to understand the possibility of convergent adaptation between African cattle, dogs and humans (Fig. 4b) co-existing in similar West African environment.
Functional enrichment analysis was then conducted to determine the biologically enriched pathways in Nigerian zebu cattle (Table 2 and Additional file 2: Tables S9-13). Among the 61 overlapping PSGs detected by both PBS and CLR, the gene IL12A was highly overrepresented or expressed more in biological pathways which could be associated with physiological roles related to the immune system of the host (Table 2). The latter gene and LAMA4 are possibly associated with the immunity against African trypanosomiasis, the endemic tropical cattle disease in tsetse infested areas. Furthermore, the functional enrichment analysis using the 240 selective genes detected by CLR test analysis alone showed evidence of 17 annotated PSGs which could be involved in biological processes, KEGG and cellular components’ pathways (Additional file 2: Table S9). Among these genes are three protein-coding genes IL12A , DLGI  and PIK3CB  related to the host immune and ATOH8 which could be linked to reproduction  (Table 1 and Fig. 4c). Two of the other protein-coding genes could be linked to more than one trait, which include ATP1A1 known to play a role in efficiency of feeds utilization and tolerance to environmental thermal stresses [33, 34] and SERPINE2  associated with growth and development of skeletal muscles  and also possibly involved in immune regulation of the host through release of immunoglobulins . Notably, PBS functional enrichment analyses using the gene list in Additional file 2: Tables S5-8 revealed several signals of positive selection (Additional file 2: Tables S10-13) likely associated with important local environmental adaptations (Table 1 and Additional file 1: Figures S7, S8, S9 and S10). These included genes that offer the host’s immune against tropical parasitic diseases such as the African trypanosomiasis, a common disease in cattle known to infect many of the zebu cattle breeds (trypano-susceptible) than the taurine cattle (trypano-tolerant) which possess the disease’s resistance mechanism. An example of this gene is MYD88  (Table 1 and Additional file 1: Figure S7 and S8). In this category, we also found ROBO2 gene which has been previously reported to provide immune response against Newcastle disease in chicken . Other important genes detected includes those related to the regulation of developmental processes (GO:0,050,793) such as coat colour phenotypic traits, for instance KITLG and KIT [20, 40]. Some of the genes present in 1% windows threshold were also linked to resistance to tick infestation (SPAST  and BOLA ) and control of Plasmodium falciparum (IL4 ) (Additional file 1: Figure S8).
Moreover, we unraveled the occurrence of candidate genes putatively involved in convergent selection in African human and domestic animals such as cattle and dogs by comparing the enrichment output of the overlapping genes between our 2792 unique PSGs (Fig. 4a) and published gene lists for African humans , N’Dama, the African taurine  and African dogs  (Fig. 4b). Notably, in computing for the evidence of convergent selection, both the shared (61) and the unique (2792) gene lists were employed. However, only the unique genes could unveil the common genes under selection among species with 97 PSGs common to at least 2 species (Fig. 4b).
GO enrichment analysis using the 41 overlapping annotated genes between African cattle (Nigerian zebu cattle) and humans among which 7 overlapped also with dogs (Fig. 4b, revealed significant enriched biological pathways (Fig. 4d). Following this procedure, one of the enriched PSGs ITGB1BP1 identified, when blasted on BGVD (Bovine Genome Variation Database)  revealed a GO term’s description: regulation of GTPase activity which confers to a common biological pathway between African cattle, humans and dogs as previously speculated [42, 44]. SPRY2 PSG was also found as an evidence of common ways of adaptation between African N’Dama (B. taurus lineage)  and Nigerian zebu cattle (B. indicus lineage) when enrichment analysis was performed using their 29 overlapping PSGs (Fig. 4b) depicting a similar GO description: regulation of GTPase activity and also a similar GO:0,005,829 term (Fig. 4d) as reported previously in African humans .
Evaluation of Genotyping-by-sequencing
To explore the potential of GBS approach, we applied it to understand the population genomics of Nigerian zebu cattle through the analysis of genetic structure, and signatures of selection. We firstly generated optimum evidence of its applicability on cattle genomic studies by comparing it with WGS. We obtained high rates of concordance for the variants calls (SNPs) between GBS and WGS datasets (Additional file 1: Figure S11), which exhibited similar patterns of genetic structure (Additional file 1: Figure S12) even though they differed in their allelic distribution (Additional file 1: Figure S13) and had low genotypes matching rate (putatively associated with the low sequencing depth of GBS, Additional file 2: Table S1). Based on the concordance of the variant calls, GBS could be reliable for its applicability in cattle genomics studies , even if its use without imputation may compromise the estimation of some genetic parameters [47, 48]. Consequently, we noticed that GBS in some instances failed to yield accurate genetic variation estimate as observed in PC 1 (Additional file 1: Figure S12a), where the five Nigerian cattle individuals typed by GBS cluster closer to Ankole than to WGS Nigerian genotypes. This genotyping effect is coherent with the low proportion of genotype match between GBS and WGS.
Genetic variation and evidence of introgression in Nigerian zebu cattle
Our study also revealed that the genetic diversity of Nigerian zebu cattle similarly to other zebus in Africa is higher compared to taurine as previously speculated . Both versions of the references genomes depicted similar pattern of the genetic diversity (Additional File 1: Figure S14). The slight degree of admixture observed by the admixture analysis may have been mediated by the gene flow from other cattle populations such as the taurine N’Dama, as evidenced by TreeMix and introgression analyses. These two cattle populations (Nigerian cattle and N’Dama) occupy the same geographical region of West Africa hence hybridization between the two is eminent. Despite the low degree of admixture detected in cattle from Nigeria, they still clustered as a homogenous population with lack of genetic structuring as recently observed in studies based on matrilineal genetic markers  and bovine high density SNP data . The lack of genetic structuring observed in Nigerian cattle could be possibly due to productivity and fitness selective pressures  a similar scenario previously observed in Borgou breed, a bovine hybrid population from Benin . Figure 1 shows some morphological differences in Nigerian cattle. However, despite these morphological disparities many of them display no structure probably due to lack of genetic differentiation. In some instances, some of the Nigerian cattle individuals show more of the similarities with taurine (indicated in blue) in the admixture plot (Fig. 2d) at optimum K = 3. Taken altogether our findings suggest evidence of introgressed taurine alleles into the gene pool of Nigerian zebu cattle.
Farmers in West Africa prefer crossing of African taurine and zebu in order to formulate a crossbreed cattle popularly known as Méré that possess combined genetic attributes of both disease tolerance and production traits [20, 27]. However, introgression from African taurine to Nigerian zebu cattle has not been fully established based on our D-statistics analysis (Additional file 1: Table S3). Zebu cattle have long been considered the African dairy and/or beef cattle  due to their high levels of milk production and large body size adopted for meat production and draught adaptive traits [6, 28, 50]. The African taurine for instance the Muturu and N’Dama cattle are known for their small size [23, 45] a feature that confers their low body size as compared to zebu cattle albeit they possess high tolerance to enzootic diseases such as trypanosomiasis and dermatophilosis prevalent in the Sub-humid region of West Africa and also less susceptible to tick‐borne diseases compared to zebu . Furthermore, a phylogenetic concatenated NJ tree (Additional file 1: Figure S15) using the five WGS samples (Additional file 2: Table S14), supports closer relationship with zebus (by clustering closer to Boran), which could result from crossbreeding, as suggested by the proximity with Ankole, the well-known crossbred population (Fig. 2c). On the other hand, the high genetic diversity or variation scenario observed in African cattle populations including the Nigerian cattle has been reported consistently throughout the African continent based on matrilineal and autosomal genetic analyses [26, 27, 52]. Notably, the lack of genetic structure or low genetic differentiation observed in the current study, as is the majority of African cattle populations , reflects the random mating of cattle populations in Africa with low practice of artificial selection as compared to breeding practices in other regions such as Europe .
Domestication impacts and adaptation in sub-Saharan tropics
The investigation of signatures of selection for the Nigerian zebu cattle was to elucidate and update information on the adaptive traits of cattle in the tropics particularly in the sub-Saharan region of West Africa. In most cases tropical environments are usually characterized by diseases, poor forage, high temperatures, exposure to ultraviolet and inappropriate management policies which are mostly observed in developing countries [44, 53]. Previous studies have shown that modern humans and domesticated animals share imprints of evolution in their genomes acquired during domestication especially when occupying sympatric geographical regions [44, 54]. Some important genes such as ADGRE1  and ASIP  have been identified to be involved in the evolution of both human and dogs or water buffalo and domestic cattle, respectively. In this study, we also identified other genes commonly involved in the evolution of African cattle, dogs and human.
Our study observed that Nigerian zebu cattle facing similar environmental challenges common in West Africa such as trypanosomiasis have also contemporarily developed similar immune response like the African taurine against these prevailing challenges [7, 56]. PBS and CLR test approaches have both indicated PSGs including IL12A, LAMA4, MYD88, SPAST and BOLA to confer resistance mechanisms towards the African trypanosomiasis and tick infestation [7, 20, 29, 38]. Nonetheless, several other PSGs such as IL4 associated with the control of malaria, one of the most prevalent tropical diseases in Africa  was also observed in this study conferring its role of malaria resistance in Nigerian zebu cattle genomes presently in West Africa. The African continent is characterized with its unique adverse conditions such as high temperatures. Nigerian zebu cattle in particular may have also developed adaptability mechanism towards such conditions such as KITLG and KIT [20, 40] which control coat colour phenotypes and the regulation of physiological temperature in the tropics possibly in a similar mutual fashion with the hair cell differentiation and blood circulation observed in Chinese zebu cattle . Nonetheless, it is worth to mention some of the genes such as ECI1 and RNPS1 present in the highest 1% PBS value of the outlier windows (Additional file 2: Tables S5-8). Based on their physiological function information retrieved from BGVD (Bovine Genome Variation Database) , these genes are related to metabolism of both catabolic and anabolic processes.
Notably, in the attempt to detect signatures of selection using GBS approach, it has further extended the evidence of concordance between WGS and GBS. Our study observed that GBS detected similar imprints of adaptation such as KIT on BTA 6 , SPAST on BTA 11  and BOLA on BTA 23  as previously unveiled by WGS data.
To disclose the possibility of shared aspects of domestication or convergent adaptation between African human, cattle and dogs present in West African part of the continent, regions of PSGs from humans , and dogs  were compared with our Nigerian zebu cattle dataset and N’Dama, the African taurine cattle  for their possible common physiological functions. When comparing PBS to CLR (SweeD), PBS computation was conducted with less stringent threshold yielding a huge evidence of PSGs imprinted in Nigerian cattle genomes, even if we cannot exclude some degree of false positives. Some of the identified PSGs conferring to convergent aspects of adaptation include ITGB1BP1 (Integrin Subunit Beta 1 Binding Protein 1) and SPRY2 (Sprouty RTK Signaling Antagonist 2) found to overlap between Nigerian zebu cattle, humans and N’Dama, the African taurine (Fig. 4d). The SPRY2 gene is important in embryo development in African taurine , and it is also involved in the regulation of GTPase activity together with the ITGB1BP1. We speculate that these two genes may have similar biological function with the ADGRE1 gene which is as well involved in GTPase regulator activity in African dogs  as it is in African humans . The shared physiological function of GTPase in cattle, dogs and human confers to the probable shared evolutionary aspects with African humans in playing defensive mechanism towards Malaria as previously unveiled . Notably, this hypothetical narrative may hold true since these three species co-exist in tropical environments hinting at their possible shared evolutionary aspects. We therefore, suggest further investigation of the disease immune mechanism associated with the SPRY2 and ITGB1BP1 PSGs towards Malaria.
This study reports the current genetic status and new insights on the adaptation of zebu cattle in sub-Saharan region of West Africa using GBS approach. The unique population structure of Nigerian zebu cattle observed serves as an important genetic resource in West Africa. We discovered the possibility of parallel or convergent adaptation among African human and domestic animals and that Nigerian zebu cattle might have acquired disease tolerance traits endemic in West Africa like their African taurine counterpart. Our study tried to investigate whether the identified PSGs in Nigerian cattle could result from convergent selection with other species occupying the same environmental conditions with regard to tropical diseases, high temperatures, scarcity of water and even of forages to mention a few. However, our finding may be speculative due to a number of reasons for instance the low sequencing coverage by GBS, or the integration of two datasets from two different genotyping platforms (only Nigerian cattle data was generated through GBS). Therefore, more efforts are still needed to determine and characterize the mechanisms of convergent adaptation in particular those conferring to resistance of diseases such as those endemics in West Africa in order to inform appropriate strategies befitting conservation, survivability of livestock, production improvement and applicability in biomedical research models for human related diseases.
Materials and Methods
Whole-blood samples (10 ml) were collected from 193 cattle coming from eight different States in Nigeria as follows (Fig. 1): Kaduna (n = 36), Kano (n = 6), Katsina (n = 7), Oyo (n = 2), Plateau (n = 50), Sokoto (n = 37), Taraba (n = 44), and Zamfara (n = 11). Genomic DNA extractions were performed following phenol–chloroform method  at Kunming Institute of Zoology, Chinese Academy of Sciences (CAS). The extracts were quantified using the Thermo Scientific™ NanoDrop 2000 spectrophotometer in order to assess purity of the extracted DNA. Furthermore, the DNA extracts were checked for molecular quality by running them through a 2% agarose gel against a 2 kilobase (kb) DNA ladder marker. The 193 cattle samples were sequenced using GBS platform. We further selected five samples for WGS for the evaluation of GBS and WGS platforms (Please, refer to Supplementary Notes in Additional file 1 for more details).
Next-generation sequencing of the GBS data
Briefly, the DNA PCR extracts were then sent to Bejing Novogene (https://en.novogene.com/), where the GBS approach was carried out following the GBS protocol . The GBS DNA library was prepared using 500 ng of DNA from each individual in 96-well plates before applying REs for genome reduced representation. Genomic DNA was then incubated at 37℃ with MseI (New England Biolabs, NEB), T4 DNA ligase (NEB), ATP (NEB), and MseI Y adapter N containing barcode. Fragment read length of 150 bases (PE150) were then sequenced using the Illumina HiSeq2500 platform with TruSeq SBS Kit v3-HS (Illumina). The whole genome-resequencing for five samples were also conducted at Beijing Novogene.
Sequence data analysis of GBS data
Illumina sequencing GBS data for 193 cattle genomes representing a wide diversity of cattle from Nigeria in West Africa were aligned to the cattle reference UMD 3.1 assembly  using BWA mem  with default parameters. Picard-tools -1.119 were used to sort the reads and to remove duplicates. The Genome Analysis Toolkit (GATK v3.8) [60, 61] was used to realign indels. Subsequently, SNPs were then detected by using UnifiedGenotyper  integrated in GATK.
The following hard filtration criteria were carried out using GATK v3.8 for the parameters: mapping quality rank sum test (MQRankSum), Fisher strand bias (FS), quality by depth (QD), the read position rank sum test (ReadPosRankSum) and phred score (GQ). The values for each parameter were QD > 2.0, FS < 60.0, MQ > 40.0, MQRankSum > -12.5, GQ > 20, QUAL > 50.0, ReadPosRankSum > -8.0, and ((MQ0 / (1.0 * DP)) < 0.1)” > ”. After filtration, only the high quality biallelic SNPs with genotyping call rate > 90% were retained for downstream analyses. The density of SNPs in each chromosome and the allelic distribution of minor alleles are provided in Additional file 1: Figures S16 and S17, respectively.
Concordance analysis between GBS and WGS datasets
We randomly selected five of the 193 cattle samples and re-sequenced them using WGS method in order to assess the accuracy of GBS. Common number of SNPs, and genotypes as generated by GATK were used for concordance evaluation. Notably, a Pearson correlation (r) method was used to determine the correlation between the computed distance matrices by GBS and WGS data using cor.test() test R function. More details of the evaluation assessment can be obtained in Supplementary Notes.
To perform population genetic analyses of the 193 genomes of Nigerian cattle GBS data, we also integrated 75 WGS genomes datasets publicly available from previous studies [20,21,22,23] representing B. taurus and B. indicus cattle of both African, European, and Asian lineages as well as B. javanicus and B. bubalis which define the outgroup. Only the overlapping genomic regions between GBS and WGS were considered using the merge parameter flag -intersection in GATK. Detailed information on the newly generated GBS data can be accessed in Additional file 2: Table S1. Geographical origin and other detailed information for each published cattle sample can be obtained in Additional file 1: Table S2.
Population genetic structure , admixture, and Genetic diversity
For PCA , EIGENSOFT software  was used to generate the principal components (PCs) from the filtered autosomal biallelic SNPs which were then plotted using R software. For the main figures of both PCA and admixture we used a dataset that excluded the outgroup, only cattle populations of zebu origin, African taurines and European taurines were used. Admixture analysis was performed using the unsupervised clustering method implemented in ADMIXTURE v1.3.0 software  and the resulting admixture proportions were plotted in Genesis software. We also computed for the genetic distances to construct a NJ population-level phylogenetic tree using autosomal genome data constructed by PLINK v1.9 software  and multiple sequence alignments were performed using Clustal W v2.1 Linux version . The resulting tree file was plotted by using MEGA X ver 10.1.7 software to surmise the evolutionary relationship between populations. Genetic diversity was also inferred from non-overlapping windows of 100 kb window size across the genome using VCFtools v0.1.12b software .
Inference of genetic admixture
The ML tree was computed following the proposed protocol in TreeMix v1.13 software  in order to determine admixture events and population splits. We were only interested in understanding how the Nigerian cattle gene pool has been influenced by other cattle populations. The algorithm was run for 1 to 4 migration edges and setting the B. bubalis as outgroup. The outputs were plotted using the R v4.0.2 software. We furthermore tested for the introgressions between populations by performing D-statistics analysis for all possible combinations .
Filtered biallelic SNPs from 193 samples were processed through VCFtools and GATK software bioinformatics tools with a genotyping rate of at least 90%, and minimum phred scaled genotype quality of 20 to produce a VCF file consisting of high-quality set of 3,282,427 SNPs. To avoid any significant false positives, the computation for the signatures of selection was done using the estimated allele frequencies from the common genomic regions between Nigerian cattle (GBS data) and other cattle populations (genotyped by WGS) merged together during PBS analyses. We used Sweep Detector (SweeD) tool that implements a composite likelihood ratio (CLR) test and the population branch statistics (PBS) [71, 72] to identify PSGs in Nigerian cattle. The CLR computation was carried out at 1000 grids (-grid 1000) to identify selective sweeps in Nigerian cattle genomes. PBS was estimated by using Wright’s FST statistics  in non-overlapping windows of 50 kb starting at the first variant, and in each consecutive 2 kb interval step until the last variant on each autosomal chromosome in four different approaches. In these PBS approaches, the first approach considered the European population to be the control group and the second one the cattle breeds from Asia to be the control group. Aside to this, we also combined European and Asian cattle populations as a single group called Euro-Asian to act as a control group in the third approach and the fourth approach considered the other African cattle populations of zebu descent excluding Nigerian cattle. In all scenarios Nigerian cattle were considered the target population and water buffalo and banteng were used as outgroups. We estimated changes in the allele of Nigerian cattle using PBS as follows:
where; PBS estimates the pairwise allele frequency (FST) between Nigerian cattle (N) and other cattle populations (OT) recorded from each of the four scenarios stated above and between these populations (N and OT) and the distantly related species (D) represented herein by the outgroup samples, the water buffalo and banteng. Sequentially, the divergence time (t) of Nigerian cattle from the other populations is also determined.
Annotation and functional enrichment
The annotation of the candidate regions was based on the B. taurus UMD 3.1 Gene Transfer Format file expressed by an extension (.gtf) from Ensembl release 90 . Functional enrichment analysis of the annotated PSGs was conducted using a statistical overrepresentation test in g: Profiler  based on the Gene Ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways  and the candidate gene information was also confirmed on the Bovine Genome Variation Database (BGVD) . A Bonferroni-corrected–adjusted P value of 0.05 was used as threshold level for statistical significance. The same protocol for accessing the adaptation information imprinted in the Nigerian cattle genomes was also implemented using ARS-UCD1.2 reference genome in order to assess the reliability of our findings.
Availability of data and materials
Bovine Genome Variation Database
Bos taurus Assembly
Chinese Academy of Sciences
Composite Likelihood Ratio
Genome Analysis Toolkit
Gene Transfer Format
Genome Sequence Archive
Kyoto Encyclopedia of Genes and Genomes
Population branch statistics
Principal component analysis
Positively selected genes
Whole genome sequencing
Years before present
Loftus RT, MacHugh DE, Ngere LO, Balain DS, Badi AM, Bradley DG, Cunningham EP. Mitochondrial genetic variation in European, African and Indian cattle populations. Anim Genet. 1994;25:265–71.
Bruford MW, Bradley DG, Luikart G. DNA markers reveal the complexity of livestock domestication. Nat Rev Genet. 2003;4:900–10.
Meghen C, MacHugh DE, Bradley DG. Genetic characteristics of West African cattle. World Anim Rev. 1994;78:59–66.
Freeman AR, Meghen CM, MacHugh DE, Loftus RT, Achukwi MD, Bado A, Sauveroche B, Bradley DG. Admixture and diversity in West African cattle populations. Mol Ecol. 2004;13:3477–87.
Hanotte O, Tawah CL, Bradley DG, Okomo M, Verjee Y, Ochieng J, Rege JE. Geographic distribution and frequency of a taurine Bos taurus and an indicine Bos indicus Y specific allele amongst sub-Saharan African cattle breeds. Mol Ecol. 2000;9:387–96.
Rege JEO. The state of African cattle genetic resources I. Classification framework and identification of threatened and extinct breeds. Anim Genet Resour Inf. 1999;251:1–25.
Kim K, Kwon T, Dessie T, Yoo D, Mwai OA, Jang J, Sung S, Lee S, Salim B, Jung J, et al. The mosaic genome of indigenous African cattle as a unique genetic resource for African pastoralism. Nat Genet. 2020;52:1099–110.
Perez-Pardal L, Sanchez-Gracia A, Alvarez I, Traore A, Ferraz JBS, Fernandez I, Costa V, Chen S, Tapio M, Cantet RJC, et al. Legacies of domestication, trade and herder mobility shape extant male zebu cattle diversity in South Asia and Africa. Sci Rep. 2018;8:18027.
Hanotte O, Bradley DG, Ochieng JW, Verjee Y, Hill EW, Rege JEO. African pastoralism: genetic imprints of origins and migrations. Science. 2002;296:336–9.
Gifford-Gonzalez D, Hanotte O. Domesticating animals in Africa. Implications of genetic and archaeological findings. J World Prehist. 2011;24:1–23.
Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES, Mitchell SE. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS One. 2011;6:e19379.
Rui Y, Chee KK, Jie Z. Whole genome sequencing analysis. In: Ranganathan S, Nakai K, Schönbach C, Gribskov M, editors. Encyclopedia of Bioinformatics and Computational Biology. Oxford: Elsevier; 2019. p. 176–83.
Davey JW, Hohenlohe PA, Etter PD, Boone JQ, Catchen JM, Blaxter ML. Genome-wide genetic marker discovery and genotyping using next-generation sequencing. Nat Rev Genet. 2011;12:499–510.
Clark MJ, Chen R, Lam HY, Karczewski KJ, Chen R, Euskirchen G, Butte AJ, Snyder M. Performance comparison of exome DNA sequencing technologies. Nat Biotechnol. 2011;29:908–14.
Pacheco G, van Grouw H, Shapiro MD, Gilbert MTP, Vieira FG. Darwin’s fancy revised: an updated understanding of the genomic constitution of pigeon breeds. Genome Biol Evol. 2020;12(3):136–50.
Bajgain P, Rouse MN, Anderson JA. Comparing genotyping-by-sequencing and single nucleotide polymorphism chip genotyping for quantitative trait loci mapping in wheat. Crop Sci. 2016;56:232–48.
Darrier B, Russell J, Milner SG, Hedley PE, Shaw PD, Macaulay M, Ramsay LD, Halpin C, Mascher M, Fleury DL, Langridge P, Stein N, Waugh R. A comparison of mainstream genotyping platforms for the evaluation and use of barley genetic resources. Front Plant Sci. 2019;10:544.
De Donato M, Peters SO, Mitchell SE, Hussain T, Imumorin IG. Genotyping-by- sequencing (GBS): a novel, efficient and cost-effective genotyping method for cattle using next-generation sequencing. PLoS One. 2013;8:e62137.
Elbasyoni IS, Lorenz AJ, Guttieri M, Frels K, Baenziger PS, Poland J, Akhunov E. A comparison between genotyping-by-sequencing and array-based scoring of SNPs for genomic prediction accuracy in winter wheat. Plant Sci. 2018;270:123–30.
Kim J, Hanotte O, Mwai OA, Dessie T, Bashir S, Diallo B, Agaba M, Kim K, Kwak W, Sung S, et al. The genome landscape of indigenous African cattle. Genome Biol. 2017;18:34.
Lee HJ, Kim J, Lee T, Son JK, Yoon HB, Baek KS, Jeong JY, Cho YM, Lee KT, Yang BC, et al. Deciphering the genetic blueprint behind Holstein milk proteins and production. Genome Biol Evol. 2014;6:1366–74.
Chen N, Cai Y, Chen Q, Li R, Wang K, Huang Y, Hu S, Huang S, Zhang H, Zheng Z, et al. Whole-genome resequencing reveals world-wide ancestry and adaptive introgression events of domesticated cattle in East Asia. Nat Commun. 2018;9:2337.
Tijjani A, Utsunomiya YT, Ezekwe AG, Nashiru O, et al. Genome sequence analysis reveals selection signatures in endangered trypanotolerant West African Muturu cattle. Front Genet Front Genet. 2019;10:442.
Bahbahani H, Salim B, Almathen F, Al Enezi F, Mwacharo JM, Hanotte O. Signatures of positive selection in African Butana and Kenana dairy zebu cattle. PLoS One. 2018;13:e0190446.
Verdugo MP, Mullin VE, Scheu A, Mattiangeli V, Daly KG, Maisano Delser P, Hare AJ, Burger J, Collins MJ, Kehati R, et al. Ancient cattle genomics, origins, and rapid turnover in the Fertile Crescent. Science. 2019;365:173–6.
Mauki DH, Adeola AC, Ng’ang’a SI, Tijjani A, Mark AI, Sanke OJ, Abdussamad AM, Olaogun SC, Ibrahim J, Dawuda PM et al. Genetic variation of Nigerian cattle inferred from maternal and paternal genetic markers. PeerJ. 9:e10607 https://doi.org/10.7717/peerj.10607.
Flori L, Thevenon S, Dayo GK, Senou M, Sylla S, Berthier D, Moazami-Goudarzi K, Gautier M. Adaptive admixture in the West African bovine hybrid zone: insight from the Borgou population. Mol Ecol. 2014;23:3241–57.
Tano K, Kamuanga M, Faminow MD, Swallow B. Using conjoint analysis to estimate farmers’ preferences for cattle traits in West Africa. J Ecol Econ. 2003;45:393–407.
Watford WT, Moriguchi M, Morinobu A, O’Shea JJ. The biology of IL-12: coordinating innate and adaptive immune responses. Cytokine Growth Factor Rev. 2003;14:361–8.
Nicolaou SA, Neumeier L, Steckly A, Kucher V, Takimoto K, Conforti L. Localization of Kv1.3 channels in the immunological synapse modulates the calcium response to antigen stimulation in T lymphocytes. J Immuno. 2009;183:6296–302.
Wojciechowska-Durczynska K, Krawczyk-Rusiecka K, Cyniak-Magierska A, et al. The role of phosphoinositide 3-kinase subunits in chronic thyroiditis. Thyroid Res. 2012;5:22.
Fang F, Wasserman SM, Torres-Vazquez J, Weinstein B, Cao F, Li Z, Wilson KD, Yue W, Wu JC, Xie X, Pei X. The role of Hath6, a newly identified shear-stress-responsive transcription factor, in endothelial cell differentiation and function. J Cell Sci. 2014;127(Pt 7):1428–40.
Barendse W, Reverter A, Bunch RJ, Harrison BE, Barris W, Thomas MB. A validated whole-genome association study of efficient food conversion in cattle. Genetics. 2007;176:1893–905.
Liu Y, Li D, Li H, Zhou X, Wang G. A novel SNP of the ATP1A1 gene is associated with heat tolerance traits in dairy cows. Mol Biol Rep. 2011;38:83–8.
Carter RE, Cerosaletti KM, Burkin DJ, Fournier RE, Jones C, Greenberg BD, Citron BA, Festoff BW. The gene for the serpin thrombin inhibitor (PI7), protease nexin I, is located on human chromosome 2q33-q35 and on syntenic regions in the mouse and sheep genomes. Genomics. 1995;27:196–9.
Raymond F, Metairon S, Kussmann M, Colomer J, Nascimento A, Mormeneo E, et al. Comparative gene expression profiling between human cultured myotubes and skeletal muscle tissue. BMC Genomics. 2010;11:125.
Bedard J, Brule S, Price CA, Silversides DW, Lussier JG. Serine protease inhibitor-E2 (SERPINE2) is differentially expressed in granulosa cells of dominant follicle in cattle. Mol Reprod Dev. 2003;64:152–65.
Pathcards. MYD88. https://pathcards.genecards.org/. Accessed 07 Jan 2021.
Wang Y, Wang J, Li BH, Qu H, Luo CL, Shu DM. An association between genetic variation in the roundabout, axon guidance receptor, homolog 2 gene and immunity traits in chickens. Poult Sci. 2014;93:31–8.
Brenig B, Beck J, Floren C, Bornemann-Kolatzki K, Wiedemann I, Hennecke S, Swalve H, Schütz E. Molecular genetics of coat colour variations in White Galloway and White Park cattle. Anim Genet. 2013;44:450–3.
Tangteerawatana P, Perlmann H, Hayano M, Kalambaheti T, Troye-Blomberg M, et al. IL4 gene polymorphism and previous malaria experiences manipulate anti-Plasmodium falciparum antibody isotype profiles in complicated and uncomplicated malaria. Malar J. 2009;8:286.
Barreiro LB, Laval G, Quach H, Patin E, Quintana-Murci L. Natural selection has driven population differentiation in modern humans. Nat Genet. 2008;40:340–5.
Xu L, Bickhart DM, Cole JB, Schroeder SG, Song J, Tassell CP, Sonstegard TS, Liu GE. Genomic signatures reveal new evidences for selection of important traits in domestic cattle. Mol Biol Evol. 2015;32:711–25.
Liu YH, Wang L, Xu T, Guo X, Li Y, Yin TT, Yang HC, Hu Y, Adeola AC, et al. Whole-genome sequencing of African dogs provides insights into adaptations against tropical parasites. Mol Biol Evol. 2018;35:287–98.
Chen N, Fu W, Zhao J, Shen J, Chen Q, Zheng Z, Chen H, Sonstegard TS, Lei C, Jiang Y. BGVD: an integrated database for bovine sequencing variations and selective signatures. Genomics Proteomics Bioinformatics. 2020;18:186–93.
Ibeagha-Awemu EM, Peters SO, Akwanji KA, Imumorin IG, Zhao X. High density genome wide genotyping-by-sequencing and association identifies common and low frequency SNPs, and novel candidate genes influencing cow milk traits. Sci Rep. 2016;6:31109.
Wang N, Yuan Y, Wang H, Yu D, Liu Y, Zhang A, Gowda M, Nair SK, Hao Z, Lu Y, San Vicente F, Prasanna BM, Li X, Zhang X. Applications of genotyping-by-sequencing (GBS) in maize genetics and breeding. Sci Rep. 2020;10(1):16308.
Benjelloun B, Boyer F, Streeter I, Zamani W, Engelen S, Alberti A, Alberto FJ, BenBati M, Ibnelbachyr M, Chentouf M, Bechchari A, Rezaei HR, Naderi S, Stella A, Chikhi A, Clarke L, Kijas J, Flicek P, Taberlet P, Pompanon F. An evaluation of sequencing coverage and genotyping strategies to assess neutral and adaptive diversity. Mol Ecol Resour. 2019;19(6):1497–515.
Schneider HK. The subsistence role of cattle among the Pakot and in East Africa. Am Anthropol. 1957;59:278–300.
Mwai O, Hanotte O, Kwon YJ, Cho S. African indigenous cattle: unique genetic resources in a rapidly changing world. Asian Australas J Anim Sc. 2015;28(7):911–21.
Mattioli RC, Pandey VS, Murray M, Fitzpatrick JL. Immunogenetic influences on tick resistance in African cattle with particular reference to trypanotolerant N’Dama (Bos taurus) and trypanosusceptible Gobra zebu (Bos indicus) cattle. Acta Trop. 2000;75:263–77.
Álvarez I, Pérez-Pardal L, Traoré A, Koudandé DO, Fernández I, Soudré A, Goyache F. Differences in genetic structure assessed using Y-chromosome and mitochondrial DNA markers do not shape the contributions to diversity in African sires. J Anim Breed Genet. 2017;134:393–404.
Cheruiyot EK, Bett RC, Amimo JO, Zhang Y, Mrode R, et al. Signatures of selection in admixed dairy cattle in Tanzania. Front Genet Front Genet. 2018;9:607.
Storb R, Thomas ED. Graft-versus-host disease in dog and man: the Seattle experience. Immunol Rev. 1985;88:215–38.
Dutta P, Talenti A, Young R, Jayaraman S, Callaby R, Jadhav SK, Dhanikachalam V, Manikandan M, Biswa BB, Low WY, et al. Whole genome analysis of water buffalo and global cattle breeds highlights convergent signatures of domestication. Nat Commun. 2020;11:4739.
Rege JEO, Aboagye GS, Tawah CL. Shorthorn cattle of West and Central Africa. I. Origin, distribution, classification and population statistics. World Anim Rev. 1994;78(1):2–13.
Sambrook J, Russell DW. Molecular cloning: a laboratory manual, 3rd ed. New York: Cold Spring Harbor Laboratory Press, Cold Spring Harbor; 2001.
Zimin AV, Delcher AL, Florea L, Kelley DR, Schatz MC, Puiu D, Hanrahan F, Pertea G, Van Tassell CP, Sonstegard TS, et al. A whole-genome assembly of the domestic cow. Bos taurus Genome Biol. 2009;10:R42.
Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. 2013.
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo MA. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–303.
Van der Auwera GA, Carneiro MO, Hartl C, et al. From FastQ data to high confidence variant calls: The Genome Analysis Toolkit best practices pipeline. Curr Protoc Bioinformatics. 2013;43:11.10.1-11.10.33.
DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, Philippakis AA, Del Angel G, Rivas MA, Hanna M, McKenna A. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011;43:491–8.
Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D. Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet. 2006;38:904–9.
Patterson N, Price AL, Reich D. Population structure and eigenanalysis. PLoS Genet. 2006;2:e190.
Alexander DH, Novembre J, Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009;19:1655–64.
Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4:7.
Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994;22:4673–80.
Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker RE, Lunter G, Marth GT, Sherry ST, McVean G. The variant call format and VCFtools. Bioinformatics. 2011;27:2156–8.
Pickrell JK, Pritchard JK. Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet. 2012;8:e1002967.
Patterson N, Moorjani P, Luo Y, Mallick S, Rohland N, Zhan Y, Genschoreck T, Webster T, Reich D. Ancient admixture in human history. Genetics. 2012;192:1065–93.
Pavlidis P, Zivkovic D, Stamatakis A, Alachiotis N. SweeD: likelihood-based detection of selective sweeps in thousands of genomes. Mol Biol Evol. 2013;30:2224–34.
Yi X, Liang Y, Huerta-Sanchez E, Jin X, Cuo ZX, Pool JE, Xu X, Jiang H, Vinckenbosch N, Korneliussen TS, et al. Sequencing of 50 human exomes reveals adaptation to high altitude. Science. 2010;329:75–8.
Weir BS, Cockerham CC. Estimating f-statistics for the analysis of population structure. Evolution. 1984;38:1358–70.
Zerbino DR, Achuthan P, Akanni W, Amode MR, Barrell D, Bhai J, Billis K, Cummins C, Gall A, Giron CG, et al. Ensembl 2018. Nucleic Acids Res. 2018;46(D1):D754–61.
Reimand J, Arak T, Vilo J. g:Profiler--a web server for functional interpretation of gene lists (2011 update). Nucleic Acids Res. 2011;39(Web Server issue):W307-315.
Kanehisa M, Goto S, Furumichi M, Tanabe M, Hirakawa M. KEGG for representation and analysis of molecular networks involving diseases and drugs. Nucleic Acids Res. 2010;38:D355–60.
We are humbled and honored to recognize the important contributions of the Central Abattoir, Ibadan and Ministry of Agriculture and Rural Development, Oyo State, in Nigeria who made possible the collection of cattle samples for this study. D.H.M acknowledges the support of the Chinese Academy of Sciences -The World Academy of Sciences (CAS-TWAS) President’s Fellowship Program for Doctoral Candidates. We appreciate all of those who assisted in the success of this study.
This work was supported by the Sino-Africa Joint Research Center, Chinese Academy of Sciences (SAJC201611) and the Animal Branch of the Germplasm Bank of Wild Species, Chinese Academy of Sciences (the Large Research Infrastructure Funding). Also, this work has been a success through the Chinese Academy of Sciences President’s International Fellowship Initiative (CAS-PIFI) who provided grant support to A.C.A. (2018FYB0003 and 2021FYB0006). D.H.M. thanks the CAS-TWAS President’s Fellowship Program for Doctoral Candidates for support.
Ethics approval and consent to participate
All experimental procedures in the present study were performed in accordance to Research Guidelines for the Institutional Review Board of Kunming Institute of Zoology, Chinese Academy of Sciences (SMKX2017009) and current study is approved by the Institutional Review Board of Kunming Institute of Zoology, Chinese Academy of Sciences (SMKX2017009). We have complied with ARRIVE at submission.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Mauki, D.H., Tijjani, A., Ma, C. et al. Genome-wide investigations reveal the population structure and selection signatures of Nigerian cattle adaptation in the sub-Saharan tropics. BMC Genomics 23, 306 (2022). https://doi.org/10.1186/s12864-022-08512-w