Runs of homozygosity analysis of South African sheep breeds from various production systems investigated using OvineSNP50k data
BMC Genomics volume 22, Article number: 7 (2021)
Population history, production system and within-breed selection pressure impacts the genome architecture resulting in reduced genetic diversity and increased frequency of runs of homozygosity islands. This study tested the hypothesis that production systems geared towards specific traits of importance or natural or artificial selection pressures influenced the occurrence and distribution of runs of homozygosity (ROH) in the South African sheep population. The Illumina OvineSNP50 BeadChip was used to genotype 400 sheep belonging to 13 breeds from South Africa representing mutton, pelt and mutton and wool dual-purpose breeds, including indigenous non-descript breeds that are reared by smallholder farmers. To get more insight into the autozygosity and distribution of ROH islands of South African breeds relative to global populations, 623 genotypes of sheep from worldwide populations were included in the analysis. Runs of homozygosity were computed at cut-offs of 1–6 Mb, 6–12 Mb, 12–24 Mb, 24–48 Mb and > 48 Mb, using the R package detectRUNS. The Golden Helix SVS program was used to investigate the ROH islands.
A total of 121,399 ROH with mean number of ROH per animal per breed ranging from 800 (African White Dorper) to 15,097 (Australian Poll Dorset) were obtained. Analysis of the distribution of ROH according to their size showed that, for all breeds, the majority of the detected ROH were in the short (1–6 Mb) category (88.2%). Most animals had no ROH > 48 Mb. Of the South African breeds, the Nguni and the Blackhead Persian displayed high ROH based inbreeding (FROH) of 0.31 ± 0.05 and 0.31 ± 0.04, respectively. Highest incidence of common runs per SNP across breeds was observed on chromosome 10 with over 250 incidences of common ROHs. Mean proportion of SNPs per breed per ROH island ranged from 0.02 ± 0.15 (island ROH224 on chromosome 23) to 0.13 ± 0.29 (island ROH175 on chromosome 15). Seventeen (17) of the islands had SNPs observed in single populations (unique ROH islands). The MacArthur Merino (MCM) population had five unique ROH islands followed by Blackhead Persian and Nguni with three each whilst the South African Mutton Merino, SA Merino, White Vital Swakara, Karakul, Dorset Horn and Chinese Merino each had one unique ROH island. Genes within ROH islands were associated with predominantly metabolic and immune response traits and predomestic selection for traits such as presence or absence of horns.
Overall, the frequency and patterns of distribution of ROH observed in this study corresponds to the breed history and implied selection pressures exposed to the sheep populations under study.
The genetic diversity of South African sheep populations is considered complex having been shaped by multifaceted production systems [1, 2] resulting from a combination of indigenous, commercial and synthetic/composite breeds raised to suit, various and often, extreme production conditions where natural selection forces are at play. Coupled with this have been farmer driven initiatives to crossbreed as an effort to develop breeds that are better suited to produce optimally under the harsh production conditions of the country. Whilst South African sheep genetic resources have been imported and introduced in other countries globally, there has also been movement of breeds into South Africa . The country has a combination of both large- and small-framed breeds where both inbreeding and outbreeding are considered dominant forces moulding their phenotypic appearance. Both natural and artificial selection of sheep, as well as regional variations due to drift, have resulted in sheep breeds that differ extensively in phenotypes.
Production system and within-breed selection pressure have pronounced effects on the genome architecture and may cause reduced genetic diversity and frequency of runs of homozygosity islands . Runs of homozygosity (ROH) are contiguous segments of homozygous genotypes that are present in an individual due to parents transmitting identical haplotypes to their offspring . The extent and frequency of ROHs are useful in providing information about the ancestry of an individual and its population [5, 6] with longer ROHs associated with more recent inbreeding within a pedigree while short ROHs are associated with ancient common ancestors [7, 8]. Shorter ROH can also be used to infer ancient relationships, information which in livestock is often missing due to limited recording. Long runs of homozygosity have been observed to be persistent in inbred individuals, suggestive of unusually low mutation rates, high linkage disequilibrium (LD), and low recombination rates at certain genomic regions . ROH accumulation in certain genomic positions has been used to analyze the demographic history in humans [10, 11] and livestock populations [12, 13]. A study also used ROH to compare and characterize beef and dairy cattle breeds . ROHs are also common in regions under positive selection and as such studies have associated accumulation of ROHs at specific loci to directional selection [13, 15]. In a number of studies, ROH have been used to estimate inbreeding levels and infer on signatures of selection and genetic adaptation to production conditions [16,17,18].
The Ovine SNP50 BeadChip array is a genome-wide genotyping array for sheep and was developed by Illumina in collaboration with the International Sheep Genomics Consortium (ISGC). This BeadChip contains 54,241 SNPs that were chosen to be uniformly distributed across the ovine genome with an average gap size and distance of 50.9 Kb and 46 Kb, respectively, and were validated in more than 75 economically important sheep breeds (OvineSNP50 Datasheet, https://www.illumina.com/documents/products/datasheets/datasheet_ovinesnp50.pdf). This study used the Ovine SNP50 BeadChip array to investigate the distribution of ROH in South African sheep breeds sampled from different breeding goals and production systems of mutton, wool, pelt and commercial versus smallholder sectors as well as various other sheep breeds obtained globally. The objectives of the study were to investigate the occurrence and distribution of ROH; characterize autozygosity and identify genomic regions with high ROH islands with the aim to draw insights into how the South African sheep populations were in the past, as well as how their structure and demography have evolved over time. The study presumed that the founder population establishing genetic processes and the extent of breeding control have differed greatly among the different sheep breeds of South Africa and globally. This study therefore hypothesised that production systems geared towards specific traits of importance such as mutton, wool, pelt or multiple traits (as with some dual-purpose breeds) or absence of selection programs e.g. in non-descript breeds kept by smallholder farmers influences the occurrence and distribution of ROH. In a previous study , the South African sheep breeds clustered according to breed and production system as illustrated in Fig. 1. Using ROHs, the current study was therefore used to infer the impact of breed history, inbreeding levels and selection on the accumulation of homozygous mutations in the diverse sheep populations. Global sheep populations accessed from the ISGC (http://www.sheephapmap.org) were used to further analyse the development and separation of populations from their presumed founder populations.
Four hundred animals belonging to 14 South African breeds/populations consisting of mutton (South African Mutton Merino (n = 10), Dohne Merino (n = 50), Meatmaster (n = 48), Blackhead Persian (n = 14) and Namaqua Afrikaner (n = 12), pelt (Swakara subpopulations of Grey (n = 22); Black (n = 16); White-vital (n = 41) and White-subvital (n = 17) and Karakul (n = 10)); wool (SA Merino (n = 56), dual purpose breeds (Dorper (n = 23); Afrino (n = 51) and non-descript Nguni sheep (n = 30) were used in the study.
The South African Mutton Merino was developed from German Merinos and kept as a dual purpose breed for meat and wool. Dohne Merino were developed through intensive selection of merino sheep and are robust animals that are resistant and tolerant to diseases and parasites. The Dohne Merino together with the Afrino and Meatmaster are South African breeds that were developed from Merino breeds either through intensive selection as in the case of the Dohne Merino or through crossbreeding with indigenous sheep breeds of Ronderib Afrikaner for Afrino and Damara for Meatmaster . The Swakara subpopulations were derived from Karakul sheep and bred and developed for pelt production, for which they are predominantly farmed in the Southern parts of Africa . The Blackhead Persian are fat tailed sheep that were imported into South Africa from Somalia in 1870 and are currently farmed by smallholder farmers primarily for meat. Namaqua Afrikaner sheep are indigenous to South Africa and, like the Blackhead Persian, are also farmed by smallholder farmers. The detailed list of breeds and sample sizes are outlined in Table 1. The commercial meat and wool breeds were sampled from Grootfontein Agricultural Development Institute (GADI) biobank and other commercial farms in the Eastern Cape and Northern Cape Provinces of the country . The Swakara sheep were sampled from Swakara pelt farming farms in Namibia and from the Northern Cape province of South Africa. The Nguni is a non-descript indigenous sheep of South Africa raised by communal farmers in the KwaZulu-Natal region of South Africa from where it was sampled.
Genotyping & SNP quality control
The 400 sheep were genotyped using the Illumina Ovine SNP50 BeadChip on the Infinium assay platform at the Agricultural Research Council-Biotechnology Platform in South Africa. SNP genotypes were called using genotyping module integrated in GenomeStudio™ V2010.1 (Illumina Inc.).
Global sheep populations
Additional 623 genotypes from a global set of sheep breeds representing worldwide populations were included in the analysis. These populations included breeds of African (6), Asian (2) and European (9) origin. The African breeds comprised African Dorper (n = 21), African White Dorper (n = 6), Ethiopian Menz (n = 34), Namaqua Afrikaner (n = 10), Red Maasai (n = 45) and Ronderib Afrikaner (n = 19). Asian populations includedBangladesh Garole (n = 24) and Karakas (n = 18). Finally, the breeds of European origin included Australian Poll Dorset (n = 108), Australian Industry Merino (n = 88), Australian Merino (n = 50), Australian Poll Merino (n = 98), Chinese Merino (n = 23), MacArthur Merino (n = 12), Dorset Horn (n = 21), Merinolandschaf (n = 22) and Black-headed Mountain (n = 24). This data set was accessed with permission from the ISGC (http://www.sheephapmap.org).
The two data sets were merged into a dataset that consisted of 1019 animals from 31 sheep breeds/populations and 43,556 SNPs that were retained for analyses after global quality control of both the South African and ISGC sheep breeds (Table 1). Chromosomal coordinates for each SNP were obtained from ovine genome assembly 4.1 (OAR4.1). Markers were filtered to exclude loci assigned to unmapped contigs. Only SNPs located on autosomes were considered for further analyses. Moreover, the following filtering parameters were adopted to exclude certain loci and animals and to generate the pruned input file: (i) SNPs with a call rate < 95% and (ii) minor allele frequency < 1% and (iii) animals with more than 2% of missing genotypes were removed. File editing was carried out using Plink .
Runs of homozygosity definition
Runs of homozygosity were computed using the R package detectRUNS and the consecutive runs method . No pruning was performed based on LD, but the minimum length that constituted the ROH was set to 1 Mb to exclude short ROH deriving from LD. The following criteria were used to define the ROH: (i) one missing SNP and up to one possible heterozygous genotype was allowed in the ROH, (ii) the minimum number of SNPs that constituted the ROH was set to 30 (iii) the minimum SNP density per ROH was set to one SNP every 100 Kb and (iv) the maximum gap between consecutive homozygous SNPs was 250 Kb. The computed ROHs were then categorised into bins based on lengths of 1–6 Mb, 6–12 Mb, 12–24 Mb, 24–48 Mb and > 48 Mb.
The mean number (MNROH) and average length (ALROH) of ROH per breed as well as the average sum of ROH segments per breed were estimated. The inbreeding coefficient (FROH) was estimated based on the ROH for each animal and averaged per breed. FROH was calculated within detectRUNS using the following formula:
LROH is the total length of ROH on autosomes and;
LAUTO is the total length of the autosomes covered by SNPs, which was 2453 Mb.
For comparison, inbreeding coefficients were also estimated using variance between observed and expected heterozygosity (FHOM). This was done using Golden Helix SVS software.
Detection of common runs of homozygosity
To identify the genomic regions most commonly associated with ROH for the meta-population and for groups on the basis of production purposes (mutton, wool and pelt and dual purpose breeds), Golden Helix SVS was used to analyse the incidence of common runs per SNP, which was then plotted against the position of the SNP along the chromosome (OAR).
ROH islands were defined as clusters of runs that were > 1000 Kb with a minimum of 30 SNPs and found in more than 20 samples and analysed using Golden Helix SVS. For each sample, the proportion of SNPs in the ROH island was estimated. The mean proportion of SNPs per sample per ROH islands was determined using Proc MEANS procedure in SAS v9.4 . The variance in mean proportion of SNPs in ROH islands amongst breeds was analysed using the Proc GLM in SAS v9.4  using the following model:
Proportion of SNPs per ROH island = μ + Bi + e.
μ = overall mean;
B i = Breed effect and;
e = random residual error
Functional annotation of ROH islands
ROH islands that were constituted by SNPs from 1, 2 or 3 populations (considered as unique islands) and those common islands with SNPs from three quarters of the populations (> 23 populations) were used for functional annotation. The genomic region associated with each of these island was annotated using the Sheep Quantitative Trait Loci (QTL) database (https://www.animalgenome.org/cgi-bin/QTLdb/OA/summary) and the University of Carlifonia Santa Cruz (UCSC) Genome Browser (http://genome.ucsc.edu/). The genomic coordinates for these ROH islands were used for the annotation of genes that were fully or partially contained within each selected region using the UCSC Genome Browser (http://genome.ucsc.edu/) and submitted to the Database for Annotation, Visualization and Integrated Discovery (DAVID) database (http://david.abcc.ncifcrf.gov/) for gene ontology (GO). Finally, the Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis was used to investigate pathways associated with each annotated gene within ROH islands. Significant enrichment in the candidate genes was indicated by a p-value of < 0.05.
Runs of homozygosity counts
The study identified 121,399 ROH in total with mean number of ROH per animal per breed ranging from 800 (African White Dorper) to 15,097 (Australian Poll Dorset) as illustrated in Fig. 2 and Supplementary Table S1. Analysis of the distribution of ROH according to their size showed that, for all breeds, the majority of the detected ROH were in the smallest 1–6 Mb in length category (88.2%) ranging from 684 in African White Dorper (n = 684) to Australian Poll Dorset (n = 13,677). The longest ROHs (> 48 MB) were the least (n = 108) with most animals detecting no ROH in this category. The Black Head Mountain had largest number of long (> 48 Mb) of 30 followed by Dorset Horn with 16 ROH > 48 Mb as illustrated in Fig. 2. The average length of ROH across breeds was 5.88 Mb and ranged from 2.60 Mb (Afrino) to 6.90 Mb (Nguni).
McArthur Merino showed the highest value of inbreeding on the basis of ROH (FROH = 0.45 ± 0.03), whereas Australian Poll Merino (FROH = 0.08 ± 0.03) showed the lowest (Table 1). Of the South African breeds, the Nguni and the Blackhead Persian displayed high FROH of 0.31 ± 0.05 and 0.31 ± 0.04, respectively. Other breeds with high FROH included the Black Vital Swakara and the White Subvital and Vital Swakara with FROH > 0.28 (Table 1). South African breeds with low FROH included Dohne Merino (FROH = 0.10 ± 0.02), the Meatmaster with FROH of 0.13 ± 0.02 and South African Merino with FROH = 0.14 ± 0.05. Inbreeding coefficient based on variance FHOM are presented in Table 1. A correlation between FROH and FHOM was observed, with breeds such as Blackhead Persian, Nguni displaying high FROH and FHOM, respectively.
ROHs per chromosome per breed
The distribution of ROHs per chromosome per breed are illustrated in Fig. 3. Runs were evenly distributed amongst chromosomes within breeds.
Incidences of common runs per SNP
Using Golden Helix SVS, an analysis was conducted to investigate the incidence of common runs per SNP and results are illustrated in Fig. 4 and Supplementary Table S2. Highest incidence of common runs per SNP across breeds was observed on chromosome 10 with over 250 incidences of common ROHs at some of the SNPs (Fig. 4; Supplementary Table S2). Other chromosomes such as 2, 6, 13, 15 and 19 were found to have moderate incidences of common SNPs averaging 150–160 (Fig. 4). Across breeds, certain regions were observed to be absent of ROHs notable of which were chromosomes 10 (±7Mbs region; 21 (±40Mbs region); 22 (±18Mbs region) and 26 (±8Mbs region) as illustrated in Supplementary File S3).
A total of 244 ROH islands distributed across all 26 autosomes were observed. Mean proportion of SNPs in ROH island ranged from 0.02 ± 0.15 (island ROH224 on chromosome 23) to 0.13 ± 0.29 (island ROH175 on chromosome 15) as illustrated in Supplementary Table S4. Number of islands ranged from a minimum of 2 clusters per chromosome (on chromosome 22) to 32 clusters per chromosome (on chromosome 1). Seventeen (17) of the islands were observed in single populations and considered unique ROH islands. Thirty-nine of the reported ROHs were each observed in 3 populations whilst the remaining 188 were each observed in more than 3 populations and considered common islands. Detailed distribution of ROH islands are presented in Supplementary Table S5 and Supplementary File S6a-d. The MacArthur Merino population had five unique ROH islands (Supplementary File S6b) followed by Nguni (Supplementary File S6b) and Blackhead Persian (Supplementary File S6c) with three each whilst the South African Merino, South African Mutton Merino, White Vital Swakara, Karakul, Dorset Horn and Chinese Merino each had one unique ROH island (Supplementary File S6d). For those islands shared between two populations, the Blackhead Persian shared with South African Mutton Merino, SA Merino, Black Vital Swakara, Ronderib Afrikaner and MacArthur Merino; the MacArthur Merino shared with Nguni, Namaqua Afrikaner, Blackhead Persian and Bangladesh Galore; the Nguni shared with MacArthur Merino, White Vital Swakara and Karakul, the Dorper with NQA and RDA while the AWD shared with South African Mutton Merino. For ROH islands shared amongst 3 populations the Nguni shared one with White Vital Swakara and Karakul; the African White Dorper one with Namaqua Afrikaner and White Subvital Swakara; the Bangaldesh Galore with Blackhead Persian and South African Mutton Merino; the Dorset Horn with Nguni and White Vital Swakara while the BVS with RDA and South African Mutton Merino.
SNPs and gene annotation
Based on the Sheep QTL database, ROH7 on chromosome one, and unique to Blackhead Persian lies within a genomic region previously found to be harboring QTLs for selection for presence or absence of horns in Soay sheep and average daily gain in Awassi and Merino sheep. ROH island ROH15 on chromosome one and unique to MacArthur Merino lies in a genomic region harboring QTLs for Feacal Egg Count and Susceptibility to facial eczema. QTLs associated with carcass fat percentage (FATP) in Awassi and Merino sheep were also observed within island (ROH15).
A list of genes and associated KEGG pathways found with population restricted ROH islands are reported in Table 2a and b and Supplementary Table S5. Using the KEGG pathway analysis, genes associated with metabolic pathways such as ANPEP, HDDC3 and ST3GAL3 where observed with ROH islands unique to the Blackhead Persian (Table 2a). Two ROH islands (ROH100 and ROH101) unique to Blackhead Persian and Nguni respectively were observed in chemokine pathways (GRK4 gene) and Natural killer cell mediated cytotoxicity pathways (SH3BP2 gene). Other pathways observed included thermogenesis (ROH199), interleukin signalling pathways (ROH11) and pathways associated with bacterial infections such as Escherichia Coli, Salmonella, Mycobacterium tuberculosis (ROH149) as illustrated in Table 2.
The domestication of sheep was a complex process that allowed both natural and artificial selection of breeds. Regional variations due to genetic drift in breeds became small and geographically restricted resulting in extensive and diverse phenotypes. Such processes, while well documented in other breeds, are unknown in most of the small and geographically restricted local populations. ROH have been extensively studied in humans and livestock populations and are an established method of inferring population history. ROH are continuous homozygous segments that are common in individuals and populations. The ability of these homozygous segments to give insight into a population’s genetic events makes them a useful tool that can provide information about the demographic evolution of a population over time. Furthermore, ROH provide useful information about the genetic relatedness among individuals, helping to manage inbreeding rate, thereby exposing possible deleterious variants in the genome. ROH are widely used as predictors of inbreeding levels in populations. Calculating the inbreeding coefficient from ROH (FROH) is more accurate for estimating autozygosity and for detecting both past and more recent inbreeding effects than estimating inbreeding from pedigree data .
South African sheep populations are a result of complex and multifaceted production systems. Natural and artificial selection forces play vital roles through mixtures of indigenous, commercial and synthetic/composite breeds raised in extreme production conditions. ROH were used in this study to investigate this population history with emphasis on breed relatedness. Illumina ovine SNP50 genotypes of South African mutton, wool, pelt and dual purpose and non-descript breeds was analysed together with that of global populations of similar geographic background.
Breeds such as the Australian Poll Dorset followed by the South African Nguni, Australian Poll Merino and Australian Industry Merino and the White Vital Swakara sheep had the most ROHs ranging from 6155 in White Vital Swakara to 15,097 in Australian Poll Dorset. Frequency of ROHs in different breeds reflect on the size of the breeds often positively associated with inbreeding levels . South African breeds of the Nguni, Blackhead Persian, Namaqua Afrikaner and Swakara are small breeds restricted to specific production systems and geographic locations [28, 29], which explains the high frequency of ROH in these populations. Similarly-raised worldwide populations include the Ethiopian Menzi [30, 31], Bangladesh Galore , Black-headed Mutton  that also had high numbers of ROH in this study. ROH due to recent inbreeding tends to be longer, due to little opportunity for recombination to break up the segments that are identical-by-descent . The Nguni (725) Afrino (845), White Vital Swakara (1204) and Australian Poll Dorset (1359) had the most moderately sized (6–24 Mb) ROH while the Australian Poll Dorset (61), DSH (75), BHM (90), Nguni (90) and White Vital Swakara (94) has the most of the large (> 24 Mb) ROH (Supplementary Table 1) presenting more ancient inbreeding. Ancient ROH are generally much shorter because the chromosomal segments have been broken down by repeated meiosis [27, 34]. In this study, the Nguni (5855), Australian Industry Merino (6040), Australian Poll Merino (6188) and Australian Poll Dorset (13,677) had the highest number of short ROH (Supplementary Table 1) implying more recent inbreeding events. The Nguni breed of South Africa is a non-descript breed kept by smallholder farmers under low-input communal farming systems [29, 35]. Small flock sizes, sharing and retaining of bucks for multiple breeding cycles characterises the smallholder livestock production systems of South Africa inclusive of sheep. It has been suggested that such production factors lead to inbreeding in these populations .
Notable inbreeding has been observed in Zulu sheep [29, 36]. The spread of Zulu sheep into different areas of KwaZulu-Natal has fractured the sheep into isolated subpopulations occupying different ecological, social-cultural and management environments . The high frequency of both short and long ROH in the Nguni relative to other breeds is supported by the breed history and suggestive of a sustained high level of inbreeding due to both founder effects and the practice by smallholder farmers of raising the sheep as small fragmented populations. Contrary to the Nguni are the composite commercial breeds of South Africa, e.g. Afrino, Meatmaster, that, although they experienced population bottlenecks during their formation which is now reflected by the high frequency of short ROH, the breeds are now well managed commercially and as such present minimum long ROH > 24 Mb. The Blackhead Persian were initially introduced to South Africa by chance in 1869. A vessel damaged by a storm at sea carried a number of slaughter sheep. These sheep, one ram and three ewes, were taken to Wellington where the breed was further developed . In 1948, the present Blackhead Persian Sheep Breeders’ Society of South Africa was formed in De Aar and to date the Black-headed Persian is represented by a well-established breed society with breed standards and management practise. Such history of the Blackhead Persian explain the predominating short ROHs reflective of ancient founder effects during its establishment in South Africa and the few long ROHs since it is now well managed with a representing breed society.
Overall, the patterns of distribution of ROH revealed in this study showed peculiar patterns of inbreeding of sheep breeds that corresponded with levels of selection pressure typical of trait of economic importance as well as the production system typical of their rearing. South African indigenous and local breeds of Nguni, Blackhead Persian, Namaqua Afrikaner and the pelt based subpopulations of the Swakara Sheep had high inbreeding estimates reflective of the small and fragmented populations. The Nguni and Namaqua Afrikaner for example are raised as small household flocks in geographically marginalised regions of the country  whereas the pelt based Swakara subpopulations are small populations raised for a unique production system of pelt . Challenges of small effective population size and inbreeding have been suggested in these populations [29, 36, 38]. Such high inbreeding levels imply that the breeds are of low genetic diversity and at risk of extinction. Conservation efforts are therefore required to minimise further loss of genetic diversity and extinction of some of these breeds. Phenotypic and genetic characterisation as well as sustainable utilisation of the genetic resources and implementation of structured and tailor-made breeding programs have been suggested as alternative approaches to conservation of threatened genetic diversity . The global populations, the MCM from south west Europe  had a high FROH similar to that of the Swakara subpopulations and the Namaqua Afrikaner. The Dorper, Dorset Horn and South African Mutton Merino also shared a high FROH as the Swakara breeds, MCM and Namaqua Afrikaner.
ROH frequencies vary widely within and across chromosomes . Chromosome 10 had the highest incidence of common runs per SNP across breeds with over 250 incidences of common ROHs at some of the SNPs. Chromosome 10 harbours a genomic region associated with horns in sheep . One of the regions on chromosome 10 harbours the RXFP2 gene which has been reported as a main candidate for horns in Soay sheep . Other studies have identified RXFP2 within a quantitative trait locus for horn size and highly heritable in sheep . Recently, it was reported a 1.8 Kb insertion in the 3′-UTR of RXFP2 to be associated with polledness in sheep . Presence and absence of horns and subsequently horn size have been the main parameters under selection in sheep pre-and post-domestication. The RXFP2 region has been observed to be under selection in some African sheep populations . Other regions on chromosome 10 were SNPs within genes such as Crystallin lambda 1(CRYL1) which is associated with metabolic pathways particularly pentose and glucuronate interconversions and has been observed to be under trans-specific signatures of domestication in sheep and goats .
Chromosome 6 also harboured SNPs with high incidence of common ROH across breeds. Some of the associated genes included the Secreted phosphoprotein 1 (SPP1) and ligand dependent nuclear receptor corepressor-like (LCORL) which are on a domain of 36.15–38.56 Mb and play an essential role in tissue and embryonic growth [45, 46]. LCORL was observed to be associated with height in cattle [47, 48] and observed to be under selection in different sheep breeds [45, 49] and in other species [50,51,52,53,54]. La et al.,  observed significantly high expression of SPP1 in the kidney of Hu sheep. The Prostaglandin f2-alpha synthase (PGFS) gene on chromosome 13 is associated with the Arachidonic acid metabolism and other metabolic pathways and were observed to be associated with wool growth regulation in Aohan fine wool sheep .
The failure to observe ROHs in certain genomic regions (i.e Chromosomes 10, 21, 22 and 26) could be attributed to gaps in marker coverage. According to Nandolo et al. , artefacts due to structural variants and gaps in marker coverage could influence the screening of ROHs. However, whilst this will be considered a SNP chip effect, affecting the screening of ROH similarly across breeds, the impact of such an artefacts might have minimal effects on the breed comparisons undertaken in this study.
In this study, ROH islands were defined as clusters of runs that were > 1000 Kb with a minimum of 30 SNPs and found in more than 20 samples which represented 1.95% of the total population in this study. In other studies, a threshold of 1% was used . According to Zhang et al., , ROH patterns are not randomly distributed across the genomes, and are seen to be distributed and shared among individuals as a result of selection events. The 244 ROH islands observed in this study varied from those with SNPs unique to one population (17 ROH islands), two-three populations (39 islands) and those distributed in more than 3 populations (188 islands). As expected, the highly inbred populations of the MCM, Blackhead Persian and Nguni had the highest frequency of unique ROHs suggestive of small, fragmented populations with small effective population sizes and evolving independently from other populations. The Nguni and Blackhead Persian for example are small breeds kept by smallholder farmers in unique productionn systems . The MCM population is also a highly inbred population from west Europe . The other breeds such as White Vital Swakara and Chinese Merino are also equally small breeds highly selected for specific production purposes, e.g. pelt production in the case of White Vital Swakara sheep . Previous studies suggested that ROH islands are a result of intensive selection often found in populations of finite size [57, 60].
Based on the Sheep QTL database, both productive traits, i.e. average daily gain and carcass fat percentage, and adaptive traits, e.g. absence of horns, feacal egg count and susceptibility to facial eczema, were observed within reported ROH islands. The Blackhead Persian is a polled breed with both sexes lacking horns. The observed unique island, ROH7, associated with selection of absence of horns  will therefore be in line with ancient selection for horns in this breed. The identified regions under ROH islands associated with metabolic pathways (e.g ST3GAL3 on island ROH2; FES gene on island ROH199), adaptive and innate immunity (ADRA2C, GRK4 and SH3BH2 genes on island ROH100), thermogenesis (PLIN1 gene on ROH199) relate to key traits relevant to the Blackhead Persian’s and other livestock’s survival in harsh compromised environments smallholder populations of South Africa . Similarly the Nguni sheep shared genomic regions (i.e ADRA2C, GRK4 and SH3BH2 genes on island ROH101) as well as traits affected by different genomic regions such as island ROH11 on chromosome 1 that harboured the S100A9 gene associated with the IL-17 signalling pathway and CRTC2 gene associated with the Glucagon signalling pathway. ROH islands that were shared between populations implied common selection pressures between/amongst affected breeds for example the Blackhead Persian and the South African Mutton Merino and South African Merino that shared ROH islands associated with metabolic and immune response pathways.
The study reported frequency and distribution of ROHs in South African sheep breeds, relative to global populations. The pattern of distribution of ROH corresponded to breed history and production system under which they are raised. Similarities in frequency and patterns of ROHs between South African breeds and other global breeds was observed especially when comparing the Merino-type breeds. The study showed peculiar patterns of inbreeding of sheep breeds that corresponded with levels of selection pressure typical of traits of economic importance as well as the production system aligned to their rearing.
Availability of data and materials
The SNP genotypes of the South African sheep breeds generated for this project are available on https://osf.io/ceup6/?view_only=a1959659de5f4d5d9bbb1c607b2d83b6 and can be downloaded upon request.
Australian Industry Merino
- ALROH :
Average length of ROH
Australian Poll Dorset
Australian Poll Merino
African White Dorper
Black Vital Swakara
- FHOM :
Inbreeding coefficient based on variance in expected heterozygosity
- FROH :
Runs of homozygosity based Inbreeding coefficient
Grootfontein Agricultural Development Institute
Grey Vital Swakara
International Sheep Genomics Consortium
Kyoto encyclopaedia of genes and genomes
- L ROH :
Total length of ROH on autosomes
- L AUTO :
Total length of the autosomes
- MNROH :
Mean number of ROH
- Proc GLM:
Procedure Generalised Linear Model
Quantitative Trait Loci
Runs of Homozygosity
SA Mutton Merino
Single Nucleotide Polymorphisms
Statistical Analysis System
White Subvital Swakara
White Vital Swakara
Cloete SWP, Olivier JJ, Sandenbergh L, Snyman MA. The adaption of the South Africa sheep industry to new trends in animal breeding and genetics: a review. S Afr J Anim Sci. 2014;44(4):307–21.
Molotsi AH, Taylor JF, Cloete SWP, Muchadeyi F, Decker JE, Sandenbergh L, Dzama K. Preliminary genome-wide association study for wet-dry phenotype in smallholder ovine populations in South Africa. South Afr J Anim Sci. 2017. https://doi.org/10.4314/sajas.v47i3.9.
Cloete SWP, Olivier JJ. South African sheep industry. In: Cottle DJ, editor. The International Sheep and Wool Handbook. England: Nottingham University Press; 2010. p. 95–112.
Szmatoła T, Gurgul A, Jasielczuk I, Ząbek T, Ropka-Molik K, Litwińczuk Z, Bugno-Poniewierska M. A comprehensive analysis of runs of Homozygosity of eleven cattle breeds representing different production types. Animals. 2019;9:1024. https://doi.org/10.3390/ani9121024.
Purfield DC, Berry DP, McParland S, Bradley DG. Runs of homozygosity and population history in cattle. BMC Genet. 2012;13:70 https://www.biomedcentral.com/1471-2156/13/70.
Joaquim LB, Chud TCS, Marchesi JAP, Savegnago RP, Buzanskas ME, Zanella R, et al. Genomic structure of a crossbred landrace pig population. PLoSONE. 2019;14(2):e0212266. https://doi.org/10.1371/journal.pone.0212266.
Szpiech ZA, Xu J, Pemberton TJ, Peng W, Zöllner S, Rosenberg NA, Li JZ. Long runs of homozygosity are enriched for deleterious variation. Am J Hum Genet. 2019;93(1):90–102. https://doi.org/10.1016/j.ajhg.2013.05.003.
Sams AJ, Boyko AR. Fine-Scale Resolution of Runs of Homozygosity Reveal Patterns of Inbreeding and Substantial Overlap with Recessive Disease Genotypes in Domestic Dogs. G3 (Bethesda, Md.). 2019;9(1):117–23. https://doi.org/10.1534/g3.118.200836.
Jemaa SB, Thamri N, Mnara S, Rebours E, Rocha D, Boussaha M. Linkage disequilibrium and past effective population size in native Tunisian cattle. Genet Mol Biol. 2019;42(1):52–61. https://doi.org/10.1590/1678-4685-GMB-2017-0342.
Halim BN, Nagara M, Regnault B, Hsouna S, Lasram K, Kefi R, Azaiez H, Khemira L, Saidane R, Ammar SB, Besbes G, Weil D, Petit C, Abdelhak S, Romdhane L. Estimation of recent and ancient inbreeding in a small endogamous Tunisian community through genomic runs of Homozygosity. Ann Hum Genet. 2015;79:402–17. https://doi.org/10.1111/ahg.12131.
Mooney JA, Huber CD, Service S, Sul JH, Marsden CD, Zhang Z, Sabatti C, Ruiz-Linares A, Bedoya G. Costa Rica/Colombia consortium for genetic investigation of bipolar Endophenotypes, Freimer N, Lohmueller KE. Understanding the hidden complexity of Latin American population isolates. Am J Hum Genet. 2018;103(5):707–26. https://doi.org/10.1016/j.ajhg.2018.09.013.
Upadhyay M, Chen W, Lenstra J, et al. Genetic origin, admixture and population history of aurochs (Bos primigenius) and primitive European cattle. Heredity. 2017;118:169–76. https://doi.org/10.1038/hdy.2016.79.
Islam R, Li Y, Liu X, Berihulay H, Abied A, Gebreselassie G, Ma Q, Ma Y. Genome-wide runs of Homozygosity, effective population size, and detection of positive selection signatures in six Chinese goat breeds. Genes. 2019;10(11):938. https://doi.org/10.3390/genes10110938.
Xu L, Zhao G, Yang L, et al. Genomic patterns of Homozygosity in Chinese local cattle. Sci Rep. 2019;9:16977. https://doi.org/10.1038/s41598-019-53274-3.
Chen Z, Zhang M, Lv F, Ren X, Li W, Liu M, Nam K, Bruford MW, Li M. Contrasting patterns of genomic diversity reveal accelerated genetic drift but reduced directional selection on X-chromosome in wild and domestic sheep species. Genome Biol Evol. 2018;10(5):1282–97. https://doi.org/10.1093/gbe/evy085.
Sandenbergh L. Identification of SNPs associated with robustness and greater reproductive success in the South African Merino sheep using SNP chip technology. In: Doctoral dissertation. Stellenbosch: Stellenbosch University; 2015.
D'Ambrosio J, Phocas F, Haffray P, et al. Genome-wide estimates of genetic diversity, inbreeding and effective size of experimental and commercial rainbow trout lines undergoing selective breeding. Genetics Sel Evol. 2019;51(1):26. https://doi.org/10.1186/s12711-019-0468-4.
Aramburu O, Ceballos F, Casanova A, Le Moan A, Hemmer-Hansen J, Bekkevold D, Bouza C, Martínez P. Genomic signatures after five generations of intensive selective breeding: runs of Homozygosity and genetic diversity in representative domestic and wild populations of turbot (Scophthalmus maximus). Front Genet. 2020;11:296. https://doi.org/10.3389/fgene.2020.00296.
Dzomba EF, Chimonyo M, Snyman MA, Muchadeyi FC. The genomic architecture of South African mutton, pelt, dual-purpose and nondescript sheep breeds relative to global sheep populations. Anim Genetics. 2020. https://doi.org/10.1111/age.12991.
Snyman MA. South African sheep breeds: Persian sheep. Grootfontein Agricultural Development Institute. 2014; Info-pack ref. 2014/026. http://gadi.agric.za/InfoPacks/infopacks.php.
Malesa MT. Population genetics of Swakara sheep inferred using genome-wide SNP genotyping. MSc Thesis: University of KwaZulu-Natal; 2015.
Soma P, Kotze A, Grobler JP, Van Wyk JB. South African sheep breeds: population genetic structure and conservation implications. Small Rumin Res. 2012;103:112–9.
Purcell SN, Todd-Brown K, Thomas L, Ferreira M, Bender D, Maller J, Sklar P, De Bakker P, Pc DMS. PLINK: a toolset for whole-genome association and population-basedlinkage analysis [Online]. American J Hum Gen. 2007;81:559–75.
Biscarini F, Cozzi P, Gaspa G, Marras G. detectRUNS: Detect runs of homozygosity and runs of heterozygosity in diploid genomes. CRAN (The Comprehensive R Archive Network). 2018. https://cran.r-project.org/web/packages/detectRUNS/index.html.
SAS Institute Inc. SAS® 9.4 Statements: Reference. Cary: SAS Institute Inc.; 2013.
Kim ES, Sonstegard TS, Van Tassell CP, Wiggans G, Rothschild MF. The relationship between runs of Homozygosity and inbreeding in Jersey cattle under selection. PLoS One. 2015;10(7):e0129967. https://doi.org/10.1371/journal.pone.0129967.
Kirin M, McQuillan R, Franklin CS, Campbell H, McKeigue PM, Wilson JF. Genomic runs of Homozygosity record population history and consanguinity. PLoS One. 2010;5(11):e13996. https://doi.org/10.1371/journal.pone.0013996.
Selepe MM, Ceccobelli S, Lasagna E, Kunene NW. Genetic structure of south African Nguni (Zulu) sheep populations reveals admixture with exotic breeds. PLoS One. 2018;13(4):e0196276. https://doi.org/10.1371/journal.pone.0196276.
Kunene NW, Ceccobelli S, Di Lorenzo P, Hlophe SR, Bezuidenhout CC, Lasagna E. Genetic diversity in four populations of Nguni (Zulu) sheep assessed by microsatellite analysis. Ital J Anim Sci. 2014;13:3083.
Edea Z, Dessie T, Dadi H, Do KT, Kim KS. Genetic diversity and population structure of Ethiopian sheep populations revealed by high-density SNP markers. Front Genet. 2017;8:218. https://doi.org/10.3389/fgene.2017.00218.
Gizaw S, Abegaz S, Rischkowsky B, Haile A, Mwai AO, Dessie T. Review of sheep Research and Development projects in Ethiopia. Nairobi: International Livestock Research Institute (ILRI); 2013. https://cgspace.cgiar.org/handle/10568/35077.
Deb GK, Choudhury MP, Kabir MA, Khan MYA, Ershaduzzaman M, Nahar TN, Hossain SMJ, Alam MS, Alim MA. Genetic relationship among indigenous sheep population of Bangladesh. Bang J Anim Sci. 2019;48(1):17–22.
Schönherz AA, Szekeres BD, Nielsen VH, Guldbrandtsen B. Population structure and genetic characterization of two native Danish sheep breeds. Acta Agriculture Scandinavica. 2020;69:1–2, 53-67. https://doi.org/10.1080/09064702.2019.1639804.
Curik I, Ferenčaković M, Sölkner J. Inbreeding and runs of homozygosity: a possible solution to an old problem. Livest Sci. 2014;166:26–34. https://doi.org/10.1016/j.livsci.2014.05.034.
Qwabe SO, van Marle-Köster E, Visser C. Genetic diversity and population structure of the endangered Namaqua Afrikaner sheep. Trop Anim Health Prod. 2013;45:511–6. https://doi.org/10.1007/s11250-012-0250-x.
Hlophe SR. Genetic variation between and within six selected south African sheep breeds using random amplified polymorphic DNA and protein markers (doctoral dissertation); 2011.
Mavule BS, Sarti FM, Lasagna E, Kunene NW. Morphological differentiation amongst Zulu sheep populations in KwaZulu-Natal, South Africa, as revealed by multivariate analysis. Small Rumin Res. 2016;140:50–6.
Muchadeyi FC, Malesa MT, Soma P, Dzomba EF. Runs of Homozygosity in Swakara pelt producing sheep: implications on sub-vital performance. In: Proceedings for Association for the Advancement of Animal Breeding and Genetics, vol. 21; 2015. p. 310–3. http://www.aaabg.org/aaabghome/AAABG21papers/Muchadeyi21310.pdf.
Kijas JW, Lenstra JA, Hayes B, Boitard S, Porto Neto LR, San Cristobal M, Servin B, McCulloch R, Whan V, Gietzen K, Paiva S, Barendse W, Ciani E, Raadsma H, McEwan J, Dalrymple B, other members of the International Sheep Genomics Consortium. Genome-Wide Analysis of the World's Sheep Breeds Reveals High Levels of Historic Mixture and Strong Recent Selection. PLoS Biology. 2012;10. https://doi.org/10.1371/journal.pbio.1001258.
Johnston SE, McEwan JC, Pickering NK, Kijas JW, Beraldi D, Pilkington JG, Pemberton JM, Slate J. Genome-wide association mapping identifies the genetic basis of discrete and quantitative variation in sexual weaponry in a wild sheep population. Mol Ecol. 2011;20(12):2555–66. https://doi.org/10.1111/j.1365-294X.2011.05076.x.
Sim Z, Coltman DW. Heritability of horn size in Thinhorn sheep. Front Genet. 2019;10:959. https://doi.org/10.3389/fgene.2019.00959.
Wiedemar N, Drögemüller CA. 1.8-kb insertion in the 3′-UTR of RXFP2 is associated with polledness in sheep. Anim Genet. 2015;46(4):457–61. https://doi.org/10.1111/age.12309.
Ahbara A, Bahbahani H, Almathen F, Al Abri M, Agoub MO, Abeba A, Kebede A, Musa HH, Mastrangelo S, Pilla F, Ciani E, Hanotte O, Mwacharo JM. Genome-Wide Variation, Candidate Regions and Genes Associated With Fat Deposition and Tail Morphology in Ethiopian Indigenous Sheep. Front Genet. 2019;9:699. https://doi.org/10.3389/fgene.2018.00699.
Alberto FJ, Boyer F, Orozco-ter Wengel P, Streeter I, Servin B, de Villemereuil P, Benjelloun B, et al. Nat Commun. 2018;9:813. https://doi.org/10.1038/s41467-018-03206-y.
Al-Mamun HA, Kwan P, Clark SA, et al. Genome-wide association study of body weight in Australian merino sheep reveals an orthologous region on OAR6 to human and bovine genomic regions affecting height and weight. Genet Sel Evol. 2015;47:66. https://doi.org/10.1186/s12711-015-0142-4.
La Y, Zhang X, Li F, Zhang D, Li C, Mo F, Wang W. Molecular Characterization and Expression of SPP1, LAP3 and LCORL and Their Association with Growth Traits in Sheep. Genes (Basel). 2019;10(8):616. https://doi.org/10.3390/genes10080616.
Gudbjartsson D, Walters G, Thorleifsson G, et al. Many sequence variants affecting diversity of adult human height. Nat Genet. 2008;40:609–15. https://doi.org/10.1038/ng.122.
Weedon MN, Lango H, Lindgren CM, Wallace C, Evans DM, Mangino M, et al. Genome-wide association analysis identifies 20 loci that influence adult height. Nat Genet. 2008;40:575–83.
Signer-Hasler H, Burren A, Ammann P, Drögemüller C, Flury C. Runs of homozygosity and signatures of selection: a comparison among eight local Swiss sheep breeds. Anim Genet. 2019;50:512–25. https://doi.org/10.1111/age.12828.
Bolormaa S, Hayes BJ, van der Werf JH, et al. Detailed phenotyping identifies genes with pleiotropic effects on body composition. BMC Genomics. 2016;17:224. https://doi.org/10.1186/s12864-016-2538-0.
Wood AR, Esko T, Yang J, Vedantam S, Pers TH, Gustafsson S, et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat Genet. 2014;46:1173–86.
Pryce JE, Hayes BJ, Bolormaa S, Goddard ME. Polymorphic regions affecting human height also control stature in cattle. Genetics. 2011;3:981–4.
Metzger J, Schrimpf R, Philipp U, Distl O. Expression levels of LCORL are associated with body size in horses. PLoS One. 2013;8. https://doi.org/10.1371/journal.pone.0056497.
Saatchi M, Schnabel RD, Taylor JF, Garrick DJ. Large-effect pleiotropic or closely linked QTL segregate within and across ten US cattle breeds. BMC Genomics. 2014;15:442.
Liu N, Li H, Liu K, et al. Identification of skin-expressed genes possibly associated with wool growth regulation of Aohan fine wool sheep. BMC Genet. 2014;15:144. https://doi.org/10.1186/s12863-014-0144-1.
Nandolo W, Utsunomiya YT, Mészáros G, Wurzinger M, Khayadzadeh N, Torrecilha RBP, Mulindwa HA, Gondwe TN, Waldmann P, Ferencakovic M, Garcia JF, Rosen BD, Bickhart D, Tassell CP, Curik I, Solkner J. Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances. Genetic Sel Evol. 2018;50:43. https://doi.org/10.1186/s12711-018-0414-x.
Peripolli E, Stafuzza NB, Munari DP, et al. Assessment of runs of homozygosity islands and estimates of genomic inbreeding in Gyr (Bos indicus) dairy cattle. BMC Genomics. 2018;19(1):34. https://doi.org/10.1186/s12864-017-4365-3.
Zhang Q, Guldbrandtsen B, Bosse M, Lund MS, Sahana G. Runs of homozygosity and distribution of functional variants in the cattle genome. BMC Genomics. 2015;16:542.
Ciani E, Lasagna E, D’Andrea M, et al. Merino and merino-derived sheep breeds: a genome-wide intercontinental study. Genet Sel Evol. 2015;47:64. https://doi.org/10.1186/s12711-015-0139-z.
Purfield DC, McParland S, Wall E, Berry DP. The distribution of runs of homozygosity and selection signatures in six commercial meat sheep breeds. PLoS One. 2017;12(5):e0176780. https://doi.org/10.1371/journal.pone.0176780.
Mdladla K, Dzomba E, Muchadeyi F. Landscape genomics and pathway analysis to understand genetic adaptation of south African indigenous goat populations. Heredity. 2018;120:369–78. https://doi.org/10.1038/s41437-017-0044-z.
The Grootfontein Agricultural Development Institute (GADI), South Africa provided some DNA samples for commercial breeds from their Biobank. We are grateful to Dr. Pranisha Soma who provided DNA samples from her MSc studies. SNP genotyping was done at the Agricultural Research Council-Biotechnology Platform. We acknowledge the International Sheep Genome Consortium for the Ovine SNP50K genotypes from global populations used in this study.
The work was funded by The National Research Foundation of South Africa and the University of KwaZulu-Natal Competitive Grant. The National Research Foundation of South Africa and the University of KwaZulu-Natal Competitive Grant Program had no role in the study design, collection, analysis, and interpretation of data, neither in the writing of the manuscript.
Ethics approval and consent to participate
The project was granted access and used DNA material from the Grootfontein Agricultural Development Institute (GADI), South Africa’s Biobank and from a previous study by Dr. Pranisha Soma. The SNP50K genotypes for the global populations were accessed with permission from the International Sheep Genome Consortium database.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Dzomba, E.F., Chimonyo, M., Pierneef, R. et al. Runs of homozygosity analysis of South African sheep breeds from various production systems investigated using OvineSNP50k data. BMC Genomics 22, 7 (2021). https://doi.org/10.1186/s12864-020-07314-2
- Production system
- SNP genotypes
- Runs of Homozygosity
- ROH island