Genome-wide detection of genetic structure and runs of homozygosity analysis in Anhui indigenous and Western commercial pig breeds using PorcineSNP80k data

Background Runs of homozygosity (ROH) are continuous homozygous regions typically located in the DNA sequence of diploid organisms. Identifications of ROH that lead to reduced performance can provide valuable insight into the genetic architecture of complex traits. Here, we systematically investigated the population genetic structure of five Anhui indigenous pig breeds (AHIPs), and compared them to those of five Western commercial pig breeds (WECPs). Furthermore, we examined the occurrence and distribution of ROHs in the five AHIPs and estimated the inbreeding coefficients based on the ROHs (FROH) and homozygosity (FHOM). Finally, we identified genomic regions with high frequencies of ROHs and annotated candidate genes contained therein. Results The WECPs and AHIPs were clearly differentiated into two separate clades consistent with their geographical origins, as revealed by the population structure and principal component analysis. We identified 13,530 ROHs across all individuals, of which 4,555 and 8,975 ROHs were unique to AHIPs and WECPs, respectively. Most ROHs identified in our study were short (< 10 Mb) or medium (10–20 Mb) in length. WECPs had significantly higher numbers of short ROHs, and AHIPs generally had longer ROHs. FROH values were significantly lower in AHIPs than in WECPs, indicating that breed improvement and conservation programmes were successful in AHIPs. On average, FROH and FHOM values were highly correlated (0.952–0.991) in AHIPs and WECPs. A total of 27 regions had a high frequency of ROHs and contained 17 key candidate genes associated with economically important traits in pigs. Among these, nine candidate genes (CCNT2, EGR2, MYL3, CDH13, PROX1, FLVCR1, SETD2, FGF18, and FGF20) found in WECPs were related to muscular and skeletal development, whereas eight candidate genes (CSN1S1, SULT1E1, TJP1, ZNF366, LIPC, MCEE, STAP1, and DUSP) found in AHIPs were associated with health, reproduction, and fatness traits. Conclusion Our findings provide a useful reference for the selection and assortative mating of pig breeds, laying the groundwork for future research on the population genetic structures of AHIPs, ultimately helping protect these local varieties. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08583-9.


Introduction
Runs of homozygosity (ROH) are defined as contiguous homozygous genotype segments present in an individual due to the parents transmitting identical haplotypes to their offspring [1]. Long ROHs are associated with more recent inbreeding within a pedigree, whereas short ROHs are associated with ancient common ancestors [2]. Bosse et al. [3] and Herrero et al. [4] used ROHs to investigate the population relationships, evolutionary history, and inbreeding effects in pigs. Several factors can influence the generation of ROHs, such as inbreeding, natural and artificial selection, genetic drift, and population bottlenecks. Of these, inbreeding is considered the most important factor [5]. Inbreeding leads to an increased risk of homozygosity for deleterious alleles throughout the genome, largely in the form of ROHs causing inbreeding depression, eventually leading to decreased fertility, viability, and phenotypic variation in the offspring [6]. Therefore, to avoid inbreeding depression in animal breeding programmes, a highly sensitive and accurate estimation of the inbreeding coefficient is of utmost importance [7].
Traditionally, the inbreeding coefficient has been estimated based on pedigree information (F PED ), whose accurate estimation relies heavily on the accuracy, completeness, and depth of pedigree information. However, pedigree errors are common in many livestock populations [8]. Several alternative methods have been proposed to estimate the genomic inbreeding coefficient (genomic F) based on the development of genotype-based microarrays using single nucleotide polymorphisms (SNPs). These include the genomic relationship matrix (F GRM ), homozygosity (F HOM ), and ROH (F ROH ). The genomic coefficients derived from animals/populations can be calculated without pedigree records or incomplete pedigree information. In addition, genomic F may provide a more accurate measure of inbreeding levels, even with missing pedigree information [9,10]. Furthermore, compared with other genomic F indices, F ROH is the most powerful and accurate method for detecting inbreeding effects and is closest to the true inbreeding coefficient [11,12] . Thus, F ROH has been widely used to estimate genomic inbreeding in livestock in recent years [13].
In pigs, ROH can also be used to estimate the inbreeding coefficient in the absence of pedigree records. To date, ROH has been used to estimate inbreeding in several Western commercial pig breeds (WECPs), including Landrace (LAN) [14], Large White (LWY) [11], Piétrain (PIE) [15], and Duroc (DUC) breeds [16], as well as Chinese indigenous pig breeds, such as the Laiwu [17], Songliao black [18], Jinhua [19], Diannan small-ear [20], and Liangshan [21] breeds. Genomic regions with a high frequency of ROH (ROH islands) can also be used to detect associations between genes and economically important porcine traits. Previous reports have identified many genes associated with pig reproduction, meat quality, fat deposition, and disease resistance traits in ROH islands [17,20,22]. The presence of ROH islands in the porcine genome suggests the occurrence of selection for economically important traits and environmental adaptation.
Although ROH has been used for breeding estimates in many Western commercial and Chinese indigenous pig breeds, it has been used less frequently in Anhui indigenous pig breeds (AHIPs), including the Wei (YZ), Wannan black (WNHZ), Huai (HZ), Wannanhua (WNHUAZ), and Six White (LB). These breeds have improved meat quality [23], disease resistance [24], and high fertility [25] compared with major commercial lean pig breeds. Nevertheless, the number of AHIPs has declined sharply in the past 20 years due to the large number of Western pig breeds that have been imported to improve leanness in pork (China National Commission of Animal Genetic Resources, 2011). The African swine fever disease outbreak also caused problems for the breeding programmes. Thus, this study had the following aims: (1) to detect the differences in genetic structure between AHIPs and WECPs, including 150 AHIPs (YZ, WNHZ, HZ, WNHUAZ, and LB) and 170 WECPs (LAN, DUR, PIE, LWY, and BER (Berkshire)) using the Illumina porcine 80 K SNP BeadChip; (2) to identify the occurrence and distribution of ROHs in WECPs and AHIPs; (3) to calculate and compare the genomic inbreeding coefficients (F ROH ) between WECPs and AHIPs using ROHs; (4) to identify and compare potential ROH regions associated with economically important traits in AHIPs and WECPs. Our results could help preserve the genetic diversity of AHIPs, promoting sustainable breeding programmes for genetic improvement in these breeds.

Conclusion:
Our findings provide a useful reference for the selection and assortative mating of pig breeds, laying the groundwork for future research on the population genetic structures of AHIPs, ultimately helping protect these local varieties. Keywords: Anhui indigenous pig breeds, genetic structure, runs of homozygosity, Inbreeding coefficient, ROH island

Analysis of population genetic structure of ten pig populations
Using the genetic background information of the ten pig breeds, we examined the relatedness among populations of indigenous breeds (YZ, WNHZ, HZ, WNHUAZ, and LB) collected from Anhui Province, China. In addition, samples were collected from the five WECPs (LAN, LWY, BER, PIE, and DUR) and comparatively analysed (Fig. 1A). Principal component analysis (PCA) results and phylogenetic trees were used to visualise the genetic relationships among the ten breeds (Fig. 1B, C). The PCA results showed that the AHIPs and WECPs were clearly segregated along the PC1 axis. Furthermore, the five AHIP breeds were separated into four clusters, with WNHZ and WNHUAZ populations being classified together. Among the WECPs, the BER, DUR, and LWY pigs clustered separately, whereas the LAN and PIE pigs clustered together. The phylogenetic tree had patterns similar to those of the PCA results, showing that overall, the AHIPs and WECPs were distinguishable at the genomic level (Fig. 1C). The population genetic structure of the ten pig breeds (K = 2-10 clusters) is illustrated in Fig. 1D. Based on the cross-validation (CV = minimal) error, we identified an optimal value of K = 10 clusters, using which all ten pig breeds were clustered separately from each other. Using a K = 2, all pig breeds were collectively separated into two distinct clusters-AHIPs and WECPs. Taken together, the analysis results showed that the five AHIPs were closely related but had different genetic backgrounds, whereas the AHIPs and WECPs significantly differed.

Distribution of runs of homozygosity
A descriptive summary of the ROH numbers and length categories (1-5 Mb, 5-10 Mb, 10-20 Mb, 20-40 Mb, and > 40 Mb) in each pig breed is listed in Table 1 and illustrated in Fig. 2 Fig. 1A). Moreover, the average ROH length was highest in HZ pigs (7.51 ± 0.28 Mb) and lowest in LWY pigs (4.86 ± 0.11 Mb). The average number of ROHs per pig was highest in BER pigs (70.20 ± 1.36; range, 54-88) and lowest in YZ pigs (15.00 ± 1.54; range, 4-40). The number of ROHs per chromosome tended to increase with chromosome length and was lowest on SSC11 and highest on SSC1 (Fig. 2B). Some BER, LAN, PIE, and WNHUAZ individuals had extremely long ROHs (> 500 Mb) (Fig. 2C, D); in particular, one WHHUAZ individual had an ROH covering a total length of > 600 Mb. Compared to the WECPs, the AHIPs exhibited fewer total ROHs per individual (Fig. 1A). We also examined the total ROH numbers in each chromosome for all ten pig breeds (Fig. 2E). Compared to the AHIPs, the WECPs contained more ROH fragments in all 18 chromosomes. Furthermore, the AHIPs had a lower proportion of short ROH fragments in the length categories of 1-5 Mb (29.79%), 5-10 Mb (34.94%), and 10-20 Mb (45.88%), while a higher proportion of length categories of 20-40 Mb (56.12%) and > 40 Mb (81.82%), suggesting recent inbreeding events (Table 1). Additionally, the percentage of chromosome coverage by ROH in each breed is summarised in Table S1 and illustrated in Figure S1. Among the WECPs, the highest chromosome coverage by ROH was found in PIE (SSC18: 31.3%) and the lowest in SSC13 of LWY (SSC13: 5.4%). As for AHIPs, the highest was on chromosome 17 in WNHUAZ (29.6%), while the lowest was on chromosome 1 in LB (3.5%).
The descriptive statistics for ROH-based (F ROH ) and homozygous-based (F HOM ) inbreeding coefficients in different length categories are listed in Table 2 and illustrated in Fig. 3. The inbreeding coefficient of F HOM varied from 0.0971 ± 0.0531 (LB) to 0.3079 ± 0.0492 (LAN), and the values of F ROH(ALL) varied from 0.064 ± 0.007 (YZ) to 0.289 ± 0.008 (LAN). We also found a high correlation between F HOM and F ROH in all ten breeds (range, 0.947-0.991), and the average correlation between F ROH and F HOM in the ten breeds was 0.967. The genomic inbreeding coefficients (F ROH and F HOM ) were highest in the LAN, BER, and PIE breeds of WECPs, and lowest in the HZ and YZ breeds of AHIPs (Fig. 3A, B). Similar conclusions drawn from F ROH and F HOM estimates indicated a considerable difference in genomic inbreeding coefficients among the different pig breeds. Of note, WECPs had significantly higher genomic inbreeding coefficients than the AHIPs. These results showed that the F ROH values differed significantly between the WECP and AHIP pig breeds, indicating differences in directional selection and breeding goals.
We identified the genomic regions most commonly associated with ROHs in the ten pig breeds and plotted the percentages of SNPs in ROHs against the positions of the SNPs along the chromosomes (Supplementary Fig. 2). No ROH islands were found in the LB and YZ breeds. High percentages of SNPs in ROHs were found in the BER (SSC1, SSC3, SSC6, SSC9, SSC12, SSC15), PIE (SSC4), and LAN (SSC7, SSC10, SSC14, SSC15) breeds. The longest ROH island (6.27 Mb) was found in the WNHUAZ breed on SSC1, whereas the shortest (0.05 Mb) was found in the HZ breed on SSC8. The SNPs in ROH islands were compared between WECPs and AHIPs, and 220 and 748 unique SNPs were found in AHIPs and WECPs, respectively (Fig. 4A). A total of 27 genomic regions had a high frequency of ROHs (Table S2) and were found to contain 202 genes. Among these, 48 candidate genes were found only in AHIPs, and 146 were found only in WECPs (Fig. 4A). In addition, we aligned all of these ROH islands to the pig quantitative trait loci (QTL) database, revealing that meat-, carcass-, and production-related QTLs were enriched in 20 WECP genomic regions, while reproduction-, fatness-, and health-related QTLs were enriched in 7 AHIP genomic regions (Table S3).

GO enrichment analysis of candidate genes in WECPs and AHIPs
Gene Ontology (GO) enrichment analysis was performed separately for WECPs and AHIPs (Fig. 4B). Genes enriched in AHIPs were mainly involved in blastocyst development, response to progesterone/oestrogen, positive regulation of B cell receptor signalling pathway, and triglyceride catabolic process, whereas those in WECPs were involved in skeletal muscle tissue development, embryonic skeletal system morphogenesis, and cellular response to growth factor stimulus. Furthermore, nine candidate genes (CCNT2, EGR2, MYL3, CDH13, PROX1, FLVCR1, SETD2, FGF18, and FGF20) in WECPs were found to be closely associated (P adj < 0.05) with skeletal muscle tissue development, embryonic skeletal morphogenesis, and cellular response to growth factor stimulus. Eight candidate genes (CSN1S1, SULT1E1, TJP1, ZNF366, LIPC, MCEE, STAP1, and DUSP) related to sex hormones and reproductive development, fatty acid biosynthesis metabolism, and immune response regulation were selected for subsequent analyses (Fig. 4C). Similarly, QTL enrichment results also revealed that ROH islands in two AHIP breeds (WHHUAZ and HZ) were associated with QTLs of economically important traits such as health, reproduction, and fatness, whereas those in the WECP breeds were related to meat-and production-related traits (Table 3).

Discussion
The Anhui Province is one of the top ten provinces that traditionally produce pigs in China, and it has abundant genetic resources of indigenous pig breeds (YZ, WNHZ, HZ, WNHUAZ, and LB). Due to long-term natural adaptation and artificial selection, the AHIPs have gradually evolved high fertility, high fat content, excellent meat quality [23], disease resistance [24], good maternal stability [25], and crude feed tolerance [22]. In this study, WECPs and AHIPs significantly differed in terms of genetic backgrounds, consistent with previous studies showing that pigs were domesticated in at least two separate domestication centres, Europe and Asia [26][27][28]. Noteworthily, artificial selection has also played a vital role in AHIPs, especially for LB/YZ breeds, as genomic information from Western breeds flowed into LB/YZ breeds. This could be because, in the past 20 years, WECPs were selected and admixed with AHIPs to increase the allelic richness and improve the breeding stock of AHIPs (China National Commission of Animal Genetic Resources, 2011). Besides, the WNHUAZ and WNHZ breeds exhibited a close genetic relationship, suggesting that both breeds may have descended from the same ancestor [29], and gradually formed two different breeds due to geographical isolation and the long-term domestication process [30]. Furthermore, the similar number of ROHs and F ROH values in WNHUAZ and WNHZ also supported the notion that these breeds originated from a common population. However, due to the small sample size and marker density, the results of population genetic structure in the WECP and AHIP breeds are insufficient and need further investigation. The abundance, length, and genomic distribution of ROHs provide valuable information about the demographic history of livestock species [3]. In this study, the occurrence and distribution of ROHs were compared between five AHIPs and five WECPs. Most of the ROHs identified in our study belonged to the short (< 10 Mb) and medium (10-20 Mb) length categories, consistent with those reported in chickens [31], sheep [32], pigs [18], and cattle [33]. The short ROHs indicate ancient inbreeding, whereas long ROHs suggest recent inbreeding [34]. Compared with WECPs, AHIPs had more ROHs in 20-40 Mb and > 40 Mb categories, fewer ROHs in 1-5 Mb and 5-10 Mb. These results are consistent with those of previous studies [18,35]. The different distribution patterns of ROH numbers and lengths between the WECPs and AHIPs may be due to the selection of different traits in these breeds; WECP management primarily focuses on the production traits of pigs [36], whereas AHIPs are selected for meat quality and disease resistance [35].
With the development of high-throughput genotyping technologies, genetic markers can provide a more accurate estimate of population relationships in pigs than pedigree data, which may have missing or incorrect parent information [13,22]. In recent years, ROHs have been widely used to predict inbreeding levels in livestock [13]. F ROH estimates are more accurate for estimating autozygosity and detecting inbreeding effects than pedigree data [11], providing useful information about interindividual genetic relatedness. In this study, we used two indices, F ROH and F HOM , to estimate inbreeding coefficients in AHIPs and WECPs. Previous studies have reported that F ROH generally highly correlates with F HOM (r FROH,FHOM = 0.78-0.85) consistent with our results (r FROH,FHOM = 0.952-0.991) and previous studies [18,37]. Moreover, we found that F HOM values were higher than F ROH values in all ten pig breeds because the F HOM estimate cannot distinguish identity by descent alleles from identity by state alleles, inevitably overestimating inbreeding levels [38]. Although using F HOM to estimate the inbreeding coefficient is not sufficiently accurate, F ROH can alleviate the issues mentioned above. Thus, theoretically, F ROH may be a more effective and accurate alternative for quantifying relatedness and inbreeding levels [39]. Further, the F ROH of AHIPs is generally expected to be lower than that of WECPs. The contradictory results of our study may be explained by the small effective population size and random sampling errors in WECPs, resulting in higher inbreeding estimates for WECPs in recent generations [40].
We found that the ROH islands harboured several candidate genes controlling economically important traits in pigs. In particular, we identified 27 genomic regions with a high frequency of ROHs, harbouring 17 key candidate genes in WECPs and AHIPs. Furthermore, we identified eight candidate genes in the AHIPs, of which three (SULT1E1, LIPC, and MCEE) were involved in fat deposition, three (CSN1S1, TJP1, and ZNF366) were involved in reproduction, and two (STAP1 and DUSP1) were immune system-related. LIPC encodes hepatic lipase and affects the metabolism, composition, and expression of several lipoproteins [41,42]. SULT1E1, a negative regulator of adipogenesis [43], serves a strong regulatory function in lipid metabolism via the PPARγ pathway [44]. SULT1E1 is also reportedly linked to foetal development [45], and ablation of the murine SULT1E1 gene causes placental thrombosis and spontaneous foetal loss [46]. ZNF366 plays an important role in regulating the expression of target genes in response to oestrogen [47,48]. TJP1 has been related to testis weight, spermatogenesis, and the development of ovarian and cystic follicles [49,50]. CSN1S1 is an effective molecular marker for litter size in goat breeding [51]. STAP1 [52] and DUSP1 [53,54] are significantly associated with anti-inflammatory responses and immune infiltration in human autoimmune diseases. We also identified nine candidate genes in the WECPs, of which six (CDH13, PROX1, EGR2, CCNT2, SETD2, and MYL3) were related to muscular  FGF18, and FGF20) were involved in skeletal morphogenesis. Among these candidate genes, miR-15a [55] and miR-155-5p [56] inhibit skeletal muscle development and differentiation by targeting CCNT2. High expression levels of CDH13 promote muscle-type identity, as CDH13 plays an active role in myogenesis [57,58]. In pigs, MYL3 [59,60] and EGR2 [61,62] are associated with muscle formation and development. PROX1 is involved in muscle fibre conversion, and is a promising candidate gene affecting pork quality traits [59,63]. FGF18 [64,65] and FGF20 [66,67] are reported to play important roles in embryonic development, bone growth, and bone-related diseases. Moreover, FLVCR1 deficiency results in Diamond-Blackfan anaemia, often associated with skeletal malformations [68]. Based on the Pig QTL database, reproduction, fatness, and health traits overlapped in the ROH islands of AHIPs, while meat-and production-related traits were observed within ROH islands of WECPs. Overall, we found that the AHIP breeds were more adapted to fat deposition, disease resistance, and high fertility, whereas WECP pigs showed selection for production traits, such as muscular and skeletal development. Taken together, our results indicate that the WECP and AHIP breeds show adaptive differences in economically important traits.

Conclusions
In this study, we characterised the population genetic structure of WECPs and AHIPs and found that they had considerably different genetic backgrounds. Furthermore, the occurrence and distribution of ROHs were compared Fig. 4 Comparisons of the annotated genes in ROH hotspots in AHIP and WECP pigs. A Venn diagram of SNPs and related genes in ROH hotspots in AHIPs and WECPs. B Gene ontology (GO) analysis of the annotated genes in ROH hotspots in AHIPs and WECPs. Red italics represent GO terms associated with economically important traits in pigs. C The summary of GO analysis result (Fig. B) in 17 key candidate genes between AHIPs (eight) and WECPs (nine). The horizontal axis represents the gene name, and the vertical axis represents the GO term (selected from Fig. B) related to key candidate genes. The legend represents the P-value of the GO term. AHIPs, Anhui indigenous pig breeds; WECPs, Western commercial pig breeds across five AHIPs and five WECPs. Results revealed how diversity has evolved in the AHIP populations. F ROH and F HOM values were significantly lower in AHIPs than in WECPs, indicating that the breeding and conservation programmes were successful in AHIPs. Several genes with a high frequency of ROHs were identified. Among these, candidate genes in AHIPs were associated with fat deposition, disease resistance, and high fertility, whereas those in WECPs were related to muscular and skeletal development. Overall, our findings provide a helpful reference for selection and assortative mating programmes in pigs. Moreover, these results reveal a novel research direction regarding the population genetic structure of AHIP breeds, which might effectively help protect these valuable local varieties.

SNP genotyping and quality control
A total of 320 pigs were used in this study: 170 WECPs (Duroc, Landrace, Yorkshire, Berkshire, and Piétrain pig breeds) and 150 AHIPs from the Anhui Province, China. Detailed information on the sampled pig breeds in this study, including the regions of recollection, breed names and abbreviations, and sample size, are presented in Table S4. Genomic DNA was extracted from ear tissue and genotyped with the Illumina porcine 80 K SNP BeadChip (Illumina, San Diego, CA, USA). Only autosomal SNPs were used for further analyses. The PLINK software (v1.90) [69] was used for quality control of the data, and the following standards were set: (1) SNPs with a call rate of < 0.95 and unknown positions were removed (-geno 0.05); (2) SNPs with a minor allele frequency of < 0.05 were removed (-maf 0.05); (3) data from individuals with a call rate of < 0.90 were discarded (-mind 0.1); (4) Hardy-Weinberg Equilibrium (HWE) P-value < 1 × 10 -6 (-hwe 0.000001). The SNP genome coordinates were obtained from the Sus scrofa 11.1 porcine genome reference assembly. After genotype quality control, 1158 markers were excluded based on the HWE test (p ≤ 1 × 10 -6 ), 7231 SNPs failed the missingness test (GENO > 0.1), 9788 SNPs failed the frequency test (MAF < 0.05), yielding 320 individuals and 54,075 SNP for further analysis.

Population structure
The geographical distributions of five WECPs and five AHIPs were estimated using the ggmap package [70] in R statistical software. To illustrate the relationships among the ten pig breeds, PCA was performed using the GCTA software (-autosome -autosome-num 18 -make-grm -pca 3) [71]. A scatterplot was generated to visualise the first and second principal components based on a variance-standardised relationship matrix created using the PCA results. The ADMIXTURE software [72] was used to infer the most probable number of ancestral populations (K = 2-10) based on the SNP genotype data. A fivefold cross-validation (-cv) error for each K was used to select the optimal K. A phylogenetic tree was created for the ten pig breeds using the online tool, the Interactive Tree Of Life (iTOL, http:// itol2. embl. de/ perso nal_ page. cgi) [73].

Genomic inbreeding coefficients
ROHs were identified for each individual using the PLINK software (v1.90), which uses a sliding window technique to scan each individual's genotype at each marker position to detect homozygous segments [39]. We defined ROHs according to the following criteria: (1) the minimum ROH length was set to 1 Mb (-homozygkb 1000); (2) a minimum of 50 consecutive SNPs were included in an ROH (-homozyg-snp 50), which was calculated using the equation proposed by Lencz et al. [74]: where α is the percentage of false-positive ROHs (set to 0.05 in the present study), n s is the number of SNPs per individual, n i is the number of individuals, and het is the heterozygosity across all SNPs. After calculation, the minimum number of SNPs constituting an ROH was set to 50; (3) the maximum gap between consecutive SNPs was set to 1 Mb (-homozyg-gap 1000); (4) the minimum SNP density was set to 1 SNP every 100 kb (-homozygdensity 100); (5) the minimum number of SNPs in a sliding window was set to 50 (-homozyg-snp 50); (6) one heterozygous genotype and no more than five missing SNPs were allowed per window (-homozyg-window-het 1; -homozyg-window-missing 5); (7) the window threshold was set to 0.01 (-homozyg-window-threshold 0.01). In this study, we classified ROHs into five different categories according to their physical length: 1 to < 5 Mb, 5 to < 10 Mb, 10 to < 20 Mb, 20 to < 40 Mb and > 40 Mb. For each length category, we computed the frequency of ROH numbers and the average length of an ROH in every breed.

Inbreeding coefficient of ROH
To verify the accuracy of F ROH , we evaluated the genomic coefficients for the ten pig breeds using two methods: (1) PLINK v1.90 software was used to measure the inbreeding coefficient based on the difference between the l = log e α n s ×n i log e (1 − het) observed and expected numbers of homozygous genotypes (F HOM ) [74]. The inbreeding coefficient for an individual (F HOM ) was calculated as follows: where L is the number of genotyped autosomal SNPs, E is the number of homozygotes expected by chance, and O is the number of observed homozygotes. (2) Genomic inbreeding coefficients were also estimated based on ROH (F ROH ). The F ROH was calculated as follows: where L ROHi is the total length of ROH i on autosomes, and L auto is the autosomal genome length covered by the SNP chip. Furthermore, the correlation between F ROH and F HOM for each breed was calculated using Pearson's correlation.

Detection of common ROHs and gene annotation
To identify genomic regions with a high frequency of ROHs, we calculated the percentage of occurrences of SNPs in ROHs by counting the number of times an SNP was detected in those ROHs across individuals. In this study, the threshold used to define an ROH hotspot in the genome was 40%, in agreement with a previous report by Rui et al. [18]. Adjacent SNPs over this threshold were merged into genomic regions called ROH islands [75,76]. We used the porcine reference genome annotation file from the NCBI database (http:// asia. ensem bl. org/ Sus_ scrofa/ Info/ Index) to annotate the genes in the ROH islands. In addition, pig QTLdb (https:// www. anima lgeno me. org/ cgi-bin/ QTLdb/ SS/ index) was used to annotate the genes in the ROH islands. GO enrichment analysis of genes in the ROH islands was performed using g:Profiler (https:// biit. cs. ut. ee/ gprofi ler/ gost), and terms with a P-value greater than 0.05 were filtered. The biological function of each annotated gene within the ROH islands was determined through an extensive literature search. F HOM = i L ROH i L auto