- Research article
- Open access
- Published:
Whole-genome resequencing provides insights into the population structure and domestication signatures of ducks in eastern China
BMC Genomics volume 22, Article number: 401 (2021)
Abstract
Background
Duck is an ancient domesticated animal with high economic value, used for its meat, eggs, and feathers. However, the origin of indigenous Chinese ducks remains elusive. To address this question, we performed whole-genome resequencing to first explore the genetic relationship among variants of these domestic ducks with their potential wild ancestors in eastern China, as well as understand how the their genomes were shaped by different natural and artificial selective pressures.
Results
Here, we report the resequencing of 60 ducks from Chinese spot-billed ducks (Anas zonorhyncha), mallards (Anas platyrhnchos), Fenghua ducks, Shaoxing ducks, Shanma ducks and Cherry Valley Pekin ducks of eastern China (ten from each population) at an average effective sequencing depth of ~ 6× per individual. The results of population and demographic analysis revealed a deep phylogenetic split between wild (Chinese spot-billed ducks and mallards) and domestic ducks. By applying selective sweep analysis, we identified that several candidate genes, important pathways and GO categories associated with artificial selection were functionally related to cellular adhesion, type 2 diabetes, lipid metabolism, the cell cycle, liver cell proliferation, and muscle functioning in domestic ducks.
Conclusion
Genetic structure analysis showed a close genetic relationship of Chinese spot-billed ducks and mallards, which supported that Chinese spot-billed ducks contributed to the breeding of domestic ducks. During the long history of artificial selection, domestic ducks have developed a complex biological adaptation to captivity.
Background
Domestication is the process of animal adaptation to captive environment and human interventions such as providing protection, offering food and promoting animal breeding [1]. Compared to their wild ancestors, domestic animals have great variation in behavior, morphology and physiology in response to domestication, and this variation is the result of genetic changes across many generations. The genetic differentiation among domestic animals and their wild ancestors is influenced by multiple mechanisms, including selection, mutation, drift and gene flow [2]. Detecting selective signatures associated with domestication is important for understanding the genetic basis of both adaptations to new environments and rapid phenotype change. In recent years, whole-genome resequencing delivers a comprehensive view of detecting the signatures left by domestication, such as in pig [3], chickens [4], dogs [5] and yaks [6].
Chinese domestic ducks are among the earliest domesticated waterfowl in the world dating back to 2228 years before present (YBP) [7]. China is famous for its abundance of waterfowl breeds, as many as 31 domestic duck breeds have been recognized. Owing to domestication and directional breeding, domestic ducks have many typical characteristics in morphology, behavior and production performance, such as reduction in brain size [8], leg morphology changes [9], decrease aggression behaviors [10] and higher egg productivity. Domestic ducks have been bred for various purposes, such as egg and/or meat production. Shaoxing and Shanma ducks are Chinese excellent egg-type duck breeds, characterized by small body size, early maturity and high productivity. In Chinese written history, Shaoxing duck can be traced back to the Song Dynasty about 1000 years ago. Through 50 years of systematic breeding, the egg production of Shaoxing ducks reached 300 at the age of 500 days [11]. Shanma duck, another famous Chinese indigenous duck, has been domesticated for 400 years in Fujian Province [12]. Fenghua (FH) duck is a special dual-purpose local duck breed in Zhejiang Province, which has similar appearance with mallards. Different from other domestic breeds, Fenghua duck still retains some habits of wild ducks such as seasonal reproduction, flying and high disease resistance, because of the short time of domestication. Chinese Pekin ducks are named Cherry Valley Pekin ducks after they were exported to the United Kingdom in1872. After more than 100 years of intensive selection, Cherry Valley Pekin ducks are famous for their fast-growth, high lean rate and high feed conversion ratio [13].
Although many studies have been conducted on the diversity and origin of Chinese domestic ducks by applying microsatellite markers, mitochondrial DNA sequencing and whole-genome resequencing, the origin and evolution of Chinese domestic ducks are still debated. Some scholars suggest that Chinese domestic ducks originated from wild mallards [14, 15], while others argue that domestic ducks might also originate from Chinese spot-billed ducks [16, 17]. Mallard is the most common wild duck species in China, which is of particular economic importance [18]. Chinese spot-billed duck is a close relative of mallard, with distributions partially overlapping in most of Japan, Korea, and northeastern China [19]. Owing to the observed hybridization of mallards and spot-billed ducks in East Asia [19], another hypothesis suggests that domestic ducks might originate from hybrids of mallards and spot-billed ducks [17, 20].
Ducks are not only economically import, but serve as important non-model study systems in evolutionary biology [21]. Thus, elucidating the evolutionary history of the various domestic breeds is essential when attempting to understand how different selective regimes have shaped their genetic variation. Therefore, we sequenced the genomes of 60 individuals from two wild populations, the spot-billed ducks and mallards, and four indigenous Chinese breeds (Fenghua, Shaoxing, Shanma and Cherry Valley Pekin ducks) to explore the genetic relationships among wild and domestic ducks and identify the genomic footprints of selection during the domestication of native ducks.
Results
We selected 60 individuals from six breeds (mallard, Chinese spot-billed, Fenghua, Shaoxing, Shanma and Cherry Valley Pekin ducks) (Fig. 1 and Supplementary Table S1). Using the Illumina Genome Analyzer platform, we generated a total of 397.88 GB of clean data with an average of 6.63 GB per individuals (Supplementary Table S2). 2.5 billion reads mapped to 95.09% of the reference genome assembly with 6.52-fold average depth (Supplementary Table S3). We called 2,809,077 high-quality single nucleotide polymorphic sites (SNPs) for 60 ducks, 63.92% (1.8 million) of the high-quality SNPs were located in the intergenic regions, and only 1.94% (0.55 million) were located in the exonic regions (Supplementary Table S4–5). We identified 42,463 synonymous SNPs and 12,084 nonsynonymous of exons, for a nonsynonymous/synonymous ratio of 0.28. And 838,413 SNPs were common between six breeds (Supplementary Fig. S1).
Population genetic structure
To explore relatedness among the domestic ducks, we conducted a principal component analysis (PCA) based on genome wide SNP data. The laying duck breeds (Shaoxing and Shanma ducks) and meat duck breeds (Cherry Valley Pekin duck) were separated by different clusters that were also distinct from the wild populations (Chinese spot-billed duck and mallard) and Fenghua duck (Fig. 2a, supplementary Fig. S2). The neighbor-joining (NJ) tree revealed that the individuals from Chinese indigenous breeds were clustered into a subclade, suggesting they have a closer genetic relationship and potentially derive from a common ancestor (Fig. 2b). To estimate different ancestral proportions, we further performed a population structure analysis with FRAPPE by assuming K ancestral populations (Fig. 2c). When K = 2, a clear division was observer between wild and domestic ducks with slight shared ancestry between these two groups. Moreover, Fenghua ducks appeared admixed, with individuals having on average of 59 and 41% assignment probability to wild and domestic breeds, respectively; suggesting these represent a wild × domestic duck hybrid population. When K = 5, there was a division between each group except Shaoxing and Shanma ducks.
Next, we used fineRADstructure [22] to further evaluate population structure by assessing individual coancestry plots across samples (Fig. 3). First, fineRADstructure recovered two major genetic clusters, one including Fenghua ducks, Chinese spot-billed ducks and mallards. The second large group contained Shaoxing ducks, Shanma ducks and Cherry Valley Pekin ducks. Second, the resulting plot also showed higher shared coancestry within each species compared to that between species, and slightly higher coancestry levels were seen between mallards and Chinese spot-billed ducks, as did Shaoxing and Shanma ducks. These findings confirmed PCA, phylogenetic tree and structure results, supporting their close evolutionary relationship [23,24,25]. Finally, Fenghua ducks shown similar coancestry levels with mallards and Chinese spot-billed ducks, although local records indicated that Fenghua ducks were originated from mallards. Notably, some individuals showed a particularly high proportion of coancestry with others, which are unlikely to be explained by sibling statues and artificial selection, and may be due to complex introgression patterns among these duck population [26].
Patterns of genomic variation and linkage disequilibrium
The genome-wide average genomic diversity (θπ) values were 5.949 × 10− 4 for mallard, 5.862 × 10− 4 for Chinese spot-billed duck, 5.815 × 10− 4 for Fenghua duck, 5.303 × 10− 4 for Shaoxing duck, 5.462 × 10− 4 for Shanma duck and 4.694 × 10− 4 for Cherry Valley Pekin duck (Supplementary Table S6), These values were much lower than in other animals (Supplementary Table S7). The wild duck had the greatest θπ and θW, suggesting that domestication reduces genetic diversity. Additionally, Linkage disequilibrium (LD) also showed that the wild ducks had a faster decay of the pairwise correlation coefficient (r2) than the domestic duck (Fig. 2d).
Demographic history
We employed the pairwise sequentially Markovian coalescent (PSMC) method [27] to infer fluctuations in the ancestral effective population sizes (Ne) of each breed in response to Quaternary climatic change (Fig. 4). From 1 million to 10 thousand years, all of the domestic breeds (Shaoxing, Shanma, Fenghua and Cherry Valley Pekin ducks) exhibited similar demographic trajectories with a peak in ancestral Ne at 50–60 thousand years ago followed by distinct declines (Supplementary Fig. S3). The decline occurred ~ 60 thousand year ago, coinciding with the beginning of the Last Glacial Maximum [28]. The effective population sizes of mallard and spot-billed duck appears to have increased rapidly after ~ 40 and ~ 20 thousand year ago, respectively (Supplementary Fig. S3).
Genome-wide selective sweep test
To accurately detect the genomic footprints of selection, we pooled the domestic duck samples (Shaoxing, Shanma and Cherry Valley Pekin ducks) and compared them to the wild duck (Mallard and Chinese spot-billed duck), which are geographically close. Using the top 5% the FST values and θπ ratio cutoffs (FST > 0.13 and log2 (θπ ratio (θπ, wild duck/θπ, domestic duck) ≥0.84), we identified 665 candidate domestication regions (CDRs) containing 387 genes under selection in the domestic ducks (Fig. 5a, Supplementary Table S8). We also calculated the Tajima’s D value of selected genes, which were significantly lower than values for other genes (Fig. 5b, c). In addition, ten candidate genes (Cmip, Tmem132b, Mphosph6, Smg7, Lyst, Zbtb37, Serpinc1, Npl, Tmem132c and Plcg2) ranking within the top 10 FST values with log2 (θπ ratio (θπ, wild duck/θπ, domestic duck) ≥ 0.84 were functionally involved in cellular adhesion function, type 2 diabetes, lipid metabolism, cell cycle, liver cell proliferation and muscle functioning [31,32,33,34,35,36] (Table 1).
To identify the active pathways in the domestication of ducks, the positively selected genes in domestic ducks were mapped to the canonical reference pathways in the KEGG database. The top three enriched pathways were “pantothenate and CoA biosynthesis” (2 genes, P = 0.02667), “FoxO signaling pathway” (6 genes, P = 0.03002), and “inositol phosphate metabolism” (4 genes, P = 0.03511) (Supplementary Fig. S4, Supplementary Table S9). The positively selected genes of domestic ducks that were successfully annotated to 47 categories of Gene Ontology (GO), belonging to three parts: cellular components, molecular function and biological processes (Supplementary Fig. S5, Supplementary Table S10). Of these, the categories that were most represented in the “biological process” principal category were “cellular process” (137 genes), followed by “single-organism process” (123 genes). In the principal category of “cellular component”, the two categories most represented were “cell” (149 genes) and “cell part” (149 genes). Within the “molecular function” principal category belonged to the “bind” (107 genes).
Positively selected genes involved in insulin signaling pathway
Using the top 5% of the FST values and θπ ratio cutoffs based on sliding 40 kb windows for the Shaoxing ducks compared to wild mallards, we identified 497 candidate domestication regions (CDRs) containing 311 genes with both high FST values and a high θπ ratio (Fig. 6a). Six genes exhibiting strong selective sweep signals were significantly over-represented in insulin signaling pathway, including ectonucleotide pyrophosphatase /phosphpdisesterase-1 (Enpp1), ectonucleotide pyrophosphatase/phosphpdisesterase-3 (Enpp3), SHC adapter protein 4 (Shc4), SOS Ras/Rac guanine nucleotide exchange factor 1 (Sos1), neuroblastoma RAS viral oncogene homolog (Nras) and protein kinase cAMP-dependent type II regulatory subunit beta (Prkar2b).
Notably, we observed much higher FST values (Fig. 6c) and lower Tajima’s D values (Fig. 6d) for the target gene Enpp1 compared to those in the adjacent genomic regions, providing further support that the candidate genes were reliable. 8 SNPs were found in this sliding window (Fig. 6e). We also used transcriptome sequencing to investigate the molecular signatures of domestication and identified significantly downregulation Enpp1 expression in the muscle and liver tissues of Shaoxing ducks compared to mallards (Fig. 6b).
Transcriptome differences in muscle, liver and cerebellum between Shaoxing ducks and mallards
Shaoxing duck is an outstanding representative of the local egg-laying duck breed in China, which contributes greatly to the Chinese waterfowl industry. To infer whether the potential positively selected genes between mallards and Shaoxing ducks could also affecting gene expression, we used Illumina paired-end RNA-seq approach to sequenced the breast muscle, liver and cerebellum of mallards and Shaoxing ducks. We obtained a total of 731 million clean reads, approximately 60.6% of them were successfully mapped to the duck genome (Supplementary Table S11). Compared with mallards, 319, 161 and 28 differentially expressed genes were identified in muscle, liver and cerebellum of Shaoxing ducks respectively (Supplementary Fig. S6, Supplementary Table S13–18). Six positively selected genes of resequencing, including Coq9, Adamts9, Zcchc24, Eya1, Enpp3 and Enpp1, were differentially expressed in muscle (Supplementary Fig. S10). However, only Enpp1 was found differntically expressed in liver. GO enrichment analysis was performed to discover the major functional categories represented in these genes. The GO categories related to cellular process, single-organism process, biological regulation, binding and catalytic (Supplementary Fig. S7, S8 and S9). There were a few KEGG pathways that were significantly enriched in muscle, including oxidative phosphorylation, fatty acid degradation, and cardiac muscle contraction (Supplementary Table S12).
Discussion
Population structure
In this study, we carried out whole-genome resequencing of 60 individuals to explore the genetic relationships among domestic ducks and wild ducks in eastern China. PCA and structure analysis clearly distinguished the wild ducks from domesticated ducks. Notably, individuals from Chinese spot-billed ducks and mallards were clustered together in PCA plot and separated in structure analysis only from K = 5, indicating a close relationship between them. Further, we constructed NJ tree based on whole-genome variants to infer phylogenetic relationships of these ducks. The result showed that four domestic ducks belong to the same large branch, which was consistent with previous studies suggesting a single domestication of domestic ducks [15, 37]. Additionally, phylogenetic tree confirmed that Chinese spot-billed ducks is a sister clade of mallard. Moreover, we found that Chinese spot-billed duck shared a relatively high degree of coancestry with mallard. Taken together, these results supported that Chinese spot-billed ducks and mallards were weakly genetically differentiated, although they were quite different in morphological appearance (Fig. 1). It was not surprising as hybridization is common between Chinese spot-billed ducks and mallards. Several mallard × Chinese spot-billed duck have recently been reported to occur on Hongkong, China [38], Khank Lake, Russia [39], and Tokyo, Japan [19]. The asymmetric hybridization and sex-biased gene flow between Chines spot-billed ducks and mallards was also confirmed [19, 25]. Due to the close genetic relationship between Chinese spot-billed duck and mallard, it was difficult to distinguish the role of Chinese spot-billed in domestic duck origination. Moreover, we found that the coancestry between Chinese spot-billed duck and domestic duck was similar to that between mallard and domestic duck. Taken together, our results indicated that Chinese spot-billed duck also shown substantial genetic contribution of Chinese domestic duck.
Demographic history
We carried out PSMC analysis to infer fluctuations in historical effective population size (Ne) of 6 breeds, and observed the similar trajectories for four Chinese domestic ducks with an apparent expansion during the Penultimate Glaciation and the Last Interglaciation, and a decline between 50 and 60 thousand years ago (Fig. 4a). However, mallard and Chines spot-billed duck population reached their pinnacle between 20 and 40 thousand years ago. The trend of Ne is similar with previous studies of other Chinese domestic ducks, which increased in the interglacial periods and decreased in the Pleistocene [37, 40]. On coastal regions such as eastern China, Quaternary glacial-interglacial changes in climate (Fig. 4b) and sea level (Fig. 4c) had major effects on terrestrial plant and animal communities. The population expansion of interglacial periods can be explained by the warm and humid weather [41]. Beside, a severe reduction of Ne approximately coinciding with the beginning of the Last Glacial Period or occurring during this period was observed in many avian populations, which may be due to climatic deterioration, habitat loss, and reduction of food supply [21]. Therefore, we believe that the similar reason is responsible for the bottleneck of ducks during Last Glacial Period.
Selection for domestication
Shaoxing duck is a typical Chinese egg-type duck breed, which is under intense artificial selection to achieve excellent egg production. In order to the candidate regions for the targeted selection of Chinese native duck during domestication, we scanned the genome of Shaoxing ducks and mallards for regions with extreme Fst and the highest θπ ratio. On our results, the Enpp1, Enpp3, Shc4, Sos1, Nras and Prkar2b, which are related to the insulin signaling pathway, showed signals of positive selection in Shaoxing duck. Enpp1 is the subtypes of ENPP family, which directly interacts with the insulin receptor and blocks the insulin signaling pathway [42], serving as a gatekeeper of insulin action. Transcriptome results showed that the expression level of Enpp1 in skeletal muscle and live of Shaoxing ducks was significantly lower than that of mallards, suggesting Enpp1 may have played a crucial role in duck domestication by improving insulin sensitivity. Enpp3 is positively associated with the serum ATP concentration, facilitating lipid deposition [43]. And Enpp3 and Prkar2b were also identified as the targets of selection during the domestication of Pekin ducks and other indigenous Chinese ducks [7]. The protein encoded by Prkar2b is a regulatory subunits of the protein kinase A (PKA) and is involved in insulin resistance [44]. Shc4 (also known as ShcD) serves as a phosphotyrosine adapter molecule that induces Ras GTPase and mitogen activated protein kinase (MAPK) activation [45]. Also, the SOS1 and NRAS provide protein-making instructions that are involved in regulating the activation of the Ras/MAPK signaling pathway, which helps to control insulin signaling (Fig. 7).
Skeletal muscle plays an important role in regulating glucose uptake and body metabolism [48]. The association between increased muscle lipid content and insulin resistance has been confirmed [49]. And it has been observed that improving insulin sensitivity helped increase muscle mass in songbirds [50]. Additionally, our early study confirmed that Shaoxing ducks had lower intramuscular fat contents compared to mallards [51]. The positive selection of genes associated with insulin signaling and decrease of muscle lipid content indicated that insulin sensitivity of Shaoxing duck was improved with increasing muscle mass during domestication, to achieve people’s breeding object.
Conclusion
In conclusion, we performed whole-genome resequencing to characterize the evolutionary origin of ducks in eastern China and the genome-wide signatures of artificial selection associated with domestication. We have shown that Chinese spot-billed duck was close related to mallard and contributed to domestic duck origination. Several candidate genes, important pathways and GO categories associated with artificial selection were functionally related to cellular adhesion, type 2 diabetes, lipid metabolism, the cell cycle, liver cell proliferation, and muscle functioning in domestic ducks. We found strong genomic evidence for the involvement of the insulin signaling pathway in the domestication of Shaoxing duck. These results advance our understanding of the genetic relationships between domestic and wild ducks, reveals the genetic footprints of domestication and shed light on the genetic mechanisms underlying species adaptation to captivity.
Methods
Sampling
The blood samples from all the 60 individual ducks (10 per breed) were collected from the wing vein using vacuum tubes containing EDTA-K2 as an anticoagulant. The spot-billed ducks, mallards and Fenghua ducks were captured in Fenghua City, Zhejiang Province, China (29°35′ N, 121°24′ E). The Shaoxing and Shanma ducks were collected in Zhuji City, Zhejiang Province, China (29°38′ N, 120°10′ E), and the Cherry Valley Pekin duck were raised in Huzhou City, Zhejiang Province, China (30°41′ N, 120°19′ E). From Shaoxing ducks and mallards, 3 randomly selected ducklings were killed by rapid decapitation and sterile dissection, and muscle and liver tissues were sampled and immediately snap frozen in liquid nitrogen.
Sequencing and quality control
A total of 60 ducks, which were sampled from Eastern China, were sequenced on the Illumina HiSeq 2000 platform (Illumina, San Diego CA, USA). We generated a total of 401.491 Gb of raw sequence data (supplementary Table S2).
Raw reads in fastq format were firstly processed for quality using in-house C scripts. Specifically, low-quality reads were filtered out based as below [52]: reads with ≥10% unidentified nucleotides (N); reads with > 50% bases having phred quality < 5; reads with > 10 nt aligned to the adapter, allowing ≤10% mismatches; putative PCR duplicates generated by PCR amplification in the library construction process (read 1 and read 2 of two paired-end reads that were completely identical).
Consequently, 387.88 Gb were retained for assembly, of which the quality of 96.06 and 91.52% of the bases were ≥ Q20 and ≥ Q30, respectively.
Reads mapping and SNP calling
The remaining high quality reads were mapped to the mallard (Anas platyrhynchos) reference genome (BGI_duck_1.0) [53] using Burrows-Wheeler Aligner (Version: 0.7.8) [54] with the command line was ‘aln -e 10 -t 4 -l 32 -i 15 -q 10’. SAMtools was used to remove the duplicated reads to reduce mismatch generated by PCR amplification before sequencing.
After alignment, we used SAMtools [55] to carry out SNP calling. The ‘mpileup’ command was used to identify SNPs with the parameters as ‘-q 1 -C 50 -S -D -m 2 -F 0.002’. The following filtering steps were applied in order to obtain high quality SNPs as follow: quality score > = 20; coverage depth > =2 and < =1000.
Annotation of genetic variants
Using the ANNOVAR package [56], 2,809,077 high-quality SNPs were annotated according to the genome. Based on the genome annotation, SNPs were classified into several categories, such as exonic regions, intronic regions, splicing sites, upstream and downstream regions and intergenic regions. SNPs from coding exon regions were identified as either synonymous or nonsynonymous.
Principal component analysis
The software GCTA [57] was used for PCA. The significance level of the eigenvectors was determined using the Tracey-Widom test to clarify the phylogenetic relationship among 60 individuals. The first three significant components were plotted (supplementary Fig. S2), and the discrete points to a degree reflect the real structure of population.
Phylogenetic genetic analysis
First, we inferred an individual-based neighbor-joining (NJ) tree from 2,809,077 SNPs data matrix using TreeBeST (http://treesoft.sourceforge.net/treebest.shtml#inno) based on the p-distance. The bootstrap was set to 1000 times to evaluate the reliability of branch.
Second, the population genetic structure of 60 individuals was inferred by FRAPPE [58]. We set the number of cluster (K) from 2 to 6 and ran analysis with 10,000 iterations.
Third, population structure was assessed using fineRADstructure [22], which calculates recent shared co-ancestry based on patterns of genomic similarity. The vcf file was transformed using hapsFromVCF module, and then the co-ancestry matrix was calculated and used to identify populations. The MCMC chain ran with a thinning interval of 1000, a burnin of 100,000, and 100,000 iterations.
Linkage disequilibrium analysis
We compared the pattern of linkage disequilibrium (LD) among 6 breeds using the 2.8 million high-quality SNPs. To estimate LD decay, we calculated the squared correlation coefficient (r2) between pairwise SNPs using the software Haploview [59]. The average r2 value was calculated for pairwise markers in a 500-kb window and averaged across the whole genome.
Effective population size
We used a hidden Markov model (HMM) of pairwise sequentially Markovian coalescence (PSMC) to reconstructed demographic history of 60 individuals. Firstly, we called genotype each individual using the package SamTools [54] based on the command ‘mpileup’ with the parameter ‘-C 50 -D -S -m 2 -F 0.002’. Then, we performed the program ‘fq2psmcfa’ with the parameter ‘−N30, −t15, −r5 and − p ‘4 + 25*2 + 4 + 6″ to convert the consensus sequence to the required input format. A mutation rate (μ) of 1.6 × 10− 9 per bp per generation [14] and a generation time of 1 year were used for analysis. In addition, we applied a bootstrapping approach, repeating sampling 100 times to estimate the variance of simulated results.
Selective sweep analysis
The nucleotide diversity (θπ), population-differentiation statistic (FST), Tajima’s D statistic and Watterson estimator (θW) were calculated with sliding windows of 40 kb that had 20 kb overlap between adjacent windows. The putative genomic regions under positive selection during domestication were extracted based on being the highest differences in genetic diversity (log2(θπ ratio)) and the top 5% of FST. We identified a total of 665 potential selective-sweep regions overlapping with 387 candidate genes in merging domestic ducks and 491 potential selective-sweep regions overlapping with 311 candidate genes in Shaoxing ducks, which would be used for subsequent analysis and discussion.
Functional enrichment analysis
Gene Ontology term enrichment analysis was processed with those selective genes by goseq packages in R software. We used the GOSeq R package, in which gene length bias was corrected, to perform GO and functional pathway analysis on the candidate genes. The Gene ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways with a Benjamini adjusted P-values less than 0.05 were considered significantly enriched.
RNA-seq and gene expression analysis
To infer whether the genes under selection could also affecting gene expression between Shaoxing ducks and mallards, we compared gene expression in breast muscle, liver and cerebellum between this two groups. 3 mature females from Shaoxing ducks and mallards respectively were selected for transcriptomics analysis.
All samples were individually sequenced by Illumina HiSeq 4000 sequencing platform. Perl scripts was used to ensure the quality of raw data. The reference genomes and the annotation file were downloaded from ENSEMBL database (http://www.ensembl.org/index.html). We used Bowtie/Bowtie 2 to build the genome index and TopHat v2.0.12 to map clean data to reference genome. And HTSeq v6.0 was used to count the number of fragments for each gene in each sample. The expression level of genes in each sample was estimated by FPKM (Fragments Per Kilobase Per Million Mapped Fragment). We used DEGseq v1.18.0 to analyze differential gene expression between Shaoxng and Shanma ducks. Genes with q ≤ 0.05 and |log2Ratio| ≥ 1 are identified as differentially expressed genes.
Availability of data and materials
The raw sequence data files discussed in this experiment have been deposited in SRA and BioProject ID is PRJNA599025 (https://dataview.ncbi.nlm.nih.gov/object/PRJNA599 025).
Abbreviations
- YBP:
-
Years before present
- SB:
-
Spot-billed duck
- MA:
-
Mallard
- FH:
-
Fenghua duck
- SX:
-
Shaoxing duck
- SM:
-
Shanma duck
- CV:
-
Cherry Valley Pekin duck
- SNP:
-
Single nucleotide polymorphic site
- PCA:
-
Principal component analysis
- NJ tree:
-
Neighbor-joining tree
- PSMC:
-
Pairwise sequentially Markovian coalescent
- GO:
-
Gene ontology
- CDR:
-
Candidate domestication region
- Enpp1:
-
Ectonucleotide pyrophosphatase /phosphpdisesterase-1
- Enpp3:
-
Ectonucleotide pyrophosphatase/phosphpdisesterase-3
- Shc4:
-
SHC adapter protein 4
- Sos1:
-
Ras/Rac guanine nucleotide exchange factor 1
- Nras:
-
Neuroblastoma RAS viral oncogene homolog
- Prkar2b:
-
Protein kinase cAMP-dependent type II regulatory subunit beta
References
Price EO. Behavioral aspects of animal domestication. Q Rev Biol. 1984;59(59):1–32. https://doi.org/10.1086/413673.
Mclaughlin JF, Faircloth BC, Glenn TC, Winker K. Divergence, gene flow, and speciation in eight lineages of trans-Beringian birds. Mol Ecol. 2020;29(18):3526–42. https://doi.org/10.1111/mec.15574.
Li M, Tian S, Yeung CK, Meng X, Tang Q, Niu L, et al. Whole-genome sequencing of Berkshire (European native pig) provides insights into its origin and domestication. Sci Rep. 2014;4(4):4678. https://doi.org/10.1038/srep04678.
Rubin CJ, Zody MC, Eriksson J, Meadows JR, Sherwood E, Webster MT, et al. Whole-genome resequencing reveals loci under selection during chicken domestication. Nature. 2010;464(7288):587–91. https://doi.org/10.1038/nature08832.
Axelsson E, Ratnakumar A, Arendt ML, Maqbool K, Webster MT, Perloski M, et al. The genomic signature of dog domestication reveals adaptation to a starch-rich diet. Nature. 2013;495(7441):360–4. https://doi.org/10.1038/nature11837.
Qiang Q, Wang L, Wang K, Yang Y, Tao M, Wang Z, et al. Yak whole-genome resequencing reveals domestication signatures and prehistoric population expansions. Nat Commun. 2015;6(1):10283. https://doi.org/10.1038/ncomms10283.
Zhang Z, Jia Y, Almeida P, Mank JE, Tuinen M, Wang Q, et al. Whole-genome resequencing reveals signatures of selection and timing of duck domestication. GigaScience. 2018;7(4):1. https://doi.org/10.1093/gigascience/giy027.
Guay P, Iwaniuk AN. Captive breeding reduces brain volume in waterfowl (Anseriformes). Condor. 2008;110(2):276–84. https://doi.org/10.1525/cond.2008.8424.
Duggan BM, Hocking PM, Schwarz T, Clements DN. Differences in hindlimb morphology of ducks and chickens: effects of domestication and selection. Genet Sel Evol. 2015;47(1):88. https://doi.org/10.1186/s12711-015-0166-9.
Yang S, Zhou L, Lin W, Li X, Lu M, Liu C. Behavioral differentiation between Anas poecilorhyncha and domestic duck. J Agric Sci Technol. 2016;4:270–82. https://doi.org/10.17265/2161-6256/2016.04.007.
Ren J, Lu L, Liu X, Tao Z, Zhang C, Wang D, et al. Paternity assessment: application on estimation of breeding value in body-weight at first egg trait of egg-laying duck (Anas platyrhynchos). Mol Biol Rep. 2009;36(8):2175–81. https://doi.org/10.1007/s11033-008-9432-z.
Zhu Z, Miao Z, Chen H, Xin Q, Li L, Lin R, et al. Ovarian transcriptomic analysis of Shan Ma ducks at peak and late stages of egg production. Asian-Australas J Anim Sci. 2017;30(9):1215–24. https://doi.org/10.5713/ajas.16.0470.
Cherry P, Morris TR. Domestic duck production: science and practice. OxfordshireK: CABI; 2008. https://doi.org/10.1079/9780851990545.0000.
Zhang Y, Chen Y, Zhen T, Huang Z, Chen C, Li X, et al. Analysis of the genetic diversity and origin of some Chinese domestic duck breeds. J Integr Agric. 2014;13(4):849–57. https://doi.org/10.1016/S2095-3119(13)60447-5.
Zhou Z, Li M, Cheng H, Fan W, Yuan Z, Gao Q, et al. An intercross population study reveals genes associated with body size and plumage color in ducks. Nat Commun. 2018;9(1):2648. https://doi.org/10.1038/s41467-018-04868-4.
Li H, Zhu W, Song W, Shu J, Han W, Chen K. Origin and genetic diversity of Chinese domestic ducks. Mol Phylogenet Evol. 2010;57(2):634–40. https://doi.org/10.1016/j.ympev.2010.07.011.
Chang H. Conspectus of genetic resources of livestock. Beijing: Chinese Agriculture Press; 1995.
Tu J, Si F, Xing X, Yue Z, Yang F. Determination and analysis of complete mitochondrial genome sequence of mallard (Anas platyrhychos). Mitochondrial DNA. 2012;27(4):682–285. https://doi.org/10.3109/19401736.2012.674121.
Kulikova IV, Zhuravlev YN, McCracken KG. Asymmetric hybridization and sex-biased gene flow between eastern spot-billed ducks Anas zonorhyncha and mallards A. platyrhynchos in the Russian Far East. Auk. 2004;121(3):930–49. https://doi.org/10.1093/auk/121.3.930.
Qiu X. China chicken breeds collection. Shanghai: Scientific and Technical Publishers; 1989.
Nadachowska-Brzyska K, Li C, Smeds L, Zhang G, Ellegren H. Temporal dynamics of avian populations during Pleistocene revealed by whole-genome sequences. Curr Biol. 2015;25(10):1375–80. https://doi.org/10.1016/j.cub.2015.03.047.
Milan M, Emiliano T, Daniel JL, Daniel F. RADpainter and fineRADstructure: population inference from RADseq data. Mol Biol Evil. 2018;35(5):1284–90. https://doi.org/10.1093/molbev/msy02.
Li H, Song W, Shu J, Chen K, Zhu W, Han W, et al. Genetic diversity and population structure of 10 Chinese indigenous egg-type duck breeds assessed by microsatellite polymorphism. J Genet. 2010;89(1):65–72. https://doi.org/10.1007/s12041-010-0012-3.
Lavretsky P, Mccracken KG, Peters JL. Phylogenetics of a recent radiation in mallards and allies (Aves: Anas): inferences from a genomic transect and the multispecies coalescent. Mol Phylogenet Evol. 2014;70:402–211. https://doi.org/10.1016/j.ympev.2013.08.008.
Wang W, Wang Y, Lei F, Liu Y, Wang H, Chen J. Incomplete lineage sorting and introgression in the diversification of Chinese spot-billed ducks and mallards. Curr Zool. 2019;65(5):589–97. https://doi.org/10.1093/cz/zoy074.
Guo X, Wang Z, Wang S, Li H, Suwannapoom C, Wang J, et al. Genetic signature of hybridization between Chinese spot-billed ducks and domesticated ducks. Anim Genet. 2020;51(6):866–75. https://doi.org/10.1111/age.13002.
Li H, Durbin R. Inference of human population history from individual whole-genome sequences. Nature. 2011;475(7357):493–6. https://doi.org/10.1038/nature10231.
Zheng B, Xu Q, Shen Y. The relationship between climate change and quaternary glacial cycles on the Qinghai-Tibetan plateau: review and speculation. Quat Int. 2002;97-8(1):93–101. https://doi.org/10.1016/S1040-6182(02)00054-X.
Bintanja R, van de Wal RSW, Oerlemans J. Modelled atmospheric temperatures and global sea levels over the past million years. Nature. 2005;437(7055):125–8. https://doi.org/10.1038/nature03975.
Lambeck K, Chappell J. Sea level change through the last glacial cycle. Sci. 2001;292(5517):679–86. https://doi.org/10.1126/science.1059549.
Ling H, Waterworth DM, Stirnadel HA, Pollin TI, Barter PJ, Kesäniemi YA, et al. Genome-wide linkage and association analyses to identify genes influencing adiponectin levels: the gems stud. Obesity. 2009;17(4):737–44. https://doi.org/10.1038/oby.2008.625.
Zhou J, Qiu L, Jiang S, Zhou F, Huang J, Yang L, et al. Molecular cloning and mRNA expression of M-phase phosphoprotein 6 gene in black tiger shrimp (Penaeus monodon). Mol Biol Rep. 2013;40(2):1301–6. https://doi.org/10.1007/s11033-012-2173-z.
Keaton JM, Gao C, Guan M, Hellwege JN, Palmer ND, Pankow JS, et al. Genome-wide interaction with the insulin secretion locus MTNR1B reveals CMIP as a novel type 2 diabetes susceptibility gene in African Americans. Genet Epidemiol. 2018;42(6):559–70. https://doi.org/10.1002/gepi.22126.
Sanchez-Pulido L, Ponting CP. TMEM132: an ancient architecture of cohesin and immunoglobulin domains define a new family of neural adhesion molecules. Bioinformatics. 2018;34(5):721–4. https://doi.org/10.1093/bioinformatics/btx689.
Wen X, Tarailo-Graovac M, Brand-Arzamendi K, Willems A, Rakic B, Huijben K, et al. Sialic acid catabolism by N-acetylneuraminate pyruvate lyase is essential for muscle function. JCI Insight. 2018;3(24):122373. https://doi.org/10.1172/jci.insight.122373.
Ma D, Lian F, Wang X. PLCG2 promotes hepatocyte proliferation in vitro via NF-κB and ERK pathway by targeting bcl2, myc and ccnd1. Artif Cells Nanomed Biotechnol. 2019;47(1):3786–92. https://doi.org/10.1080/21691401.2019.1669616.
Guo X, HE X, Chen H, Wang Z, Li H, Wang J, et al. Revisiting the evolutionary history of domestic and wild ducks based on genomic analyses. Zool Res. 2020;42(1):43–50. https://doi.org/10.24272/j.issn.2095-8137.2020.133.
Melvill DS. Apparent hybrid mallard X spot-billed ducks, Hong Kong bird report 1997, Hong Kong Bird Watching Soc; 1999.
Shibaev YV, Glushenko YN. Several examples of unusual bird behavior in Primorye region. Plants and animals of the Russian Far East. Ussuriysk: Ussury State Teacher’s College Press; 2001.
Wang L, Guo J, Xi Y, Ma S, Li Y, He H, et al. Understanding the genetic domestication history of the Jianchang duck by genotyping and sequencing of genomic genes under selection. G3-Genes Genom Genet. 2020;10(5):1469–75. https://doi.org/10.1534/g3.119.400893.
Rohling EJ, Grant K, Hemleben C, Siddall M, Hoogakker BAA, Bolshaw M, et al. High rates of sea-level rise during the last interglacial period. Nat Geosci. 2007;1(1):38–42. https://doi.org/10.1038/ngeo.2007.28.
Maddux BA, Goldfine ID. Membrane glycoprotein PC-1 inhibition of insulin receptor function occurs via direct interaction with the receptor alpha-subunit. Diabetes. 2000;49(1):13–9. https://doi.org/10.2337/diabetes.49.1.13.
Tsai SH, Kinoshita M, Kusu T, Kaama H, Okumura R, Ikeda K, et al. The ectoenzyme E-NPP3 negatively regulates ATP-dependent chronic allergic responses by basophils and mast cell. Immunity. 2015;42(2):279–93. https://doi.org/10.1016/j.immuni.2015.01.015.
Mangmool S, Denkaew T, Phosri S, Pinthong D, Parichatikanond W, Shimauchi T, et al. Sustained βAR stimulation mediates cardiac insulin resistance in a PKA-dependent manner. Mol Endocrinol. 2016;30(1):118–32. https://doi.org/10.1210/me.2015-1201.
Pasini L, Lanfrancone L. SHC4 (SHC (Src homology 2 domain containing) family, member 4). Atlas Genet Cytogenet Oncol Haematol. 2011;14(8):732–4. https://doi.org/10.4267/2042/44817.
Siddle K. Signalling by insulin and IGF receptors: supporting acts and new players. J Mol Endocerinol. 2011;47(1):1–10. https://doi.org/10.1530/JME-11-0022.
Mangmool S, Denkaew T, Parichatikanond W, Kurose H. β-Adrenergic receptor and insulin resistance in the heart. Biomol Ther. 2017;25(1):44–56. https://doi.org/10.4062/biomolther.2016.128.
Amoasii L, Sanchez-Ortiz E, Fujikawa T, Elmquist JK, Bassel-Duby R, Olson EN. NURR1 activation in skeletal muscle controls systemic energy homeostasis. Proc Natl Acad Sci. 2019;116(23):11299–308. https://doi.org/10.1073/pnas.1902490116.
Kelley DE, Goodpaster BH, Storlien L. Muscle triglyceride and insulin resistance. Annu Rev Nutr. 2002;22(1):325–46. https://doi.org/10.1146/annurev.nutr.22.010402.102912.
Xiong Y, Fan L, Hao Y, Cheng Y, Chang Y, Wang J, et al. Physiological and genetic convergence supports hypoxia resistance in high-altitude songbirds. PLoS Genet. 2020;16(12):e1009270. https://doi.org/10.1371/journal.pgen.1009270.
He J, Lu L, Tian Y, Tao Z, Wang D, Li J, et al. Analysis of intramuscular fat and fatty acids of different duck breeds and their association with SNPs of duck A-FABP gene. Can J Anim Sci. 2011;91(4):593–6. https://doi.org/10.4141/CJAS2011-032.
Chen C, Liu Z, Pan Q, Chen X, Wang H, Guo H, et al. Genomic analyses reveal demographic history and temperate adaptation of the newly discovered honey bee subspecies Apis mellifera sinisxinyuan n. ssp. Mol. Biol. Evol. 2016;33(5):1337–48. https://doi.org/10.1093/molbev/msw017.
Huang Y, Li Y, Burt DW, Chen H, Zhang Y, Qian W, et al. The duck genome and transcriptome provide insight into an avian influenza virus reservoir species. Nat Genet. 2013;45(7):776–83. https://doi.org/10.1038/ng.2657.
Li H, Durbin R. Fast and accurate short read alignment with burrows-wheeler transform. Bioinformatics. 2009;25(14):1754–60. https://doi.org/10.1093/bioinformatics/btp324.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25(16):2078–9. https://doi.org/10.1093/bioinformatics/btp352.
Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38(16):e164. https://doi.org/10.1093/nar/gkq603.
Yang J, Lee SH, Goddard ME, Visscher PM. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet. 2011;88(1):76–82. https://doi.org/10.1016/j.ajhg.2010.11.011.
Tang H, Quertermous T, Rodriguez B, Kardia SLR, Zhu X, Brown A, et al. Genetic structure, self-identified race/ethnicity, and confounding in case-control association studies. Am J Hum Genet. 2005;76(2):268–75. https://doi.org/10.1086/427888.
Barrett JC, Fry B, Maller J, Daly MJ. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005;21(2):263–5. https://doi.org/10.1093/bioinformatics/bth457.
Acknowledgements
Thank Ningbo Aoji Agricultural Science and Technology Co. LTD, Zhejiang Zhuoan Poultry Co. LTD, and Guowei Poultry Industry Co, LTD for their great help in sampling.
Funding
This work was supported by China Agriculture Research System of MOF and MARA and the Natural Science Foundation of China (31702106). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Author information
Authors and Affiliations
Contributions
PF analyzed main content of the data and performed the experiment with the help of HY, PW and LC. TZ collected samples with the help of JD, JS, ZT and GC The whole work is guided by LY and LL. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
All the experimental procedures with ducks used in this study were reviewed and approved by the Animal Care and Use Committee of the Zhejiang University of Technology and Zhejiang Academy of Agricultural Sciences (reference number: 20200622075), and performed in accordance with the Regulations for the Management of Affairs Concerning Experimental Animals (Ministry of Science and Technology, China, revised in June 2004). All efforts were made to minimize animal suffering.
Consent for publication
Not applicable.
Competing interests
We certify that there is no conflict of interest with any financial organization regarding the material discussed in the manuscript.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Additional file 1: Figure S1.
The number of unique and shared SNPs among six duck groups shown by a venn diagram. Figure S2. Principal component analysis (PCA). Figure S3. Demographic history of ducks. Figure S4. Distribution of KEGG pathway of domestic ducks is shown as a bar chart. Figure S5. Gene ontology (GO) classification for positively selected genes in domestication. Figure S6. The number of differential expression genes in muscle, liver and cerebellum. Figure S7. Function classifications of Gene Ontology terms of differential expression genes in muscle. Figure S8. Function classifications of Gene Ontology terms of differential expression genes in liver. Figure S9. Function classifications of Gene Ontology terms of differential expression genes in cerebellum. Figure S10. Venn diagram of selected genes (blue) and significantly expressed genes in muscle (yellow), liver (red) and cerebellum (green) of Shaoxing ducks. Table S1. Breeds included in the study and phenotypic description. Table S2. Statistics of genomic sequencing of six duck populations. Table S3. Summary of mapping and coverage of six duck populations. Table S4. Summary of SNPs of six duck populations included in the analyses. Table S5. Summary of the functional annotation statistics of SNP in ducks by ANNOVAR. Table S6. θπ and θW for six duck populations. Table S7. Summary statistics for genomic nucleotide diversity in different species. Table S8. List of CDRs with top 5% highest FST values and log2 (θπ ratio) in domestic ducks. Table S9. The KEGG pathway of the loci under selections in domestic ducks (Top 20). Table S10. The GO classification of the loci under selections in domestic ducks. Table S11. Summarize of sequence mapping of three tissues in Shaoxing ducks and mallards. Table S12. Pathway of KEGG differentially expressed gene in muscle of Shaoxing ducks. Table S13. The down-regulated genes in muscle of Shaoxing ducks (top 20). Table S14. The up-regulated genes in muscle of Shaoxing ducks (top 20). Table S15. The down-regulated genes in liver of Shaoxing ducks (top 20). Table S16. The up-regulated genes in liver of Shaoxing ducks (top 20). Table S17. The down-regulated genes in liver of Shaoxing ducks. Table S18. The up-regulated genes in liver of Shaoxing ducks.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Feng, P., Zeng, T., Yang, H. et al. Whole-genome resequencing provides insights into the population structure and domestication signatures of ducks in eastern China. BMC Genomics 22, 401 (2021). https://doi.org/10.1186/s12864-021-07710-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12864-021-07710-2