Deleted copy number variation of Hanwoo and Holstein using next generation sequencing at the population level
- Dong-Hyun Shin†1,
- Hyun-Jeong Lee†2,
- Seoae Cho4,
- Hyeon Jeong Kim4,
- Jae Yeon Hwang1,
- Chang-Kyu Lee1,
- JinYoung Jeong2,
- Duhak Yoon3Email author and
- Heebal Kim1, 4Email author
© Shin et al.; licensee BioMed Central Ltd. 2014
Received: 13 October 2013
Accepted: 3 March 2014
Published: 27 March 2014
Copy number variation (CNV), a source of genetic diversity in mammals, has been shown to underlie biological functions related to production traits. Notwithstanding, there have been few studies conducted on CNVs using next generation sequencing at the population level.
Illumina NGS data was obtained for ten Holsteins, a dairy cattle, and 22 Hanwoo, a beef cattle. The sequence data for each of the 32 animals varied from 13.58-fold to almost 20-fold coverage. We detected a total of 6,811 deleted CNVs across the analyzed individuals (average length = 2732.2 bp) corresponding to 0.74% of the cattle genome (18.6 Mbp of variable sequence). By examining the overlap between CNV deletion regions and genes, we selected 30 genes with the highest deletion scores. These genes were found to be related to the nervous system, more specifically with nervous transmission, neuron motion, and neurogenesis. We regarded these genes as having been effected by the domestication process. Further analysis of the CNV genotyping information revealed 94 putative selected CNVs and 954 breed-specific CNVs.
This study provides useful information for assessing the impact of CNVs on cattle traits using NGS at the population level.
KeywordsCattle Hanwoo Holstein Copy Number Variation NGS Domestication Selection signal
Since the completion of the bovine genome assembly [1–3], a large number of genetic variation as single-nucleotide polymorphisms (SNPs), have become widely known and commercial SNP panels have been developed for cattle . The continued discovery of SNPs in diverse cattle breeds has been further expanded [5, 6] by the recent availability of massively parallel sequencing technologies called next-generation sequencing (NGS). SNPs and the commercial SNP marker panels have been successfully used to identify genomic regions that potentially underlie the economic traits of cattle [7–9]. Another source of genetic variation is mammals come from gains and losses of genomic structural sequence variants, copy number variations (CNVs), that occur in more than two individuals . While SNPs are more frequently used in cattle breeding than CNVs, CNVs occupy a higher percentage of genomic sequence than SNPs.
Many studies have endeavored to understand CNVs in mammals, especially in humans [10–13] and rodents [14–17]. In particular, several CNVs were shown to be important in both normal phenotypic variability and disease susceptibility in human [18–22]. It is possible that CNVs have a potentially greater effect on phenotype, including changing of gene structure and dosage, altering gene regulation and exposing recessive alleles . These points are attracting attention to CNV as structural variation that can account for diverse economically important traits in domestic animals. In particular, the CNV type, deletions, which is the focus of this study has been shown to be one of the five CNV types and one of the two main classes with duplications . Previous study of cattle using next generation sequencing (NGS) has reported that CNVs play a crucial role in diverse biological functions as pathogen- and parasite-resistance, lipid transport and metabolism, breed-specific differences in adaptation, health, and production traits .
The focus of CNV studies has also extended into other domesticated animals including dog, goat, cattle, pig, and sheep [24–33]. Considering the heritability of CNVs and their higher rates of mutation, CNVs may be largely associated with or affect animal health and production traits under recent selection. In the case of cattle, partial deletion of the bovine gene ED1 causes anhidrotic ectodermal dysplasia . Bos taurus indicus has the capacity to adapt to warm climates and superior resistance to tick infestation than Bos taurus taurus breeds . Likewise, beef and dairy cattle breeds display distinct patterns in selected metabolic pathways related to muscling, marbling, and milk composition traits. It is possible that CNVs may be associated with these agriculturally important traits .
Until now, CNV screens were routinely performed by comparative genomic hybridization (CGH) and SNP arrays, and many studies have extensively reviewed their performances [36–39]. However, these methods, which are often affected by low probe density and cross-hybridization of repetitive sequence, were not able to detect CNVs at the whole genome level. A limited number of investigations in cattle CNV has been performed to detect CNVs using methods that include high-density aCGH and the 50 K SNP panel [25–27]. The recent advances of NGS and complementary analysis programs have provided better approaches to systematically identify CNVs at a deep genome-wide level than the currently available commercial SNP chip and aCGH methodologies [6, 40]. These sequence-based approaches, which are becoming more popular due to the ongoing developments and cost decreases in NGS, allow for CNV reconstruction at a higher effective resolution and sensitivity.
In this study, we attempt to detect genome-wide CNVs at the population level based on NGS data of 32 cattle. Using UMD3.1  as a reference genome, we used Genome STRiP to detect cattle CNVs at the population level using Hanwoo (22 individuals), a Korea beef cattle, and Holstein (10 individuals), a dairy cattle. We discovered 18.6 Mbp of deleted sequence in the reference genome. However, using Genome STRiP, we could only extract deleted CNVs from the population data . This study confirmed that CNVs are common, associated with deleted regions, and often occur in gene-rich regions in cattle. We analyzed genes related to CNVs using deletion score in order to explore their potential function and contributions in domestication. In addition, we investigated the selected CNVs using FST and breed-specific CNVs for traits related to beef and milk production (Additional file 1). By providing several types of information on cattle CNV at the population level and presenting deleted CNV maps with breed-specific CNVs, we provide the basis for further studies into the role of deleted CNVs in the cattle genome.
Result and discussion
QTL related to CNV regions were identified using the Animal QTL database . We found that 2,220 out of 3,605 (61.58%) cattle QTL overlapped with 6,623 putative deleted CNVs. The index used to measure deletion density, the average distance between deletions, showed large variations (minimum: 1069.86 bp; maximum: 3,728,838 bp; median: 20693.06 bp; average score: 31433.17 bp). The top 30 QTL overlapping with CNVs are listed in Additional file 6. CNV deletion scores of the top 30 QTL were also highly variable (between 50 and 142). Six of the top QTL were directly related to meat production while eight of the top QTL were associated with milk production. We also propose genes that overlap with the top 30 QTL (Additional file 6), which are mainly related to sensory perception as olfactory receptor.
In this study, we used 32 individual of two cattle breeds, Hanwoo and Holstein, to detect CNVs at the population level. Hanwoo, Bos taurus coreanae, is a breed of cattle raised in Korea, which may be a hybrid of Bos taurus and Bos indicus. Hanwoo migrated and settled in the Korean Peninsula around 5,000 BC. It has been used both as a draft animal and a source of meat but over the past 40 years, the main role of Hanwoo has changed to beef cattle. Since the first official genetic breeding program for Hanwoo by the Korean government started in 1979, the productivity of Hanwoo has improved substantially. In contrast, Holstein is a breed of cattle that has been strongly selected for milk-production and currently has the highest-production of dairy. Genetic resource of Holstein is shared throughout the world by trading in semen and seed bull. We used 22 Hanwoo and 10 Holstein for NGS CNV detection. Holstein individuals were selected using common global criteria while Hanwoo was selected from two different regions to capture the complete genetic picture of the breed. The genetic difference between the Hanwoo individuals of the two populations was identified to be small, and so the 22 individuals were regarded as a single population in this study.
We showed that genes with higher deletion score are more likely to be under genetic drift. Through the CNV deletion score of 32 individuals, we wanted to find out which genes have been affected by cattle domestication. Humans have applied strong selective pressure on each cattle breed through elaborate breeding strategies to form breeds that can provide products such as milk and meat. Animal breeding by humans has been performed during a short period and the cattle population is usually produced by artificial insemination using a small number of seed bull and many cows to both maintain product quality and manage bloodlines. From a genetics point of view, breeding can be regarded as a genetic diversity reduction event much like a population bottleneck. We predicted in this study that deletion regions with beneficial adaptations might have arisen after this genetic diversity reduction event. The loss of variation leaves a surviving population that is favorable with regard to the selective pressures put on it such as the production of milk or meat. The breeding strategies of each cattle breed share a common domestication process, so we wanted to capture the genic regions affected by the general cattle domestication using deleted CNV. Based on this assumption, we selected 33 significant Ensemble genes, which were strongly affected by deleted CNVs using deletion scores. We regarded the genes with higher deletion score as being under neutral or diversifying selection in the absence of additional information.
We want to discover QTL related to the deleted CNVs. We wanted to suggest novel genetic regions over genic regions, which are affected by deleted CNV by using QTL that contain information about the region related to each of the economic phenotype of cattle. So, we used QTL information of Animal QTL to detect a wider region of the genome affected by deleted CNV that contains meaningful information. However, QTL mapping is a step prior to gene definition and QTL region information is roughly defined based on phenotype information. As the variance of QTL length was very large and longer QTL tended to have higher deletion scores (Additional file 17), we could not determine whether high deletion scores of QTL were due to containing many actual deletion regions or simply from the length of the QTL. Therefore, we could not use the deletion scores of QTL to discover QTL affected by deleted CNVs. To overcome this problem, we used a new measurement, average distance between deletions (QTL length/deletion score), to discover QTL affected by deleted CNV. However, the average distance between deletions in QTL was still very variable. It was not possible to create a proper distribution of average distance between deletions in QTL because there were so many QTL regions considering the total number of CNVs. The empirical p-value of QTL had very short average distance between deletions and did not reach the commonly used criteria. However, considering previous studies that discovered important QTL regions that overlapped with SNP in GWAS or selective sweep study, we supposed that QTL containing very short average distance between deletions must be meaningful. So we proposed the top 30 QTL that were selected by the average distance between deletions and regarded these QTL as QTL affected by deleted CNV. We had guessed that QTL types related to CNV in this study would be highly variable, because QTL are roughly defined based on phenotype information and meat and milk traits are complex traits. As expected, the top 30 identified QTL from the QTL analysis had diverse traits. Additionally, as QTL is a region related to economic traits, we focused on the relationship between the region and their traits. We predicted that the gene content of QTL affected by deleted CNV was very important and that this information would supplement information on the genes selected by deletion score.
The domestication and subsequent selection by humans to create breeds have had an impact on the variation within the cattle genome. Strong selection for breed characteristics or productivity has created regions that have lost variation due to the fixation of advantageous mutations, or selective sweep regions. We identified selective sweep regions in the cattle genome but no study has yet to explore these regions using CNVs. In this study, FST based on the CNV frequency spectra was used to identify and characterize regions of the cattle genome under selective sweep. Additionally, as mentioned earlier, deletion score was used to estimate the genes affected by deleted CNV in cattle domestication by understanding the number of CNVs in each gene and the frequency of each CNV within the population. Selective sweep signal based on FST of deleted CNV was used to estimate how each deleted CNV affects the trait difference between Holstein (for milk) and Hanwoo (for meat). Between the two examined cattle breeds, 94 putative sweep regions were identified. We assumed that economic traits including beef and milk production have historically been under strong selection. Based on this assumption, we wanted to explore CNVs under selective sweep for economic traits. The results were then used as foundation for the selective sweep section of this study. The most significant deleted CNV (BovineCNV0531) was within the titin gene (TTN). Takahisa Yamada et al. (2009) reported that TTN is involved in myofibrillogenesis through a SNP association study in Japanese Black beef cattle. TTN was reported as the gene which is a positional functional candidate responsible for marbling in beef . A comparison of Japanese Black breed with Holstein and Brown Swiss breed showed that SNP in TTN has strong selection pressure for high marbling . Therefore, even though the deletion was in the intron, we predict that BovineCNV0531 has had strong impacts from selection during breed formation.
Recently, NGS data have been used to discover breed-specific SNP of domesticated animals. In a previous study on pigs, breed-specific SNPs were selected from NGS resequencing data and then filtered by data validation using SNP chip data of many individual to apply assignment test . However, in the case of CNVs it is difficult to validate them, because CNVs in this study are structural variation at the population level that is dependent on the nature of the population and there is no back-up data such as SNP chip data. However, STRUCTURE analysis using 6,811 CNVs could classify individuals into the two breeds, Hanwoo and Holstein (Additional file 18). Therefore, we wanted to know which CNVs were breed-specific and understand the biological meaning of these breed-specific CNVs. We selected CNVs that belonged to only one breed and regarded these CNVs as breed-specific CNV candidates. And then we only selected CNVs with a frequency of higher than 0.1 in each breed to minimize the false positive breed-specific CNV calls instead of validation using back-up data.
If deletions occur within coding regions of the listed genes, the missing functional domains of the translated proteins resulting from that gene may be inferred, we were careful in making such inferences as we did not have the phenotype information that is needed to conduct an association study of the relationship between genetic variants and the traits. Therefore, we could not perform additional analysis or experiment to directly investigate the biological phenomenon affected by deleted CNV. Though there are many limitations, we could discover some key points regarding the missing functional domains of translated proteins resulting from genes largely affected by deleted CNVs.
First, as we only identified deleted CNVs, the results only cover a portion of the genes involved in cattle domestication. In Gene Ontology analysis using the top deletion score genes (23), many genes related to nervous system, more specifically nervous transmission, neuron motion, and neurogenesis were identified (Figure 2). The 23 genes identified as being related to the nervous system may have played a role in the behavioral changes that occurred in cattle due to domestication. During domestication, humans selected for docility in cattle leading to the loss of cattle’s wild nature. Although these genes do not directly code for behavior, they may encode molecular products that govern the functioning of the brain, which then controls character and behavior. A previous study reported that these variations in behavior shape the evolution of genomic elements that influence social behavior through the feedback of selection . The number of genes with a top 1% deletion score was 33 and the number of CNV overlapping with a gene was 135. After comparing the CNV region with the exon region information of 33 genes, only one of 135 overlapped CNVs (BovineCNV3796, chr2: 44486266–44830807) was in exon region. BovineCNV3796 overlapped with 4 exon regions (ENSBTAE00000348420, ENSBTAE00000348416, ENSBTAE00000092579, ENSBTAE00000246220). The remaining 134 CNVs were in intron regions of the 33 genes with a top 1% deletion score We assumed that the extra structure was needed to produce diverse genes during the evolutionary process and CNV affecting these processes remains as an evolutionary trace. So, through the 33 genes with the highest number of CNV deletions, we can observe evolutionary evidence of changes in important cattle character and behavior during domestication by the potential missing functional domains of translated proteins resulting from genes affected by deleted CNVs.
Additionally, we found that 16 protein-coding genes overlapped with the top 30 QTL identified using average distance between deletions. These QTL were largely affected by deleted CNVs. Out of the 16 genes, four (OR10A7, OR2J3, OR6C75, OR6C76) were related to sensory perception as olfactory receptor. Studies of evolutionary changes of a number of ORs in other mammalian species reported that cow has fewer gene in specific OR gene cluster . And a Holstein CNV study reported that there were many CNV losses in several OR genes . The result, which showed that several top QTL overlapped with some of the OR genes, may be supported by these studies. Based on this result, we guessed that OR genes have been affected by domestication process. Moreover, the rearing time for cattle was longer than that of other domesticated mammals. We guessed that the difference among domesticated animals could remain in the OR genes. Previous study of cattle olfactory receptor gene reported that there was significant variation in the genetic component of olfactory receptor systems among artiodactyl species, indicating that the selection pressure for maintaining the integrity of olfactory receptor genes was lower in cattle compared to pigs . These results supported that some CNVs in the selected QTL have been reflected in the evolutionary process during domestication by the missing functional domains of translated proteins resulting from genes affected by deleted CNVs.
In selective sweep signal based on FST of deleted CNV, 14 protein coding genes overlapped with CNV containing strong selective sweep. Out of these, seven genes (TTN, MATN3, DST, HDAC4, TSHR, CCDC141, and GALK2) were reported as being related to meat or milk production [63–68, 101]. In these seven CNVs genotype information, five CNVs had higher deletion frequency in Hanwoo than in Holstein (TTN, HDAC4, TSHR, CCDC141, GALK2) and two CNVs (MATN3, DST) had higher deletion frequency in Holstein. Specially, TTN which encodes an abundant protein of striated muscle is famous as gene related to marbling SNP in Japanese Black beef cattle . We predict that the marbling SNP may give a certain negative effect to muscle production mechanism by TTN gene for intramuscular fat. In this study, all Hanwoo had double deletion CNVs related to TTN genes (Additional file 8). Though CNV extraction in Hanwoo and FST calculation based on deleted CNV genotype information are trial procedures and not a widely used method, the result matched up with our expectation. Three (HDAV4, TSHR, CCDC141) of other four genes with CNVs that were mainly deleted in Hanwoo were strongly related to muscle (Additional file 9). The last gene, GALK2, has been shown to be up-regulated during the secretory activation in initiation of milk production . In this study, 13 individual had double deletions and 9 individual had single deletions in Hanwoo, but in Holstein only 3 had single deletions and the remaining 7 individuals had no deletions. In the case of CNV that were mainly deleted in Holstein, MATN3 (BovineCNV3277) is related to genetic risk factors for osteoarthritis which is related to dairy production . Nine of the 10 Holstein individuals in this study had more than one deletion, but none of the Hanwoo individuals had deletions. These facts supported that these CNVs have contributed to breed differentiation, perhaps, by missing functional domains of translated proteins resulting from genes affected by deleted CNVs.
In the case of breed-specific CNVs, we selected CNVs in one breed, so there were a higher number of breed-specific CNVs than CNVs found for breed differentiation and it was difficult to discover the biological meaning behind them. In Hanwoo, through GO analysis of genes overlapping with Hanwoo-specific CNVs, two clusters were found. Cluster 1 contains 29 genes and cluster 2 contains 11genes that are related to neuromuscular process and sensory perception, respectively (Figure 4). These terms are similar to genes and QTL strongly affected by deleted CNV. Therefore, we suggest that Hanwoo-specific CNVs reflected the evolutionary process, which occurred during domestication. Additionally, in the case of beef cattle such as Hanwoo, humans have limited the allowed space for the cows to induce better marbling of the meat. Based on these facts, we predict that individuals that are less sensitive may have had more advantages than sensitive individuals in enduring this breeding environment in captivity. Therefore, we supposed that due to this breeding history, genes related to sensory perception and response to oxygen had many deletions. In Holstein, two genes (NELL2, C6ORF10) related to Holstein-specific CNV were down regulated in milk production and one gene (MATN3), related to dairy production, was reported to be a genetic risk factor for osteoarthritis (Additional file 14) [64, 88, 94]. MATN3 was also selected in the analysis of selective sweep signals based on FST of deleted CNVs. We predict that Holstein-specific deleted CNV may control some biological process, which gives rise to negative effects on the dairy cattle. These results support the hypothesis that CNVs contribute to breed establishment by the missing functional domains of translated proteins resulting from genes affected by deleted CNVs.
Almost all of the CNV regions examined by PCR in the validation experiment were similar to the CNV regions from Genome STRiP (< 200 bp). However, three CNV regions, BovineCNV 3797, BovineCNV3226 and BovineCNV0050 were not fully validated by PCR assays across both breeds and all surveyed individuals. In BovineCNV 3797 and BovineCNV3226, the deletion alleles were not successfully amplified (case 3 in Additional file 19). This is probably due to the fact that the extracted CNV regions ranged over the primer locations (Additional file 15). Although the deleted allele was not confirmed, the wild-type allele was well defined in this case and in the case of BovineCNV3226, individuals considered to have only a deletion allele did not produce any amplicons. Interestingly, the opposite case, case 4 in Additional file 19, was also present. PCR amplification detected only BovineCNV0050-deleted allele (Additional file 15). So we carried out PCR again using primer pairs amplifying the CNV and its outside region to confirm the presence of the CNV containing allele. However, it did not work and no non-deleted allele was amplified (data not shown). BovineCNV0050 region contains undefined gap sequence, so this could be the reason for the failed amplification. Similar to case 3, the results showing deleted alleles were well defined. When we calculated the CNV accuracy, these two cases were scored lower than case 1 and 2 (0.7 vs. 1.0). The CNV accuracy examined in this study was about 80% (Figure 6).
Our study presents description of deleted CNVs of cattle by analyzing NGS data of 32 individuals from two breeds. A total of 6,811 deleted CNVs were identified in 22 Hanwoo, and 10 Holsteins individuals. We selected the top 33 genes that had high deletion scores and regarded them as being significantly involved in the domestication process. Their genetic functions were related to nervous system, in particular nervous transmission, neuron motion and neurogenesis. The relationship between these 33 genes and the nervous system may be associated with the changes in behavior due to domestication. The top 30 QTL based on deleted CNVs were associated with diverse quantitative traits including meat and milk production. The genes within top QTL were related to olfactory receptor genes, which reported lower pressure in cattle. We also discovered selective signals in 94 CNVs based on FST values. The top CNVs that were under selection included the TTN gene that has a SNP strongly associated with myofibrillogenesis for marbling in Japanese Black beef cattle. In total, we detected 954 breed-specific CNVs, and 767 of 954 CNVs were Hanwoo-specific and related to several biological processes including phosphorylation, protein modification process, cell adhesion and maintenance, neuromuscular process, sensory perception, and response to oxygen. The other 187 CNVs were Holstein-specific and related to dairy production. Additionally, to confirm the CNV genotype within some putative genes containing the impact on the domestication of cattle, we performed PCR assays. The validation experiment showed that the CNV accuracy of this study is about 80%.
This study provides information on deleted CNVs across the cattle genome at the population level and suggests their possible roles in both domestication and recent breed selection. This study using deleted CNV at the population level is a trial step towards exploring the underlying genetics of economically important traits in cattle and understanding the genetic changes that occurred during domestication. However, further research into the genes related to CNVs and a comprehensive study on inserted CNVs is needed to form a more complete picture of the genetic structure variation in the bovine genome. Additionally, when the associations between CNV and economic traits in cows are identified, it will be possible to incorporate them into breeding programs for production enhancement in cattle.
All experimental procedures on animals in this study were performed in strict accordance with good animal practice as defined by the relevant national and/or local welfare bodies. In addition, all animal experiments were approved by the Institutional Animal Care and Use Committee of the National Institute of Animal Science (No. 2012-C-005, CNU-00300).
DNA sampling & resequencing process
Based on the breed history and breed-specific information, we obtained 22 Hanwoo and ten Holsteins for whole-genome resequencing. Individuals were selected as representatives of its breed. Out of the 22 Hanwoo, 11 individuals were from the Hanwoo Experiment Station, National Institute of Animal Science, Rural Development Administration, Korea, and the other 11 individuals were from Kyungpook National University, Korea. Ten Holsteins were obtained from National Institute of Animal Science, Rural Development Administration, Korea. Blood was collected from each animal and treated with heparin to prevent clotting. Manufacturers’ instructions were followed to create a paired library. Pair-end sequence data was generated using Hiseq 2000 (Illumina, Inc). Pair-end sequence reads were mapped to the reference cattle genome UMD 3.1 with aligner based on the Burrows-Wheeler transform and the FM-index (Bowtie2; version 2.1.0) using default setting . Three open-source packages were used for downstream processing and variant calling: Picard Tools, SAMtools, and Genome analysis toolkit (GATK)  (Additional file 20). All calls with a Phred-scaled quality of less than 20 were filtered out. The origin, features, and general sequencing information of the individual animals are summarized in Additional file 2.
Copy number variations extraction
The re-sequencing data of the 32 cows were aligned and CNVs were extracted from the combined dataset. The CNV extraction tool Genome STRucture in Population (Genome STRiP) was used to retrieve deletion calls of CNVs at the population level . Each CNV was genotyped, and the genotype quality was estimated based on the measurement of genotype likelihoods. To ensure that only highly plausible variants are retained, we selected CNVs that passed all genotype quality thresholds in Genome STRiP. Genome STRiP has four filtering criteria for defining deleted CNVs. The definition and default values of the four criteria in Genome STRiP are as follows: COHERENCE (incoherence metric > 0.01), COVERAGE (median normalized read depth of samples with observed evidentiary pairs < 1.0, this filter was used to remove calls in regions of unusually high sequence coverage across many samples), DEPTH (depth ratio < = 0.63 or depth ratio < = 0.8 and heterogeneity P value < 0.01), DEPTHPVAL (Depth p-value using chi-squared test < 0.01). When Genome STRiP defines CNVs, each CNV must pass the four criteria. The number of CNVs decreased from 44,388 to 9,732 CNVs following the filtering criteria. After this step, we applied a secondary criterion to check individual quality for each CNV. In this filtering process, Genome STRiP used genotype likelihoods test. If all individual did not pass through this filtering, we could not obtain the CNV genotype information. After removing the low quality CNVs, 6,811 deleted CNVs remained. We regarded these 6,811 CNVs as the cattle CNVs in this study for additional analyses (Additional file 21).
Gene content of deleted copy number variations
#Deletion = deletion number in 32 individuals in each CNV (range 0 to 32)
l = number of CNVs in each gene
We assigned a deletion score to each gene that overlapped with CNVs (Additional file 22). To discover significant genes that overlap with CNVs that may be affected by the deleted CNV, we calculated empirical p-values for each CNV overlapping gene. We assumed that the distribution of total deletion score values of the 1,228 CNV overlapping genes was a normal distribution. The empirical p-value of each CNV overlapping gene was derived from this normal distribution. Then we selected genes with the top deletion scores (p-value < 0.01) as the representative genes related to cattle domestication. These genes were used to perform Gene Ontology (GO) analysis and pathway analysis in Database for Annotation, Visualization and Integrated Discovery (DAVID; version 6.7) .
Quantitative trait of deleted copy number variations
We compared CNV regions with the cattle QTL regions to explain the role of extracted CNVs in a quantitative trait. The quantitative trait content of each CNV was assessed by selecting QTL regions that overlapped with CNV regions in the 32 cows. The Animal QTL database was used to obtain all QTL region information . QTL traits of cattle can be largely divided into 12 traits with 3,605 loci. The length of QTL was found to be highly variable (minimum: 1000 bp; maximum: 134,956,528 bp; median: 208,803 bp; average score: 7,738,095 bp) (Additional file 23). The average distance between deletions calculated as the QTL length divided by the deletion score was defined as the CNV density within a QTL. The CNV density was calculated for all QTL related to cattle CNV and the top 30 QTL were selected as being representative QTL related to cattle CNV (Additional file 6).
Population structure analysis & phylogenetic inference
Two preliminary analyses were performed to infer the population structure of the 32 cows used in this study. The program STRUCTURE was used to evaluate the extent of substructuring between Holstein and Hanwoo . We determined that an initial burn-in of 10,000 iterations followed by 10,000 iterations for parameter estimation was sufficient to ensure convergence of parameter estimates. To estimate the number of populations (the K parameter of STRUCTURE), the dataset was analyzed by allowing for the values of K = 2 and 3 (Figure 5 a and b). PCA was conducted for the CNV genotypes in the 32 cows using the statistical program R (Additional file 18). For further identification of the evolutionary history of the samples, we constructed a phylogenetic tree using Bayesian inferences (BI) approaches. Bayesian phylogenetic inference is based on Bayes’s rule. The first characteristic of Bayesian inferences is the use of distribution referred to as the prior that specifies the prior probability of different parameter values. Additionally, this method uses the likelihood function that describes the probability of the data under different parameter values and the total probability of the data summed and integrated over the parameter space to infer a phylogenetic tree. As a result, Bayesian inference is based on the so-called posterior distribution. Phylogenetic analysis in this study was carried out using BI analytical method executed in MrBayes 3.1.2.  with the following options: nst: 6, rates: gamma, number of generations: 2,000,000, sample frequency: 100, number of chains: 4, and burn-in generation: 20,000. To estimate the reliability of the nodes, the Bayesian posterior probability (BPP) values were calculated as shown on the BI tree (Additional file 24).
Identifying selection signal using FST
Wright  defined several F coefficients that describe evolutionary processes. His definition was in terms of correlations among gamete: so we used Nei’s equivalent definitions in terms of deviations from expected heterozygosities. and Hobs = the observed frequency of heterozygotes
cj = relative size (proportion) of jth subpopulation
pj = frequency of deletion in jth subpopulation
qj = frequency of allele in jth subpopulation
The estimated values of FST are shown in Figure 3.
Identification of breed-specific copy number variations
Each CNV that passed the applied filtering criteria was labeled as a putative breed-specific CNV if the allele was present in only one of the two breeds. Among the putative breed-specific CNVs, CNVs with a deletion frequency of more than 0.1 in each population were selected as breed-specific CNVs. Gene related to the breed-specific CNV were selected and Gene Ontology (GO) analysis was performed in Database for Annotation, Visualization and Integrated Discovery (DAVID; version 6.7) .
We selected seven putative genes (TTN, SLIT3, KLHL1, NCAM2, MDGA2, EFNA5 and PRKG1) that contain the impact of cattle domestication and performed PCR to confirm the 19 CNVs within these genes. Originally, there were 25 CNVs in the seven genes but six CNVs with a length of greater than 1.5 Kb were excluded in this validation. Genomic DNA (gDNA) samples from ten Holstein and 22 Hanwoo were used to validate the CNV region selected by Genome STRiP  and determine if they were genuine CNV regions. The primer pairs were designed to be located outside of the predicted CNV region or inside and outside of the CNV region for cases where only the deleted allele was detected (Additional file 20). Fifty nanograms of gDNA was used for PCR amplification and the reaction was performed by using a 2× PCR master mix solution (iNtRON Bio Technology, Seongnam, Gyeonggi, Korea) with 0.5 μM of each primer set. The amplification was performed under the following conditions: 1 cycle of 95°C for 5 min; 35 cycles of 95°C for 30 sec, annealing at the 58 ~ 66°C for 30 sec, and 72°C for 1 min or 1 min 30 sec (Additional file 25); and 1 cycle of 72°C for 10 min. All PCR products were visualized on 1% ethidium bromide stained gels run for 25 min.
Availability of supporting data
The data set supporting the results of this article is available in the NCBI SRA (PRJNA210523; Hanwoo from RDA in Suwon, PRJNA210521; Holstein from RDA in Suwon, PRJNA210519; Hanwoo from Gyeong-buk).
We thank our colleagues in the Laboratory of Bioinformatics and Population Genetics of Seoul National University for valuable advice on this study. This study was supported in part by “Cooperative Research Program for Agriculture Science & Technology Development (Project No. PJ008487)” and “the BioGreen 21 program (No. PJ008191)” of Rural Development Administration, Republic of Korea.
- Elsik CG, Tellam RL, Worley KC: The genome sequence of taurine cattle: a window to ruminant biology and evolution. Science. 2009, 324 (5926): 522-528.PubMed CentralPubMedGoogle Scholar
- Liu Y, Qin X, Song X-ZH, Jiang H, Shen Y, Durbin KJ, Lien S, Kent MP, Sodeland M, Ren Y: Bos taurus genome assembly. BMC Genomics. 2009, 10 (1): 180-10.1186/1471-2164-10-180.PubMed CentralPubMedGoogle Scholar
- Zimin AV, Delcher AL, Florea L, Kelley DR, Schatz MC, Puiu D, Hanrahan F, Pertea G, Van Tassell CP, Sonstegard TS: A whole-genome assembly of the domestic cow, Bos taurus. Genome Biol. 2009, 10 (4): R42-10.1186/gb-2009-10-4-r42.PubMed CentralPubMedGoogle Scholar
- Matukumalli LK, Lawley CT, Schnabel RD, Taylor JF, Allan MF, Heaton MP, O'Connell J, Moore SS, Smith TP, Sonstegard TS: Development and characterization of a high density SNP genotyping assay for cattle. PLoS One. 2009, 4 (4): e5350-10.1371/journal.pone.0005350.PubMed CentralPubMedGoogle Scholar
- Eck SH, Benet-Pagès A, Flisikowski K, Meitinger T, Fries R, Strom TM: Whole genome sequencing of a single Bos taurus animal for single nucleotide polymorphism discovery. Genome Biol. 2009, 10 (8): R82-10.1186/gb-2009-10-8-r82.PubMed CentralPubMedGoogle Scholar
- Stothard P, Choi J-W, Basu U, Sumner-Thomson JM, Meng Y, Liao X, Moore SS: Whole genome resequencing of black Angus and Holstein cattle for SNP and CNV discovery. BMC Genomics. 2011, 12 (1): 559-10.1186/1471-2164-12-559.PubMed CentralPubMedGoogle Scholar
- Barendse W, Barendse W, Harrison B, Bunch R, Thomas M, Turner L: Genome wide signatures of positive selection: The comparison of independent samples and the identification of regions associated to traits. BMC Genomics. 2009, 10 (1): 178-10.1186/1471-2164-10-178.PubMed CentralPubMedGoogle Scholar
- Gibbs RA, Taylor JF, Van Tassell CP, Barendse W, Eversole KA, Gill CA, Green RD, Hamernik DL, Kappes SM, Lien S: Genome-wide survey of SNP variation uncovers the genetic structure of cattle breeds. Science (New York, NY). 2009, 324 (5926): 528-532.Google Scholar
- Hayes B, Chamberlain A, Maceachern S, Savin K, McPartlan H, MacLeod I, Sethuraman L, Goddard M: A genome map of divergent artificial selection between Bos taurus dairy cattle and Bos taurus beef cattle. Anim Genet. 2009, 40 (2): 176-184. 10.1111/j.1365-2052.2008.01815.x.PubMedGoogle Scholar
- Mills RE, Mills RE, Walter K, Stewart C, Handsaker RE, Chen K, Alkan C, Abyzov A, Yoon SC, Ye K, Cheetham RK: Mapping copy number variation by population-scale genome sequencing. Nature. 2011, 470 (7332): 59-65. 10.1038/nature09708.PubMed CentralPubMedGoogle Scholar
- Redon R, Ishikawa S, Fitch KR, Feuk L, Perry GH, Andrews TD, Fiegler H, Shapero MH, Carson AR, Chen W: Global variation in copy number in the human genome. Nature. 2006, 444 (7118): 444-454. 10.1038/nature05329.PubMed CentralPubMedGoogle Scholar
- Conrad DF, Pinto D, Redon R, Feuk L, Gokcumen O, Zhang Y, Aerts J, Andrews TD, Barnes C, Campbell P: Origins and functional impact of copy number variation in the human genome. Nature. 2009, 464 (7289): 704-712.PubMed CentralPubMedGoogle Scholar
- Altshuler DM, Gibbs RA, Peltonen L, Dermitzakis E, Schaffner SF, Yu F, Bonnen PE, De Bakker P, Deloukas P, Gabriel SB: Integrating common and rare genetic variation in diverse human populations. Nature. 2010, 467 (7311): 52-58. 10.1038/nature09298.PubMedGoogle Scholar
- Graubert TA, Cahan P, Edwin D, Selzer RR, Richmond TA, Eis PS, Shannon WD, Li X, McLeod HL, Cheverud JM: A high-resolution map of segmental DNA copy number variation in the mouse genome. PLoS Genet. 2007, 3 (1): e3-10.1371/journal.pgen.0030003.PubMed CentralPubMedGoogle Scholar
- Guryev V, Saar K, Adamovic T, Verheul M, Van Heesch SA, Cook S, Pravenec M, Aitman T, Jacob H, Shull JD: Distribution and functional impact of DNA copy number variation in the rat. Nat Genet. 2008, 40 (5): 538-545. 10.1038/ng.141.PubMedGoogle Scholar
- She X, Cheng Z, Zöllner S, Church DM, Eichler EE: Mouse segmental duplication and copy number variation. Nat Genet. 2008, 40 (7): 909-914. 10.1038/ng.172.PubMed CentralPubMedGoogle Scholar
- Yalcin B, Wong K, Agam A, Goodson M, Keane TM, Gan X, Nellåker C, Goodstadt L, Nicod J, Bhomra A: Sequence-based characterization of structural variation in the mouse genome. Nature. 2011, 477 (7364): 326-329. 10.1038/nature10432.PubMed CentralPubMedGoogle Scholar
- Aitman TJ, Dong R, Vyse TJ, Norsworthy PJ, Johnson MD, Smith J, Mangion J, Roberton-Lowe C, Marshall AJ, Petretto E: Copy number polymorphism in Fcgr3 predisposes to glomerulonephritis in rats and humans. Nature. 2006, 439 (7078): 851-855. 10.1038/nature04489.PubMedGoogle Scholar
- Fellermann K, Stange DE, Schaeffeler E, Schmalzl H, Wehkamp J, Bevins CL, Reinisch W, Teml A, Schwab M, Lichter P: A chromosome 8 gene-cluster polymorphism with low human beta-defensin 2 gene copy number predisposes to Crohn disease of the colon. Am J Hum Genet. 2006, 79 (3): 439-448. 10.1086/505915.PubMed CentralPubMedGoogle Scholar
- le Maréchal C, Masson E, Chen J-M, Morel F, Ruszniewski P, Levy P, Férec C: Hereditary pancreatitis caused by triplication of the trypsinogen locus. Nat Genet. 2006, 38 (12): 1372-1374. 10.1038/ng1904.PubMedGoogle Scholar
- Stankiewicz P, Lupski JR: Structural variation in the human genome and its role in disease. Annu Rev Med. 2010, 61: 437-455. 10.1146/annurev-med-100708-204735.PubMedGoogle Scholar
- Yang Y, Chung EK, Wu YL, Savelli SL, Nagaraja HN, Zhou B, Hebert M, Jones KN, Shu Y, Kitzmiller K: Gene copy-number variation and associated polymorphisms of complement component C4 in human systemic lupus erythematosus (SLE): low copy number is a risk factor for and high copy number is a protective factor against SLE susceptibility in European Americans. Am J Hum Genet. 2007, 80 (6): 1037-1054. 10.1086/518257.PubMed CentralPubMedGoogle Scholar
- Zhang F, Gu W, Hurles ME, Lupski JR: Copy number variation in human health, disease, and evolution. Annu Rev Genomics Hum Genet. 2009, 10: 451-481. 10.1146/annurev.genom.9.081307.164217.PubMed CentralPubMedGoogle Scholar
- Bickhart DM, Hou Y, Schroeder SG, Alkan C, Cardone MF, Matukumalli LK, Song J, Schnabel RD, Ventura M, Taylor JF: Copy number variation of individual cattle genomes using next-generation sequencing. Genome Res. 2012, 22 (4): 778-790. 10.1101/gr.133967.111.PubMed CentralPubMedGoogle Scholar
- Bae J, Cheong H, Kim L, NamGung S, Park T, Chun J-Y, Kim J, Pasaje C, Lee J, Shin H: Identification of copy number variations and common deletion polymorphisms in cattle. BMC Genomics. 2010, 11 (1): 232-10.1186/1471-2164-11-232.PubMed CentralPubMedGoogle Scholar
- Fadista J, Thomsen B, Holm L-E, Bendixen C: Copy number variation in the bovine genome. BMC Genomics. 2010, 11 (1): 284-10.1186/1471-2164-11-284.PubMed CentralPubMedGoogle Scholar
- Liu GE, Hou Y, Zhu B, Cardone MF, Jiang L, Cellamare A, Mitra A, Alexander LJ, Coutinho LL, Dell'Aquila ME: Analysis of copy number variations among diverse cattle breeds. Genome Res. 2010, 20 (5): 693-703. 10.1101/gr.105403.110.PubMed CentralPubMedGoogle Scholar
- Chen W-K, Swartz JD, Rush LJ, Alvarez CE: Mapping DNA structural variation in dogs. Genome Res. 2009, 19 (3): 500-509.PubMed CentralPubMedGoogle Scholar
- Fontanesi L, Fontanesi L, Beretti F, Martelli P, Colombo M, Dall'Olio S, Occidente M, Portolano B, Casadio R, Matassino D, Russo V: A first comparative map of copy number variations in the sheep genome. Genomics. 2011, 97 (3): 158-165. 10.1016/j.ygeno.2010.11.005.PubMedGoogle Scholar
- Fontanesi L, Beretti F, Riggio V, Dall’Olio S, Davoli R, Russo V, Portolano B: Copy number variation and missense mutations of the agouti signaling protein (ASIP) gene in goat breeds with different coat colors. Cytogenet Genome Res. 2009, 126 (4): 333-347. 10.1159/000268089.PubMedGoogle Scholar
- Kijas JW, Barendse W, Barris W, Harrison B, McCulloch R, McWilliam S, Whan V: Analysis of copy number variants in the cattle genome. Gene. 2011, 482 (1): 73-77.PubMedGoogle Scholar
- Nicholas TJ, Cheng Z, Ventura M, Mealey K, Eichler EE, Akey JM: The genomic architecture of segmental duplications and associated copy number variants in dogs. Genome Res. 2009, 19 (3): 491-499.PubMed CentralPubMedGoogle Scholar
- Ramayo-Caldas Y, Castelló A, Pena R, Alves E, Mercadé A, Souza C, Fernández A, Perez-Enciso M, Folch J: Copy number variation in the porcine genome inferred from a 60 k SNP BeadChip. BMC Genomics. 2010, 11 (1): 593-10.1186/1471-2164-11-593.PubMed CentralPubMedGoogle Scholar
- Drögemüller C, Distl O, Leeb T: Partial deletion of the bovine ED1 gene causes anhidrotic ectodermal dysplasia in cattle. Genome Res. 2001, 11 (10): 1699-1705. 10.1101/gr.182501.PubMed CentralPubMedGoogle Scholar
- Porto Neto LR, Jonsson NN, D’Occhio MJ, Barendse W: Molecular genetic approaches for identifying the basis of variation in resistance to tick infestation in cattle. Vet Parasitol. 2011, 180 (3): 165-172.PubMedGoogle Scholar
- LaFramboise T: Single nucleotide polymorphism arrays: a decade of biological, computational and technological advances. Nucleic Acids Res. 2009, 37 (13): 4181-4193. 10.1093/nar/gkp552.PubMed CentralPubMedGoogle Scholar
- Lai WR, Johnson MD, Kucherlapati R, Park PJ: Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data. Bioinformatics. 2005, 21 (19): 3763-3770. 10.1093/bioinformatics/bti611.PubMed CentralPubMedGoogle Scholar
- Pinto D, Darvishi K, Shi X, Rajan D, Rigler D, Fitzgerald T, Lionel AC, Thiruvahindrapuram B, MacDonald JR, Mills R: Comprehensive assessment of array-based platforms and calling algorithms for detection of copy number variants. Nat Biotechnol. 2011, 29 (6): 512-520. 10.1038/nbt.1852.PubMed CentralPubMedGoogle Scholar
- Winchester L, Yau C, Ragoussis J: Comparing CNV detection methods for SNP arrays. Brief Funct Genomic Proteomic. 2009, 8 (5): 353-366. 10.1093/bfgp/elp017.PubMedGoogle Scholar
- Alkan C, Coe BP, Eichler EE: Genome structural variation discovery and genotyping. Nat Rev Genet. 2011, 12 (5): 363-376. 10.1038/nrg2958.PubMed CentralPubMedGoogle Scholar
- Handsaker RE, Korn JM, Nemesh J, McCarroll SA: Discovery and genotyping of genome structural polymorphism by sequencing on a population scale. Nat Genet. 2011, 43 (3): 269-276. 10.1038/ng.768.PubMedGoogle Scholar
- Winther M, Berezin V, Walmod PS: NCAM2/OCAM/RNCAM: cell adhesion molecule with a role in neuronal compartmentalization. Int J Biochem Cell Biol. 2012, 44 (3): 441-446. 10.1016/j.biocel.2011.11.020.PubMedGoogle Scholar
- McIntyre JC, Titlow WB, McClintock TS: Axon growth and guidance genes identify nascent, immature, and mature olfactory sensory neurons. J Neurosci Res. 2010, 88 (15): 3243-3256. 10.1002/jnr.22497.PubMedGoogle Scholar
- Xu X-ZS, Wes PD, Chen H, Li H-S, Yu M, Morgan S, Liu Y, Montell C: Retinal targets for calmodulin include proteins implicated in synaptic transmission. J Biol Chem. 1998, 273 (47): 31297-31307. 10.1074/jbc.273.47.31297.PubMedGoogle Scholar
- Collingridge GL, Lester RA: Excitatory amino acid receptors in the vertebrate central nervous system. Pharmacol Rev. 1989, 41 (2): 143-210.PubMedGoogle Scholar
- Meldrum B, Garthwaite J: Excitatory amino acid neurotoxicity and neurodegenerative disease. Trends Pharmacol Sci. 1990, 11 (9): 379-387. 10.1016/0165-6147(90)90184-A.PubMedGoogle Scholar
- Bliss TV, Collingridge GL: A synaptic model of memory: long-term potentiation in the hippocampus. Nature. 1993, 361 (6407): 31-39. 10.1038/361031a0.PubMedGoogle Scholar
- Cartmell J, Schoepp DD: Regulation of neurotransmitter release by metabotropic glutamate receptors. J Neurochem. 2000, 75 (3): 889-907.PubMedGoogle Scholar
- Yagi T, Takeichi M: Cadherin superfamily genes: functions, genomic organization, and neurologic diversity. Genes Dev. 2000, 14 (10): 1169-1180.PubMedGoogle Scholar
- Davy A, Gale NW, Murray EW, Klinghoffer RA, Soriano P, Feuerstein C, Robbins SM: Compartmentalized signaling by GPI-anchored ephrin-A5 requires the Fyn tyrosine kinase to regulate cellular adhesion. Genes Dev. 1999, 13 (23): 3125-3135. 10.1101/gad.13.23.3125.PubMed CentralPubMedGoogle Scholar
- Sasaki S, Shionoya A, Ishida M, Gambello MJ, Yingling J, Wynshaw-Boris A, Hirotsune S: A LIS1 < i>/</i > NUDEL/cytoplasmic dynein heavy chain complex in the developing and adult nervous system. Neuron. 2000, 28 (3): 681-696. 10.1016/S0896-6273(00)00146-X.PubMedGoogle Scholar
- Brose K, Tessier-Lavigne M: Slit proteins: key regulators of axon guidance, axonal branching, and cell migration. Curr Opin Neurobiol. 2000, 10 (1): 95-102. 10.1016/S0959-4388(99)00066-5.PubMedGoogle Scholar
- Gleeson JG, Lin PT, Flanagan LA, Walsh CA: Doublecortin is a microtubule-associated protein and is expressed widely by migrating neurons. Neuron. 1999, 23 (2): 257-271. 10.1016/S0896-6273(00)80778-3.PubMedGoogle Scholar
- Bilimoria PM, Bonni A: Molecular control of axon branching. Neuroscientist. 2013, 19 (1): 16-24. 10.1177/1073858411426201.PubMed CentralPubMedGoogle Scholar
- Rønn LCB, Hartz B, Bock E: The neural cell adhesion molecule (NCAM) in development and plasticity of the nervous system. Exp Gerontol. 1998, 33 (7): 853-864.PubMedGoogle Scholar
- Hara Y, Nomura T, Yoshizaki K, Frisén J, Osumi N: Impaired Hippocampal neurogenesis and vascular formation in ephrin‒A5‒deficient mice. Stem Cells. 2010, 28 (5): 974-983.PubMedGoogle Scholar
- Litwack ED, Babey R, Buser R, Gesemann M, O'Leary DD: Identification and characterization of two novel brain-derived immunoglobulin superfamily members with a unique structural organization. Mol Cell Neurosci. 2004, 25 (2): 263-274. 10.1016/j.mcn.2003.10.016.PubMedGoogle Scholar
- Nemes JP, Benzow KA, Koob MD: The SCA8 transcript is an antisense RNA to a brain-specific transcript encoding a novel actin-binding protein (KLHL1). Hum Mol Genet. 2000, 9 (10): 1543-1551. 10.1093/hmg/9.10.1543.PubMedGoogle Scholar
- Itoh A, Miyabayashi T, Ohno M, Sakano S: Cloning and expressions of three mammalian homologues of < i > Drosophila slit</i > suggest possible roles for < i > Slit</i > in the formation and maintenance of the nervous system. Mol Brain Res. 1998, 62 (2): 175-186. 10.1016/S0169-328X(98)00224-1.PubMedGoogle Scholar
- Yoneyama M, Kawada K, Shiba T, Ogita K: Endogenous nitric oxide generation linked to ryanodine receptors activates cyclic GMP/protein kinase G pathway for cell proliferation of neural stem/progenitor cells derived from embryonic hippocampus. J Pharmacol Sci. 2011, 115 (2): 182-195. 10.1254/jphs.10290FP.PubMedGoogle Scholar
- Nagae S, Tanoue T, Takeichi M: Temporal and spatial expression profiles of the Fat3 protein, a giant cadherin molecule, during mouse development. Dev Dyn. 2007, 236 (2): 534-543. 10.1002/dvdy.21030.PubMedGoogle Scholar
- Hu Z-L, Fritz ER, Reecy JM: AnimalQTLdb: a livestock QTL database tool set for positional QTL information mining and beyond. Nucleic Acids Res. 2007, 35 (suppl 1): D604-D609.PubMed CentralPubMedGoogle Scholar
- Yamada T, Sasaki S, Sukegawa S, Yoshioka S, Takahagi Y, Morita M, Murakami H, Morimatsu F, Fujita T, Miyake T: Association of a single nucleotide polymorphism in titin gene with marbling in Japanese Black beef cattle. BMC Res Notes. 2009, 2 (1): 78-10.1186/1756-0500-2-78.PubMed CentralPubMedGoogle Scholar
- Heinola T, de Grauw J, Virkki L, Kontinen A, Raulo S, Sukura A, Konttinen Y: Bovine chronic osteoarthritis causes minimal change in synovial fluid. J Comp Pathol. 2013, 148 (4): 335-344. 10.1016/j.jcpa.2012.08.001.PubMedGoogle Scholar
- Cole J, Wiggans G, Ma L, Sonstegard T, Lawlor T, Crooker B, Van Tassell C, Yang J, Wang S, Matukumalli L: Genome-wide association analysis of thirty one production, health, reproduction and body conformation traits in contemporary US Holstein cows. BMC Genomics. 2011, 12 (1): 408-10.1186/1471-2164-12-408.PubMed CentralPubMedGoogle Scholar
- Pipes G, Bauman T, Brooks J, Comfort J, Turner C: Effect of season, sex and breed on the thyroxine secretion rate of beef cattle and a comparison with dairy cattle. J Anim Sci. 1963, 22 (2): 476-480.Google Scholar
- Fukuda T, Sugita S, Inatome R, Yanagi S: CAMDI, a novel disrupted in schizophrenia 1 (DISC1)-binding protein, is required for radial migration. J Biol Chem. 2010, 285 (52): 40554-40561. 10.1074/jbc.M110.179481.PubMed CentralPubMedGoogle Scholar
- Mohammad MA, Hadsell DL, Haymond MW: Gene regulation of UDP-galactose synthesis and transport: potential rate-limiting processes in initiation of milk production in humans. Am J Physiol Endocrinol Metab. 2012, 303 (3): E365-E376. 10.1152/ajpendo.00175.2012.PubMed CentralPubMedGoogle Scholar
- Perez R, Cañón J, Dunner S: Genes associated with long-chain omega-3 fatty acids in bovine skeletal muscle. J Appl Genet. 2010, 51 (4): 479-487. 10.1007/BF03208877.PubMedGoogle Scholar
- Moore TM, Garg R, Johnson C, Coptcoat MJ, Ridley AJ, Morris JD: PSK, a novel STE20-like kinase derived from prostatic carcinoma that activates the c-Jun N-terminal kinase mitogen-activated protein kinase pathway and regulates actin cytoskeletal organization. J Biol Chem. 2000, 275 (6): 4311-4322. 10.1074/jbc.275.6.4311.PubMedGoogle Scholar
- Fluckey JD, Knox M, Smith L, Dupont-Versteegden EE, Gaddy D, Tesch PA, Peterson CA: Insulin-facilitated increase of muscle protein synthesis after resistance exercise involves a MAP kinase pathway. Am J Physiol Endocrinol Metab. 2006, 290 (6): E1205-E1211. 10.1152/ajpendo.00593.2005.PubMedGoogle Scholar
- Aspenström P: A Cdc42 target protein with homology to the non-kinase domain of FER has a potential role in regulating the actin cytoskeleton. Curr Biol. 1997, 7 (7): 479-487. 10.1016/S0960-9822(06)00219-3.PubMedGoogle Scholar
- Yin H-Q, Kim M, Kim J-H, Kong G, Kang K-S, Kim H-L, Yoon B-I, Lee M-O, Lee B-H: Differential gene expression and lipid metabolism in fatty liver induced by acute ethanol treatment in mice. Toxicol Appl Pharmacol. 2007, 223 (3): 225-233. 10.1016/j.taap.2007.06.018.PubMedGoogle Scholar
- Sun H, Tonks NK: The coordinated action of protein tyrosine phosphatases and kinases in cell signaling. Trends Biochem Sci. 1994, 19 (11): 480-485. 10.1016/0968-0004(94)90134-1.PubMedGoogle Scholar
- Rodrigues GA, Falasca M, Zhang Z, Ong SH, Schlessinger J: A novel positive feedback loop mediated by the docking protein Gab1 and phosphatidylinositol 3-kinase in epidermal growth factor receptor signaling. Mol Cell Biol. 2000, 20 (4): 1448-1459. 10.1128/MCB.20.4.1448-1459.2000.PubMed CentralPubMedGoogle Scholar
- Lannon CL, Sorensen PHB: ETV6-NTRK3: a chimeric protein tyrosine kinase with transformation activity in multiple cell lineages. Semin. Cancer Biol. 2005, 15: 215-223. 10.1016/j.semcancer.2005.01.003.PubMedGoogle Scholar
- Takada Y, Ye X, Simon S: The integrins. Genome Biol. 2007, 8 (5): 215-10.1186/gb-2007-8-5-215.PubMed CentralPubMedGoogle Scholar
- Veit G, Kobbe B, Keene DR, Paulsson M, Koch M, Wagener R: Collagen XXVIII, a novel von Willebrand factor A domain-containing protein with many imperfections in the collagenous domain. J Biol Chem. 2006, 281 (6): 3494-3504. 10.1074/jbc.M509333200.PubMedGoogle Scholar
- Rosato R, Veltmaat JM, Groffen J, Heisterkamp N: Involvement of the tyrosine kinase fer in cell adhesion. Mol Cell Biol. 1998, 18 (10): 5762-5770.PubMed CentralPubMedGoogle Scholar
- Senetar MA, Moncman CL, McCann RO: Talin2 is induced during striated muscle differentiation and is targeted to stable adhesion complexes in mature muscle. Cell Motil Cytoskeleton. 2007, 64 (3): 157-173. 10.1002/cm.20173.PubMedGoogle Scholar
- Vidal F, Baudoin C, Miquel C, Galliano M-F, Christiano AM, Uitto J, Ortonne J-P, Meneguzzi G: Cloning of the laminin α3 chain gene (LAMA3) and identification of a homozygous deletion in a patient with Herlitz junctional epidermolysis bullosa. Genomics. 1995, 30 (2): 273-280. 10.1006/geno.1995.9877.PubMedGoogle Scholar
- Runswick SK, O'Hare MJ, Jones L, Streuli CH, Garrod DR: Desmosomal adhesion regulates epithelial morphogenesis and cell positioning. Nat Cell Biol. 2001, 3 (9): 823-830. 10.1038/ncb0901-823.PubMedGoogle Scholar
- Halbleib JM, Nelson WJ: Cadherins in development: cell adhesion, sorting, and tissue morphogenesis. Genes Dev. 2006, 20 (23): 3199-3214. 10.1101/gad.1486806.PubMedGoogle Scholar
- Marthiens V, Gavard J, Lambert M, Mège RM: Cadherin‒based cell adhesion in neuromuscular development. Biol Cell. 2002, 94 (6): 315-326. 10.1016/S0248-4900(02)00005-9.PubMedGoogle Scholar
- Mitsui K, Nakajima D, Ohara O, Nakayama M: Mammalian fat3: a large protein that contains multiple cadherin and EGF-like motifs. Biochem Biophys Res Commun. 2002, 290 (4): 1260-1266. 10.1006/bbrc.2002.6338.PubMedGoogle Scholar
- Baik M, Etchebarne B, Bong J, VandeHaar M: Gene expression profiling of liver and mammary tissues of lactating dairy cows. Asian Austral J Animal Sci. 2009, 6: 871-884.Google Scholar
- Winters SJ, Moore JP: PACAP, an autocrine/paracrine regulator of gonadotrophs. Biol Reprod. 2011, 84 (5): 844-850. 10.1095/biolreprod.110.087593.PubMed CentralPubMedGoogle Scholar
- Connor E, Siferd S, Elsasser T, Evock-Clover C, Van Tassell C, Sonstegard T, Fernandes V, Capuco A: Effects of increased milking frequency on gene expression in the bovine mammary gland. BMC Genomics. 2008, 9 (1): 362-10.1186/1471-2164-9-362.PubMed CentralPubMedGoogle Scholar
- Casey T, Plaut K: LACTATION BIOLOGY SYMPOSIUM: Circadian clocks as mediators of the homeorhetic response to lactation. J Anim Sci. 2012, 90 (3): 744-754. 10.2527/jas.2011-4590.PubMedGoogle Scholar
- Li H, Wang Z, Moore SS, Schenkel FS, Stothard P: Genome-wide scan for positional and functional candidate genes affecting milk production traits in Canadian Holstein cattle. 2010, Leipzig, Germany: Proc. 9th WCGALP, http://www.kongressband.de/wcgalp2010/assets/pdf/0535.pdf Accessed Nov, 2010. 26: pGoogle Scholar
- Dostaler-Touchette V, Bédard F, Guillemette C, Pothier F, Chouinard P, Richard F: Cyclic adenosine monophosphate (cAMP)-specific phosphodiesterase is functional in bovine mammary gland. J Dairy Sci. 2009, 92 (8): 3757-3765. 10.3168/jds.2009-2065.PubMedGoogle Scholar
- Bionaz M, Periasamy K, Rodriguez-Zas SL, Hurley WL, Loor JJ: A novel dynamic impact approach (DIA) for functional analysis of time-course omics studies: validation using the bovine mammary transcriptome. PLoS One. 2012, 7 (3): e32455-10.1371/journal.pone.0032455.PubMed CentralPubMedGoogle Scholar
- D’Alessandro A, Zolla L, Scaloni A: The bovine milk proteome: cherishing, nourishing and fostering molecular complexity. An interactomics and functional overview. Mol Biosyst. 2011, 7 (3): 579-597. 10.1039/c0mb00027b.PubMedGoogle Scholar
- Sadkowski T, Jank M, Zwierzchowski L, Oprządek J, Motyl T: Comparison of skeletal muscle transcriptional profiles in dairy and beef breeds bulls. J Appl Genet. 2009, 50 (2): 109-123. 10.1007/BF03195662.PubMedGoogle Scholar
- Watanabe N, Satoh Y, Fujita T, Ohta T, Kose H, Muramatsu Y, Yamamoto T, Yamada T: Distribution of allele frequencies at TTN g. 231054C > T, RPL27A g. 3109537C > T and AKIRIN2 c.* 188G > A between Japanese Black and four other cattle breeds with differing historical selection for marbling. BMC Res Notes. 2011, 4 (1): 10-10.1186/1756-0500-4-10.PubMed CentralPubMedGoogle Scholar
- Ramos A, Megens H, Crooijmans R, Schook L, Groenen M: Identification of high utility SNPs for population assignment and traceability purposes in the pig using high‒throughput sequencing. Anim Genet. 2011, 42 (6): 613-620. 10.1111/j.1365-2052.2011.02198.x.PubMedGoogle Scholar
- Robinson GE, Fernald RD, Clayton DF: Genes and social behavior. Science. 2008, 322 (5903): 896-900. 10.1126/science.1159277.PubMed CentralPubMedGoogle Scholar
- Niimura Y, Nei M: Extensive gains and losses of olfactory receptor genes in mammalian evolution. PLoS One. 2007, 2 (8): e708-10.1371/journal.pone.0000708.PubMed CentralPubMedGoogle Scholar
- Seroussi E, Glick G, Shirak A, Yakobson E, Weller J, Ezra E, Zeron Y: Analysis of copy loss and gain variations in Holstein cattle autosomes using BeadChip SNPs. BMC Genomics. 2010, 11 (1): 673-10.1186/1471-2164-11-673.PubMed CentralPubMedGoogle Scholar
- Lee K, Nguyen DT, Choi M, Cha S-Y, Kim J-H, Dadi H, Seo HG, Seo K, Chun T, Park C: Analysis of cattle olfactory subgenome: the first detail study on the characteristics of the complete olfactory receptor repertoire of a ruminant. BMC Genomics. 2013, 14 (1): 596-10.1186/1471-2164-14-596.PubMed CentralPubMedGoogle Scholar
- Youn H-D, Grozinger CM, Liu JO: Calcium regulates transcriptional repression of myocyte enhancer factor 2 by histone deacetylase 4. J Biol Chem. 2000, 275 (29): 22563-22567. 10.1074/jbc.C000304200.PubMedGoogle Scholar
- Langmead B, Salzberg SL: Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012, 9 (4): 357-359. 10.1038/nmeth.1923.PubMed CentralPubMedGoogle Scholar
- McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M: The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010, 20 (9): 1297-1303. 10.1101/gr.107524.110.PubMed CentralPubMedGoogle Scholar
- Flicek P, Amode MR, Barrell D, Beal K, Brent S, Carvalho-Silva D, Clapham P, Coates G, Fairley S, Fitzgerald S: Ensembl 2012. Nucleic Acids Res. 2012, 40 (D1): D84-D90. 10.1093/nar/gkr991.PubMed CentralPubMedGoogle Scholar
- Smedley D, Haider S, Ballester B, Holland R, London D, Thorisson G, Kasprzyk A: BioMart–biological queries made easy. BMC Genomics. 2009, 10 (1): 22-10.1186/1471-2164-10-22.PubMed CentralPubMedGoogle Scholar
- da Wei Huang BTS, Lempicki RA: Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2008, 4 (1): 44-57. 10.1038/nprot.2008.211.Google Scholar
- Pritchard JK, Stephens M, Donnelly P: Inference of population structure using multilocus genotype data. Genetics. 2000, 155 (2): 945-959.PubMed CentralPubMedGoogle Scholar
- Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19 (12): 1572-1574. 10.1093/bioinformatics/btg180.PubMedGoogle Scholar
- Wright S: The genetical structure of populations. Ann Eugen. 1949, 15 (1): 323-354. 10.1111/j.1469-1809.1949.tb02451.x.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.