Skip to main content

Genome-wide association analysis of heifer livability and early first calving in Holstein cattle

Abstract

Background

The survival and fertility of heifers are critical factors for the success of dairy farms. The mortality of heifers poses a significant challenge to the management and profitability of the dairy industry. In dairy farming, achieving early first calving of heifers is also essential for optimal productivity and sustainability. Recently, Council on Dairy Cattle Breeding (CDCB) and USDA have developed new evaluations of heifer health and fertility traits. However, the genetic basis of these traits has yet to be thoroughly studied.

Results

Leveraging the extensive U.S dairy genomic database maintained at CDCB, we conducted large-scale GWAS analyses of two heifer traits, livability and early first calving. Despite the large sample size, we found no major QTL for heifer livability. However, we identified a major QTL in the bovine MHC region associated with early first calving. Our GO analysis based on nearby genes detected 91 significant GO terms with a large proportion related to the immune system. This QTL in the MHC region was also confirmed in the analysis of 27 K bull with imputed sequence variants. Since these traits have few major QTL, we evaluated the genome-wide distribution of GWAS signals across different functional genomics categories. For heifer livability, we observed significant enrichment in promotor and enhancer-related regions. For early calving, we found more associations in active TSS, active Elements, and Insulator. We also identified significant enrichment of CDS and conserved variants in the GWAS results of both traits. By linking GWAS results and transcriptome data from the CattleGTEx project via TWAS, we detected four and 23 significant gene-trait association pairs for heifer livability and early calving, respectively. Interestingly, we discovered six genes for early calving in the Bovine MHC region, including two genes in lymph node tissue and one gene each in blood, adipose, hypothalamus, and leukocyte.

Conclusion

Our large-scale GWAS analyses of two heifer traits identified a major QTL in the bovine MHC region for early first calving. Additional functional enrichment and TWAS analyses confirmed the MHC QTL with relevant biological evidence. Our results revealed the complex genetic basis of heifer health and fertility traits and indicated a potential connection between the immune system and reproduction in cattle.

Peer Review reports

Background

Heifers are young female cows that have not yet given birth to a calf and are the future of the dairy herd. Heifer health and fertility are crucial for the success of a dairy farm, as it directly affects milk production and the sustainability of a farm [1]. Healthy heifers are more likely to produce more milk once they begin lactating. In addition, heifers need to be healthy and well-cared for to conceive and carry a calf to term successfully. Unhealthy heifers may have difficulty getting pregnant, leading to lower milk production and fewer replacement cows for the farmer [2]. Diseased heifers may also require costly veterinary treatments that can add up quickly and cut into profits. Early first calving in heifers is also important for dairy farming, particularly for economic and environmental considerations, because it can reduce unproductive periods and increase lifetime production, faster generation turnover and selection progress, and improve reproductive efficiency [3]. In summary, heifer health and fertility are critical for a profitable and sustainable dairy operation [1].

The health and fertility of cows are complex issues that involve various factors, including nutrition, environment, physiology, and genetics [4,5,6]. Compared to cows, heifers generally encounter more challenges as heifers have not yet reached sexual maturity and are not yet capable of smooth reproduction and production. Moreover, heifers typically have additional nutritional requirements than cows, as they are still growing and developing. Despite the complexity of cattle health and fertility, many GWAS studies have been conducted to identify genomic regions and genes associated with health and fertility-related traits in cattle [7,8,9,10,11,12]. For instance, the bovine MHC region has been associated with cow livability and immune system-related diseases [7]. The ABCC9 and GC genes have been associated with pregnancy rate [8], while ARRDC3 was associated with growth and calving traits [4]. Heifer fertility and health traits are less studied than cows, mainly due to limited data availability.

Although the heritability of fertility and health traits tends to be relatively low, CDCB and the USDA Animal Genomics and Improvement Lab have been evaluating fertility and health-related traits using the large volume of data collected from the dairy industry (https://uscdcb.com/). Recently, they added heifer livability and early first calving to the evaluation system [13, 14]. Heifer livability represents the expected survival percentage of an animal’s female offspring from 2 days after birth up to 18 months of age in a herd with average management conditions. Larger, positive values of heifer livability are more favorable. It measures a heifer’s overall resistance to causes leading to mortality. Since the most common reasons for heifer death are digestive and respiratory diseases [15], heifer livability is primarily related to the resistance to these diseases and other causes of death. The heritability of heifer mortality has been estimated to be less than 1% in many studies [13, 16, 17]. Early first calving (EFC) is defined as the age at first calving. As a heifer fertility trait, the heritability of EFC is only 2–3% [14, 18, 19]. As part of the genetic evaluation process, traits have been corrected for management effect by CDCB, resulting in a PTA (Predicted Transmission Ability) that can be used directly for genetic studies. In this research, we aim to identify genes and genomic regions associated with these heifer traits using the large amount of genotype and phenotype data from the US dairy genomic database. To further boost power, we also included transcriptome and other functional genomics data for fine-mapping and validation.

Results

Large-sample GWAS of heifer livability and early calving

Our large-sample GWAS started with a discovery population of 3,649,734 genotyped Holstein cattle (336,386 bulls and 3,313,348 cows). After calculating deregressed predicted transmitting ability (PTA) as phenotype and editing, we included 510,318 and 768,645 animals for the GWAS of heifer livability and early first calving, respectively. All of the animals were imputed to 79 K SNPs, and we retained 73,554 SNPs after QC editing. We applied SLEMM [20] to perform the GWAS analyses that can efficiently run large-scale mixed models and incorporate variational residual variances for differential reliabilities of degressed PTAs. As a result, we found only one QTL region for each of the two traits (Fig. 1A and B C, and 1D). Nonetheless, for both traits, the P values for the majority of SNPs showed no inflation of test statistics and good quality of the results (Fig. 1A C). After removing SNPs with low minor allele frequency (MAF), only one SNP (ARS-BFGL-NGS-105563, P = 1.28e-07) passed the Bonferroni-corrected threshold for early calving (Table 1). Interestingly, this SNP is located near the Bovine MHC region on BTA 23 that encodes many fundamental molecules for regulating the immune response [21].

Fig. 1
figure 1

Large-scale GWAS results of heifer livability and early calving based on the 79 K SNPs. (A) Quantile–quantile (QQ) plot for heifer livability. (B) Manhattan plot for heifer livability. (C) QQ plot for early calving. (D) Manhattan plot for early calving. The red horizontal lines correspond to the genome-wide significance threshold

Table 1 GWAS results for two heifer traits based on 79 K SNPs

Despite the few QTL regions detected in the initial GWAS, we evaluated all SNPs passing the suggestive significance levels for functional annotation analyses. For heifer livability, we obtained 118 genes located within or overlapping the vicinity of leading SNPs (< 1 Mb) using BioMart in the Ensembl database (Ensembl Genes 106; Table S1). Several genes close to the top SNPs exhibited biological relevance for cow livability, including CHCHD7 and PLAG1, which are related to growth and development [22] and the LYN gene related to the regulation of innate and adaptive immune responses [23]. We also performed GO analysis by KOBAS [24] to determine the potential biological functions of these genes. As a result, 139 significant GO terms (P < 0.05) were found, with the top relevant terms being mineral absorption, homeostasis, metabolic process, and development (Table S2). According to existing studies and the cattle QTL database [25], in the upstream and downstream 1 Mb range of the top SNPs, many QTLs were previously associated with milk production, body type, and disease related traits in dairy cattle (Table S3).

For early first calving, we identified 596 genes within or near the associated SNPs (Table S4). Notably, the top associated SNPs were located within or near the bovine MHC region on BTA 23, indicating potential connections between the immune system and early first calving [26]. Many nearby genes were involved with immune functions and relevant biology for early calving, including ABCF1, ABHD16A, AGER, BOLA-NC1, BTN1A1, LTA, LTB, etc. We also performed the GO analysis based on these genes and detected 91 significant GO terms (P < 0.05) with a large proportion associated with immune processes (Table S5). Finally, previously reported QTL within 1 Mb of associated SNPs were associated with milk production, reproduction, body type, and disease-related traits in cattle (Table S6).

Sequence-level GWAS and fine mapping of heifer livability and early calving in 27,235 bulls

To refine the GWAS results, we conducted additional GWAS analyses with imputed sequence data for heifer livability and early first calving in 27,235 bulls that have highly accurate phenotypes. We used 3,148,506 imputed sequence SNPs as genotype and de-regressed PTAs as phenotype. After editing and filtering on reliability, we included 11,562 and 10,700 bulls for heifer livability and early calving, respectively. The QTL regions discovered in the large-sample GWAS were validated for both traits at the nominal significance level (Fig. 2A and B C, and 2D). Interestingly, sequence-level GWAS found some additional associations compared to low-density SNP data (Fig. 2B and D). As shown in Tables 2 and 16 SNPs passed the genome-wide threshold for heifer livability, and two SNPs passed the threshold for early calving. By checking the 1-Mb regions surrounding these associated SNPs, we identified many genes that were also detected in the large-sample GWAS, namely MOG, OR12D2E, OR12D3, OR2H1, OR5V1, OR5V1C, OR5V2, TRIM10, TRIM15 (Table S7).

Fig. 2
figure 2

GWAS results of heifer livability (n = 11,562) and early calving (n = 10,700) based on bulls with imputed sequence variants. (A) Quantile–quantile (QQ) plot for heifer livability. (B) Manhattan plot for heifer livability. (C) QQ plot for early calving. (D) Manhattan plot for early calving. The red horizontal lines correspond to the genome-wide significance threshold

Table 2 GWAS results based on imputed sequence variants for heifer livability (n = 11,562) and early calving (n = 10,700)

Functional enrichment analysis

We analyzed the enrichment of GWAS signals across SNPs in different functional genomic regions based on the 27 K bulls and imputed sequence data. We first categorized sequence variants into 14 groups based on the locations of 14 chromatin states reported previously [27], i.e., CTCF/Active_TSS, Active_TSS, CTCF/Promoter, Active_Promoter, Flanking_TSS, Promoter, Poised_Promoter, Active_Enhancer, CTCF/Enhancer, Primed_Enhancer, Active_Element, Insulator, Polycomb_Repressed, and Low_Signal. For heifer livability, we observed significant enrichment of variants in Active_Promoter, Promoter, CTCF/Enhancer, Primed_Enhancer, and Active_Element (Fig. 3). For early calving, we observed significant enrichment of associated variants in Active_TSS, Active_Element, and Insulator (Fig. 3).

We further investigated the enrichment of variants concerning their genomic locations (conserved) and genic annotations (CDS, intron, and UTR) inferred by SnpEff [28]. As a result, we observed significant enrichment of CDS and conserved variants in the GWAS results of both traits. For heifer livability, we observed enrichment of intron variants (Fig. 3). And for early calving, we observed significant enrichment of variants in the UTR regions (Fig. 3).

Fig. 3
figure 3

Enrichment of fine-mapping variants across functional annotations

Transcriptome-wide association study (TWAS)

TWAS seeks to identify trait-associated genes by testing for the association between a phenotype and the genetic components of gene expression levels [29]. By linking our GWAS results and existing transcriptome data from the CattleGTEx project [30] via a TWAS analysis, we detected four and 23 significant gene-trait association pairs for heifer livability and early calving, respectively (Fig. 4). Interestingly, we discovered six genes overlapped with 27 K bulls GWAS results for early calving in the Bovine MHC region, including two genes in lymph node tissue and one gene in blood, adipose, hypothalamus, and leukocyte (Table 3). In addition, the expression of OR12D2 in adipose was significantly associated with early calving (Table 3), consistent with previous findings that OR12D2 is linked with MHC [31].

Fig. 4
figure 4

Manhattan plots of transcriptome-wide association study (TWAS) for heifer livability and early calving

Table 3 Gene-trait association pairs detected by TWAS based on CattleGTEx database

Discussion

In this study, we identified genomic regions and candidate genes associated with heifer livability and early first calving using large-scale GWAS, functional enrichment analysis, and TWAS. We reported a major QTL in the bovine MHC region to be related to early first calving, suggesting a potential connection between the immune system and reproduction. TWAS using the CattleGTEx data confirmed the association and revealed several candidate genes in the bovine MHC locus.

Generally, health and fertility traits are expected to be complex traits with low heritability due to their multifactorial nature [32]. The two traits analyzed in this study, heifer livability and early first calving, also have low estimated heritability, 0.0223 and 0.0328, respectively. These two heifer traits can be more complex due to the unique characteristics of heifers. For instance, heifers are not yet fully developed and may not be ready for reproduction. With the large sample sizes used in this study, we only found one major QTL for early first calving but none for heifer livability. Still, when we explored the genome-wide enrichment of GWAS signals with functional genomic regions, we reported significant enrichment of the association signals in promoter and enhancer regions, indicating an exciting connection between the regulation of gene expression and these two complex traits.

The immune system plays a central role in protecting the body from pathogens and infections. Moreover, it has a role in fertility and reproduction. In females, the immune system is involved in the whole reproductive process, from the development and maturation of the egg to implantation and maintenance of the pregnancy [33]. The immune system must tolerate the developing fetus, which is genetically different from the mother, while still protecting against infections. If the immune system is overactive, it can cause infertility or miscarriage, while an underactive immune system can lead to increased susceptibility to infections and complications during pregnancy. These potential connections between the immune system and reproduction further support the MHC QTL and candidate genes with early first calving in heifers.

This study showcased the usefulness of functional genomics data in post-GWAS and fine-mapping studies in cattle. When the associated variants are located in non-coding or intergenic regions, functional genomics data like those from the FAANG [27] and CattleGTEx projects would be useful to provide information about the biological mechanisms underlying the associations. Integration of functional genomics data with GWAS may also boost the power of detection when the power of the original GWAS is limited. For instance, the TWAS results in this research provided additional evidence for the MHC QTL with early first calving.

Conclusion

Due to the complex genetic architecture of health and fertility traits, our large-scale GWAS analyses only detected a few major QTL for heifer livability and early first calving. Interestingly, the major QTL for early first calving is located in the bovine MHC region. This association was further supported by post-GWAS analyses and TWAS, indicating a connection between the immune system and early reproduction. Despite the low power for major QTL, we evaluated the distribution of GWAS signals across different functional genomic regions. We found significant enrichment in promoter and enhancer-related regions, which supports the contribution of gene regulation to the genetics of complex traits.

Methods

Data description

In this study, we conducted GWAS analyses with two datasets, a discovery dataset including 3,649,734 Holstein cattle (336,386 bulls and 3,313,348 cows) genotyped by various SNP chips and imputed to 79,060 SNPs and a fine-mapping dataset including 27,235 bulls genotyped by 50 K SNP chips and imputed to 3,148,506 sequence variants. The original SNP data of the discovery dataset were from multiple SNP chips with densities ranging from 3 to 50 K [34]. The CDCB and USDA AGIL laboratory routinely process the original genotype data and impute to 79 K common SNPs specifically selected for official evaluations using FindHap program [35]. For the discovery dataset, we applied PLINK 1.9 [36] to remove SNPs with call rates < 95%, minor allele frequencies (MAF) < 0.01, Hardy-Weinberg equilibrium (HWE) P < 10− 6, and to remove animals with > 5% missing genotypes. After this filtering, 73,554 SNPs and 3,520,002 animals (325,905 bulls and 3,194,097 cows) were retained for downstream analyses.

The phenotype data were part of the December 2021 genomic evaluations from the U.S. Council on Dairy Cattle Breeding (CDCB), which routinely calculates predicted transmitting ability (PTA) values for dairy cattle of multiple breeds. We only included Holstein data for this study. We used deregressed PTA values as phenotype in the GWAS of two traits, heifer livability and early first calving [37]. To ensure robustness and accuracy, we excluded animals with low reliability. The majority of filtered animals were young cows without any phenotypic records. Finally, the total number of animals used was 510,318 and 768,645 for heifer livability and early calving, respectively.

For the fine-mapping dataset, we obtained imputed sequence data of 27,235 bulls from previous studies [8]. Briefly, the imputation was conducted with FindHap v3 [35] and 444 Holstein bulls from the Run5 of 1000 Bull Genomes Project as reference. Stringent filtering and removal of intergenic SNPs resulted in an enriched set of 3,148,506 sequence variants. The imputation was highly accurate with an average percentage of consistent genotypes 96.7%. Similarly, we excluded animals with low reliability for deregressed PTA values, retaining 11,562 and 10,700 bulls for heifer livability and early calving, respectively. In this study, we only considered autosomal chromosomes BTA 1–29 from the Bos taurus ARS-UCD1.2 assembly [38].

GWAS analysis

We analyzed the discovery and fine-mapping datasets separately in the GWAS analysis. We performed the GWAS using a linear mixed model approach implemented in the SLEMM program [20]. SLEMM can handle large-scale (up to millions) genome-wide association studies while accounting for genomic relationships. In addition, SLEMM can model differences in the reliability between individual phenotypes using an error weight parameter to account for the variation of deregressed PTAs, which is calculated by 1/r2-1, where r2 is the reliability of deregressed PTAs.

After GWAS analysis, we retrieved genes within 1 Mb of the significant SNPs using BioMart in the Ensembl database (Ensembl Genes 106). We carried out Gene Ontology (GO) and Pathway analysis using KOBAS [24]. GO terms with a False Discovery Rate (FDR) less than 5% were considered statistically significant. Furthermore, we compared the regions within 1 Mb of the significant SNPs with our previous GWAS results [8] and the cattle QTLs in the Animal QTL database [25] to check if any associated genomic regions were previously reported.

Functional enrichment analysis with genome annotations

To evaluate the potential functions of the associated genomic regions, we explored the enrichment of GWAS results in different functional regions using the 27,235 bulls and imputed sequence variants. We performed enrichment analyses via MPH (MINQUE for Partitioning Heritability, https://github.com/jiang18/mph) with the annotations inferred by SnpEff [28] and 14 chromatin states from eight tissues reported by Kern et al. [27]. MPH is designed to partition SNP heritability with genotypes of related individuals or with long-spanning LDs. MPH is comparable to GREML in terms of accuracy, while being much faster and more memory efficient. It can do weighted analyses if residual variances are unequal and use many overlapping functional annotations. This approach included two steps: building genomic relationship matrices (GRMs) based on the different SNP annotation datasets, and partitioning SNP heritability accordingly. We set --min_maf and --min_hwe_pval as 0 and 1e-8 respectively. We calculated standard errors using the Delta method.

Transcriptome-wide association study (TWAS)

We performed TWAS analyses based on the 27 K bull data using S-PrediXcan [39] to link GWAS results with transcriptome data that we assembled in a previous study [30]. For the TWAS analyses, we used the CattleGTEx v.1 eQTL models [36]. For each trait, we imputed and harmonized GWAS summary statistics and then performed TWAS across 24 cattle tissues separately. We considered genes with P < 0.005 as suggestive significant.

Data Availability

The original genotype and phenotype data are owned by third parties and maintained by the Council on Dairy Cattle Breeding (CDCB). A request to CDCB is necessary for getting data access on research, which may be sent to: João Dürr, CDCB Chief Executive Officer (joao.durr@cdcb.us). All other data have been included in the manuscript and supplementary data.

References

  1. Moorey SE, Biase FH. Beef heifer fertility: importance of management practices and technological advancements. J Anim Sci Biotechnol. 2020;11(1):1–12.

    Article  Google Scholar 

  2. Wathes D, Pollott G, Johnson K, Richardson H, Cooke J. Heifer fertility and carry over consequences for life time production in dairy and beef cattle. Animal. 2014;8(s1):91–104.

    Article  PubMed  Google Scholar 

  3. Krpálková L, Cabrera V, Kvapilík J, Burdych J, Crump P. Associations between age at first calving, rearing average daily weight gain, herd milk yield and dairy herd production, reproduction, and profitability. J Dairy Sci. 2014;97(10):6573–82.

    Article  PubMed  Google Scholar 

  4. Wathes D, Brickell J, Bourne N, Swali A, Cheng Z. Factors influencing heifer survival and fertility on commercial dairy farms. animal 2008, 2(8):1135–1143.

  5. Kuhn M, Hutchison J, Wiggans G. Characterization of Holstein heifer fertility in the United States. J Dairy Sci. 2006;89(12):4907–20.

    Article  PubMed  CAS  Google Scholar 

  6. Zhang H, Wang Y, Chang Y, Luo H, Brito LF, Dong Y, Shi R, Wang Y, Dong G, Liu L. Mortality-culling rates of dairy calves and replacement heifers and its risk factors in Holstein cattle. Animals. 2019;9(10):730.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Freebern E, Santos DJ, Fang L, Jiang J, Parker Gaddis KL, Liu GE, VanRaden PM, Maltecca C, Cole JB, Ma L. GWAS and fine-mapping of livability and six Disease traits in Holstein cattle. BMC Genomics. 2020;21:1–11.

    Article  Google Scholar 

  8. Jiang J, Cole JB, Freebern E, Da Y, VanRaden PM, Ma L. Functional annotation and bayesian fine-mapping reveals candidate genes for important agronomic traits in Holstein bulls. Commun Biology. 2019;2(1):212.

    Article  Google Scholar 

  9. Nayeri S, Sargolzaei M, Abo-Ismail MK, May N, Miller SP, Schenkel F, Moore SS, Stothard P. Genome-wide association for milk production and female fertility traits in Canadian dairy Holstein cattle. BMC Genet. 2016;17(1):1–11.

    Article  Google Scholar 

  10. Tenghe A, Bouwman A, Berglund B, Strandberg E, de Koning D, Veerkamp R. Genome-wide association study for endocrine fertility traits using single nucleotide polymorphism arrays and sequence variants in dairy cattle. J Dairy Sci. 2016;99(7):5470–85.

    Article  PubMed  CAS  Google Scholar 

  11. Johnston D, Mukiibi R, Waters SM, McGee M, Surlis C, McClure JC, McClure MC, Todd CG, Earley B. Genome wide association study of passive immunity and Disease traits in beef-suckler and dairy calves on Irish farms. Sci Rep. 2020;10(1):1–10.

    Article  Google Scholar 

  12. Narayana SG, de Jong E, Schenkel FS, Fonseca PA, Chud TC, Powel D, Wachoski-Dark G, Ronksley PE, Miglior F, Orsel K. Underlying genetic architecture of resistance to mastitis in dairy cattle: a systematic review and gene prioritization analysis of genome-wide association studies. J Dairy Sci 2022.

  13. Neupane M, Hutchison J, Van Tassell C, VanRaden P. Genomic evaluation of dairy heifer livability. J Dairy Sci. 2021;104(8):8959–65.

    Article  PubMed  CAS  Google Scholar 

  14. Hutchison J, VanRaden P, Null D, Cole J, Bickhart D. Genomic evaluation of age at first calving. J Dairy Sci. 2017;100(8):6853–61.

    Article  PubMed  CAS  Google Scholar 

  15. Gulliksen S, Lie K, Løken T, Østerås O. Calf mortality in Norwegian dairy herds. J Dairy Sci. 2009;92(6):2782–95.

    Article  PubMed  CAS  Google Scholar 

  16. Fuerst-Waltl B, Sørensen M. Genetic analysis of calf and heifer losses in Danish holstein. J Dairy Sci. 2010;93(11):5436–42.

    Article  PubMed  CAS  Google Scholar 

  17. Weller JI, Gershoni M, Ezra E. Genetic and environmental analysis of female calf survival in the Israel Holstein cattle population. J Dairy Sci. 2021;104(3):3278–91.

    Article  PubMed  CAS  Google Scholar 

  18. Vergara O, Elzo M, Cerón-Muñoz M. Genetic parameters and genetic trends for age at first calving and calving interval in an Angus-Blanco Orejinegro-Zebu multibreed cattle population in Colombia. Livest Sci. 2009;126(1–3):318–22.

    Article  Google Scholar 

  19. Grossi D, Venturini G, Paz C, Bezerra L, Lôbo RB, Oliveira J, Munari D. Genetic associations between age at first calving and heifer body weight and scrotal circumference in Nelore cattle. J Anim Breed Genet. 2009;126(5):387–93.

    Article  PubMed  CAS  Google Scholar 

  20. Cheng J, Maltecca C, VanRaden PM, O’Connell JR, Ma L, Jiang J. SLEMM: million-scale genomic predictions with window-based SNP weighting. Bioinformatics. 2023;39(3):btad127.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  21. Ellis SA, Ballingall KT. Cattle MHC: evolution in action? Immunol Rev. 1999;167(1):159–68.

    Article  PubMed  CAS  Google Scholar 

  22. Nishimura S, Watanabe T, Mizoshita K, Tatsuda K, Fujita T, Watanabe N, Sugimoto Y, Takasuga A. Genome-wide association study identified three major QTL for carcass weight including the PLAG1-CHCHD7 QTN for stature in Japanese black cattle. BMC Genet. 2012;13:1–11.

    Article  CAS  Google Scholar 

  23. Xu Y, Harder KW, Huntington ND, Hibbs ML, Tarlinton DM. Lyn tyrosine kinase: accentuating the positive and the negative. Immunity. 2005;22(1):9–18.

    PubMed  Google Scholar 

  24. Xie C, Mao X, Huang J, Ding Y, Wu J, Dong S, Kong L, Gao G, Li C-Y, Wei L. KOBAS 2.0: a web server for annotation and identification of enriched pathways and Diseases. Nucleic Acids Res. 2011;39(suppl2):W316–22.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  25. Hu Z-L, Park CA, Reecy JM. Bringing the animal QTLdb and CorrDB into the future: meeting new challenges and providing updated services. Nucleic Acids Res. 2022;50(D1):D956–61.

    Article  PubMed  CAS  Google Scholar 

  26. Abrams ET, Miller EM. The roles of the immune system in women’s reproduction: evolutionary constraints and life history trade-offs. Am J Phys Anthropol. 2011;146(S53):134–54.

    Article  PubMed  Google Scholar 

  27. Kern C, Wang Y, Xu X, Pan Z, Halstead M, Chanthavixay G, Saelao P, Waters S, Xiang R, Chamberlain A. Functional annotations of three domestic animal genomes provide vital resources for comparative and agricultural research. Nat Commun. 2021;12(1):1821.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  28. Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, Wang L, Land SJ, Lu X, Ruden DM. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. 2012;6(2):80–92.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  29. Gusev A, Ko A, Shi H, Bhatia G, Chung W, Penninx BW, Jansen R, De Geus EJ, Boomsma DI, Wright FA. Integrative approaches for large-scale transcriptome-wide association studies. Nat Genet. 2016;48(3):245–52.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  30. Liu S, Gao Y, Canela-Xandri O, Wang S, Yu Y, Cai W, Li B, Xiang R, Chamberlain AJ, Pairo-Castineira E. A multi-tissue atlas of regulatory variants in cattle. Nat Genet. 2022;54(9):1438–47.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  31. Younger RM, Amadou C, Bethel G, Ehlers A, Lindahl KF, Forbes S, Horton R, Milne S, Mungall AJ, Trowsdale J. Characterization of clustered MHC-linked olfactory receptor genes in human and mouse. Genome Res. 2001;11(4):519–30.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  32. Ma L, Cole J, Da Y, VanRaden P. Symposium review: Genetics, genome-wide association study, and genetic improvement of dairy fertility traits. Journal of dairy science 2019, 102(4):3735–3743.

  33. Lee SK, Kim CJ, Kim D-J, Kang J-h. Immune cells in the female reproductive tract. Immune Netw. 2015;15(1):16–26.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Wiggans G, Cooper T, VanRaden P, Van Tassell C, Bickhart D, Sonstegard T. Increasing the number of single nucleotide polymorphisms used in genomic evaluation of dairy cattle. J Dairy Sci. 2016;99(6):4504–11.

    Article  PubMed  CAS  Google Scholar 

  35. VanRaden PM, Sun C, O’Connell JR. Fast imputation using medium or low-coverage sequence data. BMC Genet. 2015;16:1–12.

    Article  CAS  Google Scholar 

  36. Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 2015, 4(1):s13742-13015-10047-13748.

  37. Garrick DJ, Taylor JF, Fernando RL. Deregressing estimated breeding values and weighting information for genomic regression analyses. Genet Selection Evol. 2009;41:1–8.

    Article  Google Scholar 

  38. Rosen BD, Bickhart DM, Schnabel RD, Koren S, Elsik CG, Tseng E, Rowan TN, Low WY, Zimin A, Couldrey C. De novo assembly of the cattle reference genome with single-molecule sequencing. Gigascience. 2020;9(3):giaa021.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  39. Barbeira AN, Dickinson SP, Bonazzola R, Zheng J, Wheeler HE, Torres JM, Torstenson ES, Shah KP, Garcia T, Edwards TL. Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat Commun. 2018;9(1):1825.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We thank the Council on Dairy Cattle Breeding (CDCB; Bowie, MD), Cooperative Dairy DNA Repository (Verona, WI), and dairy industry contributors for providing data access.

Funding

This work was supported in part by the USDA National Institute of Food and Agriculture (NIFA) Agriculture and Food Research Initiative (AFRI) grant 2020-67015-31398 and 2021-67015-33409. This work was also supported in part by UDSA ARS appropriated projects 8042-31000-001-00-D, 8042-31000-002-00-D, and 8042-31310-078-00-D. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author information

Authors and Affiliations

Authors

Contributions

LM conceived the study. YG, AM, JJ, VI, JATV analyzed and interpreted data. YG and LM wrote the manuscript. JJ, MN, and GEL contributed tools and materials. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Li Ma.

Ethics declarations

Ethics approval and consent to participate

No live animals were used in this study, and Institutional Animal Care and Use Committee approval was not required.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gao, Y., Marceau, A., Iqbal, V. et al. Genome-wide association analysis of heifer livability and early first calving in Holstein cattle. BMC Genomics 24, 628 (2023). https://doi.org/10.1186/s12864-023-09736-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12864-023-09736-0

Keywords