Genomic structure and marker-derived gene networks for growth and meat quality traits of Brazilian Nelore beef cattle
- Maurício A. Mudadu†1, 2Email authorView ORCID ID profile,
- Laercio R. Porto-Neto†3,
- Fabiana B. Mokry†4,
- Polyana C. Tizioto4,
- Priscila S. N. Oliveira4,
- Rymer R. Tullio2,
- Renata T. Nassu2,
- Simone C. M. Niciura2,
- Patrícia Tholon2,
- Maurício M. Alencar2,
- Roberto H. Higa1,
- Antônio N. Rosa5,
- Gélson L. D. Feijó5,
- André L. J. Ferraz6,
- Luiz O. C. Silva5,
- Sérgio R. Medeiros5,
- Dante P. Lanna7,
- Michele L. Nascimento7,
- Amália S. Chaves7,
- Andrea R. D. L. Souza8,
- Irineu U. Packer7,
- Roberto A. A. TorresJr.5,
- Fabiane Siqueira5,
- Gerson B. Mourão7,
- Luiz L. Coutinho7,
- Antonio Reverter3 and
- Luciana C. A. Regitano2
© Mudadu et al. 2016
Received: 22 December 2015
Accepted: 25 February 2016
Published: 15 March 2016
The Erratum to this article has been published in BMC Genomics 2016 17:492
Nelore is the major beef cattle breed in Brazil with more than 130 million heads. Genome-wide association studies (GWAS) are often used to associate markers and genomic regions to growth and meat quality traits that can be used to assist selection programs. An alternative methodology to traditional GWAS that involves the construction of gene network interactions, derived from results of several GWAS is the AWM (Association Weight Matrices)/PCIT (Partial Correlation and Information Theory). With the aim of evaluating the genetic architecture of Brazilian Nelore cattle, we used high-density SNP genotyping data (~770,000 SNP) from 780 Nelore animals comprising 34 half-sibling families derived from highly disseminated and unrelated sires from across Brazil. The AWM/PCIT methodology was employed to evaluate the genes that participate in a series of eight phenotypes related to growth and meat quality obtained from this Nelore sample.
Our results indicate a lack of structuring between the individuals studied since principal component analyses were not able to differentiate families by its sires or by its ancestral lineages. The application of the AWM/PCIT methodology revealed a trio of transcription factors (comprising VDR, LHX9 and ZEB1) which in combination connected 66 genes through 359 edges and whose biological functions were inspected, some revealing to participate in biological growth processes in literature searches.
The diversity of the Nelore sample studied is not high enough to differentiate among families neither by sires nor by using the available ancestral lineage information. The gene networks constructed from the AWM/PCIT methodology were a useful alternative in characterizing genes and gene networks that were allegedly influential in growth and meat quality traits in Nelore cattle.
KeywordsGenotyping AWM PCIT GWAS
Brazilian livestock began with taurine cattle introduced from Europe in the colonial period, around 450 years ago. Several years later, starting in 1930 but mainly in 1960 and 1962, a Bos indicus cattle breed named Nelore was introduced in Brazil from India and the herd expanded to the majority of the territory due to its good adaptability to tropical climate. It is estimated that 7000 Nelore cattle were introduced from India, the majority were descendants from 6 main sires named Karvadi, Taj Mahal, Kurupathy, Golias, Godhavari and Rasta . The Nelore, a breed with white or gray pelage and short horns, is nowadays the most representative breed in Brazil and dominates the Brazilian beef cattle production with more than 75 % of the total herd which equates to more than 130 million heads of Nelore cattle. Artificial insemination is widely used in Brazil  and semen for only a few sires has been intensively used nationwide. With the advent of the Illumina BovineHD BeadChip (Illumina Inc., San Diego, CA), a high density SNP panel comprising over 770,000 markers, we were able to measure the genomic diversity of the Brazilian Nelore from the main commercial sire families and characterize their genomic relationship and genetic profiles across a series of phenotypes. Traditional phenotype-based selection has been focused primarily on improving growth and maternal traits . However, marker-assisted or genomic selection is particularly valuable for traits that have late, onerous and expensive measurement such as meat quality, yield and feed efficiency . Despite production advantages of Bos indicus composite breeds in tropical and subtropical environments, the inferiority of beef quality, especially tenderness, is a worrying issue for producers [5, 6]. It is known that Nelore breed show moderate heritability for meat quality  as well as for growth-related traits , which make the application of genomic tools feasible.
Genome-Wide Association Studies, or GWAS, are today a common practice in order to find associations of genome regions to phenotypes of interest such as meat quality traits [7, 9, 10]. However, GWAS are sometimes elusive regarding the low power of association of the markers to the trait and also to the association of regions of the genome that lack apparent relation to the trait being studied [11, 12]. Some meta-analysis approaches are appealing as they attempt to enrich GWAS analyses with information from other sources. One of these methods is the system’s biology AWM/PCIT methodology, which involves the construction of SNP-based gene network interactions, integrating results from several GWAS by means of the Association Weight Matrices (AWM) and Partial Correlation and Information Theory (PCIT) [13–16]. We took advantage of this methodology to explore growth and meat quality traits from 780 samples of Brazilian Nelore cattle, systematically sourced to represent 34 highly influential sires and their 746 half-sibs from across Brazil. Further, we inferred a gene network focused on growth and meat quality traits and found several genes implicated in the regulation of these phenotypes.
Genetic profile of Brazilian Nelore beef cattle
Genetic background of the sires
Number of Sibs
Lineage descendants (Sire + Sibs)
7 Indicus (23.33 %)
23 Taurus (76.66 %)
In addition to the PCA, we performed a hierarchical cluster analysis of the GRM and plotted the resulting heatmap (Fig. 1b). This analysis shows that the 34 families are well distinguished in the heatmap (grey squares in the central diagonal) and the sire relatedness to its sibs can be distinguished in some cases by a white cross inside the central squares (e.g. NE001710 family) or a white border line for some squares that follows the red lines in the upper palette representing the sires. In overall the heatmap is greyish than what would be expected taking into account that the sires were selected to be more unrelated with each other, suggesting that the half-sib families are genetically related. Wright’s F statistic (Fst) was applied to evaluate the degree of genetic differentiation among each sire with its progeny as one population (34 populations) and each lineage with its descendants as one population (17 populations) yielding values of Fst = 0.14 and Fst = 0.09, respectively, which can be considered a small level of genetic differentiation in this Nelore sample . The clustering analysis also put together the majority of the animals evidencing the relatedness of the families. Some darker grey shades can be seen within the progeny of individual NE001383, which is the individual that is more distant in the PCA analysis, showing that this sire and its half-sibs are more unrelated to all other individuals. Although this individual sire is from the Karvadi lineage and supposed to be related to the NE001394, NE001393, NE001360 and NE001385 (see Fig. 1c), in the PCA it is more distinguished from others. Overall, the lineage information obtained from pedigree was not useful to evidence any kind of relatedness between individuals with similar ancestry. For example, nor the PCA or the hierarchical clustering show any improvement in distinguishing related and unrelated animals regarding to its lineages (Additional file 1: Figure S1A and S1B). The hierarchical clustering also does not cluster individuals according to its ancestral lineages (see for example the cluster of individuals NE001383, NE001362 and NE001398, from the Karvadi, Taj Mahal and Godar Imp lineages). One explanation for the failure of these analyses to differentiate the half-sib families is the fact that the families do not have the same number of animals and the same goes for the lineage ancestry of the families (see Table 1). These distributions could affect the effectiveness of the GRM to provide enough information to generate a good differentiation in the PCA or hierarchical cluster analyses across lineages.
We obtained information of mitochondrial DNA (mtDNA) of 30 of the 34 sires from a previous study . Results show that 23 (76.66 %) of these have Bos taurus maternal ancestry (Table 1), which agrees with the hypothesis that most of the Brazilian Nelore herd was obtained by backcrossing with females of taurine origin . Although two of the most divergent sires, NE001398 and NE001362 were of indicine maternal ancestry, in overall the maternal ancestry seems not to have influenced the PCA results. For a better characterization of the genomic data under study we analyzed the haplotype structure of the population and estimated the extent of the linkage disequilibrium (LD), decay by physical genomic distance on autosomes, as the squared correlation coefficient (r 2) between SNP pairs. Additional file 2: Figure S2 shows that LD varies from r 2 = 0.58 between SNPs distance up to 1 kb for BTA5 to r 2 = 0.03 between SNPs distance of 500 kb to 1 Mb for BTA25. The average distance between SNPs for this LD study was 5.62 kb, with large gaps of 1.08 Mb on BTA7, and of 0.9 Mb and 0.84 Mb on BTA12 and on BTA27, respectively. At the distance of 10 kb to 25 kb, the overall average r 2 was ≈ 0.30, which is considered strong LD and can be used for QTL mapping , while at distance of 25 kb to 50 kb, the overall average r 2 was ≈ 0.20, which is sufficient to provide an accuracy of 0.85 for estimation of genomic breeding value . A summary of the haplotype block description is provided on Additional file 3: Table S1. From the SNPs considered for the LD and haplotype block study, 335,179 (75 %) were clustered into haplotype blocks constituted of 3 or more SNPs, spanning 1.35 MB (54 %) of the total autosomal genome size. There were 54,461 haplotype blocks with overall average length of 24.25 kb, BTA16 (882 kb) followed by BTA4 (783 kb), and BTA5 (669 kb) presented the longest haplotype blocks, the average number of SNPs into haplotype blocks was 6, with a maximum of 111 SNPs (BTA18). The haplotype block size and distribution along autosomes is provided in Additional file 4: Figure S3.
The effective population size (Ne) was estimated to be around 214 animals in average per chromosome, with a confidence interval of 13.5, one generation ago (assuming ≈ 7 years for each generation). This number is higher than the Ne obtained from pedigree records, varying from 98 to 68, calculated from past periods of time (from 1979 to 1998)  and from another study also using pedigree records, that shows Ne to be around 120 for a Brazilian Polled Nelore breed . We estimated Ne to be higher 10 generations ago (around 270 ± 16.86 animals, in average per chromosome), when the breed was introduced in Brazil. Ne shows a decreasing pattern over the last generations especially for the last two (see Additional file 5: Figure S4).
Gene networks for growth and meat quality traits
Trait data and GWAS results information
p ≤ 10−3
p ≤ 10−4
p ≤ 10−5
Estimates of inbreeding depression for meat quality traits. Estimates expressed as change in adjusted phenotype per 1 % increase, using a inbreeding coefficient derived from a genomic relationship matrix (FGRM), and a inbreeding coefficient derived from runs of homozygosity (FROH)
Genome-wide association studies
We did a literature survey of the top 10 genes associated to the lowest p-value TagSNPs for BFT, REA and TCW. For the later we searched for the involvement of the significant genes that could influence the total carcass weight phenotype. We found the gene Serotonin Receptor 2, (HTR2A) associated to the marker BovineHD1200005266 (BTA12, p = 1e-6) which plays a role in appetite control  and was found associated to birth weight in humans . We also found the Leukemia inhibitory factor receptor (LIFR) gene, associated to the marker BovineHD2000010215 (BTA20, p = 1.21e-6) which is involved in skeletal growth and bone formation and resorption in humans . For the BFT phenotype we searched for genes involved in studies related to adipose tissue and fat. We found the gene EPHA6 (ephrin receptor a6), associated to the marker BovineHD0100011825 (BTA1, p = 1.10e-6) found associated in a copy number variation (CNV) study to childhood obesity  and the gene TMEM182 (transmembrane protein 182), associated to the marker BovineHD1100002798 (BTA11, p = 1.5e-7) which is found up-regulated in brown adipose tissue during adipogenesis and myogenesis  and also found associated in another GWAS for BFT , discussed below. We found no specific genes related to REA in the literature, that could be influencing muscle growth and myogenesis, associated to any TagSNPs of the top 10 list. In a previous work from our group , GWAS were performed using a Bayesian approach, with a subset of the data used in this work (smaller number of animals and traits). We compared the set of SNP, associated with significance below 10−5 in our work, against the ones associated in this previous work, to try to find genes simultaneously associated in both studies. We found one common gene for LM, 37 common genes for LF, 15 common genes for BFT and 12 genes for REA (gene ids at Additional file 7: Spreadsheet S1). The accordance between both works for BFT, REA and LF were of 26 %, 38 % and 38 % respectively, of all genes with significance below 10−5. We could not estimate the accordance for the TCW trait as it was not used in this previous work.
To test for Gene Ontology (GO) enrichment, we ordered all TagSNPs according to the pleiotropy test  used. The ordered list of all corresponding genes was used as input to GOrilla  and results showed “cell adhesion” (GO:0007155, false discovery rate or FDR q-value = 2.8e-4), “biological adhesion” (GO:0022610, FDR q-value = 1.52e-4) and “developmental growth” (GO:0048589, FDR q-value = 1.29e-4) as the three GO terms from the biological process tree enriched with significant values after a FDR correction. Cell adhesion was also found significantly enriched for color of muscle and fat and for meat tenderness in our previous work . For the list of genes in each GO, check Additional file 8: Spreadsheet S2.
For a more holistic view of the genes influencing the eight phenotypes related to growth and meat quality we used the p-values and SNP additive effects (Z scored) obtained with the QXPAK.5 software to construct an AWM (see Additional file 9: Spreadsheet S3 for the list of SNP additive effects and p-values respectively used in the AWM). TCW was chosen as key phenotype for the AWM which was then used to calculate a pair-wise correlation matrix of all SNP effects. This correlation matrix was first filtered to maintain genes with correlation values equal or above 0.95. It was then used to run the PCIT methodology, which generated a gene network that consisted of 3371 genes (nodes) and 6961 gene relationships (edges) (data not shown). This is allegedly the network that contains genes more involved in growth traits because of the key phenotype (TCW), and also prone to be constituted of genes more involved with traits highly correlated to TCW, like REA and BFT. With less intensity, the network is supposed to contain genes involved with the other five phenotypes as they are less correlated to TCW. Additional file 10: Figure S6 shows a heatmap and hierarchical cluster obtained with the standardized SNP effects of the eight phenotypes that evidences that TCW, REA and BFT are more correlated than pH, CLO, LF, LM and DRE.
For visualization we used Cytoscape v3.1 and applied several visual filters to identify highly connected nodes. We suggest that the number of connections is a measure of the importance of the node to the function of the network as a whole. The assumption is that the more connections a gene has the more likely is its role regulating and influencing the function of other genes in the network. After applying the filters, the size of the node was changed to be proportional to its number of edges. The smaller the number of connections of a node, the smaller was the size of the node and vice-versa. This same rule is valid for the coloring: the more reddish, more connections and the more greenish, less connections. The obtained network shows in its center, a cluster of nodes with larger size and stronger red color intensity showing that these nodes are more interconnected and are probably more involved in key biological mechanisms underlying the phenotypes involved in the network (data not shown).
From the set of genes with direct edges to the trio of TF, there are some genes with remarkable evidence in literature relating them to growth traits, specially bone and muscle growth and adipogenesis. The ASB2 (ankyrin repeat and SOCS box containing 2), is a gene found to function in a negative regulation of muscle growth in salmon . The PROCR gene (protein C receptor, endothelial) was found to positively regulate other genes positively, in an osteoblastic cell line, functioning toward bone growth . The ALDH9A1 (aldehyde dehydrogenase 9 family, member A1), has a role on fetal growth in adult adipose tissue mass in bovine . CPEB4, or cytoplasmic polyadenylation element binding protein 4, is associated to human waist-hip ratio . The leucine-rich repeat LGI family, member 3, LGI3, is thought to have its function altered in obesity and to suppress adipogenesis . GPC6 (glypican 6) is implicated in bone growth . The FAM3C gene, or Family With Sequence Similarity 3, Member C was found associated to influence bone mineral density in humans . A gene, the IARS (isoleucyl-tRNA synthetase), was found related to perinatal weak calf syndrome, which involves intrauterine growth retardation and low birth weights . There are several other genes related to cell growth and proliferation, like ERGIC3 (ERGIC and golgi 3) , KHDRBS3 (KH domain containing, RNA binding, signal transduction associated 3) , CEP250 (centrosomal protein 250 kDa) , STIM2 (stromal interaction molecule 2) , HABP2 (hyaluronan binding protein 2) .
P-values of the SNPs assigned to the genes of the trio of TF (inside parentheses)
1.00E + 00
1.00E + 00
1.00E + 00
After the introduction of the Nelore breed in Brazil in the last century, there were some pedigree-based studies trying to characterize the founder lineages of the breed , but no genomic approach has been made since then. The introduction of the low and high density bovine genotyping panels, made possible the calculation and use of GRM matrices  in genomic similarity studies to create a genetic profile. Using only pedigree information from the 34 sires, the average inbreeding coefficient (FPED) was 3.23 % (± 3.9 SD) (calculation performed during experimental design, not shown), which is similar to the FGRM reported for sires, but lower than the FROH reported for sires. Comparing the inbreeding coefficients between sires and progeny, there is a clear decrease in inbreeding, which can only be attributed to the randomness of the genetic pool of the dams. According to recent literature, FGRM was determined as the optimal method for estimating genomic inbreeding coefficients as it can distinguish between markers that are IBS (identical by state) from markers that are IBD (identical by descendent) [61, 62]. The average FROH for either sires or progeny estimated in this study are higher than the ones reported for a Holstein population (3.8 ± 2.1 %) which have shown effects of inbreeding depression for traits in dairy cattle . Also, inbreeding depression appears not to have strongly affected the phenotypes (Table 3). This result agrees with the lower inbred estimation for the progeny compared to the sires. We were not able to cross test this hypothesis by estimating inbreeding depression for the traits among sires, as we did not collect their phenotypes, altough we would expect a stronger influence of inbreeding in this case.
The average r 2 values obtained in this study, considering SNPs with distance up to 150 kb apart, are higher than the values reported in the literature for Nelore and indicine breeds. Beyond 150 kb distance, the average r 2 values are similar to ones found in the literature [63, 64]. However, it is worth mentioning that it is difficult to make direct comparisons among LD studies, as LD levels vary due to many factors (sample size, marker type, marker density, and population history) . In this study, both maternal and paternal haplotypes were considered in contrast to studies that only consider maternal haplotypes, as it is our understanding that the Nelore population (and cattle production in general) presents higher influence from males than females and not considering paternal haplotypes would underestimate r 2 values. We believe that the backcrossing with taurine local breeds, evidenced by the mtDNA origin, could have influenced the values of LD and Ne estimated for this Nelore sample.
There are some reports in the literature considering haplotype block structure in cattle, but none was found for Nelore cattle. These reports stated that with the increase in marker density, there will be more haplotype blocks of smaller sizes which will respond for higher genome coverage [66–68]. A recent study using the high density SNP panel in crossbred cattle population reports 61 % of the genome covered by haplotype blocks, and a total of 78 % of SNPs clustered into haplotype blocks . The recent Ne found for Nelore in this study, is in agreement with reports of recent Ne for Holstein and Swiss Eringer breed varying from 80 to 150 individuals, and can go higher than 2000 animals for past Ne (about 2000 generations ago) [67, 70, 71]. Our results show that the diversity among sire families and lineages is not high enough to correctly separate those in a PCA, or hierarchical clustering analysis (Fig. 1). This is corroborated by the low Fst values obtained for sire families and lineages subpopulations. The uninformative values of Ne and Fst estimates were expected for the lineages as they do not represent isolated populations and should have been crosses among them.
Some of the traits used in this work were already analyzed for genome wide associations elsewhere using part of the data and with a different approach . Although the phenotype data is the same in both works, our sample size was larger and the software and algorithms used for association studies were different. We have not found larger accordance than 38 %, between the set of associated genes with significance of 1e-5 to BFT, REA and LM traits, studied in that work, although we could not verify this for TCW as this trait was not present in that work. However, the quality and consistency of the genotyping data were found reliable since the PCA against other breeds showed consistent results (Fig. 1a, inset). We also checked for the fluctuation of heterozygosity along the 224,969 TagSNPs and found reasonable smoothed values of heterozygosity fluctuating around 0.3 to 0.4 (Additional file 11: Figure S7). The traits correlations were also found reasonable (see Results). Furthermore some of the top TagSNPs that had lowest p-values in the GWAS, had the corresponding gene with functional biological correlation to the main growth traits, indicating that the GWAS was able to extract some of the signal out of the noise from the data. In addition, we also performed a gene enrichment test with GOrilla, after ordering the TagSNPs for the pleiotropic effect. The pleiotropic test is a methodology to try to rank genes that are most influencing the traits in the AWM. The results showed GO terms for biological and cell adhesion and developmental growth, which are again evidence that the terms are enriched for genes related mainly to growth which is the key phenotype of the AWM. Cell adhesion is a biological process that is also found significantly enriched in the work of Tizioto et al., 2013 . Cell adhesion is a process involved in the binding of a cell to a substrate which sustains its growth and therefore is intrinsically related to the growth process.
With a similar objective of the pleiotropy test, the construction of the gene networks with the AWM/PCIT [13, 35–37] is an alternative to analyze pure GWAS results as this methodology gathers the 8 GWAS results and constructs a gene network that influences the traits in a simultaneous way, but weighting accordingly to the correlations within the AWM. We decided to select TCW as the key trait for the AWM because of its reasonable heritability estimate and its congruent correlations to REA and BFT, together with the economic importance of growth traits for livestock. As the resulting network is too large (3371 nodes), we used a heuristic to divide the network into a sub-network in a way that it would be feasible to analyze and still gather most of the biological importance regarding these traits. This heuristic is implemented in an algorithm that selects the trio of TF that most span the network and its first degree nodes. As the coloring and shape of the nodes of the resulting sub-network are inherited from the original network, it can be noticed a majority of reddish and larger nodes (with more connections) indicating that the objective of the heuristic was achieved (Fig. 4). Also, a survey in the literature showed correlations between the trio of TF and several nodes from the sub-network to growth traits, suggesting that our methodology is correct. It is remarkable to note that some genes like VDR (which is the most connected TF from the network) was already associated to growth in a bovine GWAS  and many others have relations to the biology of growth, like proliferation and growth of muscle and bones and adipogenesis (e.g. LHX9, CPEB4, ASB2, and many others). Although comparing the results of single SNP association to the work of Tizioto et al. 2013 did not lead to more than 38 % of common associated genes, the genes from the sub-network are in 50 % of accordance with the genes associated to BFT and REA from that work. This suggests that the methodology that created this network was able to select genes that are more related to the traits involved with growth. We compared the genes from the sub-network to genes found in other studies [35–37] that also used the AWM/PCIT methodology and similar datasets (but different breeds). Although there were no great correlations, some genes in these studies were found to have similarities or to encode the same protein domains to some of the genes from the sub-network: metalloprotease and ankyrin domains (genes ADAMTS14 and ASB2 respectively), the RAB6B oncogene , the MER Proto-Oncogene Tyrosine Kinase (MERTK) and the SLC solute carrier gene family (SLC34A2) .
Taken together, these results are suggesting that the gene networks obtained are related to growth and meat quality traits and their genes should be thoroughly inspected to try to discover the biological mechanism beneath growth and meat quality phenotypes.
We used high density SNP panels to genotype 34 sires, and its half-sib families, totaling 780 animals. The sires were previously selected in order to represent most of the variability of the Nelore beef cattle in Brazil. We believe this is the first work that used a genomic approach in order to try to investigate the amount of diversity of the Brazilian Nelore breed by investigating its ancestral lineages. Results showed that the population studied was not structured enough in order to differentiate families using ancestral information of sires or lineages. Eight phenotypes related to growth and meat quality were used in whole genome association studies and the results were used to construct gene networks focused on growth, using the methodology of AWM/PCIT. Literature surveys showed that both the GWAS and the gene networks constructed had significant SNPs associated to genes related to growth in former studies like the VDR, LHX9, CPEB4, ASB2 and many others. These results suggests that the methodology used to construct the gene networks can be used as an alternative approach to standard GWAS, in order to reach for novel information and to try to understand the biological mechanisms and gene compositions that leads to complex phenotypes, like growth in beef cattle.
Pedigree and sire lineages
Aiming to select sires that represent the bulk of genetic variability within the Nelore breed in Brazil, we consulted the main insemination centers of the country, selecting the more commercially frequent Nelore sires (polled and horned), with semen value equal or inferior to R$50.0 (around US$25.0 at the time of the study) to represent the breeders preference. Afterwards, using information from the National Zootechnical File maintained by Embrapa Beef Cattle in partnership with ABCZ (Brazilian Association of Zebu Breeders) the pedigree of all animals were assembled and a relationship pedigree matrix was created (data not shown). The sires were selected by the following rules: having the most commercialized semen among the breeders association in Brazil and ii) being the most unrelated with each other according to the value of similarity from the pedigree relationship matrix.
A list of 34 sires was assembled representing almost all lineages of the Nelore breed in Brazil (see Table 1): Karvadi, Taj Mahal, Golias, Padhu-Akasamu, Kurupathy, NO, Godar, Godhavari, Mocho GR, Nagpur, Akasamu, Padhu, Lengruber (Mundo Novo), Visual, OB, IZ and IRCA. The sire lineage was surveyed by following information about the most ancestral sire from the paternal side. Our sample study was composed of 34 sires and half-sib families with 746 animals in a total of 780 animals. The 746 calves, sons of commercial cows (not pure Nelore), were born on five different ranches, where they were raised to around 21 months old, before allocation to individual or collective pens in a feedlot located in Sao Carlos, SP, Brazil; or in Campo Grande, MS, Brazil, in an interval of 3 years. The pedigree was visualized with Cytoscape version 3.1 .
The mtDNA genotype was characterized as taurine (GenBank AY526085) or indicine (GenBank AY126697) from total DNA by amplification of a 366 bp fragment of mtDNA 16S (rRNA) gene using allele-specific PCR. The complete methodology can be found in .
Where y is a vector with the values for a given phenotype, X and Z are incidence matrix, β is a vector for fixed effects composed of the first principal component used as co-variate, Slaughter Age also a co-variate, the contemporary group effect is represented by Origin * Year of Birth and Animal ID, representing the pedigree. g is a vector of additive genetic effects, normally distributed g ~ N(0, σ g 2 ). E represent the vector of residual effects, E ~ N(0, σ e 2 ).
Genotyping and quality control
All 34 sires and 746 half-sibs were genotyped with a high density SNP panel (Illumina Bovine HD SNP Chip). The DNA sample collection and the SNP chip genotyping were performed as previously described . We used quality control filters for minor allele frequency (5 %) and call rate for sample and SNPs (95 %). Filters were applied using PLINK  and Biocoductor/R [74, 75]. This filters yielded a dataset comprised of 449,203 SNPs which was used for linkage disequilibrium and haplotype block description using BEAGLE  for genotype phasing and missing genotype imputation and LDexplorer  for haplotype block recognition following Gabriel et al. 2002  algorithm, and for estimating effective population size (Ne) according to .
Genomic relationship matrix
The genomic relationship matrix (GRM) was constructed [60, 80] using the TagSNPs. The GRM was used to estimate genomic inbreeding coefficient (FGRM), estimated as the diagonal elements of the GRM , and to perform PCA analyzes with all 780 Nelore samples and the same TagSNPs for 46 samples of the Brahman breed, 36 samples of Hereford breed and 44 samples of Angus breed, genotyped with the same high density SNP panel from the Hapmap project .
Genome Wide Association Study (GWAS)
Where y ' is a vector containing the phenotype corrected for the contemporary group effect from the i-th animal to the j-th trait (check Table 2), X is the incidence matrix relating fixed effects in β' with observations in y' ij . β' is a vector for fixed effects composed of the first principal component used as co-variate, slaughter age also a co-variate, representing the pedigree. Z is the incidence matrix relating random additive polygenic effects in g with the observations in y' ij . g is a vector composed of the additive genetic effects, normally distributed g ~ N(0, Aσ g 2 ); were A represents the numerator relationship matrix derived from the pedigree. S k is the vector of genotypes for the k-th SNP across all animals. s jk is a vector that represents the additive effect of the k-th SNP on the j-th trait and E represents the vector of residual effects, E ~ N(0, σ e 2 ). QXPAK5 was run for the eight phenotypes using the TagSNP set. The p-values and additive genetic values for each SNP were obtained for each phenotype and used to construct the AWM matrix. The FDR (false discovery rate) was calculated using the formula of .
AWM/PCIT and gene networks
AWM is a methodology to explore the results obtained from several GWAS (many traits) and generate a matrix of genes (or SNPs) co-associated in a pair-wise manner to these traits. The rows in the AWM are composed of the genes (or SNPs) and the columns represent the traits. The AWM matrix is filled with the additive effects of the SNPs. AWM was used to select SNPs associated to the key trait (TCW) and also SNPs associated to several (three or more) other traits. This was done in order to keep SNPs that are important to “growth” in a holistic manner. Afterwards, a pair-wise Pearson’s correlation is calculated and the data is submitted to PCIT that is a network inference methodology used to construct the gene network. The AWM was constructed as described elsewhere . The correlation matrix was calculated for every entry of the AWM, using the standardized Z score (the additive SNP effect divided by the standard deviation) within the R environment. The correlation matrix was used as input for the PCIT package for R . A list of genes that correspond to each TagSNPs were obtained from Ensembl version 74 (http://www.ensembl.org), where genes were assigned to its nearest TagSNP. Gene networks generated by PCIT were visualized with Cytoscape version 3.1 . A transcription factor list was obtained from the Animal Transcription Factor Database  and the most connected trio of transcription factors from the gene network was obtained as described elsewhere .
Pleiotropy and gene enrichment
To test for pleiotropy we employed the approach described by . In brief, the effects of the 224,969 TagSNPs estimated from the GWAS were divided by their associated standard errors to obtain a t-value corresponding to the studentized SNP effects. A multi-trait test of the i-th TagSNP was performed by storing its studentized effects across the 8 traits in the 8 x 1 vector t i . Then, the quadratic form t i 'V − 1 t i , where V is the correlation matrix among the SNP effects, is distributed approximately as a chi-squared with 8 df under the null hypothesis that the TagSNP does not affect any of the traits . The TagSNPs where then ordered according to the results and the ordered list of corresponding genes were used as input to GOrilla .
Linkage disequilibrium and other analyses
where, c is the genetic distance between two SNP expressed in Morgans . Considering each SNP pairs located within 100Mbp of the same autosomal chromosomes, with physical distances between SNP converted to genetic distances by the simple assumption of 1 cM ~ 1 Mb were used for Ne estimation. Haploview was used for obtaining TagSNPs, which were used for deriving inbreeding coefficient by runs of homozygozity (FROH), calculated with PLINK, as described by . Heterozygosity changes along the 224,969 TagSNPs were calculated with PLINK and plotted in R using a smoothing function from the lokern package, one point for each 100 TagSNPs, as described elsewhere . The Wright’s F statistics (Fst) analyses were run with PLINK 1.9 (https://www.cog-genomics.org/plink2) using the TagSNPs of the 780 animals, separated: i) by lineage (17 subpopulations) and ii) by sire families (34 subpopulations).
All experimental procedures involving steers were approved by the Institutional Animal Care and Use Committee Guidelines from Brazilian Agricultural Research Corporation – EMBRAPA and sanctioned by the president of the committee, Dr. Rui Machado.
Availability of supporting data
The data set supporting the results of this article is included within the article (and its additional files). Genotypic data is available upon request depending on a signed declaration of exclusive research purpose.
Brazilian Association of Zebu Breeders
Association Weight Matrices
back fat thickness
false discovery rate
average genomic inbreeding coefficient from the GRM
average inbreeding coefficient derived from pedigree
inbreeding coefficient derived from runs of homozygosity
Wright’s f statistic
genomic relationship matrix
genome wide association studies
identical by descendent
identical by state
effective population size
principal component analysis
partial correlation and information theory
- r 2 :
squared correlation coefficient
rib eye area
single nucleotide polymorphism
total carcass weight
This study was conducted with funding from EMBRAPA (Macroprograma1, 01/2005) and FAPESP (process number 2012/23638-8). GBM, LLC, LCAR and MMA were granted CNPq fellowships. We thank Sean McWilliam, Marina R. S. Fortes, Edilson Guimaraes, Robson Rodrigues Santiago, Roselito F. da Silva, Fernando F. Cardoso, Flavia Aline Bressani, Wilson Malago Jr., Avelardo U. C. Ferreira, Michel E. B. Yamaguishi and Fabio D. Vieira for the help and technical assistance. The authors would like to acknowledge the collaborative efforts among EMBRAPA, University of Sao Paulo and CSIRO.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Magnabosco CU, et al. Catalogo de linhagens do germoplasma zebuino: raca Nelore, Documentos, 23. Brasilia: Embrapa-Cenargen; 1997. p. 1–52.Google Scholar
- Foote RH. The history of artificial insemination: Selected notes and notables. J Anim Sci. 2002;80(E-Suppl 2):1–10.Google Scholar
- Decker JE, et al. A novel analytical method, Birth Date Selection Mapping, detects response of the Angus (Bos taurus) genome to selection on complex traits. BMC Genomics. 2012;13:606.View ArticlePubMedPubMed CentralGoogle Scholar
- Bolormaa S, et al. Detection of chromosome segments of zebu and taurine origin and their effect on beef production and growth. J Anim Sci. 2011;89(7):2050–60.View ArticlePubMedGoogle Scholar
- O'Connor SF, et al. Genetic effects on beef tenderness in Bos indicus composite and Bos taurus cattle. J Anim Sci. 1997;75(7):1822–30.PubMedGoogle Scholar
- Johnson DD, et al. Effects of percentage Brahman and Angus breeding, age-season of feeding and slaughter end point on meat palatability and muscle characteristics. J Anim Sci. 1990;68(7):1980–6.PubMedGoogle Scholar
- Tizioto PC, et al. Genome scan for meat quality traits in Nelore beef cattle. Physiol Genomics. 2013;45(21):1012–20.View ArticlePubMedGoogle Scholar
- Utsunomiya YT, et al. Genome-wide association study for birth weight in Nellore cattle points to previously described orthologous genes affecting human and bovine height. BMC Genet. 2013;14:52.View ArticlePubMedPubMed CentralGoogle Scholar
- Kim Y, et al. Genome-wide association study reveals five nucleotide sequence variants for carcass traits in beef cattle. Anim Genet. 2011;42(4):361–5.View ArticlePubMedGoogle Scholar
- Bolormaa S, et al. A genome-wide association study of meat and carcass traits in Australian cattle. J Anim Sci. 2011;89(8):2297–309.View ArticlePubMedGoogle Scholar
- Barendse W. Haplotype analysis improved evidence for candidate genes for intramuscular fat percentage from a genome wide association study of cattle. PLoS One. 2011;6(12):e29601.View ArticlePubMedPubMed CentralGoogle Scholar
- Wang K, Li M, Hakonarson H. Analysing biological pathways in genome-wide association studies. Nat Rev Genet. 2010;11(12):843–54.View ArticlePubMedGoogle Scholar
- Fortes MR, et al. Association weight matrix for the genetic dissection of puberty in beef cattle. Proc Natl Acad Sci U S A. 2010;107(31):13642–7.View ArticlePubMedPubMed CentralGoogle Scholar
- Fortes MR, et al. Gene network analyses of first service conception in Brangus heifers: use of genome and trait associations, hypothalamic-transcriptome information, and transcription factors. J Anim Sci. 2012;90(9):2894–906.View ArticlePubMedGoogle Scholar
- Reverter A, Chan EK. Combining partial correlation and an information theory approach to the reversed engineering of gene co-expression networks. Bioinformatics. 2008;24(21):2491–7.View ArticlePubMedGoogle Scholar
- Watson-Haigh NS, Kadarmideen HN, Reverter A. PCIT: an R package for weighted gene co-expression networks based on partial correlation and information theory approaches. Bioinformatics. 2010;26(3):411–3.View ArticlePubMedGoogle Scholar
- Gibbs RA, et al. Genome-wide survey of SNP variation uncovers the genetic structure of cattle breeds. Science. 2009;324(5926):528–32.View ArticlePubMedGoogle Scholar
- Porto-Neto LR, et al. Genomic divergence of zebu and taurine cattle identified through high-density SNP genotyping. BMC Genomics. 2013;14:876.View ArticlePubMedPubMed CentralGoogle Scholar
- Paschou P, et al. PCA-correlated SNPs for structure identification in worldwide human populations. PLoS Genet. 2007;3(9):1672–86.View ArticlePubMedGoogle Scholar
- El Mousadik A, Petit RJ. High level of genetic differentiation for allelic richness among populations of the argan tree [Argania spinosa (L.) Skeels] endemic to Morocco. Theor Appl Genet. 1996;92(7):832–9.View ArticlePubMedGoogle Scholar
- Tizioto PC, et al. Caracterizacao do haplotipo mitocondrial de uma amostra de touros da raca nelore representativa das principais linhagens comercializadas no Brasil. Ciencia Animal. 2011;21(1):25–9.Google Scholar
- Meirelles FV, et al. Is the American Zebu really Bos indicus? Genet Mol Biol. 1999;22:543–6.View ArticleGoogle Scholar
- Ardlie KG, Kruglyak L, Seielstad M. Patterns of linkage disequilibrium in the human genome. Nat Rev Genet. 2002;3(4):299–309.View ArticlePubMedGoogle Scholar
- Meuwissen TH, Hayes BJ, Goddard ME. Prediction of total genetic value using genome-wide dense marker maps. Genetics. 2001;157(4):1819–29.PubMedPubMed CentralGoogle Scholar
- Faria FJ, et al. Pedigree analysis in the Brazilian Zebu breeds. J Anim Breed Genet. 2009;126(2):148–53.View ArticlePubMedGoogle Scholar
- Faria FJC, et al. Estrutura populacional da ra\cca Nelore Mocho. Arquivo Brasileiro de Medicina Veterin\'aria e Zootecnia. 2002;54:501–9.View ArticleGoogle Scholar
- Neumaier A, Groeneveld E. Restricted maximum likelihood estimation of covariances in sparse linear models. Genet Sel Evol. 1998;30(1):3–26.View ArticlePubMed CentralGoogle Scholar
- Perez-Enciso M, Misztal I. Qxpak.5: old mixed model solutions for new genomics problems. BMC Bioinformatics. 2011;12:202.View ArticlePubMedPubMed CentralGoogle Scholar
- Gasque G, et al. Small molecule drug screening in Drosophila identifies the 5HT2A receptor as a feeding modulation target. Sci Rep. 2013;3:srep02120.View ArticlePubMedPubMed CentralGoogle Scholar
- Sims NA, Johnson RW. Leukemia inhibitory factor: a paracrine mediator of bone metabolism. Growth Factors. 2012;30(2):76–87.View ArticlePubMedGoogle Scholar
- Glessner JT, et al. A genome-wide study reveals copy number variants exclusive to childhood obesity cases. Am J Hum Genet. 2010;87(5):661–6.View ArticlePubMedPubMed CentralGoogle Scholar
- Wu Y, Smas CM. Expression and regulation of transcript for the novel transmembrane protein Tmem182 in the adipocyte and muscle lineage. BMC Res Notes. 2008;1:85.View ArticlePubMedPubMed CentralGoogle Scholar
- Bolormaa S, et al. A multi-trait, meta-analysis for detecting pleiotropic polymorphisms for stature, fatness and reproduction in beef cattle. PLoS Genet. 2014;10(3):e1004198.View ArticlePubMedPubMed CentralGoogle Scholar
- Eden E, et al. GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists. BMC Bioinformatics. 2009;10:48.View ArticlePubMedPubMed CentralGoogle Scholar
- Ramayo-Caldas Y, et al. A marker-derived gene network reveals the regulatory role of PPARGC1A, HNF4G, and FOXP3 in intramuscular fat deposition of beef cattle. J Anim Sci. 2014;92(7):2832–45.View ArticlePubMedGoogle Scholar
- Widmann P, et al. A systems biology approach using metabolomic data reveals genes and pathways interacting to modulate divergent growth in cattle. BMC Genomics. 2013;14:798.View ArticlePubMedPubMed CentralGoogle Scholar
- Widmann P, et al. Systems biology analysis merging phenotype, metabolomic and genomic data identifies Non-SMC Condensin I Complex, Subunit G (NCAPG) and cellular maintenance processes as major contributors to genetic variability in bovine feed efficiency. PLoS One. 2015;10(4), e0124574.View ArticlePubMedPubMed CentralGoogle Scholar
- Reverter A, Fortes MR. Breeding and Genetics Symposium: building single nucleotide polymorphism-derived gene regulatory networks: Towards functional genomewide association studies. J Anim Sci. 2013;91(2):530–6.View ArticlePubMedGoogle Scholar
- Malloy PJ, et al. A unique insertion/duplication in the VDR gene that truncates the VDR causing hereditary 1,25-dihydroxyvitamin D-resistant rickets without alopecia. Arch Biochem Biophys. 2007;460(2):285–92.View ArticlePubMedGoogle Scholar
- Yamamoto Y, et al. Vitamin D receptor in osteoblasts is a negative regulator of bone mass control. Endocrinology. 2013;154(3):1008–20.View ArticlePubMedGoogle Scholar
- Gao Y, et al. Two novel SNPs in the coding region of bovine VDR gene and their associations with growth traits. J Genet. 2013;92(2):e53–9.PubMedGoogle Scholar
- Birk OS, et al. The LIM homeobox gene Lhx9 is essential for mouse gonad formation. Nature. 2000;403(6772):909–13.View ArticlePubMedGoogle Scholar
- Retaux S, et al. Lhx9: a novel LIM-homeodomain gene expressed in the developing forebrain. J Neurosci. 1999;19(2):783–93.PubMedGoogle Scholar
- Tzchori I, et al. LIM homeobox transcription factors integrate signaling events that control three-dimensional limb patterning and growth. Development. 2009;136(8):1375–85.View ArticlePubMedPubMed CentralGoogle Scholar
- Siles L, et al. ZEB1 imposes a temporary stage-dependent inhibition of muscle gene expression and differentiation via CtBP-mediated transcriptional repression. Mol Cell Biol. 2013;33(7):1368–82.View ArticlePubMedPubMed CentralGoogle Scholar
- Bower NI, Johnston IA. Discovery and characterization of nutritionally regulated genes associated with muscle growth in Atlantic salmon. Physiol Genomics. 2010;42A(2):114–30.View ArticlePubMedPubMed CentralGoogle Scholar
- Lee YJ, et al. Activated protein C differentially regulates both viability and differentiation of osteoblasts mediated by bisphosphonates. Exp Mol Med. 2013;45:e9.View ArticlePubMedPubMed CentralGoogle Scholar
- Taga H, et al. Cellular and molecular large-scale features of fetal adipose tissue: is bovine perirenal adipose tissue brown? J Cell Physiol. 2012;227(4):1688–700.View ArticlePubMedGoogle Scholar
- Heid IM, et al. Meta-analysis identifies 13 new loci associated with waist-hip ratio and reveals sexual dimorphism in the genetic basis of fat distribution. Nat Genet. 2010;42(11):949–60.View ArticlePubMedPubMed CentralGoogle Scholar
- Kim HA, et al. Leucine-rich glioma inactivated 3 regulates adipogenesis through ADAM23. Biochim Biophys Acta. 2012;1821(6):914–22.View ArticlePubMedGoogle Scholar
- Dwivedi PP, Lam N, Powell BC. Boning up on glypicans-opportunities for new insights into bone biology. Cell Biochem Funct. 2013;31(2):91–114.View ArticlePubMedGoogle Scholar
- Zhang LS, et al. A follow-up association study of two genetic variants for bone mineral density variation in Caucasians. Osteoporos Int. 2012;23(7):1867–75.View ArticlePubMedGoogle Scholar
- Hirano T, et al. Mapping and exome sequencing identifies a mutation in the IARS gene as the cause of hereditary perinatal weak calf syndrome. PLoS One. 2013;8(5):e64036.View ArticlePubMedPubMed CentralGoogle Scholar
- Nishikawa M, et al. Identification and characterization of endoplasmic reticulum-associated protein, ERp43. Gene. 2007;386(1-2):42–51.View ArticlePubMedGoogle Scholar
- Baudet ML, et al. Growth hormone action in the developing neural retina: a proteomic analysis. Proteomics. 2008;8(2):389–401.View ArticlePubMedGoogle Scholar
- Kumar A, et al. CEP proteins: the knights of centrosome dynasty. Protoplasma. 2013;250(5):965–83.View ArticlePubMedGoogle Scholar
- Mancarella S, et al. Targeted STIM deletion impairs calcium homeostasis, NFAT activation, and growth of smooth muscle. FASEB J. 2013;27(3):893–906.View ArticlePubMedPubMed CentralGoogle Scholar
- Kannemeier C, et al. Factor VII-activating protease (FSAP) inhibits growth factor-mediated cell proliferation and migration of vascular smooth muscle cells. FASEB J. 2004;18(6):728–30.PubMedGoogle Scholar
- Sun CX, Robb VA, Gutmann DH. Protein 4.1 tumor suppressors: getting a FERM grip on growth regulation. J Cell Sci. 2002;115(Pt 21):3991–4000.View ArticlePubMedGoogle Scholar
- VanRaden PM. Efficient methods to compute genomic predictions. J Dairy Sci. 2008;91(11):4414–23.View ArticlePubMedGoogle Scholar
- Keller MC, Visscher PM, Goddard ME. Quantification of inbreeding due to distant ancestors and its detection using dense single nucleotide polymorphism data. Genetics. 2011;189(1):237–49.View ArticlePubMedPubMed CentralGoogle Scholar
- Bjelland DW, et al. Evaluation of inbreeding depression in Holstein cattle using whole-genome SNP markers and alternative measures of genomic inbreeding. J Dairy Sci. 2013;96(7):4697–706.View ArticlePubMedGoogle Scholar
- Espigolan R, et al. Study of whole genome linkage disequilibrium in Nellore cattle. BMC Genomics. 2013;14:305.View ArticlePubMedPubMed CentralGoogle Scholar
- Porto-Neto LR, Kijas JW, Reverter A. The extent of linkage disequilibrium in beef cattle breeds using high-density SNP genotypes. Genet Sel Evol. 2014;46:22.View ArticlePubMedPubMed CentralGoogle Scholar
- Pritchard JK, Przeworski M. Linkage disequilibrium in humans: models and data. Am J Hum Genet. 2001;69(1):1–14.View ArticlePubMedPubMed CentralGoogle Scholar
- Villa-Angulo R et al. High-resolution haplotype block structure in the cattle genome. BMC Genet. 2009;10:19.View ArticlePubMedPubMed CentralGoogle Scholar
- Qanbari S, et al. The pattern of linkage disequilibrium in German Holstein cattle. Anim Genet. 2010;41(4):346–56.PubMedGoogle Scholar
- Lu D, et al. Linkage disequilibrium in Angus, Charolais, and Crossbred beef cattle. Front Genet. 2012;3:152.View ArticlePubMedPubMed CentralGoogle Scholar
- Mokry FB, et al. Linkage disequilibrium and haplotype block structure in a composite beef cattle breed. BMC Genomics. 2014;15 Suppl 7:S6.View ArticlePubMedPubMed CentralGoogle Scholar
- Goddard ME, et al. Can the same genetic markers be used in multiple breeds? In: Proc. 8th World Congr. Genet. Appl. Livest. Prod., Belo Horizonte, Brazil. 2006. CD-ROM communication no. 22-16.Google Scholar
- Flury C, et al. Effective population size of an indigenous Swiss cattle breed estimated from linkage disequilibrium. J Anim Breed Genet. 2010;127(5):339–47.View ArticlePubMedGoogle Scholar
- Shannon P, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504.View ArticlePubMedPubMed CentralGoogle Scholar
- Purcell S, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75.View ArticlePubMedPubMed CentralGoogle Scholar
- Clayton D. snpStats: SnpMatrix and XSnpMatrix classes and methods. 2013.Google Scholar
- Higa RH, et al. An R script for quality control in genome-wide association studies, in Proc. X-Meeting 2011 7th International Conference of the Brazilian Association for Bioinformatics and Computational Biology (AB3C), Florian\'opolis, Brazil. 2011. Online Abstract #32.Google Scholar
- Browning SR, Browning BL. Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am J Hum Genet. 2007;81(5):1084–97.View ArticlePubMedPubMed CentralGoogle Scholar
- Taliun D, Gamper J, Pattaro C. Efficient haplotype block recognition of very long and dense genetic sequences. BMC Bioinformatics. 2014;15:10.View ArticlePubMedPubMed CentralGoogle Scholar
- Gabriel SB, et al. The structure of haplotype blocks in the human genome. Science. 2002;296(5576):2225–9.View ArticlePubMedGoogle Scholar
- Badke YM, et al. Estimation of linkage disequilibrium in four US pig breeds. BMC Genomics. 2012;13:24.View ArticlePubMedPubMed CentralGoogle Scholar
- Porto-Neto LR, et al. Detection of signatures of selection using Fst. Methods Mol Biol. 2013;1019:423–36.View ArticlePubMedGoogle Scholar
- Price AL, et al. Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet. 2006;38(8):904–9.View ArticlePubMedGoogle Scholar
- Reverter A, Fortes MRS. Association weight matrix: a network-based approach towards functional genome-wide association studies. Methods Mol Biol. 2013;1019:437–47.View ArticlePubMedGoogle Scholar
- Zhang HM et al. AnimalTFDB: a comprehensive animal transcription factor database. Nucleic Acids Res. 2012;40(Database issue):D144–9.View ArticlePubMedGoogle Scholar
- Sved JA. Linkage disequilibrium and homozygosity of chromosome segments in finite populations. Theor Popul Biol. 1971;2(2):125–41.View ArticlePubMedGoogle Scholar