- Research article
- Open Access
Genomic insights into body size evolution in Carnivora support Peto’s paradox
BMC Genomics volume 22, Article number: 429 (2021)
The range of body sizes in Carnivora is unparalleled in any other mammalian order—the heaviest species is 130,000 times heavier than the lightest and the longest species is 50 times longer than the shortest. However, the molecular mechanisms underlying these huge differences in body size have not been explored.
Herein, we performed a comparative genomics analysis of 20 carnivores to explore the evolutionary basis of the order’s great variations in body size. Phylogenetic generalized least squares (PGLS) revealed that 337 genes were significantly related to both head body length and body mass; these genes were defined as body size associated genes (BSAGs). Fourteen positively-related BSAGs were found to be associated with obesity, and three of these were under rapid evolution in the extremely large carnivores, suggesting that these obesity-related BSAGs might have driven the body size expansion in carnivores. Interestingly, 100 BSAGs were statistically significantly enriched in cancer control in carnivores, and 15 of which were found to be under rapid evolution in extremely large carnivores. These results suggested that large carnivores might have evolved an effective mechanism to resist cancer, which could be regarded as molecular evidence to support Peto’s paradox. For small carnivores, we identified 15 rapidly evolving genes and found six genes with fixed amino acid changes that were reported to reduce body size.
This study brings new insights into the molecular mechanisms that drove the diversifying evolution of body size in carnivores, and provides new target genes for exploring the mysteries of body size evolution in mammals.
Carnivores, or the mammalian order Carnivora, feed primarily or exclusively on animal matter. They represent a highly diverse and successful group of mammals, and are at the top of the food chain. The order contains a total of 280 species in 11 families , which are widely distributed all over the world, covering most of the major land masses, rivers, and all of the oceans. Carnivores are well-known for their dietetic preferences, carnassial dentition, skull shape and body size .
Body size is closely related to factors such as habitat, life history, metabolism and risk of extinction . Previous studies revealed that the mass-specific basal metabolic rate of carnivores decreased with increasing body mass [4,5,6]. In terms of feeding habits, prey size and diversity increase with body size in predatory carnivores. Interestingly, a statistical analysis showed that, among terrestrial carnivores, the herbivorous species are relatively large, while the insectivorous species are relatively small . A typical case, the Ursidae, may have increased in size due to their diet, which includes a high proportion of fruits and vegetation . Importantly, the range of carnivore body size is unparalleled in any other mammalian orders [9, 10]. The largest carnivore—the male southern elephant seal (Mirounga leonina)—is more than 4,000 kg in body mass and over 5.8 m in length [11, 12], whereas the smallest carnivore—the least weasel (Mustela nivalis)—is only 29 g in body mass and 0.114 m in length [13, 14]. The difference between these two species is huge—over 130,000-fold in body mass and 50-fold in length—making the carnivores a good target for investigating the mechanism of mammalian body size evolution.
The formation of these highly discrepant body sizes of carnivores is probably a manifestation of adapting to their respective niche—i.e. each size has its own ecological advantages. Small carnivores have better reproductive efficiency, access to a wider variety of food and a greater ability to respond to environmental emergencies than do larger carnivores [15,16,17]. For example, small carnivores in Mustelidae and Viverridae adapted to exploit small rodent prey and invertebrates. Their smaller body allows them to move swiftly enough to follow and pounce on prey and be inconspicuous when hunting in open vegetation .
In contrast, a large body size can also bring a multitude of benefits, including the ability to exploit vast food resources, increased competitiveness, increased defense against predation, and extended longevity [18, 19]. However, a larger body size means more cells, which in turn theoretically means a higher risk of cancer, assuming that each cell has an equal risk of mutating . Notably, some extremely large carnivores such as the walrus (Odobenus rosmarus) and polar bear (Ursus maritimus), which can live for more than 40 years, were not found to have a higher risk of cancer than smaller species in Mustelidae, which have a lifespan of only several years . This phenomenon is well-known as Peto’s paradox [20, 22]. Their highly variable body sizes make carnivores an excellent group for testing Peto’s paradox at the molecular level.
Until now, the molecular mechanisms regulating the body size of carnivores remain poorly explored. Previous studies mainly focused on the intraspecific variation in body size, especially in the domestic dog (Canis lupus familiaris). A recent study showed that variants in IGF1, COL11A2, ITGA10 and ADAMTS17 contributed to height and segregate within specific dog breeds . The constantly updated high-quality genomes of carnivores provide new opportunities for studying the mechanism behind the huge variations in body size among carnivores. In the present study, a comparative genomic analysis was performed on 20 high-quality carnivore genomes. First, phylogenetic generalized least squares (PGLS) methods were used to scan for body-size-associated genes (BSAGs). Then, we determined the rapidly evolving genes (REGs) in small or extremely large carnivores and identified fixed amino acid changes in different body size groups. Finally, we tested whether selective pressure variation on cancer-related BSAGs among different carnivores partly verify Peto’s paradox. Since this study focused on a group of wild animals including many threatened or endangered species, and it is not possible or practical to collect fresh tissue samples. Therefore, no differentially expressed gene-related analysis was performed in this study. Using the results of from the above analyses, we hope to provide some novel insights into the molecular mechanism behind body size evolution in carnivores and mammals.
Genome-scanning of BSAGs and functional enrichment
A total of 6,667 one-to-one orthologous genes were identified in the genomes of the 20 carnivores and one cow (Fig. 1; Table S1) using Orthofinder and our in-house Perl scripts. PGLS revealed that 1,132 and 668 genes were significantly associated with head body length and body mass, respectively. The two-step calibration procedure (P value.all/robust/max < 0.05) revealed that 337 of these genes were significantly related to both head body length and body mass; these were defined as body-size-associated genes (BSAGs; Table S2). According to the tendency or the correlation slope, 256 genes showed a positive correlation and 81 genes exhibited a negative correlation.
Of the 256 positively-correlated BSAGs, functional enrichment analyses revealed that 62.9 % (161/256) were significantly enriched (P value < 0.05) in 164 GO terms (Table S3). 28.6 % (46/161) were annotated with metabolic processes, such as “NADP metabolic process,” “glycoprotein metabolic process” and “cellular amino acid metabolic process” (Fig. 2). 32.3 % (52/161) were significantly enriched in GO categories associated with growth and development, including “positive regulation of growth,” “cardiac chamber development,” “positive regulation of nervous system development” and “epidermis development” (Fig. 2). For instance, seven genes (ADAM10, DBN1, NTRK3, PPIB, MAP2K5, WNT2 and ZFPM2) that play key roles in maintaining normal organ or body development were enriched in the GO term “positive regulation of growth.” However, due to the scattered functions of these BSAGs, there were only 21 (8.2 %) genes significantly enriched in KEGG pathways (Table S3)—e.g. “cytokine-cytokine receptor interaction” and “fanconi anemia pathway.”
Of the negatively-correlated BSAGs, 29.6 % (24/81) genes were classified into GO categories such as “telomere maintenance,” “DNA repair,” “negative regulation of DNA metabolic process,” “negative regulation of cell activation” and “apoptotic signaling pathway” (Fig. 2; Table S3). Mapping these negatively-correlated genes onto the KEGG database did not yield any significantly-enriched terms.
We then manually looked up the biological functions of these BSAGs by searching the literature and multiple databases and identified 14 positively-correlated BSAGs (BRAP, CHCHD5, CPT1C, GPR1, LDLR, MAP2K5, PLEKHS1, SLC30A8, ST3GAL2, STX16, ZFHX3, ZGRF1, ZNF395 and ZPLD1) that were associated with “obesity” (Table S4), which is a manifestation of an enlarged-body-sized phenotype. For instance, a significant positive association between log (root−to−tip ω) and log (body mass) was tested in BRAP, STX16, ZGRF1 and ZPLD1 (Fig. 3), and all four genes were reported to cause obesity in humans. Furthermore, we also found 100 BSAGs that were associated with cancer control, including “DNA repair,” “cell cycle control, apoptosis, adhesion and autophagy” and “immune response” (Table S5). Among these, a total of 21 BSAGs (ADAM11, APC, BRCA2, CDH11, CERS2, DSC3, DTWD1, EPHB6, ERCC3, ERCC4, FANCC, HELQ, HRG, ING1, INTS6, POU6F2, STAG1, TEP1, TET1, TRMT2A and ZFHX3) were identified to be tumor suppressors according to the literature or CGC database (Table 1). In addition, 18 genes were related to immunocyte immune response, development and maturation—e.g. ITK and TNFRSF17 (Table S5).
REGs in different body sizes groups
“Branch model” implemented in Codeml of the program PAML 4.9e was used to identify rapidly evolving genes (REGs) in the small- and extremely large-body-sized groups (hereafter, any reference to a “small” or “large” group refers to body size). These results showed that divergent selective pressure might have acted on carnivores with contrasting body sizes.
In the small group, a total of 15 BSAGs were found to be under rapid evolution (Table S6). Among them, CTLA4 (cytotoxic T-lymphocyte associated protein 4) is related to lower body mass index in humans, which fits into the small phenotype category. In the end, after false discovery rate (FDR) correction, only five REGs were still significant: MAS1, CATSPERG, YTHDC2, SLC25A28 and ADGRF2 (Table S6; Table S7).
By contrast, 60 REGs were placed in the extremely large group (Table S6). Fifteen genes were significantly enriched in growth and development (such as nerve, muscle and cardiac development) or phenotypic changes in body size (ATP8B1, DIS3, POMGNT1, SLITRK5, ST3GAL2, TENM3, ZGRF1 and ZPLD1). Additionally, fifteen genes were associated with cancer control (Table S5); among these, three were related to immunity (MAGT1, RFXANK and SKAP2), two (ADGRL3 and TENM3) were related to cell adhesion, and two (ADAM11 and TEP1) were found to be tumor suppressor genes. Finally, 21 REGs maintained significance after correction for multiple testing using FDR (Table S6; Table S7).
Fixed amino acid changes in the extremely small group
Identifying fixed amino acid changes in a certain group may help explain the molecular mechanism behind the occurrence of a specific phenotype. In the present study, we identified six fixed amino acid changes in six genes in extremely small carnivores (CDC7, ENG, LIG4, MMP2, POLE and TSPAN8; Fig. 4), but none in the small or extremely large carnivores. These sites were located in the functional domains of their respective proteins identified by Pfam. For instance, a unique change, S513L, was located in the protein kinase domain of CDC7, and another mutation (S784P) was found in the DNA ligase IV domain of LIG4; both of these genes were associated with the reduced-body-sized phenotype in humans or mice.
Obesity-related genes contributing to increasing body size in carnivores
In Carnivora, some species in Pinnipedia and Fissipedia have evolved a relatively large body size, with some extremely large species weighing more than 350 kg, such as the walrus, northern sea lion, Weddell seal and polar bear. Due to their semi-aquatic life habits, species in Pinnipedia had evolved a large body size, which leads to an increased body surface area and reduction in rapid heat loss in water . Similarly, the polar bear is the largest extant bear to adapt to the cold Arctic regions, weighing 372 kg on average . Interestingly, polar bears and seals have relatively thick subcutaneous fat, which accounts for more than 30 % of their body weights, much more than that of other wild carnivores [33,34,35]. It has been suggested that the thick layer of fat covering pinnipeds and polar bears is an adaptation to the cold. In general, obesity refers to certain degree of overweight and a thick fat layer, which is caused by excessive fat accumulation ; a percent body fat ≥ 25 % for men and ≥ 30 % for women is an indicator of obesity .
In this study, we used PGLS to detect subtle variations among carnivores, as well as correlations between genes and phenotypes that are consistent across their phylogeny . The root-to-tip dN/dS method has been proved to be a powerful tool to detect gene-phenotype associations because it is more inclusive of evolutionary history of a locus and is therefore more suitable for regressions against phenotypic data from extant carnivores [38,39,40]. Thus, we could partly resolve the evolutionary mechanism of body size variation and identify key candidate genes that influence body size changes in Carnivora. Consequently, we found that variations in 14 positively-correlated BSAGs (BRAP, CHCHD5, CPT1C, GPR1, LDLR, MAP2K5, PLEKHS1, SLC30A8, ST3GAL2, STX16, ZFHX3, ZGRF1, ZNF395 and ZPLD1) were associated with “obesity” (Table S4). For instance, SNPs in BRAP (BRCA1 associated protein) were shown to associate with obesity and other metabolic abnormalities . STX16 (syntaxin 16) encoded a protein that is a member of the syntaxin or t-SNARE family, and deletion of this gene caused obesity and macrosomia in humans . Our results revealed that MAP2K5 (mitogen-activated protein kinase kinase 5) was enriched in the GO cluster “positive regulation of growth (GO:0045927),” and genetic variations in this gene were reported to cause childhood obesity . Furthermore, the evolutionary rates of 14 obesity-related BASGs in large carnivores were higher than those of small ones (Fig. 3). In particular, three obesity-related BSAGs (ST3GAL2, ZGRF1 and ZPLD1) were determined to have undergone rapid evolution in extremely large carnivores, although this was not significant in any of the three after FDR correction (Table S6). For instance, the evolutionary rate of ZPLD1 (zona pellucida-like domain containing 1) is 0.68632 in the extremely large carnivores, 5.5 times that identified in the control group. Deletions in ZPLD1 were proven to contribute to the genetic susceptibility of common childhood obesity . Mice lacking ST3GAL2 (ST3 beta-galactoside alpha-2,3-sialytransferase 2) have 50 % more fat mass and 9 % more lean body mass than the control . ZGRF1 (Zinc finger GRF-type containing 1) encodes a protein that contains GRF zinc finger (zf-GRF) and transmembrane domains, and a recent genome-wide and exome chip association study revealed that the gene is associated with adiposity . Thus, the 14 obesity-related BSAGs identified in this study may contribute to increased body size and accumulated body fat in large carnivores.
It is worth noting that some genes may not show correlation between evolutionary rates and body size across the entire Carnivora phylogeny but instead in some specific lineages within Carnivora. These genes may have been overlooked in the present analysis. Further analysis should be performed in the future to focus on some specific lineages with significant body size variation, which will reveal more mechanisms underlying body size evolution of Carnivora.
Molecular evidence for Peto’s paradox in carnivores?
Animal gigantism is a recurring phenomenon that seems to be influenced by resource availability and natural selection . Being larger brings organisms an array of advantages, but it also brings biological tradeoffs, including the increased risk of developing cancer due to having more cells . Surprisingly, some studies found that empirical cancer rates do not vary with body size, and large and long-lived animals actually have a lower risk of getting cancer than do smaller, shorter-lived animals; this phenomenon is called Peto’s paradox [20, 22, 48]. In recent years, Peto’s paradox has been studied in many large mammals, such as the bowhead whale and humpback whale [49, 50]. Their genomes provided the following major pieces of evidence related to cancer suppression: (1) multiple duplications of tumor suppressor genes; (2) positive selection in genes related to cancer and aging. It was thus suggested that large species might have evolved multiple mechanisms to suppress cancer.
Carnivores are relatively long-lived mammals, and, generally speaking, species in Pinnipedia have a longer lifespan than do those in Fissipedia. Some extremely large species of carnivores—e.g. walruses and polar bears—were reported to live 40 or more years in the wild . Within certain carnivore taxa, body size and lifespan also seem to be positively-correlated . For instance, sea otters (about 27.4 kg) may live as many as 27 years in the wild, whereas ferrets (about 975.6 g) live a much shorter time—about 11.1 years . The relatively large polar bears (maximum lifespan: 43.8 years) and brown bears (maximum lifespan: 40 years) also live longer than the smaller giant pandas (maximum lifespan: 36.8 years) . However, the molecular mechanism for maintaining longevity in large carnivores is not very clear, and there have been very few relevant studies on cancer in carnivores so far.
In the present study, we identified a total of 100 BSAGs in carnivores that were related to the cancer control process, including tumor suppressor, DNA repair and immunity (Table S5). We do not discuss differences in the expression levels of cancer-related BSAGs among carnivores. The number of cancer-related BSAGs accounted for 29.7 % of the total number of BSAGs, which was far higher than the proportion of cancer-related genes to the total number of functional genes in the human genome (723/21,306, 3.4 %) .
Among these cancer-related BSAGs, 21 were determined to be tumor suppressor genes in previous studies (Table 1). For example, APC (APC regulator of the WNT signaling pathway) encodes a multidomain protein that plays a crucial role in tumor suppression by antagonizing the WNT signaling pathway. Variants in APC would induce various kinds of cancer, such as colorectal and pancreatic cancers [53, 54]. ZFHX3 (zinc finger homeobox 3) is essential for regulating myogenic and neuronal differentiation, and it was reported to function as a tumor suppressor in several cancers . The evolutionary rates of these two genes are significantly positively-correlated with both body size parameters, suggesting that large carnivores have a higher evolutionary rate than do small ones.
In addition, we identified 15 cancer-related REGs in extremely large carnivores, including two tumor suppressor genes—ADAM11 and TEP1, of which ADAM11 was still significant after FDR correction. The evolutionary rate of ADAM11 (ADAM metallopeptidase domain 11) was 0.48527 in the extremely large carnivores, 15.4 times greater than that identified in the background group. This gene was previously identified as a candidate tumor suppressor gene in human breast cancer . Alterations in TEP1 (Telomerase associated protein 1) were confirmed to cause several types of tumors in humans, including brain, breast, prostate and lung cancers . Both ADAM11 and TEP1 had relatively higher evolutionary rates in extremely large carnivores than small ones and suggested an enhanced ability to suppress cancer.
Additionally, 16 BSAGs were found to be related to “DNA repair” (Table S5), and it is well known that deficits in DNA repair capacity might lead to genetic instability and carcinogenesis . A recent study revealed that HELQ (Helicase, POLQ-like) plays a critical role in replication-coupled DNA repair, germ cell maintenance and tumor suppression in mammals . Importantly, 18 immunity-related genes were identified in BSAGs of carnivores (Table S5), of which three exhibited elevated evolutionary rates (MAGT1, RFXANK and SKAP2). The evolutionary rate of MAGT1 (magnesium transporter 1), for example, was 0.33127 in extremely large carnivores, 6.1 times that of the background group. Loss of MAGT1 disrupts T cell signaling and leads to a novel human primary immunodeficiency  and, furthermore, overexpression of MAGT1 is associated with development and metastasis of colorectal cancer . Here, we obtained 100 cancer-related genes that were significantly associated with body size evolution in carnivores, including 15 cancer-related REGs that were identified in the extremely large group and which might protect the animal from cancer invasion, especially for large and long-lived species. These results might provide novel molecular evidence for Peto’s paradox with regard to carnivores.
Fixed amino acid changes in extremely small carnivores contribute to growth restriction
There are some extremely small species of carnivores, such as meerkats and ferrets, which have body masses less than 1 kg and body lengths under 50 cm. These two species are distantly related and belong to two suborders (Feliformia and Caniformia, respectively). This small size may allow the species to flourish. For instance, smaller carnivores could take advantage of food resources that are not available to some large animals to ensure survival when the environment changes dramatically and have more free energy and time to engage in activities that increase mating and reproductive success [15,16,17].
Compared with other carnivores in our dataset, six fixed amino acid changes from six genes were identified in extremely small group (Fig. 4): CDC7 (S513L), ENG (V367I), LIG4 (S784P), MMP2 (S579T), TSPAN8 (S178T) and POLE (E701D). These six genes have been shown to be related to the phenotype of reduced body size. While CDC7 (cell division cycle 7) was highly conserved throughout mammalian evolution, a fixed amino acid change (S513L) was identified in extremely small carnivores. CDC7 plays essential roles in initiating mitotic DNA replication, and previous study showed that CDC7−/− or low expression of the CDC7 protein leads to reduced body size with decreased cell proliferation in mice . Importantly, the fixed changes (S513L) were located in the protein kinase domain that functions as an on/off switch for many cellular processes, including metabolism, cell cycle progression and transcription .
Another fixed change (V367I) was examined in the Zona_pellucida domain of ENG (endoglin), and it was reported that Eng −/− mice were three times smaller than wild type mice at embryonic day 10.5 of development . The unique amino acid mutation (S784P) was determined in the key domain (DNA ligase IV) of the LIG4 (DNA ligase 4) gene that was reported mutations in humans or mice would cause growth failure and microcephaly, and this might be the result of activation of the DNA damage response, leading to a large amount of apoptosis during development .
A Fixed amino acid mutation was separately found in the MMP2 (S579T) and POLE (E701D) genes. Previous studies have shown that MMP2 (matrix metallopeptidase 2) knockout in mice and mutations in POLE (DNA polymerase epsilon, catalytic subunit) in humans cause short stature [65, 66]. Finally, we found a fixed difference at site 178 of the Tetraspannin domain in TSPAN8 between extremely small carnivores and others; genetic ablation of this gene in mice caused a reduction (-15.6 %) in the body weight of males fed a normal chow diet . Furthermore, the changes in S513L in CDC7 and S784P in LIG4 affect polarity and might cause radical changes in the three-dimensional structure and function of proteins . These six unique changes examined in the extremely small carnivores might have restricted body size growth.
Though body size regulation is an inherently complex process involving many genes and signaling pathways, and not all genes function in the same way in different species and show the same evolutionary pattern, the amino acid sites with fixed changes in different body size groups could still reveal whether the body size variations in different carnivores is driven by common genes or shared mechanism to some degree. Of course, functional experiments are needed in future to further test whether and how these changes cause growth retardation in Carnivora.
Mammalian order Carnivora exhibits a huge variation in body size—with an over 130,000-fold difference in body mass between the heaviest and lightest species and 50-fold difference in body length between the longest and shortest—but the molecular mechanisms underlying the disparities remain poorly explored. Here we scanned the genomes of 20 representative carnivores and found a total of 337 genes associated with body size. Our analyses showed that 14 obesity-related genes and three rapidly evolving genes might drive body size expansion. The results provided molecular evidence for Peto’s paradox—a lack of correlation between body size and cancer risk—based on 100 body-size-associated genes associated with cancer control and 15 cancer-related genes under rapid evolution in carnivores. By contrast, 15 rapidly evolving genes and unique amino acid changes in six genes might have restricted the growth of small carnivores. This study brings new insights into the molecular mechanisms that drove the diversifying evolution of body size in carnivores, and provides new target genes for exploring the mysteries of body size evolution in mammals.
Phenotype data and orthologous genes
High-quality genomes of 20 carnivores and one outgroup cow (Bos taurus) were downloaded from the NCBI database. Eight families in Carnivora (Felidae, Canidae, Mustelidae, Phocidae, Otariidae, Odobenidae, Herpestidae and Ursidae) were chosen to represent a diversity of body sizes. Two kinds of phenotype data—head body length (cm) and body mass (g)—were collected from the PanTHERIA database  and used for subsequent correlation analysis. Head body length means the length from the snout of the nose to the root of the tail for an animal . Head body length data were missing on the domestic cat (Felis catus) in the database, so we obtained it from the Animal Diversity Web resource . All the phenotype data came from adult individuals.
We then divided these carnivores into four groups—extremely large (body mass > 350 kg), small (body mass < 12 kg), extremely small (body mass < 1 kg) and medium-sized (remaining carnivores)—for subsequent analysis. High-confidence “one-to-one” orthologous gene clusters were identified using the OrthoFinder  pipeline, which applied an all-against-all BLSATP algorithm. For genes with various transcripts, the longest coding sequence was used. Transcripts that were shorter than 150 bp or with lengths that were not multiples of three were eliminated by our in-house Perl scripts. The sequences were aligned using Prank  at the codon level, and poorly aligned regions with gaps and non-homologous fragments were removed using Gblocks  with relatively strict parameters (“-t = c, -b5 = h”). High-quality multiple sequence alignment (MSA) files were used for subsequent analysis.
PGLS scanning the body-size-associated genes (BSAGs)
PGLS implemented in the “Caper” package in R  was used to test the potential association between the evolutionary rates of each gene and each piece of phenotype data (i.e. head body length and body mass). An ultrametric tree of 20 carnivores for PGLS analysis was obtained from the TimeTree website . The Brownian motion model was applied and the phylogenetic signal (λ parameter) was tested by the maximum likelihood (ML) method. The lambda (λ) value was used as a quantitative measure of phylogenetic signals . A λ value estimated to be 1 or near to 1 indicated that these genes showed a strong phylogenetic signal. To obtain more stringent correlation P values, we further employed an extra two-step calibration procedure (an alternative to multiple testing correction), as suggested by Ma et al. . Using ‘P value.all’ from the regression analysis for all carnivores, two P values were calculated: (1) ‘P value.robust’ from the regression, repeated after discarding the species with the largest residual error, and (2) ‘P value.max’ from the PGLS on the remaining species, to calculate the maximum P value after dropping each species on at a time. Genes that were significantly related to both head body length and body mass under the most stringent standard (P value.all < 0.05, P value.robust < 0.05, P value.max < 0.05) are defined as body-size-associated genes (BSAGs).
The evolutionary rate ω—the ratio of non-synonymous (dN) to synonymous substitutions (dS)—was estimated using the free-ratios model (model = 1) implemented in the CODEML program of PAML 4.9e . The root-to-tip ω of each species was calculated by averaging the ω from the ancestral carnivore to each terminal branch according to the method suggested by Montgomery et al. . If the value of dN or dS in each ω value is less than 0.0002, then we marked it as an outlier “n/a” to prevent it from effecting the integral root-to-tip ω adversely. In addition, all root-to-tip ω values were log10-transformed to improve normality for regression analysis .
Functional enrichment analysis
The functional annotation clustering tool Metascape  was used to perform Gene Ontology (GO) and KEGG pathway enrichments for the BSAGs list. GO categories were discovered and grouped into annotation clusters against a background of the human genome. All GO terms with an enrichment score (ES) > 1.3 (corresponding to a P value less than 0.05) were considered significantly enriched.
We also used literature searches and the GWAS Catalog, Human Phenotype Ontology (HPO; ), DisGeNET , RefSeq , Cancer Gene Census (CGC; ), Online Mendelian Inheritance in Man (OMIM; ) databases to explore potential biological functions of each BSAG in association with body size.
Testing for rapidly evolving genes (REGs)
To test whether divergent selective pressures acted on carnivores with contrasting body size based on the BSAGs determined above, we divided the 20 species into three groups: The first was small body-sized carnivores (body mass < 12 kg and body length < 1 m), which included five species: meerkat, Suricata suricatta; ferret Mustela putorius furo; domestic cat, Felis catus; Canada lynx, Lynx canadensis; and red fox, Vulpes vulpes. The second was extremely large carnivores (body mass > 350 kg), which included four species: polar bear, Ursus maritimus; walrus, Odobenus rosmarus: northern sea lion, Eumetopias jubatus; and Weddell seal, Leptonychotes weddellii). The third group comprised the remaining 11 species. A two-ratio model (model = 2) that allows different ω values within the foreground and background branches was used to evaluate the selective pressures in the three groups of species. The small- and extremely large-body-sized groups were regarded as separate foreground branches and the remaining 11 species were regarded as background branches. The null model—i.e. one-ratio model (model = 0)—assumed that all branches have the same ω. The likelihood ratio test (LRT) was used to compare nested likelihood models and the FDR method was used for multiple testing correction (P adjust). We defined genes as REGs if their ω of the foreground was higher than that of the background branches with P value < 0.05.
Identification of fixed amino acid changes in small, extremely small and large groups
To explore how changes in single amino acid sites contribute to body size development, we scanned all the orthologous genes set for the fixed amino acid changes in the small group (body mass < 12 kg, i.e. meerkat, ferret, domestic cat, Canada lynx and red fox), the extremely large group (i.e. polar bear, walrus, northern sea lion and Weddell seal) and extremely small species (body mass < 1 kg, i.e. meerkat and ferret) . FasParser  was used to pick out the fixed amino acid changes specific to these two groups (compared with other carnivores). For the extremely small group, the stoat Mustela erminea (GCA_009829155.1)  was added to improve the reliability of the identified amino acid sites. Amino acid sites that were the same in the three extremely small species and were consistently different in other carnivores were selected. Sites containing gaps were also excluded, and positions were corrected using human (Homo sapiens) amino acid sequences. Pfam 1.6  was used to determine whether the changes were located in the functional domains of the protein.
Availability of data and materials
The data generated and analyzed during this study are included in this article and its additional files, including 8 tables and 4 figures. All genome sequences used in this study are available on the NCBI database (https://www.ncbi.nlm.nih.gov/) under the accession numbers reported in Additional file: Table S1.
Basic local alignment search tool
Breast cancer type 1 susceptibility protein
Body size associated genes
False discovery rate
Growth hormone-releasing factor
Genome-Wide Association Studies
Kyoto Encyclopedia of Genes and Genomes
Likelihood ratio test
Phylogenetic analysis by maximum likelihood
Phylogenetic generalized least squares
- POLQ :
DNA polymerase theta
Rapidly evolving genes
Soluble NSF attachment protein receptor
Single Nucleotide Polymorphism
Nowak RM, Walker EP. Walker’s Mammals of the World. 6th ed. Baltimore: The Johns Hopkins University Press; 1999.
Meloro C, Clauss M, Raia P. Ecomorphology of Carnivora challenges convergent evolution. Org Divers Evol. 2015;15(4):711–20.
Clauset A, Erwin DH. The evolution and distribution of species body size. Science. 2008;321(5887):399–401.
Kleiber M. Body size and metabolic rate. Physiol Rev. 1947;27(4):511–41.
McNab BK. Food habits, energetics, and the population biology of mammals. Am Nat. 1980;116(1):106–24.
McNab BK. Basal rate of metabolism, body size, and food habits in the order Carnivora. In: Gittleman JL, editor. Carnivore behavior, ecology, and evolution. Boston: Springer; 1989. pp. 335–54.
Gittleman JL. Carnivore body size: ecological and taxonomic correlates. Oecologia. 1985;67(4):540–54.
Eisenberg JF. The mammalian radiations: an analysis of trends in evolution, adaptation, and behaviour. Chicago: The University of Chicago Press; 1981.
Savage RJ. Evolution in carnivorous mammals. Palaeontology. 1977;20:237–71.
Bekoff M, Daniels TJ, Gittleman JL. Life history patterns and the comparative social ecology of carnivores. Annu Rev Ecol Syst. 1984;15:191–232.
King JE. Seals of the world. British Museum (Natural History). Oxford: University Press, Oxford; 1991.
Hofmeyr G. Mirounga leonina. The IUCN Red List of Threatened Species. Gland, Switzerland: Species Survival Commission; 2015.
Heptner VG, Sludskii AA. Carnivores (Mustelidae and Procyonidae). In: Heptner VG, editor. Mammals of the Soviet Union. Washington, DC: Smithsonian Institution Libraries and National Science Foundation; 2002.
Van Valkenburgh B, Wayne RK. Carnivores. Curr Biol. 2010;20(21):R915-19.
Moors PJ. Sexual dimorphism in the body size of mustelids (Carnivora): the roles of food habits and breeding systems. Oikos. 1980;34:147–58.
King CM. The advantages and disadvantages of small size to weasels, Mustela species. In: Gittleman JL, editor. Carnivore behavior, ecology, and evolution. Boston: Springer; 1989. pp. 302–34.
Blanckenhorn WU. The evolution of body size: what keeps organisms small? Q Rev Biol. 2000;75(4):385–407.
Hone DW, Benton MJ. The evolution of large size: how does Cope’s Rule work? Trends Ecol Evol. 2005;20(1):4–6.
Vermeij GJ. Gigantism and its implications for the history of life. PLoS One. 2016;11(1):e0146092.
Peto R, Roe F, Lee P, Levy L, Clack J. Cancer and ageing in mice and men. Br J Cancer. 1975;32(4):411–26.
Estes JA. Adaptations for aquatic living by carnivores. In: Gittleman JL, editor. Carnivore behavior, ecology, and evolution. Boston: Springer; 1989. pp. 242–82.
Peto R. Epidemiology, multistage models, and short-term mutagenicity tests. Int J Epidemiol. 2016;45(3):621–37.
Bannasch DL, Baes CF, Leeb T. Genetic Variants Affecting Skeletal Morphology in Domestic Dogs. Trends Genet. 2020;36:598–609.
Fan S, Wang Y, Lu J, Zheng Y, Wu D, Zhang Z, Shan Q, Hu B, Li M, Cheng W. CERS2 suppresses tumor cell invasion and is associated with decreased V-ATPase and MMP-2/MMP-9 activities in breast cancer. J Cell Biochem. 2015;116(4):502–13.
Cui T, Yang L, Ma Y, Petersen I, Chen Y. Desmocollin 3 has a tumor suppressive activity through inhibition of AKT pathway in colorectal cancer. Exp Cell Res. 2019;378(2):124–30.
Son HJ, Choi EJ, Yoo NJ, Lee SH. Inactivating mutations of tumor suppressor genes KLOTHO and DTWD1 in colorectal cancers. Pathol Res Pract. 2020;216(2):152816–16.
Adelman CA, Lolo RL, Birkbak NJ, Murina O, Matsuzaki K, Horejsi Z, Parmar K, Borel V, Skehel JM, Stamp G. HELQ promotes RAD51 paralogue-dependent repair to avert germ cell loss and tumorigenesis. Nature. 2013;502(7471):381–84.
Zou X, Zhang D, Song Y, Liu S, Long Q, Yao L, Li W, Duan Z, Wu D, Liu L. HRG switches TNFR1-mediated cell survival to apoptosis in Hepatocellular Carcinoma. Theranostics. 2020;10(23):10434.
Lehalle D, Mosca-Boidron A-L, Begtrup A, Boute-Benejean O, Charles P, Cho MT, Clarkson A, Devinsky O, Duffourd Y, Duplomb-Jego L. STAG1 mutations cause a novel cohesinopathy characterised by unspecific syndromic intellectual disability. J Med Genet. 2017;54(7):479–88.
Chang Y-H, Nishimura S, Oishi H, Kelly VP, Kuno A, Takahashi S. TRMT2A is a novel cell cycle regulator that suppresses cell proliferation. Biochem Biophys Res Commun. 2019;508(2):410–15.
Clauset A. How large should whales be? PloS one. 2013;8(1):e53967.
Pond CM, Ramsay MA. Allometry of the distribution of adipose tissue in Carnivora. Can J Zool. 1992;70(2):342–47.
Pond CM. Morphological aspects and the ecological and mechanical consequences of fat deposition in wild vertebrates. Annu Rev Ecol Syst. 1978;9:519–70.
Mustonen A-M, Puukka M, Saarela S, Paakkonen T, Aho J, Nieminen P. Adaptations to fasting in a terrestrial mustelid, the sable (Martes zibellina). Comp Biochem Physiol A: Mol Integr Physiol. 2006;144(4):444–50.
Shero MR, Costa DP, Burns JM. Scaling matters: incorporating body composition into Weddell seal seasonal oxygen store comparisons reveals maintenance of aerobic capacities. J Comp Physiol B. 2015;185(7):811–24.
Apovian CM. Obesity: definition, comorbidities, causes, and burden. Am J Manag Care. 2016;22(Suppl 7):s176-85.
Shah NR, Braverman ER. Measuring adiposity in patients: the utility of body mass index (BMI), percent body fat, and leptin. PLoS One. 2012;7(4):e33308.
Muntané G, Farré X, Rodríguez JA, Pegueroles C, Hughes DA, de Magalhaes JP, Gabaldón T, Navarro A. Biological processes modulating longevity across primates: a phylogenetic genome-phenome analysis. Mol Biol Evol. 2018;35(8):1990–2004.
Montgomery SH, Capellini I, Venditti C, Barton RA, Mundy NI. Adaptive evolution of four microcephaly genes and the evolution of brain size in anthropoid primates. Mol Biol Evol. 2011;28(1):625–38.
Boddy AM, Harrison PW, Montgomery SH, Caravas JA, Raghanti MA, Phillips KA, Mundy NI, Wildman DE. Evidence of a conserved molecular response to selection for increased brain size in primates. Genome Biol Evol. 2017;9(3):700–13.
Imaizumi T, Ando M, Nakatochi M, Yasuda Y, Honda H, Kuwatsuka Y, Kato S, Kondo T, Iwata M, Nakashima T. Effect of dietary energy and polymorphisms in BRAP and GHRL on obesity and metabolic traits. Obes Res Clin Pract. 2018;12(1):39–48.
de Lange IM, Verrijn Stuart AA, van der Luijt RB, Ploos van Amstel HK, van Haelst MM. Macrosomia, obesity, and macrocephaly as first clinical presentation of PHP1b caused by STX16 deletion. Am J Med Genet A. 2016;170(9):2431–35.
Lv D, Zhang D-D, Wang H, Zhang Y, Liang L, Fu J-F, Xiong F, Liu G-L, Gong C-X, Luo F-H. Genetic variations in SEC16B, MC4R, MAP2K5 and KCTD15 were associated with childhood obesity and interacted with dietary behaviors in Chinese school-age population. Gene. 2015;560(2):149–55.
Moleres A, Martinez J, Marti A. Genetics of obesity. Curr Obes Rep. 2013;2(1):23–31.
Lopez PH, Aja S, Aoki K, Seldin MM, Lei X, Ronnett GV, Wong GW, Schnaar RL. Mice lacking sialyltransferase ST3Gal-II develop late-onset obesity and insulin resistance. Glycobiology. 2017;27(2):129–39.
Gao C, Wang N, Guo X, Ziegler JT, Taylor KD, Xiang AH, Hai Y, Kridel SJ, Nadler JL, Kandeel F. A comprehensive analysis of common and rare variants to identify adiposity loci in hispanic Americans: the IRAS Family Study (IRASFS). PLoS One. 2015;10(11):e0134649.
Nunney L. Size matters: height, cell number and a person’s risk of cancer. Proc Royal Soc B. 2018;285(1889):20181743.
Abegglen LM, Caulin AF, Chan A, Lee K, Robinson R, Campbell MS, Kiso WK, Schmitt DL, Waddell PJ, Bhaskara S. Potential mechanisms for cancer resistance in elephants and comparative cellular response to DNA damage in humans. JAMA. 2015;314(17):1850–60.
Keane M, Semeiks J, Webb AE, Li YI, Quesada V, Craig T, Madsen LB, van Dam S, Brawand D, Marques PI. Insights into the evolution of longevity from the bowhead whale genome. Cell Rep. 2015;10(1):112–22.
Tollis M, Robbins J, Webb AE, Kuderna LF, Caulin AF, Garcia JD, Bèrubè M, Pourmand N, Marques-Bonet T, O’Connell MJ. Return to the sea, get huge, beat cancer: an analysis of cetacean genomes including an assembly for the humpback whale (Megaptera novaeangliae). Mol Biol Evol. 2019;36(8):1746–63.
De Magalhaes J, Costa J. A database of vertebrate longevity records and their relation to other life-history traits. J Evol Biol. 2009;22(8):1770–74.
Futreal PA, Coin L, Marshall M, Down T, Hubbard T, Wooster R, Rahman N, Stratton MR. A census of human cancer genes. Nat Rev Cancer. 2004;4(3):177–83.
Horii A, Nakatsuru S, Miyoshi Y, Ichii S, Nagase H, Ando H, Yanagisawa A, Tsuchiya E, Kato Y, Nakamura Y. Frequent somatic mutations of the APC gene in human pancreatic cancer. Cancer Res. 1992;52(23):6696–98.
Sparks AB, Morin PJ, Vogelstein B, Kinzler KW. Mutational analysis of the APC/β-catenin/Tcf pathway in colorectal cancer. Cancer Res. 1998;58(6):1130–34.
O’Leary NA, Wright MW, Brister JR, Ciufo S, Haddad D, McVeigh R, Rajput B, Robbertse B, Smith-White B, Ako-Adjei D. Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 2016;44(D1):D733-45.
Yamamoto S, Higuchi Y, Yoshiyama K, Shimizu E, Kataoka M, Hijiya N, Matsuura K. ADAM family proteins in the immune system. Immunol Today. 1999;20(6):278–84.
Kohno T, Takahashi M, Manda R, Yokota J. Inactivation of the PTEN/MMAC1/TEP1 gene in human lung cancers. Genes Chromosomes Cancer. 1998;22(2):152–56.
Goode EL, Ulrich CM, Potter JD. Polymorphisms in DNA repair genes and associations with cancer risk. Cancer Epidemiol Biomarkers Prev. 2002;11(12):1513–30.
Li FY, Lenardo MJ, Chaigne-Delalande B. Loss of MAGT1 abrogates a Mg2 + flux required for T cell signaling and leads to a novel human primary immunodeficiency. Magnes Res. 2011;24(3):109.
Zheng K, Yang Q, Xie L, Qiu Z, Huang Y, Lin Y, Tu L, Cui C. Overexpression of MAGT1 is associated with aggressiveness and poor prognosis of colorectal cancer. Oncol Lett. 2019;18(4):3857–62.
Kim JM, Masai H. Genetic dissection of mammalian Cdc7 kinase: cell cycle and developmental roles. Cell Cycle. 2004;3(3):298–302.
Manning G, Whyte DB, Martinez R, Hunter T, Sudarsanam S. The protein kinase complement of the human genome. Science. 2002;298(5600):1912–34.
Li DY, Sorensen LK, Brooke BS, Urness LD, Davis EC, Taylor DG, Boak BB, Wendel DP. Defective angiogenesis in mice lacking endoglin. Science. 1999;284(5419):1534–37.
Klingseisen A, Jackson AP. Mechanisms and pathways of growth failure in primordial dwarfism. Genes Dev. 2011;25(19):2011–14.
Bauters D, Scroyen I, Van Hul M, Lijnen HR. Gelatinase A (MMP-2) promotes murine adipogenesis. Biochim Biophys Acta Gen Subj. 2015;1850(7):1449–56.
Pachlopnik Schmid J, Lemoine R, Nehme N, Cormier-Daire V, Revy P, Debeurme F, Debré M, Nitschke P, Bole-Feysot C, Legeai-Mallet L. Polymerase ε1 mutation in a human syndrome with facial dysmorphism, immunodeficiency, livedo, and short stature (“FILS syndrome”) Polymerase ε1 mutation in humans. J Exp Med. 2012;209(13):2323–30.
Champy M, Le Voci L, Selloum M, Peterson L, Cumiskey A, Blom D. Reduced body weight in male Tspan8-deficient mice. Int J Obesity. 2011;35(4):605–17.
Zhang J. Rates of conservative and radical nonsynonymous nucleotide substitutions in mammalian nuclear genes. J Mol Evol. 2000;50(1):56–68.
Jones KE, Bielby J, Cardillo M, Fritz SA, O’Dell J, Orme CDL, Safi K, Sechrest W, Boakes EH, Carbone C. PanTHERIA: a species-level database of life history, ecology, and geography of extant and recently extinct mammals: Ecological Archives E090-184. Ecology. 2009;90(9):2648–48.
Kolb H. Variation in the size of foxes in Scotland. Biol J Linn Soc. 1978;10(3):291–304.
Myers P, Espinosa R, Parr C, Jones T, Hammond G, Dewey T. The animal diversity web. 2006.http://animaldiversity.org. Accessed 20 Oct 2020.
Emms DM, Kelly S. OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol. 2015;16(1):157.
Löytynoja A. Phylogeny-aware alignment with PRANK. Methods Mol Biol. 2014;1079:155–70.
Castresana J. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol. 2000;17(4):540–52.
Orme D, Freckleton R, Thomas G, Petzoldt T, Fritz S, Isaac N, Pearse W. Caper: comparative analyses of phylogenetics and evolution in R. Methods Ecol Evol. 2012;3:145–51.
Kumar S, Stecher G, Suleski M, Hedges SB. TimeTree: a resource for timelines, timetrees, and divergence times. Mol Biol Evol. 2017;34(7):1812–19.
Pagel M. Inferring the historical patterns of biological evolution. Nature. 1999;401(6756):877–84.
Ma S, Yim SH, Lee S-G, Kim EB, Lee S-R, Chang K-T, Buffenstein R, Lewis KN, Park TJ, Miller RA. Organization of the mammalian metabolome according to organ function, lineage specialization, and longevity. Cell Metab. 2015;22(2):332–43.
Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24(8):1586–91.
Zhou Y, Zhou B, Pache L, Chang M, Khodabakhshi AH, Tanaseichuk O, Benner C, Chanda SK. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun. 2019;10(1):1–10.
Köhler S, Carmody L, Vasilevsky N, Jacobsen JOB, Danis D, Gourdine J-P, Gargano M, Harris NL, Matentzoglu N, McMurry JA. Expansion of the Human Phenotype Ontology (HPO) knowledge base and resources. Nucleic Acids Res. 2019;47(D1):D1018-27.
Piñero J, Ramírez-Anguita JM, Saüch-Pitarch J, Ronzano F, Centeno E, Sanz F, Furlong LI. The DisGeNET knowledge platform for disease genomics: 2019 update. Nucleic Acids Res. 2020;48(D1):D845-55.
Hamosh A, Scott AF, Amberger JS, Bocchini CA, McKusick VA. Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res. 2005;33(Database issue):D514-17.
Sun YB. FasParser: a package for manipulating sequence data. Zool Res. 2017;38(2):110–12.
Rhie A, McCarthy SA, Fedrigo O, et al. Towards complete and error-free genome assemblies of all vertebrate species. Nature. 2021;592(7856):737–46.
Finn RD, Coggill P, Eberhardt RY, Eddy SR, Mistry J, Mitchell AL, Potter SC, Punta M, Qureshi M, Sangrador-Vegas A. The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res. 2016;44(D1):D279-85.
We thank members of the Jiangsu Key Laboratory for Biodiversity and Biotechnology, Nanjing Normal University, for their contributions to this paper. The authors thank Mr. Xinrong Xu, Dr. Yulin Gai, Dr. Yuan Mu, Dr. Simin Chai, and Dr. Zhenpeng Yu for some helpful discussion. Many thanks to Dr. Ran Tian for literature assistant and important advice. Special thanks to Dr. Di Sun and Dr. Weijian Guo for technical supports.
This study was supported by the Key Project of the National Natural Science Foundation of China (grant nos. 32030011, 31630071 to GY), NSFC (grant nos. 32070409, 31772448 to SX), National Key Programme of Research and Development, Ministry of Science and Technology (Grant number 2016YFC0503200 to GY and SX), the Qing Lan Project of Jiangsu Province to SX, and the Priority Academic Program Development of Jiangsu Higher Education Institutions to GY and SX. These finding bodies played no role in study design, data collection, analysis, interpretation of data, and in writing the manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Genome version and phenotype data for the 20 carnivores used in our study.
Significant results for the gene-phenotype association tests for both head body length and body mass (P value.all/robust/max < 0.05).
GO (Biological Process) and KEGG enrichment results for positively- and negatively-correlated BSAGs.
Biological roles of 14 positively-correlated BSAGs that related to obesity.
Biological Function of 100 cancer-control-related BSAGs.
Rapidly evolving genes in small and extremely large carnivores.
All results produced by PGLS analysis and REGs test in this study.
About this article
Cite this article
Huang, X., Sun, D., Wu, T. et al. Genomic insights into body size evolution in Carnivora support Peto’s paradox. BMC Genomics 22, 429 (2021). https://doi.org/10.1186/s12864-021-07732-w
- Body size associated genes
- Peto’s paradox
- Rapidly evolving genes
- Fixed amino acid changes