- Research article
- Open Access
Evaluation of functionality for serine and threonine phosphorylation with different evolutionary ages in human and mouse
BMC Genomics volume 19, Article number: 431 (2018)
Rapid evolution of phosphorylation sites could provide raw materials of natural selection to fit the environment by rewiring the regulation of signal pathways. However, a large part of phosphorylation sites was suggested to be non-functional. Although the new-arising phosphorylation sites with little functional implications prevailed in fungi, the evolutionary performance of vertebrate phosphorylation sites remained elusive.
In this study, we evaluated the functionality of human and mouse phosphorylation sites by dividing them into old, median and young age groups based on the phylogeny of vertebrates. We found the sites in the old group were more likely to be functional and involved in signaling pathways than those in the young group. A smaller proportion of sites in the young group originated from aspartate/glutamate, which could restore the ancestral functions. In addition, both the phosphorylation level and breadth was increased with the evolutionary age. Similar to cases in fungi, these results implied that the newly emerged phosphorylation sites in vertebrates were also more likely to be non-functional, especially for serine and threonine phosphorylation in disordered regions.
This study provided not only insights into the dynamics of phosphorylation evolution in vertebrates, but also new clues to identify the functional phosphorylation sites from massive noisy data.
Genetic variations are the primary sources that contribute to the evolution of new phenotypes [1,2,3]. With the great advances in the high-throughput genome sequencing, substantial genomic variations across different species were characterized [4,5,6,7]. However, how the variations positively influence the phonotypic outcomes remains largely unexplored. Changes of protein phosphorylation, which is the most ubiquitous post-translational modification conducting cellular signals, have been drawing many attentions [8,9,10]. Although phosphorylation sites (phosphosites) are on average more conserved than non-phosphorylated counterparts in evolution, they exhibit rapid divergence among species due to their structural preference in disordered regions [11,12,13,14,15]. The high diversity of phosphosites would possibly rewire the cellular signaling transduction and response to environment, providing potential materials that natural selection could act upon. Based on the observation that genetic interactions between kinases and substrates were altered at a higher rate than average genes among three species of yeast, it was postulated that the evolution of phosphorylation regulation made crucial contribution to phenotypic fitness just as transcriptional regulation .
Nonetheless, another pervasive consequence of the rapid turnover of phosphosites would be that a large percentage of phosphosites are non-functional [16, 17]. Though there is little empirical evidence, it is possible in principle because of the limited specificity of kinases. This hypothesis was primarily supported by the fact that a large proportion (about 65%) of phosphosites with no characterized function evolved at a similar rate compared with non-phosphosites in disordered regions [16, 17]. Moreover, the evolutionary conservation of phosphosites was positively associated with phosphorylation stoichiometry but negatively associated with protein abundance in yeast, which could be explained by more accidental phosphorylation in abundant proteins . A recent study presented a more comprehensive landscape of phosphorylation across 18 fungal species . Through tracking the phylogeny of the phosphosites, the study revealed that while only about 2% of the phosphosites could be preserved longer than 700 million years, 69% were gained younger than 18 million years. Especially, relatively to the recently acquired phosphosites, the ancient ones are more likely to be functionally important, which implies the enrichment of noisy phosphorylation in the former.
The prevalence of young phosphosites with silent functions was observed in fungi, but the evolutionary pattern in more complex vertebrate species was elusive. After all, the sophisticated development program and communications among numerous cells in vertebrate species strongly depend on signaling pathways mediated by phosphorylation [10, 20,21,22,23,24] . In fact, it was shown that phosphosites involved in vertebrate-specific functional modules (VFMs) were even more conserved than those in basic functional modules (BFMs) . In this study, with the large-scale data of human and mouse phosphosites, we aimed to evaluate the functionality of phosphosites with different evolutionary ages across vertebrates.
Three evolutionary age groups of phosphosites
Most of phosphorylation data on vertebrates were generated from model organisms, such as human and mouse [15, 26]. Due to the unbalance of phosphorylation studies, it was infeasible to directly compare the phosphorylated status across different species . However, as is known that the appearance of phosphor-acceptor amino acids (serine, threonine or tyrosine) was essential for phosphorylation [8, 27], it was reasonably assumed that the later appearance time of those amino acids in evolution would represent younger phosphorylated status. In this study, we separately collected 196,093 human and 62,274 mouse phosphosites from the PhosphoSitePlus database (version: 2015.12) , and 5747 phosphosites were shared by the two species (Additional file 1: Figure S1). We reconstructed the ancestral sequences of the phosphorylation proteins across 8 vertebrates (see Methods). Based on the first appearance time of the phosphor-acceptor amino acids in the vertebrate species tree, we divided the phosphosites into three different age groups: old, median and young (Fig. 1a). In the old group, the phosphosites kept conserved across all the vertebrates and emerged earlier than 435 million years ago (MYA), whereas the phosphosites in the young group were generated during the divergence of the mammal species (about 96 MYA) (Fig. 1a). The phosphosites with evolutionary age between the old and young group were defined to the median group.
The basic characters of phosphosites in the three age groups were analyzed. For the three phosphor-acceptor amino acids, the serine and threonine showed relatively balanced distribution among the age groups, while the tyrosine was much more enriched in the old group (Fig. 1b). As the difference in bio-chemical properties of the serine/threonine and tyrosine and that they were phosphorylated by different types kinases, our results indicated their different evolutionary dynamics. In each age group, there were more phosphosites located in disordered regions than in the ordered regions, consistent with the knowledge that they were more likely to be recognized by kinases on the surface of protein [14, 15, 28, 29] (Fig. 1b). In the following analyses, we mainly focused on the phosphosites of serine/threonine in the disordered regions, from which most of the evolutionary diversity resulted across the species .
Functional annotations of phosphosites in different evolutionary age groups
To evaluate the functional potential of phosphosites in different age groups, we took several types of annotations into consideration. Firstly, we separately gathered human and mouse known-functional phosphosites from the PhosphoSitePlus database and polymorphic phosphosites in populations from dbSNP [26, 30] (See Methods). We compared the proportion of the known-functional and polymorphic phosphosites among the three age groups, respectively. For both human and mouse, the proportion of known-functional phosphosites was significantly raised with increase of the evolutionary age (human: p-value < 0.01, mouse: p-value < 0.01, Chi-squared test), while the proportion of the polymorphic phosphosites was significantly decreased with the increase of the evolutionary age (human: p-value < 0.01, mouse: p-value < 0.01, Chi-squared test, Fig. 2a). The trends were also significantly observed for both human tyrosine phosphosites and serine/threonine phosphosites in ordered regions (Additional file 1: Figure S2). These results were well in agreement with the cases in yeasts, which suggested that functional phosphosites in vertebrates tended to be older in evolution. Meanwhile, as polymorphic sites were more likely to be evolutionarily neutral, our results implicated that a higher proportion of young phosphosites might be non-functional.
Next, due to the limited functional annotation of phosphosites, we took the proteins where the phosphosites resided into consideration. Wang et al. divided phosphorylated proteins into two broad modules: basic functional modules (BFMs) that shared by both vertebrates and non-vertebrates, and vertebrate-specific functional modules (VFMs) . In this study, it was shown that the phosphosites in VFMs showed higher evolutionary conversation than those in BFMs, which was validated in both the human and mouse (p-value = 1.50e-09 and p-value < 2.2e-016, separately, Wilcox rank sum test, Additional file 1: Figure S3A). In particular, there was a larger percentage of young phosphosites distributed in the BFMs compared with the VFMs (human: p-value = 0.035, mouse: p-value = 9.17e-10, Chi-squared test, Fig. 2b). In addition, a smaller proportion of young phosphosites in the BFMs were known-functional compared with those in the VFMs (human: p-value = 5.63e-07, mouse: p-value = 1.87e-07, Fisher exact test, Additional file 1: Figure S3B). As the VFMs contained many signaling pathways related proteins in which functional phosphosites were more likely to be involved, the results also implied that phosphosites in the young group were less functionally important than those in the old group.
Enrichment analysis of ancestral state for the phosphosites
It has been reported that more phosphosites could evolve from aspartate/glutamate (Asp/Glu) residues than corresponding non-phosphosites significantly [31, 32]. As the unpredictability of ancestral state for the old group conserved across all the vertebrates, we only considered phosphosites in the median and young groups for the analysis. Following Pearlman et al. , we did the enrichment analysis of the ancestral state for the phosphosites, taking the corresponding non-phosphorylated residues in the same age groups as the control (see Methods). In both human and mouse datasets, we observed significantly more phosphor-serine with evolutionary transition from ancestral Asp/Glu residues in the disordered regions (Fig. 3a). However, the enrichment was not significant for phosphor-threonine, phosphor-tyrosine and those in ordered regions, which probably due to the small sample size (Additional file 1: Figures S4 and S5). In addition to those results, we also discovered that lysine (Lys), a positively charged amino acid, was enriched in the transition to phosphor-serine, probably due to the fact that phosphorylation could also act upon Lys [33,34,35].
Asp/Glu, the negatively charged amino acids, can sometimes mimic the function of the phosphorylated residues with negative charge [31, 36, 37]. Thus, it was assumed that the phosphorylation of the residues evolved from Asp/Glu could activate the ancestral function of proteins. Consistent with the assumption, we found that there was a significant higher proportion of phosphosites transited from Asp/Glu distributed in the median group than in the young group (human: p-value < 2.2e-16, mouse: p-value < 2.2e-16, Chi-squared test, Fig. 3b). We also observed the similar results for the phosphosites transited from Lys (human: p-value < 2.2e-16, mouse: p-value < 2.2e-16, Chi-squared test). These provided another evidence that younger phosphosites of serine in disordered regions were more likely to be non-functional.
Phosphorylation level and breath among tissues
As reported that in yeasts, phosphosites of higher stoichiometry were more conserved and functional [18, 38]. However, the relationship among quantitative features and evolutionary conservation was not systematically studied in the different tissues of vertebrates. In this study, we collected 14,954 mouse phosphosites with phosphorylation levels across nine different tissues . Then, we compared the maximum phosphorylation level and phosphorylation breadth (i.e., number of tissues where the phosphorylation was expressed) among the three age groups (see Methods). Meanwhile, as the quantitative features of phosphosites might be confounded by those of the whole protein, we excluded the bias resulted from protein abundance and breadth.
We identified that the maximum phosphorylation level in the young group was lower compared with the old group (p-value = 1.87e-10, Wilcox rank sum test, Fig. 4a), but the protein abundance between the two groups showed no difference (p-value = 0.41, Wilcox rank sum test, Additional file 1: Figure S6A). We also found that the phosphorylation breadth was lower in the young group than the old group (p-value = 7.52e-12, Wilcox rank sum test, Fig. 4b). Although the protein expression breath showed difference (p-value < 2.2e-16, Wilcox rank sum test, Additional file 1: Figure S6B), the ANONA analysis suggested that the contribution of the phosphorylation breadth could not be compensated (p-value < 0.01) taking into account the two factors simultaneously. Furthermore, there was a larger proportion of phosphosites with both low phosphorylation level and breadth in the young group compared with the old group (p-value = 5.1e-04, Chi-squared test, Additional file 1: Figure S7). And with the stepwise regression, the contribution of phosphorylation level could be compensated by the breadth, indicating the high correlation between phosphorylation level and breadth (r = 0.46, Pearson’s correlation). These results indicated that with the increase of evolutionary age, both the phosphorylation level and breadth tended to be increased, which could be more likely to keep essential functions.
It was interesting that phosphosites evolved from Asp/Glu displayed higher phosphorylation level and breadth than those originated from other amino acids (p-value = 1.89e-08 and p-value = 1.12e-05, separately, Wilcox rank sum test, Fig. 4C and D). This supported our argument that this type of phosphosites was more intended to be functional. However, this trend was not observed for the phosphosites originated from Lys (p-value = 0.31 and p-value = 0.24, separately, Wilcox rank sum test). Thus, quantitative features would be also helpful to prioritize functional phosphosites.
As with genetic variants, rapid changes of protein phosphorylation would play important roles in the evolution of new phenotypes. However, a large part of phosphosites was speculated to be non-functional due to the non-specific recognition of kinases [16, 17]. Computational methods integrating evolutionary conservation, structural preference and kinase recognition were proposed to explore functional phosphosites from large-scale data [40,41,42]. And a recent fungi study showed that young phosphosites prevailed and contributed to the major noisy of phosphorylation . In this study, we investigated whether the evolutionary age of phosphosites was also associated with their functionality in vertebrates by dividing the phosphosites into old, median and young groups based on their emergence time in the species tree.
We found several evidences that the functional potential of phosphosites was increased with their evolutionary age especially for serine and threonine in the disordered regions, which was separately proved in the human and mouse data. Firstly, the old group harbored a higher proportion of known-functional phosphosites and a lower proportion of polymorphic phosphosites than the young group. Secondly, in VFMs where phosphorylation was more essential, there was a lower fraction of young phosphosites than in BFMs. Thirdly, fewer phosphosites originating from Asp/Glu which could mimic their ancestral functions, were evolutionarily young. Finally, the phosphorylation level and breadth among different tissues were increased with evolutionary age. And the phosphorylation level is highly correlated with the breadth.
Consistent with the study in fungi, our results provided a comprehensive scenario for the evolution of phosphorylation. The high turnover rate in disordered regions facilitated the birth of new phosphosites. However, most of them would be non-functional and eliminated due to the lack of functional constrains. The rapid ‘try-and-error’ process provided potential materials for the innovation of phenotypic fitness, and only those phosphosites acquiring indispensable functions could be retained during the long evolutionary time. The scenario provided a new approach to screen out the functional ones from massive phosphosite data by a combination of several factors, including evolutionary age, ancestral state, phosphorylation level and breadth.
It is important to note that the insights of tyrosine phosphorylation and the phosphorylation in ordered regions were limited in our study. This was because that the evolutionary rates of the two types were much lower than the serine and threonine phosphorylation in disordered regions, leading that only a small amount of data was useful. However, it was also observed that for both tyrosine phosphosites and serine/threonine phosphosites in ordered regions, fewer known-functional and more polymorphic phosphosites were distributed in the young group, indicating that they were subjected to the similar pattern. Another possible concern was that we identified the ages based on the phylogeny of phospho-acceptor, which was an upper bound for the emergence of phosphorylation status. For example, it was possible that some phosphosites in the old group might be phosphorylated in recent time. Therefore more comprehensive phosphorylation datasets covering different species were necessary to infer the phosphorylation age more precisely.
In summary, the current study evaluated the functionality of human and mouse phosphosites, indicating that newly emerged vertebrate phosphosites were more likely to be non-functional, especially for serine and threonine phosphorylation in disordered regions. Meanwhile, our study provided a new evolutionary scenario of phosphorylation. New phosphosites were caused by the high turnover rate in disordered regions, and only those with indispensable functions could result in the innovation of phenotypic fitness during the long evolutionary time. We also provided useful insights to screen out the functional ones from massive phosphosite data by a combination of the evolutionary age, ancestral state, phosphorylation level and breadth.
Classification of Phosphosites
One hundred ninety-six thousand ninety-three human phosphosites of 16,031 proteins and 62,274 mouse phosphosites of 9600 proteins were gathered from the PhosphoSitePlus database (version: v2015.12) separately . To classify the phosphosites by the evolutionary ages, 8 vertebrate species were selected, including H. sapiens, P. troglodytes, M. musculus, R. norvergicus, B. taurus, G. gallus, X. tropicalis and D. rerio. We built the species tree and got the divergence time from TimeTree.org . The orthologous proteins of these species were found from InParanoid8, and multiple sequence alignments were performed with Clustal-omega [44, 45]. Then, the FastML.v3.1 software was used to reconstruct the ancestral sequences of phosphorylation proteins [46, 47]. The phophosites were filtered by the following rules: the number of orthologous sequences should be more than 4 to infer the ancestral state. Finally, we got 115,780 human phosphosites and 42,244 mouse phosphosites respectively (Additional file 1: Table S1).
The VSL2 software was used to identify the phosphosites in ordered or disordered regions . It calculated the disorder score (between 0 and 1) for each amino acid based on 26 amino acids-based features for a given protein sequence. The score of phosphosites in disordered regions should be greater than 5. Otherwise, the phosphosites were in ordered regions.
The human and mouse functional phosphosites were collected from a file named ‘Regulatory_sites.gz’ in the PhosphoSitePlus database (file name: Regulatory_sites, version: v2015.12) . We respectively got human and mouse polymorphic phosphosites by overlapping with human and mouse SNPs annotated from dbSNP138 using ANNOVAR (version: 20150619, GRCH38 and GRCM38) . The definition of BFMs and VFMs for phosphorylation proteins refered to Wang et al. . And Rate4sites  was used to calculate the evolutionary rate of phosphosites and compared the rates between BFMs and VFMs.
Enrichment of ancestral state
We explored whether phosphosites in the median and young group could significantly originate from some amino acid residues compared with corresponding non-phosphosites. The ancestral state of those sites was retrieved from the FastML.v3.1 results. For the control dataset, we collected serine, threonine and tyrosine sites which were not known to be phosphorylated from the same proteins within the age groups. We filtered the control dataset was filtered by the following criterion: 1) the sites overlapped with the verified and predicted phosphosites from dbPTM were excluded; 2) we eliminated the potential phosphosites predicted by Networkin with the score greater than 2 were eliminated. According to the method of Pearlman et al., we calculated the ratio and p-value of enrichment for each ancestral amino acid by bootstrap sampling with 1000 times. For every sampling, the distribution of phosphosites in disordered regions was balanced between phosphosites and control datasets.
Fourteen thousand nine hundred fifty-four mouse phosphosites with quantification data were gathered in nine tissues . Overlapping with the age groups in disordered regions, we respectively got 2947, 2359 and 2310 sites in the old, median and young groups. Among the age groups, we respectively compared the maximum phosphorylation level across the tissue and the number of tissues where the phosphosites were expressed. In order to exclude the influence of protein abundance and breadth, the ANOVA analysis was performed ting both the phosphorylation and protein quantification into account.
vertebrate-specific functional modules
basic functional modules
million years ago
glutamate; Lys: lysine.
Wapinski I, Pfeffer A, Friedman N, Regev A. Natural history and evolutionary principles of gene duplication in fungi. Nature. 2007;449(7158):54–61.
Beltrao P, Trinidad JC, Fiedler D, Roguev A, Lim WA, Shokat KM, Burlingame AL, Krogan NJ. Evolution of phosphoregulation: comparison of phosphorylation patterns across yeast species. PLoS Biol. 2009;7(6):e1000134.
Landry CR, Lemos B, Rifkin SA, Dickinson WJ, Hartl DL. Genetic properties influencing the evolvability of gene expression. Science. 2007;317(5834):118.
Soon WW. Hariharan M. High-throughput sequencing for biology and medicine: Snyder MP; 2013.
Kircher M, Kelso J. High-throughput DNA sequencing--concepts and limitations. Bioessays News & Reviews in Molecular Cellular & Developmental Biology. 2010;32(6):524–36.
Purcell S, Neale B, Toddbrown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75.
Durbin R: 1000 genome project: a map of human genome variation from population scale sequencing. 2010.
Cohen P. The origins of protein phosphorylation. Nat Cell Biol. 2002;4(5):E127.
Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M. Global, in vivo, and site-specific phosphorylation dynamics in signaling networks. Cell. 2006;127(3):635.
Johnson LN. The regulation of protein phosphorylation. Biochem Soc Trans. 2009;37(4):627–41.
Malik R, Nigg EA, Körner R. Comparative conservation analysis of the human mitotic phosphoproteome. Bioinformatics. 2008;24(12):1426.
Chen SC, Chen FC, Li WH. Phosphorylated and nonphosphorylated serine and threonine residues evolve at different rates in mammals. Molecular Biology & Evolution. 2010;27(11):2548–54.
Boekhorst J, Breukelen BV, Heck AJ, Snel B. Comparative phosphoproteomics reveals evolutionary and functional conservation of phosphorylation across eukaryotes. Genome Biol. 2008;9(10):R144.
Jiménez JL, Hegemann B, Hutchins JR, Peters JM, Durbin R. A systematic comparative and structural analysis of protein phosphorylation sites based on the mtcPTM database. Genome Biol. 2007;8(5):1–20.
Gnad F, Ren S, Cox J, Olsen JV, Macek B, Oroshi M, Mann M. PHOSIDA (phosphorylation site database): management, structural and evolutionary investigation, and prediction of phosphosites. Genome Biol. 2007;8(11):R250.
Lienhard GE. Non-functional phosphorylations? Trends Biochem Sci. 2008;33(8):351.
Landry CR, Levy ED, Michnick SW. Weak functional constraints on phosphoproteomes. Trends in Genetics Tig. 2009;25(5):193.
Tan CS, Bader GD. Phosphorylation sites of higher stoichiometry are more conserved. Nat Methods. 2012;9(4):317. author reply 318
Romain A, RAR-M S, Haas KM, Hsu JI, Viéitez C, Solé C, Swaney DL, Stanford LB, Liachko I, Böttcher R, Dunham MJ, de Nadal E, Posas F, Beltrao P, Villén J. Evolution of protein phosphorylation across 18 fungal species. Science. 2016;354(6309):229–32.
Ubersax JA, Jr FJ. Mechanisms of specificity in protein phosphorylation. Nat Rev Mol Cell Biol. 2007;8(7):530–41.
Pawson T, Scott JD. Protein phosphorylation in signaling--50 years and counting. Trends Biochem Sci. 2005;30(6):286.
Karin M. Signal transduction from the cell surface to the nucleus through the phosphorylation of transcription factors. Curr Opin Cell Biol. 1994;6(3):415.
Johnson LN, Barford D. The effects of phosphorylation on the structure and function of proteins. Annual Review of Biophysics & Biomolecular Structure. 1993;22(22):199.
Hardie DG. AMP-activated/SNF1 protein kinases: conserved guardians of cellular energy. Nat Rev Mol Cell Biol. 2007;8(10):774–85.
Wang Z, Ding G, Geistlinger L, Li H, Liu L, Zeng R, Tateno Y, Li Y. Evolution of protein phosphorylation for distinct functional modules in vertebrate genomes. Mol Biol Evol. 2011;28(3):1131–40.
Hornbeck PV, Kornhauser JM, Tkachev S, Zhang B, Skrzypek E, Murray B, Latham V, Sullivan M. PhosphoSitePlus: a comprehensive resource for investigating the structure and function of experimentally determined post-translational modifications in man and mouse. Nucleic Acids Res. 2012;40:Database issue):261–70.
Manning G, Whyte DB, Martinez R, Hunter T, Sudarsanam S. The protein kinase complement of the human genome. Science. 2002;298(5600):1912.
Iakoucheva LM, Radivojac P, Brown CJ, O'Connor TR, Sikes JG, Obradovic Z, Dunker AK. The importance of intrinsic disorder for protein phosphorylation. Nucleic Acids Res. 2004;32(3):1037.
Davis FP. Phosphorylation at the Interface. Structure. 2011;19(12):1726.
Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, Sirotkin K. dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 2000;29(1):308.
Pearlman SM, Serber Z, Ferrell JE Jr. A mechanism for the evolution of phosphorylation sites. Cell. 2011;147(4):934–46.
Kurmangaliyev YZ, Goland A, Gelfand MS. Evolutionary patterns of phosphorylated serines. Biol Direct. 2011;6(1):1–7.
Besant P, Attwood P, Piggott M. Focus on Phosphoarginine and Phospholysine. Curr Protein Pept Sci. 2009;10(6):536–50.
Bertran-Vicente J, Serwa RA, Schumann M, Schmieder P, Krause E, Hackenberger CP. Site-specifically phosphorylated lysine peptides. J Am Chem Soc. 2014;136(39):13622–8.
Cieśla J, Frączyk T, Rode W. Phosphorylation of basic amino acid residues in proteins: important but easily missed. Acta Biochim Pol. 2011;58(2):137.
Thorsness PE, Koshland DE. Inactivation of isocitrate dehydrogenase by phosphorylation is mediated by the negative charge of the phosphate. J Biol Chem. 1987;262(22):10422–5.
Cooper JA, Sefton BM, Hunter T:  Detection and quantification of phosphotyrosine in proteins. Methods Enzymol 1983, 99(99):387.
Levy ED, Michnick SW, Landry CR. Protein abundance is key to distinguish promiscuous from functional phosphorylation based on evolutionary information. Philos Trans R Soc Lond Ser B Biol Sci. 2012;367(1602):2594–606.
Huttlin EL, Jedrychowski MP, Elias JE, Goswami T, Rad R, Beausoleil SA, Villén J, Haas W, Sowa ME, Gygi SP. A tissue-specific atlas of mouse protein phosphorylation and expression. Cell. 2010;143(7):1174.
Xiao Q, Miao B, Jie B, Zhen W, Li Y. Corrigendum: prioritizing functional phosphorylation sites based on multiple feature integration. Sci Rep. 2016;6:24735.
Shen N, Wang Z, Ge D, Zhang G, Li Y. Prediction of functional phosphorylation sites by incorporating evolutionary information. Protein & Cell. 2012;3(9):675–90.
Beltrao P, Albanèse V, Kenner LR, Swaney DL, Burlingame A, Villén J, Lim WA, Fraser JS, Frydman J, Krogan NJ. Systematic functional prioritization of protein posttranslational modifications. Cell. 2012;150(2):413.
Hedges SB, Marin J, Suleski M, Paymer M, Kumar S. Tree of life reveals clock-like speciation and diversification. Molecular Biology & Evolution. 2015;32(4):835–45.
Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, Lopez R, Mcwilliam H, Remmert M, Söding J. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal omega. Mol Syst Biol. 2011;7(7):539.
Sonnhammer EL, G Ö: InParanoid 8: orthology analysis between 273 proteomes, mostly eukaryotic. Nucleic Acids Res 2015, 43(Database issue):234–239.
Pupko T, Pe'Er I, Shamir R, Graur D. A fast algorithm for joint reconstruction of ancestral amino acid sequences. Molecular Biology & Evolution. 2000;17(6):890.
Haim A, Osnat P, Adi DF, Ofir C, Gina C, Oren Z, Tal P. FastML: a web server for probabilistic reconstruction of ancestral sequences. Nucleic Acids Res. 2012;40:Web Server issue):580–4.
Peng K, Radivojac P, Vucetic S, Dunker AK, Obradovic Z. Length-dependent prediction of protein intrinsic disorder. Bmc Bioinformatics. 2006;7(1):1–17.
Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38(16):e164.
Tal Pupko REB, Mayrose I, Glaser F, Ben-Tal N. Rate4Site: an algorithmic tool for the identification of functional regions in proteins by surface mapping of evolutionary determinants within their homologues. Bioinformatics. 2002;18:S71–7.
The authors thank the members of the YL lab for helpful discussion.
This work was supported by the National Key R&D Program of China (2017YFA0505500, 2016YFC0901704), the National Natural Science Foundation of China (31301032), the Youth Innovation Promotion Association CAS (2017325) and the Shanghai Postdoctoral Scientific Program (13R21417300). The funding bodies had no role in the design of the study and collection, analysis and interpretation of data and in writing the manuscript.
Availability of data and materials
BM was responsible for evolutionary analysis of the phosphorylation data. QX carried out the polymorphic phosphorylation sites. WC carried out the data collection and interpretation. ZW conceived this research and drafted the manuscript. YL conceived the research and revised the paper. All authors read and approved the final manuscript.
Ethics approval and consent to participate
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure S1. The shared phosphosites between human and mouse. Figure S2. The functional annotations for human tyrosine phosphosites and serine/threonine phosphosites in ordered regions. Figure S3. The difference of conservation and functional annotations between BFMs and VFMs. Figure S4. Enrichment analysis of amino acids in the transition to phosphosites for human. Figure S5. Enrichment analysis of amino acids in the transition to phosphosites for mouse. Figure S6. The distribution of protein abundance and breath in mouse. Figure S7. The phosphosites with both low phosphorylation level and breadth significantly enriched in young group compared with old group. Table S1. The number distribution of human and mouse phosphosites. (DOCX 226 kb)
About this article
Cite this article
Miao, B., Xiao, Q., Chen, W. et al. Evaluation of functionality for serine and threonine phosphorylation with different evolutionary ages in human and mouse. BMC Genomics 19, 431 (2018). https://doi.org/10.1186/s12864-018-4661-6