- Research article
- Open Access
The TriTryp Phosphatome: analysis of the protein phosphatase catalytic domains
BMC Genomicsvolume 8, Article number: 434 (2007)
The genomes of the three parasitic protozoa Trypanosoma cruzi, Trypanosoma brucei and Leishmania major are the main subject of this study. These parasites are responsible for devastating human diseases known as Chagas disease, African sleeping sickness and cutaneous Leishmaniasis, respectively, that affect millions of people in the developing world. The prevalence of these neglected diseases results from a combination of poverty, inadequate prevention and difficult treatment. Protein phosphorylation is an important mechanism of controlling the development of these kinetoplastids. With the aim to further our knowledge of the biology of these organisms we present a characterisation of the phosphatase complement (phosphatome) of the three parasites.
An ontology-based scan of the three genomes was used to identify 86 phosphatase catalytic domains in T. cruzi, 78 in T. brucei, and 88 in L. major. We found interesting differences with other eukaryotic genomes, such as the low proportion of tyrosine phosphatases and the expansion of the serine/threonine phosphatase family. Additionally, a large number of atypical protein phosphatases were identified in these species, representing more than one third of the total phosphatase complement. Most of the atypical phosphatases belong to the dual-specificity phosphatase (DSP) family and show considerable divergence from classic DSPs in both the domain organisation and sequence features.
The analysis of the phosphatome of the three kinetoplastids indicates that they possess orthologues to many of the phosphatases reported in other eukaryotes, including humans. However, novel domain architectures and unusual combinations of accessory domains, suggest distinct functional roles for several of the kinetoplastid phosphatases, which await further experimental exploration. These distinct traits may be exploited in the selection of suitable new targets for drug development to prevent transmission and spread of the diseases, taking advantage of the already extensive knowledge on protein phosphatase inhibitors.
The parasitic protozoa Trypanosoma cruzi, Trypanosoma brucei and Leishmania major are the causative agents of human diseases known as Chagas disease, African sleeping sickness and cutaneous Leishmaniasis, respectively, that affect millions of people in developing countries (Central and South America, sub-Saharan Africa, Asia and parts of Europe) . A major factor in the prevalence of these diseases is poverty and lack of medical resources in areas already afflicted by poor social support and threatened economies. Currently, there are no vaccines to prevent the spread of these diseases and many of the drugs available for treatment are highly toxic and require hospitalisation. In the past several years, new efforts have polarised the attention of laboratories worldwide to tackle the control of these neglected diseases. The genome sequences for these pathogenic kinetoplastids, and recently for two more Leishmania species, have now been reported [2–5] representing a major step forward to expand our understanding of their biology. This information provides a great opportunity to analyse particular gene families and to compare them with those of the host, with the aim to identify new targets for pharmaceutical intervention.
The sequenced kinetoplastid parasites have very different life cycles. Each parasite uses a different insect as their transmitting vector and prefers different conditions in which to survive and reproduce after infecting a human host. L. major is an intracellular parasite, seeking to invade macrophages and T. cruzi, also intracellular, invades and replicates in many cell-types, including macrophages and fibroblasts. T. brucei is an exclusively extracellular parasite that resides in the bloodstream of the mammalian host. As the parasitic life cycle takes these organisms through widely differing environments, frequent and substantial adaptive changes are required in many cell processes, resulting in changes in gene expression, protein levels and protein modifications [6–8]. A well-documented posttranslational modification is protein phosphorylation, which exhibits marked changes during development of these parasites [9, 10].
Reversible protein phosphorylation is one of the most important biological mechanisms for the regulation of adaptive responses to intra- and extra-cellular signals in both eukaryotes and prokaryotes. Many cellular signalling pathways are modulated through the antagonistic activities of highly specific protein kinases and protein phosphatases that control a number of processes including metabolic pathways, cell-cell communication, cell growth and proliferation and gene transcription. Because of their essential roles, mutated forms of these proteins are often involved in disease states in mammals [11–14] or result in a severe reduction in virulence and infection of pathogenic bacteria [15–17]. The specific roles of protein phosphatases in unicellular protists, in particular protozoan parasites like Trypanosomes and Leishmania are less understood.
Recent work has identified several protein phosphatases and their roles in regulation of parasite development. Tb PTP1 phosphatase is a master switch in controlling differentiation in T. brucei . In particular, this phosphatase inhibits the transition from the G0-arrested stumpy form in the blood stream to the procyclic form in the tsetse fly. Tb PTP1 inactivation by genetic or chemical means releases such inhibition and allows the cells to progress through differentiation to the procyclic form. Another protein tyrosine phosphatase, not orthologous to Tb PTP1, has been identified in L. major (Lm PTP1) that allows L. major amastigotes to survive in mice . This may be an important factor in virulence, enabling the invading pathogen to survive in a host.
Other phosphatases characterised include PP1 from T. cruzi , PP1 and PP2A in T. brucei , PP5 in T. brucei  and protein phosphatases with EF-Hands (PPEF) in T. cruzi, T. brucei and L. major , each with various roles in the biology and development of kinetoplastids. All this work highlights the importance of protein phosphatases in the regulation of essential developmental aspects in the life cycle of pathogenic kinetoplastids.
Protein phosphatases are traditionally classified according to their substrate preferences, including serine/threonine phosphatases (STP), tyrosine-specific phosphatases (PTP), dual-specificity phosphatases (DSP), that dephosphorylate phospho-serine, phospho-threonine and phospho-tyrosine substrates, lipid phosphatases (PTEN type and Myotubularins), and the low molecular weight PTP (LMW-PTP). The presence of specific conserved motifs in the catalytic domain as well as additional regulatory or targeting domains allow these types to be recognised and classified into different subfamilies [24–27]. We have applied this knowledge on domain architecture and conserved catalytic motifs as the basis for the phosphatase domain classification.
A recent study of the TriTryp kinome  provides an overview of the kinase complement in the three parasites, and highlights important traits in the distribution of the different families of kinases. For a full understanding of phosphorylation-mediated events we also require knowledge of their counterpart protein phosphatases. We describe in this work the protein phosphatase complements of the TriTryp genomes. We employed an ontology-based classification tool that we have successfully used in the past to classify phosphatases in the Human and Aspergillus fumigatus genomes  to compile the phosphatomes of T. cruzi, T. brucei and L. major. We found that these organisms have an unusual composition of phosphatases with the PTP family being greatly reduced while the STP family has expanded by comparison with human phosphatases. We have also identified novel domain architectures in several phosphatases with potentially new functions and a number of unique and atypical phosphatases. With less than 30 phosphatases characterised in kinetoplastids out of more than 250, the information contained in the TriTryp phosphatome should stimulate further experimentation that would lead to a much more complete understanding on the biology of these important parasites. The significant divergence from human phosphatases indicates that these enzymes may be suitable targets for the development of specific inhibitors with therapeutic applications.
Results and discussion
The TriTryp phosphatome
Following the ontology classification, a total of 252 protein phosphatases were identified: 86 in T. cruzi, 78 in T. brucei and 88 in L. major (full list in Additional file 1). Overall there is a great deal of similarity across the three genomes regarding the number of phosphatases of each type (Figure 1, Table 1). The three kinetoplastids have representatives from all the major families of phosphatases: STPs, PTPs, DSPs and the lipid phosphatases PTEN and Myotubularins. Missing genes include the low molecular weight PTP (LMW-PTP) and the eyes absent (EYA) phosphatase. Consistent with the evolutionary divergence of kinetoplastid parasites, a high proportion of phosphatases were identified that have no clear orthologues in other genomes reported to date. We found that atypical and kinetoplastid-specific phosphatases amount to 36% for T. cruzi, 39% for T. brucei and a 41% for L. major of the total of phosphatases, most of them belonging to the STP and DSP subfamilies (Table 1).
The proportion of each subfamily of phosphatases is also significantly different from the human, S. cerevisiae and plant phosphatome (Figure 1). The most striking feature is the small proportion of PTPs present in the TriTryp phosphatome. This resembles only that of the plant A. thaliana. Whereas in humans and S. cerevisiae, PTPs comprise 31% and 16%, respectively of their phosphatase complement, A. thaliana and the kinetoplastids have a very small proportion: 2% for A. thaliana (2 proteins), 2.3% for T. cruzi (2 proteins), 2.6% for T. brucei (2 proteins) and 3.4% for L. major (3 proteins). When the proportion of STPs is analysed, there are also significant differences to the other species. T. cruzi has 65%, T. brucei, 69%, L. major, 66%, S. cerevisiae, 61% and A. thaliana 81%, whereas the human genome only has 24%. The DSP family proportion in the kinetoplastids is similar to humans and S. cerevisiae: T. cruzi has 23% DSPs, T. brucei, 24% and L. major, 25% compared with 31% for the human DSPs and 18% in S. cerevisiae. T. cruzi has a higher proportion of PTEN and myotubularin phosphatases compared to the other two parasites, with 3 more PTEN sequences than L. major and 4 more than T. brucei. The low proportion of PTPs, together with the large proportion of STPs and of atypical and kinetoplastid-specific phosphatases, suggests that phosphorylation-mediated mechanisms in these parasites may have a different emphasis than in vertebrates. It is interesting that the extremely low proportion of PTPs seems to be compensated by an expansion of the STP family. This may be related to the absence of tyrosine-specific kinases encoded in the kinetoplastid genomes  whose activity may be replaced by dual-specificity protein kinases. Overall it seems that there is still much to uncover about the functions of phosphatases in organisms distantly related to the mammalian model, traditionally accepted as the standard.
Protein tyrosine phosphatases
PTPs contain single polypeptide chains that form the catalytic domain and they are usually decorated with accessory subdomains (for example, SH2, Rhodanese, Ig, FN) critical for specific regulation or subcellular location [25, 29]. PTPs are recognised by a highly conserved active-site motif, CX5R, necessary for a Cys-based mechanism of catalysis, assisted by a conserved Asp residue. The rest of the catalytic domain differs significantly between subfamilies. In addition to the classical PTPs a number of atypical phosphatases exist that lack catalytic activity (STYX or pseudophosphatases) . All TriTryp PTPs identified have a single PTP domain (Figure 2) with the conserved active-site motif, CX5R, but with no extracellular regions or trans-membrane regions predicted, and without any additional recognisable regulatory or targeting domains commonly found in human PTPs. A sequence analysis shows that kinetoplastid PTPs fall into three separate groups (see Additional File 2) based on the conservation of the 10 landmark motifs known to be important for catalysis, substrate binding and maintenance of the three-dimensional fold characteristic of PTPs . Group 1 contains the sequences that are the most similar to human phosphatase domains. A member of this group is present in L. major (Lm PTP1) with an orthologous syntenic gene in T. cruzi (Tc PTP1) (see Additional file 3). However, T. brucei lacks an orthologue of this protein, suggesting that it may have a role in intracellular parasitism. This is consistent with recent functional analysis of LmPTP1 demonstrating reduced virulence of amastigote forms upon genetic ablation . Group 2 contains three proteins, Tb PTP1, Lm PTP2 and Tc PTP2 (Note that, despite the nomenclature, Tb PTP1 is not the orthologue of Tc PTP1 and Lm PTP1). We have recently characterised Tb PTP1 as a tyrosine specific PTP with a critical role in controlling T. brucei differentiation . These Group 2 PTPs lack motif 2 (DX2RVXL) in the phosphatase domain and contain up to six kinetoplastid-specific regions in both the pre-catalytic and catalytic domain of the protein. Distinct specific motifs are also found in Group 1 PTPs with slight sequence variations (see Additional file 4). The function of these regions is unknown but may be potentially important in substrate recognition or regulation. Group 3 (kinetoplastid-specific PTPs, kPTPs) sequences show the most interesting variations of the PTP domain with substitutions in most motifs and a deletion between motifs 7 and 8. Substitutions were detected in the structural motifs (motifs 2–7) of five hydrophobic residues-required for core stability-, by hydrophilic and basic residues. Altogether, these changes may have a considerable effect on the stability of the PTP domain and perhaps this is compensated by alternative folding arrangements or local conformational adjustments. These may become clear once structural information on these enzymes is available. Phylogenetic analysis of the PTP sequences (Figure 3) confirms the presence of three clades, which are distantly related to human, S. cerevisiae and A. thaliana PTPs.
Because the analysis of the TriTryp kinome revealed that these organisms do not have tyrosine kinases, determining the exact roles of the uncharacterised PTPs has become more important, given their potential role in the regulation of parasite biology. Our analysis did not identify any potential receptor PTPs, which is consistent with the lack of receptor tyrosine kinases. The sequence differences described above and the low homology to human PTPs suggest that these phosphatases may be suitable targets for the design of specific and selective inhibitors against parasitic infection and transmission.
TriTryp DSPs show a large variety of domain architectures, some of them completely novel when compared to DSPs in other organisms (Figure 2). Sequence analysis of the type 2 DSP domains in the TriTryp genomes shows a group of parasite proteins with clear matches to classic eukaryotic DSPs (eDSPs). These sequences show good conservation of the five classic motifs in DSPs . The eDSP group includes PRL (phosphatase of regenerating liver) and its isoforms, and cdc14 (cell division cycle 14) (Table 1, Figure 4).
eDSPs: cdc14 and PRLs
PRL phosphatases (PRL1, -2, -3) play important roles in cell growth and cellular transformation, potentially through regulation of spindle dynamics [31, 32]. They show sequence similarity to cdc14, a DSP that regulates mitotic exit [33, 34] and PTEN, a tumour suppressor , but they posses a unique C-terminal prenylation site or CAAX sequence, essential for localisation to the plasma membrane and early endosomes . PRL phosphatases are found in the three kinetoplastids and all of them contain prenylation C-terminal CAAX sequences. PRL-1 in T. cruzi, has been characterised and it was found to be farnesylated at the C-terminus and to localise to endocytic membranes . The functional role of this protein in trypanosomes is still unclear.
Atypical DSPs (aDSPs)
The majority of TriTryp DSPs show significant differences to classic DSPs, and have either unusual domain organisations (Figure 2), or are lacking conservation of sequence features in their catalytic domain (see Additional file 5), we refer to this group as atypical DSPs (aDSPs). In the first category, we found three subfamilies of phosphatases with unusual domain organisation: 1) a catalytic DSP domain with additional Leucine Rich Repeats (LRRs) (types 4 and 5 in Figure 2); 2) a DSP domain with two pseudokinase domains (see below) and LRRs (types 6 and 7, Figure 2), we named this subfamily kinatases (from kin ase and phosphatase); and 3) a DSP domain with an ankyrin domain. The presence of LRRs in a phosphatase sequence is very unusual. Related examples are only found in A. thaliana, which contains a large family of receptor protein kinases with LRRs  and also in some human kinases, for example, LRRK1 . The only other example of a phosphatase with LRRs is the human protein PHLPP, which dephosphorylates Akt and promotes apoptosis . This protein contains LRRs with a protein phosphatase 2C domain. No sequences have been reported to date containing both LRRs and DSP domains or kinase domains with LRRs and DSP domains, making these domain organisations unique to the kinetoplastids.
The LRR regions from each of the kinetoplastid sequences were investigated using sequence similarity searching. Interestingly, the LRRs in the kinatase group have homology to LRR modules in SHOC2/SUR-8 . The LRR-DSPs match SHOC-2, or Soc-2 proteins, but the E-values for all the matches are much higher (>0.001). SHOC2 is a Ras-binding scaffold protein that enhances Ras-MAP kinase signal transduction by facilitating the interaction between Ras and Raf . Other matches include a human sequence (Q9Y4C4_HUMAN, TrEMBL database) annotated as "Malignant fibrous histiocytoma amplified sequence", which contains a Ras GTPase domain, and plant intracellular Ras-group related LRR proteins (PIRLs) , related to Ras-binding proteins in animals and yeast. All three kinatases and all three LRR-DSPs also have near-top matches to bacterial LRR-containing sequences annotated as small GTP-binding proteins (e.g. Trichodesmium erythraeum Q10Y31_TRIEI, Magnetococcus A0L4U3_MAGSM, and Anabaena variabilis Q3MD20_ANAVT). Overall, the kinatases and LRR-DSPs show similarity to LRR proteins involved in regulation of Ras-mediated signalling pathways and small GTP-binding proteins in bacteria, suggesting that they may share similar functional roles in the parasites and act as scaffolding proteins in signalling pathways.
Analysis of the kinase domains in the kinatases indicates that they are likely to be enzymatically inactive, and we refer to them as pseudokinases for consistency with the previous classification. Comparison to eukaryotic protein kinases highlights the lack of residues essential for catalytic activity and substrate binding (Figure 5). For example, the glycine triad (GXGXXG) necessary for ATP binding is only present in one of the T. cruzi kinase domains, and the catalytic Asp residue in the active site HRD motif is substituted by other residues in all of the kinase domains. The first kinase domain in the kinatases also has substitutions in the magnesium-binding DFG motif. Although these pseudokinase domains might have lost the capability to phosphorylate substrates, they may still perform important regulatory roles in signalling pathways as has been reported for other pseudokinases . The presence of the LRR modules strongly suggests a role in protein-protein interactions, while the addition of an active DSP domain and pseudokinase domains hints to a sophisticated regulatory function, which has not been reported in higher eukaryotes. A similar domain architecture (kinase + phosphatase) is present in other protozoan organisms such as Giardia lamblia, Tetrahymena thermophila, and Dictyostelium discoideum, indicating evolutionary conservation of a these proteins. Sequences with LRRs and DSP domains only, are also present in other protozoa: Entamoeba histolytica and Dictyostelium discoideum suggesting that this family may be also specific to protozoa. It will be interesting to further explore the functional relevance of these novel DSPs in kinetoplastid signalling.
The last type of atypical domain organisation is found in T. cruzi and contains a DSP domain with an additional ankyrin domain (type 8, Figure 2). Ankyrin is a common protein-protein interaction domain found in proteins involved in transcription initiation, cell cycle regulation and signalling . There are few examples of PTPs with Ankyrin domains. One of these is the human protein tyrosine phosphatase PTN20 variant 11 (Q4JDK8_HUMAN). Others include uncharacterised sequences from C. elegans and C. briggsae (Q22668_CAEEL and Q628I2_CAEBR). No DSPs have been reported with ankyrin domains thus this appears to be a novel domain architecture only present in T. cruzi.
In a second category of aDSPs, we found a number of sequences that are particularly different from eukaryotic DSPs with low homology in one or more functional DSP motifs, and with amino acid substitutions in the active site P-loop (see Additional file 5). These sequences can be grouped into 6 further subgroups. Groups 1, 2 and 3 contain basic residues Arg or Lys in the P-loop signature, a trait found in PTEN lipid phosphatases, although in the phylogenetic tree they cluster independently of PTEN. We refer to them as lipid-like aDSPs (Figure 4 and Table 1). Group 3 have similar features in the P-loop to the triple-specificity phosphatase MptpB in M. tuberculosis . Groups 4 and 5 lack the catalytic Arg in the P-loop and are missing either the D-motif or the R-motif. This group clusters around the lipid-like DSPs (aDSP in Figure 4). Another group (group 6 in Additional file 5) has substitutions of the catalytic Cys residue in the P-loop, a diagnostic of inactive phosphatases, also called STYX phosphatases. STYX phosphatases are evolutionarily conserved pseudophosphatase modules  that, like pseudokinases are enzymatically inactive, but are functionally important as modulators of phosphorylation-dependent signalling. Their actions may be implicated in either competitive or cooperative mechanisms of phosphosubstrate binding and controlling subcellular location as reported for STYX and inactive myotubularins [47, 48] and the inactive phosphatase domains of receptor PTPs .
MAP kinase phosphatases (MKPs) are critical regulatory proteins in stress-dependent and hormone-related signalling pathways, and are highly conserved in vertebrates. However, other eukaryotes, like yeast and plants, have distinct MKPs apparently unrelated to the vertebrate analogues [50–53]. There are no obvious orthologues of human MKPs in kinetoplastids. Instead, a number of sequences were found to have close homology to reported MKP phosphatases in Arabidopsis (At MKP1, At DSPTP1). This is supported both by blast searches and clustering in the phylogenetic tree (Figure 6). These sequences segregate into two groups in the tree (Figure 6), one of them includes the atypical DSPs with LRR motifs and the kinatases. The other group clusters with AtMKP1. One sequence in the second group, Tc00.1047053509099.20, has a KIM-like motif (kinase interacting motif) , although it does not contain the remaining motifs, found in classic MKPs  and shared with different MAP kinase (MAPK) effectors , or the rhodanese domain found in human MKPs. Furthermore, the close resemblance to plant MKPs highlights their divergence from the mammalian genes. The interesting combination with protein interacting modules, like LRR (discussed above) and pseudokinase domains, suggests a role in the regulation of signalling pathways, likely of the Ras-dependant MAPK type. There are a total of 42 protein kinases in the T. brucei, T. cruzi and L. major, which are thought to be regulated as MAPKs  and it is possible that the aDSP-MKPs identified here play a role in their regulation.
Kinetoplastid-specific DSPs (kDSP)
A significant number of sequences in the DSP family share most of the classic DSP motifs, although they have no clear homology to any reported DSP and so they appear to be kinetoplastid-specific (Table 1, Figure 4). To determine kDSP function, further experimental characterisation will be needed.
Special features and accessory domains
A feature of the kinetoplastid DSPs is that most of them, including the eDSPs, are significantly longer than human DSPs. The same is true for several kinases, in particular MAPKs [56, 57]. A calculation of the average human DSP length gave 392aa whereas the figures for the kinetoplastids were: T. cruzi, 501aa; T. brucei, 519aa; and L. major, 700aa. Many of the parasite DSPs, have either N- or C- terminal extensions, some of them with no significant matches (E-value < 0.01) to annotated proteins such that no functions for these extra regions can be assigned. Other extensions contain accessory motifs or domains including the LRRs, pseudokinase domains and ankyrin domains previously described. Another example is the presence of a CAAX box, consisting of a cysteine (C), two aliphatic residues (A, A) and any other amino acid (X) in the PRL phosphatases. The cysteine directs protein prenylation important for localisation of the PRLs.
In contrast, some TriTryp DSPs, lack typical accessory modules found in mammalian genes. Such is the case of the putative MKPs that lack the kinase-binding rhodanese domain or CH2 domain  in the N-terminal region. This is also missing in mammalian low molecular weight MKPs, although they are still capable of dephosphorylating MAPKs. In the kinetoplastid MKPs the presence of LRRs and pseudokinase domains may replace the specific binding role of such domains. Thus, the kinetoplastid PTPs and DSPs seem to have adopted different domain organisations and strategies to fulfil analogous functions to mammalian phosphatases. A similar trend was also observed for protein kinases, which in trypanosomes lack the same type of accessory domains as their human orthologues. In fact, four of the five most common Pfam domains in human protein kinases are absent in the trypanosomatids kinome .
Lipid phosphatases (PTEN and myotubularins)
PTEN and myotubularins (MTMs) are DSP phosphatases with activity against phosphoinositides. This peculiar substrate versatility is crucial to their identified biological roles in mammals as tumour suppressors (for PTEN), and as regulators of endosomal trafficking (in the case of MTMs). Their role in lipid dephosphorylation and membrane targeting may also be important in different aspects of kinetoplastid metabolism. PTEN-like phosphatases were identified in the three kinetoplastids (Table 1) with two different architectures: the first has a DSP domain and a calcium-binding region at its C-terminus (type 9, Figure 2) and the second contains only the DSP domain (type 10, Figure 2) and shows lower homology to eukaryotic PTENs. The existence of two types of PTEN phosphatases is also reflected in the phylogenetic analysis, even when the DSP domain alone is considered (Figure 7). The type 9 group is closest to human PTEN and contains 4 T. cruzi and 1 L. major sequence (Figure 7). Interestingly, T. brucei does not seem to have an orthologue of human PTEN, but it has a PTEN-like protein similar to the other two parasites.
Two subfamilies of MTMs were identified that possess the typical domain organisation of MTMs, with a phosphatase domain and N-terminal lipid-binding GRAM domain (type 11, Figure 2), or with and additional PI3P binding FYVE domain (type 12, Figure 2). Each of the parasites has one sequence of each type. Overall, the six kinetoplastid MTM sequences have high similarity to human MTMs across the length of the phosphatase domain. However, they do not individually cluster with any human MTMs, hence direct orthologues cannot be determined (Figure 7). It is interesting that only two MTM sequences are present in each of these organisms as this is one of the largest subfamilies of DSPs in humans (14 genes). Along with the classical types of DSPs, the MTM family appears to have been substantially expanded in higher eukaryotes. MTMs, which are relatively large sequences in humans, are larger still in the kinetoplastids. Most human MTMs are around 450–700aa with a few over 1,000aa. The smallest TriTryp MTM is 872aa (Tb927.1.3300) and the largest is 3,246 aa (LmjF12.0320) with a long N-terminus extension that has no predicted domain or motif matches.
Cdc25 and Arc2 reductases
Cdc25 (cell division cycle 25) phosphatases are widely spread in metazoans and known to be responsible for the activation of cyclin dependent kinases (CDKs) . This is an essential step in progression to mitosis that controls the G2 to M checkpoint. The kinetoplastid cell cycle is also regulated by CDK-related kinases (CRKs)  and 10–11 CRKs have been identified in the TriTryp kinome . Although modulation of CRK activity is still not well understood, the presence of other cell cycle regulatory phosphatases like PP1, PP2A and cdc14 suggest that similar mechanisms may be conserved in kinetoplastids.
In our analysis, two sequences, Tc00.1047053508707.20 and LmjF32.2740, matched the rhodanese-like domain similar to the cdc25 phosphatase catalytic domain. No significant match was found in T. brucei (< 0.01). Orthologues of LmjF32.2740 are also present in L. infantum and L. brazilensis (see Additional file 6).
Rhodanese domains are also found in sulfur-transferases and in the Acr2 type of arsenate reductases, indicating a common evolutionary lineage with phosphatases [61, 62]. In addition, they all share a similar Cys-based catalytic mechanism and a conserved CX5R active site motif . In fact, LmjF32.2740 is homologous to eukaryotic arsenate reductases from S. cerevisiae, Sc ACR2, and Arabidopsis, At ACR2 (formerly annotated as cdc25). Consistent with this, LmjF32.2740 (Lm ACR2) has been found to functionally complement the arsenate-sensitive phenotype of the Sc ACR2 deletion in S. cerevisiae. Furthermore, it confers sensitivity to the antimonial drug Pentostam, by virtue of its antimonial reductase activity in addition to the arsenate reductase activity . Recently, it has been reported that purified Lm ACR2 also exhibits apparent phosphatase activity in vitro . Tc00.1047053508707.20 remains uncharacterised, but its similarity to Lm ACR2 predicts it may have arsenate reductase activity. Phylogenetic analysis further supports the idea that cdc25-like sequences in Leishmania and T. cruzi, are more closely related to eukaryotic Acr2 reductases than to mammalian cdc25s (see Additional file 6). The active site motif of the kinetoplastid sequences is clearly different in composition and properties to the mammalian cdc25 that have acidic residues (E FSSE), but similar to Acr2s with hydrophobic residues (A XSLV).
The functional role of cdc25 in kinetoplastids, as suggested for plants , may be replaced instead by the activity of B type cyclin-dependent kinases (cdck-B). The nearest homologues to the cdck-B in Arabidopsis are the Cdc2-related kinases (CRK1–3, ) identified in T. brucei and L. mexicana. These CDKs are critically involved in cell cycle control mechanisms similarly to Cdc25s in other eukaryotes [67–69].
LMW-PTPs and ArsC reductases
Three kinetoplastid sequences (Tb09.160.2100, LmjF01.0200 and Tc00.1047053504797.120) were initially found to match the InterPro LMW-PTP domain (IPR000106). These sequences were aligned with more than 250 LMW-PTPs from different organisms and also with a group of related bacterial arsenate reductases (ArsC type) for phylogenetic analysis (data not shown). The prokaryote ArsC reductases are unrelated to eukaryote Acr2 reductases, and yet they also share the active site P-loop signature motif CX5R with PTPs and have remarkable structural similarity to LMW-PTPs . The kinetoplastid sequences formed a separate cluster in the phylogenetic tree originating from the same branch as those of the bacterial ArsC but clearly distinct from the LMW-PTPs. The phylogram (see Additional file 6) also shows that cdc25 and Acr2 (yeast, plants and kinetoplastids) are clearly related but distinct from the ArsC reductases found in bacteria and in kinetoplastids.
The three kinetoplastid ArsC sequences have two conserved long insertions. Interestingly, the kinetoplastid ArsC sequences lack the catalytic Cys residue in the active site (substituted by Gly), and the two landmark Cys residues also important in catalysis [71, 72]. However, they contain the catalytic Arg (P-loop) and Asp (general acid in catalysis) residues. This suggests that they may be inactive enzymes or that they use an alternative catalytic mechanism.
ArsC homologues are not present in vertebrates or plants, which use Acr2 instead for metalloid detoxification. It is intriguing then that T. cruzi and L. major have retained the bacterial arsenate reductases ArsC in addition to acquiring the eukaryotic Acr2 and that the Acr2 gene is not present in T. brucei, – with the syntenic region of the genome in that parasite instead containing two predicted amino acid transporter genes (see Additional file 7). Recently, the presence of two putative ArsC proteins in A. fumigatus has been reported , suggesting that some lower eukaryotes may require both systems for arsenate reduction.
STPs are multi-subunit complexes, which combine a catalytic subunit with various regulatory and targeting subunits [26, 74] providing selective substrate specificity, subcellular localisation and regulation of enzymatic activity. STPs share a similar catalytic mechanism and are structurally related, although based on metal ion dependence and sequence homology they can be divided into three main subfamilies: PPP, PPM and FCP . The PPP and PPM subfamilies share structural similarity and a common mechanism of catalysis, with conserved acidic residues that coordinate the metal ions essential for activity. In humans, a small number of genes encode for STP catalytic domains. Conversely, in C. elegans  and Arabidopsis , STPs represent between 50–80% of the phosphatome. We restricted our analysis to the catalytic phosphatase domains for this class. The STP family of phosphatases is the largest represented in kinetoplastids accounting for more than one half of the whole complement of protein phosphatases. Typical domain architectures were identified (Figure 2) containing the conserved phosphatase catalytic domain either alone or with accessory domains like tetratricopeptide repeats (TPRs) that mediate protein-protein interactions, found in PP5 or calcium binding EF-hand domains, found in PPEF proteins. Many kinetoplastid STPs, like the DSPs, are longer proteins than their human relatives, with extensions of unknown function.
PPP phosphatases PP1 and PP2A are involved in the regulation of a number of signalling pathways, including MAPK-dependent networks, cell cycle regulation, glycogen metabolism and microtubule organisation at centrosomes [77–81]. The general pattern that identifies members of the PPP family contains motifs from three separate regions GD XH G – GD XVDRG – GN H E  (residues in bold coordinate metal ions at the catalytic site and the underlined His is the proton donor in catalysis).
There are clearly identifiable clades in the PPP phylogenetic tree for the PP1, PP2A, PP2B, PP4, PP5 and PPEF phosphatases and these groups are well supported statistically (Figure 8).
Protein phosphatase 1 (PP1)
The PP1 group contains a large group of kinetoplastid sequences: 7 in T. cruzi, 8 in T. brucei and 8 in L. major. Two of the T. cruzi PP1catalytic subunits (PP1α and PP1β) have been characterised . Inhibitor studies suggested that in T. cruzi epimastigotes, PP1 has an important role in the completion of cell division and in the maintenance of cell shape . Similar roles were previously proposed for PP1 in higher eukaryotes . The functional roles of the T brucei PP1 and PP2A phosphatases have also been studied . The combined RNAi ablation of 7 PP1s and the PP2A catalytic subunits in procyclic forms, resulted in a slow growth phenotype but did not lead to lethality. This is in contrast with previous experiments in which trypanosomes treated with okadaic acid (OA), a well known inhibitor of PP1 and PP2A enzyme activity, show defects in cytokinesis and organelle genome segregation (multinuclear cells with single kinetoplast) . The seemingly conflicting findings of these two reports (absence of multinucleated cells with RNAi vs. multinucleated cells after okadaic acid treatment) suggests that additional OA sensitive PPP enzyme activities are present in T. brucei although incomplete ablation by RNAi might also be responsible.
Interestingly, 3 PP1c genes in T. cruzi, 4 PP1c genes in T. brucei and 5 PP1c genes in L. major are found in tandem gene arrays. In T. brucei the repeat structure of the array is disrupted through extensive divergence in the flanking and coding sequences. The gene encoding PP1-7 (Tb927.4.3560) has evolved unique UTRs and is separated from the other 3 members (PP1-6 (Tb927.4.3620), PP1-5 (Tb927.4.3630) and PP1-4 (Tb927.4.3640)) by the insertion of two unrelated tandem-arranged genes (each encoding translational elongation factor 1 beta). This organisation retains orthology in both T. cruzi and L. major . Kinetoplastid genes do not generally possess individual promoters and, therefore, cannot be up-regulated through transcriptional initiation. Instead, where high transcript levels are required, genes may be duplicated to form tandem gene arrays, which are co-transcribed. The PP1s in the arrays might needed to be expressed on a certain level, working as "housekeeping genes" while the other PP1 homologues would be in charge of the "fine tuning" of various signal transduction events.
Protein phosphatase 2A and 2B (PP2A and PP2B)
Two groups of PP2A phosphatases were identified for each parasite. One group is more closely related to human, S. cerevisiae and plant PP2As than the other (Figure 8). A member of the second group, PP2A from T. cruzi (Tc00.1047053511021.10), has already been characterised as important for the complete transformation of trypomastigotes into amastigotes during the life cycle of the parasite . Similarly to PP2A, two groups of PP2B are present in kinetoplastids. One group clusters with the human and S. cerevisiae homologues, while the second triplet forms a separate cluster (Figure 8). Sequences in the second cluster have mutations in catalytic residues suggesting they may be pseudophosphatases.
Protein phosphatase 4 and 6 (PP4 and PP6)
Homologues to human PP4 and PP6 have not yet been characterised in any of these kinetoplastids. We found a well-supported PP4 cluster that included a kinetoplastid triplet together with the human and A. thaliana PP4 sequences (Figure 8). PP6 orthologues were found in T. cruzi (Tc00.1047053510687.40) and L. major (LmjF34.4190) by blast search, but there is apparently no T. brucei PP6 orthologue.
Protein phosphatase 5 (PP5)
Homologues of PP5 in T. brucei, T. cruzi and L. major cluster with human and S. cerevisiae PP5 sequences (Figure 8) and they all contain tetratricopeptide (TPR) modules (Figure 2). Analysis of the T. brucei PP5 (Tb PP5)  highlights that the TPRs of this protein are actually similar to those in fungi. Tb PP5 expression is regulated during cell cycle progression and it is important for normal cell growth .
Protein phosphatases with EF Hands (PPEF)
Close to the PP5 sequences are the protein phosphatases with EF-hands (PPEF) (or PP7) (Figure 8). Kinetoplastid PPEF phosphatases are found clustering with human PPEFs, showing that there are two PPEF proteins in T. cruzi and T. brucei but only one in L. major. As there are two human PPEF proteins this suggests either that duplication of the PPEF gene occurred before kinetoplastids diverged and that L. major has lost a copy or that two duplication events occurred independently, in kinetoplastids and during higher eukaryote development. A recent publication discusses the PPEF family of kinetoplastid phosphatases , showing that these enzymes are N-myristoylated and constitutively expressed through all parasites life cycle.
In many eukaryotic organisms there exist "non-conventional" PPPs. These have greater similarity to bacterial enzymes than to other eukaryotic PPP proteins . Alphs (A paH-l ike ph osphatases) are a group of eukaryotic PPPs that have greater similarity to bacterial diadenosine tetraphosphatases than to other members of the PPP family and several kinetoplastid PPP sequences fall into this category (see Additional file 8). These Alphs have specific mutations in the second conserved motif, GDXVDRG, in particular, the substitution of the second Asp for a neutral residue and substitution of the Arg for Lys . These mutations are found in 3 T. cruzi, 3 T. brucei and 2 L. major sequences (Additional file 8).
Kinetoplastid-specific PPPs (kPPP)
Several kinetoplastid PPP sequences have amino acid substitutions of catalytically important residues (Additional file 8). These pseudophosphatases may not be able to carry out dephosphorylation, although they may still have kinetoplastid-specific roles. These sequences form part of a large group of kinetoplastid-specific PPPs (kPPP, Figure 8). The long branch-lengths of several of the proteins in this group and also the Alphs indicate that these kPPP sequences have diverged much more over time than the others. The most interesting aspect of the kPPP group is that sequence similarity searches show some of these sequences have greater similarity to plant and fungal phosphatases, in particular to the BSU1 and BSL types of A. thaliana phosphatases. BSU1 is a nuclear protein phosphatase that modulates the cell response to plant steroid hormones . One of these kinetoplastid-specific sequences, Tb927.6.4630, has previously been identified as a "Shelph" with similarity to PPP phosphatases from Shewanella, a psychrophillic bacteria . Another Shelph exists in L. major, LmjF31.2630 that appears in the PP1 section of the tree (Figure 8).
Despite differences in their primary sequences, the three-dimensional structures of PPP and PPM proteins are very similar and they share a common catalytic mechanism . PPM phosphatases depend on Mg2+ or Mn2+ for catalytic activity. A set of 11 conserved motifs has been identified within the PPM/PP2C family of phosphatases . Motifs 1, 2, 5, 6, 8 and 11 are most conserved in eukaryotes and form the pattern: ED – DG H [AG] – GD – GD – DG – D N (conserved Asp residues coordinate the metal ions Mg2+ or Mn2+ essential for catalysis). The analysis of the PP2C sequences from kinetoplastidsshows that all of them possess the Asp residues in the motifs above, with the only exception of Tc00.1047053504163.10, Tb10.70.1410 and LmjF36.1260, which are missing some of the acidic residues in the metal binding site. These have been previously designated as 'PP2C-like'  because they do not possess all of the PP2C motifs.
The TriTryp PPM family is similar to the human as there are 15 PPMs in human and 14 in T. cruzi, 13 in T. brucei and 15 in L. major. A. thaliana, however, has a larger expansion of this family  with 63 genes. From the phylogenetic analysis (Figure 9) we observed that plant PPM phosphatases, with few exceptions, form distinct clades separate from human, yeast and kinetoplastid sequences. Parasite PPMs have greater similarity to human and yeast sequences, although direct human homologues are not clear from the tree.
PP2C phosphatases are negative regulators of stress-regulated signalling mediated by PKA and the p38 and JNK MAPK pathways [93, 94] in mammals. In yeast and plants, PP2C proteins are also involved in modulating stress response signalling [95–97]. More recently an important role in cell cycle regulation has been reported for human PP2C in dephosphorylation of Cdk2 and Cdk6 , and in cell cycle arrest in Xenopus . Other functions of PP2Cs include the regulation of cytoskeleton organisation  and development . A Leishmania PP2C, Lc PP2C, was shown to be present in both the infective promastigote and tissue amastigote stages of L. chagasi and L. amazonensis. The catalytic properties of LcPP2C were found to be similar to eukaryotic PP2C enzymes with respect to Mg2+ dependence and insensitivity to the inhibitor okadaic acid .
This family comprises a group of phosphatases that dephosphorylate the carboxy-terminal domain (CTD) of RNA polymerase II and that interact with transcription factor TFIIF [103, 104]. In yeast, FCP1 is an essential gene . We found FCP homologues in all three genomes: 13 in T. cruzi, 14 in T. brucei and 13 in L. major matching human and S. cerevisiae proteins (see Additional File 1). These phosphatases dephosphorylate serine residues of the conserved "heptad repeats" YSPTSPS at the CTD of the polymerase, which is required for recycling of the polymerase at the end of transcription. T. brucei has an RNA polymerase II, but this protein lacks the heptad repeats at its C-terminus, as found in most eukaryotes. Despite the lack of heptad repeats, the T. brucei RNA polymerase II is believed to be phosphorylated at alternative sites – perhaps through the C-terminal region which is rich in serine and tyrosine residues-, that may be targeted by the identified CTD phosphatase homologues [105, 106].
Overall, there is a remarkable expansion of the kinetoplastid STP family in comparison to other eukaryotic genomes. Importantly, this highlights the prominent role of serine/threonine phosphorylation in the physiology of these parasites that overwrites the importance of tyrosine phosphorylation as reported in mammalian organisms. This is consistent with the lack of tyrosine kinases  and matches the situation in plants .
The fluctuation of protein phosphorylation in kinetoplastids is well documented and in many instances is clearly related to stage-specific events or cell cycle regulation. As the functional characterisation of protein kinases is quickly expanding, a better knowledge of the phosphatase complement of these unicellular parasites is essential to understand the complexity and regulation of many cellular processes controlled by phosphorylation. Overall we identified more than 250 protein phosphatase domains in the three kinetoplastids, which represents about 50% of the kinase genes and apparently indicates a lower level of complexity for this type of enzymes. Further complexity may result from combinations with interacting regulatory subunits, particularly in the STP family or by stage-specific control of their expression levels.
The analysis of the TriTryp phosphatome provides valuable information for future experimental studies and highlights many interesting differences with other eukaryotic genomes, such as the low proportion of PTPs and the expansion of the STP family. Interestingly, a larger number of atypical protein phosphatases have been identified in these species, comprising approximately one third of the total. Most of the atypical phosphatases belong to the DSP family, and show considerable divergence from classic DSPs. Novel domain architectures suggest potential functional roles for the LRR containing DSPs as scaffolds in cellular events. Many kinetoplastid phosphatases are longer than human phosphatases and those extensions may contain unidentified functionalities. Sequence extensions have also been found in PP2 phosphatases in Plasmodium falciparum  and in a large number of proteins from pathogenic microorganisms . It appears that the presence of such extensions has been conserved through evolution and may have an important role in pathogenesis.
Between the three kinetoplastids there are interesting differences, potentially related to their different environments and parasitic mode. T. brucei, the only extracellular pathogen, seems to have distinct traits, with less phosphatase genes than T. cruzi and L. major. Finally, the STP family appears to be extended in the TriTryp genome in comparison to humans, maybe to compensate for the lower number of PTPs. This correlates well with results from the TriTryp kinome study showing that these parasites are lacking tyrosine specific kinases. Overall, a number of important differences in the phosphatome of the TriTryp suggest that phosphorylation-dependent processes in these organisms may have a distinct organisation and physiological imprint that is worth a more detailed experimental exploration. Moreover, these distinct traits may be exploited in the selection of new targets for drug-design and development of therapeutic compounds taking advantage of the existing specific protein phosphatase inhibitors.
The three protein data sets were obtained from GeneDB  (versions released in May 2006). The T. cruzi genome is a hybrid of two very closely related species and so many genes are present in two very similar forms. Using the Clusters of orthologous genes data  and sequence similarity searching, the shortest of each protein duplicate was removed. The phosphatase classification process was performed in a similar way to the method used previously to classify phosphatases from the human and Aspergillus fumigatus genomes . Four components are required for the ontology classification: the phosphatase ontology (written in the Web Ontology Language OWL ), a description logic reasoner (Racer Pro version 1.9.0) , the Instance Store  and the domain compositions of all protein sequences, obtained through InterProScan , (version 13.1 of the InterPro database) . The OWL ontology for the protein phosphatases was constructed using the Protégé ontology editor , which enables different types of protein phosphatase family members to be defined in terms of their domain architectures. These ontological descriptions of types of phosphatase were combined with the InterProScan domain information for individual proteins from each genome and the classification against the ontology was carried out through the Instance Store. The Instance Store combines a description logic reasoner and a relational database and provides an interface through which the data can be queried to return the results of the classification. The reasoner checks for logical inconsistencies in the ontology and performs the classification of individual phosphatases according to the types described in the ontology. The relational database of the Instance Store enables large numbers of protein instances to be stored. This system provides, in essence, an extension to the InterProScan domain-matching tool, as it is able to work at the level of a whole genome and place proteins in defined classes.
Using dual-specificity protein phosphatases (DSPs) as an example of the process, they are defined in the ontology as any protein possessing, amongst other features, a generic PTP domain (based on the HCX5R consensus motif) (IPR000387) and also a more specific DSP domain (IPR000340). When the Instance Store is queried for all DSPs, it will return a list of all protein sequences that matched both of these domains in InterProScan . The types of phosphatase described in the ontology vary from the general (tyrosine phosphatase) to the specific (R2A Phosphatase). The classification process will place any phosphatase to the most specific type possible. Those proteins not fully classified are either new types not yet completely described by the ontology, or types for which there are no InterPro descriptions. To make sure no phosphatases had been missed using the ontology method, the SMART domain database  was scanned and sequence similarity searching was used (using BlastP programs from both NCBI and Swiss-Prot/TrEMBL ). Note that searches using more recent versions of InterPro may give slightly different results to those reported in this work due to updating of InterPro entries and also entries in the databases scanned by InterPro.
Domain architecture analysis
This was performed with the information obtained from InterProScan . Quality control methods were employed to distinguish true domain matches from low scoring matches. Each domain match was submitted for sequence similarity searching against the Swiss-Prot/TrEMBL database  to determine whether they matched other sequences with the same domain. As InterPro is a secondary domain database and searches other bioinformatics databases, the original motifs or domains descriptions and criteria were investigated for each InterPro match. For example, if the InterPro domain was based on a PRINTS fingerprint  entry then we investigated how many motifs of the fingerprint did actually match the query sequence. If an InterPro domain match had support from several other databases, then it was deemed to have good support and was included in this study.
Sequence analysis of the phosphatase catalytic domains
Multiple alignments were produced with ClustalX  and manually edited with BioEdit  to obtain the most robust alignments for phylogenetic analyses. Detailed sequence analyses were performed for each kinetoplastid phosphatase subfamily. Motifs previously described as conserved in other eukaryotic catalytic domains were compared to the kinetoplastid phosphatase catalytic domains [25, 26, 82, 91, 92, 120–122]. Further sequence similarity searching and motif analysis was done for the "kinatases" to analyse the features of the kinase domain and the conservation of the kinase-phosphatase domain architecture in other organisms. From BlastP searching at Swiss-Prot/TrEMBL , the closest annotated sequences to the kinetoplastid kinatases were found to be mammalian calcium-dependent kinase kinases (CaMKKs). A ClustalX multiple alignment was created of the kinase domains of CaMKKs from several eukaryotes (human, mouse, rat, X. laevis, D. rerio, C. elegans, D. melanogaster, D. discoideum, S. cerevisiae and A. thaliana) to identify conserved motifs. This alignment was manually edited and the conserved residues determined using descriptions of the 11 conserved kinase subdomains. The InterPro and SMART databases were queried to determine if kinatases were present in any other species.
Additional sequence similarity searching was performed for the DSPs containing Leucine Rich Repeats (LRRs) using BlastP . The LRR regions from the six kinetoplastid sequences (LRR-DSPs and kinatases) were analysed for the closest matches in other organisms, particularly human. A BlastP search was also done for the region containing an ankyrin domain in Tc00.1047053510265.70 and the InterPro database queried for any other ankyrin phosphatase domain containing proteins.
A previously established and thorough phylogenetics approach was used  to produce the evolutionary trees. The different ClustalX alignments created from the phosphatase catalytic domain sequence analyses were used for the phylogenetic tree construction. All human, S. cerevisiae and A. thaliana sequences from each phosphatase subfamily were included in the alignments, unless the sequences were fragments or had deletions in important conserved regions. These sequences were included as markers and to give some functional definition to the clades in each tree determining the similarity between T. cruzi, T. brucei and L. major protein phosphatases with those from other eukaryotes of different complexities. BioEdit was used to manually improve the alignments and remove all gap-containing sites before the trees were created.
ClustalX neighbour-joining (NJ) trees were produced for each of the alignments and the reliability of these was tested using three separate methods: Bayesian analysis using MrBayes (version 3.1.2, ), maximum likelihood and maximum parsimony methods using the PHYLIP package (version 3.63)  and Tree-Puzzle (version 5.2, . ClustalX NJ trees were used in favour of PHYLIP Neighbour-Joining as the PHYLIP NJ trees differed from results with the other three methods. This was particularly noticeable in groups of three sequences with one sequence from each parasite, as ClustalX-NJ trees, and Maximum Likelihood and Maximum Parsimony methods would group the two trypanosome sequences together with the Leishmania sequence less closely related, although PHYLIP's NJ methods would consistently show other orientations.
The MrBayes programme Markov Chain Monte Carlo, was used to generate an optimal tree using Bayesian methods. For Maximum Likelihood analysis, firstly gamma correction values were generated from Tree Puzzle . This gives a more accurate model of amino acid substitutions as it models the rate of evolution against the frequency of sites with that rate. The value for each sequence alignment was input into the PHYLIP program, PROML. Global rearrangement was also used to improve the results as this option allows the programme to re-evaluate the placing of each sequence within the tree. Maximum Parsimony was performed using SEQBOOT, PROTPARS and CONSENSE from PHYLIP. For both ClustalX Neighbour Joining and Maximum parsimony 1000 bootstrap replicates were produced to give improved statistical values for the consensus trees. Final consensus trees were produced for every phosphatase alignment integrating results from each method. Bootstrap and Bayesian clade credibility values of 70% and above are shown in the figures.
For the MKP and lipid phosphatases phylogenetic analyses a preliminary NJ tree was produced containing all DSPs and lipid phosphatases from the three kinetoplastids, human, yeast and plant. The MKP and lipid phosphatase-containing regions were of most interest so only sequences from these groups were used for the full analysis using all four methods.
A separate analysis was performed for the kinetoplastid sequences originally identified as Low Molecular Weight PTPs (LMW-PTPs) and Cdc25. A ClustalX multiple sequence alignment included human and yeast LMW-PTP, bacterial ArsC reductases, human and mouse Cdc25 and Arc2 reductases from S. cerevisiae and plants, together with the putative kinetoplastids Cdc25 and LMW-PTPs. A phylogenetic tree was calculated as above.
dual specificity phosphatase
eukaryotic-like dual specificity phosphatase
atypical dual specificity phosphatase
kinetoplastid-specific dual specificity phosphatase
protein tyrosine phosphatase
kinetoplastid-specific protein tyrosine phosphatase
Phosphatase and tensin homolog
low molecular weight protein tyrosine phosphatase
Eyes absent phosphatase
phosphatase of regenerating liver
Mitogen-activated protein kinase phosphatase
cell division cycle phosphatase 25
kinetoplastid-specific phospho-protein phosphatase
protein phosphatase, magnesium or manganese-dependent
TFIIF-stimulated CTD phosphatase
Protein phosphatase with EF-Hand domains
Leucine Rich Repeat.
El-Sayed NM, Myler PJ, Bartholomeu DC, Nilsson D, Aggarwal G, Tran AN, Ghedin E, Worthey EA, Delcher AL, Blandin G, Westenberger SJ, Caler E, Cerqueira GC, Branche C, Haas B, Anupama A, Arner E, Aslund L, Attipoe P, Bontempi E, Bringaud F, Burton P, Cadag E, Campbell DA, Carrington M, Crabtree J, Darban H, da Silveira JF, de Jong P, Edwards K, Englund PT, Fazelina G, Feldblyum T, Ferella M, Frasch AC, Gull K, Horn D, Hou L, Huang Y, Kindlund E, Klingbeil M, Kluge S, Koo H, Lacerda D, Levin MJ, Lorenzi H, Louie T, Machado CR, McCulloch R, McKenna A, Mizuno Y, Mottram JC, Nelson S, Ochaya S, Osoegawa K, Pai G, Parsons M, Pentony M, Pettersson U, Pop M, Ramirez JL, Rinta J, Robertson L, Salzberg SL, Sanchez DO, Seyler A, Sharma R, Shetty J, Simpson AJ, Sisk E, Tammi MT, Tarleton R, Teixeira S, Van Aken S, Vogt C, Ward PN, Wickstead B, Wortman J, White O, Fraser CM, Stuart KD, Andersson B: The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas disease. Science. 2005, 309 (5733): 409-415. 10.1126/science.1112631.
Berriman M, Ghedin E, Hertz-Fowler C, Blandin G, Renauld H, Bartholomeu DC, Lennard NJ, Caler E, Hamlin NE, Haas B, Bohme U, Hannick L, Aslett MA, Shallom J, Marcello L, Hou L, Wickstead B, Alsmark UC, Arrowsmith C, Atkin RJ, Barron AJ, Bringaud F, Brooks K, Carrington M, Cherevach I, Chillingworth TJ, Churcher C, Clark LN, Corton CH, Cronin A, Davies RM, Doggett J, Djikeng A, Feldblyum T, Field MC, Fraser A, Goodhead I, Hance Z, Harper D, Harris BR, Hauser H, Hostetler J, Ivens A, Jagels K, Johnson D, Johnson J, Jones K, Kerhornou AX, Koo H, Larke N, Landfear S, Larkin C, Leech V, Line A, Lord A, Macleod A, Mooney PJ, Moule S, Martin DM, Morgan GW, Mungall K, Norbertczak H, Ormond D, Pai G, Peacock CS, Peterson J, Quail MA, Rabbinowitsch E, Rajandream MA, Reitter C, Salzberg SL, Sanders M, Schobel S, Sharp S, Simmonds M, Simpson AJ, Tallon L, Turner CM, Tait A, Tivey AR, Van Aken S, Walker D, Wanless D, Wang S, White B, White O, Whitehead S, Woodward J, Wortman J, Adams MD, Embley TM, Gull K, Ullu E, Barry JD, Fairlamb AH, Opperdoes F, Barrell BG, Donelson JE, Hall N, Fraser CM, Melville SE, El-Sayed NM: The genome of the African trypanosome Trypanosoma brucei. Science. 2005, 309 (5733): 416-422. 10.1126/science.1112642.
Ivens AC, Peacock CS, Worthey EA, Murphy L, Aggarwal G, Berriman M, Sisk E, Rajandream MA, Adlem E, Aert R, Anupama A, Apostolou Z, Attipoe P, Bason N, Bauser C, Beck A, Beverley SM, Bianchettin G, Borzym K, Bothe G, Bruschi CV, Collins M, Cadag E, Ciarloni L, Clayton C, Coulson RM, Cronin A, Cruz AK, Davies RM, De Gaudenzi J, Dobson DE, Duesterhoeft A, Fazelina G, Fosker N, Frasch AC, Fraser A, Fuchs M, Gabel C, Goble A, Goffeau A, Harris D, Hertz-Fowler C, Hilbert H, Horn D, Huang Y, Klages S, Knights A, Kube M, Larke N, Litvin L, Lord A, Louie T, Marra M, Masuy D, Matthews K, Michaeli S, Mottram JC, Muller-Auer S, Munden H, Nelson S, Norbertczak H, Oliver K, O'Neil S, Pentony M, Pohl TM, Price C, Purnelle B, Quail MA, Rabbinowitsch E, Reinhardt R, Rieger M, Rinta J, Robben J, Robertson L, Ruiz JC, Rutter S, Saunders D, Schafer M, Schein J, Schwartz DC, Seeger K, Seyler A, Sharp S, Shin H, Sivam D, Squares R, Squares S, Tosato V, Vogt C, Volckaert G, Wambutt R, Warren T, Wedler H, Woodward J, Zhou S, Zimmermann W, Smith DF, Blackwell JM, Stuart KD, Barrell B, Myler PJ: The genome of the kinetoplastid parasite, Leishmania major. Science. 2005, 309 (5733): 436-442. 10.1126/science.1112680.
Peacock CS, Seeger K, Harris D, Murphy L, Ruiz JC, Quail MA, Peters N, Adlem E, Tivey A, Aslett M, Kerhornou A, Ivens A, Fraser A, Rajandream MA, Carver T, Norbertczak H, Chillingworth T, Hance Z, Jagels K, Moule S, Ormond D, Rutter S, Squares R, Whitehead S, Rabbinowitsch E, Arrowsmith C, White B, Thurston S, Bringaud F, Baldauf SL, Faulconbridge A, Jeffares D, Depledge DP, Oyola SO, Hilley JD, Brito LO, Tosi LR, Barrell B, Cruz AK, Mottram JC, Smith DF, Berriman M: Comparative genomic analysis of three Leishmania species that cause diverse human disease. Nat Genet. 2007, 39 (7): 839-847. 10.1038/ng2053.
Andrade LO, Andrews NW: The Trypanosoma cruzi-host-cell interplay: location, invasion, retention. Nat Rev Microbiol. 2005, 3 (10): 819-823. 10.1038/nrmicro1249.
Matthews KR: The developmental cell biology of Trypanosoma brucei. J Cell Sci. 2005, 118 (Pt 2): 283-290. 10.1242/jcs.01649.
Atwood JA, Weatherly DB, Minning TA, Bundy B, Cavola C, Opperdoes FR, Orlando R, Tarleton RL: The Trypanosoma cruzi proteome. Science. 2005, 309 (5733): 473-476. 10.1126/science.1110289.
Dell KR, Engel JN: Stage-specific regulation of protein phosphorylation in Leishmania major. Mol Biochem Parasitol. 1994, 64 (2): 283-292. 10.1016/0166-6851(94)00030-1.
Parsons M, Valentine M, Deans J, Schieven GL, Ledbetter JA: Distinct patterns of tyrosine phosphorylation during the life cycle of Trypanosoma brucei. Mol Biochem Parasitol. 1991, 45 (2): 241-248. 10.1016/0166-6851(91)90091-J.
Tonks NK: Protein tyrosine phosphatases: from genes, to function, to disease. Nat Rev Mol Cell Biol. 2006, 7 (11): 833-846. 10.1038/nrm2039.
Tian Q, Wang J: Role of serine/threonine protein phosphatase in Alzheimer's disease. Neurosignals. 2002, 11 (5): 262-269. 10.1159/000067425.
Laporte J, Blondeau F, Buj-Bello A, Mandel JL: The myotubularin family: from genetic disease to phosphoinositide metabolism. Trends Genet. 2001, 17 (4): 221-228. 10.1016/S0168-9525(01)02245-4.
Zhang ZY: Protein tyrosine phosphatases: prospects for therapeutics. Curr Opin Chem Biol. 2001, 5 (4): 416-423. 10.1016/S1367-5931(00)00223-4.
Black DS, Marie-Cardine A, Schraven B, Bliska JB: The Yersinia tyrosine phosphatase YopH targets a novel adhesion-regulated signalling complex in macrophages. Cell Microbiol. 2000, 2 (5): 401-414. 10.1046/j.1462-5822.2000.00061.x.
Lin SL, Le TX, Cowen DS: SptP, a Salmonella typhimurium type III-secreted protein, inhibits the mitogen-activated protein kinase pathway by inhibiting Raf activation. Cell Microbiol. 2003, 5 (4): 267-275. 10.1046/j.1462-5822.2003.t01-1-00274.x.
Singh R, Rao V, Shakila H, Gupta R, Khera A, Dhar N, Singh A, Koul A, Singh Y, Naseema M, Narayanan PR, Paramasivan CN, Ramanathan VD, Tyagi AK: Disruption of mptpB impairs the ability of Mycobacterium tuberculosis to survive in guinea pigs. Mol Microbiol. 2003, 50 (3): 751-762. 10.1046/j.1365-2958.2003.03712.x.
Szoor B, Wilson J, McElhinney H, Tabernero L, Matthews KR: Protein tyrosine phosphatase TbPTP1: A molecular switch controlling life cycle differentiation in trypanosomes. J Cell Biol. 2006, 175 (2): 293-303. 10.1083/jcb.200605090.
Nascimento M, Zhang WW, Ghosh A, Houston DR, Berghuis AM, Olivier M, Matlashewski G: Identification and characterization of a protein-tyrosine phosphatase in Leishmania: Involvement in virulence. J Biol Chem. 2006, 281 (47): 36257-36268. 10.1074/jbc.M606256200.
Orr GA, Werner C, Xu J, Bennett M, Weiss LM, Takvorkan P, Tanowitz HB, Wittner M: Identification of novel serine/threonine protein phosphatases in Trypanosoma cruzi: a potential role in control of cytokinesis and morphology. Infect Immun. 2000, 68 (3): 1350-1358. 10.1128/IAI.68.3.1350-1358.2000.
Erondu NE, Donelson JE: Characterization of trypanosome protein phosphatase 1 and 2A catalytic subunits. Mol Biochem Parasitol. 1991, 49 (2): 303-314. 10.1016/0166-6851(91)90074-G.
Chaudhuri M: Cloning and characterization of a novel serine/threonine protein phosphatase type 5 from Trypanosoma brucei. Gene. 2001, 266 (1-2): 1-13. 10.1016/S0378-1119(01)00367-5.
Mills E, Price HP, Johner A, Emerson JE, Smith DF: Kinetoplastid PPEF phosphatases: dual acylated proteins expressed in the endomembrane system of Leishmania. Mol Biochem Parasitol. 2007, 152 (1): 22-34. 10.1016/j.molbiopara.2006.11.008.
Alonso A, Rojas A, Godzik A, Mustelin T: The dual-specific protein tyrosine phosphatase family. Protein Phosphatases. Edited by: Springer-Verlag. 2004, 5: [http://springerlink.com/content/pua0916rwjwh/?p=e48631ac1b3b45358a2de7edfe7a436d&pi=14]
Andersen JN, Mortensen OH, Peters GH, Drake PG, Iversen LF, Olsen OH, Jansen PG, Andersen HS, Tonks NK, Moller NP: Structural and evolutionary relationships among protein tyrosine phosphatase domains. Mol Cell Biol. 2001, 21 (21): 7117-7136. 10.1128/MCB.21.21.7117-7136.2001.
Cohen PTW: Overview of protein serine/threonine phosphatases. Protein Phosphatases. Edited by: Hohmann S. 2004, Springer-Verlag, 5:
Wolstencroft K, Lord P, Tabernero L, Brass A, Stevens R: Protein classification using ontology classification. Bioinformatics. 2006, 22 (14): e530-8. 10.1093/bioinformatics/btl208.
Parsons M, Worthey EA, Ward PN, Mottram JC: Comparative analysis of the kinomes of three pathogenic trypanosomatids: Leishmania major, Trypanosoma brucei and Trypanosoma cruzi. BMC Genomics. 2005, 6: 127-10.1186/1471-2164-6-127.
Alonso A, Sasin J, Bottini N, Friedberg I, Osterman A, Godzik A, Hunter T, Dixon J, Mustelin T: Protein tyrosine phosphatases in the human genome. Cell. 2004, 117 (6): 699-711. 10.1016/j.cell.2004.05.018.
Wishart MJ, Dixon JE: Gathering STYX: phosphatase-like form predicts functions for unique protein-interaction domains. Trends Biochem Sci. 1998, 23 (8): 301-306. 10.1016/S0968-0004(98)01241-9.
Diamond RH, Cressman DE, Laz TM, Abrams CS, Taub R: PRL-1, a unique nuclear protein tyrosine phosphatase, affects cell growth. Mol Cell Biol. 1994, 14 (6): 3752-3762.
Cates CA, Michael RL, Stayrook KR, Harvey KA, Burke YD, Randall SK, Crowell PL, Crowell DN: Prenylation of oncogenic human PTP(CAAX) protein tyrosine phosphatases. Cancer Lett. 1996, 110 (1-2): 49-55. 10.1016/S0304-3835(96)04459-X.
Taylor GS, Liu Y, Baskerville C, Charbonneau H: The activity of Cdc14p, an oligomeric dual specificity protein phosphatase from Saccharomyces cerevisiae, is required for cell cycle progression. J Biol Chem. 1997, 272 (38): 24054-24063. 10.1074/jbc.272.38.24054.
Kaiser BK, Zimmerman ZA, Charbonneau H, Jackson PK: Disruption of centrosome structure, chromosome segregation, and cytokinesis by misexpression of human Cdc14A phosphatase. Mol Biol Cell. 2002, 13 (7): 2289-2300. 10.1091/mbc.01-11-0535.
Li J, Yen C, Liaw D, Podsypanina K, Bose S, Wang SI, Puc J, Miliaresis C, Rodgers L, McCombie R, Bigner SH, Giovanella BC, Ittmann M, Tycko B, Hibshoosh H, Wigler MH, Parsons R: PTEN, a putative protein tyrosine phosphatase gene mutated in human brain, breast, and prostate cancer. Science. 1997, 275 (5308): 1943-1947. 10.1126/science.275.5308.1943.
Zeng Q, Si X, Horstmann H, Xu Y, Hong W, Pallen CJ: Prenylation-dependent association of protein-tyrosine phosphatases PRL-1, -2, and -3 with the plasma membrane and the early endosome. J Biol Chem. 2000, 275 (28): 21444-21452. 10.1074/jbc.M000453200.
Cuevas IC, Rohloff P, Sanchez DO, Docampo R: Characterization of farnesylated protein tyrosine phosphatase TcPRL-1 from Trypanosoma cruzi. Eukaryot Cell. 2005, 4 (9): 1550-1561. 10.1128/EC.4.9.1550-1561.2005.
Shiu SH, Bleecker AB: Receptor-like kinases from Arabidopsis form a monophyletic gene family related to animal receptor kinases. Proc Natl Acad Sci U S A. 2001, 98 (19): 10763-10768. 10.1073/pnas.181141598.
Korr D, Toschi L, Donner P, Pohlenz HD, Kreft B, Weiss B: LRRK1 protein kinase activity is stimulated upon binding of GTP to its Roc domain. Cell Signal. 2006, 18 (6): 910-920. 10.1016/j.cellsig.2005.08.015.
Gao T, Furnari F, Newton AC: PHLPP: a phosphatase that directly dephosphorylates Akt, promotes apoptosis, and suppresses tumor growth. Mol Cell. 2005, 18 (1): 13-24. 10.1016/j.molcel.2005.03.008.
Li W, Han M, Guan KL: The leucine-rich repeat protein SUR-8 enhances MAP kinase activation and forms a complex with Ras and Raf. Genes Dev. 2000, 14 (8): 895-900.
Selfors LM, Schutzman JL, Borland CZ, Stern MJ: soc-2 encodes a leucine-rich repeat protein implicated in fibroblast growth factor receptor signaling. Proc Natl Acad Sci U S A. 1998, 95 (12): 6903-6908. 10.1073/pnas.95.12.6903.
Forsthoefel NR, Cutler K, Port MD, Yamamoto T, Vernon DM: PIRLs: a novel class of plant intracellular leucine-rich repeat proteins. Plant Cell Physiol. 2005, 46 (6): 913-922. 10.1093/pcp/pci097.
Boudeau J, Miranda-Saavedra D, Barton GJ, Alessi DR: Emerging roles of pseudokinases. Trends Cell Biol. 2006, 16 (9): 443-452. 10.1016/j.tcb.2006.07.003.
Bork P: Hundreds of ankyrin-like repeats in functionally diverse proteins: mobile modules that cross phyla horizontally?. Proteins. 1993, 17 (4): 363-374. 10.1002/prot.340170405.
Beresford N, Patel S, Armstrong J, Szoor B, Fordham-Skelton AP, Tabernero L: MptpB, a virulence factor from Mycobacterium tuberculosis, exhibits triple-specificity phosphatase activity. Biochem J. 2007, 406 (1): 13-18. 10.1042/BJ20070670.
Wishart MJ, Dixon JE: The archetype STYX/dead-phosphatase complexes with a spermatid mRNA-binding protein and is essential for normal sperm production. Proc Natl Acad Sci U S A. 2002, 99 (4): 2112-2117. 10.1073/pnas.251686198.
Kim SA, Vacratsis PO, Firestein R, Cleary ML, Dixon JE: Regulation of myotubularin-related (MTMR)2 phosphatidylinositol phosphatase by MTMR5, a catalytically inactive phosphatase. Proc Natl Acad Sci U S A. 2003, 100 (8): 4492-4497. 10.1073/pnas.0431052100.
Buist A, Zhang YL, Keng YF, Wu L, Zhang ZY, den Hertog J: Restoration of potent protein-tyrosine phosphatase activity into the membrane-distal domain of receptor protein-tyrosine phosphatase alpha. Biochemistry. 1999, 38 (3): 914-922. 10.1021/bi981936b.
Gupta R, Huang Y, Kieber J, Luan S: Identification of a dual-specificity protein phosphatase that inactivates a MAP kinase from Arabidopsis. Plant J. 1998, 16 (5): 581-589. 10.1046/j.1365-313x.1998.00327.x.
Ulm R, Ichimura K, Mizoguchi T, Peck SC, Zhu T, Wang X, Shinozaki K, Paszkowski J: Distinct regulation of salinity and genotoxic stress responses by Arabidopsis MAP kinase phosphatase 1. Embo J. 2002, 21 (23): 6483-6493. 10.1093/emboj/cdf646.
Martin H, Flandez M, Nombela C, Molina M: Protein phosphatases in MAPK signalling: we keep learning from yeast. Mol Microbiol. 2005, 58 (1): 6-16. 10.1111/j.1365-2958.2005.04822.x.
Nordle AK, Rios P, Gaulton A, Pulido R, Attwood TK, Tabernero L: Functional assignment of MAPK phosphatase domains. Proteins. 2007, 69 (1): 19-31. 10.1002/prot.21477.
Pulido R, Zuniga A, Ullrich A: PTP-SL and STEP protein tyrosine phosphatases regulate the activation of the extracellular signal-regulated kinases ERK1 and ERK2 by association through a kinase interaction motif. Embo J. 1998, 17 (24): 7337-7350. 10.1093/emboj/17.24.7337.
Tanoue T, Adachi M, Moriguchi T, Nishida E: A conserved docking motif in MAP kinases common to substrates, activators and regulators. Nat Cell Biol. 2000, 2 (2): 110-116. 10.1038/35000065.
Ellis J, Sarkar M, Hendriks E, Matthews K: A novel ERK-like, CRK-like protein kinase that modulates growth in Trypanosoma brucei via an autoregulatory C-terminal extension. Mol Microbiol. 2004, 53 (5): 1487-1499. 10.1111/j.1365-2958.2004.04218.x.
Wiese M: Leishmania MAP kinases - Familiar proteins in an unusual context. Int J Parasitol. 2007, 37 (10): 1053-1062. 10.1016/j.ijpara.2007.04.008.
Keyse SM, Ginsburg M: Amino acid sequence similarity between CL100, a dual-specificity MAP kinase phosphatase and cdc25. Trends Biochem Sci. 1993, 18 (10): 377-378. 10.1016/0968-0004(93)90092-2.
Boutros R, Dozier C, Ducommun B: The when and wheres of CDC25 phosphatases. Curr Opin Cell Biol. 2006, 18 (2): 185-191. 10.1016/j.ceb.2006.02.003.
Hammarton TC: Cell cycle regulation in Trypanosoma brucei. Mol Biochem Parasitol. 2007, 153 (1): 1-8. 10.1016/j.molbiopara.2007.01.017.
Bordo D, Bork P: The rhodanese/Cdc25 phosphatase superfamily. Sequence-structure-function relations. EMBO Rep. 2002, 3 (8): 741-746. 10.1093/embo-reports/kvf150.
Mukhopadhyay R, Rosen BP: Arsenate reductases in prokaryotes and eukaryotes. Environ Health Perspect. 2002, 110 Suppl 5: 745-748.
Zhou Y, Messier N, Ouellette M, Rosen BP, Mukhopadhyay R: Leishmania major LmACR2 is a pentavalent antimony reductase that confers sensitivity to the drug pentostam. J Biol Chem. 2004, 279 (36): 37445-37451. 10.1074/jbc.M404383200.
Zhou Y, Bhattacharjee H, Mukhopadhyay R: Bifunctional role of the leishmanial antimonate reductase LmACR2 as a protein tyrosine phosphatase. Mol Biochem Parasitol. 2006, 148 (2): 161-168. 10.1016/j.molbiopara.2006.03.009.
Boudolf V, Inze D, De Veylder L: What if higher plants lack a CDC25 phosphatase?. Trends Plant Sci. 2006, 11 (10): 474-479. 10.1016/j.tplants.2006.08.009.
Mottram JC, Smith G: A family of trypanosome cdc2-related protein kinases. Gene. 1995, 162 (1): 147-152. 10.1016/0378-1119(95)00350-F.
Hassan P, Fergusson D, Grant KM, Mottram JC: The CRK3 protein kinase is essential for cell cycle progression of Leishmania mexicana. Mol Biochem Parasitol. 2001, 113 (2): 189-198. 10.1016/S0166-6851(01)00220-1.
Hammarton TC, Clark J, Douglas F, Boshart M, Mottram JC: Stage-specific differences in cell cycle control in Trypanosoma brucei revealed by RNA interference of a mitotic cyclin. J Biol Chem. 2003, 278 (25): 22877-22886. 10.1074/jbc.M300813200.
Tu X, Wang CC: The involvement of two cdc2-related kinases (CRKs) in Trypanosoma brucei cell cycle regulation and the distinctive stage-specific phenotypes caused by CRK3 depletion. J Biol Chem. 2004, 279 (19): 20519-20528. 10.1074/jbc.M312862200.
Bennett MS, Guan Z, Laurberg M, Su XD: Bacillus subtilis arsenate reductase is structurally and functionally similar to low molecular weight protein tyrosine phosphatases. Proc Natl Acad Sci U S A. 2001, 98 (24): 13577-13582. 10.1073/pnas.241397198.
Liu J, Gladysheva TB, Lee L, Rosen BP: Identification of an essential cysteinyl residue in the ArsC arsenate reductase of plasmid R773. Biochemistry. 1995, 34 (41): 13472-13476. 10.1021/bi00041a026.
Messens J, Hayburn G, Desmyter A, Laus G, Wyns L: The essential catalytic redox couple in arsenate reductase from Staphylococcus aureus. Biochemistry. 1999, 38 (51): 16857-16865. 10.1021/bi9911841.
Nierman WC, Pain A, Anderson MJ, Wortman JR, Kim HS, Arroyo J, Berriman M, Abe K, Archer DB, Bermejo C, Bennett J, Bowyer P, Chen D, Collins M, Coulsen R, Davies R, Dyer PS, Farman M, Fedorova N, Feldblyum TV, Fischer R, Fosker N, Fraser A, Garcia JL, Garcia MJ, Goble A, Goldman GH, Gomi K, Griffith-Jones S, Gwilliam R, Haas B, Haas H, Harris D, Horiuchi H, Huang J, Humphray S, Jimenez J, Keller N, Khouri H, Kitamoto K, Kobayashi T, Konzack S, Kulkarni R, Kumagai T, Lafon A, Latge JP, Li W, Lord A, Lu C, Majoros WH, May GS, Miller BL, Mohamoud Y, Molina M, Monod M, Mouyna I, Mulligan S, Murphy L, O'Neil S, Paulsen I, Penalva MA, Pertea M, Price C, Pritchard BL, Quail MA, Rabbinowitsch E, Rawlins N, Rajandream MA, Reichard U, Renauld H, Robson GD, Rodriguez de Cordoba S, Rodriguez-Pena JM, Ronning CM, Rutter S, Salzberg SL, Sanchez M, Sanchez-Ferrero JC, Saunders D, Seeger K, Squares R, Squares S, Takeuchi M, Tekaia F, Turner G, Vazquez de Aldana CR, Weidman J, White O, Woodward J, Yu JH, Fraser C, Galagan JE, Asai K, Machida M, Hall N, Barrell B, Denning DW: Genomic sequence of the pathogenic and allergenic filamentous fungus Aspergillus fumigatus. Nature. 2005, 438 (7071): 1151-1156. 10.1038/nature04332.
Bollen M: Combinatorial control of protein phosphatase-1. Trends Biochem Sci. 2001, 26 (7): 426-431. 10.1016/S0968-0004(01)01836-9.
Plowman GD, Sudarsanam S, Bingham J, Whyte D, Hunter T: The protein kinases of Caenorhabditis elegans: a model for signal transduction in multicellular organisms. Proc Natl Acad Sci U S A. 1999, 96 (24): 13603-13610. 10.1073/pnas.96.24.13603.
Kerk D, Bulgrien J, Smith DW, Barsam B, Veretnik S, Gribskov M: The complement of protein phosphatase catalytic subunits encoded in the genome of Arabidopsis. Plant Physiol. 2002, 129 (2): 908-925. 10.1104/pp.004002.
Dombradi V, Axton JM, Brewis ND, da Cruz e Silva EF, Alphey L, Cohen PT: Drosophila contains three genes that encode distinct isoforms of protein phosphatase 1. Eur J Biochem. 1990, 194 (3): 739-745. 10.1111/j.1432-1033.1990.tb19464.x.
Alessi DR, Gomez N, Moorhead G, Lewis T, Keyse SM, Cohen P: Inactivation of p42 MAP kinase by protein phosphatase 2A and a protein tyrosine phosphatase, but not CL100, in various cell lines. Curr Biol. 1995, 5 (3): 283-295. 10.1016/S0960-9822(95)00059-5.
Helps NR, Brewis ND, Lineruth K, Davis T, Kaiser K, Cohen PT: Protein phosphatase 4 is an essential enzyme required for organisation of microtubules at centrosomes in Drosophila embryos. J Cell Sci. 1998, 111 ( Pt 10): 1331-1340.
Gotz J, Probst A, Ehler E, Hemmings B, Kues W: Delayed embryonic lethality in mice lacking protein phosphatase 2A catalytic subunit Calpha. Proc Natl Acad Sci U S A. 1998, 95 (21): 12370-12375. 10.1073/pnas.95.21.12370.
Suzuki Y, Lanner C, Kim JH, Vilardo PG, Zhang H, Yang J, Cooper LD, Steele M, Kennedy A, Bock CB, Scrimgeour A, Lawrence JC, DePaoli-Roach AA: Insulin control of glycogen metabolism in knockout mice lacking the muscle-specific protein phosphatase PP1G/RGL. Mol Cell Biol. 2001, 21 (8): 2683-2694. 10.1128/MCB.21.8.2683-2694.2001.
Barton GJ, Cohen PT, Barford D: Conservation analysis and structure prediction of the protein serine/threonine phosphatases. Sequence similarity with diadenosine tetraphosphatase from Escherichia coli suggests homology to the protein phosphatases. Eur J Biochem. 1994, 220 (1): 225-237. 10.1111/j.1432-1033.1994.tb18618.x.
Ceulemans H, Bollen M: Functional diversity of protein phosphatase-1, a cellular economizer and reset button. Physiol Rev. 2004, 84 (1): 1-39. 10.1152/physrev.00013.2003.
Li Z, Tu X, Wang CC: Okadaic acid overcomes the blocked cell cycle caused by depleting Cdc2-related kinases in Trypanosoma brucei. Exp Cell Res. 2006, 312 (18): 3504-3516. 10.1016/j.yexcr.2006.07.022.
Das A, Gale M, Carter V, Parsons M: The protein phosphatase inhibitor okadaic acid induces defects in cytokinesis and organellar genome segregation in Trypanosoma brucei. J Cell Sci. 1994, 107 ( Pt 12): 3477-3483.
Jackson AP: Tandem gene arrays in Trypanosoma brucei: comparative phylogenomic analysis of duplicate sequence variation. BMC Evol Biol. 2007, 7: 54-10.1186/1471-2148-7-54.
Gonzalez J, Cornejo A, Santos MR, Cordero EM, Gutierrez B, Porcile P, Mortara RA, Sagua H, Da Silveira JF, Araya JE: A novel protein phosphatase 2A (PP2A) is involved in the transformation of human protozoan parasite Trypanosoma cruzi. Biochem J. 2003, 374 (Pt 3): 647-656. 10.1042/BJ20030215.
Andersen JF, Ribeiro JM: A secreted salivary inositol polyphosphate 5-phosphatase from a blood-feeding insect: allosteric activation by soluble phosphoinositides and phosphatidylserine. Biochemistry. 2006, 45 (17): 5450-5457. 10.1021/bi052444j.
Andreeva AV, Kutuzov MA: Widespread presence of "bacterial-like" PPP phosphatases in eukaryotes. BMC Evol Biol. 2004, 4: 47-10.1186/1471-2148-4-47.
Mora-Garcia S, Vert G, Yin Y, Cano-Delgado A, Cheong H, Chory J: Nuclear protein phosphatases with Kelch-repeat domains modulate the response to brassinosteroids in Arabidopsis. Genes Dev. 2004, 18 (4): 448-460. 10.1101/gad.1174204.
Das AK, Helps NR, Cohen PT, Barford D: Crystal structure of the protein serine/threonine phosphatase 2C at 2.0 A resolution. Embo J. 1996, 15 (24): 6798-6809.
Bork P, Brown NP, Hegyi H, Schultz J: The protein phosphatase 2C (PP2C) superfamily: detection of bacterial homologues. Protein Sci. 1996, 5 (7): 1421-1425.
Takekawa M, Maeda T, Saito H: Protein phosphatase 2Calpha inhibits the human stress-responsive p38 and JNK MAPK pathways. Embo J. 1998, 17 (16): 4744-4752. 10.1093/emboj/17.16.4744.
Hanada M, Kobayashi T, Ohnishi M, Ikeda S, Wang H, Katsura K, Yanagawa Y, Hiraga A, Kanamaru R, Tamura S: Selective suppression of stress-activated protein kinase pathway by protein phosphatase 2C in mammalian cells. FEBS Lett. 1998, 437 (3): 172-176. 10.1016/S0014-5793(98)01229-0.
Maeda T, Wurgler-Murphy SM, Saito H: A two-component system that regulates an osmosensing MAP kinase cascade in yeast. Nature. 1994, 369 (6477): 242-245. 10.1038/369242a0.
Shiozaki K, Russell P: Counteractive roles of protein phosphatase 2C (PP2C) and a MAP kinase kinase homolog in the osmoregulation of fission yeast. Embo J. 1995, 14 (3): 492-502.
Rodriguez PL: Protein phosphatase 2C (PP2C) function in higher plants. Plant Mol Biol. 1998, 38 (6): 919-927. 10.1023/A:1006054607850.
Cheng A, Kaldis P, Solomon MJ: Dephosphorylation of human cyclin-dependent kinases by protein phosphatase type 2C alpha and beta 2 isoforms. J Biol Chem. 2000, 275 (44): 34744-34749. 10.1074/jbc.M006210200.
Ofek P, Ben-Meir D, Lavi S: An inducible system to study the growth arrest properties of protein phosphatase 2C. Methods Enzymol. 2003, 366: 338-347.
Hishiya A, Ohnishi M, Tamura S, Nakamura F: Protein phosphatase 2C inactivates F-actin binding of human platelet moesin. J Biol Chem. 1999, 274 (38): 26705-26712. 10.1074/jbc.274.38.26705.
Strovel ET, Wu D, Sussman DJ: Protein phosphatase 2Calpha dephosphorylates axin and activates LEF-1-dependent transcription. J Biol Chem. 2000, 275 (4): 2399-2403. 10.1074/jbc.275.4.2399.
Burns JM, Parsons M, Rosman DE, Reed SG: Molecular cloning and characterization of a 42-kDa protein phosphatase of Leishmania chagasi. J Biol Chem. 1993, 268 (23): 17155-17161.
Archambault J, Pan G, Dahmus GK, Cartier M, Marshall N, Zhang S, Dahmus ME, Greenblatt J: FCP1, the RAP74-interacting subunit of a human protein phosphatase that dephosphorylates the carboxyl-terminal domain of RNA polymerase IIO. J Biol Chem. 1998, 273 (42): 27593-27601. 10.1074/jbc.273.42.27593.
Kobor MS, Archambault J, Lester W, Holstege FC, Gileadi O, Jansma DB, Jennings EG, Kouyoumdjian F, Davidson AR, Young RA, Greenblatt J: An unusual eukaryotic protein phosphatase required for transcription by RNA polymerase II and CTD dephosphorylation in S. cerevisiae. Mol Cell. 1999, 4 (1): 55-62. 10.1016/S1097-2765(00)80187-2.
Chapman AB, Agabian N: Trypanosoma brucei RNA polymerase II is phosphorylated in the absence of carboxyl-terminal domain heptapeptide repeats. J Biol Chem. 1994, 269 (7): 4754-4760.
Dasgupta A, Sharma S, Das A, Sarkar D, Majumder H: Carboxy terminal domain of the largest subunit of RNA polymerase II of Leishmania donovani has an unusually low number of phosphorylation sites. Med Sci Monit. 2002, 8 (5): CR341-50.
Garcia A, Cayla X, Barik S, Langsley G: A family of PP2 phosphatases in Plasmodium falciparum and parasitic protozoa. Parasitol Today. 1999, 15 (3): 90-92. 10.1016/S0169-4758(99)01393-9.
Cherkasov A, Lee SJ, Nandan D, Reiner NE: Large-scale survey for potentially targetable indels in bacterial and protozoan proteins. Proteins. 2006, 62 (2): 371-380. 10.1002/prot.20631.
GeneDB homepage. [http://www.genedb.org/]
Horrocks I, Patel-Schneider PF, van Harmelen F: From SHIQ and RDF to OWL: The making of a web ontology language. Journal of Web Semantics. 2003, 1 (1): 7-26.
Racer Systems GmbH & Co. KG: RacerPro. [http://www.racer-systems.com/]1.9.0
Bechhofer S, Horrocks I, Turi D: The OWL instance store: System description. Proc of the 20th Int Conf on Automated Deduction. Edited by: Nieuwenhuis R. 2005, Springer, 177-181.
Zdobnov EM, Apweiler R: InterProScan--an integration platform for the signature-recognition methods in InterPro. Bioinformatics. 2001, 17 (9): 847-848. 10.1093/bioinformatics/17.9.847.
Apweiler R, Attwood TK, Bairoch A, Bateman A, Birney E, Biswas M, Bucher P, Cerutti L, Corpet F, Croning MD, Durbin R, Falquet L, Fleischmann W, Gouzy J, Hermjakob H, Hulo N, Jonassen I, Kahn D, Kanapin A, Karavidopoulou Y, Lopez R, Marx B, Mulder NJ, Oinn TM, Pagni M, Servant F, Sigrist CJ, Zdobnov EM: InterPro--an integrated documentation resource for protein families, domains and functional sites. Bioinformatics. 2000, 16 (12): 1145-1150. 10.1093/bioinformatics/16.12.1145.
Letunic I, Copley RR, Schmidt S, Ciccarelli FD, Doerks T, Schultz J, Ponting CP, Bork P: SMART 4.0: towards genomic data integration. Nucleic Acids Res. 2004, 32 (Database issue): D142-4. 10.1093/nar/gkh088.
Bairoch A, Apweiler R: The SWISS-PROT protein sequence data bank and its new supplement TREMBL. Nucleic Acids Res. 1996, 24 (1): 21-25. 10.1093/nar/24.1.21.
Attwood TK: The PRINTS database: a resource for identification of protein families. Brief Bioinform. 2002, 3 (3): 252-263. 10.1093/bib/3.3.252.
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997, 25 (24): 4876-4882. 10.1093/nar/25.24.4876.
Hall T: BioEdit. [http://www.mbio.ncsu.edu/BioEdit/bioedit.html]7.0.5
Lee JO, Yang H, Georgescu MM, Di Cristofano A, Maehama T, Shi Y, Dixon JE, Pandolfi P, Pavletich NP: Crystal structure of the PTEN tumor suppressor: implications for its phosphoinositide phosphatase activity and membrane association. Cell. 1999, 99 (3): 323-334. 10.1016/S0092-8674(00)81663-3.
Begley MJ, Taylor GS, Kim SA, Veine DM, Dixon JE, Stuckey JA: Crystal structure of a phosphoinositide phosphatase, MTMR2: insights into myotubular myopathy and Charcot-Marie-Tooth syndrome. Mol Cell. 2003, 12 (6): 1391-1402. 10.1016/S1097-2765(03)00486-6.
Kamenski T, Heilmeier S, Meinhart A, Cramer P: Structure and mechanism of RNA polymerase II CTD phosphatases. Mol Cell. 2004, 15 (3): 399-407. 10.1016/j.molcel.2004.06.035.
Altschul SF GW Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-410.
Huxley-Jones J, Robertson DL, Boot-Handford RP: On the origins of the extracellular matrix in vertebrates. Matrix Biol. 2007, 26 (1): 2-11. 10.1016/j.matbio.2006.09.008.
Huelsenbeck JP, Ronquist F: MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics. 2001, 17 (8): 754-755. 10.1093/bioinformatics/17.8.754.
Felsenstein J: PHYLIP. [http://evolution.genetics.washington.edu/phylip.html]
Strimmer K, von Haeseler A: Quartet puzzling: a quartet maximum likelihood method for reconstructing tree topologies. Mol Biol and Evolution. 1996, 13: 964-969.
Stern A, Privman E, Rasis M, Lavi S, Pupko T: Evolution of the metazoan protein phosphatase 2C superfamily. J Mol Evol. 2007, 64 (1): 61-70. 10.1007/s00239-006-0033-y.
Laporte J, Blondeau F, Buj-Bello A, Tentler D, Kretz C, Dahl N, Mandel JL: Characterization of the myotubularin dual specificity phosphatase gene family from yeast to human. Hum Mol Genet. 1998, 7 (11): 1703-1712. 10.1093/hmg/7.11.1703.
Sakumoto N, Mukai Y, Uchida K, Kouchi T, Kuwajima J, Nakagawa Y, Sugioka S, Yamamoto E, Furuyama T, Mizubuchi H, Ohsugi N, Sakuno T, Kikuchi K, Matsuoka I, Ogawa N, Kaneko Y, Harashima S: A series of protein phosphatase gene disruptants in Saccharomyces cerevisiae. Yeast. 1999, 15 (15): 1669-1679. 10.1002/(SICI)1097-0061(199911)15:15<1669::AID-YEA480>3.0.CO;2-6.
Farkas I, Dombradi V, Miskei M, Szabados L, Koncz C: Arabidopsis PPP family of serine/threonine phosphatases. Trends Plant Sci. 2007, 12 (4): 169-176. 10.1016/j.tplants.2007.03.003.
Carver TJ, Rutherford KM, Berriman M, Rajandream MA, Barrell BG, Parkhill J: ACT: the Artemis Comparison Tool. Bioinformatics. 2005, 21 (16): 3422-3423. 10.1093/bioinformatics/bti553.
We are grateful to Dr. P. Eyers for useful discussions. R.B. is a recipient of a BBSRC Strategic Studentship award (BBS/Q/Q/2004/06222). This work was partially funded by Wellcome Trust grant 068793 and the EU Research Training Network MRTN-CT-2006-035830.
RB is responsible for data collection, phylogenetic analysis and drafting of the manuscript. JHJ contributed to the production and analysis of phylogenetic trees; HT, HM, BS contributed to several aspects of data analysis; RS is responsible for the creation of the phosphatase ontology and supervision of the computational methods; KM contributed to data analysis, biological insights and the writing of the manuscript. LT conceived the study, analysed data and wrote the manuscript. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Table S1. A list of all kinetoplastid sequences described in this study. Sequences are listed in subgroups with the systematic GeneDB ID used in this analysis. Swiss-Prot IDs are also provided. As the T. cruzi proteome contains sequences from two closely related species, the 'duplications' have been listed where present. 'X' means there is no duplication. InterPro domain definitions: IPR000387, Tyr-specific phosphatase; IPR000242, Protein tyrosine phosphatase; IPR003595, Protein tyrosine phosphatase – catalytic; IPR000340, Dual-specificity phosphatase; IPR000719, Protein kinase; IPR011009, Protein kinase-like; IPR001611, Leucine-rich repeat; IPR003591, Leucine-rich repeat, typical subtype; IPR002110, Ankyrin; IPR004861, Protein tyrosine phosphatase SIW14-like; IPR008973, C2 calcium/lipid-binding region CaLB; IPR014019, Phosphatase tensin type; IPR014020, C2 tensin-type; IPR010569, Myotubularin-related; IPR011011, Zinc finger FYVE/PHD-type; IPR000306, Zinc finger FYVE-type; IPR001763, Rhodanese-like; IPR006186, Serine/threonine-specific protein phosphatase and bis(5-nucleosyl)-tetraphosphatase; IPR004843, Metallophosphoesterase; IPR001440, Tetratricopeptide TPR_1; IPR011236, Protein phosphatase 5; IPR013026, Tetratricopeptide region; IPR011990, Tetratricopeptide-like helical; IPR013235, Serine/Threonine phosphatase PPP5; IPR002048, Calcium-binding EF-hand; IPR011992, EF-Hand type; IPR013105, Tetratricopeptide TPR_2; IPR015655, Protein phosphatase 2C; IPR001932, Protein phosphatase 2C-related; IPR014045, Protein phosphatase 2C N-terminal; IPR010822, Sporulation stage II, protein E C-terminal; IPR000222, PP2C manganese/magnesium aspartate binding site; IPR004274, NLI interacting factor; IPR011948, Dullard-like phosphatase. Domains highlighted in red were removed from the analysis as they had poor matches and the query sequence lacked similarity to other sequences containing the same domain: IPR000834, Peptidase M14 carboxypeptidase A; ATP-dependent helicase DEAD-box; IPR003006, Immunoglobulin MHC motif; IPR006035, Ureohydrolase; IPR001356, Homeobox; IPR000106, Protein tyrosine phosphatase low molecular weight; IPR000560, Histidine acid phosphatase; IPR013323, SIAH-type; IPR001395, Aldo/keto reductase; IPR008984, SMAD/FHA; IPR001865, Ribosomal protein S2; IPR005829, Sugar transporter superfamily; IPR001360, Glycoside hydrolase family 1; IPR002086, Aldehyde dehydrogenase. (XLS 56 KB)
Additional file 3: Figure S1. Synteny of PTPs in the three kinetoplastids. Comparison of the corresponding syntenic regions from the L. major, T. cruzi and T. brucei genomes around Lm PTP1. Analyses were conducted via TBlastX using the Artemis comparison tool  with an E value of and default Gap settings. Outputs were manually annotated using GeneDB annotations. (PDF 615 KB)
Additional file 4: Table S3. Kinetoplastid-specific motifs in TriTryp classical Tyr-specific PTPs. Sequence motifs conserved in kinetoplastid PTPs but not in other eukaryotes. PcT1 and PcT2 are located at the N-terminal region before the phosphatase catalytic domain. T1–T4 are located in the phosphatase catalytic domain. (DOC 56 KB)
Additional file 6: Figure S2 Phylogenetic tree of LMW-PTPs, mammalian Cdc25 with arsenate reductases type ArsC and ACR2. Sequences were inferred using the Neighbour-Joining method of PHYLIP. LMW-PTPs from Homo sapiens (HUMAN_P24666) and Saccharomyces cerevisiae (YEAST_P40347) are shown. ArsC reductases are included from Bacillus subtilis (BACSU_P45947) and Staphylococcus aureus (STAAU_P0A006). Kinetoplastid sequences found grouping with these ArsC reductases include T. brucei (Tb09.160.2100), T. cruzi (Tc00.1047053504797.120), and L. major (LMJ_0020). CDC25s from H. sapiens (HUMAN_P30307) and Rattus norvegicus (RAT_P48966) are shown. A second group of kinetoplastid sequences including Leishamania infantum (LEIIN_A4I895), L. major (LmjF32.2740), Leishmania braziliensis (LbrM32_V2.2980), and T. cruzi (Tc00.1047053508707.20) were found to group with the ACR2 reductases from S. cerevisiae (YEAST_Q06597), and the plants Arabidopsis thaliana (ARATH_Q8GY31), and Oryza sativa (ORYSJ_Q9AV34). The tree is unrooted and the scale indicating amino acid replacements per site is shown. (PDF 245 KB)
Additional file 7: Figure S3. Synteny of Cdc25-like Acr2 between L. major and T. brucei. Comparison of the corresponding syntenic regions from the L. major and T. brucei genomes around the ACR2 phosphatase (LmjF32.2740). Analyses were conducted via TBlastX using the Artemis comparison tool  with an E value of 1.00 and default Gap settings. Outputs were manually annotated using GeneDB annotations. (PDF 659 KB)
Additional file 8: Table S5. Motif analysis for the PPP family of Ser/Thr phosphatases. Residues in red are substitutions from the conserved pattern: GD XH G – GD XVDRG – GN HE (residues in bold coordinate metal ions at the catalytic site and the underlined His is the proton donor in catalysis). (DOC 48 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.