The protein-phosphatome of the human malaria parasite Plasmodium falciparum
© Wilkes and Doerig; licensee BioMed Central Ltd. 2008
Received: 08 April 2008
Accepted: 15 September 2008
Published: 15 September 2008
Malaria, caused by the parasitic protist Plasmodium falciparum, represents a major public health problem in the developing world. The P. falciparum genome has been sequenced, which provides new opportunities for the identification of novel drug targets. We report an exhaustive analysis of the P. falciparum genomic database (PlasmoDB) aimed at identifying and classifying all protein phosphatases (PP) in this organism.
Using a variety of bioinformatics tools, we identified 27 malarial putative PP sequences within the four major established PP families, plus 7 sequences that we predict to dephosphorylate "non-protein" substrates. We constructed phylogenetic trees to position these sequences relative to PPs from other organisms representing all major eukaryotic phyla except Cercozoans (for which no full genome sequence is available). Predominant observations were: (i) P. falciparum possessed the smallest phosphatome of any of the organisms investigated in this study; (ii) no malarial PP clustered with the tyrosine-specific subfamily of the PTP group (iii) a cluster of 7 closely related members of the PPM/PP2C family is present, and (iv) some P. falciparum protein phosphatases are present in clades lacking any human homologue.
The considerable phylogenetic distance between Apicomplexa and other Eukaryotes is reflected by profound divergences between the phosphatome of malaria parasites and those of representative organisms from all major eukaryotic phyla, which might be exploited in the context of efforts for the discovery of novel targets for antimalarial chemotherapy.
Eukaryotic protein phosphatases
The reversible phosphorylation of proteins represents a ubiquitous regulatory mechanism for diverse pathways and systems in eukaryotic cells. The process is controlled by a balance between the antagonistic activities of protein kinases, which catalyse the phosphorylation of serine, threonine or tyrosine residues predominantly (reviewed in [1, 2]), and more marginally of other residues, notably histidine [3, 4], and those of protein phosphatases, which cleave the monophosphate esters from the phosphorylated form of the same residues (reviewed in [4–6]). A large range of kinases have been identified, which seem to have arisen by multiple gene duplication events with subsequent selection . In contrast the number of different protein phosphatase catalytic subunits is much lower than that of kinases, and phosphatases are in general less discriminating than most kinases in substrate selectivity. This lack of specificity combined with high catalytic efficiency suggest that a 'naked' protein phosphatase activity is potentially toxic . The specificity and regulation of many of these enzymes is in fact mediated by accessory proteins (the phosphatase regulatory subunits), a wide variety of which interact with the relatively small repertoire of catalytic subunits (this is not the case for the PTP group, see below). As a consequence, it is speculated that the total number of protein phosphatase holoenzymes involved in regulatory pathways matches, or even exceeds the protein kinase repertoire [8–10]. There are four broad families of protein phosphatases with distinct evolutionary histories:
1. The PPP group. PPP sequences (P hospho-P rotein P hosphatases) are highly conserved, and constitute perhaps the most highly conserved set of sequences across the eukaryotic kingdom [11, 12]. They encode a wide variety of phosphatase activities directed not only at phosphoproteins but at other substrates as well. The dependency of these enzymes on Mn2+, Ca2+ and/or Co2+ led to members of this group being called metallophosphatases. The PPP group, which constitutes a subgroup of metallophosphatases, is the most extensively studied type of protein phosphatase. Classically these enzymes were classified into three major groups, PP1, PP2A and PP2B, defined in terms of substrate specificity and inhibitor sensitivity . This classification has been extended in recent years with the identification of a range of sequences related to, but distinct from, PP2A, and a of series of sequences which diverged from the other PPPs early in the evolutionary history of the eukaryotes [6, 14]. Thus the PPP family (reviewed in ) now comprises as many as eight distinct subtypes of serine/threonine phosphatases: PP1, PP2A, PP2B (calcineurin, PP3), PP4, PP5, PP6, PP7 and the plant-specific BSU subfamily, which is closely related to PP1 and characterised by the presence of a diagnostic Kelch motif . Among these subtypes, PP2, -4 and -6 are closely related to each other and have been grouped in a distinct subfamily .
Furthermore, a family of bacterial-like PPP sequences found in eukaryotes (including in P. falciparum) has recently been described . Whereas three highly conserved motifs (GDXHG, GDXXDRG and GNH [E/D]) mediating metal coordination in the active centre are considered as the signature of the PPP family, sequences showing no similarities to the known PPP phosphatases beyond the presence of the GDXHG and GDXXDRG motifs were identified in Plants, Plasmodium, Trypanosoma and some fungi. This revealed the existence in eukaryotes of "non-conventional" branches of the PPP family (reviewed in ).
2. The PPM/PP2C group comprises a highly diverse, evolutionarily recent set of enzymes with Mg2+ or Mn2+-dependent serine/threonine phosphatase activities. The active forms appear to be highly diverse monomeric polypeptides which in many cases possess regulatory domains in C- or N-terminal extensions. A number of defined motifs and conserved residues relate to binding of activating metal ions, water and phosphate groups, as in the PPP type enzymes, but there is no discernable sequence homology between the two groups, despite remarkable structural similarity . A major part of the functions of PP2C (PPM) activities in a variety of species appears to be to modulate stress responses [5, 19]. PPM enzymes form part of a superfamily that includes bacterial forms (SpoIIE) and a mitochondrial pyruvate dehydrogenase phosphatase. The PPM family of protein phosphatases is greatly expanded in plants .
3. The PTP (Protein Tyrosine Phosphatase) superfamily, which is subdivided into three main families: the tyrosine-specific phosphatases, the dual-specificity PTPs (which include the cdc25-like, the Ccdc14 and the MAPK phosphatase groups [20–22]), and the low molecular weight phosphatases . The tyrosine and dual-specificity phosphatases are involved in signalling, cell growth and differentiation, and in the control of cell cycle progression (for example, cdc25 is a major regulator of cyclin-dependent kinase activity , and cdc14 regulates mitosis exit by dephosphorylating CDK targets ). The enzymes share a common catalytic mechanism mediated by cysteine, arginine and aspartic acid residues. Supplementary domains assist in targeting and substrate specificity , in contrast to most other types of phosphatases, which require interaction with regulatory proteins for proper substrate binding.
4. The NIF group (N LI i nteracting f actor-like phosphatase) includes the FCP1 (TF IIF-associating C-terminal domain (CTD) p hosphatase 1) and SCP (S mall C TD p hosphatases) [25, 26]. These phosphatases are responsible for dephosphorylation of the carboxy-terminal domain (CTD) of RNA polymerase II and interact with the transcription factor TFIIF [27, 28]. The function appears to be the dephosphorylation of serine residues within the conserved heptad repeat in the C-terminal, which is required to reactivate the polymerase after termination of transcription. The NIF phosphatases have a DxDx(T/V) motif in the active site .
The case for a Plasmodium phosphatome study
Malaria remains a major public health problem in tropical and subtropical regions, bearing a huge socio-economic impact on affected countries, most of which are in the developing world. Malaria parasites have a complex life cycle. Infection of human beings by Plasmodium falciparum, the species responsible for the lethal form of human malaria, begins with the bite of an infected Anopheles mosquito, which delivers sporozoites into the bloodstream. These cells establish an infection inside hepatocytes, where they undergo intense multiplication generating several thousand merozoites, a process called exo-erythrocytic schizogony. The merozoites invade erythrocytes, where they also undergo schizogony, the process that is responsible for malaria pathogenesis. Some merozoites, however, arrest the cell cycle and differentiate into male or female gametocytes, which are infective to the mosquito. Once ingested by the insect, the gametocytes develop into gametes and fuse into a zygote. Further development in the mosquito involves a process of sporogony, producing sporozoites that accumulate in the salivary glands and are now ready to infect a new human host (see http://www.malaria.org for information on malaria).
The study of signalling processes (in particular those involving protein phosphorylation/dephosphorylation) in malaria parasites presents considerable interest, both in terms of fundamental biology (how does a eukaryote that is phylogenetically very distant from model organisms regulate growth, proliferation, differentiation and transition between its complex developmental stages?) and in terms of the search for urgently needed novel drug targets [30, 31]. The sequencing of the P. falciparum genome  and the availability of an interactive genomic database (PlasmoDB, http://www.plasmodb.org)  have dramatically facilitated the identification of potential targets. Probabilistic models of peptide domains sharing an evolutionary history (Hidden Markov Models, HMMs) permit the rapid scanning of a set of conceptual translations from any organism whose genomic sequence is available . The plasmodial kinome has thus been characterised and highlighted profound divergences from the kinomes of other eukaryotes [35, 36]. Although a number of studies have been published on individual phosphatases of malaria parasites (see below), a full "phosphatome" analysis has not been reported, while studies of both the kinomes  and phosphatomes  of other major parasitic unicellular eukaryotes, the trypanosomatids, have recently been published. Here, we use the Pfam collection of HMMs  to investigate the phosphatome of P. falciparum in relation to that of members from all major groups of the eukaryotic kingdom .
Results and discussion
Protein phosphatase-encoding genes in representative organisms from major eukaryotic groups
A high-resolution version of Figure 3 is available as a PNG file (see additional file 7)
PPP group – Metallophosphatases
Constitution of the PPP dataset and construction of a phylogenetic tree
A list of the P. falciparum sequences in the four groups of protein phosphatases, with the PlasmoDB annotation available at the time of analysis.
protein serine/threonine phosphatase
serine/threonine protein phosphatase, putative
PP1-like protein serine/threonine phosphatase
serine/threonine protein phosphatase, putative
erythrocyte membrane-associated antigen
HT; SP; API
protein phosphatase, putative
serine/threonine protein phosphatase, putative
serine/threonine protein phosphatase pfPp5
RNA lariat debranching enzyme, putative
DNA repair exonuclease, putative
vacuolar protein sorting 29, putative
acid phosphatase, putative
protein phosphatase 2c-like protein, putative
hypothetical protein, conserved
Protein phosphatase 2C, putative
protein phosphatase 2C
protein phosphatase 2c, putative
protein phosphatase, putative
Protein phosphatase 2C
Protein phosphatase 2C, putative
protein phosphatase, putative
protein phosphatase 7 homolog, putative
dual-specificity protein phosphatase, putative
protein tyrosine phosphatase, putative
nif-like protein, putative
hypothetical protein, conserved
NLI interacting factor-like phosphatase, putative
The "PPP sequences" region of the tree is shown in greater detail in Figure 3B. All families of the PPP type protein phosphatases as identified by Cohen et al.  are represented in P. falciparum, as well as an additional type found only in plants and containing the Kelch motif , with which the plasmodial sequence PF14_0630 [A] clearly clusters [throughout the article, the capital letter in square brackets following a PlasmoDB identifier refers to the labelling on the figures]; this is the only P. falciparum sequence occurring in a PPP group with no homologues in humans.
Previously characterised P. falciparum metallophosphatases
Many of the plasmodial sequences in this group have been the subjects of previous reports in the literature:
PF14_0630 [A] (BSU subfamily)
This protein has been first identified as a PP1-related enzyme, which is confirmed by the position of this sequence in our phylogenetic tree (Fig. 3A); this enzyme was called PfPPαg. Subsequently, it was found that PfPPα has close relatives in plants, which, like the latter enzyme, encode tandem Kelch motifs in their N-terminal extension; the name "PPKLs" (Protein Phosphatases with Kelch-Like domains) was suggested to designate members of this subfamily of PP1 enzymes . The PPKL gene structure is conserved in homologous sequences from the Apicomplexans Cryptosporidium hominis, Toxoplasma gondii and Theileria parva (one sequence per genome), as well as in the plants Arabidopsis thaliana and Oryza sativa (3 and 4 occurrences respectively). Kelch motifs form distinctive 'propeller like' tertiary structures proposed to mediate interactions with regulatory subunits . At least one of the three A. thaliana gene products is found in the nucleus and appears to be involved in regulating the signal from the brassinosteroid plant hormones . The limited distribution of PPKLs (these proteins have been found only in Plants and Apicomplexa, which is consistent with our phylogenetic tree) is reminiscent with that of other gene families and in line with the proposed photosynthetic ancestry of Apicomplexa . The absence of PPKLs in Opisthokonts suggests PF14_0630 might be a target for parasite-selective inhibition.
PF14_0142 [B] (PP1 type)
This protein exhibits the properties of a typical PP1 phospho-serine/threonine phosphatase, and an inhibitor profile consistent with PP1 type activity (IC50 values for tautomycin, I-1, I-2 and okadaic acid being 0.8, 400, 7 and 100 nM, respectively) . The protein appears to be expressed in all life cycle stages as judged by Western blot analysis. Microarray analysis indicates a small reduction in expression during the mid-trophozoite stage. RNAi of this sequence resulted in the ablation of PP1 expression, as well as in the impairment of parasite growth (as measured by 3H-hypoxanthine incorporation); the subsequent finding that P. falciparum does not possess the molecular machinery that mediates RNA interference makes these data difficult to interpret. However, the function of this protein was subsequently confirmed in vivo through complementation of a yeast mutant deficient in PP1 activity .
PF14_0224 [C] (PP7 subgroup)
This protein has been described previously as PfPPJ . The phosphatase activity is okadaic acid-resistant, and catalysis requires Mn2+ but no other cations (Mg2+ or Ca2+). Sequence analysis confirmed the presence of the usual metal coordinating, phosphate binding and water activation motifs, but indicated substantial differences from the PP1, PP2A and calcineurin subgroups. This is consistent with our assignment of the sequence to the PP7 subgroup, which appears to have diverged from the other grouping very early in the evolution of the eukaryotes . Similar to PF08_0129 discussed above, subsequent analysis demonstrated the primary PF14_0224 translation product to be much larger than the PP catalytic domain, with two EF-hand motifs that must be occupied by calcium for the enzyme to become fully active . The small size originally predicted for PfPPJ was due to a spurious stop codon in the original cDNA , but a fragment corresponding in size to this is apparently produced by post-translational processing detected by Western blotting.
PFC0595c [D] (PP2/4/6 type)
This protein has been described previously as PfPPβ . The initial sequence analysis assigned this enzyme to the PP2A group of protein phosphatases. Our analysis suggests, however, that the sequence is a member of the closely related PP2/4/6 family (with closer clustering with the PP4 subgroup), which has been implicated in cell cycle regulation . Although gametocyte-specific expression of PFC0595c mRNA expression was originally reported, microarray data  indicate that the gene is expressed at all stages of the asexual cycle, as well as in sporozoites and gametocytes.
PF08_0129 [F] (PP3, PP2B or calcineurin subgroup)
Our phylogenetic analysis indicates this sequence to be the only one encoding a calcineurin-type enzyme (can) in the P. falciparum genome. A calcineurin type activity (okadaic acid insensitive, calcium dependent) which is non-competitively inhibited by cyclosporine/cyclophilin has been described in the parasite  and subsequently attributed to the protein encoded by PF08_0129, which contains a calmodulin-binding domain. The protein appears to be subject to post-translational proteolysis producing a constitutively active core from a large precursor. A putative regulatory subunit of calcineurin (CnB) was identified in the context of the same study .
PFI1245c [G] (PP2 subgroup)
Previously described by Dobson et al , this enzyme activity was potently inhibited by okadaic acid (IC50 ~ 0.2 nM), and required Mn2+ for activity. These properties led to its classification as a member of the PP2A group. Our phylogenetic analysis supports the assignment of this protein to the PP2 group of protein phosphatases. The same group then identified PfARP (a spartate-r ich p rotein), a plasmodial protein with significant similarity to the I2PP2A family of inhibitors of mammalian PP2A . PfARP was able to inhibit PFI1245c, but none of the four other P. falciparum protein phosphatases tested .
MAL13P1.274 [I] (PP5 subgroup)
This protein has been reported previously independently by two groups [55, 56]. The activity is sensitive to nanomolar concentrations of okadaic acid. The sequence of the polypeptide comprises a nuclear targeting sequence at its N-terminus, as well as TPR (tetratricopeptide) repeats, which have an autoinhibitory effect on phosphatase activity; in other systems, this inhibition is relieved by binding unsaturated fatty acids, and indeed, purified recombinant MAL13P1.274 protein, like the native protein enriched from P. falciparum extracts, exhibited phosphatase activity that can be enhanced by arachidonic and oleic acids.
Uncharacterised P. falciparum PPPs
The PFI1360c [H] peptide is to our knowledge not described in the literature. Our phylogenetic analysis (Fig 3A) indicates that PFI1360c is most closely related to the PP2/4/6 subgroup, although it emerges at the very base of the cluster, and is therefore relatively divergent from other members of this subgroup. Members of this subgroup are involved in a variety of functions in metazoans, including centrosome maturation, spliceosome assembly, chromatin modification, and regulation of NF-κB and mTOR signalling pathways . As mentioned above, two plasmodial sequences (PF14_0660 [O] and PFL0300c [P]) cluster close to the "She wanella-l ike ph osphatases ("Shelphs") group, confirming a prior report that P. falciparum possesses two members of this bacterial-like phosphatase family . No functional studies have been reported on these two enzymes; likewise, we are not aware of any published biochemical studies of any of the 7 phosphatases predicted to act on non-protein substrates (sequences J-P in Fig. 3A) present in the tree.
Constitution of the PPM dataset
A HMM search of the P. falciparum peptide sequence set using the PF00481 (Protein phosphatase 2C) HMM profile produced 10 hits. Markov clustering of the P. falciparum sequences, along with the domain-conformant set from the model genomes, was performed to generate a tree.
The PPM phylogenetic tree
A high-resolution version of this Figure is available as a PNG file (see additional file 8)
Previously characterised P. falciparum PPMs
The PP2c-type phosphatase PF11_0396 [G] is the only PPM from P. falciparum reported in the literature . It has been implicated in the regulation of the nucleotide exchange activity of translation elongation factor 1B, antagonising its in vitr o phosphorylation by mammalian protein kinase C . In contrast to the monomeric nature of other PPM enzymes, maximal activity is associated with homodimerization of the peptide. The P. falciparum PP2c-conformant sequences are found in two regions separated by over 400 residues. Mamoun et al. proposed that the peptide contains two distinct PP2c type domains, each capable of enzymatic activity on phosphoserine or – threonine . In this model the dimeric enzyme presents four active sites. Detailed examination of the sequence indicates that only one full set of the conserved functional and structural groups is present in a single polypeptide, and that this complete set is distributed between the two distinct regions. However, evidence that the two peptides interact 'head to tail' may indicate that the regions in the different peptides complement each other, to produce two effective active sites. Such an arrangement may not be uncommon in Plasmodium. Two other PPM-related sequences (PFE1010w [E] and MAL8P1.109 [H]) show evidence of the same split of the PP2c domain, with shorter inserts (200 and 140 residues respectively); there is no experimental data on the function of the latter sequences. It is noteworthy that both are members of the same exclusive Apicomplexan-specific cluster mentioned above that also contains PF11_0396 [G].
PTP Tyrosine phosphatase-like group
Constitution of the PTP dataset
Searching the P. falciparum peptide set with Pfam-derived HMM profiles of the PTP superfamily identified a small number of sequences conforming to dual-specificity phosphatases (DSPs) : [PFC0380w, PF14_0524 (fragment) and PF14_0525 (fragment)], and two low scoring hit to tyrosine phosphatases (Y-phosphatases)  (PF11_0139, PF11_0281) (See Table 1 and Additional file 3). The two fragments PF14_0524 and PF14_0525 are immediately adjacent on the genome, and have similar expression profiles, suggesting that the stop codon separating them may be a misread, or may be read through in translation, as has been shown to be the case for at least one P. falciparum gene displaying an internal stop codon . One of the atypical protein kinases of the Apicomplexan-specific FIKK family has the same configuration, with a stop codon interrupting an otherwise complete catalytic domain [35, 64]. For further analyses, a hybrid sequence (labelled PF14_052x) was constructed by joining the two sequences. The locus has recently been re-annotated in PlasmoDB: a gene model called "PF14_024_changed" is now proposed, which generates a single predicted polypeptide with a full phosphatase domain encompassing sequences that were previously separated into PF14_0524 and PF14_0525.
The PTP phylogenetic tree
A high-resolution version of this Figure is available as a PNG file (see additional file 9)
Previously characterised P. falciparum PTPs
Two of the four P. falciparum PTPs have been the subject of biochemical investigations [67, 68]. The PFC0380w [B] polypeptide was assigned to the DSP subgroup, and like other members of this subgroup, contains a functional Zn2+-binding domain in addition to its phosphatase catalytic domain. Recombinant PFC0380w exhibits phosphatase activity on both phosphoserine and phosphotyrosine, in line with its assignment to the DSP family.
PF11_0139 [C] belongs to the PRL ("P rotein of R egenerating L iver") group . This sequence possesses the CaaX C-terminal motif for farnesylation, a distinguishing feature of this group of phosphatases (the attachment of a farnesyl group generally promotes membrane association to the target protein). It was recently demonstrated that this motif in PF11_0139 (called PfPRL in this study) is indeed the target of farnesyl transferase activity purified from parasite extracts, and that recombinant PfPRL displays phosphatase activity. Interestingly, in merozoites PfPRL co-localises with AMA-1, a membrane-associated protein associated with invasion .
To our knowledge, nothing has been published on the other two P. falciparum sequences appearing in the tree. It is noteworthy that PF11_0281 [D] does not cluster with any branch containing sequences from other Eukaryotes.
Protein tyrosine phosphatase-like proteins (PTPL; Pfam PF04387) constitute a small family of proteins structurally related to PTPs, but the substitution of proline for an essential arginine in the catalytic site renders these polypeptides catalytically inactive. MAL13P1.168 is the only P. falciparum sequence containing a PTP-like motif . While the present paper was in revision, a phylogenetic analysis of PTPs in protozoan parasites was published , whose conclusion are essentially in agreement with our own data with respect to the representation of P. falciparum sequences in the various families of the PTP group.
The NIF group
A high-resolution version of this Figure is available as a PNG file (see additional file 10)
Missing phosphatase groups
A high-resolution version of this Figure is available as a PNG file (see additional file 11)
Other protein phosphatase groups for which we found no evidence in Plasmodium are the Tyrosine phosphatases (see Fig. 5), the Low Molecular Weight phosphatases , the cdc14 phosphatases  and the Styx phosphatases. Styx sequences are related to those of PTPs and do recognise phosphotyrosine residues, but are non-catalytic proteins (the catalytic cysteine is replaced by a glycine). Because of their similarity to PTPs, human Styx sequences were picked up in our HMM search and are highlighted on the tree in Fig. 5. However, P. falciparum does not possess obvious Styx homologues. Finally, an HMM search of the P. falciparum database using the PFAM profile for Myotubularin (MTM) family of lipid phosphatases  yielded only hits with very low scores, indicating that the parasite does not encode members of this family.
The putative P. falciparum phosphoprotein phosphatase sequences were examined for the presence of signal peptides targeting proteins to various cellular compartments. PlasmoDB records the presence of apicoplast targeting sequences , the signal peptides predicted by SignalP , and the motif directing proteins to the host erythrocyte [77, 78]; the presence of these motifs on PP sequences is indicated in Table 1. In addition the set of sequences was analysed by the PlasMit algorithm http://gecco.org.chemie.uni-frankfurt.de/plasmit/ for putative mitochondrial targeting. No sequence demonstrated unequivocal (high stringency) mitochondrial targeting with this algorithm, not even the peptide associated with the TIM50 mitochondrial translocase (PF07_0110); however seven sequences yielded a lower score that is still compatible with mitochondrial targeting (in order of probability: PFE0795c > PF07_0110 > PF11_0362 > MAL8P1.109 > PF10_0093 > MAL13P1.44 > PFI1245c). The (presumably inactive, see above) PF10_0124 is borderline in this respect and is classified as non-mitochondrial. It is relevant to repeat here, as pointed out above, that the PP5-like MAL13P1.174 possesses a nuclear localisation signal. It is important to emphasise (i) that the presence or absence of targeting motifs on any sequence is dependent on gene predictions and can vary with database re-annotations, and (ii) that the functionality of such motifs should be verified experimentally.
Associated domains and motifs
In addition to the accessory domain instances described earlier, the only sequence containing associated domains with homologues in other organisms is that annotated as erythrocyte membrane-associated antigen (PF10_0177). In the sequence we originally downloaded and used in our analyses, this large polypeptide had an EF-hand domain and a putative acid protease domain in addition to the phosphatase domain. In a recent re-annotation of this locus, however, the open reading frames encoding the phosphatase and protease domains are proposed to be split and expressed as distinct genes. Other domain combinations are discussed above.
A "protein phosphatase" keyword query on PlasmoDB yielded 18 entries in the annotated P. falciparum genome, to be compared to the 27 PP sequences we retrieved using HHM searches. Using "phosphatase" as a query, PlasmoDB yielded 30 entries, a very similar number to the total number (34) of sequences of enzymes that phosphorylate proteins and non-protein substrates we found using HMMs. This indicates that in this instance the Plasmodb annotation is remarkably accurate, but detailed annotation of these genes can be improved – we are addressing this issue with the database curators.
We based our approach on HMM searches using established profiles, which would of course miss any "cryptic", non-HMM-conforming enzymes. The list we propose here must therefore be viewed as the minimal complement of functional protein phosphatases.
The ratio of protein kinases to protein phosphatases in P. falciparum is close to 2:1, in line with the smaller numbers of phosphatase catalytic domains (compared with those of kinase catalytic domains) present in other eukaryotes. The A. thaliana phosphatome contains a large number of PPMs (linked to modulation of stress responses through the MAPK pathway [19, 21, 54]), which may be linked to the observation that Plant genomes contain a much larger number of genes coding for receptor kinases than other organisms (reviewed in ). Similarly, PTPs linked to intercellular signalling, and antagonistic to a large repertoire of tyrosine kinases are vastly expanded in the mammalian phosphatome (Fig. 2). In contrast, the complement of phosphatases in P. falciparum does not include any markedly expanded family other than the 7-member cluster of PPMs described above, despite a major expansion of FIKK type kinases observed previously [35, 64]. Interestingly, the diversity of PP types represented in the malarial phosphatome is relatively high despite a comparatively small number of enzymes, which is explained by our observation that subtypes in the four PP groups are represented by one member only; this is particularly apparent in the PPP group, where subtypes are frequently represented by one member only. Thus the parasite maintains a large functional capability despite a small phosphatome. We have not addressed here the identification of protein phosphatase regulatory subunits. Undoubtedly the parasite possesses many such polypeptides, which are likely to considerably increase functional diversity (reviewed in ). It will be fascinating to explore the functional implications, in terms of both specific biochemical processes (signalling, motility, cell cycle and transcription control, transport, among many others) and overall parasite development, of the antagonism between specific instances of protein phosphorylation and dephosphorylation. Importantly, phosphatases are gaining recognition as potential targets for chemotherapeutic intervention , and have been estimated to represent 4% of the druggable human genome; in particular, PTPs appear an important new target for cancer therapy, notably for melanoma (reviewed in ). Thus, the P. falciparum phosphatases, like the plasmodial protein kinases , might well, in the near future, join the cohort of potential targets for novel antimalarials.
Selection of Hidden Markov Models for protein phosphatase catalytic domains
Profiles used, unaltered, to mine the various genomes for conformant sequences.
Rhodanese domain (CDC25)
Serine/Threonine; Many proteins with catalytic sites conforming to the PF00149 profile hydrolyse phosphate esters on substrates other than phosphoproteins.
Two tyrosine phosphatases (PF00102; Y-phosphatase, PF03162; Y-phosphatase2), and a dual-specificity serine/threonine/tyrosine phosphatase activity (PF00782; DSP) are closely related and are grouped in a single clan, the protein tyrosine-phosphatase superfamily (also referred to as PTP) . An additional low molecular weight tyrosine phosphatase [PF01451] with limited sequence similarity, but possessing the characteristic PTP motif, is also listed. Serine-threonine phosphatase activities are found in two distinct groups: a highly conserved group conforming to the Metallophosphatase family (Pfam profilePF00149; note that this family includes a wide range of phosphatase activities in addition to protein phosphatases; the protein phosphatase activities are classified as PPP type) and a structurally unrelated (though catalytically similar) group, the PP2C family (Pfam profile PF00481, PPM).
Identification of catalytic domains
Catalytic domains were identified by use of the hmmsearch option of HMMER  using Hidden Markov Profiles appropriate to the domain of interest using moderately stringent criteria (Expect value [-E] of 10-3, database record number [-Z] 100000). The initial search used the global model for each domain type, although the local model was subsequently used where appropriate if multiple or fragmented domains were found.
Extraction of profile conformant sequences (PP domain plus short flanking sequences)
Peptide sequences were aligned under the guidance of an appropriate HMM profile  using the hmmalign option of HMMER. Alignment output in ClustalW format was trimmed down to those blocks encompassing match states to the profile and ungapped Fasta formatted sequences extracted from this sub-set of the alignment. (T_coffee seq_reformat option).
Multiple sequence alignment
MSA of a given sequence set was performed by three independent methods; ClustalW , t-coffee  and hmmalign  guided by the appropriate profile. The alignments used the default settings for each method. Alignments were combined under t-coffee, and quality of alignment assessed.
Clustering of model genome peptide sequences with identified P. falciparum sequences
The eukaryotic kingdom is extremely diverse, and molecular analysis confidently identifies eight major groups within this diversity. For a broad phylogenetic and evolutionary analysis of the protein phosphatases present in P. falciparum the translated gene products (June 2006 versions) were downloaded from completed genome projects of the following species:
Homo sapiens (Opisthokonts)
Dictyostelium discoideum (amoebazoa)
Arabidopsis thaliana (viridiplantae)
Plasmodium falciparum (alveolates)
Thalassiosira pseudonana (heterokonts)
Trypanosoma brucei (discicristates)
Giardia lamblia (excavates)
At the time of writing, no genomic information was available for any member of the cercozoan group. Translated peptide sets for the six model genomes were combined and subjected to the HMMER hmmsearch option using the above criteria. Identified sequences were retrieved using the blast  fastacmd option, and domain conformant subsequences extracted. Appropriate subsequences derived from P. falciparum sequences were added to the dataset. An all against all blastp (-e 0.01) of the sequence set was performed, and Markov clustering of the output performed under control of the Tribe package . The inflation parameter (-I) was 1.7, a value which demonstrates a reasonable discrimination without fragmenting clusters to an unusable degree
High quality Multiple Sequence Alignments of the catalytic domains were prepared as described above, and columns displaying low consistency (score < 5) or significant numbers of gaps (> 15%) removed. Alternate neighbour joining phylogenies were visualised using Neighbour-Net, implemented on SplitsTree version 4 .
This work was made possible by the availability of the P. falciparum genome database PlasmoDB. We are indebted to all members of the team which contributed to the development of this database, which is proving an invaluable tool for molecular research on malaria. Financial support for the Plasmodium Genome Consortium was provided by the Burroughs Wellcome Fund, the Wellcome Trust, the National Institutes of Health (NIAID) and the U.S. Department of Defence, Military Infectious Diseases Research Program. Financial Support for PlasmoDB was provided by the Burroughs Wellcome Fund. We thank Dr. J. Chevalier (Service Scientifique de l'Ambassade de France à Londres) for continuing interest and support. The idea of undertaking the present study was stimulated by the invited participation of C.D. to a FASEB meeting on phosphatases organised by M. Tremblay (McGill Cancer Center) and D. Virshup (Duke-NUS) – many thanks to them!
Work in the C.D. laboratory is supported by INSERM, the European Commission (FP6 Integrated Project ANTIMAL and Network of Excellence BioMalPar), a grant from the Novartis Institute for Tropical Diseases (NITD, Singapore) and benefits from the Wellcome Trust core funding towards the Wellcome Centre for Molecular Parasitology, which also supports J.W.
- Hanks SK: Genomic analysis of the eukaryotic protein kinase superfamily: a perspective. Genome Biol. 2003, 4 (5): 111-PubMedPubMed CentralView ArticleGoogle Scholar
- Hanks SK, Quinn AM: Protein kinase catalytic domain sequence database: identification of conserved features of primary structure and classification of family members. Methods Enzymol. 1991, 200: 38-62.PubMedView ArticleGoogle Scholar
- West AH, Stock AM: Histidine kinases and response regulator proteins in two-component signaling systems. Trends Biochem Sci. 2001, 26 (6): 369-376.PubMedView ArticleGoogle Scholar
- Klumpp S, Krieglstein J: Reversible phosphorylation of histidine residues in vertebrate proteins. Biochim Biophys Acta. 2005, 1754 (1–2): 291-295.PubMedView ArticleGoogle Scholar
- Barford D, Das AK, Egloff MP: The structure and mechanism of protein phosphatases: insights into catalysis and regulation. Annu Rev Biophys Biomol Struct. 1998, 27: 133-164.PubMedView ArticleGoogle Scholar
- Gallego M, Virshup DM: Protein serine/threonine phosphatases: life, death, and sleeping. Curr Opin Cell Biol. 2005, 17 (2): 197-202.PubMedView ArticleGoogle Scholar
- Miranda-Saavedra D, Barton GJ: Classification and functional annotation of eukaryotic protein kinases. Proteins. 2007, 68 (4): 893-914.PubMedView ArticleGoogle Scholar
- Cohen PT: Novel protein serine/threonine phosphatases: variety is the spice of life. Trends Biochem Sci. 1997, 22 (7): 245-251.PubMedView ArticleGoogle Scholar
- Cohen PT, Chen MX, Armstrong CG: Novel protein phosphatases that may participate in cell signaling. Adv Pharmacol. 1996, 36: 67-89.PubMedView ArticleGoogle Scholar
- Bollen M: Combinatorial control of protein phosphatase-1. Trends Biochem Sci. 2001, 26 (7): 426-431.PubMedView ArticleGoogle Scholar
- Orgad S, Brewis ND, Alphey L, Axton JM, Dudai Y, Cohen PT: The structure of protein phosphatase 2A is as highly conserved as that of protein phosphatase 1. FEBS Lett. 1990, 275 (1–2): 44-48.PubMedView ArticleGoogle Scholar
- Barton GJ, Cohen PT, Barford D: Conservation analysis and structure prediction of the protein serine/threonine phosphatases. Sequence similarity with diadenosine tetraphosphatase from Escherichia coli suggests homology to the protein phosphatases. Eur J Biochem. 1994, 220 (1): 225-237.PubMedView ArticleGoogle Scholar
- Wera S, Hemmings BA: Serine/threonine protein phosphatases. Biochem J. 1995, 311 (Pt 1): 17-29.PubMedPubMed CentralView ArticleGoogle Scholar
- Andreeva AV, Kutuzov MA: PPP family of protein Ser/Thr phosphatases: two distinct branches?. Mol Biol Evol. 2001, 18 (3): 448-452.PubMedView ArticleGoogle Scholar
- Kutuzov MA, Andreeva AV: Protein Ser/Thr phosphatases with kelch-like repeat domains. Cell Signal. 2002, 14 (9): 745-750.PubMedView ArticleGoogle Scholar
- Cohen PT, Philp A, Vazquez-Martin C: Protein phosphatase 4 – from obscurity to vital functions. FEBS Lett. 2005, 579 (15): 3278-3286.PubMedView ArticleGoogle Scholar
- Andreeva AV, Kutuzov MA: Widespread presence of "bacterial-like" PPP phosphatases in eukaryotes. BMC Evol Biol. 2004, 4: 47-PubMedPubMed CentralView ArticleGoogle Scholar
- Das AK, Helps NR, Cohen PT, Barford D: Crystal structure of the protein serine/threonine phosphatase 2C at 2.0 A resolution. Embo J. 1996, 15 (24): 6798-6809.PubMedPubMed CentralGoogle Scholar
- Schweighofer A, Hirt H, Meskiene I: Plant PP2C phosphatases: emerging functions in stress signaling. Trends Plant Sci. 2004, 9 (5): 236-243.PubMedView ArticleGoogle Scholar
- Boutros R, Dozier C, Ducommun B: The when and wheres of CDC25 phosphatases. Curr Opin Cell Biol. 2006, 18 (2): 185-191.PubMedView ArticleGoogle Scholar
- Owens DM, Keyse SM: Differential regulation of MAP kinase signalling by dual-specificity protein phosphatases. Oncogene. 2007, 26 (22): 3203-3213.PubMedView ArticleGoogle Scholar
- Trinkle-Mulcahy L, Lamond AI: Mitotic phosphatases: no longer silent partners. Curr Opin Cell Biol. 2006, 18 (6): 623-631.PubMedView ArticleGoogle Scholar
- Raugei G, Ramponi G, Chiarugi P: Low molecular weight protein tyrosine phosphatases: small, but smart. Cell Mol Life Sci. 2002, 59 (6): 941-949.PubMedView ArticleGoogle Scholar
- Fauman EB, Saper MA: Structure and function of the protein tyrosine phosphatases. Trends Biochem Sci. 1996, 21 (11): 413-417.PubMedView ArticleGoogle Scholar
- Yeo M, Lin PS: Functional characterization of small CTD phosphatases. Methods Mol Biol. 2007, 365: 335-346.PubMedGoogle Scholar
- Yeo M, Lin PS, Dahmus ME, Gill GN: A novel RNA polymerase II C-terminal domain phosphatase that preferentially dephosphorylates serine 5. J Biol Chem. 2003, 278 (28): 26078-26085.PubMedView ArticleGoogle Scholar
- Kobor MS, Greenblatt J: Regulation of transcription elongation by phosphorylation. Biochim Biophys Acta. 2002, 1577 (2): 261-275.PubMedView ArticleGoogle Scholar
- Suh MH, Ye P, Zhang M, Hausmann S, Shuman S, Gnatt AL, Fu J: Fcp1 directly recognizes the C-terminal domain (CTD) and interacts with a site on RNA polymerase II distinct from the CTD. Proc Natl Acad Sci USA. 2005, 102 (48): 17314-17319.PubMedPubMed CentralView ArticleGoogle Scholar
- Hausmann S, Shuman S: Defining the active site of Schizosaccharomyces pombe C-terminal domain phosphatase Fcp1. J Biol Chem. 2003, 278 (16): 13627-13632.PubMedView ArticleGoogle Scholar
- Tilley L, Davis TM, Bray PG: Prospects for the treatment of drug-resistant malaria parasites. Future Microbiol. 2006, 1: 127-141.PubMedView ArticleGoogle Scholar
- Doerig C, Meijer L: Antimalarial drug discovery: targeting protein kinases. Expert Opin Ther Targets. 2007, 11 (3): 279-290.PubMedView ArticleGoogle Scholar
- Gardner MJ, Hall N, Fung E, White O, Berriman M, Hyman RW, Carlton JM, Pain A, Nelson KE, Bowman S: Genome sequence of the human malaria parasite Plasmodium falciparum. Nature. 2002, 419 (6906): 498-511.PubMedView ArticleGoogle Scholar
- Stoeckert CJ, Fischer S, Kissinger JC, Heiges M, Aurrecoechea C, Gajria B, Roos DS: PlasmoDB v5: new looks, new genomes. Trends Parasitol. 2006, 22 (12): 543-546.PubMedView ArticleGoogle Scholar
- Eddy SR: Profile hidden Markov models. Bioinformatics. 1998, 14 (9): 755-763.PubMedView ArticleGoogle Scholar
- Ward P, Equinet L, Packer J, Doerig C: Protein kinases of the human malaria parasite Plasmodium falciparum: the kinome of a divergent eukaryote. BMC Genomics. 2004, 5 (1): 79-PubMedPubMed CentralView ArticleGoogle Scholar
- Anamika , Srinivasan N, Krupa A: A genomic perspective of protein kinases in Plasmodium falciparum. Proteins. 2005, 58 (1): 180-189.PubMedView ArticleGoogle Scholar
- Parsons M, Worthey EA, Ward PN, Mottram JC: Comparative analysis of the kinomes of three pathogenic trypanosomatids: Leishmania major, Trypanosoma brucei and Trypanosoma cruzi. BMC Genomics. 2005, 6: 127-PubMedPubMed CentralView ArticleGoogle Scholar
- Brenchley R, Tariq H, McElhinney H, Szoor B, Huxley-Jones J, Stevens R, Matthews K, Tabernero L: The TriTryp phosphatome: analysis of the protein phosphatase catalytic domains. BMC Genomics. 2007, 8: 434-PubMedPubMed CentralView ArticleGoogle Scholar
- Bateman A, Birney E, Cerruti L, Durbin R, Etwiller L, Eddy SR, Griffiths-Jones S, Howe KL, Marshall M, Sonnhammer EL: The Pfam protein families database. Nucleic Acids Res. 2002, 30 (1): 276-280.PubMedPubMed CentralView ArticleGoogle Scholar
- Baldauf SL: The deep roots of eukaryotes. Science. 2003, 300 (5626): 1703-1706.PubMedView ArticleGoogle Scholar
- Mora-Garcia S, Vert G, Yin Y, Cano-Delgado A, Cheong H, Chory J: Nuclear protein phosphatases with Kelch-repeat domains modulate the response to brassinosteroids in Arabidopsis. Genes Dev. 2004, 18 (4): 448-460.PubMedPubMed CentralView ArticleGoogle Scholar
- Li JL, Baker DA: A putative protein serine/threonine phosphatase from Plasmodium falciparum contains a large N-terminal extension and five unique inserts in the catalytic domain. Mol Biochem Parasitol. 1998, 95 (2): 287-295.PubMedView ArticleGoogle Scholar
- Adams J, Kelso R, Cooley L: The kelch repeat superfamily of proteins: propellers of cell function. Trends Cell Biol. 2000, 10 (1): 17-24.PubMedView ArticleGoogle Scholar
- Waller RF, McFadden GI: The apicoplast: a review of the derived plastid of apicomplexan parasites. Curr Issues Mol Biol. 2005, 7 (1): 57-79.PubMedGoogle Scholar
- Kumar R, Adams B, Oldenburg A, Musiyenko A, Barik S: Characterisation and expression of a PP1 serine/threonine protein phosphatase (PfPP1) from the malaria parasite, Plasmodium falciparum: demonstration of its essential role using RNA interference. Malar J. 2002, 1 (1): 5-PubMedPubMed CentralView ArticleGoogle Scholar
- Bhattacharyya MK, Hong Z, Kongkasuriyachai D, Kumar N: Plasmodium falciparum protein phosphatase type 1 functionally complements a glc7 mutant in Saccharomyces cerevisiae. Int J Parasitol. 2002, 32 (6): 739-747.PubMedView ArticleGoogle Scholar
- Dobson S, Bracchi V, Chakrabarti D, Barik S: Characterization of a novel serine/threonine protein phosphatase (PfPPJ) from the malaria parasite, Plasmodium falciparum. Mol Biochem Parasitol. 2001, 115 (1): 29-39.PubMedView ArticleGoogle Scholar
- Kumar R, Musiyenko A, Oldenburg A, Adams B, Barik S: Post-translational generation of constitutively active cores from larger phosphatases in the malaria parasite, Plasmodium falciparum: implications for proteomics. BMC Mol Biol. 2004, 5: 6-PubMedPubMed CentralView ArticleGoogle Scholar
- Li JL, Baker DA: Protein phosphatase beta, a putative type-2A protein phosphatase from the human malaria parasite Plasmodium falciparum. Eur J Biochem. 1997, 249 (1): 98-106.PubMedView ArticleGoogle Scholar
- Bastians H, Ponstingl H: The novel human protein serine/threonine phosphatase 6 is a functional homologue of budding yeast Sit4p and fission yeast ppe1, which are involved in cell cycle regulation. J Cell Sci. 1996, 109 (Pt 12): 2865-2874.PubMedGoogle Scholar
- Le Roch KG, Zhou Y, Blair PL, Grainger M, Moch JK, Haynes JD, De La Vega P, Holder AA, Batalov S, Carucci DJ: Discovery of gene function by expression profiling of the malaria parasite life cycle. Science. 2003, 301 (5639): 1503-1508.PubMedView ArticleGoogle Scholar
- Dobson S, May T, Berriman M, Del Vecchio C, Fairlamb AH, Chakrabarti D, Barik S: Characterization of protein Ser/Thr phosphatases of the malaria parasite, Plasmodium falciparum: inhibition of the parasitic calcineurin by cyclophilin-cyclosporin complex. Mol Biochem Parasitol. 1999, 99 (2): 167-181.PubMedView ArticleGoogle Scholar
- Li M, Guo H, Damuni Z: Purification and characterization of two potent heat-stable protein inhibitors of protein phosphatase 2A from bovine kidney. Biochemistry. 1995, 34 (6): 1988-1996.PubMedView ArticleGoogle Scholar
- Dobson S, Kumar R, Bracchi-Ricard V, Freeman S, Al-Murrani SW, Johnson C, Damuni Z, Chakrabarti D, Barik S: Characterization of a unique aspartate-rich protein of the SET/TAF-family in the human malaria parasite, Plasmodium falciparum, which inhibits protein phosphatase 2A. Mol Biochem Parasitol. 2003, 126 (2): 239-250.PubMedView ArticleGoogle Scholar
- Dobson S, Kar B, Kumar R, Adams B, Barik S: A novel tetratricopeptide repeat (TPR) containing PP5 serine/threonine protein phosphatase in the malaria parasite, Plasmodium falciparum. BMC Microbiol. 2001, 1: 31-PubMedPubMed CentralView ArticleGoogle Scholar
- Lindenthal C, Klinkert MQ: Identification and biochemical characterisation of a protein phosphatase 5 homologue from Plasmodium falciparum. Mol Biochem Parasitol. 2002, 120 (2): 257-268.PubMedView ArticleGoogle Scholar
- Carniol K, Ben-Yehuda S, King N, Losick R: Genetic dissection of the sporulation protein SpoIIE and its role in asymmetric division in Bacillus subtilis. J Bacteriol. 2005, 187 (10): 3511-3520.PubMedPubMed CentralView ArticleGoogle Scholar
- Chakraborty N, Ohta M, Zhu JK: Recognition of a PP2C interaction motif in several plant protein kinases. Methods Mol Biol. 2007, 365: 287-298.PubMedGoogle Scholar
- Mamoun CB, Sullivan surDJ Jr, Banerjee R, Goldberg DE: Identification and characterization of an unusual double serine/threonine protein phosphatase 2C in the malaria parasite Plasmodium falciparum. J Biol Chem. 1998, 273 (18): 11241-11247.PubMedView ArticleGoogle Scholar
- Mamoun CB, Goldberg DE: Plasmodium protein phosphatase 2C dephosphorylates translation elongation factor 1beta and inhibits its PKC-mediated nucleotide exchange activity in vitro. Mol Microbiol. 2001, 39 (4): 973-981.PubMedView ArticleGoogle Scholar
- Roma-Mateo C, Rios P, Tabernero L, Attwood TK, Pulido R: A novel phosphatase family, structurally related to dual-specificity phosphatases, that displays unique amino acid sequence and substrate specificity. J Mol Biol. 2007, 374 (4): 899-909.PubMedView ArticleGoogle Scholar
- Dewang PM, Hsu NM, Peng SZ, Li WR: Protein tyrosine phosphatases and their inhibitors. Curr Med Chem. 2005, 12 (1): 1-22.PubMedView ArticleGoogle Scholar
- Bischoff E, Guillotte M, Mercereau-Puijalon O, Bonnefoy S: A member of the Plasmodium falciparum Pf60 multigene family codes for a nuclear protein expressed by readthrough of an internal stop codon. Mol Microbiol. 2000, 35 (5): 1005-1016.PubMedView ArticleGoogle Scholar
- Schneider AG, Mercereau-Puijalon O: A new Apicomplexa-specific protein kinase family: multiple members in Plasmodium falciparum, all with an export signature. BMC Genomics. 2005, 6 (1): 30-PubMedPubMed CentralView ArticleGoogle Scholar
- Dorin D, Semblat JP, Poullet P, Alano P, Goldring D, Whittle C, Patterson S, Whittle C, Chakrabarti D, Doerig C: PfPK7, an atypical MEK-related protein kinase, reflects the absence of typical three-component MAP kinase pathways in the human malaria parasite Plasmodium falciparum. Mol Microbiol. 2005, 55 (1): 184-196.PubMedView ArticleGoogle Scholar
- Dorin D, Alano P, Boccaccio I, Ciceron L, Doerig C, Sulpice R, Parzy D, Doerig C: An atypical mitogen-activated protein kinase (MAPK) homologue expressed in gametocytes of the human malaria parasite Plasmodium falciparum. Identification of a MAPK signature. Journal of Biological Chemistry. 1999, 274 (42): 29912-29920.PubMedView ArticleGoogle Scholar
- Kumar R, Musiyenko A, Cioffi E, Oldenburg A, Adams B, Bitko V, Krishna SS, Barik S: A zinc-binding dual-specificity YVH1 phosphatase in the malaria parasite, Plasmodium falciparum, and its interaction with the nuclear protein, pescadillo. Mol Biochem Parasitol. 2004, 133 (2): 297-310.PubMedView ArticleGoogle Scholar
- Pendyala PR, Ayong L, Eatrides J, Schreiber M, Pham C, Chakrabarti R, Fidock DA, Allen CM, Chakrabarti D: Characterization of a PRL protein tyrosine phosphatase from Plasmodium falciparum. Mol Biochem Parasitol. 2008, 158 (1): 1-10.PubMedView ArticleGoogle Scholar
- Stephens BJ, Han H, Gokhale V, Von Hoff DD: PRL phosphatases as potential molecular targets in cancer. Mol Cancer Ther. 2005, 4 (11): 1653-1661.PubMedView ArticleGoogle Scholar
- Andreeva AV, Kutuzov MA: Protozoan protein tyrosine phosphatases. Int J Parasitol. 2008Google Scholar
- Yu X, Chini CC, He M, Mer G, Chen J: The BRCT domain is a phospho-protein binding domain. Science. 2003, 302 (5645): 639-642.PubMedView ArticleGoogle Scholar
- Rudolph J: Cdc25 phosphatases: structure, specificity, and mechanism. Biochemistry. 2007, 46 (12): 3595-3604.PubMedView ArticleGoogle Scholar
- Doerig C, Endicott J, Chakrabarti D: Cyclin-dependent kinase homologues of Plasmodium falciparum. Int J Parasitol. 2002, 32 (13): 1575-1585.PubMedView ArticleGoogle Scholar
- Clague MJ, Lorenzo O: The myotubularin family of lipid phosphatases. Traffic. 2005, 6 (12): 1063-1069.PubMedView ArticleGoogle Scholar
- Foth BJ, Ralph SA, Tonkin CJ, Struck NS, Fraunholz M, Roos DS, Cowman AF, McFadden GI: Dissecting apicoplast targeting in the malaria parasite Plasmodium falciparum. Science. 2003, 299 (5607): 705-708.PubMedView ArticleGoogle Scholar
- Emanuelsson O, Brunak S, von Heijne G, Nielsen H: Locating proteins in the cell using TargetP, SignalP and related tools. Nat Protoc. 2007, 2 (4): 953-971.PubMedView ArticleGoogle Scholar
- Marti M, Good RT, Rug M, Knuepfer E, Cowman AF: Targeting malaria virulence and remodeling proteins to the host erythrocyte. Science. 2004, 306 (5703): 1930-1933.PubMedView ArticleGoogle Scholar
- Hiller NL, Bhattacharjee S, van Ooij C, Liolios K, Harrison T, Lopez-Estrano C, Haldar K: A host-targeting signal in virulence proteins reveals a secretome in malarial infection. Science. 2004, 306 (5703): 1934-1937.PubMedView ArticleGoogle Scholar
- Castells E, Casacuberta JM: Signalling through kinase-defective domains: the prevalence of atypical receptor-like kinases in plants. J Exp Bot. 2007, 58 (13): 3503-3511.PubMedView ArticleGoogle Scholar
- Ventura JJ, Nebreda AR: Protein kinases and phosphatases as therapeutic targets in cancer. Clin Transl Oncol. 2006, 8 (3): 153-160.PubMedView ArticleGoogle Scholar
- Easty D, Gallagher W, Bennett DC: Protein tyrosine phosphatases, new targets for cancer therapy. Curr Cancer Drug Targets. 2006, 6 (6): 519-532.PubMedView ArticleGoogle Scholar
- Finn RD, Mistry J, Schuster-Bockler B, Griffiths-Jones S, Hollich V, Lassmann T, Moxon S, Marshall M, Khanna A, Durbin R: Pfam: clans, web tools and services. Nucleic Acids Res. 2006, D247-251. 34 Database
- Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22 (22): 4673-4680.PubMedPubMed CentralView ArticleGoogle Scholar
- Notredame C, Higgins DG, Heringa J: T-Coffee: A novel method for fast and accurate multiple sequence alignment. J Mol Biol. 2000, 302 (1): 205-217.PubMedView ArticleGoogle Scholar
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25 (17): 3389-3402.PubMedPubMed CentralView ArticleGoogle Scholar
- Enright AJ, Van Dongen S, Ouzounis CA: An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 2002, 30 (7): 1575-1584.PubMedPubMed CentralView ArticleGoogle Scholar
- Huson DH, Bryant D: Application of phylogenetic networks in evolutionary studies. Mol Biol Evol. 2006, 23 (2): 254-267.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.