A new family of phosphoinositide phosphatases in microorganisms: identification and biochemical analysis
© Beresford et al. 2010
Received: 4 January 2010
Accepted: 2 August 2010
Published: 2 August 2010
Skip to main content
© Beresford et al. 2010
Received: 4 January 2010
Accepted: 2 August 2010
Published: 2 August 2010
Phosphoinositide metabolism is essential to membrane dynamics and impinges on many cellular processes, including phagocytosis. Modulation of phosphoinositide metabolism is important for pathogenicity and virulence of many human pathogens, allowing them to survive and replicate in the host cells. Phosphoinositide phosphatases from bacterial pathogens are therefore key players in this modulation and constitute attractive targets for chemotherapy. MptpB, a virulence factor from Mycobacterium tuberculosis, has phosphoinositide phosphatase activity and a distinct active site P-loop signature HCXXGKDR that shares characteristics with eukaryotic lipid phosphatases and protein tyrosine phosphatases. We used this P-loop signature as a "diagnostic motif" to identify related putative phosphatases with phosphoinositide activity in other organisms.
We found more than 200 uncharacterised putative phosphatase sequences with the conserved signature in bacteria, with some related examples in fungi and protozoa. Many of the sequences identified belong to recognised human pathogens. Interestingly, no homologues were found in any other organisms including Archaea, plants, or animals. Phylogenetic analysis revealed that these proteins are unrelated to classic eukaryotic lipid phosphatases. However, biochemical characterisation of those from Listeria monocytogenes and Leishmania major, demonstrated that, like MptpB, they have phosphatase activity towards phosphoinositides. Mutagenesis studies established that the conserved Asp and Lys in the P-loop signature (HCXXG KD R) are important in catalysis and substrate binding respectively. Furthermore, we provide experimental evidence that the number of basic residues in the P-loop is critical in determining activity towards poly-phosphoinositides.
This new family of enzymes in microorganisms shows distinct sequence and biochemical characteristics to classic eukaryotic lipid phosphatases and they have no homologues in humans. This study provides a foundation for examining the biological role of this new family of phosphatases and their potential as pharmaceutical targets against infectious diseases.
Phosphatidylinositols are important metabolites present in animal cell membranes. They can be phosphorylated at three main positions of the inositol head (D-3, 4, and 5) to generate seven different phosphoinositides (PIs). Their metabolism is tightly controlled by phospholipases that in some instances generate second messengers such as inositol 1,4,5-trisphosphate (Ins(1,4,5)P3) and diacylglycerol (DAG). In addition, kinases and phosphatases control the levels of the mono, di- or tri-phosphorylated forms [1–5]. PI phosphatases belong to different families of enzymes; PTEN and myotubularin inositol 3-phosphatases (3-ptases) belong to the protein tyrosine phosphatase (PTP) superfamily and have both protein and lipid phosphatase activities [6, 7]. Inositol 4-phosphatases (4-ptases), share with PTPs the conserved active site signature CX5R (P-loop motif) where the Cys is the nucleophile in catalysis and Arg binds the phosphate group in the substrate. The inositol 5-phosphatases (5-ptases) are Mg2+-dependent enzymes related to endonucleases (reviewed in [6, 7]).
PIs are implicated in the regulation of a wide variety of cellular functions, including signal transduction, membrane dynamics, cytoskeleton arrangements, endocytosis, endosome trafficking, permeability and transport across membranes [1, 3]. PIs are also important in facilitating phagocytosis of pathogenic bacteria and the subsequent phagosomal maturation leading to bacteria decay. This process is mediated by PI(3)P and PI(3,5)P2 . Pathogenic bacteria have evolved sophisticated strategies to escape the innate immune response, using PI metabolising proteins such as phosphatases and kinases to block phagosome maturation. Some of these bacterial phosphatases like SigD/SopB, IpgD and SapM have phosphoinositide activity resulting in the alteration of PI levels in the host membranes [8–10].
Mycobacterium tuberculosis (Mtb) is an exceptionally successful pathogen that is able to survive in the host for a lifetime. Mtb enters macrophages in a phosphoinositide-3-kinase (PI3K) dependent manner and uses lipid analogues (LAM)  and PI metabolising enzymes (SapM ) to keep low levels of PI(3)P that prevent recruitment of Rab7 and phagosomal maturation. MptpB is a secreted phosphatase  critical for Mtb survival in host cells . We have recently shown that it has protein and PI phosphatase activity  and could act together with SapM to control PI(3)P levels in the host.
MptpB has a unique active site P-loop motif (160-167): HCFAG KD R, which is strictly conserved in other mycobacterial species, where the Asp165 is an essential catalytic residue and Lys164 is important for PI binding . This signature shares similarities with that of eukaryotic 3-ptases myotubularin (MTM), HCSDGW D R, and PTEN, HC K AG K GR, despite a low overall sequence homology (10.6% for MTMR2 and 9.8% for PTEN). The MTM/PTEN family of 3-ptases is conserved in mammals, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, yeast, kinetoplastids and several protozoa [16–20]. However, homologous 3-ptases have not been identified in bacteria. Instead, some pathogenic bacteria have multifunctional 5-ptases such as SopB from Salmonella and IpgD from Shigella . The P-loop features and catalytic profile of MptpB, together with the lack of MTM and PTEN orthologues in bacteria suggested that MptpB like phosphatases may constitute an alternative type of PI phosphatase in prokaryotes. This hypothesis prompted us to further investigate the existence of MptpB related protein sequences in other organisms.
In this study, bioinformatics explorations revealed a large family of MptpB related sequences present mostly in bacteria, with some in fungi and protozoa. Several of the sequences identified are found in recognised intracellular pathogens. No MptpB homologues were found in Archaea, plants or animals. We characterised biochemically, selected proteins from different organisms (two bacterial and one from a protozoa) and confirmed that they have phosphatase activity towards both phosphorylated peptides and PIs. Mutagenesis of conserved P-loop residues confirmed a Cys based mechanism of catalysis, where the conserved Asp is an essential catalytic residue. Furthermore, we provide evidence for the importance of key residues in the P-loop as determinants in PI substrate specificity. Substitution of P-loop residues to mimic that of eukaryotic 3-ptases, resulted in mutant enzymes with enhanced phosphatase activity and new poly-phosphate PI specificity.
Phylogenetic analysis shows that this new family of PI phosphatases is not related to the classic eukaryotic lipid phosphatases like MTMs/PTEN. Moreover, these phosphatases exhibit distinct and unusual features in their catalytic profile, sequence and domain organisation to other known PI and inositol phosphatases, for which we designated them as atypical lipid phosphatases (ALPs).
Classification of sequences identified through Blast analysis and ScanProsite
Sequences were identified in both Gram positive (48%) and Gram negative (51%) bacteria. The MptpB P-loop signature is conserved in all sequences, except 7, of which the Cys residue is absent in 3 sequences (2 from Burkholderia phytofirmans and 1 from Mycobacterium paratuberculosis) and the Asp residue is absent in 4 sequences (Bacteroides ovatus, Bacteroides uniformis, Bacteroides fragilis, Bacteroides thetaiotaomicron). As both residues are important for catalysis  these 7 sequences are probably inactive phosphatases and were excluded in subsequent analyses.
The domain organisation of this new family shows a main difference with other eukaryotic lipid phosphatases, which generally contain a number of characteristic modules for membrane targeting and lipid binding (PH, C2, GRAM, FYVE). In contrast, ALPs contain the phosphatase domain only and lack any classic lipid binding modules (Figure 2B), suggesting that they use alternative mechanisms for lipid binding and membrane anchoring, maybe mediated by effector proteins.
The occurrence of one single eukaryotic sequence in the blast search was intriguing, but the high conservation of the P-loop motif, suggested that perhaps other eukaryotic sequences may still retain this signature albeit with a low overall homology for the rest of the sequence. To test this, we carried out a ScanProsite search using the active site motif "HCXXGKDR". The search returned a total of 209 sequences, of which 163 were bacterial and 46 were from lower eukaryotes (Table 1). The majority of eukaryotic sequences identified were fungi (40), with 4 Leishmania sequences (2 L. infantum and 2 L. major), 1 sequence from Caenorhabditis elegans and 1 sequence from Trichomonas vaginalis. A total of 31% of the sequences identified in ScanProsite were also present in the Blast hit list, indicating that in the remaining sequences the overall sequence identity was less than 30% to MptpB. Interestingly, more than 30 sequences of the total list belong to pathogens.
Other known bacterial lipid and inositol phosphatase sequences were missing from our hit list. This is the case of SopB/SigD and IpgD, with a slight different motif: N CKSGKDR, and low sequence identity to MptpB (~11%). Also absent are the mammalian 4-ptases (with the motif XCKS A KDR) and 5-ptases that lack the CX5R consensus motif.
We previously reported that MptpB had both protein phosphatase and lipid phosphatase activity. We also established that Cys160, Lys164, Asp165 and Arg166 in the P-loop, are important residues involved in catalysis or substrate binding . The conservation of these residues in the ALP sequences, suggested they could have a similar enzymatic profile, which could be relevant to PI metabolism. To test this hypothesis, two prokaryotic sequences and one eukaryotic sequence from our hit list were selected for enzymatic analysis, all of them belonging to intracellular pathogens; Lmo1800 and Lmo1935 from L. monocytogenes and LM1 from L. major. These sequences occupy different locations within the phylogenetic tree (Figure 3) and had sequence identities of only 22-24% to MptpB. Despite their relatively poor sequence homology the P-loop motif is almost identical to MptpB with only one substitution of F for T in position two.
Activity of ALPs on phosphosubstrates.
Specific Activity (nmoles PO4/min/mg)
46.94 ± 0.31
45.47 ± 1.52
57.47 ± 0.96
63.38 ± 0.71
2.89 ± 0.55
0.34 ± 0.26
0.48 ± 0.02
6.71 ± 0.43
3.35 ± 0.16
0.35 ± 0.03
3.90 ± 0.66
0.63 ± 0.09
0.26 ± 0.05
0.58 ± 0.16
0.51 ± 0.23
0.2 ± 0.02
0.37 ± 0.14
0.95 ± 0.26
0.12 ± 0.03
1.07 ± 0.57
1.0 ± 0.12
3.17 ± 1.36
0.57 ± 0.21
0.69 ± 0.05
0.42 ± 0.04
0.78 ± 0.34
6.48 ± 0.40
9.68 ± 3.29
1.24 ± 0.07
0.45 ± 0.1
0.15 ± 0.05
Another residue in the P-loop, Lys164 in MptpB, is involved in substrate binding and particularly important for dephosphorylation of PI(3,5)P2 . Mutation of the cognates Lmo1800-Lys193 and LM1-Lys151 to Ala resulted in a significant loss in activity towards PI(3,5)P2 but retained 10% of the WT activity for PI(3)P (Figure 5B).
Having established the importance of residues in position 5 and 6, we then wanted to explore the importance of Lys in position 2 in substrate specificity. We mutated position 2 residues in MptpB and related enzymes to Lys to see if we could convert a group 2 lipid phosphatase into a group 1 phosphatase with similar substrate specificity. We found that in MptpB, the residue in position 2, Phe161, when mutated to Lys, results in an mutant enzyme with new activity for di-phosphorylated substrates PI(3,4)P2 (8.5 nmoles min-1 mg-1) and PI(4,5)P2 (7.0 nmoles min-1 mg-1) and even shows activity towards the tri-phosphorylated PI(3,4,5)P3 (2.2 nmoles min-1 mg-1) (Figure 5B). Note that the WT enzyme does not dephosphorylate any of these substrates.
Mutation of the equivalent position in Lmo1800 (Thr190) to Lys resulted in increased dephosphorylation of PI(3)P (1.4-fold), a 3-fold increase of activity towards and PI(3,5)P2 with low activity for PI(3,4)P2 (0.2 nmoles min-1 mg-1). For LM1, mutation of Thr148 to Lys, also shows an increase (1.5-fold) in activity for PI(3)P but not towards di- or tri-phosphorylated PIs, suggesting that other factors and possibly conformational arrangements in the active site of Lmo1800 and LM1 determine substrate preference.
In this study, we have used the P-loop signature of the bacterial lipid phosphatase MptpB as a "diagnostic motif" to identify related putative lipid phosphatases. We found a whole new family of predicted phosphatases in bacteria, with some instances in fungi and protozoa. Interestingly, no homologues were found in other organisms such as Archaea, plants or animals. These proteins have not previously been characterised functionally and at least one third of them are not annotated. The existence of such a large family of putative lipid phosphatases in bacteria is unexpected. Very few bacteria, with the exception of actinobacteria, produce inositols or phosphoinositides, therefore they generally lack the enzymes involved in inositol metabolism . On the other hand, some pathogenic bacteria secrete PI metabolising enzymes, like kinases and phosphatases, that they use to subvert PI signalling in the host cells, promoting virulence and survival in the infected cells [9, 10]. Although the biological role of the ALPs identified in this study remains unknown, it is interesting to note that many of the sequences belong to pathogenic microorganisms, such as Listeria, Yersinia, Klebsiella, Clostridium, Leishmania, Candida, and Aspergillus.
Phylogenetic analysis showed a number of clusters, distinct for the bacterial and fungal/protozoan species, and that ALPs are clearly different to classic mammalian lipid 3-ptases, MTMs and PTEN or the PFAs. Additionally, ALPs lack the typical lipid anchoring domains commonly found in mammalian lipid phosphatases.
Biochemical characterisation of selected sequences from the intracellular pathogens L. monocytogenes and L. major, confirmed our prediction that these are active phosphatases, able to dephosphorylate phosphoinositides. A broader PI activity in ALPs is also in contrast with the more restrictive specificity observed in MTMs/PTEN 3-ptases and the capability to dephosphorylate di- and tri-phosphoinositides found in the SopB/IpgD 5-ptases.
Clear differences were observed in the activity profile between the two proteins in L. monocytogenes, Lmo1800 and Lmo1935. While Lmo1935 has preference for peptide substrates, Lmo1800 exhibits a similar activity profile as MptpB, with DSP activity and broad PI specificity. This suggests that the two Lmo proteins could have distinct functional roles and provides an explanation for the presence of the two genes in Listeria.
The catalytic function of critical P-loop residues in classic eukaryotic lipid phosphatases has previously been determined. In MTMs, position 6 in the P-loop is occupied by an Asp residue (HCSDGW D R), which functions as the general acid in catalysis . Similar findings were reported for MptpB . Here we established the importance of the conserved Asp in the ALP proteins as mutation to Ala results in mutant enzymes with compromised activity, suggesting a similar role of this residue in catalysis as shown for MptpB and MTMs.
In PTEN, the presence of several basic residues (HC K AG K G R) facilitates the binding of highly negatively charged PI substrates and in particular of PI(3,4,5)P3, its preferred substrate . A similar motif, C K SX K DR, is also found in SopB/IpgD inositol and mammalian 4-ptases that have a preference for poly-phosphate PIs. However, ALPs, contain only two basic residues (Lys and Arg) in the P-loop and they dephosphorylate preferably mono-phosphoralyted PIs with no activity towards PI(3,4,5)P3. Here we have shown that firstly, the conserved Lys in position 5 of the P-loop is important for efficient dephosphorylation of PI(3,5)P2, and probably involved in substrate binding, consistent with the proposed role of the cognate Lys164 in MptpB  and Lys128 in PTEN . Secondly, we showed that the number of basic residues is critical to dephosphorylate poly-phosphate PIs. We mutated the residues in position 2 of the P-loop to Lys to mimic the extra basic residue in PTEN and SopB. Mutation of Phe161 in MptpB, Thr190 in Lmo1800 and Thr148 in LM1 to Lys resulted in enhanced activity towards mono- and di-phospho PIs in Lmo1800, and MptpB, and new activity towards PI(3,4,5)P3 for MptpB (Figure 5B). This is an important finding as, historically, the role of P-loop residues has been restricted to enzyme catalysis and catalytic efficiency. In this study we demonstrate that single amino acid substitutions can result in subtle changes in substrate preferences and supports the notion that the active site loop sequence can be used as diagnostic of substrate specificity. New insights regarding the structural constrains that determine substrate binding in this new family of phosphatases should be revealed when the three-dimensional structures are determined.
Overall, this new family of enzymes shows both distinct sequence features and biochemical characteristics. ALPs have a unique P-loop signature and exhibit broad PI specificity. In addition, they have low sequence homology and different domain organisation to classic lipid phosphatases. The biological role of these proteins remains to be elucidated, but their potential role in PI metabolism suggests interesting links with established mechanisms of pathogenesis in other microorganisms such as M. tuberculosis, Leishmania, Listeria, Salmonella, and Shigella [9, 33], and a potential to be exploited as targets in anti-infective treatments given their low homology to any other human phosphatases.
Two approaches were taken to identify MptpB related sequences, a) Blast searches  of SwissProt and TrEMBL using the full length sequence of MptpB and b) ScanProsite search of SwissProt and TrEMBL http://www.expasy.ch/tools/scanprosite using the signature motif "HCXXGKDR". In the Blast searches the sequences selected had E-values between 10-11 and 10-131(> 30% sequence identity). Multiple sequence alignments of MptpB-related phosphatases were performed using ClustalX  and manually edited using the programs BioEdit (Hall T: Bioedit http://www.mbio.ncsu.edu/BioEdit/bioedit.html) and Cinema . Phylogenetic trees were calculated using a maximum likelihood program, RAxML , with a 100 bootstrap replicates. Trees were produced using the program Treeview . Analysis of the domain architecture was preformed using InterProScan  and SMART databases .
All the proteins used in the biochemical characterisation were produced in E.coli as recombinant proteins as described below:
The open reading frame of Rv0153c, encoding MptpB, was amplified from M. tuberculosis H37Rv DNA and cloned into a pET28a vector (Novagen) to generate an N-terminal His6-tagged expression construct. Site directed mutagenesis of the following residues; F161A, F161K, K164A and D165N were carried out using the QuikChange kit (Stratagene). Each construct (WT and mutants) were transformed into E. coli BL21 (DE3) and expression was induced using an auto-induction method at 25°C for 16 h . His-tagged MptpB WT and mutants was purified by standard nickel affinity chromatography on a 5 ml HiTrap column (Amersham Bioscience) in binding buffer (50 mM Hepes, 500 mM NaCl, pH 7) and eluted 300 mM imidazole.
The open reading frame of Lmo1800 and Lmo1935, encoding Lmo1800 and Lmo1935 respectively, was amplified from Listeria monocytogenes EGD-e DNA and cloned into pGEX-6P-1 (GE Healthcare) to generate an N-terminal glutathione S-transferase (GST) tagged expression construct. Site directed mutagenesis of the following residues; T190K, K193A and D194A (for Lmo1800) and D220A for Lmo1935 were carried out using QuikChange. Each construct (WT and mutants) was transformed into E. coli BL21 and expressed using an auto-induction method at 25°C for 16 h . The expressed GST-Lmo1800 and GST-Lmo1935 were purified by glutathione sepharose affinity chromatography. The supernatant from the bacterial lysate was loaded onto a 1 ml GST Trap HP column (GE Healthcare) in binding buffer (50 mM Hepes, 500 mM NaCl, pH 7) and the protein was recovered following on-column cleavage of the GST-tag with PreScission protease (GE Healthcare). For Initial activity assays we used tag-less proteins (Lmo1800 WT and Lmo1935 WT) against phospho-peptides and PIs. Subsequent assays used GST-tag proteins (WT and mutants of Lmo1800 and Lmo1935) following elution with 10 mM glutathione of the glutathione sepharose column. No significant differences in activity were observed for the fusion proteins.
The Leishmania major LmjF22.0250 (SP code Q4QBX1) initial pET14 construct was a kind gift from Wesley Van Voorhis, Fred Buckner and Erin Quartley of the Structural Genomics of Pathogenic Protozoa consortium. This construct contained a frame shift, which was subsequently removed by site directed mutagenesis. Site directed mutagenesis of the following residues; T148K, K151A and D152A were carried out using QuikChange. Each construct (WT and mutants) were transformed into E. coli BL21 (DE3) and expression was induced using an auto-induction method at 25°C for 16 h . The expressed His6-LM1 WT and mutants were purified by nickel affinity chromatography. The supernatant from the bacterial lysate was loaded onto a 1 ml HiTrap column (Amersham Bioscience) in binding buffer (50 mM Hepes, 500 mM NaCl, pH 7) and the protein was eluted with 300 mM imidazole.
Tb PTP1 was amplified from T. brucei DNA and cloned into pET28a (Novagen) as previously reported [42, 15]. Recombinant Tb PTP1 was expressed in E. coli strain BL21 DE3 Codon+ RIPL, grown in LB broth at 37°C and induced at 30°C with 0.4 mM IPTG. Purification of His-tagged Tb PTP1 was performed by nickel affinity chromatography using the same method as for MptpB.
The malachite green assay  was used to determine the amount of free phosphate during the dephosphorylation assays with a range of substrates: phospho-Tyr peptides from EGFR (DADEpYLIPQQG) and insulin receptor (TRDIpYETDYYRK), phospho-Ser peptide (RRApSVA), phospho-Thr peptide (KRpTIRR) (Alta Bioscience, University of Birmingham, http://www.altabioscience.bham.ac.uk), pNPP (Sigma), and the PIs diC8-PI(3)P, diC8-PI(3,4)P2, diC8-PI(3,5)P2, diC8-PI(4)P, diC8-PI(4,5)P2, diC8-PI(5)P and diC8-PI(3,4,5)P3 (Echelon Bioscience), phospho-amino acids, adenosine 5' monophosphate, inosine 5'-monophosphophate, phosphorylcholine chloride, phosphorylethanolamine, glycerol 2-phosphate and sodium pyrophosphate (Sigma). Each reaction was prepared in triplicates in a 96-well microplate, containing 50 μl of reaction mix with 5 μg enzyme (MptpB, Lmo1800, Lmo1935, LM1) or 20 μg GST-Lmo enzymes in buffer (50 mM Tris, 50 mM BisTris, 100 mM sodium acetate, pH 6) and 100-125 μM substrate. The reactions were incubated for 15 minutes at 37°C prior to the addition of 50 μl malachite green reagent (Sigma) and further incubated for 10 minutes at room temperature. The absorbance was subsequently read at 620 nm and the mean calculated. Control reactions containing no enzyme were included to measure the background level of phosphate. A phosphate standard curve was produced using known amounts of phosphate (25-3000 pmoles of Sigma phosphate standard solution). Experimental points were interpolated in the standard curve to calculate the amount of phosphate released, which was then used to calculate the specific activity (SA, nmoles min-1 mg-1).
protein tyrosine phosphatase
atypical lipid phosphatase
phosphatase and tensin homologue deleted on chromosome 10
p -nitrophenol phosphate
epidermal growth factor receptor
We would like to thank Wesley Van Voorhis, Fred Buckner and Erin Quartley of the Structural Genomics of Protozoan Parasites consortium, for their generous gift of the original Leishmania plasmid, Balazs Szöor for the TbPTP1 clone and Simon Whelan and Daniel Money for their valuable support with RaMXL. This study was supported by the MRC (ref. G0701233) and STFC with a CASE award to N. B. and in part by BBSRC (BB/C516828/1).
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.