- Research article
- Open Access
Comparative analysis of the kinomes of three pathogenic trypanosomatids: Leishmania major, Trypanosoma brucei and Trypanosoma cruzi
BMC Genomicsvolume 6, Article number: 127 (2005)
The trypanosomatids Leishmania major, Trypanosoma brucei and Trypanosoma cruzi cause some of the most debilitating diseases of humankind: cutaneous leishmaniasis, African sleeping sickness, and Chagas disease. These protozoa possess complex life cycles that involve development in mammalian and insect hosts, and a tightly coordinated cell cycle ensures propagation of the highly polarized cells. However, the ways in which the parasites respond to their environment and coordinate intracellular processes are poorly understood. As a part of an effort to understand parasite signaling functions, we report the results of a genome-wide analysis of protein kinases (PKs) of these three trypanosomatids.
Bioinformatic searches of the trypanosomatid genomes for eukaryotic PKs (ePKs) and atypical PKs (aPKs) revealed a total of 176 PKs in T. brucei, 190 in T. cruzi and 199 in L. major, most of which are orthologous across the three species. This is approximately 30% of the number in the human host and double that of the malaria parasite, Plasmodium falciparum. The representation of various groups of ePKs differs significantly as compared to humans: trypanosomatids lack receptor-linked tyrosine and tyrosine kinase-like kinases, although they do possess dual-specificity kinases. A relative expansion of the CMGC, STE and NEK groups has occurred. A large number of unique ePKs show no strong affinity to any known group. The trypanosomatids possess few ePKs with predicted transmembrane domains, suggesting that receptor ePKs are rare. Accessory Pfam domains, which are frequently present in human ePKs, are uncommon in trypanosomatid ePKs.
Trypanosomatids possess a large set of PKs, comprising approximately 2% of each genome, suggesting a key role for phosphorylation in parasite biology. Whilst it was possible to place most of the trypanosomatid ePKs into the seven established groups using bioinformatic analyses, it has not been possible to ascribe function based solely on sequence similarity. Hence the connection of stimuli to protein phosphorylation networks remains enigmatic. The presence of numerous PKs with significant sequence similarity to known drug targets, as well as a large number of unusual kinases that might represent novel targets, strongly argue for functional analysis of these molecules.
Trypanosomatid pathogens of humans include Trypanosoma brucei, Trypanosoma cruzi and Leishmania major, causative agents of African sleeping sickness, Chagas disease, and cutaneous leishmaniasis respectively . Trypanosoma brucei lives extracellularly in the human host, primarily in the bloodstream and cerebrospinal fluid. African sleeping sickness, which is estimated to afflict 300,000–500,000 people per year in sub-Saharan Africa, with a disease burden of 1.6 million disability adjusted life years (DALYs), is invariably fatal unless treated . Trypanosoma cruzi, which is found in Latin America, results in a disease burden of 650,000 DALYs. This parasite can invade most types of nucleated cells. About 30% of infected individuals progress to a chronic phase that culminates in heart disease and mega syndrome . Of those infected it is estimated the 50,000 will die each year. Leishmania parasites result in a disease burden of 2.3 million DALYs, with greater than 80,000 deaths/year and cause a variety of diseases depending on the infecting species. The most dangerous manifestation is the visceral disease known as kala azar, caused by L. donovani. Kala azar is re-emerging in India in a particularly aggressive form that is resistant to standard treatment . No vaccine has been approved for any of these diseases and many of the drugs in use are highly toxic and prone to the development of drug resistance. There is therefore an urgent need to identify new drug targets and the recent completion of the genome sequence of the three model trypanosomatids, T. brucei, T. cruzi and L. major, can be exploited in this regard [5–7].
During development the parasites pass through different environments. Each species is carried by a different insect vector, in which the parasite undergoes specific developmental changes that allow it to infect the human host. For example, Leishmania parasites move from the sandfly midgut up to the mouthparts, then into the human host where they invade macrophages and live within a phagolysosome. In each environment, the parasites respond with significant changes in their metabolic and protein profile. The signal transduction pathways mediating these changes remain unknown. Only a few receptor-like proteins have been identified, primarily receptor adenylate cyclases with an extracellular putative ligand binding domain and an intracellular catalytic domain [8, 9]. Intermediate steps of signal transduction in the parasites have not been defined, although genomic analysis shows that they possess numerous molecules predicted to bind second messengers, as well as protein kinases and phosphatases . The culmination of the signaling pathways is unlikely to be at the level of transcription, since most genes are transcribed in polycistronic units with little evidence for regulation [7, 10, 11]. Many changes in protein phosphorylation during the parasite developmental cycles have been documented [12–14]. The parasites also possess an integrated cell cycle that coordinates the inheritance of the single mitochondrion, flagellum, and nucleus [15, 16].
Protein kinases (PKs) are key mediators of signal transduction, transmitting environmental cues and coordinating intracellular processes. Eukaryotic protein kinases (ePKs) are categorized by the amino acid sequence of their catalytic domains. Broadly, ePKs fall into two superfamilies: protein serine/threonine kinases and protein tyrosine kinases. The former are ubiquitous in eukaryotes. The latter are present in all metazoa for which the genome sequence is available, but relatively few examples have been found in unicellular eukaryotes [17, 18]. However, protein tyrosine phosphorylation has been well documented in trypanosomatids [13, 14, 19, 20]. Mammalian receptor protein kinases are generally tyrosine kinases , while all known plant receptor kinases are serine/threonine kinases [22, 23]. Receptor kinases are activated by ligands, facilitating intercellular communication within multicellular organisms. Parasites that live in multicellular hosts could conceivably use similar mechanisms to respond to host or parasite ligands, although such reactions have not been defined at the molecular level.
Six major groups of ePKs have been defined on the basis of sequence similarity of the catalytic domains: AGC, CAMK, CMGC, TK, TKL, STE . ePKs that do not fall into these groups are categorized as "Other". Within each group (including "Other"), multiple families have been defined. Interestingly, the substrate preferences break into groups along the same lines: for example AGC and CAMK kinases tend to phosphorylate motifs containing basic residues, CMGC kinases often are proline-directed, while CK1 and CK2 kinases phosphorylate motifs with acidic residues . Additional features that correlate with group assignments include responses to other mediators such as ligands (receptor TK), calcium (CAMK), and certain second messengers (AGC). Relatively few protein kinases have been studied in detail with respect to expression and function in each of the trypanosomatids (see Additional file 1).
Atypical PKs (aPKs) are not closely related to ePKs at the sequence level, lacking the 11 subdomains that define ePKs. They include a variety of molecules that have been shown to have protein kinase catalytic activity in specific systems. Among the aPKs, the most well-characterized are the PIKK kinases, which have catalytic domains resembling those of lipid kinases in sequence . Interestingly, the RIO and alpha groups show remnants of many of the ePK subdomain motifs [26, 27]. The other atypical kinases require further study for definitive analysis of their activity.
An analysis of partial genomic sequence suggested that trypanosomatids might differ considerably from the host in signaling mechanisms, lacking typical signaling receptors with the exception of adenylate cyclases, as well as SH2 domains and transcription factors . These speculations have been borne out by the completed genomic sequences of the three trypanosomatids known as the TriTryps: T. brucei , T. cruzi , and L. major , as briefly discussed in the T. cruzi genome paper . In this report, we present a detailed examination of the TriTryp kinome.
Results and discussion
The TriTryp kinome
To identify all protein kinase genes in the three trypanosomatid genomes, we searched GeneDB  for all genes bearing Pfam protein kinase domains, as well as by BLAST using representatives of all major protein kinase gene families, including aPKs. All ePKs were examined for the presence of the 11 characterized subdomains, and specifically for the presence of the key lysine in subdomain 2 and aspartic acids in subdomains 6 and 7. The genomic analysis revealed 179, 156, and 171 ePKs and 17, 20, 19 aPKs in L. major, T. brucei and T. cruzi respectively. These numbers suggest that phosphorylation is an important mechanism for cellular regulation in all three trypanosomatids and are considerably larger than that described for another intracellular parasite that transits diverse environments; Plasmodium falciparum. P. falciparum possesses 65 ePKs and 20 ePK-related sequences, designated FIKK [30–32]. The latter have not yet been shown to have protein kinase activity . The activation of many ePKs requires phosphorylation in the activation loop between subdomains 7 and 8. These kinases are typically marked by an RD motif within subdomain 6 . In T. brucei, 130 of the 156 ePKs are RD kinases, further supporting the concept that phosphorylation networks are complex and important in these organisms.
We examined the relationship of the trypanosomatid ePKs to the groups and families of kinase domains of human, worm, fly, and yeast using the available datasets . Most ePKs had a highly significant BLAST score against at least one member of the 4-kinome dataset. For example, 58% of the T. cruzi ePKs had an E-value of at least 10-40, and 77% had a score of at least 10-30 against a member of this dataset (Figure 1). Based on BLAST E-values, as well as phylogenetic inference (see below), assignments to ePK groups and families were made (Table 1, Additional file 1). We generated phylogenetic trees of the kinase domains of the entire T. brucei ePK kinome, seeding the tree with human and yeast PKs to facilitate classification (Figure 2 shows the MRBAYES tree). The trees were generally consistent with the BLAST assignments (the few that did not match are marked by an asterisk in the tree). Of note are several unique kinases that are on long branches originating near the center of the tree, indicating their high divergence from other ePKs.
In general, the TriTryp kinomes are closely related. COGS are c lusters of o rthologous g enes, as revealed by analysis of mutual BLASTP hits across the genomes. The majority (68%) of the ePK genes reside in COGS that contain members from each of the three species (Additional file 1). Conversely, only a small number of genes appear to be unique to a species (20 in L. major, 11 in T. cruzi, and 3 in T. brucei, Additional file 1). As with other genes in these organisms, members of these COGs are generally syntenic among the three species, furthering the concept that the molecules are orthologous. Figure 3 compares the representation in the major groups and families of human protein kinases with those of L. major and Table 1 shows the representation of groups and families among the TriTryps, with further details including systematic gene names provided in Additional file 1.
Protein tyrosine kinases
A key difference between host and parasite kinomes is the complete lack of ePKs that map to the tyrosine kinase (TK) and tyrosine kinase-like (TKL) groups in the trypanosomatids. Representatives of the former in humans include receptor protein kinases such as the insulin receptor and cytosolic kinases such as src. The latter group contains ePKs such as RAF1 and TGFβR2. We also found no evidence of the receptor guanylyl cyclase (RGC) group of proteins, which are structurally related to protein kinases. These groups of ePKs are also absent in malaria parasites, which further lack the STE group of kinases. Interestingly, it has been reported recently that the genome of the unicellular protist Entamoeba histolytica encodes TKs with SH2 domains, TKLs, and a large family of putative receptor serine-threonine ePKs . As E. histolytica also possesses genes encoding putative 7 transmembrane receptors and heterotrimeric G proteins, which the trypanosomatids lack , the mechanisms regulating cell signaling appear to be very different among the parasitic protozoa.
As noted above, phosphorylation on tyrosine is well documented in trypanosomatids. We propose that this activity is likely to be due to the action of atypical tyrosine kinases such as Wee1 and dual-specificity kinases that can phosphorylate serine, threonine, and tyrosine. Multiple members of the dual specificity kinase families (DYRKs, CLKs, and STE7) are present in the trypanosomatid genomes. Although Wee1 is functionally a tyrosine kinase, it most closely resembles serine/threonine kinases such as Chk1 and cAMP-dependent kinases in structure and primary amino acid sequence . In yeast and higher eukaryotes Wee1 phosphorylates a conserved tyrosine residue in the ATP binding pocket of CDK1 (cdc2), inactivating the protein kinase. This mechanism is likely to be conserved in the three trypanosomatids, since there are two Wee1 family members in L. major and T. cruzi and one in T. brucei. In addition, CRK3, the putative functional CDK1 homologue in trypanosomatids [36–38], contains a conserved tyrosine residue in the same subdomain as the human CDK1 regulatory tyrosine [39, 40]. In T. brucei, 18 other CMGC members also have this tyrosine within subdomain 1 (Additional file 2), reiterating the potential for widespread regulation of protein kinase activity via tyrosine phosphorylation. The potential conservation in regulatory mechanisms for CDK activity between yeast, mammals and trypanosomatids may not extend to all protozoa, as the putative P. falciparum Wee1 lacks a key active site residue suggesting it may not be active  and dual-specificity ePKs appear to be absent. In addition, no tyrosine phosphorylation has been demonstrated to date in that species. The existence of other unusual protein tyrosine kinases in trypanosomatids is an intriguing possibility given the large number of protein kinases in the trypanosomatid kinomes that cannot be easily placed into typical ePK groups or families (see below).
Serine-threonine protein kinases
Poorly represented groups: CAMK and AGC
The CAMK and AGC groups are relatively poorly represented within trypanosomatid genomes as compared to humans. The CAMK group (which includes the Ca+2/calmodulin regulated kinases and AMP-dependent kinase, AMPK) is small in trypanosomatids with only 13 CAMKs predicted to be active in T. cruzi, 14 in T. brucei, and 16 in L. major. In contrast, the human genome encodes 74 CAMKs. A phylogenetic tree of the trypanosomatid CAMK and CAMK-like unique kinase domains is shown in Figure 4. Also included are along with representatives of each family of CAMKs from human, two yeast CAMKs and a plasmodial calcium dependent CAMK. Of the 19 trypanosomatid CAMK genes identified, 13 have representatives in each species as determined by COG analysis, and supported by phylogenetic trees. An additional CAMK-like kinase, marked as unique due to its low E-value in BLAST analysis, was also conserved, as was a COG in which two of the three orthologues were predicted to be inactive. The tree shown in Figure 4 also shows a characteristic common to all trees in which groups of ePKs from the trypanosomatids were compared: the trypanosomatid ePKs falling within COGS formed a tight cluster with high confidence, and were more distantly related to ePKs from humans or yeast.
BLAST analysis against the 4-kinome dataset indicated that about half of the trypanosomatid CAMKs belong to the CAMKL subfamily, the remainder were not assigned to a specific family. The phylogenetic trees generally agreed with these predictions, and supported the further classification of two sets of trypanosomatid genes as members of the AMPK subfamily. In other organisms, AMPKs are regulated by AMP and hence are involved in metabolic sensing . In addition, two sets of the predicted CAMK kinases contain EF hand sequences, which may provide for sensitivity to Ca+2 (marked in Figure 4). This juxtaposition of a protein kinase domain with EF hand motifs is characteristic of CDPKs, a group of calcium dependent protein kinases that are prominent in plants  and in P. falciparum [30, 31, 43], but which are absent in humans and yeast. However, the phylogenetic inference does not support the clustering of these trypanosomatid CAMKs with the plasmodial calcium dependent kinase CDPK1. These trypanosomatid genes are likely therefore to encode a novel class of EF-hand containing ePKs.
The AGC group includes ePKs structurally related to protein kinases that respond to second messengers: protein kinase A (responsive to cAMP), protein kinase G (responsive to cGMP), and protein kinase C (responsive to diacyl glycerol). Normalized to kinome size, trypanosomatids have approximately half as many AGC kinases as humans. The parasite genomes encode 3 AGC kinases that are related to PKA. However, T. brucei PKA appears to be activated by cGMP rather than cAMP . Also within the AGC group are the NDR kinases. BLAST analysis and phylogenetic tree inference indicates that T. brucei possesses two NDR family kinases. One of these is conserved and syntenic among the trypanosomatids, while the other, PK50 (Tb10.70.2260), is specific to T. brucei. This molecule is a functional homologue of Schizosaccharomyces pombe Orb6  and interacts with MOB1 to form an active kinase complex that has a potential role in cytokinesis, but not mitosis . Whether the conserved NDR kinases also interact with MOB1 is not yet known. The remainder of the AGC kinases could not be assigned to a specific family by sequence alone, except for one RSK-like sequence. Phylogenetic inference of the T. brucei sequences carried out as detailed in Methods supports these general conclusions and provided no indication of trypanosomatid-specific clusters (data not shown).
Over-represented groups: CMGC and STE
The CMGC group and the STE group are relatively well represented within these trypanosomatid genomes as compared to humans. Examples of CMGC kinases include ePKs such as cyclin-dependent kinases (CDKs), MAP kinases (MAPKs), and dual specificity CLK and DYRK kinases. Trypanosomatids have a large number of these kinases (e.g., 45 in L. major as compared to 61 in humans). All of the CMGC families identified in humans are also represented in trypanosomatids, as indicated by BLAST analysis and phylogenetic inference. The CDK family is relatively large in trypanosomatids with 11 members in T. brucei and L. major and 10 in T. cruzi. This complexity may reflect the problem of dividing a highly polarized cell with an elaborate cytoskeleton and a single mitochondrion, along with an integral link between cell cycle control and life cycle differentiation. Despite the existence of a large number of CDK family members (named CRK for c dc2-r elated k inase), only 2 have been shown to be essential for cell cycle progression in trypanosomatids. CRK3 in complex with the CYC6 mitotic cyclin is essential for G2/M phase progression and is the functional homologue of CDK1 [36–38, 47]. CRK3 in complex with CYC2 is essential for G1 progression [48, 49]. A PHO80-like cyclin and a B-type cyclin control the cell cycle of the procyclic form of Trypanosoma brucei , while TbCRK1 is also an essential gene required for G1 phase progression [38, 47, 51, 52]. However, the roles of CRKs in the cell cycle is complex, with functional differences between bloodstream and procyclic form T. brucei as revealed by RNAi knockdown studies [37, 38, 47, 48]. CRK7 has the highest level of sequence identity to CDK7 of mammals. CDK7, in complex with cyclin H and MAT1, is a CDK-activating kinase (CAK) that phosphorylates the T-residue of CDKs (e.g., T160 of human CDK1). No cyclin H or MAT1 orthologues can be identified in trypanosomatids based on sequence, so it remains to be determined if CRK7 is a functional cyclin-dependent kinase or indeed if it has CAK activity. However, many CRKs, including CRK1, 2, 3, 6, 7, 8, 9 and 12, have a conserved T-loop residue, suggesting that the CRKs might be activated in vivo by a CAK activity .
Interestingly, T. cruzi possesses a large number of genes encoding CRK7 isoforms (counted only as one unique gene in our analyses). These are dispersed near the telomeres of many chromosomes, being adjacent to a retrotransposon hotspot protein gene. Of the 27 sequences identified, most contain all of the catalytic residues, although a few are truncated. The biological significance of this gene amplification is not known, however expansion of gene families within subtelomeric regions of trypanosomatid chromosomes is a feature of these genomes in general.
Two families of CMGC kinases phosphorylate serine/arginine rich motifs in serine-arginine rich SR proteins, which function in RNA processing and splicing in many higher organisms: SRPKs  and the dual specificity CLKs . Two SRPKs  and four or five CLKs are encoded within each trypanosomatid genome. Given the major role of RNA processing and turnover in modulating gene expression in trypanosomatids, these families of CMGC kinases may be of key interest in studying parasite gene regulation. Two GSKs, which are drug targets in diabetes and neurological diseases , are also present.
A large number of MAPK-related genes are also present in trypanosomatids, possibly reflecting a role of 3-component MAP kinase cascades in coordinating responses to environmental cues (Table 1). Among these genes are those which are most closely related in sequence to the MAPK family, and those which are most closely related to the CDKL and RCK families. These latter families possess the residues characteristic for the regulation of MAPKs and hence are considered to be part of a MAPK superfamily by some authors , even though they are more similar in sequence to CDKs. Among the identified MAPK-like predicted proteins, two sets lack the predicted regulatory motifs (LmjF13.0780 and its T. cruzi orthologue; LmjF03.210 and its T. brucei and T. cruzi orthologues, Table S1) and hence must be regulated in a distinct manner. Thus the total complement of protein kinases likely to be regulated as MAPKs numbers 14 in T. brucei, 13 in T. cruzi, and 15 in L. major.
The parasites clearly find themselves in environments that vary substantially in temperature, pH, nutrients, and stresses during their developmental cycle. An elaborate phosphorylation signaling system to respond to those changes may be a key strategy of this group of organisms. Many MAPKs, and those kinases likely to regulate them (see below), appear to be involved in developmentally regulated processes in trypanosomatids. Nine MAPKs have been cloned and analyzed from L. mexicana (LmxMPK1-9) and their mRNAs abundances are developmentally regulated [58, 59]. LmxMPK1 is essential for amastigote, but not promastigote proliferation , while LmxMPK9 is involved in regulating flagellar length, a stage-regulated function in Leishmania . Three MAPKs have been analyzed in T. brucei. KFR1, an ERK-like MAPK, has been proposed to be involved in the proliferation of bloodstream form trypanosomes and is the first trypanosomatid ePK reported to be regulated by a specific extracellular molecule, interferon γ [61, 62]. TbMAPK2, also ERK-like, is not essential for proliferation of the bloodstream form trypanosome, but is important for successful differentiation . Mutants lacking TbMAPK2 have delayed kinetics of differentiation from the bloodstream form to the procyclic form; the resulting procyclic forms undergo cell cycle arrest. TbECK1, which has characteristics of both MAPKs and CDKs, and was named T. brucei E RK-like, C DK-like protein k inase , falls into the CDKL family by phylogenetic analysis (this study). This kinase appears to be essential in all life cycle stages analyzed . TbECK1 has an unusual C-terminal extension and overexpression of TbECK1 lacking the C-terminal extension in procyclic trypanosomes leads to a significant reduction in growth, suggesting an important role in cell cycle control. The C-terminal extension appears to act as a cis-acting negative regulator of protein kinase. The roles of many trypanosomatid MAPKs remain to be explored.
MAPKs are activated by phosphorylation within the activation loop, typically both on a tyrosine and a threonine. This phosphorylation is mediated by MAP kinase kinases (MAP2Ks), which are members of the STE7 family, one of the three major families of STE group kinases that are generally described as upstream regulators of MAP kinase cascades. Although only two STE7 genes were assigned through BLAST analysis, phylogenetic inference revealed that five sets of orthologues cluster with good confidence into the STE7 family (Figure 5), suggesting that they may function as MAP2Ks in this organism. A previously identified Leishmania mexicana MAP2K, LmxPK4, has a potential role in parasite differentiation , while another LmxMKK, is involved in the maintenance of flagellar length . STE11 family ePKs often function as MAP3Ks and are especially numerous in the trypanosomatids. Several of the STE11 kinases formed trypanosomatid-specific clusters in our phylogenetic analyses. Another cluster was found to be ubiquitous amongst the trypanosomatid, yeast and human kinomes. LmMRK1 (LmjF32.0120) is an essential STE11 family kinase . In contrast to STE11, the STE20 family kinases, many of which function as MAP4Ks, are relatively rare in trypanosomatids. Another arm of the MAPK activation pathways is mediated by RAF1, a TKL kinase. The TKL group of protein kinases is absent in trypanosomatids. It is clear that sequence data alone cannot accurately predict specific three-component signaling pathways in the trypanosomatids – detailed biochemical analyses will be required. Nonetheless, taken together, these findings provide interesting insight into trypanosomatid-specific aspects of MAP kinase cascades. The STE group of kinases is relatively expanded in trypanosomatids, with 34 members in L. major. In contrast, it is either absent or highly abbreviated in the malaria parasite [30, 31], once again highlighting the differences amongst protozoan lineages.
Other serine/threonine kinases
The NEK family of ePKs shows a significant expansion within trypanosomatids, having 20–22 members (compared to the 15 representatives in the human genome). The NEK kinases have been relatively little studied in model systems, but several appear to be involved in cell cycle  and cytoskeletal functions [69, 70]. Some of the NEK kinases appear to function in cascades, with human NEK9 phosphorylating and activating NEK6 and NEK7 . Indeed, all of the T. brucei NEK kinases possess the RD motif in subdomain 6, which is an indicator that phosphorylation in the activation loop is likely to be required for maximal activity. As with most of the human NEK kinases, the catalytic domain is situated at or near the N-terminus of the T. brucei NEK kinases. Phylogenetic analysis of the 20 T. brucei NEK kinases shows that the parasite kinases do not form tight clusters with the NEK kinases represented in the 4-kinome database nor with the NEK kinases of the protozoan P. falciparum (Figure 6), although in two of three phylogenetic methods implemented (MRBAYES and PHYML Likelihood), one of the T. brucei NEK kinases (Tb06.2N9.460) did cluster with a plasmodial kinase (MAL6P1.56). In contrast, several clusters of NEK kinases across yeast and metazoa were identified: e.g., ScKIN3, DmNEK2 and HsNEK2 form a highly supported clade, as do HsNEK10 and CePQN25. Of particular interest to us was the identification of a trypanosomatid-specific clade containing 12 of the T. brucei NEK kinases, which was supported by all of the methodologies. The trypanosomatid NEK kinases have perhaps a modestly higher preponderance of accessory domains compared to other trypanosomatid kinases (see below). For example, several possess a coiled coil region downstream of the catalytic domain (Tb03.27C5.650, Tb05.26K5.430, Tb10.61.2330, and Tb10.70.7860). This feature is also found in human NEK1 and NEK2. Several other trypanosomatid NEK kinases have a C-terminal PH domain, a combination not described in the NEK kinases of other species. These kinases lie within the trypanosomatid-specific clade. The roles of the trypanosomatid NEK kinases have not been studied in any detail, although at least one is known to be developmentally regulated  and one has a role in basal body duplication (D. Robinson, personal communication).
Among the families of "Other" protein kinases represented in trypanosomatids, several have been shown to be involved in cell division in various organisms (AUR, Aurora; PLK, polo-like kinases; Wee1) DNA replication/repair (TLK, also ATM/ATR atypical kinases described below), and stress responses (PEK family). Activators of CAMKs (CAMK kinases, CAMKK) are also present, as are multiple CK1 and CK2 isoforms (formerly known as casein kinase I and II) [73, 74]. A member of the VPS15 family (involved in v acuolar p rotein s orting), was also identified, although the Leishmania orthologue may not be catalytically active. One representative of the ULK family kinases was found in each trypanosomatid. ULK kinases are involved in autophagy in yeast  and in pattern formation and development in multicellular organisms .
A significant number of ePKs were classified as unique, as they showed no clear affinity to any known group or family within the 4-kinome dataset. For example, a number of T. cruzi ePKs which had significant matches to the protein kinase Pfam domain signature (Pfam 00069) did not show any distinct similarity to specific kinases in the 4-kinome dataset. Of this group, half had E-values of 10-35 or better against the Pfam domain (Figure 7). On the other hand, approximately one-third of the T. cruzi unique kinases showed relatively poor matches against the Pfam domain (E-values ≤ 10-16), but nonetheless were observed to possess a complete subdomain structure as well as the required catalytic residues. The ePKs classified as unique were the least conserved among the trypanosomatids, with 63% being absent in at least one of the three species. As such, the unique kinases are likely to represent instances of lineage-specific evolution defined by gene gain and/or loss in these organisms. Such divergent kinases may provide a set of useful protein kinase drug targets, since they have no closely related homologues in the host.
Membrane kinases interfacing with the environment?
Most mammalian receptor kinases belong to the tyrosine kinase group, a group which is lacking in trypanosomatids. However, in plants, most receptor kinases are serine/threonine kinases. Bearing this in mind, we searched the T. brucei genome for genes bearing the protein kinase Pfam domain plus the annotation of a transmembrane domain. Ten candidates fit the criteria (see Additional file 3), these were spread among a variety of ePK groups, with a somewhat higher representation among the STE kinases. At this juncture, there is no evidence that any domain of these molecules is displayed on the parasite surface, where it might respond to host or parasite derived ligands. Alternatively, if surface-localized, the kinase could phosphorylate host or parasite molecules to modify their environment. We note with interest previous reports of an ectokinase with a substrate profile characteristic of CK1 in Leishmania [77, 78]. Intriguingly, one of the L. major CK1 genes identified in this analysis encodes a protein with a predicted signal anchor sequence (LmjF17.1780). Assessing whether any parasite protein kinases interface with the host environment is an important arena for future experimental studies.
Inactive protein kinases
Approximately 8% of the ePKs of each species are predicted to be catalytically inactive, based on the presence of mutations in essential residues (K in subdomain 2 and D in subdomains 6 and 7). Most of these possess an orthologue in at least one other trypanosomatids. Of the 13 T. brucei ePKs predicted to be catalytically inactive, 11 are mutated to a predicted non-catalytic form in each of the three species. Genome-wide, the level of amino acid sequence identity among COG members averages 61 +/- 7% between T. brucei and T. cruzi , with a similar level of identity for a sampling of ePKs (60% +/- 7%). The ePKs predicted to be inactive show a lower level of identity at 44% +/- 8%. Hence, while conserved, these sequences are somewhat more divergent across species.
We also estimated the synonymous (Ks) and nonsynonymous nucleotide (Ka) nucleotide substitution rates in T. brucei versus T. cruzi genes encoding ePKs predicted to be catalytically active or inactive. The Ka/Ks ratio (sometimes designated as dN/dS) can reflect the selective constraints on a gene. Ka/Ks = 1 is expected for genes evolving neutrally. Ka/Ks < 1 is thought to indicate selection to remove amino acid replacements. In the rare cases where the Ka/Ks > 1, selection for amino acid divergence is usually invoked. For a random subset of 90 active ePKs, Ka = 0.336, Ks = 4.822 and Ka/Ks = 0.077. For the inactive ePKs, these figures were 0.535, 9.639, and 0.110, respectively. These data indicate that in both sets synonymous mutations are highly preferred. Nonetheless, the Ka and Ka/Ks were significantly different in the active versus inactive datasets (p = 0.0003 and p = 0.0045, Mann-Whitney U-test, two tailed). There was no statistically significant difference in the calculated Ks scores between these two datasets (p = 0.3). These findings suggest that the encoded proteins continue to play a significant functional role within the organisms, although the predicted lack of catalytic activity indicates this role is likely to be via a distinct mechanism, such as regulation via protein-protein interaction. Indeed, a recent analysis has shown that inactive protein kinases are not an exception in metazoa and that a few have evolved novel functions, some of which might be involved in processes that enhance the complexity of regulatory phosphorylation networks .
A characteristic of human ePKs is the presence of accessory domains. Indeed, over 50% of human protein kinases have additional Pfam domains, and more are found when criteria are relaxed . We examined all of the trypanosomatid ePKs for significant matches to additional Pfam domains (Table 2). In the case of L. major, only 25 ePKs possessed additional Pfam domains that met the default cutoff. Three additional ePKs, which had orthologues in T. brucei or T. cruzi that had a significant Pfam domain, were found to possess partial motifs, and others had domains of lower significance. The accessory domains were generally conserved among members of a COG. Notably four of the five most common Pfam domains on human ePKs are absent in the trypanosomatid kinome: Ig, fn3, SH2, and SH3. Ig and fn3 domains are generally extracellular domains that interact with ligands, so their absence may not be surprising given the paucity (or absence) of receptor ePKs in trypanosomatids. SH2 domains interact with phosphotyrosine, and their absence in the trypanosomatid genomes could suggest a co-evolution with dedicated tyrosine kinases. SH3 domains bind to proline rich sequences.
Several unusual domain combinations are found in trypanosomatid protein kinases (Table 2, examples shown in Figure 8). A search of eight eukaryotic kinomes using the KinG Kinases in Genomes Resource  revealed that some accessory domains found in trypanosomatids that were not associated with ePK catalytic domains in other species. For example, LmjF35.4000, which is a Leishmania-specific gene, contains a unqiue ePK catalytic domain, along with three TPR motifs (which are present on some plant receptor kinases) along with a domain associated with ubiquitin transferase (HECT), a domain not seen in the sampled genomes. A T. cruzi ePK, Tc00.1047053511727.210, has a TPR motif and HAMP domain (found on various signaling proteins including histidine kinases), both of which are recognizable on the T. brucei orthologue, but not on the L. major orthologue. Others domains are associated with ePKs of different classification. For example, the very large STE kinase LmjF15.1200, has an unusual juxtaposition of two domains related to cyclic nucleotide binding and a PAS domain, associated with signal sensing. In other species, the PAS domain is found on CAMK and TKL kinases, while the cNMP domain is restricted to AGC kinases. The L. major domain structure is likely conserved in T. cruzi, although the coding region is interrupted by a contig break. No orthologue is present in T. brucei. Some other accessory domains are found on similar groups of kinases (RWD, PX). Despite the paucity of identified domains, the trypanosomatid ePKs are generally considerably larger than the 250 aa kinase domain. For example, half of T. cruzi ePKs are larger than 64 kDa, and 38 are larger than 100 kDa.
Atypical protein kinases
The parasites possess a complement of atypical protein kinases, including representatives of all of the more well-characterized families: RIO, alpha, PIKK and PDK (Figure 9 and Additional file 4), although no functional analyses have been carried out to date on any representative aPK from trypanosomatids. The RIO family of atypical kinases is related to ePKs, but RIO proteins lack the sequences known to be involved in peptide binding in ePKs . Nonetheless, the catalytic residues are present. Trypanosomatids possess two RIO proteins, which are clearly assigned to the RIO1 and RIO2 subfamilies. In other organisms both RIO1 and RIO2 are required for ribosomal biogenesis, and RIO is involved in cell cycle progression . Interestingly, the similarity between human and trypanosomatid RIO2 extends into the N-terminus, where the structure of the human enzyme shows a winged helix-turn-helix motif . Such helix-turn-helix motifs are often found on DNA binding proteins such as transcription factors, a class of proteins which are rare in trypanosomatids.
The alpha kinases, so named because they phosphorylate their substrates within alpha helices, show a small amount of sequence similarity to ePKs, with conservation of the catalytic residues in subdomains 2, 6, and 7 . One set of the alpha kinases in trypanosomatids is comprised of small molecules, being little more than the 241 aa alpha domain. This type of alpha kinase is present in all three species. However, L. major possesses two additional alpha kinase genes, which are very large (>1000 amino acids). Interestingly, three of the four L. major alpha kinases are found on a 12 kb segment on chromosome 36. None of the alpha kinases appear to be fused to an ion channel, as is the case for certain vertebrate alpha kinases .
The PIKK kinases represent a particularly interesting family in which the protein kinase domain structurally resembles that of phosphatidylinositol 3-kinases . In addition to the kinase domain, these proteins also have FAT and FATC motifs which are not found in the lipid kinases. The similarity between the trypanosomatid PIKK kinases and those in the 4-kinome dataset is highly significant, with E-values of 10-90 or better. PIKK kinases are quite large in general and those in T. brucei are no exception, ranging in predicted size from 271 to 468 kDa. The parasites possess clear homologues to the specific PIKK kinases involved in genome surveillance: ATM and ATR . They also have four kinases that belong to the FRAP family (this family includes FRAP and mTOR). TOR (target of the immunosuppressive agent rapamycin) modulates translation and cell cycle in response to nutrient and growth signals . Multiple drugs targeting mTOR are in trials for the treatment of various cancers .
The trypanosomatids contain 3 genes encoding putative pyruvate dehydrogenase kinases (PDK). In mammals, the activity of mitochondrial pyruvate dehydrogenase is tightly regulated by multi-site serine phosphorylation of the E1α subunit . Despite their exclusive phosphorylation of serine residues, the PDKs lack the domains characteristic of ePKs. Rather, these kinases have two distinct domains. A C-terminal domain that shares structural conservation with the GHKL ATPase/kinase superfamily (including members of the histidine kinase family) and an N-terminal domain that resembles a histidine phosphoryl transfer domain of bacterial two component systems . The presence of an active pyruvate dehydrogenase in trypanosomatids with an E1α subunit suggests that regulation of activity by phosphorylation is likely to be conserved in these species.
The analysis presented here shows that trypanosomatids possess a large complement of protein kinases, indicating that protein phosphorylation is a key mechanism for regulation of parasite processes. In metazoa and yeast, the ultimate targets of many signaling cascades are transcription factors, which then trigger the expression of new sets of genes. In contrast, since trypanosomatids indiscriminately transcribe most genes in large polycistronic units, signaling cascades in these organisms must function in post-transcriptional regulation. Key regulators of specific mRNA turnover are still being sought, and we propose that protein kinases are major players in these processes. We also propose that trypanosomatids, more than many other organisms, rely on the phosphorylation of the downstream molecules that perform stage-specific and cell-cycle specific functions. Phosphorylation has been shown to modulate protein turnover, localization, interaction and activity for various molecules in eukaryotes. Both ePKs and aPKs are the targets of major drug discovery efforts in chronic human diseases [56, 84, 87, 88]. Exploiting the knowledge and resources generated in those efforts could provide new answers in the search for new drugs to combat trypanosomatid diseases. A major effort to understand the functions of individual protein kinases will allow increased focus on key molecules. We suggest that PKs closely related to human drug targets would be a useful first set to be explored. However, perhaps just as useful could be the group of unique kinases, which show little resemblance to human PKs.
All ePKs were retrieved from GeneDB  through a combination of searches with the protein kinase Pfam domain, BLAST analysis using diverse ePKs, and examination of COGS. L. major (version 5.0, Feb 2005); T. brucei (version 4.0, Feb 2005) and T. cruzi (version 3.0, July 2004) were the final datasets used in this study. In the case of T. cruzi, in which the genome strain is a hybrid, the two presumed alleles were identified through analysis of COGs , and counted as one gene, even though up to 7% sequence allelic sequence divergence occurs in this strain . All ePKs were examined for the presence of the 11-subdomain structure, and the presence of lysine in subdomain 2 and aspartic acid in subdomains 6 and 7, which are required for catalysis [24, 89]. Those lacking these residues were categorized as catalytically inactive. Similarly, a few ePKs lacked any sequence resembling subdomain 1, which functions in ATP binding, and were also categorized as catalytically inactive. Atypical PKs were identified by BLAST analysis using representatives of each group of aPKs from other species as queries.
Each ePK was analyzed for the presence of additional domains by hidden Markov model analysis of the Pfam database . The default cutoffs were used.
All predicted ePKs from each species were analyzed by BLAST analysis against the 4-kinome dataset comprised of all human, Saccharomyces cerevisiae, Drosophila melanogaster, and Caenorhabditis elegans protein kinase domains . Since phylogenetic inference indicated that members of trypanosomatid COGs were more closely related to each other than to ePKs of the 4-kinome dataset, all COG members were classified congruently (the sole exception we found was two kinases that shared extensive homology outside of the kinase domain, LmjF15.1200 and Tc00.1047053505977.13). Assignments required an E-value difference of 5 logs or more between groups or families of ePKs for one of the trypanosomatid orthologues. When all trypanosomatid orthologues had E-values poorer that 10-16, or had similar E-values for different groups of ePKs, those ePKs were designated "unique".
Due to the relatively low degree of sequence conservation at the nucleotide level within some of these families, phylogenetic inference was carried out on amino acid alignment, rather than attempting alignment at the nucleotide level. Kinase domains were identified by analysis of alignments with the Pfam protein kinase domain, and extended manually as needed. Insertions larger than 25 amino acids were identified and removed prior to subsequent analyses. The SAM (Sequence Alignment and Modeling System using Hidden Markov Model (HMM) software was used to build HMMs representing the kinase domains of the gene families discussed in this paper . These trained models were then used to identify residues capable of discriminating between the various domain families. In addition, by aligning the sequences of the kinase domains to these models, we created multiple sequence alignments of these gene families. These alignments were then visually inspected to verify that all subdomains were appropriately aligned, as well as to allow removal of both gene-specific insertions (in addition to those previously removed) and deletions and phylogenetically uninformative residues. These edited HMM-generated alignments were used as the starting point for phylogenetic reconstruction of these domain families.
Phylogenetic analysis of the kinase domains of these proteins was carried out using a variety of techniques. As a preliminary step in the phylogenetic investigation of our dataset, we used the Neighbour-joining approach as defined by Saitou and Nei as implemented in ClustalX . Due to the relatively high degree of divergence that might be expected within our dataset we used the correct for multiple substitutions option in our analysis. Bootstrapping was carried out on the dataset with 1,000 replicates.
These aligned amino acid sequences were also subjected to parsimony analysis using PAUP*, version 4.0b8 [93, 94]. Given the relatively large number of taxa in this dataset, the use of an exhaustive search was not possible. In its place, an heuristic search strategy was employed to attempt to find the best tree by reducing the set of trees examined and just calculating the score for likely trees. It should be noted that this method is not guaranteed to identify the most parsimonious trees from the sequences. We carried out 100 random stepwise addition sequences of taxa, each with TBR swapping and MAXTREES set to 10,000. Parsimony bootstrapping on the dataset was performed with 1,000 replicates and the same settings, except that only 10 random stepwise addition sequences were used per bootstrap replicate.
The same datasets were analysed using a Maximum Likelihood approach, as implemented in the web available PHYML application [95, 96]. Analysis using the WAG amino acid substitution model inferred the starting tree. The proportion of invariable sites and the gamma distribution parameter were estimated by maximizing the likelihood of the phylogeny. The number of substitution rate categories was set at four for these analyses. Non-parametric bootstrap analysis was then carried out on the original data set with 500 replicates (the upper limit available for this web service). Majority-rule consensus trees were created for each of the three methodologies outlined above.
Finally, MRBAYES was used to carry out a Bayesian analysis of our data . We used the WAG amino acid substitution model (based on nuclear genes and globular protein sequences, respectively), with a gamma rate distribution estimated from the data set to infer the phylogeny of our dataset. Starting from random trees, four parallel Markov chains were run to sample trees using the Markov Chain Monte Carlo (MCMC) principle. In general, 1,000,000 generations were run; after the burn-in phase, every 100th tree was saved.
Estimation of Ka/Ks
Analysis of the synonymous/non synonymous substitution rates required construction of pairwise, codon aligned, sequence alignments. These were generated using the PAL2NAL web server , which converted full-length amino acid alignments and the corresponding DNA sequences into a codon-based DNA alignment. The amino acid sequence alignments were obtained using the Water programme from the EMBOSS package using default parameters . Estimation of Ka/Ks (dN/dS) ratios was then carried out by maximum likelihood using the pairwise codon-based substitution model in Codeml, which is part of the Phylogenetic Analysis by Maximum Likelihood (PAML) suite of programs .
atypical protein kinase
clustered orthologous groups
eukaryotic protein kinase, PK, protein kinase, TriTryp, the trypanosomatids Leishmania major, Trypanosoma brucei, and Trypanosoma cruzi.
TDR Homepage. 2005, [http://www.who.int/tdr]
Lejon V, Buscher P: Review Article: Cerebrospinal fluid in human African trypanosomiasis: a key to diagnosis, therapeutic decision and post-treatment follow-up. Trop Med Int Health. 2005, 10: 395-403. 10.1111/j.1365-3156.2005.01403.x.
Higuchi ML, Benvenuti LA, Martins RM, Metzger M: Pathophysiology of the heart in Chagas' disease: current status and new developments. Cardiovasc Res. 2003, 60: 96-107. 10.1016/S0008-6363(03)00361-4.
Murray HW: Treatment of visceral leishmaniasis in 2004. Am J Trop Med Hyg. 2004, 71: 787-794.
El-Sayed NMA, Myler PJ, Bartholomeu D, Nilsson D, Aggarwal G, Tran AN, Ghedin E, Worthey EA, Delcher A, Blandin G, Westenberger S, Haas B, Caler E, Cerqueira G, Arner E, Aslund L, Bontempi E, Branche C, Bringaud F, Campbell D, Carrington M, Crabtree JS, Darban H, Edwards K, Englund P, Feldblyum T, Ferella M, Frasch C, Kindlund E, Klingbeil MM, Kluge S, Koo HL, Lacerda D, McCulloch R, McKenna A, Mizuno Y, Mottram J, Ochaya S, Pai G, Parsons M, Pettersson U, Pop M, Luis Ramirez J, Salzberg S, Tammi M, Tarleton RL, Teixeira SM, Van Aken S, Wortman J, Stuart KD, Andersson B, Anapuma A, Attipoe P, Burton P, Cadag E, Franco da Silva J, de Jong P, Fazelinia G, Gull K, Horn D, Hou L, Huang Y, Levin MJ, Lorenzi H, Louie T, Machado CR, Nelson S, Osoegawa K, Pentony M, Rinta J, Robertson L, Sanchez DO, Seyler A, Sharma R, Shetty J, Simpson AJ, Sisk E, Vogt C, Ward P, Wickstead B, White O, Fraser CM, Stuart KD, Andersson B: The genome sequence of Trypanosoma cruzi, etiological agent of Chagas' disease. Science. 2005, 309: 409-415. 10.1126/science.1112631.
Berriman M, Ghedin E, Hertz-Fowler C, Blandin G, Lennard NJ, Bartholomeu D, Renauld HJ, Caler E, Hamlin N, Haas B, Harris BR, Hannick L, Barrell B, Donelson J, Hall N, Fraser CM, Melville SE, El-Sayed N, Böhme UC, Shallom J, Aslett M, Hou L, Atkin B, Barron AJ, Bringaud F, Brooks K, Cherevach I, Chillingworth T, Churcher C, Clark LN, Corton CH, Cronin A, Davies R, Doggett J, Djikeng A, Feldblyum T, Fraser A, Goodhead I, Hance Z, Harper AD, Hauser H, Hostetler J, Jagels K, Johnson D, Johnson J, Jones C, Kerhornou A, Koo H, Larke N, Larkin C, Leech V, Line A, MacLeod A, Mooney P, Moule S, Mungall K, Norbertczak H, Ormond D, Pai G, Peterson J, Quail MA, Rajandream MA, Reitter C, Sanders M, Schobel S, Sharp S, Simmonds M, Simpson AJ, Tallon L, Turner CM, Tait A, Tivey A, Van Aken S, Walker D, Wanless D, White B, White O, Whitehead S, Wortman J, Barry JD, Fairlamb AH, Field MC, Gull K, Landfear S, Marcello L, Martin DM, Opperdoes F, Ullu E, Whickstead B, Alsmark C, Arrowsmith C, Carrington M, Embley TM, Ivens A, Lord A, Morgan GM, Peacock CS, Rabbinowitsch E, Salzberg S, Wang S, Woodward J, Adams MD, T. Martin Embley TM, Gull K, Ullu E, Barry JD, Fairlamb AH, Opperdoes F, Barrell BG, Donelson JE, Hall N, Fraser CM, Melville SE, El-Sayed NM: The genome of the African trypanosome, Trypanosoma brucei. Science. 2005, 309: 416-422. 10.1126/science.1112642.
Ivens AC, Peacock C, Worthey EA, Murphy L, Aggarwal G, Berriman M, Sisk E, Hertz-Fowler C, Quail MA, Harris D, Rajandream MA, Davies RM, Anupama A, Bason N, Attipoe P, Collins M, Cadag E, Cronin A, Fazelinia G, Fosker N, Fraser A, Huang Y, Knights A, Larke N, Litvin L, Lord A, Louie T, Munden H, Nelson S, Norbertczak H, Oliver K, O'Neill S, Pentony M, Price C, Rabbinowitsch E, Rinta J, Robertson L, Saunders D, Seeger K, Seyler A, Sharp S, Sivam D, Vogt C, Warren T, Woodward J, Fuchs M, Gabel C, Mueller-Auer S, Schaefer M, Rieger M, Bauser C, Bothe G, Pohl TM, Masuy D, Purnelle B, Goffeau A, Borzym K, Klages S, Beck A, Reinhardt R, Duesterhoeft A, Hilbert H, Wedler H, Bianchettin G, Ciarloni L, Tosato V, Bruschi C, Aert R, Robben J, Volckaert G, Wambutt R, Zimmerman W, Zhou S, Schwartz DC, Shin H, Schein J, Marra M, Ruiz JC, Cruz A, Dobson DE, Beverley SM, Clayton C, Coulson RM, Frasch AC, De Gaudenzi J, Horn D, Matthews K, Michaeli S, Smith DF, Blackwell JM, Stuart KD, Barrell B, Myler PJ, Apostolou Z, Goble A, Kube M, Rutter S, Squares R, Squares S, Mottram JC, Muller-Auer S, Munden H, Nelson S, Norbertczak H, Oliver K, O'Neil S, Pentony M, Pohl TM, Price C, Purnelle B, Quail MA, Rabbinowitsch E, Reinhardt R, Rieger M, Rinta J, Robben J, Robertson L, Ruiz JC, Rutter S, Saunders D, Schafer M, Schein J, Schwartz DC, Seeger K, Seyler A, Sharp S, Shin H, Sivam D, Squares R, Squares S, Tosato V, Vogt C, Volckaert G, Wambutt R, Warren T, Wedler H, Woodward J, Zhou S, Zimmermann W, Smith DF, Blackwell JM, Stuart KD, Barrell B, Myler PJ: The genome of the kinetoplastid parasite, Leishmania major. Science. 2005, 309: 436-42. 10.1126/science.1112680.
Bieger B, Essen LO: Structural analysis of adenylate cyclases from Trypanosoma brucei in their monomeric state. EMBO J. 2001, 20: 433-445. 10.1093/emboj/20.3.433.
Alexandre S, Paindavoine P, Hanocq-Quertier J, Paturiaux-Hanocq F, Tebabi P, Pays E: Families of adenylate cyclase genes in Trypanosoma brucei. Mol Biochem Parasitol. 1996, 77: 173-182. 10.1016/0166-6851(96)02591-1.
Clayton CE: Life without transcriptional control? From fly to man and back again. EMBO J. 2002, 21: 1881-1888. 10.1093/emboj/21.8.1881.
Martinez-Calvillo S, Nguyen D, Stuart K, Myler PJ: Transcription initiation and termination on Leishmania major chromosome 3. Eukaryot Cell. 2004, 3: 506-517. 10.1128/EC.3.2.506-517.2004.
Aboagye-Kwarteng T, Ole-Moiyoi OK, Lonsdale-Eccles JD: Phosphorylation differences among proteins of bloodstream developmental stages of Trypanosoma brucei brucei. Biochem J. 1991, 275: 7-14.
Dell KR, Engel JN: Stage-specific regulation of protein phosphorylation in Leishmania major. Mol Biochem Parasitol. 1994, 64: 283-292. 10.1016/0166-6851(94)00030-1.
Parsons M, Valentine M, Deans J, Schieven G, Ledbetter JA: Distinct patterns of tyrosine phosphorylation during the life cycle of Trypanosoma brucei. Mol Biochem Parasitol. 1991, 45: 241-248. 10.1016/0166-6851(91)90091-J.
Hammarton TC, Mottram JC, Doerig C: The cell cycle of parasitic protozoa: potential for chemotherapeutic exploitation. Prog Cell Cycle Res. 2003, 5:91-101.: 91-101.
McKean PG: Coordination of cell cycle and cytokinesis in Trypanosoma brucei. Curr Opin Microbiol. 2003, 6: 600-607. 10.1016/j.mib.2003.10.010.
Shiu SH, Li WH: Origins, lineage-specific expansions, and multiple losses of tyrosine kinases in eukaryotes. Mol Biol Evol. 2004, 21: 828-840. 10.1093/molbev/msh077.
Loftus B, Anderson I, Davies R, Alsmark UC, Samuelson J, Amedeo P, Roncaglia P, Berriman M, Hirt RP, Mann BJ, Nozaki T, Suh B, Pop M, Duchene M, Ackers J, Tannich E, Leippe M, Hofer M, Bruchhaus I, Willhoeft U, Bhattacharya A, Chillingworth T, Churcher C, Hance Z, Harris B, Harris D, Jagels K, Moule S, Mungall K, Ormond D, Squares R, Whitehead S, Quail MA, Rabbinowitsch E, Norbertczak H, Price C, Wang Z, Guillen N, Gilchrist C, Stroup SE, Bhattacharya S, Lohia A, Foster PG, Sicheritz-Ponten T, Weber C, Singh U, Mukherjee C, El Sayed NM, Petri WAJ, Clark CG, Embley TM, Barrell B, Fraser CM, Hall N: The genome of the protist parasite Entamoeba histolytica. Nature. 2005, 433: 865-868. 10.1038/nature03291.
Parsons M, Ledbetter JA, Schieven GL, Nel AE, Kanner SB: Developmental regulation of pp44/46, tyrosine-phosphorylated proteins associated with tyrosine/serine kinase activity in Trypanosoma brucei. Mol Biochem Parasitol. 1994, 63: 69-78. 10.1016/0166-6851(94)90009-4.
Salotra P, Ralhan R, Sreenivas G: Heat-stress induced modulation of protein phosphorylation in virulent promastigotes of Leishmania donovani. Int J Biochem Cell Biol. 2000, 32: 309-316. 10.1016/S1357-2725(99)00134-X.
Manning G, Whyte DB, Martinez R, Hunter T, Sudarsanam S: The protein kinase complement of the human genome. Science. 2002, 298: 1912-1934. 10.1126/science.1075762.
Walker JC, Zhang R: Relationship of a putative receptor protein kinase from maize to the S-locus glycoproteins of Brassica. Nature. 1990, 345: 743-746. 10.1038/345743a0.
Shiu SH, Karlowski WM, Pan R, Tzeng YH, Mayer KF, Li WH: Comparative analysis of the receptor-like kinase family in Arabidopsis and rice. Plant Cell. 2004, 16: 1220-1234. 10.1105/tpc.020834.
Hanks SK, Hunter T: Protein kinases 6: The eukaryotic protein kinase superfamily: Kinase (catalytic) domain structure and classification. FASEB J. 1995, 9: 576-596.
Abraham RT: PI 3-kinase related kinases: 'big' players in stress-induced signaling pathways. DNA Repair (Amst). 2004, 3: 883-887. 10.1016/j.dnarep.2004.04.002.
Angermayr M, Bandlow W: RIO1, an extraordinary novel protein kinase. FEBS Lett. 2002, 524: 31-36. 10.1016/S0014-5793(02)02993-9.
Drennan D, Ryazanov AG: Alpha-kinases: analysis of the family and comparison with conventional protein kinases. Prog Biophys Mol Biol. 2004, 85: 1-32. 10.1016/S0079-6107(03)00060-9.
Parsons M, Ruben L: Pathways involved in environmental sensing in trypanosomatids. Parasitol Today. 2000, 16: 56-62. 10.1016/S0169-4758(99)01590-2.
GeneDB. 2005, [http://www.genedb.org/]
Ward P, Equinet L, Packer J, Doerig C: Protein kinases of the human malaria parasite Plasmodium falciparum: the kinome of a divergent eukaryote. BMC Genomics. 2004, 5: 79-10.1186/1471-2164-5-79.
Anamika, Srinivasan N, Krupa A: A genomic perspective of protein kinases in Plasmodium falciparum. Proteins. 2005, 58: 180-189. 10.1002/prot.20278.
Schneider AG, Mercereau-Puijalon O: A new Apicomplexa-specific protein kinase family : multiple members in Plasmodium falciparum, all with an export signature. BMC Genomics. 2005, 6: 30-10.1186/1471-2164-6-30.
Johnson LN, Noble MEM, Owen DJ: Active and inactive protein kinases: Structural basis for regulation. Cell. 1996, 85: 149-158. 10.1016/S0092-8674(00)81092-2.
Kinase.com. 2005, [http://188.8.131.52/]
Squire CJ, Dickson JM, Ivanovic I, Baker EN: Structure and inhibition of the human cell cycle checkpoint kinase, Wee1A kinase: an atypical tyrosine kinase with a key role in CDK1 regulation. Structure (Camb ). 2005, 13: 541-550. 10.1016/j.str.2004.12.017.
Hassan P, Fergusson D, Grant KM, Mottram JC: The CRK3 protein kinase is essential for cell cycle progression of Leishmania mexicana. Mol Biochem Parasitol. 2001, 113: 189-198. 10.1016/S0166-6851(01)00220-1.
Hammarton TC, Clark J, Douglas F, Boshart M, Mottram JC: Stage-specific differences in cell cycle control in Trypanosoma brucei revealed by RNA interference of a mitotic cyclin. J Biol Chem. 2003, 278: 22877-22886. 10.1074/jbc.M300813200.
Tu X, Wang CC: The involvement of two cdc2-related kinases (CRKs) in Trypanosoma brucei cell cycle regulation and the distinctive stage-specific phenotypes caused by CRK3 depletion. J Biol Chem. 2004, 279: 20519-20528. 10.1074/jbc.M312862200.
Mottram JC, Smith G: A family of trypanosome cdc2-related protein kinases. Gene. 1995, 162: 147-152. 10.1016/0378-1119(95)00350-F.
Grant KM, Hassan P, Anderson JS, Mottram JC: The crk3 gene of Leishmania mexicana encodes a stage-regulated cdc2-related histone H1 kinase that associates with p12cks1. J Biol Chem. 1998, 273: 10153-10159. 10.1074/jbc.273.17.10153.
Kemp BE, Stapleton D, Campbell DJ, Chen ZP, Murthy S, Walter M, Gupta A, Adams JJ, Katsis F, van Denderen B, Jennings IG, Iseli T, Michell BJ, Witters LA: AMP-activated protein kinase, super metabolic regulator. Biochem Soc Trans. 2003, 31: 162-168.
Cheng SH, Willmann MR, Chen HC, Sheen J: Calcium signaling through protein kinases. The Arabidopsis calcium-dependent protein kinase gene family. Plant Physiol. 2002, 129: 469-485. 10.1104/pp.005645.
Zhao Y, Pokutta S, Maurer P, Lindt M, Franklin RM, Kappes B: Calcium-binding properties of a calcium-dependent protein kinase from Plasmodium falciparum and the significance of individual calcium-binding sites for kinase activation. Biochem. 1994, 33: 3714-3721. 10.1021/bi00178a031.
Shalaby T, Liniger M, Seebeck T: The regulatory subunit of a cGMP-regulated protein kinase A of Trypanosoma brucei. Eur J Biochem. 2001, 268: 6197-6206. 10.1046/j.0014-2956.2001.02564.x.
Garcia-Salcedo JA, Nolan DP, Gijon P, Gomez-Rodriguez J, Pays E: A protein kinase specifically associated with proliferative forms of Trypanosoma brucei is functionally related to a yeast kinase involved in the co-ordination of cell shape and division. Mol Microbiol. 2002, 45: 307-319. 10.1046/j.1365-2958.2002.03019.x.
Hammarton TC, Lillico SG, Welburn SC, Mottram JC: Trypanosoma brucei MOB1 is required for accurate and efficient cytokinesis but not for exit from mitosis. Mol Microbiol. 2005, 56: 104-116. 10.1111/j.1365-2958.2005.04542.x.
Tu X, Wang CC: Pairwise knockdowns of cdc2-related kinases (CRKs) in Trypanosoma brucei identified the CRKs for G1/S and G2/M transitions and demonstrated distinctive cytokinetic regulations between two developmental stages of the organism. Eukaryot Cell. 2005, 4: 755-764. 10.1128/EC.4.4.755-764.2005.
Hammarton TC, Engstler M, Mottram JC: The Trypanosoma brucei cyclin, CYC2, is required for cell cycle progression through G1 phase and for maintenance of procyclic form cell morphology. J Biol Chem. 2004, 279: 24757-24764. 10.1074/jbc.M401276200.
Li Y, Li Z, Wang CC: Differentiation of Trypanosoma brucei may be stage non-specific and does not require progression of cell cycle. Mol Microbiol. 2003, 49: 251-265. 10.1046/j.1365-2958.2003.03575.x.
Li Z, Wang CC: A PHO80-like cyclin and a B-type cyclin control the cell cycle of the procyclic form of Trypanosoma brucei. J Biol Chem. 2003, 278: 20652-20658. 10.1074/jbc.M301635200.
Mottram JC, Kinnaird JH, Shiels B, Tait A, Barry JD: A novel CDC2-related protein kinase from Leishmania mexicana, lmmCRK1, is post-translationally regulated during the life-cycle. J Biol Chem. 1993, 268: 21044-21052.
Mottram JC, McCready BP, Brown KG, Grant KM: Gene disruptions indicate an essential function for the LmmCRK1 cdc2-related kinase of Leishmania mexicana. Mol Microbiol. 1996, 22: 573-582. 10.1046/j.1365-2958.1996.00136.x.
Naula C, Parsons M, Mottram JC: Protein kinases as drug targets in trypanosomes and Leishmania. Biochim Biophys Acta. 2005, in press:
Portal D, Lobo GS, Kadener S, Prasad J, Espinosa JM, Pereira CA, Tang Z, Lin RJ, Manley JL, Kornblihtt AR, Flawia MM, Torres HN: Trypanosoma cruzi TcSRPK, the first protozoan member of the SRPK family, is biochemically and functionally conserved with metazoan SR protein-specific kinases. Mol Biochem Parasitol. 2003, 127: 9-21. 10.1016/S0166-6851(02)00299-2.
Duncan PI, Stojdl DF, Marius RM, Scheit KH, Bell JC: The Clk2 and Clk3 dual-specificity protein kinases regulate the intranuclear distribution of SR proteins and influence pre-mRNA splicing. Exp Cell Res. 1998, 241: 300-308. 10.1006/excr.1998.4083.
Cohen P, Goedert M: GSK3 inhibitors: development and therapeutic potential. Nat Rev Drug Discov. 2004, 3: 479-487. 10.1038/nrd1415.
Miyata Y, Nishida E: Distantly related cousins of MAP kinase: biochemical properties and possible physiological functions. Biochem Biophys Res Commun. 1999, 266: 291-295. 10.1006/bbrc.1999.1705.
Wiese M, Wang Q, Gorcke I: Identification of mitogen-activated protein kinase homologues from Leishmania mexicana. Int J Parasitol. 2003, 33: 1577-1587. 10.1016/S0020-7519(03)00252-2.
Wiese M: A mitogen-activated protein (MAP) kinase homologue of Leishmania mexicana is essential for parasite survival in the infected host. EMBO J. 1998, 17: 2619-2628. 10.1093/emboj/17.9.2619.
Bengs F, Scholz A, Kuhn D, Wiese M: LmxMPK9, a mitogen-activated protein kinase homologue affects flagellar length in Leishmania mexicana. Mol Microbiol. 2005, 55: 1606-1615. 10.1111/j.1365-2958.2005.04498.x.
Hua S, Wang CC: Differential accumulation of a protein kinase homolog in Trypanosoma brucei. J Cell Biochem. 1994, 54: 20-31. 10.1002/jcb.240540104.
Hua SB, Wang CC: Interferon-gamma activation of a mitogen-activated protein kinase, KFR1, in the bloodstream form of Trypanosoma brucei. J Biol Chem. 1997, 272: 10797-10803. 10.1074/jbc.272.16.10797.
Muller IB, Domenicali-Pfister D, Roditi I, Vassella E: Stage-specific requirement of a mitogen-activated protein kinase by Trypanosoma brucei. Mol Biol Cell. 2002, 13: 3787-3799. 10.1091/mbc.E02-02-0093.
Ellis J, Sarkar M, Hendriks E, Matthews K: A novel ERK-like, CRK-like protein kinase that modulates growth in Trypanosoma brucei via an autoregulatory C-terminal extension. Mol Microbiol. 2004, 53: 1487-1499. 10.1111/j.1365-2958.2004.04218.x.
Kuhn D, Wiese M: LmxPK4, a mitogen-activated protein kinase kinase homologue of Leishmania mexicana with a potential role in parasite differentiation. Mol Microbiol. 2005, 56: 1169-1182. 10.1111/j.1365-2958.2005.04614.x.
Wiese M, Kuhn D, Grunfelder CG: Protein kinase involved in flagellar-length control. Eukaryot Cell. 2003, 2: 769-777. 10.1128/EC.2.4.769-777.2003.
Agron PG, Reed SL, Engel JN: An essential, putative MEK kinase of Leishmania major. Mol Biochem Parasitol. 2005, 142: 121-125. 10.1016/j.molbiopara.2005.03.007.
O'Connell MJ, Krien MJ, Hunter T: Never say never. The NIMA-related protein kinases in mitotic control. Trends Cell Biol. 2003, 13: 221-228. 10.1016/S0962-8924(03)00056-4.
Mahjoub MR, Qasim RM, Quarmby LM: A NIMA-related kinase, Fa2p, localizes to a novel site in the proximal cilia of Chlamydomonas and mouse kidney cells. Mol Biol Cell. 2004, 15: 5172-5186. 10.1091/mbc.E04-07-0571.
Liu S, Lu W, Obara T, Kuida S, Lehoczky J, Dewar K, Drummond IA, Beier DR: A defect in a novel Nek-family kinase causes cystic kidney disease in the mouse and in zebrafish. Development. 2002, 129: 5839-5846. 10.1242/dev.00173.
Belham C, Roig J, Caldwell JA, Aoyama Y, Kemp BE, Comb M, Avruch J: A mitotic cascade of NIMA family kinases. Nercc1/Nek9 activates the Nek6 and Nek7 kinases. J Biol Chem. 2003, 278: 34897-34909. 10.1074/jbc.M303663200.
Gale MJJ, Carter V, Parsons M: Translational control mediates the developmental regulation of the Trypanosoma brucei Nrk protein kinase. J Biol Chem. 1994, 269: 31659-31665.
Park JH, Brekken DL, Randall AC, Parsons M: Molecular cloning of Trypanosoma brucei CK2 catalytic subunits: the alpha isoform is nucleolar and phosphorylates the nucleolar protein Nopp44/46. Mol Biochem Parasitol. 2002, 119: 97-106. 10.1016/S0166-6851(01)00407-8.
Spadafora C, Repetto Y, Torres C, Pino L, Robello C, Morello A, Gamarro F, Castanys S: Two casein kinase 1 isoforms are differentially expressed in Trypanosoma cruzi. Mol Biochem Parasitol. 2002, 124: 23-36. 10.1016/S0166-6851(02)00156-1.
Matsuura A, Tsukada M, Wada Y, Ohsumi Y: Apg1p, a novel protein kinase required for the autophagic process in Saccharomyces cerevisiae. Gene. 1997, 192: 245-250. 10.1016/S0378-1119(97)00084-X.
Therond P, Alves G, Limbourg-Bouchon B, Tricoire H, Guillemet E, Brissard-Zahraoui J, Lamour-Isnard C, Busson D: Functional domains of fused, a serine-threonine kinase required for signaling in Drosophila. Genetics. 1996, 142: 1181-1198.
Sacerdoti-Sierra N, Jaffe CL: Release of ecto-protein kinases by the protozoan parasite Leishmania major. J Biol Chem. 1997, 272: 30760-30765. 10.1074/jbc.272.49.30760.
Vieira LL, Sacerdoti-Sierra N, Jaffe CL: Effect of pH and temperature on protein kinase release by Leishmania donovani. Int J Parasitol. 2002, 32: 1085-1093. 10.1016/S0020-7519(02)00067-X.
El-Sayed NMA, Myler PJ, Blandin G, Berriman M, Crabtree J, Aggarwal G, Caler E, Renauld HJ, Worthey EA, Hertz-Fowler C, Ghedin E, Peacock C, Bartholomeu D, Haas B, Tran AN, Wortman J, Alsmark UCM, Angiuoli S, Anupama A, Badger J, Bringaud F, Cadag E, Carlton J, Cerqueira G, Creasy T, Delcher AL, Djikeng A, Embley TM, Hauser C, Ivens AC, Kummerfield SK, Pereira-Leal JB, Nilsson D, Peterson J, Salzberg S, Shallom J, Silva JC, Sundaram J, Westenberger S, White O, Melville S, Donelson JE, Andersson B, Stuart KD, Hall N: Comparative genomics of the trypanosomatids. Science. 2005, 309: 404-409. 10.1126/science.1112181.
Pils B, Schultz J: Inactive enzyme-homologues find new function in regulatory processes. J Mol Biol. 2004, 340: 399-404. 10.1016/j.jmb.2004.04.063.
KinG Kinases in Genome. 2005, [http://hodgkin.mbu.iisc.ernet.in/~king/index.html]
LaRonde-LeBlanc N, Wlodawer A: Crystal structure of A. fulgidus Rio2 defines a new family of serine protein kinases. Structure (Camb ). 2004, 12: 1585-1594. 10.1016/j.str.2004.06.016.
Garber PM, Vidanes GM, Toczyski DP: Damage in transition. Trends Biochem Sci. 2005, 30: 63-66. 10.1016/j.tibs.2004.12.004.
Bjornsti MA, Houghton PJ: The TOR pathway: a target for cancer therapy. Nat Rev Cancer. 2004, 4: 335-348. 10.1038/nrc1362.
Reed LJ, Damuni Z, Merryfield ML: Regulation of mammalian pyruvate and branched-chain alpha-keto acid dehydrogenase complexes by phosphorylation-dephosphorylation. Curr Top Cell Regul. 1985, 27: 41-49.
Kato M, Chuang JL, Tso SC, Wynn RM, Chuang DT: Crystal structure of pyruvate dehydrogenase kinase 3 bound to lipoyl domain 2 of human pyruvate dehydrogenase complex. EMBO J. 2005, 24: 1763-1774. 10.1038/sj.emboj.7600663.
Force T, Kuida K, Namchuk M, Parang K, Kyriakis JM: Inhibitors of protein kinase signaling pathways: emerging therapies for cardiovascular disease. Circulation. 2004, 109: 1196-1205. 10.1161/01.CIR.0000118538.21306.A9.
Kumar S, Boehm J, Lee JC: p38 MAP kinases: key signalling molecules as therapeutic targets for inflammatory diseases. Nat Rev Drug Discov. 2003, 2: 717-726. 10.1038/nrd1177.
Bossemeyer D: Protein kinases--Structure and function. FEBS Lett. 1995, 369: 57-61. 10.1016/0014-5793(95)00580-3.
Bateman A, Coin L, Durbin R, Finn RD, Hollich V, Griffiths-Jones S, Khanna A, Marshall M, Moxon S, Sonnhammer EL, Studholme DJ, Yeats C, Eddy SR: The Pfam protein families database. Nucleic Acids Res. 2004, 32: D138-D141. 10.1093/nar/gkh121.
Barrett C, Hughey R, Karplus K: Scoring hidden Markov models. Comput Appl Biosci. 1997, 13: 191-199.
Saitou N, Nei M: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987, 4: 406-425.
PAUP*. 2005, [http://paup.csit.fsu.edu/index.html]
Swofford DL, Waddell PJ, Huelsenbeck JP, Foster PG, Lewis PO, Rogers JS: Bias in phylogenetic estimation and its relevance to the choice between parsimony and likelihood methods. Syst Biol. 2001, 50: 525-539. 10.1080/106351501750435086.
PHYML - A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. 2005, [http://atgc.lirmm.fr/phyml/]
Guindon S, Gascuel O: A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003, 52: 696-704. 10.1080/10635150390235520.
Huelsenbeck JP, Ronquist F: MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics. 2001, 17: 754-755. 10.1093/bioinformatics/17.8.754.
PAL2NAL. 2005, [http://www.bork.embl-heidelberg.de/pal2nal]
Rice P, Longden I, Bleasby A: EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000, 16: 276-277. 10.1016/S0168-9525(00)02024-2.
Yang Z: PAML: a program package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci. 1997, 13: 555-556.
The authors thank Daniel Nilsson for assistance in COG analyses, particularly with respect to the T. cruzi genome. We also thank Peter Myler and Christiane Hertz-Fowler for advice on parasite genomics, the TriTryp Genome Consortium for their considerable effort that made this work possible and Tansy Hammarton for comments on the manuscript. We appreciate helpful comments from Gerard Manning regarding protein kinase classification. This work was supported in part by NIH R01 AI31077 (MP), the M.J. Murdock Charitable Trust (MP), the Medical Research Council (JCM) and Wellcome Trust (JCM).
MP carried out the analysis of T. brucei and T. cruzi protein kinases and drafted the manuscript. EAW performed the phylogenetic inference analyses. PNW carried out the analysis of L. major protein kinases. JCM contributed to interpretation of the data and the writing of the manuscript. MP and JCM conceived and coordinated the study. All authors read and approved the final manuscript.
Electronic supplementary material
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.