- Research article
- Open Access
The translation initiation complex eIF3 in trypanosomatids and other pathogenic excavates – identification of conserved and divergent features based on orthologue analysis
BMC Genomics volume 15, Article number: 1175 (2014)
The initiation of translation in eukaryotes is supported by the action of several eukaryotic Initiation Factors (eIFs). The largest of these is eIF3, comprising of up to thirteen polypeptides (eIF3a through eIF3m), involved in multiple stages of the initiation process. eIF3 has been better characterized from model organisms, but is poorly known from more diverged groups, including unicellular lineages represented by known human pathogens. These include the trypanosomatids (Trypanosoma and Leishmania) and other protists belonging to the taxonomic supergroup Excavata (Trichomonas and Giardia sp.).
An in depth bioinformatic search was carried out to recover the full content of eIF3 subunits from the available genomes of L. major, T. brucei, T. vaginalis and G. duodenalis. The protein sequences recovered were then submitted to homology analysis and alignments comparing them with orthologues from representative eukaryotes. Eleven putative eIF3 subunits were found from both trypanosomatids whilst only five and four subunits were identified from T. vaginalis and G. duodenalis, respectively. Only three subunits were found in all eukaryotes investigated, eIF3b, eIF3c and eIF3i. The single subunit found to have a related Archaean homologue was eIF3i, the most conserved of the eIF3 subunits. The sequence alignments revealed several strongly conserved residues/region within various eIF3 subunits of possible functional relevance. Subsequent biochemical characterization of the Leishmania eIF3 complex validated the bioinformatic search and yielded a twelfth eIF3 subunit in trypanosomatids, eIF3f (the single unidentified subunit in trypanosomatids was then eIF3m). The biochemical data indicates a lack of association of the eIF3j subunit to the complex whilst highlighting the strong interaction between eIF3 and eIF1.
The presence of most eIF3 subunits in trypanosomatids is consistent with an early evolution of a fully functional complex. Simplified versions in other excavates might indicate a primordial complex or secondary loss of selected subunits, as seen for some fungal lineages. The conservation in eIF3i sequence might indicate critical functions within eIF3 which have been overlooked. The identification of eIF3 subunits from distantly related eukaryotes provides then a basis for the study of conserved/divergent aspects of eIF3 function, leading to a better understanding of eukaryotic translation initiation.
The initiation stage of protein synthesis in eukaryotes is a complicated process, which involves a great number of different macromolecules and requires the action of multiple eukaryotic Initiation Factors, eIFs. Many eIFs from model organisms such as Drosophila, budding yeast, mouse and Arabidopsis, have been well characterized and their role in translation initiation described. Some of these are single polypeptide factors whilst others are complexes of multiple subunits which can be very large and perform various roles in translation (more recently reviewed in [1–3]).
eIF3 is the largest of the initiation factors, both in size and in number of subunits, being active during multiple steps of the translation initiation process. In mammals, it is composed of 13 subunits (eIF3a through eIF3m) whilst in the budding yeast S. cerevisae, a reduced eIF3 is present composed of five essential subunits (eIF3a, eIF3b, eIF3c, eIF3g and eIF3i) and the non-essential eIF3j (reviewed in [2, 4–6]). The essential S. cerevisae subunits defined the core eIF3 complex present in all eukaryotes investigated so far. Nevertheless, in vitro reconstitution of mammalian eIF3 have implicated three conserved subunits (eIF3a, eIF3b, eIF3c) and three non-conserved subunits (eIF3e, eIF3f and eIF3h) as being required for eIF3 function, and constituting a functional mammalian eIF3 core, whilst indicating that the two conserved subunits eIF3g and eIF3i might be dispensable . Indeed eIF3a, eIF3b and eIF3c have been proposed to occupy a central position in the overall mammalian eIF3 structure. In the absence of the loosely associated eIF3j, the mammalian complex seems to be organized in three stable modules: the first (module i), formed by the eIF3a, eIF3b, eIF3g and eIF3i subunits, resembling the budding yeast eIF3 core; a second module (ii), formed by subunits eIF3c, eIF3d, eIF3e, eIF3k and eIF3l; and a third smaller module (iii), formed by subunits eIF3f, eIF3h and eIF3m [8, 9]. Cryo-electron microscopic reconstruction of eIF3 has revealed that its subunits are organized in an anthropomorphic shape with five appendages and which shows surface complementarity to the platform of the 40S ribosomal subunit .
To accomplish its function, many of the eIF3 subunits participate in direct protein-protein interactions with other eIFs as well as ribosomal proteins, and also bind directly to RNA (reviewed in [2, 5]). Six of the mammalian eIF3 subunits (eIF3a, eIF3c, eIF3e, eIF3k, eIF3l and eIF3m) contain a PCI domain, a hallmark of related protein complexes such as the lid of the 19S regulatory particle of the 26S proteasome and the COP9 signalosome, which is involved in protein-protein interactions ([11, 12]; reviewed in ). Recent evidence has also implicated the PCI domain in the binding of two distinct yeast eIF3 subunits (eIF3a and eIF3c) to RNA [14, 15] although in humans the same subunits were found to bind to RNA (HCV IRES) through independent RNA-binding HLH motifs . Two other eIF3 subunits (eIF3f and eIF3h), containing the MPN domain, are members of a second family of proteins which also include subunits of the 26S proteasome and the signalosome, reflecting a common evolutionary origin for all three complexes ([11, 12, 17]; also reviewed in ). It has been proposed that the six PCI and two MPN containing subunits constitute a structural core which binds to other eIF3 subunits, translation initiation factors and the 40S ribosomal subunit [18, 19]. Indeed a functional reconstitution of the human eIF3 has shown that these subunits can form a stable octameric complex , pointing out to a spontaneous assembly pathway for the eIF3 complex which, nevertheless, is compatible with the three modules concept described above.
Most of what is known in regard to eIF3 and its role in translation initiation within eukaryotes is derived from very few organisms, despite the fact that a remarkable diversity within the translation apparatus has been noticed across the different eukaryotic groups . Not many studies, however, have been carried out focusing on excavates, unicellular protists which diverged very early from better studied unicellular and multicellular eukaryotes. These include many pathogenic species of medical and veterinary importance, such as those belonging to the family Trypanosomatidae (genus Leishmania and Trypanosoma) and even more divergent eukaryotes. The trypanosomatids have been seen to display unique biological characteristics rarely seen in eukaryotes. These include a scarcity of promoters for the protein coding genes, transcription of mRNAs in long polycystronic units, trans splicing of these polycystronic messages into monocystronic mature mRNAs and a lack of transcriptional control of their gene expression (recently reviewed in [21–24]). Regarding translation initiation, the study in trypanosomatids of eIF4E and eIF4G, two other initiation factors known to partner with eIF3 in the recruitment of mRNA by the ribosome, has highlighted novel and conserved events when compared to other eukaryotes (reviewed in [25, 26]). However, little is known regarding the eIF3 subunits of trypanosomatids and to what degree they are conserved in sequence or function when compared to better known eukaryotes.
The availability of complete or nearly complete genomic sequences for several selected trypanosomatids [27–29], as well as for other excavates of medical importance, such as Giardia sp.  and Trychomonas sp. , has generated new opportunities to investigate basic biological processes in pathogenic organisms which are distantly placed within the eukaryotic lineage. Studying translation initiation in these organisms may not only help define divergent features in specific stages, useful in the identification of potential therapeutical targets, but may also enhance the understanding regarding the evolution and conservation of the whole process throughout the eukaryotic lineages. Here, various bioinformatic tools were applied in order to perform a systematic search of the genomes of selected organisms for homologues of the various eIF3 subunits. The main targets were the two most complete trypanosomatid genomes available, those from Leishmania major  and Trypanosoma brucei , but studies were also carried out using the Giardia duodenalis (synonymous to G. lamblia and G. intestinalis –  and Trichomonas vaginalis sequences, as well as those from model eukaryotic and prokaryotic organisms. Sequences identified were validated through different approaches, including the biochemical purification of the Leishmania eIF3 complex followed by subsequent proteomic analysis, and used to investigate conserved and divergent features. Our results indicate a substantial degree of conservation of the eIF3 subunits in most eukaryotes whilst highlighting the occurrence of multiple instances of complex simplification. It also suggests a likely central role for the eIF3b, eIF3c and eIF3i subunits in eIF3 function.
Bioinformatic identification of eIF3 subunits within selected genome sequences
A detailed de novo search was carried out for sequences corresponding to eIF3 subunits present within the proteomes of the four unicellular pathogens chosen for this study (T. brucei, L. major, T. vaginalis and G. duodenalis), as well as representative organisms used for comparison (described in the Methods section). Identification of these sequences was validated using as reference the annotated human eIF3 subunits. This validation generated the results shown in Table 1, providing an overview on the conservation of individual subunits throughout representative eukaryotic lineages. The human eIF3 subunits were also used for pair-wise comparisons aiming to evaluate the degree of identity/similarity between the sequences identified from the four target organisms and their mammalian counterparts, summarized in Table 2.
Orthologues to all thirteen subunits were found in lineages representing the three major groups of multicellular eukaryotes, metazoans (Homo sapiens), plants (Arabidopsis thaliana) and filamentous fungi (Aspergillus niger), confirming a substantial degree of conservation for the eIF3 complex. Using the bioinformatics approach, eleven orthologues for eIF3 subunits were found in both L. major and T. brucei, missing one PCI (eIF3m) and one MPN subunit (eIF3f) when compared to higher eukaryotes. A likely eIF3f orthologue was found experimentally in Leishmania, as discussed later in the text, and both L. major and T. brucei orthologues were included in Tables 1 and 2. The presence of a nearly complete eIF3 complex in trypanosomatids suggests that it would be structurally similar to the human eIF3 and contrasts with the substantial reduction in the number of eIF3 subunits found in T. vaginalis and G. duodenalis. Only five eIF3 subunits were found in T. vaginalis including eIF3b, eIF3d and eIF3i plus one subunit each with a PCI (eIF3c) and MPN (eIF3h) domain, comprising subunits from all three described mammalian eIF3 modules . A similar result was also generated from the analysis with the G. duodenalis sequences, where only four subunits were identified. A minimal eIF3 then seems to be present in this organism, consisting of subunits eIF3b and eIF3i of the first mammalian eIF3 module plus a single PCI subunit (eIF3c) and, somewhat unexpectedly, an eIF3j orthologue. Another relevant observation is the identification of a putative eIF3i homologue from M. jannaschii, with this being the single eIF3 subunit which is found to have an Archaean homologue. Specific results generated in this study from the bioinformatic analysis of each eIF3 subunit are presented separately below.
The largest eIF3 subunit, eIF3a (also known as TIF32/RPG1), is characterized by the presence of a centrally localized PCI domain followed by a Spectrin repeat domain, typical of the Spectrin family of Actin-binding proteins (reviewed in ). eIF3a orthologues were identified from both L. major or T. brucei (named EIF3A), although only after searches carried out using the HMMs based approach, with both trypanosomatid polypeptides found being annotated as hypothetical proteins within the TriTrypDB database (Table 1). No equivalent sequences were found from either G. duodenalis or T. vaginalis, despite extensive searches using different bioinformatics approaches. When compared with the human eIF3a, the putative trypanosomatid orthologues display a low level of overall similarity (Table 2), but when they were included in a multiple alignment with other eIF3a orthologues a number of conserved amino acids, spaced throughout their length, are observed, as well as the characteristic PCI and Spectrin repeat domains (Additional file 1: Figure S1). Noteworthy is the presence of an extended C-terminus in the human protein, which is missing from the trypanosomatid orthologues and is much reduced in eIF3a sequences from other eukaryotes. Overall similarity between the different sequences is greater within their N-terminuses, implying a greater conservation in binding properties to the 40S subunit, eIF3c and eIFs involved in start codon recognition, all of which are associated with the N-terminal half of eIF3a [34, 35]. Conservation within the C-terminal halves, which include the PCI and Spectrin domains, is less pronounced (Figure 1A and B), although one of the longest conserved motifs in the alignment is found within the Spectrin domain, with seven out of ten amino acids strictly conserved between the trypanosomatid and human sequences. This Spectrin domain has recently been shown to mediate binding of human eIF3a to eIF3b and eIF3i  but, apart from the single phenylalanine residue (F760 in human eIF3a), shown to participate in the interaction with eIF3b and also found in the two trypanosomatid sequences, the overall similarity within the segment already implicated in binding to eIF3b and eIF3i is very low.
Also known as PRT1, eIF3b (reviewed in ) is a large scaffolding protein within the eIF3 complex, characterized by the presence of a non-canonical RRM domain . The C-terminus of eIF3b consists of a very large segment defined as a WD repeat domain, which serves as a platform where proteins interact and participates in the formation of multiprotein complexes . This WD repeat domain assumes a structure which has been recently solved and consists of a nine bladed β-propeller which interacts with the 40S ribosomal subunit . As previously described , and in contrast to eIF3a, eIF3b orthologues are clearly identifiable in both L. major and T. brucei. The HMMs search also returned putative eIF3b orthologues for T. vaginalis and G. duodenalis, with the two proteins containing both RRM and WD repeat domains, but sharing little overall sequence identity with other homologues (Tables 1 and 2). When these proteins were analyzed by a multiple alignment, the conservation in sequence between the protozoa eIF3b orthologues and those from multicellular eukaryotes (fungi, mammals and plants) is more pronounced at the C-terminal WD repeat domain and within the protein’s center, whilst the conservation within the RRM domain is more restricted to its N-terminal end (Figure 2A and B and Additional file 1: Figure S2). The RRM of eIF3b has been involved in several protein-protein interactions required for eIF3b function, such as binding to eIF3a, eIF3j and eIF3e [37, 41, 42]. In contrast, its extreme C-terminus, including the end of the WD repeat domain, has been shown in S. cerevisiae to be involved in binding to eIF3g and eIF3i, with the interaction between eIF3b and eIF3i characterized in more detail [43, 44]. In the various protozoan homologues investigated here, and with the exception of a single aromatic residue implicated in this interaction (W713 in the yeast protein in the Figure) and found to be conserved in all eIF3b orthologues aligned, the C-terminus of eIF3b implicated in the interaction with eIF3i is poorly conserved.
The third eIF3 subunit, eIF3c (NIP1), interacts with eIF3a and the two proteins form a dimer which is central to the eIF3 complex, since together they interact with each of the remaining PCI and MPN domain-containing subunits in human eIF3, as well as with the conserved eIF3b/eIF3g/eIF3i subcomplex . eIF3c orthologues were clearly found in both trypanosomatids investigated, with two identical genes observed encoding the T. brucei protein. In contrast, putative orthologues for this eIF3 subunit from T. vaginalis and G. duodenalis were only identified through the HMM search and both are annotated as hypothetical proteins (Tables 1 and 2). When compared with the other two large eIF3 subunits, eIF3c is more conserved than eIF3a but not as much as eIF3b, although the degree of conservation is still low. Through the multiple alignment analysis, overall conservation between the various eIF3c homologues can be seen to be more concentrated at their last third, encompassing the PCI domain (Additional file 1: Figure S3), with little conservation within the segment which in yeast eIF3c has been implicated in the interaction with eIF3a (residues 157–370 of the yeast protein – ). A noteworthy feature for both trypanosomatid orthologues is the very glycine rich region at their C-terminuses, present also in plant eIF3c (and also in the human protein but with a reduced number of glycines), but missing from the fungi proteins as well as from the orthologues from T. vaginalis and G. duodenalis. Also of interest is the conservation of the N-terminal ends of most eIF3c orthologues, with two consecutive phenyalanines strictly conserved in all eIF3c sequences, with the exception of the G. duodenalis homologue. In yeast, the eIF3c N-terminus has been implicated in its binding to eIF5 [45, 46], so the conservation observed may highlight residues involved in this interaction. Likewise, conserved residues found within the segment implicated in binding to eIF1 in all eIF3c homologues studied, again with the exception of the G. duodenalis homologue, may pinpoint residues involved in this interaction (Figure 3A and B).
The non-core subunit eIF3d (also called Moe1 or p66), from mammals, was studied very early on and seen to be able to cross-link to 18S RNA , with the protein’s segment responsible for the binding to RNA been mapped to within its first 120 residues . Despite its absence in S. cerevisae, clear eIF3d orthologues are present in trypanosomatids with overall conservation similar to that observed for the previous subunits (Table 2). In addition a candidate for T. vaginalis was found during the HMMs search analysis, although no G. duodenalis eIF3d orthologue was identified. An alignment of the various eIF3d sequences reveals the existence of several sets of conserved amino acids distributed throughout their length which are good candidates for studies investigating eIF3d function (Additional file 1: Figure S4). These include various residues at the proteins’ N-terminal ends which, with the exception of the T. vaginalis orthologue, are strictly conserved and might be involved in the interaction with RNA, as well as the acidic C-terminus, absent from the S. pombe orthologue. eIF3d orthologues have been previously reported from trypanosomatids , but those in fact are possibly eIF3g orthologues (see below), containing a RRM domain, and do not align with true eIF3d proteins.
eIF3e (p48, Int-6) is another non-core eIF3 subunit, characterized by a carboxi-terminal PCI domain (reviewed in [4, 48]), and which may have a role in regulating eIF3 function (; reviewed in ). It was first identified as the protein product of a gene which is the site of frequent integration of the mouse mammary tumor virus (int-6) and only later identified as an eIF3 subunit [51, 52]. eIF3e orthologues were also found in L. major and T. brucei which are in general more conserved than the previous eIF3 subunits (Table 2), but no orthologues could be found in either T. vaginalis or G. duodenalis. The comparison between the trypanosomatid sequences with other eukaryotic eIF3e orthologues reveals various conserved elements, within the PCI domain but also in the protein’s N-terminal half (Additional file 1: Figure S5). All orthologues share a very conserved segment which coincides with the N-terminus of the human protein (Figure 4) and where a previously reported Nuclear Export Signal (NES) has been mapped .
The two eIF3 subunits containing the MPN domain, eIF3f and eIF3h, associate as a dimmer and both seem to interact with eIF3m, forming the third stable eIF3 module . These two eIF3 subunits are each more closely related to the two MPN containing proteasome and COP9 signalosome subunits than to each other. eIF3f is more related to the proteasome subunit Rpn8 (also known as Mov34, p40 or subunit 7) and signalosome subunit Csn6, whilst eIF3h is more related to proteasomal subunit Rpn11 (also known as pad1 or p47) and signalosome subunit Csn5 (reviewed in ). In S. cerevisae these two eIF3 subunits are absent although the equivalent proteasomal subunits can be identified as well as a Csn5 homologue; in S. pombe, both eIF3 and proteasomal subunits are present as well as the signalosome Csn5; and in plants and animals both the eIF3 subunits and their two proteasomal and signalosome counterparts are found (also reviewed in [4, 5, 54–56]). BLAST searches of the L. major and T. brucei genome databases with the sequences of either eIF3f or eIF3h yielded proteins which were annotated as proteasomal subunits, with eIF3f finding as best hit a putative Rpn8 orthologue whilst eIF3h found a candidate Rpn11 orthologue. Nevertheless, through the HMMs search approach applied here, candidate eIF3h orthologues were found from both L. major and T. brucei (encoded by two neighboring genes), annotated as hypothetical proteins. As described below, the biochemical characterization of the Leishmania eIF3 yielded yet another conserved hypothetical protein which, upon blast searches against the non-redundant protein sequence databases from GenBank, displayed similarities against eIF3f orthologues. Despite not having an identifiable MPN domain it was considered a likely eIF3f orthologue and included in the analysis below. HMMs searches carried out for T. vaginalis and G. duodenalis yielded only two MPN containing homologues from G. duodenalis, annotated as potential proteasome subunits, and three from T. vaginalis, one of which was identified as a putative eIF3h orthologue.
Considering the high degree of similarity in sequence between the various MPN proteins, and to better evaluate their true relationships, a phylogenetic tree was built based on the alignment of multiple MPN containing proteins from the organisms chosen for this study. Sequences from both eIF3 subunits were included and compared with their putative trypanosomatid (eIF3f and eIF3h) and T. vaginalis (eIF3h only) orthologues, as well as with known orthologues from their counterparts found in the 26S proteasome and COP9 signalosome complexes and less defined MPN-containing proteins from the four protozoan genomes studied here. As shown in Figure 5, most of the eIF3f orthologues, including the putative trypanosomatid proteins, loosely group together as part of a large group which also includes the Rpn8/Rpn7 and Csn6 orthologues from various organisms. The latter proteins seem to be more conserved and form more robust subgroups which include likely Rpn8/Rpn7 orthologues from both trypanosomatids and T. vaginalis plus a less conserved MPN-containing homologue from G. duodenalis (Gdu-RPN7 in the figure). The various eIF3f sequences seem to be more divergent and their grouping is accompanied by low bootstraps, but an alignment carried out comparing known and putative eIF3f orthologues confirm the presence of conserved elements shared by all proteins and supporting their identification (Additional file 1: Figure S6A). For the eIF3h orthologues, they also group together, as part of a larger group which includes the various known Rpn11 and Csn5 orthologues as well as MPN containing Rpn11 orthologues from all four protozoans. Overall conservation for these proteins is greater than that observed for the eIF3f orthologues and related proteins and the grouping is validated by more robust bootstraps which includes the T. vaginalis eIF3h. The trypanosomatid eIF3h orthologues are more divergent and their positioning with the Rpn11/Csn5/eIF3h group has a low bootstrap, but when the various eIF3h sequences were aligned conserved elements were found throughout their lengths which substantiate their identification (Additional file 1: Figure S6B). Based on the evidence presented, Tables 1 and 2 include both trypanosomatid eIF3f and eIF3h orthologues and the T. vaginalis eIF3h in the comparisons carried out with the other parasite eIF3 subunits. Additional file 1: Figure S7 also provides a scheme for both T. brucei orthologues, highlighting the position of the MPN domain.
The fourth subunit of the S. cerevisiae eIF3 core complex, eIF3g (TIF35 or p44 in humans), is an essential protein in both budding and fission yeasts species which has been implicated as a participant in translation reinitiation (reviewed in [4, 5]) and seen to be required for the scanning phase of the process . Putative eIF3g orthologues were found in T. brucei and in L. major using the HMM search (Tables 1 and 2 - a schematic representation of the T. brucei orthologue is shown in Additional file 1: Figure S7), but none were identified from either T. vaginalis or G. duodenalis, implying that functional eIF3-like complexes may occur in the absence of this otherwise essential subunit. The two trypanosomatid proteins have been previously reported as possible eIF3d homologues  but this was likely a nomenclature error. The alignment with other eukaryotic eIF3g homologues (Additional file 1: Figure S8) shows that the conservation within the trypanosomatid sequences is mainly restricted to their C-terminal RRM domain, the major eIF3g feature, shown to be required for its RNA binding activity and for the protein’s role in mRNA scanning in yeast [57–59]. Nevertheless, minor segments of similarity are also observed within their N-terminal half, previously implicated in binding to eIF3i . Noteworthy is the absence in the trypanosomatid proteins of the region encompassing the Zinc-Finger motif, previously implicated in the ability of plant’s eIF3g to bind to partners such as eIF4B and the viral Transactivator protein (TAV) of caulimoviruses [60, 61].
eIF3i (p36/TRIP1 in humans and TIF34/Sum1 in yeast) is another eIF3 core component conserved between yeast and humans, and thus also essential for translation in vivo (reviewed in [2, 5]). It is characterized by the presence of a WD repeat domain (as described for eIF3b), consisting of seven defined WD repeats which cover nearly all of its length and which are mostly conserved in different eukaryotic species . eIF3i orthologues in trypanosomatids are clearly identifiable with size and features similar to other eukaryotic sequences and their similarity with the human protein is the highest among the eIF3 subunits (Table 2). T. vaginalis and G. duodenalis orthologues were also identified and, by using the HMM search, a putative M. jannaschii eIF3i orthologue was also found (Table 1). eIF3i then was the single eIF3 subunit from which a representative orthologue was found for this Archean organism and, with exception of E. coli, all other organisms investigated in this study were found to have eIF3i orthologues. The M. jannaschii homologue is only found in a restricted number of Archaen species and differs greatly in size and sequence from all other eukaryotic eIF3i orthologues but a phylogenetic tree built comparing these with their nearest WD containing homologues from various organisms shows a grouping with eIF3i (Figure 6A), although it is not clear at this stage if this grouping has any relevant implications regarding its function. Aligning the various eukaryotic eIF3i sequences reveal that they all have similar size and segments of similarities are seen throughout the sequences but the conservation is increased near the proteins’ N- and C-terminal ends (Additional file 1: Figure S9). The various amino acid residues mapped to the C-terminal end of yeast eIF3i and found to mediate the interaction with eIF3b  were also investigated (Figure 6B and C). From a total of twelve residues involved in the eIF3i-eIF3b interaction, highlighted in the Figure, eight are conserved in the eIF3i of trypanosomatids, most of which are not found in either of the T. vaginalis or G. duodenalis orthologues. Likewise, two residues implicated in the interaction with eIF3g  are also conserved in the trypanosomatid eIF3i orthologues.
eIF3j (also called p35 in humans and HCR1 in S. cerevisiae) is the only non-core, non-essential eIF3 subunit found in S. cerevisiae. It is a highly conserved subunit which is loosely associated to the eIF3 complex and might play a role in mediating its binding to the 40S ribosomal subunit (reviewed in [2, 4, 5]). More recently eIF3j has been confirmed as non-essential in human cells  and has been implicated in events associated with control of translation termination and stop codon read-through in yeast . Putative eIF3j homologues were found not only in both trypanosomatids investigated (annotated as hypothetical proteins) but also in G. duodenalis although no homologue was identified from T. vaginalis (Table 1). These eIF3j orthologues, however, display a very low level of identity when compared with better characterized eukaryotic eiF3j orthologues from mammals and yeast (Additional file 1: Figure S10), and they are all annotated as hypothetical proteins. A distinctive feature is the much conserved, negatively charged, C-terminal end in all proteins, which is somewhat more diverged in the L. major orthologue. The presence of eIF3j homologues in divergent eukaryotes highlight the relevant role it plays in translation initiation despite its non-essential nature in S. cerevisae.
eIF3k, eIF3l and eIF3m
These are the most recently characterized of the eIF3 subunits, all three harboring a PCI domain and found in animal, plant and filamentous fungi species, but with two of them (eIF3l and eIF3k) absent from S. cerevisae and S. pombe  and the third, eIF3m (first called GA-17 in humans and Csn7b in S. pombe), missing from S. cerevisae but otherwise essential for translation in fission yeast . eIF3k, the smallest non-core eIF3 subunit, is found in both trypanosomatid species studied, with sizes similar to those observed for this protein from other organisms, although no candidate T. vaginalis and G. duodenalis eIF3k sequences were identified (Tables 1 and 2). The conservation between the protozoan eIF3k orthologues and the human protein is very low but the sequences are conserved within the trypanosomatid family, encompassing its PCI domain, and the alignment with other eukaryotic proteins confirms the low similarity between the sequences (Additional file 1: Figures S7 and S11A). Clear orthologues in both L. major and T. brucei were also found for eIF3l (originally called HSPC021 in humans) with sizes and features similar to those present in other eukaryotes and overall conservation comparable to those observed for other eIF3 subunits (Tables 1 and 2). As for eIF3k, no likely orthologues were found in either T. vaginalis or G. duodenalis but the alignment of the trypanosomatid proteins with other eIF3l sequences reveals conserved elements throughout their length (Additional file 1: Figures S7 and S11B), including the previously described tetratricopeptide (TPR) repeat and PCI domain regions . In contrast to the other PCI containing eIF3 subunits, no clear eIF3m orthologues were found within the trypanosomatid sequences and no T. vaginalis or G. duodenalis orthologues were found either. Using the HMM search, a single eIF3m homologue for both T. brucei and L. major was found (Tb927.10.15720 and LmjF.19.1120, respectively), which, however, clustered with 26S proteasome subunits upon sequence alignment and phylogenetic tree building with related sequences (data not shown).
Biochemical characterization of the LeishmaniaeIF3 complex
To validate the bioinformatic characterization of the various trypanosomatid eIF3 subunits, the sequence encoding the L. major EIF3E was amplified, cloned and expressed as an N-terminally his-tagged protein in E. coli. The resultant recombinant protein was then used to immunize rabbits and produce a specific polyclonal anti-serum which recognizes a single band of ~45 kDa in whole protein extracts of L. major and also L. infantum (not shown). Considering the high degree of conservation within the N-terminus of the various eukaryotic eIF3e orthologues (see Figure 4), and the postulated role for this segment in mediating the nuclear localization of the human protein , this anti-serum was first used to confirm the subcelullar localization of its orthologue in L. major promastigotas. As shown in Figure 7A, the Leishmania protein was found to strictly localize to the cellular cytoplasm, ruling out any nuclear localization but compatible with its role in translation initiation. The anti-serum was then used in immunoprecipitation assays, this time using total cell extracts from L. infantum, aiming to purify the whole eIF3 complex. An analysis of the precipitated samples through western-blots with the same serum confirmed the efficiency of the procedure for the eIF3e orthologue with none of it coming down in a control immunoprecipitation carried out with the pre-immune serum (Figure 7B). These samples were then submitted to mass spectrometry analysis in order to confirm the subunit composition of the Leishmania eIF3. Figure 7C summarizes the results derived from the mass spectrometry analysis of three sets of replicates comparing the anti-EIF3E antibodies with the pre-immune control. These results confirm the presence of 10 of the eIF3 subunits identified by the bioinformatic analysis and indicate the presence of a candidate eIF3f orthologue which cannot be identified by the HMMs based searches. They also highlight the strong association between the eIF3 complex and eIF1, the single other translation initiation factor which co-precipitated in this assay, whilst confirming the lack of association of the putative eIF3j orthologue with the trypanosomatid eIF3 complex.
The results from both Leishmania and Trypanosoma species are consistent with an early appearance of a fully functional eIF3 complex during the evolution of the eukaryotic lineages. The lack of identifiable orthologues to selected eIF3 subunits from T. vaginalis and G. duodenalis might be, at least in some cases, a consequence of too much divergence in sequence which prevented their proper identification purely through bioinformatic analysis. Indeed the identification, through biochemical approaches, of a putative eIF3f orthologue in trypanosomatids which was overlooked by the bioinformatic search supports this hypothesis. When compared with their Leishmania and Trypanosoma orthologues, the sequences of the eIF3 subunits found for T. vaginalis and G. duodenalis are in general less conserved, which is consistent with an earlier divergence from the main line of eukaryotic evolution or a faster evolution divergence rate for these organisms, as indicated by their classification within the supergroup Excavata . Nevertheless, considering the lack of evidence for some of the most conserved eIF3 subunits (such as eIF3e or eIF3g) in both T. vaginalis and G. duodenalis, possibly a consequence of a secondary loss of subunits rather than reflecting an earlier evolutionary stage, the evidence presented indicates a much simplified eIF3 complex for these organisms. The absence of an eIF3a orthologue in these two protists is nonetheless striking, considering that it is the largest of the eIF3 subunits and the large number of interactions in which it is involved and which are critical for eIF3 function. Since an eIF3j orthologue is found in G. duodenalis, and considering the limited but consistent homology seen between the entire length of eIF3j and part of eIF3a , plus some functional overlap observed between the two proteins , a possible explanation would be for the G. duodenalis eIF3j to perform some or most of the functions carried out by the eIF3a subunit in other eukaryotes. For this organism at least, a simplification in some roles has been seen for other biological processes  and the data shown here is consistent with a simplified eIF3 complex based mainly on the eIF3b, eIF3c and eIF3i subunits.
A secondary loss of selected subunits from the eIF3 complex definitely seems to have happened for the budding yeast S. cerevisae, which lacks several subunits found in more primitive organisms (eIF3d, eIF3e, eIF3h, eIF3k and eIF3l) and appears to have suffered a drastic reduction in complexity during its evolution. Even the eIF3 from the fission yeast S. pombe seems to be in an intermediate situation between the S. cerevisae and filamentous fungi, since it includes five subunits (eIF3d, eIF3e, eIF3f, eIF3h and eIF3m) absent from the budding yeast eIF3 but is missing two subunits (eIF3k and eIF3l) which are present in most eukaryotes, including A. niger and both trypanosomatid lineages. Critical differences in eIF3 function between the S. cerevisae and human complexes have already been seen in the way it interacts with its eIF4G partner, an interaction critical for the mRNA recruitment by the ribosome (reviewed in ). In humans this interaction is mediated by a direct binding between eIF4G and three eIF3 subunits (eIF3c, eIF3d and eIF3e)  whilst in yeast this is done indirectly, through eIF5 . Overall these results indicate a substantial flexibility in eIF3 function which has been differentially exploited by different organisms.
eIF3i has been shown to localize to the periphery of the eIF3 complex [8, 72], but is the most conserved eIF3 subunit, indicating both an ancient role in translation initiation (and an origin which may precede the evolution of early eukaryotes) and also a conservation of critical functions as part of the eIF3 complex. The presence of putative eIF3b and eIF3c orthologues in all eukaryotic organisms investigated thus far also highlights the central role that these proteins are likely to have within the eIF3 complex. In yeast, eIF3b-eIF3i-eIF3g form a ternary complex that interacts with eIF3c and eIF3a  and which has also been confirmed in mammalian cells [8, 36]. The ternary complex lacks any of the proteins belonging to the PCI/MPN octamer core of eIF3 but with eIF3a it forms the mammalian module i of eIF3, capable of maintaining on its own the ability to promote mRNA recruitment to the ribosomes . Within this module, eIF3a seems to be critical for the formation and stability of the remaining eIF3 subunits and its function would have seem to be essential for any eIF3 function. Nevertheless, provided that it is indeed missing from G. duodenalis or T. vaginalis, the resulting eIF3 complex in these organisms would have to be stabilized solely by direct interactions between eIF3b and eIF3i and between eIF3b and eIF3c. Indeed the lack of conservation of residues in eIF3b which are involved in its interaction with eIF3i (despite a conservation of the eIF3b binding residues in eIF3i) might indicate significant divergence in this interaction in different organisms. Under this scenario, it might be possible that during its evolution the eIF3 complex started with a single PCI subunit (eIF3c) and no subunit containing a MPN domain. Other proteins with these specific domains were then acquired, with some at least deriving from the proteasome lid or signalsome subunits since four eIF3 subunits (eIF3f, eIF3h, eIF3k, eIF3l) have clear counterparts within these two complexes . This model implies a significant contribution of eIF3i for the function of the whole eIF3 complex, at least in more primitive eukaryotes, however it is in disagreement with early evidence indicating that eIF3i and eIF3g are dispensable for several key functions of eIF3 in translation initiation in yeast  and that neither subunit is required for active complex formation in mammals , a discrepancy which needs to be resolved.
The in silico and experimental data presented here highlight very relevant features regarding the eIF3 complex and its conservation in most, if not all eukaryotic lineages. The systematic approach carried out aiming to properly identify the different eIF3 subunits in two distinct trypanosomatid lineages, and also in T. vaginalis and G. duodenalis, provides a framework for future studies focusing on unique aspects of the translation machinery in these pathogens. Considering the lack of information regarding eIF3 structure and function in more divergent eukaryotes, the sequence analysis performed here can also help pinpoint conserved elements, in different subunits, so far overlooked but which might have relevant functional roles and are in need of a proper investigation. Subsequent experimental approaches to investigate and validate relevant differences found in comparison with other eukaryotes will be very useful not only to understand unique aspects of translation initiation, and its regulation, in these divergent eukaryotes but also may contribute to the understanding of the whole process, and how it can vary between different groups.
Organisms and sequences
In order to identify orthologues to subunits of the eIF3 complex in Trypanosomatids and lower eukaryotes, a range of predicted proteomes were downloaded for twelve organisms in February 25, 2013 including: Homo sapiens [taxid: 9606], Caenorhabditis elegans [taxid: 6239], Arabidopsis thaliana [taxid: 3702], Aspergilus niger [taxid: 425011], Schizosaccharomyces pombe [taxid: 284812], Saccharomyces cerevisiae [taxid: 559292], Leishmania major [taxid: 347515], Trypanosoma brucei [taxid: 185431], Trichomonas vaginalis [taxid: 412133], Giardia duodenalis [taxid: 5741], Methanocaldococcus jannaschii [taxid: 2190] and Escherichia coli [taxid: 1010810]. Within these organisms, four of them are excavates, two trypanosomatids (L. major and T. brucei) plus T. vaginalis and G. duodenalis. An Archea (M. jannaschii) and a bacterium (E. coli) species were also included, with the bacterium to be used as negative controls. Both trypanosomatid proteomes were downloaded from TritrypDB, whilst the T. vaginalis and G. duodenalis proteomes were downloaded from TrichDB and GiardiaDB, respectively, and all other proteomes were downloaded from NCBI ftp site. All accession numbers for the sequences included in the alignments and in the phylogenetic analysis described below (including not only the various orthologues to the eIF3 subunits but also other relevant sequences) are discriminated in Additional file 2: Table S1.
All likely eIF3 subunits from trypanosomatid species were named in capital letters following the proposed nomenclature for trypanosomatid proteins . Abbreviations for the various organisms investigated are as follows: Hsa – Homo sapiens; Cel – Caenorhabditis elegans; Ath – Arabidopsis thaliana; Ani – Aspergillus niger; Spo – Schizosaccharomyces pombe; Sce – Saccharomyces cerevisae; Lmj – Leishmania major; Tbr – Trypanosoma brucei, Tva – Trichomonas vaginalis; Gdu– Giardia duodenalis, Mje – Methanocaldococcus jannaschii.
Search for eIF3 subunits using Hidden Markov Models
To identify the eIF3 subunits within the proteomes of the selected organisms, all proteins derived from each chosen organism were clustered into orthologue groups. The OrthoMCL program , which uses the MCL (Markov Cluster) algorithm , was employed for this task. The human (H. sapiens) predicted proteome was then used as reference to search for the thirteen subunits of human eIF3 complex, and these proteins were used to recover the orthologue groups defined by the OrthoMCL tool. From a total of 213,686 protein sequences derived from the selected genome sequences, 23,271 orthologue groups were predicted based on OrthMCL analysis. When a search was made for subunits of the human eIF3 complex, 20 proteins were returned, and they were distributed into 13 orthologue groups.
All protein sequences present in each orthologue group were extracted, and these were used as input for the multiple alignment tool called MAFFT  (default settings). The alignments were manually inspected, and proteins, which were decreasing the alignment quality, were taken off from the data. Finally a multiple alignment of each orthologue group was used as input to hmmbuild, a program from the HMMER package, version 3.0 . HMMER was used to build Hidden Markov Models (HMMs) based on multiple alignments, and the HMMs used to search for distantly related proteins within the various organisms’ proteomes using the hmmsearch tool (part of the HMMER package). A cutoff of 0.001 for hit significance (e-value < = 0.001) was used.
Phylogenetic analysis of eIF3 subunits
In order to define how the identified orthologues for selected subunits (eIF3f, eIF3h and eIF3i) relate to each other as well as to closely related homologues, proteins recovered by the HMM searches were aligned by MAFFT (default settings). The alignments were automatically edited by Trimal  to keep just phylogenetically informative sites. The evolutionary model which best fits for each alignment was predicted by ProtTest . Subsequently, the phylogenetic trees were built with PhyML tool using the Maximum Likelihood method . The branch support for each tree was given by non-parametric bootstrap analysis using 1000 replicates.
Cloning and protein expression methods
The L. major DNA fragment coding for its eIF3e orthologue was amplified by PCR from total genomic DNA flanked by restriction sites for the enzymes BamH I and Xho I (5′ primer - GTG GGA TCC ATG GAC ATG CTA ACG AAG CTG; 3′ primer - TGC TCG AGT TAA CGC ATA ACG GTG TCT AGC TT; restriction sites in italic) and cloned into the same sites of the expression plasmid pRSETa (Life Technologies®). Recombinant L. major EIF3E was expressed with an N-terminal histidine tag in Escherichia coli BL-21star cells (Life Technologies®) followed by purification with Ni-NTA Agarose beads (QIAGEN®) and quantification as previously described .
Serum production and immunological procedures
The rabbit anti-serum generated against L. major EIF3E was produced through the immunization of a New Zealand white rabbit with the recombinant his-tagged protein using standard procedures. All the experimental procedures required for this immunization were approved by the “Ethics Committee for the Use of Animals on Research” from the Fundacao Oswaldo Cruz (CEUA-FIOCRUZ), license number L-053/08 to work with Oryctolagus cuniculus, and follow the ethical principles on animal experimentation defined by the Brazilian College of Animal Experimentation (COBEA). The serum generated was tested against both the recombinant protein and native Leishmania extracts through western-blotting and validated by comparison with the pre-immune serum. Antibodies derived from this serum, affinity purified against recombinant EIF3E, were then used to perform indirect immunofluorescence assays. These were carried out using the anti-rabbit IgG Alexa Fluor 488 as secondary antibody (Life Technologies®) and exponentially grown L. major promastigote cells, as described .
Immunoprecipitation and proteomic analysis of LeishmaniaeIF3 subunits
Cytoplasmic extracts prepared from exponentially grown Leishmania infantum promastigotes were used in immunoprecipitation (IP) assays carried out with the affinity purified anti-EIF3E antibodies and the pre-immune serum used as negative control. Extract preparation and IPs were essentially performed as previously described , using 30 μl of protein A sepharose pre-incubated with either anti-EIF3E antibodies or the pre-immune serum (roughly 70 μg of total IgG in both), prior to the incubation with 200 μl (0.5 to 1.0 ODs at 260 nm) of the cytoplasmic extract. Proteins bound to the beads were eluted in SDS-PAGE and an aliquot validated for the presence/absence of Leishmania EIF3E through western blotting using the anti-EIF3E serum. For mass spectrometry (MS) both sets of eluted proteins were loaded unto 15% SDS-PAGE gels and allowed to migrate into the resolving gel, when the electrophoresis was interrupted. Gel slices containing the whole protein content loaded were excised and submitted to an in-gel tryptic digestion, followed by peptide elution and desalting at a homemade C18 stage-tip . The peptides were analyzed by electrospray tandem mass spectrometry (ESI MS/MS), performed with an EASY nLC 1000 (Thermo Scientific), using a 15-cm fused silica emitter (75 μm inner diameter) in-house packed with reversed-phase ReproSil-Pur C18-AQ 3 μm resin (Dr. Maisch GmbH), connected to a LTQ Orbitrap XL ETD (Thermo Scientific) (mass spectrometry facility RPT02H PDTIS/Carlos Chagas Institute - Fiocruz Parana) mass spectrometer equipped with a nanoelectrospray ion source (Phoenix S&T). For data analyses, the MaxQuant platform (version 184.108.40.206)  was used for peak list picking, protein identification and validation. Protein identification was based on the L. infantum protein sequence databases (L. infantum JPCM5, version 6 from September 11, 2013 available at TriTrypDB). For validation, a minimum of six amino acids for peptide length and two peptides per protein were required. In addition, a false discovery rate (FDR) threshold of 0.01 (using the decoy database approach) was applied at both peptide and protein levels. To confirm the specificity of the IP assays, for each polypeptide, the ratio between the average intensities (from three independent experiments) generated for the anti-EIF3E and control IPs was first determined. The base 2 logarithms of the values produced were then calculated and only those >5 were considered. In addition, in order to have a statistical support for the results found, a t-Test was performed comparing the natural logarithm of the signal intensities from the triplicate experiments for the anti-EIF3E and control IPs and only those polypeptides having p-values <0.05 were considered.
Availability of supporting data
The data sets supporting the results from this article are included within the main manuscript and within its two additional file(s).
Aitken CE, Lorsch JR: A mechanistic overview of translation initiation in eukaryotes. Nat Struct Mol Biol. 2012, 19: 568-576. 10.1038/nsmb.2303.
Valasek LS: ‘Ribozoomin’–translation initiation from the perspective of the ribosome-bound eukaryotic initiation factors (eIFs). Curr Protein Pept Sci. 2012, 13: 305-330. 10.2174/138920312801619385.
Hinnebusch AG: The scanning mechanism of eukaryotic translation initiation. Annu Rev Biochem. 2014, 83: 779-812. 10.1146/annurev-biochem-060713-035802.
Dong Z, Zhang JT: Initiation factor eIF3 and regulation of mRNA translation, cell growth, and cancer. Crit Rev Oncol Hematol. 2006, 59: 169-180. 10.1016/j.critrevonc.2006.03.005.
Hinnebusch AG: eIF3: a versatile scaffold for translation initiation complexes. Trends Biochem Sci. 2006, 31: 553-562. 10.1016/j.tibs.2006.08.005.
Marchione R, Leibovitch SA, Lenormand JL: The translational factor eIF3f: the ambivalent eIF3 subunit. Cell Mol Life Sci. 2013, 70: 3603-3616. 10.1007/s00018-013-1263-y.
Masutani M, Sonenberg N, Yokoyama S, Imataka H: Reconstitution reveals the functional core of mammalian eIF3. EMBO J. 2007, 26: 3373-3383. 10.1038/sj.emboj.7601765.
Zhou M, Sandercock AM, Fraser CS, Ridlova G, Stephens E, Schenauer MR, Yokoi-Fong T, Barsky D, Leary JA, Hershey JW, Doudna JA, Robinson CV: Mass spectrometry reveals modularity and a complete subunit interaction map of the eukaryotic translation factor eIF3. Proc Natl Acad Sci U S A. 2008, 105: 18139-18144. 10.1073/pnas.0801313105.
Wagner S, Herrmannova A, Malik R, Peclinovska L, Valasek LS: Functional and biochemical characterization of human eIF3 in living cells. Mol Cell Biol. 2014, 34 (16): 3041-3052. 10.1128/MCB.00663-14.
Siridechadilok B, Fraser CS, Hall RJ, Doudna JA, Nogales E: Structural roles for human translation factor eIF3 in initiation of protein synthesis. Science. 2005, 310: 1513-1515. 10.1126/science.1118977.
Hofmann K, Bucher P: The PCI domain: a common theme in three multiprotein complexes. Trends Biochem Sci. 1998, 23: 204-205. 10.1016/S0968-0004(98)01217-1.
Aravind L, Ponting CP: Homologues of 26S proteasome subunits are regulators of transcription and translation. Protein Sci. 1998, 7: 1250-1254. 10.1002/pro.5560070521.
Pick E, Hofmann K, Glickman MH: PCI complexes: Beyond the proteasome, CSN, and eIF3 Troika. Mol Cell. 2009, 35: 260-264. 10.1016/j.molcel.2009.07.009.
Kouba T, Rutkai E, Karaskova M, Valasek L: The eIF3c/NIP1 PCI domain interacts with RNA and RACK1/ASC1 and promotes assembly of translation preinitiation complexes. Nucleic Acids Res. 2012, 40: 2683-2699. 10.1093/nar/gkr1083.
Khoshnevis S, Gunišová S, Vlčková V, Kouba T, Neumann P, Beznosková P, Ficner R, Valášek LS, et al: Structural integrity of the PCI domain of eIF3a/TIF32 is required for mRNA recruitment to the 43S pre-initiation complexes. Nucleic Acids Res. 2014, 42: 4123-4139. 10.1093/nar/gkt1369.
Sun C, Querol-Audí J, Mortimer SA, Arias-Palomo E, Doudna JA, Nogales E, Cate JH: Two RNA-binding motifs in eIF3 direct HCV IRES-dependent translation. Nucleic Acids Res. 2013, 41: 7512-7521. 10.1093/nar/gkt510.
Asano K, Vornlocher HP, Richter-Cook NJ, Merrick WC, Hinnebusch AG, Hershey JW: Structure of cDNAs encoding human eukaryotic initiation factor 3 subunits. Possible roles in RNA binding and macromolecular assembly. J Biol Chem. 1997, 272: 27042-27052. 10.1074/jbc.272.43.27042.
Sun C, Todorovic A, Querol-Audí J, Bai Y, Villa N, Snyder M, Ashchyan J, Lewis CS, Hartland A, Gradia S, Fraser CS, Doudna JA, Nogales E, Cate JH: Functional reconstitution of human eukaryotic translation initiation factor 3 (eIF3). Proc Natl Acad Sci U S A. 2011, 108: 20473-20478. 10.1073/pnas.1116821108.
Querol-Audi J, Sun C, Vogan JM, Smith MD, Gu Y, Cate JH, Nogales E: Architecture of human translation initiation factor 3. Structure. 2013, 21: 920-928. 10.1016/j.str.2013.04.002.
Hernandez G, Proud CG, Preiss T, Parsyan A: On the diversification of the translation apparatus across eukaryotes. Comp Funct Genomics. 2012, 2012: 256848-
Fernandez-Moya SM, Estevez AM: Posttranscriptional control and the role of RNA-binding proteins in gene regulation in trypanosomatid protozoan parasites. Wiley Interdiscip Rev RNA. 2010, 1: 34-46.
Alsford S, du Bois K, Horn D, Field MC: Epigenetic mechanisms, nuclear architecture and the control of gene expression in trypanosomes. Expert Rev Mol Med. 2012, 14: e13-
Kramer S: Developmental regulation of gene expression in the absence of transcriptional control: the case of kinetoplastids. Mol Biochem Parasitol. 2012, 181: 61-72. 10.1016/j.molbiopara.2011.10.002.
Clayton CE: Networks of gene expression regulation in Trypanosoma brucei. Mol Biochem Parasitol. 2014, 195 (2): 96-106. 10.1016/j.molbiopara.2014.06.005.
Jagus R, Bachvaroff TR, Joshi B, Place AR: Diversity of eukaryotic translational Initiation Factor eIF4E in protists. Comp Funct Genomics. 2012, 2012: 134839-
Zinoviev A, Shapira M: Evolutionary conservation and diversification of the translation initiation apparatus in trypanosomatids. Comp Funct Genomics. 2012, 2012: 813718-
Ivens AC, Peacock CS, Worthey EA, Murphy L, Aggarwal G, Berriman M, Sisk E, Rajandream MA, Adlem E, Aert R, Anupama A, Apostolou Z, Attipoe P, Bason N, Bauser C, Beck A, Beverley SM, Bianchettin G, Borzym K, Bothe G, Bruschi CV, Collins M, Cadag E, Ciarloni L, Clayton C, Coulson RM, Cronin A, Cruz AK, Davies RM, De Gaudenzi J: The genome of the kinetoplastid parasite, Leishmania major. Science. 2005, 309: 436-442. 10.1126/science.1112680.
Berriman M, Ghedin E, Hertz-Fowler C, Blandin G, Renauld H, Bartholomeu DC, Lennard NJ, Caler E, Hamlin NE, Haas B, Böhme U, Hannick L, Aslett MA, Shallom J, Marcello L, Hou L, Wickstead B, Alsmark UC, Arrowsmith C, Atkin RJ, Barron AJ, Bringaud F, Brooks K, Carrington M, Cherevach I, Chillingworth TJ, Churcher C, Clark LN, Corton CH, Cronin A: The genome of the African trypanosome Trypanosoma brucei. Science. 2005, 309: 416-422. 10.1126/science.1112642.
El-Sayed NM, Myler PJ, Blandin G, Berriman M, Crabtree J, Aggarwal G, Caler E, Renauld H, Worthey EA, Hertz-Fowler C, Ghedin E, Peacock C, Bartholomeu DC, Haas BJ, Tran AN, Wortman JR, Alsmark UC, Angiuoli S, Anupama A, Badger J, Bringaud F, Cadag E, Carlton JM, Cerqueira GC, Creasy T, Delcher AL, Djikeng A, Embley TM, Hauser C, Ivens AC: Comparative genomics of trypanosomatid parasitic protozoa. Science. 2005, 309: 404-409. 10.1126/science.1112181.
Morrison HG, McArthur AG, Gillin FD, Aley SB, Adam RD, Olsen GJ, Best AA, Cande WZ, Chen F, Cipriano MJ, Davids BJ, Dawson SC, Elmendorf HG, Hehl AB, Holder ME, Huse SM, Kim UU, Lasek-Nesselquist E, Manning G, Nigam A, Nixon JE, Palm D, Passamaneck NE, Prabhu A, Reich CI, Reiner DS, Samuelson J, Svard SG, Sogin ML: Genomic minimalism in the early diverging intestinal parasiteGiardia lamblia. Science. 2007, 317: 1921-1926. 10.1126/science.1143837.
Carlton JM, Hirt RP, Silva JC, Delcher AL, Schatz M, Zhao Q, Wortman JR, Bidwell SL, Alsmark UC, Besteiro S, Sicheritz-Ponten T, Noel CJ, Dacks JB, Foster PG, Simillion C, Van de Peer Y, Miranda-Saavedra D, Barton GJ, Westrop GD, Müller S, Dessi D, Fiori PL, Ren Q, Paulsen I, Zhang H, Bastida-Corcuera FD, Simoes-Barbosa A, Brown MT, Hayes RD, Mukherjee M: Draft genome sequence of the sexually transmitted pathogenTrichomonas vaginalis. Science. 2007, 315: 207-212. 10.1126/science.1132894.
Plutzer J, Ongerth J, Karanis P: Giardia taxonomy, phylogeny and epidemiology: Facts and open questions. Int J Hyg Environ Health. 2010, 213: 321-333. 10.1016/j.ijheh.2010.06.005.
Saletta F, Suryo RY, Richardson DR: The translational regulator eIF3a: the tricky eIF3 subunit!. Biochim Biophys Acta. 1806, 2010: 275-286.
Valasek L, Nielsen KH, Hinnebusch AG: Direct eIF2-eIF3 contact in the multifactor complex is important for translation initiation in vivo. EMBO J. 2002, 21: 5886-5898. 10.1093/emboj/cdf563.
Valasek L, Mathew AA, Shin BS, Nielsen KH, Szamecz B, Hinnebusch AG: The yeast eIF3 subunits TIF32/a, NIP1/c, and eIF5 make critical connections with the 40S ribosome in vivo. Genes Dev. 2003, 17: 786-799. 10.1101/gad.1065403.
Dong Z, Qi J, Peng H, Liu J, Zhang JT: Spectrin domain of eukaryotic Initiation Factor 3a is the docking site for formation of the a:b:i:g subcomplex. J Biol Chem. 2013, 288: 27951-27959. 10.1074/jbc.M113.483164.
ElAntak L, Tzakos AG, Locker N, Lukavsky PJ: Structure of eIF3b RNA recognition motif and its interaction with eIF3j: structural insights into the recruitment of eIF3b to the 40 S ribosomal subunit. J Biol Chem. 2007, 282: 8165-8174. 10.1074/jbc.M610860200.
Smith TF, Gaitatzes C, Saxena K, Neer EJ: The WD repeat: a common architecture for diverse functions. Trends Biochem Sci. 1999, 24: 181-185. 10.1016/S0968-0004(99)01384-5.
Liu Y, Neumann P, Kuhle B, Monecke T, Schell S, Chari A, Ficner R: Translation initiation factor eIF3b contains a nine-bladed beta-propeller and interacts with the 40S ribosomal subunit. Structure. 2014, 22: 923-930. 10.1016/j.str.2014.03.010.
De Gaudenzi J, Frasch AC, Clayton C: RNA-binding domain proteins in Kinetoplastids: a comparative analysis. Eukaryot Cell. 2005, 4: 2106-2114. 10.1128/EC.4.12.2106-2114.2005.
Nielsen KH, Valasek L, Sykes C, Jivotovskaya A, Hinnebusch AG: Interaction of the RNP1 motif in PRT1 with HCR1 promotes 40S binding of eukaryotic initiation factor 3 in yeast. Mol Cell Biol. 2006, 26: 2984-2998. 10.1128/MCB.26.8.2984-2998.2006.
Shalev A, Valásek L, Pise-Masison CA, Radonovich M, Phan L, Clayton J, He H, Brady JN, Hinnebusch AG, Asano K: Saccharomyces cerevisiae protein Pci8p and human protein eIF3e/Int-6 interact with the eIF3 core complex by binding to cognate eIF3b subunits. J Biol Chem. 2001, 276: 34948-34957. 10.1074/jbc.M102161200.
Asano K, Phan L, Anderson J, Hinnebusch AG: Complex formation by all five homologues of mammalian translation initiation factor 3 subunits from yeast Saccharomyces cerevisiae. J Biol Chem. 1998, 273: 18573-18585. 10.1074/jbc.273.29.18573.
Herrmannová A, Daujotyte D, Yang JC, Cuchalová L, Gorrec F, Wagner S, Dányi I, Lukavsky PJ, Valásek LS: Structural analysis of an eIF3 subcomplex reveals conserved interactions required for a stable and proper translation pre-initiation complex assembly. Nucleic Acids Res. 2012, 40: 2294-2311. 10.1093/nar/gkr765.
Asano K, Clayton J, Shalev A, Hinnebusch AG: A multifactor complex of eukaryotic initiation factors, eIF1, eIF2, eIF3, eIF5, and initiator tRNA(Met) is an important translation initiation intermediate in vivo. Genes Dev. 2000, 14: 2534-2546. 10.1101/gad.831800.
Karaskova M, Gunisova S, Herrmannova A, Wagner S, Munzarova V, Valasek L: Functional characterization of the role of the N-terminal domain of the c/Nip1 subunit of eukaryotic initiation factor 3 (eIF3) in AUG recognition. J Biol Chem. 2012, 287: 28420-28434. 10.1074/jbc.M112.386656.
Nygard O, Westermann P: Specific interaction of one subunit of eukaryotic initiation factor eIF-3 with 18S ribosomal RNA within the binary complex, eIF-3 small ribosomal subunit, as shown by cross-linking experiments. Nucleic Acids Res. 1982, 10: 1327-1334. 10.1093/nar/10.4.1327.
von Arnim AG, Chamovitz DA: Protein homeostasis: a degrading role for Int6/eIF3e. Curr Biol. 2003, 13: R323-R325. 10.1016/S0960-9822(03)00238-0.
Yahalom A, Kim TH, Roy B, Singer R, von Arnim AG, Chamovitz DA: Arabidopsis eIF3e is regulated by the COP9 signalosome and has an impact on development and protein translation. Plant J. 2008, 53: 300-311.
Kim T, Hofmann K, von Arnim AG, Chamovitz DA: PCI complexes: pretty complex interactions in diverse signaling pathways. Trends Plant Sci. 2001, 6: 379-386. 10.1016/S1360-1385(01)02015-5.
Marchetti A, Buttitta F, Miyazaki S, Gallahan D, Smith GH, Callahan R: Int-6, a highly conserved, widely expressed gene, is mutated by mouse mammary tumor virus in mammary preneoplasia. J Virol. 1995, 69: 1932-1938.
Asano K, Kinzy TG, Merrick WC, Hershey JW: Conservation and diversity of eukaryotic translation initiation factor eIF3. J Biol Chem. 1997, 272: 1101-1109. 10.1074/jbc.272.2.1101.
Guo J, Sen GC: Characterization of the interaction between the interferon-induced protein P56 and the Int6 protein encoded by a locus of insertion of the mouse mammary tumor virus. J Virol. 2000, 74: 1892-1899. 10.1128/JVI.74.4.1892-1899.2000.
Browning KS, Gallie DR, Hershey JW, Hinnebusch AG, Maitra U, Merrick WC, Norbury C: Unified nomenclature for the subunits of eukaryotic initiation factor 3. Trends Biochem Sci. 2001, 26: 284-10.1016/S0968-0004(01)01825-4.
Schwechheimer C: The COP9 signalosome (CSN): an evolutionary conserved proteolysis regulator in eukaryotic development. Biochim Biophys Acta. 2004, 1695: 45-54. 10.1016/j.bbamcr.2004.09.023.
Wei N, Serino G, Deng XW: The COP9 signalosome: more than a protease. Trends Biochem Sci. 2008, 33: 592-600. 10.1016/j.tibs.2008.09.004.
Cuchalova L, Kouba T, Herrmannova A, Danyi I, Chiu WL, Valasek L: The RNA recognition motif of eukaryotic translation initiation factor 3 g (eIF3g) is required for resumption of scanning of posttermination ribosomes for reinitiation on GCN4 and together with eIF3i stimulates linear scanning. Mol Cell Biol. 2010, 30: 4671-4686. 10.1128/MCB.00430-10.
Block KL, Vornlocher HP, Hershey JW: Characterization of cDNAs encoding the p44 and p35 subunits of human translation initiation factor eIF3. J Biol Chem. 1998, 273: 31901-31908. 10.1074/jbc.273.48.31901.
Hanachi P, Hershey JW, Vornlocher HP: Characterization of the p33 subunit of eukaryotic translation initiation factor-3 from Saccharomyces cerevisiae. J Biol Chem. 1999, 274: 8546-8553. 10.1074/jbc.274.13.8546.
Park HS, Browning KS, Hohn T, Ryabova LA: Eucaryotic initiation factor 4B controls eIF3-mediated ribosomal entry of viral reinitiation factor. EMBO J. 2004, 23: 1381-1391. 10.1038/sj.emboj.7600140.
Park HS, Himmelbach A, Browning KS, Hohn T, Ryabova LA: A plant viral “reinitiation” factor interacts with the host translational machinery. Cell. 2001, 106: 723-733. 10.1016/S0092-8674(01)00487-1.
Beznoskova P, Cuchalova L, Wagner S, Shoemaker CJ, Gunisova S, von der HT: Translation Initiation Factors eIF3 and HCR1 control translation termination and stop codon read-through in yeast cells. PLoS Genet. 2013, 9: e1003962-10.1371/journal.pgen.1003962.
Burks EA, Bezerra PP, Le H, Gallie DR, Browning KS: Plant initiation factor 3 subunit composition resembles mammalian initiation factor 3 and has a novel subunit. J Biol Chem. 2001, 276: 2122-2131. 10.1074/jbc.M007236200.
Zhou C, Arslan F, Wee S, Krishnan S, Ivanov AR, Oliva A, et al: PCI proteins eIF3e and eIF3m define distinct translation initiation factor 3 complexes. BMC Biol. 2005, 3: 14-10.1186/1741-7007-3-14.
Morris-Desbois C, Rety S, Ferro M, Garin J, Jalinot P: The human protein HSPC021 interacts with Int-6 and is associated with eukaryotic translation initiation factor 3. J Biol Chem. 2001, 276: 45988-45995. 10.1074/jbc.M104966200.
Walker G, Dorrell RG, Schlacht A, Dacks JB: Eukaryotic systematics: a user's guide for cell biologists and parasitologists. Parasitology. 2011, 138: 1638-1663. 10.1017/S0031182010001708.
Valasek L, Phan L, Schoenfeld LW, Valaskova V, Hinnebusch AG: Related eIF3 subunits TIF32 and HCR1 interact with an RNA recognition motif in PRT1 required for eIF3 integrity and ribosome binding. EMBO J. 2001, 20: 891-904. 10.1093/emboj/20.4.891.
Chiu WL, Wagner S, Herrmannova A, Burela L, Zhang F, Saini AK, et al: The C-terminal region of eukaryotic translation initiation factor 3a (eIF3a) promotes mRNA recruitment, scanning, and, together with eIF3j and the eIF3b RNA recognition motif, selection of AUG start codons. Mol Cell Biol. 2010, 30: 4415-4434. 10.1128/MCB.00280-10.
Ankarklev J, Jerlstrom-Hultqvist J, Ringqvist E, Troell K, Svard SG: Behind the smile: cell biology and disease mechanisms of Giardia species. Nat Rev Microbiol. 2010, 8: 413-422.
Villa N, Do A, Hershey JW, Fraser CS: Human eukaryotic Initiation Factor 4G (eIF4G) protein binds to eIF3c, −d, and -e to promote mRNA recruitment to the ribosome. J Biol Chem. 2013, 288: 32932-32940. 10.1074/jbc.M113.517011.
Asano K, Shalev A, Phan L, Nielsen K, Clayton J, Valásek L, Donahue TF, Hinnebusch AG: Multiple roles for the C-terminal domain of eIF5 in translation initiation complex assembly and GTPase activation. EMBO J. 2001, 20: 2326-2337. 10.1093/emboj/20.9.2326.
Damoc E, Fraser CS, Zhou M, Videler H, Mayeur GL, Hershey JW, Doudna JA, Robinson CV, Leary JA: Structural characterization of the human eukaryotic initiation factor 3 protein complex by mass spectrometry. Mol Cell Proteomics. 2007, 6: 1135-1146. 10.1074/mcp.M600399-MCP200.
Khoshnevis S, Hauer F, Milon P, Stark H, Ficner R: Novel insights into the architecture and protein interaction network of yeast eIF3. RNA. 2012, 18: 2306-2319. 10.1261/rna.032532.112.
Phan L, Schoenfeld LW, Valasek L, Nielsen KH, Hinnebusch AG: A subcomplex of three eIF3 subunits binds eIF1 and eIF5 and stimulates ribosome binding of mRNA and tRNA(i)Met. EMBO J. 2001, 20: 2954-2965. 10.1093/emboj/20.11.2954.
Clayton C, Adams M, Almeida R, Baltz T, Barrett M, Bastien P, Belli S, Beverley S, Biteau N, Blackwell J, Blaineau C, Boshart M, Bringaud F, Cross G, Cruz A, Degrave W, Donelson J, El-Sayed N, Fu G, Ersfeld K, Gibson W, Gull K, Ivens A, Kelly J, Vanhamme L, Myler P: Genetic nomenclature forTrypanosoma and Leishmania. Mol Biochem Parasitol. 1998, 97: 221-224. 10.1016/S0166-6851(98)00115-7.
Li L, Stoeckert CJ, Roos DS: OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 2003, 13: 2178-2189. 10.1101/gr.1224503.
Enright AJ, Van DS, Ouzounis CA: An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 2002, 30: 1575-1584. 10.1093/nar/30.7.1575.
Katoh K, Misawa K, Kuma K, Miyata T: MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002, 30: 3059-3066. 10.1093/nar/gkf436.
Eddy SR: Profile hidden Markov models. Bioinformatics. 1998, 14: 755-763. 10.1093/bioinformatics/14.9.755.
Capella-Gutierrez S, Silla-Martinez JM, Gabaldon T: trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009, 25: 1972-1973. 10.1093/bioinformatics/btp348.
Abascal F, Zardoya R, Posada D: ProtTest: selection of best-fit models of protein evolution. Bioinformatics. 2005, 21: 2104-2105. 10.1093/bioinformatics/bti263.
Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O: New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010, 59: 307-321. 10.1093/sysbio/syq010.
Dhalia R, Reis CR, Freire ER, Rocha PO, Katz R, Muniz JR, Standart N, de Melo Neto OP: Translation initiation in Leishmania major: characterisation of multiple eIF4F subunit homologues. Mol Biochem Parasitol. 2005, 140: 23-41. 10.1016/j.molbiopara.2004.12.001.
da Costa Lima TD, Moura DM, Reis CR, Vasconcelos JR, Ellis L, Carrington M, Figueiredo RC, de Melo Neto OP: Functional characterization of three leishmania poly(a) binding protein homologues with distinct binding properties to RNA and protein partners. Eukaryot Cell. 2010, 9: 1484-1494. 10.1128/EC.00148-10.
Rappsilber J, Ishihama Y, Mann M: Stop and go extraction tips for matrix-assisted laser desorption/ionization, nanoelectrospray, and LC/MS sample pretreatment in proteomics. Anal Chem. 2003, 75: 663-670. 10.1021/ac026117i.
Cox J, Mann M: MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat Biotechnol. 2008, 26: 1367-1372. 10.1038/nbt.1511.
Work in OPDMN’s lab was funded with grants provided by the Brazilian funding agencies FACEPE (APQ-0239-2.02/12) and CNPq (475471/2008-3 and 480899/2013-4). LAA, ECN, TDCL, ERF, RK and CRSR received studentships from CNPq, FACEPE or CAPES. The authors thank the Program for Technical Development of Health Inputs-PDTIS-FIOCRUZ for the use of its facilities, the mass spectrometry facility RPT02H, at the Carlos Chagas Institute – Fiocruz Parana, and the automatic sequencing facility RPT01C, at the Research Center Aggeu Magalhaes – Fiocruz Pernambuco.
The authors declare that they have no competing interests.
AMR did most of the final data analysis and did part of the writing of the manuscript. LAA, ECN and TDCL were responsible for the various experiments associated with cloning and expression of the Leishmania EIF3E, antibody production, immunolocalization and IPs. FKM performed the analysis by mass spectrometry of the immunoprecipitated samples. ERF and CRSR contributed with the original work design and with the sequence analysis of the eIF3 subunits. OPDMN contributed during most stages of the work design and execution and wrote the manuscript. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Figure S1: Protein sequence alignment of eIF3a orthologues. Red and blue boxes represent the PCI and Spectrin domains, respectively. Figure S2. Protein sequence alignment of eIF3b orthologues. Red, green and blue boxes represent the RRM and WD domains and the eIF3i binding region, respectively. Figure S3. Protein sequence alignment of eIF3c orthologues. Red, blue and green boxes represent the eIF5 and eIF1 binding regions and the PCI domain, respectively. Figure S4. Protein sequence alignment of eIF3d orthologues. Figure S5. Protein sequence alignment of eIF3e orthologues. Blue and red boxes represent the NES and PCI domain, respectively. Figure S6. Protein sequence alignment of eIF3f and eIF3h orthologues. (A) Alignment of eIF3f orthologues. (B) Alignment of eIF3h orthologues. The red box represents the MPN domain. Figure S7. Schematic representation of the T. brucei EIF3F, EIF3G, EIF3H, EIF3K and EIF3L subunits. The domains/motifs are boxed and colored blue (EIF4F and EIF3H MPN domains), yellow (EIF3G RRM domain), red (EIF3K and EIF3L PCI domains) and brown (EIF3L TPR region). Figure S8. Protein sequence alignment of eIF3g orthologues. The red box represents the RRM domain. The Zinc Finger motif, when found, is highlighted (blue box). Figure S9. Protein sequence alignment of eIF3i orthologues. The red box represents the eIF3b binding region. Figure S10. Protein sequence alignment of eIF3j orthologues. Figure S11. Protein sequence alignment of eIF3k and eIF3l orthologues. (A) Alignment of the eIF3k orthologues. (B) Alignment of the eIF3l orthologues. The conserved TPR region and PCI domains are also highlighted (blue and red boxes, respectively). (DOCX 570 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Rezende, A.M., Assis, L.A., Nunes, E.C. et al. The translation initiation complex eIF3 in trypanosomatids and other pathogenic excavates – identification of conserved and divergent features based on orthologue analysis. BMC Genomics 15, 1175 (2014). https://doi.org/10.1186/1471-2164-15-1175
- Translation initiation factor
- Protein synthesis