Evolutionary analysis of the ENTH/ANTH/VHS protein superfamily reveals a coevolution between membrane trafficking and metabolism
© De Craene et al.; licensee BioMed Central Ltd. 2012
Received: 20 December 2011
Accepted: 22 June 2012
Published: 2 July 2012
Membrane trafficking involves the complex regulation of proteins and lipids intracellular localization and is required for metabolic uptake, cell growth and development. Different trafficking pathways passing through the endosomes are coordinated by the ENTH/ANTH/VHS adaptor protein superfamily. The endosomes are crucial for eukaryotes since the acquisition of the endomembrane system was a central process in eukaryogenesis.
Our in silico analysis of this ENTH/ANTH/VHS superfamily, consisting of proteins gathered from 84 complete genomes representative of the different eukaryotic taxa, revealed that genomic distribution of this superfamily allows to discriminate Fungi and Metazoa from Plantae and Protists. Next, in a four way genome wide comparison, we showed that this discriminative feature is observed not only for other membrane trafficking effectors, but also for proteins involved in metabolism and in cytokinesis, suggesting that metabolism, cytokinesis and intracellular trafficking pathways co-evolved. Moreover, some of the proteins identified were implicated in multiple functions, in either trafficking and metabolism or trafficking and cytokinesis, suggesting that membrane trafficking is central to this co-evolution process.
Our study suggests that membrane trafficking and compartmentalization were not only key features for the emergence of eukaryotic cells but also drove the separation of the eukaryotes in the different taxa.
KeywordsMembrane trafficking Cytokinesis Metabolism Comparative genomic Eukaryotic evolution Phylogeny
Intracellular compartments represented by membrane-delineated regions with specific lipids and proteins contents are characteristic of eukaryotic cells. Membrane trafficking connects all compartments allowing on the one hand lipids and proteins synthesized in the endoplasmic reticulum (ER) to reach their intended organelle and on the other hand exchanges with the extracellular medium. This makes membrane trafficking an important object of evolutionary studies, since these mechanisms are fundamentally inherent to the multi-organelle status of eukaryotes[2, 3]. Among the different intracellular compartments, endosomes are central, since they are at the crossroads of several trafficking pathways and should therefore contain the vestiges of the first eukaryotic endomembrane system, a key factor for later evolution. Endosomes are at the intersection of the endocytic, phagocytic, Golgi to lysosome trafficking (also termed VPS for vacuolar protein sorting), autophagy and plasma membrane recycling pathways. At the endosomes, effectors and cargo proteins following these different pathways are sorted to reach their final destination, henceforth we will gather this under the name endosomal system. Specific key regulators of the endosomal system contain an ENTH (Epsin N-terminal homology), an ANTH (AP180 N-terminal homology) or a VHS (Vps27, Hrs and STAM) domain at their N-terminus. Structural analyses resulted in the grouping of these proteins in the ENTH/ANTH/VHS superfamily, while functional analyses and sequence homologies led to their further classification into families and subfamilies[7–10].
These regulators also termed adaptors localize at the Golgi, endosomal or plasma membrane and function in pairs of proteins from different subfamilies such as the ANTH Sla2/HIP1 and the ENTH Ent1-2/Epsin1,2,3 in endocytosis. They are required for cargo sorting into vesicles and recruitment of scaffold or accessory trafficking effectors. Indeed, most ENTH and ANTH domains are lipid binding domains specifically interacting with a given phosphoinositide enriched at the target membrane. Cargo recruitment is mediated by interaction either with ubiquitin tagging the cargo (for endocytosis and late endosomal/multivesicular body (MVB) sorting or a peptide-motif as for the mannose-6-phosphate receptor (for transport of soluble lysosomal enzymes to endosomes)[11, 12]. Moreover, to ensure the correct assembly and budding of vesicles, most plasma membrane and Golgi localized adaptors also interact with clathrin through their C-terminal clathrin binding sites[12, 13]. Indeed, clathrin, the major component of vesicle coats at the plasma membrane and Golgi, depends on such adaptors for its membrane recruitment.
In addition to being studied experimentally, many protein families involved in trafficking have been analyzed in evolutionary studies[2, 3]. For example, extensive studies of the COP (coat protein complex) proteins required for the formations of the vesicles at early secretory pathway (ER, Golgi trafficking) showed that they have a similar structure to clathrin coat, suggesting an evolutionary conservation of the vesicular coat structure[14, 15]. Other in-depth investigations include the GTPase (Rab) and SNARE (SNAP (Soluble NSF Attachment Protein) REceptor) families that are required for the formation of the trafficking vesicles and their fusion with membranes ([4, 16]. All these studies highlighted the determinant role played by these proteins in the evolution of the endomembrane system. On the other hand, Field and co-workers focused on assessing the presence/absence profile of trafficking proteins belonging to many families using two representative organisms of each eukaryotic taxa. They showed that in the endosomal system some effectors arose later than others during evolution, Vps27/Hrs and Hse1/STAM (VHS proteins) forming the ESCRT-0 (endosomal sorting complex required for transport) complex or the GGA proteins required for exit from the Golgi (VHS proteins) or the plasma membrane epsins (ENTH proteins) are specific to Fungi and Metazoa (Opisthokonta)[17, 18].
Here we performed an extensive analysis of ENTH/ANTH/VHS superfamily members found in the proteomes of 84 fully sequenced organisms representing different eukaryotic taxa. Analysis of the presence/absence profiles of the different subfamilies revealed that the ENTHA, PICALM and GGA subfamilies were present prior to the divergence in the various eukaryotic taxa. Moreover, the genomic distribution of this superfamily perfectly reflects the dichotomy between Opisthokonta and Plantae. Comparative genomics of four proteomes and phylogenetic analyses revealed that a similar dichotomy was also observed for other protein families involved in the endosomal system but also in cytokinesis and metabolism. Further analyses of these protein families show that the endosomal system was a key process linking both metabolism and cytokinesis. Based on these results, we suggest an innovative evolutionary scenario, where the endosomal system drove the separation of Fungi and Metazoa from Plantae and Protists.
Clustering of the ENTH/ANTH/VHS protein sequences
We analyzed the protein sequences of ENTH/ANTH/VHS superfamily members found in the genomes and proteomes of 84 fully sequenced Metazoa, Fungi, Amoebazoa, Plantae, Excavata, Euglenozoa, Chromista and Rhizaria taxa (Genome OnLine Database website November 2011). Proteins with an ANTH, ENTH or VHS domain were gathered by extensive BLAST (BLASTP, TBLASTN and PSI-BLAST) searches and used to generate a Multiple Alignment of Complete Sequences (MACS) composed of 1134 proteins manually adjusted according to the structural data available on the distinct ENTH, ANTH and VHS proteins. At least one protein was found in all organisms except for one Fungi (E. cuniculi) and three Chromista (H. parasitica, T. pseudonana and C. hominis). Analysis of the MACS allowed the clustering of all the proteins in: i) 4 VHS subfamilies GGA (Golgi-localized, gamma-ear-containing, ARF-binding proteins), STAM (signal transducing adaptor molecule (SH3 domain and ITAM motif)), VHS (Vps27/Hrs/STAM) and TOM (target of myb1), ii) 2 ANTH subfamilies ANTH and PICALM (Phosphatidylinositol binding clathrin assembly protein), and iii) 4 ENTH subfamilies Epsin, EpsinR and two newly identified subfamilies of unknown function. To facilitate the discussion, we named the Golgi/endosome EpsinR subfamily: ENTHA, the endocytic Epsin subfamily: ENTHB, the new vertebrate specific epsins (human ENTD1): ENTHC and the new fungi specific epsins (S. cerevisiae Ent4): ENTHD.
Finally, an important point results from the analysis of the Plantae, Euglenozoa and Amoebozoa proteins belonging to the VHS family. While our phylogenetic tree clustered these proteins with the opisthokonta GGA subfamily, previous studies have classified them in the TOM subfamily due to a domain organization similar to the metazoan TOM, i.e. presence of a GAT (GGA and Tom1) domain and absence of a GAE (gamma-adaptin ear) domain‐. To definitively assign these proteins to either the GGA or the TOM subfamily, we aligned their GAT domain and a robust phylogenetic tree was calculated (Additional file1: Figure S1). This tree showed a clear clustering of all Plantae and protist GAT domains on the same branch as Metazoa and Fungi GGA proteins whereas the Metazoan GAT domain of TOM proteins clustered on a separate branch. Confidence in this classification was acquired by analyzing both our alignment and the crystal structures of these domains. Our alignment showed that the extreme N-terminus of TOM proteins had a conserved PF domain and an extra amino acid between ∝-helices 3 and 4 compared to Metazoa GGA proteins or the Plantae and Protists proteins with a VHS domain (Additional file2: Figure S2). Furthermore, the resolved structure of TOM1 VHS domain shows that the conserved PF domain is a ∝-helical turn and ∝-helix 6 of GGA is broken in TOM proteins[24, 25]. These results clearly classify Amoebozoa, Plantae and Euglenozoa VHS proteins in the GGA subfamily and suggest that the GAE domain was a later acquisition of Opisthokonta GGA proteins.
As a whole, GGA-containing proteins revealed a complex evolution as illustrated by the fungal GGA branch which is nested in the VHS subfamily cluster and closer to the Metazoa than Fungi VHS domain (Figure1). Since Fungal GGA proteins display the same domain composition and function as the Mammalian ones, this result suggests that the VHS domain, specific to the Opisthokonta taxa, evolved from the GGA domain and certainly resulted from duplication. Furthermore, the STAM and TOM subfamily, respectively specific to Opisthokonta and Metazoa, are more recent acquisitions that evolved from the VHS subfamily.
Genomic distribution of the ENTH/ANTH/VHS superfamily members
The ENTHB subfamily (Human Epsin1-3 and yeast Ent1-2), which is required for endocytosis, is specific to Opisthokonta (Metazoa and Fungi) (Figure2), suggesting that this subfamily was acquired more recently probably by duplication of the ENTHA subfamily. In addition, we identified two new Opisthokonta subfamilies of unknown function: ENTHC found in most vertebrates except Aves and ENTHD in the fungal subphylum Saccharomycotina (Figure2).
Members of the ANTH family are also found in most taxa excluding the Chromista and Excavata, with the PICALM subfamily composed of Metazoa AP180 and CALM and Fungi yAP1801-2 being the most conserved. This is in agreement with a previous study showing that AP180 is an ancient component of the endocytic system. In addition, our results show that the second ANTH subfamily (Metazoa HIP1(HSC70 Interacting Protein) and HIP1R, Fungi Sla2 and Amoeboza HIP1R), characterized by the I/LWEQ actin binding domain, is specific to Opisthokonta and Amoebozoa which form the Unikonts. We do notice some exceptions with the Fungi Rhizopus oryzae and animalia Monosiga brevicollis, Trichoplax adhaerens, Hydra magnapapillata, Nematostella vectensis and Amphimedon queenslandica missing the ANTH proteins probably due to protein loss (Figure2).
Analysis of the presence/absence profile in the VHS family shows that VHS and STAM subfamilies, forming the endosomal sorting complex ESCRT-0 are specific to Opisthokonta (Figure2), as previously observed[18, 23]. Here we also show that TOM proteins are specific to Metazoa and certainly a recent acquisition. Moreover in vertebrates, STAM and TOM proteins are duplicated or even triplicated, whereas the VHS subfamily contains only one member despite its crucial role in endosomal sorting. This is even more surprising since this unique VHS protein directly interacts with the duplicated STAM proteins to form the ESCRT-0 complex. Nonetheless, total absence of the ESCRT-0 complex is observed in the Fungi Rhizopus oryzae or partial in animalia Hydra magnapapillata and Amphimedon queenslandica which have a TOM but no STAM (Figure2). Trichoplax adhaerens on the other hand has a STAM but no TOM protein. Thus, such organisms could be ideally suited to better characterize the function of these proteins.
Plantae, which undergo endocytosis and endosomal sorting[31, 32], possess only ENTHA, PICALM and GGA subfamilies, all of which had a complex evolution as illustrated by the high number of duplications in each subfamily that we could group in several types (Figure2). Despite being unable to assign clear functions to these different types, we could speculate on the potential functions of some ENTHA types. Indeed, Plantae ENTHA1, ENTHA2 types (A. thaliana Epsin1 and Epsin2/EpsinR2 respectively) and ENTHA3 cluster with S. cerevisiae Ent3 (yEnt3) and human EpsinR and could thus function in Golgi-to-endosomal sorting, while ENTHA4 type proteins may have a different role in trafficking, possibly endocytosis. Interestingly, the A. thaliana AtEpsin1 protein was shown to be required for Golgi sorting of vacuolar protein (similar functions as hEpsinR and yEnt3) and AtEpsinR2 binds to PtdIns3P suggesting an endosomal sorting function (similar as yEnt3)[33, 34].
Comparative genomic analysis
It is noteworthy that among the 245 proteins, 35 proteins share two of the three functions (i.e. intracellular trafficking and metabolism or cytokinesis)(Figure3D and Additional file3: Figure S3). Analysis of these 35 proteins shows that 20 proteins are involved in membrane trafficking and metabolism, 10 in trafficking and cytokinesis but only 4 in metabolism and cytokinesis and, 1 in the three processes (Figure3D). These results further support the hypothesis according to which membrane trafficking was a key factor in eukaryotic evolution and probably influenced the evolution of the metabolic and cytokinesis functions.
All eukaryotes share a similar intracellular organization with the nucleus and membrane-bound organelles. Thus, they depend on similar requirements to transport newly synthesized proteins from the ER to their target organelle. Numerous phylogenetic studies have shown that many ubiquitous trafficking effectors, such as small GTPases, syntaxins, coat components or adaptors (such as COP or clathrin), the lipid PtdIns-kinases and -phosphatases involved in endocytosis and MVB sorting or the C2 domain found in many proteins involved in intracellular trafficking, the MABP (MVB12-associated β prism) domain present in ESCRT-I/MVB12 subunits and in other trafficking proteins and the UMA (UBAP1-MVB12-associated) domain found in regulators of ESCRT function, are highly conserved throughout the eukaryotic lineage[17, 18, 21, 38]‐. The ENTH/ANTH/VHS superfamily of proteins is involved in three different trafficking pathways i.e. Golgi to endosomes, endosomal sorting and endocytic pathways forming the endosomal system. Here, we have revealed that although participating in the same pathways as the previously mentioned trafficking proteins, the different members and families of the ENTH/ANTH/VHS superfamily do not share the same evolutionary pattern.
Our results support the hypothesis that the ENTHA domain is the foundation of the superfamily since it is present in 80 out of the 84 studied eukaryotic organisms. In the Chromista and Excavata taxa, this domain is even the unique representative of the ENTH/ANTH/VHS function, thus suggesting that, in these organisms, multiple tasks might be performed by the unique ENTHA-containing protein. Accordingly, it is to note that, in Trypanosoma brucei (Euglenozoa), the ENTHA-containing protein localizes both as EpsinR (ENTHA) and Epsin (ENTHB) and is required for endocytosis. In Plantae, we could define four different types of ENTHA (ENTHA1-4) (Figure2). The A. thaliana Epsin1 (ENTHA1 in our analysis) and EpsinR2/Epsin2 (ENTHA2) proteins, were linked to functions in the Golgi to vacuole pathway[33, 34]. The same could apply for ENTHA3, which is closely related to ENTHA1 and ENTHA2 in our analysis. We also identified another ENTHA member, the ENTHA4 that clusters on another branch (Figure1) and may represent the epsin required for endocytosis. All these evidences support our hypothesis of a multi-functional role in trafficking for this ENTHA subfamily in organisms lacking the ENTHB subfamily. The Plantae GGA and PICALM domains diverged early on in at least two branches (giving rise to different types, Figure1 and2). The A. thaliana AP180 (At1g05020, NP_563726) protein (Picalm6 in our analysis) was shown to be involved in clathrin-mediated endocytosis and specifically recruited at the plasma membrane upon PdtIns(4,5)P2 production[45, 46]. Based on the phylogenetic tree (Figure1) and on the MACS of the full length proteins, this Picalm6 could be functionally redundant in endocytosis with the Picalm4-5 proteins (At4g32285, At2g25430; At4g02650, At1g03050), whereas the Picalm9-10 proteins (At1g68110, At1g25240, At1g14686, At2g01920; At4g40080, At5g10410, At5g65370) that are clustered on a separated branch of the phylogenetic tree might be required for other plant endocytic functions. Here, we also show that the Plantae, Euglenozoa and Amoebozoa VHS proteins belong to the GGA subfamily, even though they lack the GAE C-terminal domain, what led to their classification in the TOM subfamily‐. The biological and trafficking function of the TOM proteins are very different from the ones displayed by the GGA proteins. Indeed, the GGA proteins are required for protein sorting at the trans Golgi network, via direct binding between their VHS domain and sorting motifs present on the cargos[48, 49]. Since Plantae GGA proteins are clustered in different types, we propose that some perform Opisthokonta GGA functions, while others mimic the VHS and STAM functions that are required for cargo sorting at the endosomes in conjunction with the ESCRT-I complex and the Vps4 ATPase[18, 50]. The Euglenozoa and Amoebozoa GGA protein could be multifunctional for cargo sorting at the Golgi and at the endosomes, since in D. discoidium it interacts with DdTsg101 an endosomal ESCRT-I subunit and with clathrin that is required for Golgi trafficking. In conclusion, the large duplication of these three ENTHA, PICALM and GGA subfamilies in Plantae might allow the different subtypes to function at different steps of trafficking, and thus did not require the emergence of other subfamilies. Therefore, this study should advance the understanding of the ENTH/ANTH/VHS superfamily proteins in Plantae and in protists.
Our analysis of a large number of fully sequenced genomes representative of the different eukaryotic taxa allowed us to observe a strong correlation between the presence/absence/duplication of members of this superfamily and the eukaryotic evolutionary tree (Figure2). Indeed, most of the different eukaryotic kingdoms (Opisthokonta, Amoebozoa, Plantae, Euglenozoa) can easily be distinguished based solely on the number of subfamilies populated by at least one protein (Figure2). Only the Excavata and Chromista kingdoms could not be distinguished from each other having only the ENTHA protein (Figure2). Moreover, among the Opisthokonta, Animalia and Fungi kingdoms can be clustered in two separate groups, based on the presence/absence of the TOM subfamily. The TOM subfamily as well as the VHS and STAM subfamily probably results from the duplication and divergence of the GGA subfamily to fulfill the needs of a more complex organelle organization. Several in vivo studies on different model organisms have shown that in Opisthokonta vesicular budding and cargo sorting at the different steps of the endosomal system requires the combination of two proteins from different ENTH/ANTH/VHS subfamilies‐). This implies that these partners had to coevolve. This hypothesis is supported by our results showing that the absence of one partner results also in the lack of the other protein, such as the pair ENTHB/ANTH or VHS/STAM (S.c. Ent1/Sla2 or H.s. Hrs/STAM). For most Plantae proteins identified in this analysis the biological function is unknown, however a similar pairing rule could apply between the protein types.
Our genomic distribution also suggests that membrane trafficking was a driving force for the eukaryotic diversification in different taxa. This hypothesis was tested by comparative genomics and GO enrichment analyses and resulted in the identification of membrane trafficking, metabolism and cytokinesis as the only cellular functions differentiating Opisthokonta (H. sapiens and S. cerevisiae) from Plantae (A. thaliana) and Amoebozoa (E. histolytica), only metabolic functions were common to S. cerevisiae and A. thaliana, and none could statistically be considered common to S. cerevisiae and E. histolytica. Among the 75 proteins identified as Opisthokonta specific and involved in trafficking, 30 were involved in the endosomal system supporting our hypothesis that this process contributed to the evolution of Opisthokonta. It is also noteworthy that of these same 75 proteins, 20 have overlapping functions in intracellular trafficking and metabolism and 10 in intracellular trafficking and cytokinesis, a specificity of Opisthokonta. In both cases, the majority of proteins are involved in endosomal network. These results further support the endosomal system as an evolutionary driving force hypothesis.
In conclusion, although intracellular trafficking connects the same organelles in all eukaryotes and involves common regulators such as Rab GTPases, SNAREs or Clathrin, the ENTH/ANTH/VHS superfamily shows a distinct distribution between Opisthokonta and Plantae. Furthermore, without being able to assign a function to proteins with a TOM domain we could definitely conclude that they are a Metazoa specific subfamily of proteins and that ENTHA and GGA (involved in Golgi to endosome transport) and PICALM (involved in endocytosis) were present in eukaryotes prior to their evolution in the different taxa. Finally, our results show that membrane trafficking was tightly linked to cytokinesis and metabolism in Opisthokonta, this strongly supports a central role of membrane trafficking in the formation of the Opisthokonta and Amoebozoa taxa. This also suggests that membrane trafficking was not only a key acquisition for eukaryogenesis, but was also crucial for the emergence of different eukaryotic taxa, probably by allowing complex innovative metabolic pathways to be organized in compartments to protect cell homeostasis.
Generation of the multiple alignment of complete sequences
Amino acid sequences of proteins with an ENTH, ANTH or VHS domain were gathered for 20 Metazoa (Homo sapiens, Mus musculus, Rattus norvegicus, Gallus gallus, Meleagris gallopavo, Taeniopygia guttata, Tetraodon nigroviridis, Denio rerio, Xenopus laevis, Xenopus tropicalis, Drosophila melanogaster, Anopheles gambia, Caenorhabditis elegans, Caenorhabditis briggsae, Strongylocentrotus purpuratus, Nematostella vectensis, Trichoplax adhaerens, Monosiga brevicollis, Hydra magnipapillata and Ciona intestinalis), 12 Fungi (Saccharomyces cerevisiae, Ashbya gossypii, Schizosaccharomyces pombe, Candida albicans, Candida glabrata, Kluyveromyces lactis, Debaryomyces hansenii, Yarrowia lipolytica, Neurospora crassa, Rhizopus oryzae, Cryptococcus neoformans and Encephalitozoon cuniculi), 21 plants (Arabidopsis thaliana, Populus trichocarpa, Aquilegia coerulea, Carica papaya, Citrus clementina, Citrus sinesis, Cucumis sativa, Eucalyptus grandis, Glycina max, Manihot esculenta, Medicago truncatula, Mimulus guttatus, Prunus persica, Ricinus communis, Brachypodium distachyon, Setaria italica, Sorghum bicolor, Zea mays, Physcomitrella patens, Selaginella moellendorfii and Oryza sativa), 1 brown alga ( Aureoccocus anophagefferens), 1 red alga (Cyanidioschyzon merolae), 6 green algae (Micromonas pusilla, Ostreoccocus taurii, Ostreoccocus lucimarinus, Chlamydomonas reinhardtii, Volvox carteri and Chlorella vulgaris) and 23 protists (Babesia bovis, Guillardia theta, Hyaloperonospora parasitica, Phytophtora infestans, Plasmodium knowlesi, Phaeodactylum tricornutum, Theileria annulata, Toxoplasma gondii, Naegleria gruberi, Bigelowiella natans, Trypanosoma brucei, Trypanosoma cruzi, Leishmania major, Dictyostelium discoideum, Entamoeba histolytica, Tetrahymena thermophila, Giardia lamblia, Plasmodium falciparum, Plasmodium yoelii, Theileria parva, Cryptosporidium parvum, Thalassiosira pseudonana and Cryptosporidium hominis) from the NCBI, Ensembl and the Joint Genome Institute (JGI) databanks. Sequences were blasted (using BLASTP, TBLASTN and PSI-BLAST to ascertain the absence of any missing protein) and aligned using PipeAlign cascade and were manually adjusted.
The ∝ − helices of the ENTH, ANTH and VHS domains were localized on the Multiple Alignment of Complete Sequences (MACS) based on the protein structures found in the PDB database. Structure of TOM is 1ELK, VHS is 1DVP, Epsin1 is 1INZ, GGA1 is 1JWF, CALM is 1HF8 and STAM is 1X5B (by Tochio, N., Koshiba, S., Inoue, M., Kigawa, T. and Yokoyama, S. for the RIKEN Structural Genomics/Proteomics Initiative (RSGI)).
To identify the domains and motifs present in the proteins, the MACS was scanned using Interproscan and the InterPro database.
In order to reconstruct the phylogeny, we used the ∝ − helices 2 to 7 of the VHS, ENTH and ANTH domains. We excluded the following sequences because of incompleteness or poor prediction: R. norvegicus Tom1 np_001011994, O. Sativa np_001062546, X. tropicalis Tom1 np_001079451 and T. nigroviridis ANTH2. We used the neighbor joining algorithm implemented in Phylowin with 500 bootstraps to generate the phylogenetic tree. For tree visualization and editing we used iTOL.
Reciprocal best hit analysis
We used the proteomes of S. cerevisiae fromhttp://www.yeastgenome.org/, H. sapiens fromhttp://www.ncbi.nlm.nih.gov, A. thaliana fromhttp://www.plantgdb.org/AtGDB/ and E. histolytica from thehttp://amoebadb.org to generate our database. Each of the 95 077 proteins was compared to the whole database using BLASTP with a cutoff E value of 10-10. This allowed the classification of the proteins in 15 groups according to the number of organisms in which each protein had an ortholog.
Proteins from the yeast-human group were then manually curated to eliminate proteins with coverage lower than 20% by BLASTP. The remaining 245 proteins were analyzed by AmiGO Term Enrichment on the Gene Ontology website using the Saccharomyces cerevisiae database. The cutoff for the p-value was set to 10-4. The GO category was retained only if it comprises more than 3 proteins among the 245 proteins analyzed. We also manually searched the SGD (Saccharomyces Genome Database) database to verify their proper assignment into a given GO category and to complete the functional analysis for the proteins that did not form GO categories since less than 3 proteins shared the same GO term. The yeast-tale cress (280 proteins) and yeast-amoeba (20 proteins) groups, not manually curated, were also analyzed by AmiGO Term Enrichment on the Gene Ontology website using the Saccharomyces cerevisiae database.
We thank L. Bianchetti and L. Poidevin (IGBMC) for their technical help; J. Morvan and J. Muller for critical discussions to improve the manuscript. J.-O.D.C. was supported by an FRM Postdoctoral fellowship. S.F. laboratory is supported by grants from the Centre National de la Recherche Scientifique (CNRS, ATIP-CNRS 05–00932 and ATIP-Plus-CNRS 2008–3098), Université de Strasbourg (UdS), Fondation Recherche Médicale (FRM INE20051105238 and FRM Comité Alsace 2006CX67-1), Association pour la Recherche sur le Cancer (ARC JR/MLD/MDV-CR306/7901), Agence Nationale de la Recherche (ANR-07-BLAN-0065), Sidaction (13065-01-00/AO016-1) and O.P. laboratory by INSERM, CNRS and ANR(ANR-07-BLAN-0065) vers (ANR-07-BLAN-0065; ANR-10-BINF-03).
- Furthauer M, Gonzalez-Gaitan M: Tales of 1001 functions: the multiple roles of membrane trafficking in development. Traffic. 2009, 10: 781-782. 10.1111/j.1600-0854.2009.00931.x.View ArticlePubMed
- de Duve C: The origin of eukaryotes: a reappraisal. Nat Rev Genet. 2007, 8: 395-403. 10.1038/nrg2071.View ArticlePubMed
- Cavalier-Smith T: Origin of the cell nucleus, mitosis and sex: roles of intracellular coevolution. Biol Direct. 2010, 5: 7-10.1186/1745-6150-5-7.PubMed CentralView ArticlePubMed
- Jekely G: Origin of eukaryotic endomembranes: a critical evaluation of different model scenarios. Adv Exp Med Biol. 2007, 607: 38-51. 10.1007/978-0-387-74021-8_3.View ArticlePubMed
- Jovic M, Sharma M, Rahajeng J, Caplan S: The early endosome: a busy sorting station for proteins at the crossroads. Histol Histopathol. 2010, 25: 99-112.PubMed CentralPubMed
- Itoh T, De Camilli P: BAR, F-BAR (EFC) and ENTH/ANTH domains in the regulation of membrane-cytosol interfaces and membrane curvature. Biochim Biophys Acta. 2006, 1761: 897-912. 10.1016/j.bbalip.2006.06.015.View ArticlePubMed
- Hyman J, Chen H, Di Fiore PP, De Camilli P, Brunger AT: Epsin 1 undergoes nucleocytosolic shuttling and its eps15 interactor NH(2)-terminal homology (ENTH) domain, structurally similar to Armadillo and HEAT repeats, interacts with the transcription factor promyelocytic leukemia Zn(2) + finger protein (PLZF). J Cell Biol. 2000, 149: 537-546. 10.1083/jcb.149.3.537.PubMed CentralView ArticlePubMed
- Mao Y, Nickitenko A, Duan X, Lloyd TE, Wu MN, Bellen H, Quiocho FA: Crystal structure of the VHS and FYVE tandem domains of Hrs, a protein involved in membrane trafficking and signal transduction. Cell. 2000, 100: 447-456. 10.1016/S0092-8674(00)80680-7.View ArticlePubMed
- Ford MG, Pearse BM, Higgins MK, Vallis Y, Owen DJ, Gibson A, Hopkins CR, Evans PR, McMahon HT: Simultaneous binding of PtdIns(4,5)P2 and clathrin by AP180 in the nucleation of clathrin lattices on membranes. Science. 2001, 291: 1051-1055. 10.1126/science.291.5506.1051.View ArticlePubMed
- Maldonado-Baez L, Wendland B: Endocytic adaptors: recruiters, coordinators and regulators. Trends Cell Biol. 2006, 16: 505-513. 10.1016/j.tcb.2006.08.001.View ArticlePubMed
- Zhu Y, Doray B, Poussu A, Lehto VP, Kornfeld S: Binding of GGA2 to the lysosomal enzyme sorting motif of the mannose 6-phosphate receptor. Science. 2001, 292: 1716-1718. 10.1126/science.1060896.View ArticlePubMed
- Shih SC, Katzmann DJ, Schnell JD, Sutanto M, Emr SD, Hicke L: Epsins and Vps27p/Hrs contain ubiquitin-binding domains that function in receptor endocytosis. Nat Cell Biol. 2002, 4: 389-393. 10.1038/ncb790.View ArticlePubMed
- Legendre-Guillemin V, Wasiak S, Hussain NK, Angers A, McPherson PS: ENTH/ANTH proteins and clathrin-mediated membrane budding. J Cell Sci. 2004, 117: 9-18. 10.1242/jcs.00928.View ArticlePubMed
- Devos D, Dokudovskaya S, Alber F, Williams R, Chait BT, Sali A, Rout MP: Components of coated vesicles and nuclear pore complexes share a common molecular architecture. PLoS Biol. 2004, 2: e380-10.1371/journal.pbio.0020380.PubMed CentralView ArticlePubMed
- Gurkan C, Stagg SM, Lapointe P, Balch WE: The COPII cage: unifying principles of vesicle coat assembly. Nat Rev Mol Cell Biol. 2006, 7: 727-738. 10.1038/nrm2025.View ArticlePubMed
- Kloepper TH, Kienle CN, Fasshauer D: SNAREing the basis of multicellularity: consequences of protein family expansion during evolution. Mol Biol Evol. 2008, 25: 2055-2068. 10.1093/molbev/msn151.View ArticlePubMed
- Field MC, Gabernet-Castello C, Dacks JB: Reconstructing the evolution of the endocytic system: insights from genomics and molecular cell biology. Adv Exp Med Biol. 2007, 607: 84-96. 10.1007/978-0-387-74021-8_7.View ArticlePubMed
- Leung KF, Dacks JB, Field MC: Evolution of the multivesicular body ESCRT machinery; retention across the eukaryotic lineage. Traffic. 2008, 9: 1698-1716. 10.1111/j.1600-0854.2008.00797.x.View ArticlePubMed
- Duncan MC, Costaguta G, Payne GS: Yeast epsin-related proteins required for Golgi-endosome traffic define a gamma-adaptin ear-binding motif. Nat Cell Biol. 2003, 5: 77-81. 10.1038/ncb901.View ArticlePubMed
- Eugster A, Pecheur EI, Michel F, Winsor B, Letourneur F, Friant S: Ent5p is required with Ent3p and Vps27p for ubiquitin-dependent protein sorting into the multivesicular body. Mol Biol Cell. 2004, 15: 3031-3041. 10.1091/mbc.E03-11-0793.PubMed CentralView ArticlePubMed
- Winter V, Hauser MT: Exploring the ESCRTing machinery in eukaryotes. Trends Plant Sci. 2006, 11: 115-123. 10.1016/j.tplants.2006.01.008.PubMed CentralView ArticlePubMed
- Blanc C, Charette SJ, Mattei S, Aubry L, Smith EW, Cosson P, Letourneur F: Dictyostelium Tom1 participates to an ancestral ESCRT-0 complex. Traffic. 2009, 10: 161-171. 10.1111/j.1600-0854.2008.00855.x.View ArticlePubMed
- Herman EK, Walker G, van der Giezen M, Dacks JB: Multivesicular bodies in the enigmatic amoeboflagellate Breviata anathema and the evolution of ESCRT 0. J Cell Sci. 2011, 124: 613-621. 10.1242/jcs.078436.PubMed CentralView ArticlePubMed
- Misra S, Beach BM, Hurley JH: Structure of the VHS domain of human Tom1 (target of myb 1): insights into interactions with proteins and membranes. Biochemistry. 2000, 39: 11282-11290. 10.1021/bi0013546.View ArticlePubMed
- Shiba T, Takatsu H, Nogi T, Matsugaki N, Kawasaki M, Igarashi N, Suzuki M, Kato R, Earnest T, Nakayama K, Wakatsuki S: Structural basis for recognition of acidic-cluster dileucine sequence by GGA1. Nature. 2002, 415: 937-941. 10.1038/415937a.View ArticlePubMed
- Bonifacino JS: The GGA proteins: adaptors on the move. Nature reviews Molecular cell biology. 2004, 5: 23-32.View ArticlePubMed
- McCann RO, Craig SW: The I/LWEQ module: a conserved sequence that signifies F-actin binding in functionally diverse proteins from yeast to mammals. Proc Natl Acad Sci U S A. 1997, 94: 5679-5684. 10.1073/pnas.94.11.5679.PubMed CentralView ArticlePubMed
- Stechmann A, Cavalier-Smith T: Rooting the eukaryote tree by using a derived gene fusion. Science. 2002, 297: 89-91. 10.1126/science.1071196.View ArticlePubMed
- Bache KG, Brech A, Mehlum A, Stenmark H: Hrs regulates multivesicular body formation via ESCRT recruitment to endosomes. J Cell Biol. 2003, 162: 435-442. 10.1083/jcb.200302131.PubMed CentralView ArticlePubMed
- Bache KG, Raiborg C, Mehlum A, Stenmark H: STAM and Hrs are subunits of a multivalent ubiquitin-binding complex on early endosomes. J Biol Chem. 2003, 278: 12513-12521. 10.1074/jbc.M210843200.View ArticlePubMed
- Schellmann S, Pimpl P: Coats of endosomal protein sorting: retromer and ESCRT. Curr Opin Plant Biol. 2009, 12: 670-676. 10.1016/j.pbi.2009.09.005.View ArticlePubMed
- Kitakura S, Vanneste S, Robert S, Lofke C, Teichmann T: Tanaka H. 2011, Clathrin Mediates Endocytosis and Polar Distribution of PIN Auxin Transporters in Arabidopsis. Plant Cell, Friml J
- Song J, Lee MH, Lee GJ, Yoo CM, Hwang I: Arabidopsis EPSIN1 plays an important role in vacuolar trafficking of soluble cargo proteins in plant cells via interactions with clathrin, AP-1, VTI11, and VSR1. Plant Cell. 2006, 18: 2258-2274. 10.1105/tpc.105.039123.PubMed CentralView ArticlePubMed
- Lee GJ, Kim H, Kang H, Jang M, Lee DW, Lee S, Hwang I: EpsinR2 interacts with clathrin, adaptor protein-3, AtVTI12, and phosphatidylinositol-3-phosphate. Implications for EpsinR2 function in protein trafficking in plant cells. Plant Physiol. 2007, 143: 1561-1575. 10.1104/pp.106.095349.PubMed CentralView ArticlePubMed
- Lahr DJ, Grant J, Nguyen T, Lin JH, Katz LA: Comprehensive phylogenetic reconstruction of amoebozoa based on concatenated analyses of SSU-rDNA and actin genes. PLoS One. 2011, 6: e22780-10.1371/journal.pone.0022780.PubMed CentralView ArticlePubMed
- Nakada-Tsukui K, Saito-Nakano Y, Husain A, Nozaki T: Conservation and function of Rab small GTPases in Entamoeba: annotation of E. invadens Rab and its use for the understanding of Entamoeba biology. Exp Parasitol. 2010, 126: 337-347. 10.1016/j.exppara.2010.04.014.View ArticlePubMed
- Smith S, Guillen N: In Microbiology Monographs. Volume 17. Organelles and Trafficking in Entamoeba histolytica. Edited by: Souza W. 2010, Springer, Berlin / Heidelberg, 149-173.
- Anantharaman V, Aravind L: The GOLD domain, a novel protein module involved in Golgi function and secretion. Genome Biol. 2002, 3: research0023-PubMed CentralPubMed
- Jekely G: Small GTPases and the evolution of the eukaryotic cell. Bioessays. 2003, 25: 1129-1138. 10.1002/bies.10353.View ArticlePubMed
- Dacks JB, Field MC: Evolution of the eukaryotic membrane-trafficking system: origin, tempo and mode. J Cell Sci. 2007, 120: 2977-2985. 10.1242/jcs.013250.View ArticlePubMed
- Lecompte O, Poch O, Laporte J: PtdIns5P regulation through evolution: roles in membrane trafficking?. Trends Biochem Sci. 2008, 33: 453-460. 10.1016/j.tibs.2008.07.002.View ArticlePubMed
- Gabernet-Castello C: Dacks JB. 2009, The Single ENTH-Domain Protein of Trypanosomes; Endocytic Functions and Evolutionary Relationship with Epsin. Traffic, Field MC
- Zhang D, Aravind L: Identification of novel families and classification of the C2 domain superfamily elucidate the origin and evolution of membrane targeting activities in eukaryotes. Gene. 2010, 469: 18-30. 10.1016/j.gene.2010.08.006.PubMed CentralView ArticlePubMed
- de Souza RF, Aravind L: UMA and MABP domains throw light on receptor endocytosis and selection of endosomal cargoes. Bioinformatics. 2010, 26: 1477-1480. 10.1093/bioinformatics/btq235.PubMed CentralView ArticlePubMed
- Barth M, Holstein SE: Identification and functional characterization of Arabidopsis AP180, a binding partner of plant alphaC-adaptin. J Cell Sci. 2004, 117: 2051-2062. 10.1242/jcs.01062.View ArticlePubMed
- Zhao Y, Yan A, Feijo JA, Furutani M, Takenawa T, Hwang I, Fu Y, Yang Z: Phosphoinositides regulate clathrin-dependent endocytosis at the tip of pollen tubes in Arabidopsis and tobacco. Plant Cell. 2010, 22: 4031-4044. 10.1105/tpc.110.076760.PubMed CentralView ArticlePubMed
- Chen X, Irani NG, Friml J: Clathrin-mediated endocytosis: the gateway into plant cells. Curr Opin Plant Biol. 2011, 14: 674-682. 10.1016/j.pbi.2011.08.006.View ArticlePubMed
- Misra S, Puertollano R, Kato Y, Bonifacino JS, Hurley JH: Structural basis for acidic-cluster-dileucine sorting-signal recognition by VHS domains. Nature. 2002, 415: 933-937. 10.1038/415933a.View ArticlePubMed
- Shiba T, Takatsu H, Nogi T, Matsugaki N, Kawasaki M, Igarashi N, Suzuki M, Kato R, Earnest T, Nakayama K, Wakatsuki S: Structural basis for recognition of acidic-cluster dileucine sequence by GGA1. Nature. 2002, 415: 937-941. 10.1038/415937a.View ArticlePubMed
- Robinson DG, Jiang L, Schumacher K: The endosomal system of plants: charting new and familiar territories. Plant Physiol. 2008, 147: 1482-1492. 10.1104/pp.108.120105.PubMed CentralView ArticlePubMed
- Baggett JJ, D’Aquino KE, Wendland B: The Sla2p talin domain plays a role in endocytosis in Saccharomyces cerevisiae. Genetics. 2003, 165: 1661-1674.PubMed CentralPubMed
- Bilodeau PS, Winistorfer SC, Kearney WR, Robertson AD, Piper RC: Vps27-Hse1 and ESCRT-I complexes cooperate to increase efficiency of sorting ubiquitinated proteins at the endosome. J Cell Biol. 2003, 163: 237-243. 10.1083/jcb.200305007.PubMed CentralView ArticlePubMed
- Raiborg C, Stenmark H: The ESCRT machinery in endosomal sorting of ubiquitylated membrane proteins. Nature. 2009, 458: 445-452. 10.1038/nature07961.View ArticlePubMed
- Plewniak F, Bianchetti L, Brelivet Y, Carles A, Chalmel F, Lecompte O, Mochel T, Moulinier L, Muller A, Muller J: PipeAlign: A new toolkit for protein family analysis. Nucleic Acids Res. 2003, 31: 3829-3832. 10.1093/nar/gkg518.PubMed CentralView ArticlePubMed
- Koshiba S, Kigawa T, Kikuchi A, Yokoyama S: Solution structure of the epsin N-terminal homology (ENTH) domain of human epsin. J Struct Funct Genomics. 2002, 2: 1-8. 10.1023/A:1011397007366.View ArticlePubMed
- Zdobnov EM, Apweiler R: InterProScan an integration platform for the signature-recognition methods in InterPro. Bioinformatics. 2001, 17: 847-848. 10.1093/bioinformatics/17.9.847.View ArticlePubMed
- Galtier N, Gouy M, Gautier C: SEAVIEW and PHYLO_WIN: two graphic tools for sequence alignment and molecular phylogeny. Comput Appl Biosci. 1996, 12: 543-548.PubMed
- Letunic I, Bork P: Interactive Tree Of Life (iTOL): an online tool for phylogenetic tree display and annotation. Bioinformatics. 2007, 23: 127-128. 10.1093/bioinformatics/btl529.View ArticlePubMed
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.PubMed CentralView ArticlePubMed
- Gouet P, Courcelle E, Stuart DI, MÃÂ©toz F: ESPript: analysis of multiple sequence alignments in PostScript. Bioinformatics. 1999, 15: 305-8. 10.1093/bioinformatics/15.4.305.View ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.