- Research article
- Open Access
The drug:H+ antiporters of family 2 (DHA2), siderophore transporters (ARN) and glutathione:H+antiporters (GEX) have a common evolutionary origin in hemiascomycete yeasts
BMC Genomics volume 14, Article number: 901 (2013)
The Saccharomyces cerevisiae 14-spanner Drug:H+ Antiporter family 2 (DHA2) are transporters of the Major Facilitator Superfamily (MFS) involved in multidrug resistance (MDR). Although poorly characterized, DHA2 family members were found to participate in the export of structurally and functionally unrelated compounds or in the uptake of amino acids into the vacuole or the cell. In S. cerevisiae, the four ARN/SIT family members encode siderophore transporters and the two GEX family members encode glutathione extrusion pumps. The evolutionary history of DHA2, ARN and GEX genes, encoding 14-spanner MFS transporters, is reconstructed in this study.
The translated ORFs of 31 strains from 25 hemiascomycetous species, including 10 pathogenic Candida species, were compared using a local sequence similarity algorithm. The constraining and traversing of a network representing the pairwise similarity data gathered 355 full size proteins and retrieved ARN and GEX family members together with DHA2 transporters, suggesting the existence of a close phylogenetic relationship among these 14-spanner major facilitators. Gene neighbourhood analysis was combined with tree construction methodologies to reconstruct their evolutionary history and 7 DHA2 gene lineages, 5 ARN gene lineages, and 1 GEX gene lineage, were identified. The S. cerevisiae DHA2 proteins Sge1, Azr1, Vba3 and Vba5 co-clustered in a large phylogenetic branch, the ATR1 and YMR279C genes were proposed to be paralogs formed during the Whole Genome Duplication (WGD) whereas the closely related ORF YOR378W resides in its own lineage. Homologs of S. cerevisiae DHA2 vacuolar proteins Vba1, Vba2 and Vba4 occur widespread in the Hemiascomycetes. Arn1/Arn2 homologs were only found in species belonging to the Saccharomyces complex and are more abundant in the pre-WGD species. Arn4 homologs were only found in sub-telomeric regions of species belonging to the Sacharomyces sensu strictu group (SSSG). Arn3 type siderophore transporters are abundant in the Hemiascomycetes and form an ancient gene lineage extending to the filamentous fungi.
The evolutionary history of DHA2, ARN and GEX genes was reconstructed and a common evolutionary root shared by the encoded proteins is hypothesized. A new protein family, denominated DAG, is proposed to span these three phylogenetic subfamilies of 14-spanner MFS transporters.
The Saccharomyces cerevisiae major facilitator superfamily (MFS) of transporters involved in multidrug resistance (MDR), i.e., in the simultaneous resistance to a wide range of structurally and functionally unrelated cytotoxic chemicals , were classified in three protein families as deduced from the sequence analysis of the open reading frames identified after the release of the whole genome sequence [2–6]. Two of these families are the 12-spanner drug:H+ antiporter family 1 (DHA1) and the 14-spanner drug:H+ antiporter family 2 (DHA2) [7, 8] while the third was called the Unknown Major Facilitator (UMF) family . Following the demonstration that four S. cerevisiae UMF members encoded siderophore transporters [9–12], these proteins were reassigned to a new protein family, denominated the ARN family (also known as the SIT family) [13, 14]. Arn2p and Arn4p (also known as Enb1p) have high siderophore substrate specificity for the bacterial catecholate enterobactin and for triacetylfusarinine C, respectively, while Arn1 and Arn3p (also known as Sit1p) show a broad and overlapping siderophore substrate specificity . Although the other two proteins of the UMF family are highly similar to the ARN transporters [15, 16], no experimental evidence has been obtained supporting their involvement in siderophore transport [17, 18]. More recently, these two proteins were demonstrated to function as glutathione extrusion pumps (GEX) and designated Gex1p and Gex2p .
The S. cerevisiae DHA2 family comprises ten proteins encoded by ATR1, YMR279C, YOR378W, SGE1, AZR1, VBA1, VBA2, VBA3, VBA4 and VBA5 genes , but the physiological functions of the majority of these proteins are still unclear. ATR1 was the first S. cerevisiae DHA2 gene to be biochemically characterized and found to confer resistance to aminotriazole and 4-nitroquinoline-1-oxide (4-NQO) [20–22]. Later, a screen for yeast genes conferring resistance to boron revealed that the plasma membrane Atr1p was the main exporter for this element . The ORF YMR279C was proposed to encode a back-up boron pump  while ORF YOR378W is not required for boron tolerance [23, 24] but determines yeast resistance to cycloheximide and streptomycin and sensitivity to rapamycin [25, 26]. Vba1p, Vba2p and Vba3p were found to be involved in vacuolar uptake of basic amino acids, mediating the transport of histidine and lysine into the vacuole  and Vba2p was also found to catalyze the vacuolar transport of arginine . Although Vba4p was localized at the vacuolar membrane , no significant differences in the vacuolar uptake of basic amino acids were registered in a ∆vba4 mutant compared with the parental strain  and the physiological function of Vba4p remains unknown . With the exception of five amino acid residues and the presence of an extra peptide of 124 amino acids in the N terminus, Vba5p exhibits the same sequence as Vba3p . However, differently from Vba3p, Vba5p localizes exclusively at the plasma membrane where it catalyzes the uptake of lysine and arginine into the cell  and VBA5 overexpression was shown to lead to increased susceptibility of yeast cells to 4-NQO and quinidine . The transcription level of S. cerevisiae VBA3 gene was found to be highly induced under low-iron conditions [30, 31]. The AZR1 gene encodes a plasma membrane transporter required for yeast adaptation to low-molecular-weight organic acids, in particular to acetic acid, and to the antifungals ketoconazole and fluconazole and to polymyxin B [32, 33]. The SGE1 gene encodes a plasma membrane transporter presumably involved in the expulsion of dye molecules possessing a large unsaturated domain that stabilizes a permanent positive charge, such as 10-N-nonyl acridine orange, crystal violet, ethidium bromide and malachite green [34–36]. SGE1 gene also confers resistance to methylmethane sulfonate  and was reported to be present in multiple copies in the genomes of S. cerevisiae strains involved in industrial production of fuel ethanol or saké [37, 38]. Knq1p, a functionally characterized DHA2 transporter of Kluyveromyces lactis, was found to be involved in oxidative stress response and iron homeostasis [39, 40]. This protein was found to define a new branch in a phylogenetic tree constructed using the DHA2 proteins encoded in the genomes of 5 hemiascomycete yeasts .
The evolutionary history of the DHA1 genes present in the genome of 13 hemiascomycete yeast species was reconstructed by combining building tree methodologies with gene neighbourhood analysis . Gene neighbourhood analysis is a comparative genome based-approach used to infer gene lineages [42, 43]. In the present study, we undertook the clustering of the amino acid sequences of a total of 172,422 translated ORFs obtained from 31 sequenced yeast strains from 25 different hemiascomycetous species to construct a homogenous protein classification system which was used to trace back the evolutionary history of genes encoding DHA2 transporters in the Hemiascomycetes. Combined with tree construction methods, this approach allowed the most comprehensive phylogenetic characterization of the hemiascomycetous DHA2 proteins available to date. The results obtained during this study also suggest that the DHA2, ARN and GEX transporters are closely related families.
Hemiascomycete yeast genomes
The translated ORFs of the 31 sequenced hemiascomycetous strains analysed in this work were retrieved from the genome databases indicated in Table 1. These 31 hemiascomycetous strains correspond to 25 different species, 14 of which belong to the Saccharomyces complex, 9 to the CTG complex and 2 are the early-divergent hemiascomycetes, Pichia pastoris and Yarrowia lipolytica. Henceforth, the four letters code shown in Table 1 for species abbreviation will be used to designate both yeast genes and species. The letter displayed after the first four letters is used to abbreviate the strain name when the genome of more than one strain from a given species is available or when the genome of the same strain was sequenced by different research centres. To uniformize the annotation used, translated ORFs are always represented with small letters.
Sequence clustering of the translated ORFs
The comparative genomic approach used in this study is based on the sequence clustering of all translated ORFs of the 31 sequenced yeast strains. This required the compilation and organization of a total of 172,422 translated ORFs. These translated ORFs were organized into a blast database and compared all-against-all using blastp algorithm made available in blast2 package . The blastp algorithm used gapped alignment with the following parameter: expectation value (10-30), open gap (−1), extend gap (−1), threshold for extending hits (11) and word size (3). This approach generated a total of 31 million pairwise alignments. In order to handle this amount of data, sequence clustering was formulated as a graph traversal problem, where the nodes are the translated ORFs and the edges indicate the existence of pairwise sequence similarity between amino acid sequences. Classification of the translated ORFs into clusters was achieved by breadth-first traversing this network at different e-value thresholds, ranging from E-30 to E-12.
Gene neighbourhood analysis
During this work, a MySQL genome database was built compiling a series of genomic information regarding the previously mentioned translated ORFs. This genomic information includes gene name, chromosome/contig, sequence clustering classification, gene start position, gene end position, amino acid sequence length and encoded amino acid sequences. The package “sqldf”  and complementing scripting in R language was used to retrieve fifteen neighbour genes on each side of the query genes as well as the corresponding sequence clustering classification from this hemiascomycetous genome database. The rationale of synteny analysis resides on the assumption that two genes of different yeast species whose translation products belong to the same sequence cluster (homologues by similarity) will be members of the same gene lineage if they share at least one pair of neighbours that are also homologous to each other by similarity [42, 43]. The process is reiterated for all possible heterospecific pairwise comparisons of homologues deduced from the sequence clusters. The sequence clustering classification of the thirty genes neighbouring each query gene was done using a conservative blastp e-value of E-30 to limit the number of false positive sequences incorporated together with true cluster members. When further evidences were needed to corroborate dubious synteny connections between genes, sequence clustering was performed at a less restrictive e-value threshold (E-15).
The chromosome neighbourhood of the query genes was converted into a format adequate for import into the Cytoscape environment . In the resulting networks, nodes represent query genes and edges represent pairs of neighbouring genes classified in the same sequence cluster. Useful biological information indicated below was imported into the Cytoscape network as edges attributes. The existence of synteny between query genes was verified through the analysis of network topology (number of shared neighbour pairs) and the biological information associated with the corresponding edges. The advantage of this framework is that it allows scrutinizing the synteny relationships established between genes in a simple mathematical context of network topology exploration. Three sources of biological information were used to assess the strength of each neighbour pair connection : i) closeness of the connecting neighbours in relation to the query genes, ii) sequence similarity between connecting neighbours and iii) dimension of the sequence cluster to which the homologous neighbours belong; small dimension of the sequence cluster indicates that it is small the probability that two homologous neighbours are in the vicinity of two query genes by chance.
Topology prediction, sequence alignment and phylogenetic tree building
The topology of the amino acid sequences was analysed using HMMTOP and TMHMM 2.0 bioinformatics tools [47, 48]. For each amino acid sequence not showing 14 TMS and with less than 490 residues in length, the TMS range was predicted by visual analysis of the topology probability and protein hydrophobicity plots generated by TMHMM 2.0 and TOPPRED 2 [49, 50], respectively.
Multiple alignments of the amino acid sequences were calculated by MUSCLE  and processed using the PHYLIP package . PROTDIST/NEIGHBOUR and PROML packages were used to generate the phylogenetic trees based on distance and maximum likelihood methods, respectively. The Dendroscope application was used for tree visualization .
Sequence identity and similarity shared between protein pairs was assessed using an all-against-all Needleman-Wunsch alignment approach. This algorithm was run using the needle package available in the EMBOSS suite . All needle pairwise alignments made in this work used default values for the gap open and gap extension parameters, 10.0 and 0.5, respectively. After the construction of the DHA2 phylogenetic tree, all-against-all Needleman-Wunsch alignments were also constructed for the members of each phylogenetic cluster.
Identification of the DHA2 proteins in 31 hemiascomycetous strains
The constraining and traversing of a pairwise similarity network allowed the identification of the DHA2 proteins encoded in the genomes of 31 hemiascomycetous strains. The functionally characterized Atr1 protein was used as starting node for the network traversal. Analysis of the plot representing the number of sequences retrieved at different e-values shows the existence of four distinct blastp clustering ranges. The first range occurs between e-values E-30 to E-21, gathering 68 amino acid sequences highly similar to the starting node Atr1p (group 1), including S. cerevisiae ORFs YMR279C and YOR378W (Figure 1B). In the second blastp clustering range (e-values from E-20 to E-18) occurs the merge of the amino acid sequences comprised in groups 3 and 4 (182 and 143 members, respectively) with those of group 1 (Figure 1B), originating a total of 393 sequences. Group 3 comprises the S. cerevisiae DHA2 transporters Sge1, Azr1, Vba1, Vba2, Vba3, Vba4 and Vba5. Group 4 comprises the amino acid sequences of S. cerevisiae ARN and GEX 14-spanner MFS transporters. Members of clusters 3 and 4 are linked by many connections, indicating the existence of strong homology between amino acid segments of these transporters. In the third blastp clustering range (e-values from E-17 to E-15), group 2, comprising the biochemically characterized DHA2 transporter of K. lactis species, Knq1p, is merged with groups 1, 3 and 4. The resulting supergroup contains a total of 402 amino acid sequences. Analysis of the number of TMS shown by these amino acid sequences (see Additional file 1) together with the construction of a phylogenetic tree (see Additional file 2) confirmed that all true 14-spanner transporters were gathered at this blastp clustering range. In the fourth blastp clustering range (e-values equal or bellow E-14), false positive amino acid sequences are incorporated with the true 14-spanner MFS proteins.
The joint retrieval of the DHA2, ARN and GEX proteins by this in silico approach suggests the existence of a close phylogenetic relationship linking these transporters. Consistent with this indication, the DHA2, ARN and GEX proteins encoded in the genome of S. cerevisiae S288C strain were clustered in a single protein family, CL3C0009, by the Génolevures Consortium . These results were confirmed by breadth-first traversing this network using the functionally characterized Sge1, Vba1, Vba4, Arn1, Arn3, Arn4, Gex1 and Knq1 proteins as starting nodes. For the sake of clarity, only the results of Atr1p, Knq1p, Sge1p and Arn1p are shown in Figure 1A since the remaining amino acid sequences clustered with one of these four proteins (Figure 1B).
Phylogenetic analysis of the hemiascomycetous DHA2, ARN and GEX transporters
The analysis of the hydrophobicity and topology of the 14-spanner proteins gathered in the third blastp clustering range revealed that 355 of these comprised full-size transporters while 47 were fragments (see Additional file 1). A phylogenetic tree representing the full-size DHA2, ARN and GEX proteins was built and divided into 20 clusters, labelled from A to T (Figure 2A,B and Additional file 3 for protein/translated ORFs names). Since the distance and the maximum likelihood methods originated similar phylogenetic trees regarding cluster composition (see Additional file 4 and Additional file 5), only the tree obtained using the distance method is shown (Figure 2). The stability of cluster composition of the phylogenetic trees obtained by the distance and maximum likelihood methods supported the division of the phylogenetic tree into the 20 clusters. To avoid tree artefacts resulting from root positioning, the pist_igi19985888 protein was manually chosen as root for the phylogenetic tree (cluster A) since this protein does not cluster together with any of the other amino acid sequences. The neddle package of EMBOSS suite was used to make all possible pairwise alignment combinations between the full-size 14-spanner MFS-MDR proteins. Pairwise sequence comparisons revealed that the sequence identity of transporters residing in the same phylogenetic cluster ranged from 36.5% to 94.2% whereas sequence similarity ranged from 53.0% to 97.1%.
Using as reference the DHA2, ARN and GEX transporters encoded in the S. cerevisiae S288C genome, this phylogenetic tree (Figure 2) was used to assign sequence homology to the 14-spanner transporters encoded in the genomes of the remaining hemiascomycetous strains (Table 2 and Additional file 1). For example, regarding the siderophore transporters arsenal present in the genome of the ten Candida pathogenic species considered in this study, the analysis of Table 2 shows that C. albicans and C. dubliniensis only possess CaArn1 type of siderophore transporters (cluster R), C. tropicalis and C. parapsilosis exhibit both CaArn1 and Arn3 types of siderophore transporters (cluster R and cluster P, respectively) and C. glabrata, an opportunistic yeast pathogen belonging to the Saccharomyces complex, only exhibits one siderophore transporter (the ortholog of Arn1 gene). Eight translated ORFs in the genome of C. albicans SC5314 were iden-tified as bona fide DHA2 transporters: caal_a_19.304, caal_a_19.2350, caal_a_19.1942, caal_a_19.4779, caal_a_19.3444, caal_a_19.1308, caal_a_19.7554 and caal_a_19.7336. An additional ORF, orf19.2923, was proposed before as encoding a DHA2 protein  but the corresponding amino acid sequence shares high similarity with S. cerevisiae ORF YMR155W, a putative protein of unknown function, identified as interacting with Hsp82p . Consistent with our proposal, the protein classification system developed by the Génolevures consortium also clustered YMR155W amino acid sequence in the GL3C0730 family while the S. cerevisiae DHA2 proteins are grouped in the GL3R0009 family. No association could be found between the presence of particular genes encoding DHA2, ARN and GEX transporters in the yeast genomes examined and the corresponding species pathogenicity (Table 2 and Additional file 6 and Additional file 7).
The family functional classification of each S. cerevisiae 14-spanner MFS transporter was retrieved from the Transporter Classification Database (TCDB)  and the information was added to Figure 2. The TCDB classification divides the genes that have been considered as encoding the S. cerevisiae DHA2 transporters into two families. “The Drug:H+ Antiporter-2 (14 Spanner) (DHA2) Family” (2.A.1.3), comprising ATR1, SGE1, AZR1 and VBA3 genes and ORF YOR378W, and “The Vacuolar Basic Amino Acid Transporter (V-BAAT) Family” (2.A.1.48), comprising VBA1, VBA2 and VBA4 genes. The VBA5 gene and the ORF YMR279C, historically considered as members of the DHA2 protein family , do not have family classification in TCDB database. The proteins encoded by ARN1, ARN2, ARN3 and ARN4 genes reside in a single TCDB family, “The Siderophore-Iron Transporter (SIT) Family” (2.A.1.16). The GEX1 and GEX2 encoded transporters are also not included in TCDB database.
Identification of DHA2, ARN and GEX gene lineages in the Hemiascomycetes
Gene neighbourhood analysis of the chromosome environment where the DHA2, ARN and GEX genes reside allowed the identification of thirteen gene lineages (see Additional file 8). This analysis involved the representation of synteny between genes in a network framework and, subsequently, the exploitation of the network topology in Cytoscape software environment. DHA2 genes encoding transporters present in phylogenetic clusters A, G, H, I and L did not reside in a conserved chromosome environment, as it is the case of the genes encoding siderophore transporters residing in clusters M and Q. However, in general, genes belonging to a given lineage encode transporters present in the same phylogenetic cluster. The DHA2, ARN and GEX gene lineages spanning the species of the Saccharomyces complex, CTG complex and the early-divergent hemiascomycetes, P. pastoris and Y. lipolytica, are detailed bellow. The order of speciation of the yeasts belonging to the CTG complex adopted in this work was based on the order used in previous phylogenetic studies on Hemiascomycetes [58–61].
DHA2 gene lineages
The gene neighbourhood analysis allowed the identification of seven DHA2 gene lineages in the 31 hemiascomycetous strains examined (Figures 3, 4, 5, 6, 7A and 7B). Five of these lineages include the ten DHA2 genes encoded by the genome of S. cerevisiae S288C reference strain.
Lineage 1 comprises seven sublineages (Figure 3). With the exception of pist_igi19985888 gene, a member of cluster A, lineage 1 comprises cluster B-encoding genes. In the Saccharomyces complex, three sublineages converge on S. cerevisiae SGE1, AZR1 and VBA3/VBA5 genes. The S. cerevisiae VBA3 and VBA5 genes are paralogs originated in a duplication event occurring after S. mikatae speciation. The sequenced genomes of Z. rouxii, K. polysporus, C. glabrata and S. castellii species do not possess SGE1 or VBA3/VBA5 homologs, suggesting that these genes were acquired by the ancestral of the Saccharomyces sensu strictu group (SSSG) by lateral transference. In addition, no synteny was observed between the AZR1 homologs of Lachancea and those of SSSG species. In the CTG complex, three sublineages converge on C. albicans CaSGE1 gene and on ORFs caal_a_19.3444 and caal_a_19.4779. Two sublineages encompass DHA2 genes belonging to CTG haploid sub-group species while the origin of the third sublineage is more recent. The cluster B ORF of the early-divergent species P. pastoris (pipa_3g03370) resides in a distinct chromosome environment.
With the exception of five amino acid residues and the presence of an extra peptide of 124 amino acids in the N terminus, the amino acid sequences encoded by VBA5 and VBA3 genes are identical. Due to this fact, the plasma membrane localization of Vba5p was hypothesized to be dependent on the presence of the extra N-terminal amino acid sequence . The analysis of the amino acid sequences of the VBA3 and VBA5 homologs showed that, with exception of VBA3 gene (encoded in the S. cerevisiae S288C genome) and ORF sace_e_3474 (encoded in the S. cerevisiae YJM789 genome), all these genes carry a similar extra N-terminal peptide (see Additional file 9). In addition, the analysis of translated DNA sequence of the upstream region of VBA3 gene and ORF sace_e_3474 showed that their N-terminal peptides are still encoded in the genomes of the corresponding S. cerevisiae strains. The N-terminal peptide of ORF sace_e_3474 is miss-predicted due to the localization of this ORF in the extremity of the DNA contig (its sequence is partly truncated) while the coding sequence of the N-terminal peptide in the VBA3 gene is disrupted by a stop codon (see Additional file 10).
Lineage 2 comprises the homologs of S. cerevisiae VBA1/VBA2 genes (cluster C). This lineage is divided into four sublineages (Figure 4A). One extends from ergo2b04004 to the S. cerevisiae VBA1 gene, encompassing genes from all species belonging to the Saccharomyces complex considered in this study. The genomes of K. polysporus, C. glabrata and S. castellii lack a VBA2 homolog, resulting in a lineage discontinuity occurring in the transition from pre- to post-WGD species. The third sublineage is composed by two K. waltii and K. thermotolerans genes and by a tandem repeat present in Z. rouxii genome. The last sublineage spans genes of the CTG haploid species. The VBA1/VBA2 homolog of the early-divergent hemiascomycetes P. pastoris (pipa_3g02865) does not share common neighbours with the remaining cluster C members.
Lineage 3 comprises the homologs of S. cerevisiae VBA4 gene (cluster D). Only the genomes of C. glabrata and S. castellii lack a cluster D-encoding gene (or fragment) in the 31 hemiascomycetous strains considered in this study. The sublineage encompassing the Saccharomyces complex species extends from ergo2g10076 to the S. cerevisiae VBA4 gene (Figure 4B). The paralog of VBA4 gene was quickly lost after the WGD event. The chromosome environment where the VBA4 homologs reside in the species of the CTG complex is highly conserved, encompassing also ORF yali0e18095 belonging to the early-divergent hemiascomycete yeast species Y. lipolytica. Two ORFs classified in cluster D present in the P. pastoris genome (pipa_4g02960 and pipa_1g01100) do not share common neighbours with the remaining lineage 3 genes.
Lineage 4 comprises the homologs of S. cerevisiae ATR1 gene and ORF YMR279C (cluster E). The chromosome environment where ATR1 and YMR279C homologs reside is conserved (Figure 5). The evolutionary history of these genes reproduces a typical WGD pattern, where a pre-WGD lineage splits into two sublineages, each of which gave rise to the S. cerevisiae ATR1 and YMR279C paralogs (Figure 6A). Regarding the CTG complex, cluster E-encoding genes show a linear evolutionary history converging on ORF caal_a_19.304. The early-divergent hemiascomycetes P. pastoris and Y. lipolytica do not possess cluster E members.
Lineage 5 comprises the homologs of S. cerevisiae ORF YOR378W (cluster F). This lineage divides into two sublineages (Figure 6B). The sublineage spanning the species of the Saccharomyces complex shows a discontinuity occurring in the transition from pre- to post-WGD species. Although klwa_034-snap.4 and klth0c10560 share two common neighbours with the post-WGD YOR378W homologs, the size of this discontinuity suggests that the species of the SSSG have acquired their cluster F-encoding genes by lateral gene transfer, plausibly from a pre-WGD yeast. The sublineage spanning the species of the CTG complex converges on C. albicans ORF caal_a_19.2350. The genomes of C. lusitaniae and the early-divergent hemiascomycetes P. pastoris and Y. lipolytica lack a cluster F-encoding gene.
Lineage 6 comprises the homologs of K. lactis KNQ1 gene (cluster J). Besides K. lactis, only the genomes of SSSG and Lachancea species possess cluster J-encoding genes. However, with the exception of JAY-291 strain, the S. cerevisiae strains considered in this work lack a cluster J-encoding gene. The lack of KNQ1 homologs by Z. rouxii, K. polysporus, C. glabrata, S. castellii and S. kudriavzevii species suggests that the ancestral of the SSSG have acquired their cluster F-encoding genes by lateral gene transfer, plausibly from a pre-WGD donor species (Figure 7A). The genomes of both early-divergent hemiascomycetes lack a cluster F-encoding gene.
Lineage 7 comprises genes of species belonging to the CTG complex (cluster K). A duplication event occurring after the speciation of C. parapsilosis originated two sublineages, converging each on C. albicans ORFs caal_a_19.7554 and on caal_a_19.7336 (Figure 7B). With the exception of yali0d20196, cluster K-encoding genes reside in a conserved chromosome environment.
ARN gene lineages
This study identified four lineages comprising genes encoding siderophore transporters (Figures 7C,D and 8). Three of these lineages comprise the four ARN genes encoded in the genome of S. cerevisiae S288C reference strain [9–12] and one additional lineage comprises the sole C. albicans gene encoding a siderophore transporter [62, 63]. The members of the ARN gene lineages, as described in , reside in the phylogenetic cluster 2.A.1.16.Z1.
Lineage 9 comprises the homologs of S. cerevisiae ARN4 gene (cluster O). These genes are only found in the genomes of SSSG species (Figure 7C). Of the five S. cerevisiae strains considered in this work, only the genomes of S288C reference strain and of two wine yeast isolates, the RM11-1A and EC1118 strains, exhibit ARN4 homologs.
Lineage 10 comprises the homologs of S. cerevisiae ARN3 gene (cluster P). The chromosome environment where cluster P-encoding genes reside is sparsely conserved, splitting lineage 10 into five different sublineages (Figure 8A). One sublineage spans the species belonging to the CTG, although C. albicans, C. dubliniensis and L. elongisporus lack cluster P-encoding genes. Regarding species belonging to the Saccharomyces complex, two sublineages encompass genes from post-WGD species while the remaining two encompass genes from pre-WGD species. The origin of the sublineage containing the S. cerevisiae ARN3 gene can be traced back to klpo_358.4. The amino acid sequence of pipa_3g03640 is similar to sakl0g06600 protein, although it does not share common neighbours with the remaining cluster P members.
Lineage 11 comprises the homologs of CaARN1 gene (cluster R). These genes are not related to the S. cerevisiae ARN1 gene since they do not share common neighbours and do not group in the same phylogenetic cluster. Although this lineage spans mainly genes of CTG diploid species, the haploid species P. stipitis also possesses one cluster R ORF (Figure 7D).
Lineage 13 comprises the homologs of S. cerevisiae ARN1/ARN2 genes (cluster T). Four sublineages exist within lineage 13 (Figure 8B), all spanning genes of species belonging to the Saccharomyces complex. The origin of the sublineage containing the S. cerevisiae ARN1 gene could be retraced to a Z. rouxii ORF (zyro0g07414). The K. polysporus genome lacks a cluster T-encoding gene suggesting that, after the WGD event, the ancestral of the post-WGD species quickly lost one gene duplicate. The genomes of Lachancea species and of K. lactis show an abundant number of cluster T-encoding genes, sparsely syntenic. Although the species of the CTG complex lack full-size cluster T-encoding genes, the genome of C. guilliermondii show a gene fragment whose amino acid sequence is highly similar to these transporters. The genomes of the early-divergent hemiascomycetes P. pastoris and Y. lipolytica lack cluster T-encoding genes.
The phylogenetic cluster M comprises proteins showing sequence similarity to siderophore transporters (Figure 2). As described by Diffels et al. , genes encoding cluster M transporters are only present in Y. lipolytica genome and reside in the phylogenetic cluster 2.A.1.16.Z2. Pairwise similarity searches using cluster M amino acid sequences against the Aspergillus Genome Database showed that these proteins share high sequence similarity with Aspergillus nidulans MirC (e-value 4E-76), MirA (e-value 5E-58) and MirB (e-value 5E-58) proteins, three biochemically characterized siderophore transporters [14, 64, 65].
The amino acid sequence of members of phylogenetic cluster N are closely related to those of siderophore transporters (Figure 2). Diffels et al.  reported that these proteins group in two different phylogenetic clusters (2.A.1.16.Z3 and 2.A.1.16.Z4). The gene neighbourhood analysis allowed the reconstruction of the evolutionary history of cluster N-encoding genes (lineage 8). All species belonging to the SSSG possess a cluster N transporter and the corresponding genes reside in a conserved chromosome environment (Figure 9A). However, with exception of the JAY-291 strain, all S. cerevisiae strains considered in this work lack a cluster N-encoding gene. Cluster N members are also found in species belonging to the CTG haploid sub-group and in the early-divergent hemiascomycetes P. pastoris.
GEX gene lineages
Lineage 12 comprises the homologs of S. cerevisiae GEX1/GEX2 genes (cluster S). The members of the GEX gene lineage reside in the phylogenetic cluster 2.A.1.16.Z1 (as described in ). Several previous studies indicated that the amino acid sequences of Gex1p and Gex2p are highly similar to those of Arn1 and Arn2 proteins [15, 16, 18] and the analysis of the phylogenetic tree representing the 14-spanner DHA2, ARN and GEX sub-families confirmed this observation (Figure 2). The close resemblance between the members of cluster T (Arn1/Arn2 homologs) and cluster S (Gex1/Gex2 homologs) suggests that these two functionally distinct groups of 14-spanner MFS transporters have been differentiated from the same ancestral gene. Interestingly, only the genomes of three species belonging to the SSSG and of three pre-WGD species were found to encode cluster S-encoding genes in the Hemiascomycetes clade (Figure 9B). This lineage divides into three sublineages, two containing each S. cerevisiae GEX gene while the third one comprises the GEX homologs present in the genomes of K. lactis, K. thermotolerans and K. waltii. The discontinuity occurring in lineage 12 in the transition from pre- to post-WGD species suggests that the cluster S-encoding genes were acquired by the ancestral of the SSSG by lateral gene transfer, presumably from a pre-WGD species.
A combined approach using classical phylogenetic tree building methods and gene neighbourhood analysis was used to reconstruct the evolution of the DHA2 genes in the Hemiascomycetes. This study considered twenty additional hemiascomycetous species to those examined by Gbelska et al., which did not included gene neighbourhood analysis. The 12 cluster classification of the DHA2 subfamily proposed in our phylogenetic study considerably expands the previous study . Members of the phylogenetic clusters B (Sge1/Azr1/Vba3/Vba5), C (Vba1/Vba2), D (Vba4), E (Atr1/YMR279C) and F (YOR378W) were found in the majority of the hemiascomycetous species analysed in this study, strongly suggesting that these DHA2 proteins may sustain important biological functions.
The comparative genomics approach adopted in this work allowed understanding the evolutionary relationships between three S. cerevisiae DHA2 genes encoding related amino acid sequences: ATR1 and the uncharacterized ORFs YMR279C and YOR378W. This approach revealed that the ATR1 and YMR279C genes are ohnolog genes (lineage 4) while YOR378W resides in its own lineage (lineage 5). Although the genes comprised in these two lineages do not share common neighbours, this does not exclude the existence of a common evolutionary origin rooting deep in the Fungi phylogenetic tree. The close phylogenetic relationship of ORF YMR279C with the ATR1 gene and the fact that its constitutive expression confers resistance to boron in yeast cells through the decrease of the intracellular levels of this element , is consistent with the hypothesis that ORF YMR279C is a boron extrusion pump acting as a back-up of the Atr1p transporter . The proteins belonging to cluster E present in the genomes of the pre-WGD yeast species are more similar to Atr1p than to the ORF YMR279C encoded protein. This may suggest that the ancestral function carried out by these two genes was preserved by Atr1p while the putative ORF YMR279C boron back-up function is, presumably, a new metabolic feature acquired through functional divergence of one of the redundant gene copies produced at the WGD event.
Although a few DHA2 family transporters have been described as recognizing substrates of biological significance, most of those known to be required for resistance to several drugs and other xenobiotic compounds do not have an assigned biological role yet . Remarkably, the expression of a number of DHA2 transporters has been found to confer increased susceptibility, rather than resistance, to specific chemical compounds as well. This is the case of the VBA5 gene whose overexpression in S. cerevisiae sensitizes the cells to the action of 4-NQO and quinidine  or of the ORF YOR378W whose overexpression leads to increased yeast susceptibility to rapamycin . These observations reinforces the idea that the drug pump model used to explain the physiological functions associated to the Major Facilitator Superfamily Multidrug Resistance (MFS-MDR) transporters is too simplistic .
The phylogenetic study described here strongly suggests that the previous cluster classification of the ARN and GEX members  should be revised. Since the time of the previous phylogenetic study, the GEX1 and GEX2 genes were shown to encode glutathione exchangers, a biological role that makes them physiologically apart from the ARN proteins. The functional classification of the ARN and GEX proteins into distinct subfamilies cannot ignore the fact that the encoding genes share highly related amino acid sequences (Figure 2) and that their expression is activated under conditions of iron depletion, although GEX gene expression only occurs under extreme iron scarcity . The main transcription regulator of the expression of the four ARN genes is Aft1p  while the expression of GEX1 gene is under control of the transcription factor Aft2p . The S. cerevisiae AFT1 and AFT2 genes are paralogs  that specialized during yeast evolution to perform overlapping but not redundant functions . The close amino acid sequence similarity between the ARN and GEX proteins suggests that the encoding genes may share a common evolutionary origin and that posterior divergence led to their differentiation, both in sequence and regulation, to fulfill different physiological functions.
Consistent with the notion that siderophore uptake is strongly dependent on the genetic background of the yeast strain , the present study also uncovered important variations in the arsenal of siderophore transporters encoded in genomes of different S. cerevisiae strains. While S288C and EC1118 strains do possess both ARN1 and ARN2 genes, the genomes of the remaining S. cerevisiae strains examined in this study only have the ARN1 gene suggesting that the former strains may have acquired the ARN2 gene by lateral gene transfer, presumably from a pre-WGD donor species. The genomes of S. cerevisiae strains JAY291 and YJM789 also lack an ARN4 homolog. Interestingly, ARN4 homologs only exist in SSSG species, all residing in sub-telomeric regions. These chromosomal regions are thought to serve as nursery for new genes and to provide a reservoir where new haplotypes and new gene functions can be created . However, the inspection of the chromosome neighbourhood where each ARN4 homolog resides did not provide any clue regarding the origin of this hypothetical primordial gene.
The synteny and similarity data suggests that lateral gene transfer and gene duplication were the main evolutionary forces responsible for the expansion of genes encoding DHA2, ARN and GEX transporters in the Hemiascomycetes. Lateral gene transfer is proposed to have occur in lineage 1 (SGE1 and VBA3/VBA5 homologs), lineage 2 (VBA2 homologs), lineage 5 (YOR378W homologs), lineage 6 (KNQ1 homologs), lineage 8 (cluster N members), lineage 12 (GEX1/GEX2 homologs) and lineage 13 (ARN2 homologs). Gene duplication, the primary source of new genes necessary for the evolution of functional novelty , was found to be a frequent event in the majority of DHA2, ARN and GEX gene lineages reconstructed in this study. The consistent evolutionary pattern of a surplus of duplicate genes belonging to the same phylogenetic cluster found to occur in certain hemiascomycetous genomes raises the question of how many of these genes are still functionally redundant and how many have already been co-opted through neofunctionalization or sub-functionalization to fulfil new physiological functions in the corresponding yeast strains. Although the majority of the genes encoding DHA2, ARN and GEX transporters are not essential in laboratorial optimal conditions, the widespread occurrence of lateral transfer and duplication events of these genes during the evolution of the hemiascomycetes suggest that the encoded proteins may sustain important physiological functions in the diverse range of ecological niches occupied by these yeasts in nature. Considering that multidrug resistance and iron uptake are major determinants of yeast virulence, the identification of the complete set of DHA2 and ARN transporters present in the genomes of ten Candida pathogenic species provides potential new molecular targets for antifungal drug development.
The phylogenetic results emerging from this work together with new experimental results recently reported in the literature concerning the DHA2, ARN and GEX transporters should be considered for the eventual revision of protein family classification used in TCDB database . In specific, the biochemically characterized S. cerevisiae Gex1, Gex2, Vba5, C. albicans CaArn1 and K. lactis Knp1 transporters with a demonstrated function in yeast physiology should be included in TCDB database. Moreover, the finding that ORF YMR279C is the paralog of ATR1 gene with origin in the WGD event and that it is also involved in boron homeostasis suggest its classification in the same TCDB family of ATr1p.
This study provides evidence for a close amino acid sequence similarity between the DHA2, ARN and GEX proteins. The fact that these 14-spanner transporters of the Major Facilitator Superfamily are hypothesized to recognize distinct substrates and may have different subcellular localization, at the plasma membrane, the vacuole membrane, post-Golgi vesicles and late endosomal vesicles [14, 18, 19, 71], suggests that it is unlikely that their sequence similarity may result from convergent evolution. The hypothesis that DHA2, ARN and GEX transporters share a common evolutionary root is the explanation that better fits the results of this phylogenetic study and we propose a new family to accommodate the DHA2, ARN and GEX proteins, DAG, spanning these three phylogenetic subfamilies of 14-spanner MFS transporters. This hypothesis is corroborated by the fact that these three subfamilies appeared during the evolutionary transition giving birth to the Dikarya fungi (unpublished results). Subsequently, selection, radiation and neofunctionalization of the initial ancestral genes encoding these 14-spanner MFS transporters gave rise to the functions associated with them, spanning the MDR phenomenon, amino acid transport, boron homeostasis, siderophore transport and glutathione exchange.
A total of 172,422 translated ORFs encoded in the genomes of 31 sequenced yeast strains from 25 hemiascomycetous species were gathered in this study. The corresponding amino acid sequences were compared using the blastp algorithm, generating a total of 31 million pairwise alignments, represented as a network. A functionally characterized DHA2 protein, Atr1p, was used as starting node to breadth-first traverse this network at different e-value thresholds. 14-spanner Major Facilitator Superfamily transporters involved in siderophore import  and glutathione export  were gathered together with the DHA2 proteins, supporting the concept that the genes encoding the DHA2, ARN and GEX proteins share a common evolutionary origin. The new protein family spanning these three phylogenetic subfamilies was denominated the DAG protein family and a phylogenetic tree representing the full-size DAG proteins was built. Gene neighbourhood analysis of the chromosome environment where the DHA2, ARN and GEX genes reside allowed the identification of seven DHA2 gene lineages, five ARN gene lineages and one GEX gene lineage. Lateral gene transfer and gene duplication were important mechanisms underlying the evolution of the DAG genes in the Hemiascomycetes.
Availability of supporting data
The data sets supporting the results of this article are available in the TreeBASE Repository with a study Accession URL http://purl.org/phylo/treebase/phylows/study/TB2:S15039.
Whole genome duplication
- Saccharomyces complex:
Saccharomyces sensu lato group
- CTG complex:
Group of species that translate the CUG codon into serine instead of leucine
Multiple drug resistance
Major facilitator superfamily
Drug:H+ antiporters of family 1
Drug:H+ antiporters of family 2
Unknown major facilitator
Anhydromevalonyl residues linked to Nδ-ornithine
Siderophore iron transport
Transporter classification database
Group comprising the DHA2, ARN and GEX transporters
Saccharomyces sensu strictu group
Horizontal gene transfer.
Hayes JD, Wolf CR: Molecular genetics of drug resistance. 1997, Amsterdam, the Netherlands: Harwood Academic
Goffeau A, Barrell BG, Bussey H, Davis RW, Dujon B, Feldmann H, Galibert F, Hoheisel JD, Jacq C, Johnston M, et al: Life with 6000 genes. Science. 1996, 274 (5287): 546-10.1126/science.274.5287.546. 563–547
Goffeau A, Park J, Paulsen IT, Jonniaux JL, Dinh T, Mordant P, Saier MH: Multidrug-resistant transport proteins in yeast: complete inventory and phylogenetic characterization of yeast open reading frames with the major facilitator superfamily. Yeast. 1997, 13 (1): 43-54. 10.1002/(SICI)1097-0061(199701)13:1<43::AID-YEA56>3.0.CO;2-J.
Nelissen B, De Wachter R, Goffeau A: Classification of all putative permeases and other membrane plurispanners of the major facilitator superfamily encoded by the complete genome of Saccharomyces cerevisiae. FEMS Microbiol Rev. 1997, 21 (2): 113-134. 10.1111/j.1574-6976.1997.tb00347.x.
Nelissen B, Mordant P, Jonniaux JL, De Wachter R, Goffeau A: Phylogenetic classification of the major superfamily of membrane transport facilitators, as deduced from yeast genome sequencing. FEBS Lett. 1995, 377 (2): 232-236. 10.1016/0014-5793(95)01380-6.
Paulsen IT, Sliwinski MK, Nelissen B, Goffeau A, Saier MH: Unified inventory of established and putative transporters encoded within the complete genome of Saccharomyces cerevisiae. FEBS Lett. 1998, 430 (1–2): 116-125.
Saier MH, Reddy VS, Tamang DG, Vastermark A: The transporter classification database. Nucleic Acids Res. 2013, In press
Reddy VS, Shlykov MA, Castillo R, Sun EI, Saier MH: The major facilitator superfamily (MFS) revisited. FEBS J. 2012, 279 (11): 2022-2035. 10.1111/j.1742-4658.2012.08588.x.
Lesuisse E, Simon-Casteras M, Labbe P: Siderophore-mediated iron uptake in Saccharomyces cerevisiae: the SIT1 gene encodes a ferrioxamine B permease that belongs to the major facilitator superfamily. Microbiology. 1998, 144 (Pt 12): 3455-3462.
Heymann P, Ernst JF, Winkelmann G: Identification of a fungal triacetylfusarinine C siderophore transport gene (TAF1) in Saccharomyces cerevisiae as a member of the major facilitator superfamily. Biometals. 1999, 12 (4): 301-306. 10.1023/A:1009252118050.
Heymann P, Ernst JF, Winkelmann G: A gene of the major facilitator superfamily encodes a transporter for enterobactin (Enb1p) in Saccharomyces cerevisiae. Biometals. 2000, 13 (1): 65-72. 10.1023/A:1009250017785.
Heymann P, Ernst JF, Winkelmann G: Identification and substrate specificity of a ferrichrome-type siderophore transporter (Arn1p) in Saccharomyces cerevisiae. FEMS Microbiol Lett. 2000, 186 (2): 221-227. 10.1111/j.1574-6968.2000.tb09108.x.
Yun CW, Tiedeman JS, Moore RE, Philpott CC: Siderophore-iron uptake in Saccharomyces cerevisiae. Identification of ferrichrome and fusarinine transporters. J Biol Chem. 2000, 275 (21): 16354-16359. 10.1074/jbc.M001456200.
Haas H, Eisendle M, Turgeon BG: Siderophores in fungal physiology and virulence. Annu Rev Phytopathol. 2008, 46: 149-187. 10.1146/annurev.phyto.45.062806.094338.
Sá-Correia I, Tenreiro S: The multidrug resistance transporters of the major facilitator superfamily, 6 years after disclosure of Saccharomyces cerevisiae genome sequence. J Biotechnol. 2002, 98 (2–3): 215-226.
Diffels JF, Seret ML, Goffeau A, Baret PV: Heavy metal transporters in Hemiascomycete yeasts. Biochimie. 2006, 88 (11): 1639-1649. 10.1016/j.biochi.2006.08.008.
Yun CW, Ferea T, Rashford J, Ardon O, Brown PO, Botstein D, Kaplan J, Philpott CC: Desferrioxamine-mediated iron uptake in Saccharomyces cerevisiae. Evidence for two pathways of iron uptake. J Biol Chem. 2000, 275 (14): 10709-10715. 10.1074/jbc.275.14.10709.
Dhaoui M, Auchere F, Blaiseau PL, Lesuisse E, Landoulsi A, Camadro JM, Haguenauer-Tsapis R, Belgareh-Touze N: Gex1 is a yeast glutathione exchanger that interferes with pH and redox homeostasis. Mol Biol Cell. 2011, 22 (12): 2054-2067. 10.1091/mbc.E10-11-0906.
Sá-Correia I, dos Santos SC, Teixeira MC, Cabrito TR, Mira NP: Drug:H+ antiporters in chemical stress response in yeast. Trends Microbiol. 2009, 17 (1): 22-31. 10.1016/j.tim.2008.09.007.
Kanazawa S, Driscoll M, Struhl K: ATR1, a Saccharomyces cerevisiae gene encoding a transmembrane protein required for aminotriazole resistance. Mol Cell Biol. 1988, 8 (2): 664-673.
Gompel-Klein P, Mack M, Brendel M: Molecular characterization of the two genes SNQ and SFA that confer hyperresistance to 4-nitroquinoline-N-oxide and formaldehyde in Saccharomyces cerevisiae. Curr Genet. 1989, 16 (2): 65-74. 10.1007/BF00393397.
Gompel-Klein P, Brendel M: Allelism of SNQ1 and ATR1, genes of the yeast Saccharomyces cerevisiae required for controlling sensitivity to 4-nitroquinoline-N-oxide and aminotriazole. Curr Genet. 1990, 18 (1): 93-96. 10.1007/BF00321122.
Kaya A, Karakaya HC, Fomenko DE, Gladyshev VN, Koc A: Identification of a novel system for boron transport: Atr1 is a main boron exporter in yeast. Mol Cell Biol. 2009, 29 (13): 3665-3674. 10.1128/MCB.01646-08.
Bozdag GO, Uluisik I, Gulculer GS, Karakaya HC, Koc A: Roles of ATR1 paralogs YMR279c and YOR378w in boron stress tolerance. Biochem Biophys Res Commun. 2011, 409 (4): 748-751. 10.1016/j.bbrc.2011.05.080.
Butcher RA, Bhullar BS, Perlstein EO, Marsischky G, LaBaer J, Schreiber SL: Microarray-based method for monitoring yeast overexpression strains reveals small-molecule targets in TOR pathway. Nat Chem Biol. 2006, 2 (2): 103-109. 10.1038/nchembio762.
Alamgir M, Erukova V, Jessulat M, Azizi A, Golshani A: Chemical-genetic profile analysis of five inhibitory compounds in yeast. BMC Chem Biol. 2010, 10: 6-10.1186/1472-6769-10-6.
Shimazu M, Sekito T, Akiyama K, Ohsumi Y, Kakinuma Y: A family of basic amino acid transporters of the vacuolar membrane from Saccharomyces cerevisiae. J Biol Chem. 2005, 280 (6): 4851-4857.
Wiederhold E, Gandhi T, Permentier HP, Breitling R, Poolman B, Slotboom DJ: The yeast vacuolar membrane proteome. Mol Cell Proteomics. 2009, 8 (2): 380-392.
Shimazu M, Itaya T, Pongcharoen P, Sekito T, Kawano-Kawada M, Kakinuma Y: Vba5p, a novel plasma membrane protein involved in amino acid uptake and drug sensitivity in Saccharomyces cerevisiae. Biosci Biotechnol Biochem. 2012, 76 (10): 1993-1995. 10.1271/bbb.120455.
Shakoury-Elizeh M, Tiedeman J, Rashford J, Ferea T, Demeter J, Garcia E, Rolfes R, Brown PO, Botstein D, Philpott CC: Transcriptional remodeling in response to iron deprivation in Saccharomyces cerevisiae. Mol Biol Cell. 2004, 15 (3): 1233-1243.
Philpott CC, Leidgens S, Frey AG: Metabolic remodeling in iron-deficient fungi. Biochim Biophys Acta. 2012, 1823 (9): 1509-1520. 10.1016/j.bbamcr.2012.01.012.
Rieger KJ, El-Alama M, Stein G, Bradshaw C, Slonimski PP, Maundrell K: Chemotyping of yeast mutants using robotics. Yeast. 1999, 15 (10B): 973-986. 10.1002/(SICI)1097-0061(199907)15:10B<973::AID-YEA402>3.0.CO;2-L.
Tenreiro S, Rosa PC, Viegas CA, Sá-Correia I: Expression of the AZR1 gene (ORF YGR224w), encoding a plasma membrane transporter of the major facilitator superfamily, is required for adaptation to acetic acid and resistance to azoles in Saccharomyces cerevisiae. Yeast. 2000, 16 (16): 1469-1481. 10.1002/1097-0061(200012)16:16<1469::AID-YEA640>3.0.CO;2-A.
Ehrenhofer-Murray AE, Wurgler FE, Sengstag C: The Saccharomyces cerevisiae SGE1 gene product: a novel drug-resistance protein within the major facilitator superfamily. Mol Gen Genet. 1994, 244 (3): 287-294.
Jacquot C, Julien R, Guilloton M: The Saccharomyces cerevisiae MFS superfamily SGE1 gene confers resistance to cationic dyes. Yeast. 1997, 13 (10): 891-902. 10.1002/(SICI)1097-0061(199708)13:10<891::AID-YEA138>3.0.CO;2-U.
Ehrenhofer-Murray AE, Seitz MU, Sengstag C: The Sge1 protein of Saccharomyces cerevisiae is a membrane-associated multidrug transporter. Yeast. 1998, 14 (1): 49-65. 10.1002/(SICI)1097-0061(19980115)14:1<49::AID-YEA199>3.0.CO;2-T.
Ogihara F, Kitagaki H, Wang Q, Shimoi H: Common industrial sake yeast strains have three copies of the AQY1-ARR3 region of chromosome XVI in their genomes. Yeast. 2008, 25 (6): 419-432. 10.1002/yea.1596.
Babrzadeh F, Jalili R, Wang C, Shokralla S, Pierce S, Robinson-Mosher A, Nyren P, Shafer RW, Basso LC, de Amorim HV, et al: Whole-genome sequencing of the efficient industrial fuel-ethanol fermentative Saccharomyces cerevisiae strain CAT-1. Mol Genet Genomics. 2012, 287 (6): 485-494. 10.1007/s00438-012-0695-7.
Takacova M, Imrichova D, Cernicka J, Gbelska Y, Subik J: KNQ1, a Kluyveromyces lactis gene encoding a drug efflux permease. Curr Genet. 2004, 45 (1): 1-8. 10.1007/s00294-003-0449-5.
Marchi E, Lodi T, Donnini C: KNQ1, a Kluyveromyces lactis gene encoding a transmembrane protein, may be involved in iron homeostasis. FEMS Yeast Res. 2007, 7 (5): 715-721. 10.1111/j.1567-1364.2007.00235.x.
Gbelska Y, Krijger JJ, Breunig KD: Evolution of gene families: the multidrug resistance transporter genes in five related yeast species. FEMS Yeast Res. 2006, 6 (3): 345-355. 10.1111/j.1567-1364.2006.00058.x.
Dias PJ, Seret ML, Goffeau A, Sá-Correia I, Baret PV: Evolution of the 12-spanner drug:H+ antiporter DHA1 family in hemiascomycetous yeasts. OMICS. 2010, 14 (6): 701-710. 10.1089/omi.2010.0104.
Seret ML, Diffels JF, Goffeau A, Baret PV: Combined phylogeny and neighborhood analysis of the evolution of the ABC transporters conferring multiple drug resistance in hemiascomycete yeasts. BMC Genomics. 2009, 10: 459-10.1186/1471-2164-10-459.
Tatusova TA, Madden TL: BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences. FEMS Microbiol Lett. 1999, 174 (2): 247-250. 10.1111/j.1574-6968.1999.tb13575.x.
sqldf: Perform SQL Selects on R Data Frames. R package version 0.4-2. http://CRAN.R-project.org/package=sqldf,
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13 (11): 2498-2504. 10.1101/gr.1239303.
Tusnady GE, Simon I: Principles governing amino acid composition of integral membrane proteins: application to topology prediction. J Mol Biol. 1998, 283 (2): 489-506. 10.1006/jmbi.1998.2107.
Sonnhammer EL, von Heijne G, Krogh A: A hidden Markov model for predicting transmembrane helices in protein sequences. Proc Int Conf Intell Syst Mol Biol. 1998, 6: 175-182.
von Heijne G: Membrane protein structure prediction. Hydrophobicity analysis and the positive-inside rule. J Mol Biol. 1992, 225 (2): 487-494. 10.1016/0022-2836(92)90934-C.
Neron B, Menager H, Maufrais C, Joly N, Maupetit J, Letort S, Carrere S, Tuffery P, Letondal C: Mobyle: a new full web bioinformatics framework. Bioinformatics. 2009, 25 (22): 3005-3011. 10.1093/bioinformatics/btp493.
Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32 (5): 1792-1797. 10.1093/nar/gkh340.
Felsenstein J: PHYLIP - Phylogeny Inference Package (Version 3.2). Cladistics. 1989, 5: 164-166.
Huson DH, Richter DC, Rausch C, Dezulian T, Franz M, Rupp R: Dendroscope: an interactive viewer for large phylogenetic trees. BMC Bioinforma. 2007, 8: 460-10.1186/1471-2105-8-460.
Rice P, Longden I, Bleasby A: EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000, 16 (6): 276-277. 10.1016/S0168-9525(00)02024-2.
Sherman DJ, Martin T, Nikolski M, Cayla C, Souciet JL, Durrens P: Génolevures: protein families and synteny among complete hemiascomycetous yeast proteomes and genomes. Nucleic Acids Res. 2009, 37 (Database issue): D550-D554.
Gaur M, Puri N, Manoharlal R, Rai V, Mukhopadhayay G, Choudhury D, Prasad R: MFS transportome of the human pathogenic yeast Candida albicans. BMC Genomics. 2008, 9: 579-10.1186/1471-2164-9-579.
Millson SH, Truman AW, King V, Prodromou C, Pearl LH, Piper PW: A two-hybrid screen of the yeast proteome for Hsp90 interactors uncovers a novel Hsp90 chaperone requirement in the activity of a stress-activated mitogen-activated protein kinase, Slt2p (Mpk1p). Eukaryot Cell. 2005, 4 (5): 849-860. 10.1128/EC.4.5.849-860.2005.
Diezmann S, Cox CJ, Schonian G, Vilgalys RJ, Mitchell TG: Phylogeny and evolution of medical species of Candida and related taxa: a multigenic analysis. J Clin Microbiol. 2004, 42 (12): 5624-5635. 10.1128/JCM.42.12.5624-5635.2004.
Butler G, Rasmussen MD, Lin MF, Santos MA, Sakthikumar S, Munro CA, Rheinbay E, Grabherr M, Forche A, Reedy JL, et al: Evolution of pathogenicity and sexual reproduction in eight Candida genomes. Nature. 2009, 459 (7247): 657-662. 10.1038/nature08064.
Jeffries TW, Grigoriev IV, Grimwood J, Laplaza JM, Aerts A, Salamov A, Schmutz J, Lindquist E, Dehal P, Shapiro H, et al: Genome sequence of the lignocellulose-bioconverting and xylose-fermenting yeast Pichia stipitis. Nat Biotechnol. 2007, 25 (3): 319-326. 10.1038/nbt1290.
Rolland T, Dujon B: Yeasty clocks: dating genomic changes in yeasts. C R Biol. 2011, 334 (8–9): 620-628.
Ardon O, Bussey H, Philpott C, Ward DM, Davis-Kaplan S, Verroneau S, Jiang B, Kaplan J: Identification of a Candida albicans ferrichrome transporter and its characterization by expression in Saccharomyces cerevisiae. J Biol Chem. 2001, 276 (46): 43049-43055. 10.1074/jbc.M108701200.
Hu CJ, Bai C, Zheng XD, Wang YM, Wang Y: Characterization and functional analysis of the siderophore-iron transporter CaArn1p in Candida albicans. J Biol Chem. 2002, 277 (34): 30598-30605. 10.1074/jbc.M204545200.
Oberegger H, Schoeser M, Zadra I, Abt B, Haas H: SREA is involved in regulation of siderophore biosynthesis, utilization and uptake in Aspergillus nidulans. Mol Microbiol. 2001, 41 (5): 1077-1089.
Haas H, Schoeser M, Lesuisse E, Ernst JF, Parson W, Abt B, Winkelmann G, Oberegger H: Characterization of the Aspergillus nidulans transporters for the siderophores enterobactin and triacetylfusarinine C. Biochem J. 2003, 371 (Pt 2): 505-513.
Conde e Silva N, Goncalves IR, Lemaire M, Lesuisse E, Camadro JM, Blaiseau PL: KlAft, the Kluyveromyces lactis ortholog of Aft1 and Aft2, mediates activation of iron-responsive transcription through the PuCACCC Aft-type sequence. Genetics. 2009, 183 (1): 93-106. 10.1534/genetics.109.104364.
Courel M, Lallet S, Camadro JM, Blaiseau PL: Direct activation of genes involved in intracellular iron use by the yeast iron-responsive transcription factor Aft2 without its paralog Aft1. Mol Cell Biol. 2005, 25 (15): 6760-6771. 10.1128/MCB.25.15.6760-6771.2005.
Lesuisse E, Blaiseau PL, Dancis A, Camadro JM: Siderophore uptake and use by the yeast Saccharomyces cerevisiae. Microbiology. 2001, 147 (Pt 2): 289-298.
Fairhead C, Dujon B: Structure of Kluyveromyces lactis subtelomeres: duplications and gene content. FEMS Yeast Res. 2006, 6 (3): 428-441. 10.1111/j.1567-1364.2006.00033.x.
Conant GC, Wolfe KH: Turning a hobby into a job: how duplicated genes find new functions. Nat Rev Genet. 2008, 9 (12): 938-950. 10.1038/nrg2482.
Kosman DJ: Molecular mechanisms of iron uptake in fungi. Mol Microbiol. 2003, 47 (5): 1185-1197. 10.1046/j.1365-2958.2003.03368.x.
We thank André Goffeau for fruitful discussions. This research was supported by Fundação para a Ciência e a Tecnologia (FCT) (contract: PEst-OE/EQB/LA0023/2011_research line: Systems and Synthetic Biology and a post-doctoral grant (SFRH/BD/23331/2005) to PJD).
The authors declare that they have no competing interests.
PJD carried out phylogenetic tree construction and gene neighbourhood analysis and built the pairwise similarity network. ISC conceived and supervised this study and together with PJD wrote the manuscript. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Potential 14-spanner MFS-MDR proteins gathered from the 31 hemiascomycetous yeasts analysed during this work. This table shows the gene/ORF name, acronym, protein sequence, protein sequence length and phylogenetic cluster classification. It also indicates whether the translated ORF was considered to comprise a true 14-spanner MFS-MDR protein and if the corresponding amino acid sequence was used in the construction of the phylogenetic tree. Both HMMTOP 2.1 and TMHMM 2.0 were used for topology prediction of the amino acid sequences under analysis (HMMTOP_pred indicates number of predicted TMS, HMMTOP_N_top indicates the N-terminal topology prediction, TMHMM_pred indicates number of predicted TMS, TMHMM_topology details where topology changes along the protein sequence, TMHMM_First60 indicates the expected number of amino acids in transmembrane helices in the first 60 amino acids of the protein). For amino acid sequences showing less than 490 amino acids in length, this table also indicates whether the protein sequence was gathered or not at blastp e-values of E-15, E-17, E-20, E-25 and E-30 (Y = yes, N = no) and the TMS range predicted by visual analysis of the topology probability and protein hydrophobicity plots generated by TMHMM 2.0 and TOPPRED 2, respectively, for each amino acid sequence. (XLS 364 KB)
Radial phylogram representing the 920 amino acid sequences gathered at an e-value level of E-14 as described in Figure
Additional file 2: 1. Besides the 14-spanner MFS-MDR transporters, 508 DHA1 amino acid sequences and 10 non-membrane proteins were recovered at this similarity threshold. (JPEG 494 KB)
Radial phylogram showing the amino acid sequence similarity distances between the 355 full-size 14-spanner MFS transporters.
Additional file 3: Protein and translated ORF names are shown. The name of the S. cerevisiae and C. albicans members is indicated as well as the biochemically characterized Knq1 transporter of K. lactis. The gene and species annotation adopted in this study uses the four letters code described in Table 1. (JPEG 4 MB)
Additional file 6: S. cerevisiae DHA2, ARN and GEX genes and genes present in the genomes of the most virulent Candida species.(PDF 8 KB)
Additional file 7: S. cerevisiae DHA2, ARN and GEX genes and genes present in the genomes of the less virulent Candida species.(PDF 8 KB)
Additional file 8: Chromosome environment of the DHA2, ARN and GEX genes gathered from thirty-one hemiascomycetous yeasts. Gene neighbourhood is shown with a 30-gene window. The following genomic information is displayed in two tables: a) gene name and b) protein family name. Each box framed in red represents a gene. Adjacent boxes represent the gene neighbours. Yellow background represents genes not belonging to the phylogenetic cluster associated to the lineage. Homologous neighbours, based on our protein family classification, are highlighted in the same colour. (PDF 6 MB)
Additional file 9: S. cerevisiae VBA3/VBA5 genes and by the fragments sami_a_c798_9285 and saku_c1383.2.(ZIP 3 KB)
Additional file 10: S. cerevisiae VBA3 gene and ORF sace_e_3474.(DOC 56 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Dias, P.J., Sá-Correia, I. The drug:H+ antiporters of family 2 (DHA2), siderophore transporters (ARN) and glutathione:H+antiporters (GEX) have a common evolutionary origin in hemiascomycete yeasts. BMC Genomics 14, 901 (2013). https://doi.org/10.1186/1471-2164-14-901
- Multidrug resistance (MDR)
- Hemiascomycete yeasts
- Major facilitator superfamily (MFS)
- 14-spanner MFS transporters
- DHA2 transporters
- ARN transporters
- GEX transporters
- Comparative genomics
- Phylogenetic analysis
- Gene neighbourhood analysis