The transcriptome of Candida albicans mitochondria and the evolution of organellar transcription units in yeasts

Kolondra, Adam; Labedzka-Dmoch, Karolina; Wenda, Joanna M.; Drzewicka, Katarzyna; Golik, Pawel

doi:10.1186/s12864-015-2078-z

Research article
Open access
Published: 21 October 2015

The transcriptome of Candida albicans mitochondria and the evolution of organellar transcription units in yeasts

Adam Kolondra¹,
Karolina Labedzka-Dmoch¹,
Joanna M. Wenda¹,
Katarzyna Drzewicka¹ &
…
Pawel Golik ORCID: orcid.org/0000-0001-7814-482X^1,2

BMC Genomics volume 16, Article number: 827 (2015) Cite this article

3856 Accesses
22 Citations
4 Altmetric
Metrics details

Abstract

Background

Yeasts show remarkable variation in the organization of their mitochondrial genomes, yet there is little experimental data on organellar gene expression outside few model species. Candida albicans is interesting as a human pathogen, and as a representative of a clade that is distant from the model yeasts Saccharomyces cerevisiae and Schizosaccharomyces pombe. Unlike them, it encodes seven Complex I subunits in its mtDNA. No experimental data regarding organellar expression were available prior to this study.

Methods

We used high-throughput RNA sequencing and traditional RNA biology techniques to study the mitochondrial transcriptome of C. albicans strains BWP17 and SN148.

Results

The 14 protein-coding genes, two ribosomal RNA genes, and 24 tRNA genes are expressed as eight primary polycistronic transcription units. We also found transcriptional activity in the noncoding regions, and antisense transcripts that could be a part of a regulatory mechanism. The promoter sequence is a variant of the nonanucleotide identified in other yeast mtDNAs, but some of the active promoters show significant departures from the consensus. The primary transcripts are processed by a tRNA punctuation mechanism into the monocistronic and bicistronic mature RNAs. The steady state levels of various mature transcripts exhibit large differences that are a result of posttranscriptional regulation. Transcriptome analysis allowed to precisely annotate the positions of introns in the RNL (2), COB (2) and COX1 (4) genes, as well as to refine the annotation of tRNAs and rRNAs. Comparative study of the mitochondrial genome organization in various Candida species indicates that they undergo shuffling in blocks usually containing 2–3 genes, and that their arrangement in primary transcripts is not conserved. tRNA genes with their associated promoters, as well as GC-rich sequence elements play an important role in these evolutionary events.

Conclusions

The main evolutionary force shaping the mitochondrial genomes of yeasts is the frequent recombination, constantly breaking apart and joining genes into novel primary transcription units. The mitochondrial transcription units are constantly rearranged in evolution shaping the features of gene expression, such as the presence of secondary promoter sites that are inactive, or act as “booster” promoters, simplified transcriptional regulation and reliance on posttranscriptional mechanisms.

Background

Recent advances in nucleic acid sequencing technology brought a wealth of knowledge on genomics of many interesting and important species that were not among the model organisms of classical molecular biology. Among eukaryotic organisms, ascomycetous yeasts represent a veritable evolutionary laboratory, spanning hundreds of millions years [1] of highly divergent evolution; and their relatively small and compact genomes provide significant insights for comparative genomics [2]. The mitochondrial genome of yeasts is of considerable interest, both for its importance for the cell metabolism, and for its evolutionary history. Nucleo-mitochondrial interactions were shown to contribute to speciation [3–6] and are an important factor in intraspecific variation [7].

The mitochondrial genetic system is a legacy of the bacterial origins of these organelles [8], its coding and regulatory capacity in modern cells is, however, significantly reduced. In yeasts, mtDNA encodes several key subunits of the oxidative phosphorylation (OXPHOS) system – apocytochrome b from Complex III, subunits 1–3 of the cytochrome oxidase (Complex IV), and subunits 6, 8 and 9 of the ATP synthase (Complex V). In a number of hemiascomycetous yeasts the mitochondrial genome also contains genes encoding for subunits of Complex I (NADH dehydrogenase) [9], they were, however, lost in the most extensively studied Saccharomycetaceae family which includes the model species Saccharomyces cerevisiae [2]. Complex I genes are also absent from the mitochondrial genome of Schizosaccharomyces pombe, which belongs to a separate and distant ascomycetous lineage (Taphrinomycotina). Additionally, yeast mtDNA typically contains genes for two ribosomal RNAs and a complete set of tRNAs. A single ribosomal protein gene, and a sequence encoding the RNA subunit of mitochondrial RNase P are found in some, but not all yeast mtDNAs [10, 11].

Typical features of the mitochondrial genome expression, shared by all Eukaryotes, include polycistronic primary transcription units, often containing both mRNA and functional RNA sequences, that are posttranscriptionally processed by various mechanisms, including tRNA excision by RNase P and tRNase Z (tRNA punctuation) [12], transcribed from simple promoter sequences by a single RNA polymerase related to bacteriophage enzymes [13]. Additional RNA processing mechanisms, described in yeast include exoribonucleolytic trimming [14–17], and processing at specific sequence signals at the 3’ end of mRNAs [18–22].

In spite of a broadly similar core set of encoded genes, the size and organization of yeast mitochondrial genomes vary significantly, from the compact (19 kb) mtDNA of Schizosaccharomyces pombe, to the much larger (up to 75–85 kb, depending on the strain) mitochondrial genome of Saccharomyces cerevisiae. Frequent rearrangements change the gene order even between closely related species [23–28], and the presence of optional introns [29] further contributes to the evolutionary variability of yeast mitochondrial genomes. The structural variability of yeast mitochondrial genomes also entails considerable differences in the organization of gene expression, with the exception of a few model species there is, however, little experimental evidence concerning functional aspects of organellar gene expression in the majority of these divergent organisms.

Two phylogenetically distant yeast species, Saccharomyces cerevisiae and Schizosaccharomyces pombe are currently used as model organisms for mitochondrial genetics and molecular biology [19, 30, 31] and have provided a wealth of data on various aspects of organellar biology and the nucleo-mitochondrial interactions. Recently, next generation sequencing (NGS) provided the first insights into the complete transcriptome of S. cerevisiae mitochondria [32]. They are also representative of the two major types of the mitochondrial genome expression organization. In S. cerevisiae the 35 genes are expressed as 11 separate polycistronic units [32, 33], separated by long noncoding stretches of DNA in the genome, whereas in S. pombe the compact mtDNA is transcribed as two [19] primary transcripts. Two long primary transcripts are also a feature of animal mitochondrial genomes, and are thus believed to represent the ancestral state, at least in Opisthokonts [19].

The study of evolution of mitochondrial gene expression is hindered by the paucity of experimental data in species other than the two well-studied model yeasts. S. cerevisiae is the only representative of the Hemiascomycetes where systematic studies of mitochondrial gene expression were performed, and for the vast majority of the remaining species the only information available is inferred in silico from genomic sequences [10, 26, 34–38]. Outside the Saccharomycetaceae family, only fragmentary mitochondrial gene expression studies were performed in yeasts such as Yarrowia lipolytica [39] and Magnusiomyces capitatus, where a curious translational bypassing mechanism was recently identified [40]. In spite of extensive studies on the mitochondrial chromosome structure and evolution [27, 28, 35, 38, 41–45], and on mtDNA replication [45–47], there are virtually no experimental data on mitochondrial gene expression in yeasts of the so-called “CTG clade”. This monophyletic clade, distant from the well-known Saccharomyces group, consists of species that translate CUG as serine instead of leucine in their nuclear genetic code [41, 48–50] and contains many pathogenic species, including Candida albicans.

C. albicans is a common commensal of humans and other vertebrates, but in certain situations, particularly in immunocompromised individuals, it can be a source of opportunistic infections and is the most frequent fungal pathogen of humans [51–55]. In addition to the medical interest, C. albicans is emerging as an attractive model organism for molecular biology and comparative genomics [46, 51, 52, 56–63]. The study of mitochondrial biogenesis and nucleo-mitochondrial interactions is one of the areas, where it can provide significant insights beyond those gained from work on S. cerevisiae, which exhibits many peculiar specific adaptations in its metabolism, many of which evolved in Saccharomycetaceae as a result of the whole genome duplication [64, 65]. Unlike S. cerevisiae, C. albicans does not exhibit the Crabtree effect, and thus maintains active mitochondrial respiration in the presence of glucose [56], even though it tolerates anaerobiosis and respiratory-deficient mutants are viable [61, 62, 66–68]. It is a petite-negative species, that does not tolerate loss of mtDNA [61, 69], and its mitochondrial genome contains genes encoding Complex I subunits [9, 70], lost from S. cerevisiae, as well as from S. pombe. Important insights into the mechanisms of mtDNA replication [46, 47, 61] or mitochondrial protein import [62] were obtained using C. albicans as a model organism. Additionally, mitochondrial metabolism has been linked to pathogenicity, as respiratory deficient strains were shown to be unable to form biofilms [68], lack virulence [71] or exhibit decreased pathogenicity [72] and increased sensitivity to oxidative stress induced by photodynamic therapy [73].

In spite of this, expression of mitochondrial genes has not been studied to date in C. albicans, and the putative promoters, genes, and introns in the mtDNA of this yeast were only inferred from in silico comparisons to other species. In order to provide the necessary background for the study of mitochondrial genes in C. albicans, and to gain insights into the evolution of mtDNA expression organization in yeasts, we performed systematic analysis of the C. albicans mitochondrial transcriptome using deep sequencing, followed by low-throughput analysis of mitochondrial transcripts using RT-PCR, 5’-RACE and Northern hybridization. Our results provided evidence for the composition of polycistronic transcription units, active promoter sequences, processing by tRNA punctuation, split genes, and possible regulatory function of GC-rich sequence elements.

Results and discussion

RNA-seq analysis of Candida albicans mitochondrial transcriptome

Sample collection, library preparation and sequencing

Mitochondria isolated from two commonly used laboratory strains of C. albicans (BWP17 [59] and SN148 [57]) were used to prepare RNA sequencing libraries. We also sequenced libraries from total C. albicans RNA in order to account for possible artifacts related to the mitochondrial purification protocol. We did not, however, observe any significant differences in mitochondrial transcripts between the two approaches (see Supplementary Results and Discussion in Additional file 1).

Unlike the nuclear mRNAs, but similarly to bacterial transcripts, the primary products of organellar transcription contain a 5’-pyrophosphate [74, 75]. In order to enrich the RNA sequencing products in reads corresponding to the 5’ ends of primary transcripts, an aliquot of each RNA preparation was therefore treated with Tobacco Alkaline Phosphatase (TAP) prior to library preparation. As expected, more reads mapping to regions immediately downstream of identified transcription start sites were observed in libraries prepared from TAP-treated samples, but otherwise the results were very similar to those obtained without treatment with TAP (Additional file 1: Figure S1).

Reference sequence and read mapping

Reads were mapped to the complete mtDNA sequence of C. albicans strain SC5314 [GenBank:AF285261.1] [70]. The 40.4 kb mitochondrial genome of C. albicans contains two inverted repeat regions of 6.8 kb with identical nucleotide sequence [76, 77]. In order to maintain the directionality of transcript reads, we removed the second repeat region (IRb) from the reference sequence. In total, about 24 million reads were mapped to the reference mtDNA sequence in all our experiments. When RNA from purified mitochondria was used, more than 50 % of all reads could be unambiguously aligned to the reference sequence. Reads that mapped to more than one position in the reference without the second repeat (less than 1 % of all aligned reads) were discarded. Similar RNA sequencing performance was reported in the study of the S. cerevisiae mitochondrial transcriptome [32], where approximately 9 million reads of ~18 million were mapped to the reference. In the human mitochondrial transcriptome [78] only 14 % of RNA-seq reads from mitochondrial preparations aligned to mtDNA. The statistics of the RNA sequencing and mapping of the C. albicans mitochondrial transcriptome are presented in detail in Supplementary Results and Discussion (Additional file 1: Table S1).

The majority (>90 %) of reads that did not map to the mtDNA reference could be unambiguously mapped to the nuclear genome sequence, mostly to the highly expressed rRNA and tRNA genes. The most plausible explanation for this result is copurification of cytosolic ribosomes with the mitochondrial outer membrane, which was also observed in the study of human mitochondrial transcriptome [78]. Import of nuclear-encoded transcripts into mitochondria was shown in multiple systems [79] and cannot be excluded in the case of C. albicans, but the results obtained using mitochondrial preparations are not sufficient to draw such conclusions.

The aligned reads did not show any nucleotide differences from the reference, indicating that the BWP17 and SN148 strains are identical to SC5314 with respect to their mtDNA nucleotide sequence. Independently cultured BWP17 and SN148 strains gave virtually identical RNA-seq results in all our experiments (Additional file 1: Figure S1). For subsequent analysis we therefore pooled reads from both strains in silico, even though they were always grown and used to prepare sequencing libraries separately to control the reproducibility of the results.

The mitochondrial genome of C. albicans is more compact than that of S. cerevisiae, and coding regions cover the majority of its sequence. The only larger noncoding stretches (approximately 5.6 kb each) are located in the inverted repeat regions [45, 70, 76, 77]. Consequently, RNA-seq reads map to the majority of the mtDNA sequence, although they do not cover the entire genome (Fig. 1). All the annotated genes are clearly expressed, additional transcriptional activity is also evident in the noncoding regions of the inverted repeats, but it is limited to a few short stretches.

The orientation of strand-specific sequencing reads matches the direction of transcription of a given transcription unit (for transcripts of genes found in the inverted repeat regions this becomes apparent only after the second copy of the repeat is removed from the reference sequence). In total, >96 % of reads map to the appropriate sense strand of the transcript. Reads that map in the antisense orientation constitute only a minor fraction of total reads (~3 %), and they do not form contiguous mappings, suggesting that mainly one strand is transcribed in each of the transcription units, or that any putative antisense (“mirror”) RNAs are very rapidly and efficiently degraded. Whereas many of these rare antisense reads can be artifacts of sequencing or mapping, some of them, clustering near the ends of transcripts or putative processing sites, might indicate the presence of RNA-based regulatory mechanisms (Additional file 1: Figure S2).

Promoters and transcription units

Identification of putative transcription units

Transcription units were identified based on the existence of contiguous overlapping reads from the sense strand, taking into account reads mapping across introns. The 14 protein-coding genes (not including the putative intronic ORFs), two ribosomal RNA genes, and 24 tRNA genes are expressed as eight primary polycistronic transcription units on both strands of the genome (Figs. 1 and 2). The short coding region (SCR) contains one transcription unit (TU1) consisting of RNL, COX2, flanked by two tRNAs, and the NAD6/NAD1 bicistron. This transcript ends with a GC-rich hairpin sequence at the boundary of the inverted repeat (IRa). The inverted repeats contain two short transcription units, transcribed from opposite strands: TU2 consisting of COX3 and tRNA-Lys, and TU3 – the only primary transcript containing only tRNAs (five). In addition, two transcripts of unknown function, sharing no similarity with any known mitochondrial gene and devoid of potential open reading frames are observed in the noncoding region of the inverted repeat. Their biological function is currently unknown. The long coding region (LCR) begins with TU4, transcribed in the reverse orientation and containing all three mitochondrial ATP genes, as well as four tRNAs. ATP8 and ATP6 are translated from a single bicistronic transcript. ATP9 is separated from the ATP8,6 bicistron by the tRNA-Ile sequence. The remaining transcription units (TU5-TU8) are expressed in the same orientation, but our results indicate that they have independent promoters, and do not form a single long transcript. The continuity of the identified polycistronic transcription units was confirmed using RT-PCR (Supplementary Results and Discussion, Additional file 1).

Analysis of reads mapping in the 5’ regions of these transcription units allowed us to identify the putative promoter sequences, shown in Fig. 2. The promoter consensus sequence spans nucleotides −8 to +1 (Fig. 2d), and bears general similarity to known nonanucleotide mitochondrial promoters of S. cerevisiae [80–82] and other yeasts [83–87]. All but two of the promoters show very tight adherence to the WWATAGGTA consensus (Fig. 2c); the TU1 promoter upstream of the RNL gene is the only one, where the +1 nucleotide is G instead of A, whereas the putative transcription start site of TU7, upstream of NAD4L significantly deviates from the consensus at several positions, yet has strong experimental support in RNA-seq and 5’-RACE results. Putative promoters of TU7 and TU4 are also the only two where the −5 nucleotide is not T. Interestingly, the unusual TU7 promoter sequence shows some similarity to the bacteriophage T7 promoter [88], with the +1 A of the C. albicans transcript corresponding to the −1 position of the T7 consensus (Fig. 2c). Additionally, we observed some conservation of the first 5 nucleotides of the primary transcript (Fig. 2e), with A at positions +1 and +2 in the majority of them and G or C at +3 and +4.

In addition to the promoter sites upstream of the eight identified transcription units, multiple additional sequences conforming to the promoter consensus were found. They are shown in Fig. 2b, and listed in Additional file 2: Table S2. Three such putative promoters, located in the noncoding region of the inverted repeats may be responsible for the transcriptional activity observed in these unannotated regions. Some of the others are located within the transcription units in the sense orientation, upstream of the tRNA gene clusters. As the steady state levels of tRNAs are much higher than those of mRNAs, it is possible that they are auxiliary “booster” promoters ensuring adequate expression of tRNAs located downstream of an mRNA sequence in a polycistronic transcription unit. The sharp increase in the number of reads mapping to tRNAs could, however, also be a result of rapid transcript processing and different stability of tRNA and mRNA sequences. Very high levels of mature tRNAs make detection of unprocessed primary transcripts difficult, and definitive conclusions about any biological relevance of these additional promoter sequences cannot be drawn.

The promoter sequences localized in the transcription unit sequences in an antisense orientation could be involved in regulation, or could simply be inactive. Mapping of RNA-seq reads to the reverse strand of the genomic reference revealed that indeed, some antisense reads do appear downstream of these intragenic reverse-orientation promoters, although the coverage is very low compared to the sense strand expression. For example, a reverse promoter in the COX1 gene generates antisense reads mapping to the first 336 nt of the transcript (Additional file 1: Figure S2). Regulation by short antisense transcripts was observed in prokaryotic systems [89–91], and it is tempting to speculate that the reverse orientation promoters and antisense reads observed in the C. albicans mitochondrial transcriptome hint at the presence of similar mechanisms.

Presence of sequences conforming to the promoter consensus, but not active in transcription was observed in other yeast species, like S. cerevisiae [32] and S. pombe [19]. High density of potential promoters is a feature of the majority of yeast mtDNAs, although experimental data on their activity exist only in S. cerevisiae, S. pombe, and as of this study, in C. albicans. Assuming that the low density of promoters and the presence of two or three primary transcripts, observed, among others, in S. pombe [19], Candida parapsilosis [35], filamentous ascomycetes [92–95] and Metazoa [96] is indeed the more ancestral mode of transcription, creation of new transcription start sites had to occur many times in the history of yeast mitochondrial genomes.

Identification of the primary transcription start sites by 5’-RACE

We applied the 5’-RACE protocol used to identify transcription start sites in prokaryotic systems [74, 75], including organelles [75], based on the observation that the 5’ ends of primary transcripts, but not RNA processing products, contain a triphosphate group which inhibits adapter ligation, and results in quantitative differences in the obtained PCR product between samples treated with Tobacco Alkaline Phosphatase (TAP) and untreated.

We applied this strategy to each of the eight primary transcription units (TU1-TU8). Additionally, we attempted to confirm the activity of two putative secondary “booster” promoters localized upstream of tRNA-Arg_1 (tR(UCU)) and RNS sequences, inside TU5 and TU6, respectively. Untreated and TAP-treated RNA from purified mitochondria was ligated to an oligonucleotide adapter and reverse-transcribed as described in Methods. 5’-RACE products were visualized on agarose gels (Fig. 3) and also cloned into a plasmid vector after excision from the gel. The inserts (usually 5–20 individual clones) were sequenced to determine the exact 5’ nucleotide, to which the adapter had been ligated.

Analysis of the 5’-RACE products of transcripts containing the RNL rRNA sequence indicates that they start at the position identified by RNA sequencing, downstream of the noncanonical promoter, with G as the +1 nucleotide. The amount of amplified product increased significantly in TAP-treated RNA preparation, suggesting that this corresponds to a bona fide transcription start site. Similarly, the primary transcription start sites of TU5 (upstream of COX1), TU6 (upstream of COB) and TU7 (upstream of NAD4L) were clearly confirmed by 5’-RACE (Fig. 3a). TU7 is particularly interesting, as its transcription starts at a site that significantly deviates from the nonanucleotide consensus promoter sequence (Fig. 2c). The results of 5’-RACE for this region were, however, unambiguous, with a clear increase in the amplified product after TAP treatment. Additionally, sequencing of the cloned 5’-RACE products resulted in 8/10 clones starting at the predicted +1 position (the remaining two had one nucleotide at the 5’ end truncated) for the TAP treated preparation; whereas only 1 in 5 clones from the untreated preparation started in the predicted region. These results clearly indicate that the unusual TU7 promoter is indeed an active transcription start site.

The results for the transcription units that start with tRNA sequences are more difficult to interpret, as these transcripts are immediately efficiently processed by tRNA excision, and the mature tRNAs are generally stable and present at high steady-state levels. Nevertheless, 5’-RACE amplification of the 5’ terminus of the TU4 transcription unit clearly shows a band corresponding to the predicted molecular weight of the product starting at the position of the indicated consensus promoter, that is apparent only in TAP treated preparations. Three additional bands correspond in size to the predicted products of RNA processing by tRNA punctuation, and they are present in TAP-treated and untreated reactions in comparable amounts (Fig. 3).

In the case of TU2, the major visible amplified product, obtained using starters annealing to the COX3 coding sequence, corresponds to the processed transcript with the tRNA (tK(UUU)) already excised, as evidenced by the sequence of cloned inserts. The location of the start site of this transcription unit is, however, quite clear, as there is only one putative promoter sequence in the correct orientation in the appropriate upstream region, and the mapping of RNA sequencing reads matches the identified consensus nonanucleotide promoter location. Similarly, in the case of TU8 the sequenced 5’-RACE products correspond to the transcript already processed at the beginning of the second tRNA (tL(UAG)). Both RNA sequencing reads and 5’-RACE analysis indicate, however, that TU8 is transcribed separately from the preceding transcription unit (TU7), and the identified promoter, conforming to the nonanucleotide consensus, is the only plausible candidate for its transcription start site.

The case of TU3, the only transcription unit consisting solely of tRNAs, is more complicated, as there are two potential consensus promoters, one 6 bp, and the other 99 bp upstream of the first tRNA sequence (tL(UAA)). Sequencing of the cloned 5’-RACE products amplified from TAP-treated RNA resulted in two (out of 11) clones starting at the +1 nucleotide of the closer promoter. The remaining clones corresponded to the products of tRNA excision, and no clone with a sequence starting at the upstream promoter was identified. Analysis of RNA sequencing revealed, however, that even though the vast majority of reads start immediately upstream of tL(UAA), there are detectable reads spanning the upstream region and appearing to originate from the first, upstream promoter. It is therefore likely that both promoters upstream of TU3 are active, but it’s the second one that contributes to the bulk of transcription. Interestingly, the steady state level of the second tRNA in this transcription unit (tY(GUA)) is much higher than that of the first one (tL(UAA)). The interplay of the two promoters, together with RNA processing and degradation, could contribute to the observed differential expression of tRNAs encoded in the same transcription unit. Similar multiple promoters have been identified in plant mitochondria [75].

Transcription initiation from the putative promoters located inside the primary transcription units could not be confirmed using RT-PCT (Supplementary Results and Discussion, Additional file 1: Figure S3).

The mature transcripts, introns, and RNA processing

Differences in the expression of mitochondrial genes transcribed as polycistronic primary transcripts

Counting reads mapping to the annotations in the reference genome allowed to compare the expression levels of different transcripts, calculated as RPKM values. The steady state levels of mitochondrial RNAs vary greatly, sometimes by several orders of magnitude (Fig. 4, Additional file 3: Table S3). Generally, the most abundant transcripts are some of the tRNAs, followed by rRNAs, although five of the tRNAs have lower RPKM values than rRNAs (Fig. 4b). Differences between the RPKM values of different tRNAs can be as large as about two thousand-fold (between tY(GUA) and tL(UAG), Fig. 4b). Differences in expression of protein-coding transcripts are not as large as between noncoding RNAs (Fig. 4a) with ATP9, COX1 and COX2 displaying highest RPKM values, and NAD3 and NAD4L the lowest (the difference between ATP9 and NAD3 is about 276-fold). The two rRNAs of the small and large subunits of the mitoribosome, encoded by the RNS and RNL genes, respectively, are expressed at the same level (16 309 and 15 162 RPKM, respectively, about 7 % difference), consistent with the 1:1 stoichiometry of the subunits.

Significant differences between steady state levels of RNAs expressed from a single polycistronic transcription unit were also observed in the S. cerevisiae [32] and human [78] mitochondrial transcriptomes, and are most likely due to differences in stability and degradation [32, 78, 97, 98]. Transcriptional attenuation was suggested as an additional mechanism in cases, when the relative transcript abundance decreases in the 5’ to 3’ direction [98], but this does not seem to be the general rule in C. albicans mitochondria (Figs. 2b and 4). Additional putative promoter sequences, found inside the primary transcription units described above (Fig. 2b), such as those upstream of tRNA-Ile (tI(GAU)) in TU4, upstream of the cluster of 5 tRNAs downstream of COX1 in TU5, upstream of the RNS gene in TU6 and upstream of the three tRNAs downstream of NAD5 in TU7, could also play a role in generating differences in expression of genes located on the same main transcription unit by boosting the levels of tRNAs and rRNAs.

Reproducibility of the expression data was confirmed by comparing results obtained from two C. albicans strains grown and analyzed separately, as well as mitochondrial and total RNA preparations (Supplementary Results and Discussion, Additional file 1: Figure S1).

Mature monocistronic and bicistronic transcripts are generated by tRNA punctuation

The mitochondrial genome of C. albicans encodes 14 proteins (not including the putative proteins encoded in introns) that function in the complexes of the OXPHOS system: 7 subunits of Complex I (NAD1, NAD2, NAD3, NAD4, NAD4L, NAD5, NAD6), 1 Complex III subunit (COB), 3 Complex IV subunits (COX1, COX2, COX3), and 3 subunits of the ATP synthase (ATP6, ATP8, ATP9). Additionally, we confirmed the presence and expression of both ribosomal RNAs of the small and large subunit, encoded by the RNL and RNS genes, respectively, and the full set of 24 tRNAs (Table 1). One protein coding gene (COX3) and six tRNA genes are located in the inverted repeat regions, and thus are annotated twice in the reference genome sequence. Analysis of the tRNA sequencing reads reveals the presence of the non-templated posttranscriptionally added 3’ terminal CCA trinucleotide [99] in all 24 of them, confirming that they are active and processed into mature tRNAs, and allowing precise mapping of the 3’ ends of respective genes. The tRNAs encoded by the C. albicans mitochondrial genome are consistent with the genetic code that differs from the standard in using UGA for Trp (instead of stop) [100]. This codon assignment is common in mitochondria of many fungi, protozoa and other organisms, and is described as the “mold, protozoan, and coelenterate mitochondrial code and the Mycoplasma/Spiroplasma code”, also known as “translation table 4”, at NCBI (http://www.ncbi.nlm.nih.gov/Taxonomy/Utils/wprintgc.cgi), but is different from the code used in S. cerevisiae and related species [101].

Table 1 C. albicans mitochondrial tRNAs. Systematic names are according to the guidelines published in the Candida Genome Database (http://www.candidagenome.org, [105]). Transcription units are numbered according to Figs. 1 and 2

Full size table

Interestingly, the gene encoding the RNA component of RNase P cannot be found in C. albicans mtDNA [11], suggesting that the enzyme is probably composed purely of protein subunits, like in many animal and plant organellar systems [102–104].

Transcription unit TU3 is the only one comprised exclusively of five tRNA genes, the remaining 19 tRNAs are contained within polycistronic units, comprising also rRNA and mRNA genes (Fig. 2b). Cleavage of precursors containing tRNA sequences by endonucleases RNase P and tRNase Z provides a likely mechanism liberating flanking transcripts, known as the tRNA punctuation model, first described in human mitochondria [12, 78]. In C. albicans mitochondria, tRNA punctuation is sufficient to account for all the cleavages necessary to produce the final mRNA and rRNA molecules. Excision of tRNAs liberates monocistronic transcripts of RNL, COX2, COX3, ATP9, COX1, COB, RNS, and NAD4, whereas NAD6 with NAD1, ATP8 with ATP6, NAD2 with NAD3, and NAD4L with NAD5, would remain as co-translated bicistronic mRNAs (Figs. 1 and 2). The predicted open reading frames of NAD genes encoded on bicistronic transcripts are adjacent, and overlap by one nucleotide (the last A of the stop codon becomes A of the presumed ATG initiation codon), whereas ATP8 and ATP6 are separated by 125 nt of noncoding sequence. We used Northern blot hybridization to confirm the predicted sizes of these mature transcripts (Supplementary Results and Discussion, Additional file 1: Figure S4).

In S. cerevisiae an additional mechanism is responsible for processing at the 3’ termini of mRNAs by cleavage at a conserved AU-rich dodecamer sequence [18, 21, 32], whereas in S. pombe a C-rich stretch (the “C-core”) acts as the signal for the cleavage and maturation of the 3’ ends of mRNAs and the small subunit rRNA [19]. Additional putative cleavage signal sequences, consisting of GC rich 17-mers, were recently identified in S. cerevisiae [32]. In C. albicans we found no evidence for such mechanisms operating in addition to tRNA punctuation, which is sufficient to explain the generation of all observed mature transcripts and is therefore likely the sole mechanism responsible for fragmenting of primary transcripts into the final RNAs, like in the vertebrate system.

Introns in the mitochondrial genes of C. albicans

Introns are a common feature of organellar genes, and as mobile genetic elements they make an important contribution to the evolutionary dynamics of yeast mitochondrial genomes [29]. In silico analysis (http://www.candidagenome.org, [105]), based on the conservation of the protein-coding sequences, and similarity to orthologous sequences from Candida parapsilosis [35], indicated that in C. albicans there are four introns in the COX1 gene, and two introns in the COB gene. Long-range mapping of RNA sequencing reads obtained in our experiments fully confirmed those predicted splicing sites, and provided first experimental evidence for the exon-intron structure of these two genes (Fig. 5). Additionally, we identified two introns in the large subunit rRNA (RNL) gene (Fig. 5c). These introns were not annotated in the reference genome, nor in the Candida Genome Database, but their presence was indicated in a complete genomic sequence of another C. albicans strain (CBS 562, [GenBank: KC993188.1], unpublished). This brings the total intron number in C. albicans mitochondrial genes to eight. The four introns of COX1 are predicted to be group I introns by the in silico annotation server MFannot (http://megasun.bch.umontreal.ca/cgi-bin/mfannot/mfannotInterface.pl, [29]), whereas the two COB introns cannot be reliably assigned to a group by automatic prediction tools. MFannot predicts that the two introns in the RNL gene are also group I (IA) introns.

The four introns of COX1 and the first of the two introns of COB contain ORFs in frame with the preceding exon. Such arrangement is often found in mitochondrial introns of Fungi, and it entails synthesis of a fusion protein encoded by the intron and the preceding exon(s) [32, 106, 107]. The intron-encoded proteins are involved in intron mobility (homing) and/or in splicing [108–114]. Analysis using the NCBI conserved domain database (CDD, [115]) revealed that all four putative proteins encoded by the introns of the COX1 gene contain the LAGLIDADG endonuclease domain [116], that is found in proteins exhibiting RNA maturase (splicing), DNA endonuclease (homing), or both maturase and endonuclease activities. Sequence analysis alone cannot distinguish between these activities, as sometimes only a few amino acid substitutions determine whether a protein of this family acts as a maturase, endonuclease or a bifunctional maturase/endonuclease [117–119]. The protein encoded by the first intron of the COB gene contains signature motif of another class of homing endonucleases – the GYI-YIG superfamily [120].

GC-rich elements in the C. albicans mitochondrial genome and transcriptome

The mitochondrial genomes of yeasts are generally AT-rich, and C. albicans (32 % GC) is no exception. The GC-rich fragments are often found dispersed in the genome as relatively short palindromic clusters that can potentially fold into hairpin structures [10, 26, 33–36, 121–124], and may also constitute mobile genetic elements and/or regulatory signals.

In C. albicans we identified about 40 such elements, dispersed throughout the entire mitochondrial genome, but generally localized outside coding sequences (Fig. 6, listed in Additional file 4: Table S4). The length and sequence of different GC rich sequence elements are variable, and their putative function (if any) can also be different. Tentatively, they can be grouped into three classes (Fig. 6a). Elements from the first group can be folded into a distinctive structure consisting of two adjacent stem-loops (Fig. 6b), resembling the double-hairpin elements (DHEs) identified in other fungal mitochondrial genomes, and considered to be mobile genetic elements [34, 122–125]. Another group resembles the DHEs, but the two stem loops are separated by a stretch of 40–60 nucleotides of AT-rich sequence that does not form a stable structure (an example is a structure in the unannotated putative transcript in the inverted repeat region, described in the next section). It is not clear whether they originated from the canonical DHEs by an insertion, or if they are unrelated. Finally, some GC-rich elements form a single stem-loop structure (Fig. 6c).

Some of the GC-rich hairpin forming sequences are located in the regions between the primary transcription units, whereas others are found within these units, but always between genes. Their presence could be related to the evolutionary history of the mitochondrial genome, where genes and groups of genes were often shuffled by recombination. Particularly, DHEs and DHE-like elements with insertions between the hairpins, could be related to the recombinational events shaping the evolution of mitochondrial genomes [34, 122–125].

A role in gene expression could be postulated for putative hairpin forming sequences located in or near the transcribed regions. Many such elements can be found in the intergenic regions separating the distinct primary transcription units, and thus they may play a role in transcription termination. For example, in the short region (190 bp) separating the last gene (tS(GCU)) of TU7 and the promoter of TU8, the forward mapping sequencing reads clearly stop at a GC-rich hairpin forming sequence (Fig. 6d), which thus appears to function as a terminator. Another GC-rich sequence, forming a double hairpin structure, is located in the region of TU8 promoter. GC-rich palindromes can also be found at the ends of other transcription units, as well as in the regions separating the opposite strand promoters of TU2 and TU3, as well as TU4 and TU5 (Fig. 6a). Such sequences can thus function as terminators and/or elements isolating the separate transcription units. Several GC-rich elements can also be found in the noncoding region of the inverted repeats, where we observed significant transcriptional activity (described in the next section).

Another potentially interesting example is the element found inside TU4, between the coding sequences of ATP8 and ATP6 (Fig. 6c). These two open reading frames are translated from a single bicistronic mature transcript, but unlike in the other bicistronic mRNAs, where the two genes are immediately adjacent to each other, there is a noncoding region of 125 nt separating the two ORFs. The GC-rich hairpin is located upstream of ATP6, and could be a target for a translational activator or a processing/stability factor.

Transcripts of unknown function originating from inverted repeat regions

In addition to transcripts mapping to the annotated genes of the C. albicans mitochondrial genome, we found significant transcriptional activity in the unannotated noncoding region of the inverted repeats. At least three putative promoter sequences could be responsible for driving transcription in this region (Figs. 7 and 2b). The first transcribed region, about 1300 bp long, is flanked by two promoter nonanucleotides (Fig. 7), one in the forward orientation, and the other in reverse. Both seem to be active, as the forward strand reads prevail immediately downstream of the first, forward promoter, while reverse orientation reads originate from the second, reverse strand promoter. Two GC-rich palindromic stem-loops, separated by 60 nt, can be found between the forward and reverse regions. The biological significance of this curious arrangement is currently unknown. Expression of this region is not particularly strong, but the number of reads that map there is significantly above the background. Transcription of the second region appears to be driven by a single reverse strand promoter, conforming to the nonanucleotide consensus (Fig. 7), and results in a short (~300 nt) putative RNA product. The core region of this transcript appears to be strongly expressed and stable, based on the very high number of RNA sequencing reads that map there.

Neither of these putative transcribed sequences contain any open reading frames, and they do not show significant homology to any of the known functional RNAs. It is therefore not clear, whether they represent bona fide functional or regulatory transcripts. The inverted repeats are associated with frequent recombination and replication [46, 61]. Even though replication initiation in C. albicans mitochondria is driven by homologous recombination, rather than by classical RNA priming, RNA:DNA duplexes are apparent in the mtDNA preparations, and can somehow be involved in the replication process [46].

Evolutionary dynamics of transcriptional organization in yeast mitochondria

Mitochondrial genomes of yeasts, including the relatives of C. albicans grouped in the CTG clade, exhibit a great diversity in their genetic organization, in spite of a mostly conserved gene content [27, 28]. In order to investigate the effect of those rearrangements on the organization of gene expression units, we chose a set of mtDNA sequences of 11 CTG clade Candida species representative of different mitochondrial gene orders, 4 Candida species from early branching hemiascomycete lineages, with Yarrowia lipolytica as the early branch outgroup, inferred their phylogenetic relationships, and analyzed the alterations in gene order in the context of the primary transcription units inferred from our transcriptome study. The list of mtDNA sequences used in this comparison, including accession numbers and references is provided as supplementary data (Additional file 6: Table S6). The phylogenetic tree (Supplementary Results and Discussion, Additional file 1: Figure S5) is consistent with earlier hemiascomycete phylogenies [28, 126, 127] and shows a monophyletic CTG clade, with C. vartiovaarae, C. norvegica, C. santjacobensis, C. salmanticensis, and Y. lipolytica outside this clade as early branching lineages.

In general, this comparison indicates that the grouping of mitochondrial genes into polycistronic transcripts is not stable in evolution, and undergoes significant rearrangements even in relatively close species. There are, however, shorter blocks of synteny that seem to be conserved as they are shuffled into different combinations. One notable example is the pairing of ATP8 and ATP6, translated from a single mRNA, which seems to be a common feature of all the mitochondrial genomes in hemiascomycetous yeasts. The synteny of these two genes extends beyond Fungi to vertebrate mtDNAs, but is not universal in Opisthokonts, as it is broken in the mitochondrial genomes of Schizosaccharomycetales [123] and Zygomycetes [125]. In mtDNAs of the analyzed yeast species we also observed that the genes encoding Complex I subunits: NAD6 with NAD1, NAD2 with NAD3, and NAD4L with NAD5, remain closely linked and probably co-translated from a single mature transcript, as the predicted ORFs either overlap (like in C. albicans) or are separated by only a few nucleotides. Unlike the ATP8-ATP6 synteny, however, the NAD gene pairs are not conserved in mtDNAs of, for example, filamentous ascomycetes or Metazoa.

All the genes encoded in mtDNAs of Y. lypolitica and C. norvegica are transcribed from the same strand, and in C. vartiovaareae only the rps3 gene (encoding a mitoribosome subunit, absent in the CTG clade yeasts mtDNAs) is reversed. With the exception of the ATP8-ATP6 and NAD genes, the gene order in mtDNAs of the early branching yeasts shows no patterns of similarity to the members of the CTG clade. On the other hand, the mitochondrial genomes of CTG clade Candida species contain blocks of genes that are syntenic and correspond to blocks identified in previous studies [28, 35]. The way these blocks are assembled into transcription units seems, however, to highly variable (Fig. 8).

One such block corresponds to the first transcription unit of C. albicans mtDNA and consists of RNL, trnA, COX2, trnN, NAD6, and NAD1. It was identified as the ancestral gene order in the analysis of Valach et al. [28]. In some genomes it underwent further rearrangements, for example in C. sojae the trnA, COX2 gene pair moved to a different location in the genome, whereas in C. orthopsilosis, C. oxycetoniae and C. maltosa the trnN, NAD6, NAD1 block got transposed. Interestingly, in C. alai this block is interrupted by the insertion of another conserved block consisting of trnS, trnL, and NAD4 which corresponds to C. albicans TU8. Only in C. maltosa the synteny of trnN, NAD6, NAD1, found even in the species not belonging to the CTG clade, and thus probably ancestral to all the analyzed mtDNAs, is broken.

Other examples of synteny include the blocks consisting of trnG, trnC, trnP, ATP6, and ATP8 (trnP, ATP6, and ATP8 were duplicated in C. alai); trnK and COX3 (constituting the transcription unit TU2 of C. albicans); trnI and ATP9; trnM and RNS (not conserved in C. alai), and blocks consisting of tRNA genes, such as trnL,Y,H,T,E (TU3 of C. albicans), trnQ,D,S, and others (Fig. 8). The COB and COX1 genes appear to undergo evolutionary shuffling on their own, independently of any other gene.

These evolutionary units of mitochondrial genome organization show only partial correlation with the primary transcription units. Whereas C. albicans TU1, TU2, TU3 and TU8 correspond to putative ancestral synteny blocks, TU4, TU5, TU6 and TU7 are composed of two or three of such blocks each (Fig. 8). The presence of secondary consensus promoters of uncertain activity inside the transcription units (Fig. 2b), as well as of GC-rich sequence elements (Fig. 6a) does coincide, however, with the boundaries of these blocks and is probably related to their evolution. The case of the DHE-like element upstream of tRNA-Met_1 sequence is particularly interesting. Downstream of the preceding COB gene, a DHE-like structure is followed by a nonanucleotide sequence (GTATAGGTA) closely related to the promoter consensus, and then the tRNA. RNA-seq reads indicate that these still belong to the same primary transcript, they could, however, show an evolutionary intermediate step that could lead to the formation of an independent transcription unit. Alternatively, it could be a vestige of a recent rearrangement that brought tRNA-Met_1 and RNS into this transcription unit. A similar arrangement can be observed in the region of TU8 promoter (Fig. 6d), where a DHE-like sequence includes the active promoter of TU8. This transcription unit is separated from the preceding transcript (TU7) by only a short intergenic stretch of about 140 bp that contains two GC-rich hairpin-forming elements. This suggests that a close link exists between evolutionary rearrangements mediated by the DHE-like elements and the reorganization of transcription in yeast mitochondrial genomes. Involvement of tRNA genes and their promoters is also a common theme in the evolutionary history of yeast mitochondrial genome organization.

Conclusions

The mitochondrial genome organization shows great evolutionary variability, even in closely related organisms. As the promoter consensus sequences are short, and can be found at multiple sites in the genome often not corresponding to active transcription start sites, in silico analysis alone is not sufficient to draw conclusions about the number of active promoters. The study of the mitochondrial genome expression and its evolution must therefore involve transcriptome analysis in addition to mtDNA sequencing.

In the first experimental study of the expression of Candida albicans mitochondrial genome, we demonstrated that 14 protein coding genes, 24 tRNAs and 2 rRNAs are expressed as eight polycistronic primary transcription units that are processed into mature transcripts by excision of tRNAs. Introns are found in three of the genes (two in LSU rRNA, two in COB and four in COX1). Steady state levels of mature RNAs vary by orders of magnitude, even for sequences transcribed from the same promoter, suggesting RNA degradation as the main regulatory mechanism. The promoter consensus is a novel variant of the nonanucleotide sequence typical of yeast mitochondrial promoters. In addition to the main promoters driving the eight primary transcription units, there are multiple secondary promoter consensus sequences, that are either inactive, or could act as “boosters”. Their presence could be a vestige of former primary transcription start sites, that became redundant after new transcription units were formed. It may also facilitate the creation of new, separate transcription units after the extant polycistrons are broken up by genomic rearrangements. The fact, that the promoter sequences are short, and even significant departures from the consensus are tolerated, as evidenced by the active promoter of TU7 (Fig. 2), makes creation of new transcription start sites a likely evolutionary event.

It is clear that the polycistronic transcription units of yeast and animal mitochondria are not direct descendants of bacterial operons that can still be found in the mtDNA of jakobid protists [128, 129]. It has been suggested that the ancestral mitochondrial genome organization, at least in Opisthokonts, involved few promoters and very long primary transcripts [19]. This is the model of expression still found in Metazoa [96], S. pombe [19], and filamentous ascomycetes [92–95]. It could be also postulated for early branching Hemiascomycetes, like Y. lipolytica, as all the coding sequences are on the same genomic strand, there is, however, little experimental data about mitochondrial transcription in this organism [39].

Two primary transcripts were also suggested for Candida parapsilosis [35] and related species, this hypothesis is, however, based mostly on in silico analysis of the mtDNA sequence, and not on direct experimental study of transcription. Given the extraordinary evolutionary dynamics of the mitochondrial genome organization in yeasts, it is entirely possible that the putative two primary transcripts of C. parapsilosis are not directly descended from the ancestral state, but were formed as a result of rearrangements of multiple transcription units.

The main evolutionary force shaping the mitochondrial genomes of hemiascomycetous yeasts is the frequent recombination, constantly breaking apart and joining genes into primary transcription units. Even within the cells of the same species, as it was found in C. albicans, a mitochondrial genome can exist in forms differing in the gene order [46]. Long repeat regions [28] and GC-rich hairpin forming sequences [34, 122–125] are often involved in these recombinational events. Our study suggests that tRNA genes and their promoters, often with associated GC-rich elements, are also an important source of rearrangements. The conserved modules that undergo shuffling often start with a tRNA gene, as it is in the case of trnA and COX2; trnK and COX3; trnI and ATP9; trnM and RNS; trnG, trnC, trnP, ATP6, and ATP8; trnS, trnL, and NAD4; trnN, NAD6, and NAD1, and the groups consisting only of tRNA genes.

Short and promiscuous transcription initiation sites, apparent in yeast mtDNAs ensure that gene expression can adapt to the rapidly changing gene order. This means, however, that transcription initiation can have only a limited role in gene-specific regulatory mechanisms, and that posttranscriptional mechanisms should play a key role in the regulation of mitochondrial gene expression. The steady state levels of mature transcripts vary greatly among RNAs generated from the same primary transcript in C. albicans (Fig. 4), as well as in S. cerevisiae [32] and human [78] mitochondrial transcriptomes, and are most likely regulated by stability and degradation [32, 78, 97, 98], and possibly transcriptional attenuation [98]. Translational regulation (reviewed in [130]) is also an important mechanism controlling protein synthesis in yeast mitochondria. Entire families of proteins, such as the pentatricopeptide repeat (PPR) family, evolved in Eukaryotes as factors required to maintain and regulate the genetic system of the mitochondrial endosymbiont [131–135]. In order to gain a complete understanding of the evolution of the mitochondrial genetic systems, it will be necessary to follow the comparative genomics not only with the transcriptomic analysis, but also with an in-depth study of nucleo-mitochondrial interactions in a variety of taxonomically diverse organisms. C. albicans appears to be a good candidate for a new model system to study the functioning of the mitochondrial genome, and the overview of its transcriptome provides the necessary foundation for further investigations.

Methods

Strains and media

Candida albicans BWP17(ura3::imm434/ura3::imm434; his1::hisG/his1::hisG; arg4::hisG/arg4::hisG) [59] and SN148 (arg4/arg4; leu2/leu2; his1/his; ura3::imm434/ura3::imm434; iro1:: imm434/iro1:: imm434) [57] strains were grown in YPGal medium (1 % yeast extract, 2 % peptone and 2 % galactose) containing 80 μg/ml uridine at 37 °C until logarithmic growth phase.

Mitochondria isolation and RNA preparation

Mitochondria were isolated from log-phase liquid culture by differential centrifugation as described previously [31]. Purified intact mitochondria were treated with 10 μg of RNase A (Thermo Scientific) at 37 °C for 5 min to remove co-purified cytoplasmic RNA. Isolation of mitochondrial RNA was performed from purified mitochondria using the hot phenol procedure [136] or by fenozol extraction (A&A Biotechnology).

Alternatively, total cellular RNA extracted using the hot phenol procedure [136] was treated with Ribo-Zero Gold rRNA Removal Kit (Yeast) (Epicentre) to deplete cytoplasmic rRNAs according to the manufacturer’s protocol, and used to construct total RNA libraries.

RNA samples were treated with DNAse (Roche) in the presence of RNase inhibitor (Ribolock, Thermo) in the manufacturer’s recommended buffer (Roche). DNA-free RNA was then extracted with phenol/chloroform/octanol (25:24:1), precipitated from the aqueous phase by ethanol/sodium acetate (pH 5.2) and dissolved in water. The quality of each preparation was assessed by conventional agarose gel electrophoresis and using BioAnalyzer.

Library preparation and RNA-seq

RNA-seq libraries were prepared using the Ion Total RNA-Seq Kit v2 (Life Technologies) starting with 400–500 ng of either mitochondrial or total RNA, according to the manufacturer’s protocol. For mitochondrial RNA libraries the RNA fragmentation step was shortened to 4 min, preserving intact tRNAs. After the RNA fragmentation step each sample was divided into two equal aliquots, and one was treated with 0.5 U (mitochondrial RNA) or 0.75 U (total RNA) of tobacco acid pyrophospatase (TAP) (Epicentre Technologies) at 37 °C for 30 min. RNA quality and library construction was monitored using BioAnalyzer 2100 (Agilent Technologies) according to the manufacturer’s protocol.

The libraries were sequenced on the Ion Torrent Proton™ NGS System on a P1 chip using the Template OT2 200 Kit for template preparation and Ion PI™ Sequencing 200 Kit for sequencing (Life Technologies), all according to the manufacturer’s instructions. This method produces single-end, strand-specific reads. Raw sequencing data were processed using the Torrent Suite™ Software (Life Technologies). Barcode removal and quality trimming were performed in Torrent Suite™ using default parameters (30 % QC threshold, reads <25 nt rejected). The resulting reads range from 25 to 351–367 nt, with a mean of about 70 nt. The processed reads were exported as FASTQ files and imported into CLC Genomics Workbench 8 (http://www.clcbio.com) for the mapping, analysis and visualization of sequencing results. The RNA-seq workflow used in this software is based on the methodology of Mortazavi et al. [137]. The complete mtDNA sequence of C. albicans strain SC5314 [GenBank:AF285261.1], with additional feature annotations from the Candida Genome Database (Candida Genome Database, http://www.candidagenome.org, [105]) was used as the reference for read mapping. For analyses involving the nuclear genome, Assembly 22 of the C. albicans SC5314 genome sequence [138] was used as reference. Reads were mapped to both strands the entire reference sequence, including intergenic regions using default parameters (mismatch cost 2, indel cost 3, length and similarity fractions 0.8). Following the removal of the second copy of the inverted repeat region from the reference sequence, only uniquely mapping reads were counted. Expression values for annotated genes were calculated as RPKM [137].

RT-PCR

Total cellular RNA was extracted and treated with DNAse as described above. 5 μg of DNA-free RNA was reverse transcribed by Maxima™ Reverse Transcriptase (Thermo Scientific) with random hexamers. 2 μl of tenfold diluted RT product was amplified in 28 PCR cycles in 20 μl with 0.5 U of Phusion polymerase (Thermo Scientific) and 10 pmol specific primers (Additional file 5: Table S5), and analyzed by agarose gel electrophoresis.

Northern blot analysis

Northern hybridization was performed essentially as described previously [139]. 2.5 μg of mitochondrial RNA was separated on a 1 % denaturing formaldehyde gel, transferred onto Nytran N nylon membrane (GE Healthcare) and hybridized with the appropriate probe. Fragments of respective mitochondrial genes amplified by PCR and cloned into plasmid vectors were radiolabeled with α-³²P-dATP using the Nick Translation System (Invitrogen). For the detection of ATP6, ATP8, ATP9 oligonucleotide probes labeled with γ-³²P-ATP using the T4 polynucleotide kinase (New England Biolabs) were used. All the probes used in Northern blot analysis are listed in Additional file 5: Table S5.

5’-RACE analysis of transcription start sites

Primary transcript 5’ termini were determined by 5’-RACE technique according to Bensing et al. [74] with minor modifications.

5’ triphosphates were converted to monophosphates by treatment of 5 μg of mitochondrial RNA with 10 units of tobacco acid pyrophospatase (TAP) (Epicentre Technologies) at 37 °C for 60 min in the presence of 40 U of RNase inhibitor (Thermo Scientific) in the appropriate buffer (50 mM sodium acetate pH 6.0, 10 mM EDTA, 1 % β-mercaptoethanol, 0.1 % TRITON X-100). Control RNA was incubated under the same conditions without TAP enzyme. RNA was then extracted with phenol/chloroform/octanol (25:24:1), precipitated from the aqueous phase by ethanol/sodium acetate (pH 5.2) and dissolved in water. RNA was subsequently ligated with 100 pM of chimeric DNA-RNA adapter (44 nt, blocked at the 5’ end with 5'-O-methyl-2'-deoxythymidine, last three nucleotides were RNA) at 17 °C for 12 h with 50 U of T4 RNA ligase (New England Biolabs), in the presence of 150 μM ATP and 80 U of RNase inhibitor (Thermo Scientific), in the appropriate buffer (50 mM Tris HCl, pH 7.5, 10 mM MgCl₂, 4 mM DTT, 10 % DMSO). Ligation was followed by phenol/chloroform/octanol extraction, and ethanol/sodium acetate precipitation. RNA was dissolved in water and reverse-transcribed using 2 pmol gene specific primers and Super Script III Reverse Transcriptase (Invitrogen), according to the manufacturer’s instructions.

The products of reverse transcription were amplified by PCR, with 1 μl of RT reaction as a template, 25 pmol of each nested gene specific and adapter specific primer, 250 μM of each dNTP, 0.5 units of Phusion Polymerase (Thermo Scientific) in 25 μl of the manufacturer’s recommended buffer. Cycling conditions were 98 °C/3 min, 28 cycles of 98 °C/ 30 s, 55 °C/30 s, 72 °C/30 s, 72 °C/7 min. PCR products were separated on 3 % high resolution agarose gels (Polskie Agarozy), excised, eluted from the gel (Thermo Scientific GeneJET kit) and cloned into pJET1.2 Cloning Vector (Thermo Scientific CloneJET). Bacterial clones containing the plasmids with the inserts of proper size were identified by BglII digestion. Inserts were analyzed by Sanger sequencing with vector specific primers at the IBB PAS sequencing facility (http://oligo.ibb.waw.pl/).

Each 5’-RACE reaction was performed for two independent RNA preparations (from BWP17 and SN148 Candida albicans strains). A list of all oligos and primers used in 5’-RACE reactions is provided in Additional file 5: Table S5.

Phylogenetic analysis

The mtDNA sequences used in the comparative analysis, with accession numbers and references are listed in Additional file 6: Table S6. Concatenated amino acid sequences encoded by 14 mitochondrial protein coding genes were aligned with MUSCLE version 3.8.31 [140] (3911 amino acid sites after manual removal of regions containing gaps in the alignment), and the tree was inferred using PhyloBayes (MPI version 1.5a) [141, 142] using the CAT-GTR model. Two MCMC chains were run in parallel for 5000 cycles, at which point the maximum discrepancy (maxdiff) value reached 0.009 (maxdiff <0.1 is considered sufficient). First 1000 trees in each chain were discarded as burnin, and one in two of the remaining trees were sampled for posterior consensus. The tree was rooted using the Y. lipolytica sequence as outgroup.

Availability of supporting data

The datasets supporting the results of this article are available in the [Sequence Read Archive], [SRP056472 http://www.ncbi.nlm.nih.gov/Traces/study/?acc=SRP056472]

Abbreviations

mtDNA:: Mitochondrial DNA
OXPXOS:: Oxidative phosphorylation
TU:: Transcription unit
IR:: Inverted repeat
DHE:: Double-hairpin element
TAP:: Tobacco alkaline phosphatase
RT-PCR:: Reverse transcription polymerase chain reaction
5’ RACE:: Rapid amplification of 5′ complementary DNA ends
NGS:: Next generation sequencing

References

Taylor JW, Berbee ML. Dating divergences in the Fungal Tree of Life: review and new analyses. Mycologia. 2006;98:838–49.
Article PubMed Google Scholar
Dujon B. Yeast evolutionary genomics. Nat Rev Genet. 2010;11:512–24.
Article CAS PubMed Google Scholar
Chou J-Y, Leu J-Y. Speciation through cytonuclear incompatibility: insights from yeast and implications for higher eukaryotes. Bioessays. 2010;32:401–11.
Article CAS PubMed Google Scholar
Lee H-Y, Chou J-Y, Cheong L, Chang N-H, Yang S-Y, Leu J-Y. Incompatibility of nuclear and mitochondrial genomes causes hybrid sterility between two yeast species. Cell. 2008;135:1065–73.
Article CAS PubMed Google Scholar
Levin L, Blumberg A, Barshad G, Mishmar D. Mito-nuclear co-evolution: the positive and negative sides of functional ancient mutations. Front Genet. 2014;5:448.
Article PubMed Central PubMed CAS Google Scholar
Spírek M, Poláková S, Jatzová K, Sulo P. Post-zygotic sterility and cytonuclear compatibility limits in S. cerevisiae xenomitochondrial cybrids. Front Genet. 2015;5:454.
PubMed Central PubMed Google Scholar
Paliwal S, Fiumera AC, Fiumera HL. Mitochondrial-nuclear epistasis contributes to phenotypic variation and coadaptation in natural isolates of Saccharomyces cerevisiae. Genetics. 2014;198:1251–65.
Article CAS PubMed PubMed Central Google Scholar
Lang BF, Gray MW, Burger G. Mitochondrial genome evolution and the origin of eukaryotes. Annu Rev Genet. 1999;33:351–97.
Article CAS PubMed Google Scholar
Nosek J, Fukuhara H. NADH dehydrogenase subunit genes in the mitochondrial DNA of yeasts. J Bacteriol. 1994;176:5622–30.
PubMed Central CAS PubMed Google Scholar
Jung PP, Schacherer J, Souciet J-L, Potier S, Wincker P, de Montigny J. The complete mitochondrial genome of the yeast Pichia sorbitophila. FEMS Yeast Res. 2009;9:903–10.
Article CAS PubMed Google Scholar
Seif ER, Forget L, Martin NC, Lang BF. Mitochondrial RNase P RNAs in ascomycete fungi: lineage-specific variations in RNA secondary structure. RNA. 2003;9:1073–83.
Article PubMed Central CAS PubMed Google Scholar
Ojala D, Montoya J, Attardi G. tRNA punctuation model of RNA processing in human mitochondria. Nature. 1981;290:470–4.
Article CAS PubMed Google Scholar
Jaehning JA. Mitochondrial transcription: is a pattern emerging? Mol Microbiol. 1993;8:1–4.
Article CAS PubMed Google Scholar
Szczesny RJ, Borowski LS, Malecki M, Wojcik MA, Stepien PP, Golik P. RNA degradation in yeast and human mitochondria. Biochim Biophys Acta. 1819;2012:1027–34.
Google Scholar
Hoffmann B, Nickel J, Speer F, Schafer B: The 3' ends of mature transcripts are generated by a processosome complex in fission yeast mitochondria. J Mol Biol. 2008;377:1024–1037.
Fekete Z, Ellis TP, Schonauer MS, Dieckmann CL. Pet127 governs a 5“ - > 3-”exonuclease important in maturation of apocytochrome b mRNA in Saccharomyces cerevisiae. J Biol Chem. 2008;283:3767–72.
Article CAS PubMed Google Scholar
Wiesenberger G, Fox TD. Pet127p, a membrane-associated protein involved in stability and processing of Saccharomyces cerevisiae mitochondrial RNAs. Mol Cell Biol. 1997;17:2816–24.
Article PubMed Central CAS PubMed Google Scholar
Hofmann TJ, Min J, Zassenhaus HP. Formation of the 3' end of yeast mitochondrial mRNAs occurs by site-specific cleavage two bases downstream of a conserved dodecamer sequence. Yeast. 1993;9:1319–30.
Article CAS PubMed Google Scholar
Schafer B, Hansen M, Lang BF. Transcription and RNA-processing in fission yeast mitochondria. RNA. 2005;11:785–95.
Article PubMed Central PubMed CAS Google Scholar
Min J, Zassenhaus HP. Identification of a protein complex that binds to a dodecamer sequence found at the 3' ends of yeast mitochondrial mRNAs. Mol Cell Biol. 1993;13:4167–73.
Article PubMed Central CAS PubMed Google Scholar
Osinga KA, De Vries E, der Horst Van GTJ, Tabak HF. Processing of yeast mitochondrial messenger RNAs at a conserved dodecamer sequence. EMBO J. 1984;3:829–34.
PubMed Central CAS PubMed Google Scholar
Zhu H, Conrad-Webb H, Liao XS, Perlman PS, Butow RA. Functional expression of a yeast mitochondrial intron-encoded protein requires RNA processing at a conserved dodecamer sequence at the 3' end of the gene. Mol Cell Biol. 1989;9:1507–12.
Article PubMed Central CAS PubMed Google Scholar
Cardazzo B, Rinaldi T, Frontali L, Carignani G, Palleschi C. Evolution of mitochondrial genomes in yeast: a study of mitochondrial divergence in two closely related species, Saccharomyces douglasii and Saccharomyces cerevisiae. Mol Biol Evol. 1997;14:200–3.
Article CAS PubMed Google Scholar
Tian GL, Macadre C, Kruszewska A, Szczesniak B, Ragnini A, Grisanti P, et al. Incipient mitochondrial evolution in yeasts. I. The physical map and gene order of Saccharomyces douglasii mitochondrial DNA discloses a translocation of a segment of 15,000 base-pairs and the presence of new introns in comparison with Saccharomyces cerevisiae. J Mol Biol. 1991;218:735–46.
Article CAS PubMed Google Scholar
Tian GL, Michel F, Macadre C, Slonimski PP, Lazowska J. Incipient mitochondrial evolution in yeasts. II. The complete sequence of the gene coding for cytochrome b in Saccharomyces douglasii reveals the presence of both new and conserved introns and discloses major differences in the fixation of mutations in evolution. J Mol Biol. 1991;218:747–60.
Article CAS PubMed Google Scholar
Jung PP, Friedrich A, Souciet J-L, Louis V, Potier S, de Montigny J, et al. Complete mitochondrial genome sequence of the yeast Z`Pichia farinosa and comparative analysis of closely related species. Curr Genet. 2010;56:507–15.
Article CAS PubMed Google Scholar
Valach M, Pryszcz LP, Tomaska L, Gacser A, Gabaldón T, Nosek J. Mitochondrial genome variability within the Candida parapsilosis species complex. Mitochondrion. 2012;12:514–9.
Article CAS PubMed Google Scholar
Valach M, Farkas Z, Fricova D, Kovac J, Brejova B, Vinar T, et al. Evolution of linear chromosomes and multipartite genomes in yeast mitochondria. Nucleic Acids Res. 2011;39:4202–19.
Article PubMed Central CAS PubMed Google Scholar
Lang BF, Laforest M-J, Burger G. Mitochondrial introns: a critical view. Trends Genet. 2007;23:119–25.
Article CAS PubMed Google Scholar
Lipinski KA, Kaniak-Golik A, Golik P. Maintenance and expression of the S. cerevisiae mitochondrial genome--from genetics to evolution and systems biology. Biochim Biophys Acta. 1797;2010:1086–98.
Google Scholar
Chiron S, Gaisne M, Guillou E, Belenguer P, Clark-Walker GD, Bonnefoy N. Studying mitochondria in an attractive model: Schizosaccharomyces pombe. Methods Mol Biol. 2007;372:91–105.
Article CAS PubMed Google Scholar
Turk EM, Das V, Seibert RD, Andrulis ED. The mitochondrial RNA landscape of Saccharomyces cerevisiae. PLoS One. 2013;8:e78105.
Article PubMed Central CAS PubMed Google Scholar
Foury F, Roganti T, Lecrenier N, Purnelle B. The complete sequence of the mitochondrial genome of Saccharomyces cerevisiae. FEBS Lett. 1998;440:325–31.
Article CAS PubMed Google Scholar
Pramateftaki PV, Kouvelis VN, Lanaridis P, Typas MA. Complete mitochondrial genome sequence of the wine yeast Candida zemplinina: intraspecies distribution of a novel group-IIB1 intron with eubacterial affiliations. FEMS Yeast Res. 2008;8:311–27.
Article CAS PubMed Google Scholar
Nosek J, Novotna M, Hlavatovicova Z, Ussery DW, Fajkus J, Tomaska L. Complete DNA sequence of the linear mitochondrial genome of the pathogenic yeast Candida parapsilosis. Mol Genet Genomics. 2004;272:173–80.
Article CAS PubMed Google Scholar
Gaillardin C, Neuvéglise C, Kerscher S, Nicaud J-M. Mitochondrial genomes of yeasts of the Yarrowia clade. FEMS Yeast Res. 2012;12:317–31.
Article CAS PubMed Google Scholar
Bouchier C, Ma L, Créno S, Dujon B, Fairhead C. Complete mitochondrial genome sequences of three Nakaseomyces species reveal invasion by palindromic GC clusters and considerable size expansion. FEMS Yeast Res. 2009;9:1283–92.
Article CAS PubMed Google Scholar
Rycovska A, Valach M, Tomaska L, Bolotin-Fukuhara M, Nosek J. Linear versus circular mitochondrial genomes: intraspecies variability of mitochondrial genome architecture in Candida parapsilosis. Microbiology (Reading, Engl). 2004;150:1571–80.
Article CAS Google Scholar
Matsuoka M, Matsubara M, Inoue J, Kakehi M, Imanaka T. Organization and transcription of the mitochondrial ATP synthase genes in the yeast Yarrowia lipolytica. Curr Genet. 1994;26:382–9.
Article CAS PubMed Google Scholar
Lang BF, Jakubkova M, Hegedusova E, Daoud R, Forget L, Brejova B, et al. Massive programmed translational jumping in mitochondria. Proc Natl Acad Sci U S A. 2014;111:5926–31.
Article PubMed Central CAS PubMed Google Scholar
Kosa P, Valach M, Tomaska L, Wolfe KH, Nosek J. Complete DNA sequences of the mitochondrial genomes of the pathogenic yeasts Candida orthopsilosis and Candida metapsilosis: insight into the evolution of linear DNA genomes from mitochondrial telomere mutants. Nucleic Acids Res. 2006;34:2472–81.
Article PubMed Central CAS PubMed Google Scholar
Miyakawa I, Okamuro A, Kinsky S, Visacka K, Tomaska L, Nosek J. Mitochondrial nucleoids from the yeast Candida parapsilosis: expansion of the repertoire of proteins associated with mitochondrial DNA. Microbiology (Reading, Engl). 2009;155:1558–68.
Article CAS Google Scholar
Fricova D, Valach M, Farkas Z, Pfeiffer I, Kucsera J, Tomaska L, et al. The mitochondrial genome of the pathogenic yeast Candida subhashii: GC-rich linear DNA with a protein covalently attached to the 5' termini. Microbiology (Reading, Engl). 2010;156:2153–63.
Article CAS Google Scholar
Visacka K, Gerhold JM, Petrovicova J, Kinsky S, Jõers P, Nosek J, et al. Novel subfamily of mitochondrial HMG box-containing proteins: functional analysis of Gcf1p from Candida albicans. Microbiology (Reading, Engl). 2009;155:1226–40.
Article CAS Google Scholar
Gerhold JM, Sedman T, Visacka K, Slezakova J, Tomaska L, Nosek J, et al. Replication intermediates of the linear mitochondrial DNA of Candida parapsilosis suggest a common recombination based mechanism for yeast mitochondria. J Biol Chem. 2014;289:22659–70.
Article PubMed Central CAS PubMed Google Scholar
Gerhold JM, Aun A, Sedman T, Jõers P, Sedman J. Strand invasion structures in the inverted repeat of Candida albicans mitochondrial DNA reveal a role for homologous recombination in replication. Mol Cell. 2010;39:851–61.
Article CAS PubMed Google Scholar
Bendich AJ. The end of the circle for yeast mitochondrial DNA. Mol Cell. 2010;39:831–2.
Article CAS PubMed Google Scholar
Ohama T, Suzuki T, Mori M, Osawa S, Ueda T, Watanabe K, et al. Non-universal decoding of the leucine codon CUG in several Candida species. Nucleic Acids Res. 1993;21:4039–45.
Article PubMed Central CAS PubMed Google Scholar
Kawaguchi Y, Honda H, Taniguchi-Morimura J, Iwasaki S. The codon CUG is read as serine in an asporogenic yeast Candida cylindracea. Nature. 1989;341:164–6.
Article CAS PubMed Google Scholar
Fitzpatrick DA, Logue ME, Stajich JE, Butler G. A fungal phylogeny based on 42 complete genomes derived from supertree and combined gene analysis. BMC Evol Biol. 2006;6:99.
Article PubMed Central PubMed CAS Google Scholar
Hernday AD, Noble SM, Mitrovich QM, Johnson AD. Genetics and molecular biology in Candida albicans. Meth Enzymol. 2010;470:737–58.
Article CAS PubMed Google Scholar
Noble SM, Johnson AD. Genetics of Candida albicans, a diploid human fungal pathogen. Annu Rev Genet. 2007;41:193–211.
Article CAS PubMed Google Scholar
De Backer MD, Magee PT, Pla J. Recent developments in molecular genetics of Candida albicans. Annu Rev Microbiol. 2000;54:463–98.
Article PubMed Google Scholar
Scherer S, Magee PT. Genetics of Candida albicans. Microbiol Rev. 1990;54:226–41.
PubMed Central CAS PubMed Google Scholar
Shepherd MG, Poulter RT, Sullivan PA. Candida albicans: biology, genetics, and pathogenicity. Annu Rev Microbiol. 1985;39:579–614.
Article CAS PubMed Google Scholar
Askew C, Sellam A, Epp E, Hogues H, Mullick A, Nantel A, et al. Transcriptional regulation of carbohydrate metabolism in the human pathogen Candida albicans. PLoS Pathog. 2009;5:e1000612.
Article PubMed Central PubMed CAS Google Scholar
Noble SM, Johnson AD. Strains and strategies for large-scale gene deletion studies of the diploid human fungal pathogen Candida albicans. Eukaryot Cell. 2005;4:298–309.
Article PubMed Central CAS PubMed Google Scholar
Noble SM, French S, Kohn LA, Chen V, Johnson AD. Systematic screens of a Candida albicans homozygous deletion library decouple morphogenetic switching and pathogenicity. Nat Genet. 2010;42:590–8.
Article PubMed Central CAS PubMed Google Scholar
Wilson RB, Davis D, Mitchell AP. Rapid hypothesis testing with Candida albicans through gene disruption with short homology regions. J Bacteriol. 1999;181:1868–74.
PubMed Central CAS PubMed Google Scholar
Massey SE, Moura G, Beltrão P, Almeida R, Garey JR, Tuite MF, et al. Comparative evolutionary genomics unveils the molecular mechanism of reassignment of the CTG codon in Candida spp. Genome Res. 2003;13:544–57.
Article PubMed Central CAS PubMed Google Scholar
Jõers P, Gerhold JM, Sedman T, Kuusk S, Sedman J. The helicase CaHmi1p is required for wild‐type mitochondrial DNA organization in Candida albicans. FEMS Yeast Res. 2007;7:118–30.
Article PubMed CAS Google Scholar
Hewitt VL, Heinz E, Shingu-Vazquez M, Qu Y, Jelicic B, Lo TL, et al. A model system for mitochondrial biogenesis reveals evolutionary rewiring of protein import and membrane assembly pathways. Proc Natl Acad Sci U S A. 2012;109:E3358–66.
Article PubMed Central CAS PubMed Google Scholar
Bartelli TF, Ferreira RC, Colombo AL, Briones MRS. Intraspecific comparative genomics of Candida albicans mitochondria reveals non-coding regions under neutral evolution. Infect Genet Evol. 2013;14:302–12.
Article CAS PubMed Google Scholar
Gojković Z, Knecht W, Zameitat E, Warneboldt J, Coutelis JB, Pynyaha Y, et al. Horizontal gene transfer promoted evolution of the ability to propagate under anaerobic conditions in yeasts. Mol Genet Genomics. 2004;271:387–93.
Article PubMed CAS Google Scholar
Piskur J, Rozpedowska E, Poláková S, Merico A, Compagno C. How did Saccharomyces evolve to become a good brewer? Trends Genet. 2006;22:183–6.
Article CAS PubMed Google Scholar
Li D, Chen H, Florentino A, Alex D, Sikorski P, Fonzi WA, et al. Enzymatic dysfunction of mitochondrial complex I of the Candida albicans goa1 mutant is associated with increased reactive oxidants and cell death. Eukaryot Cell. 2011;10:672–82.
Article PubMed Central CAS PubMed Google Scholar
Khamooshi K, Sikorski P, Sun N, Calderone R, Li D. The Rbf1, Hfl1 and Dbp4 of Candida albicans regulate common as well as transcription factor-specific mitochondrial and other cell activities. BMC Genomics. 2014;15:56.
Article PubMed Central PubMed Google Scholar
Richard ML, Nobile CJ, Bruno VM, Mitchell AP. Candida albicans biofilm-defective mutants. Eukaryot Cell. 2005;4:1493–502.
Article PubMed Central CAS PubMed Google Scholar
Chen XJ, Clark-Walker GD. The petite mutation in yeasts: 50 years on. Int Rev Cytol. 2000;194:197–238.
Article CAS PubMed Google Scholar
Anderson JB, Wickens C, Khan M, Cowen LE, Federspiel N, Jones T, et al. Infrequent genetic exchange and recombination in the mitochondrial genome of Candida albicans. J Bacteriol. 2001;183:865–72.
Article PubMed Central CAS PubMed Google Scholar
Bambach A, Fernandes MP, Ghosh A, Kruppa M, Alex D, Li D, et al. Goa1p of Candida albicans localizes to the mitochondria during stress and is required for mitochondrial function and virulence. Eukaryot Cell. 2009;8:1706–20.
Article PubMed Central CAS PubMed Google Scholar
Cheng S, Clancy CJ, Zhang Z, Hao B, Wang W, Iczkowski KA, et al. Uncoupling of oxidative phosphorylation enables Candida albicans to resist killing by phagocytes and persist in tissue. Cell Microbiol. 2007;9:492–501.
Article PubMed CAS Google Scholar
Chabrier-Roselló Y, Giesselman BR, De Jesús-Andino FJ, Foster TH, Mitra S, Haidaris CG. Inhibition of electron transport chain assembly and function promotes photodynamic killing of Candida. J Photochem Photobiol B Biol. 2010;99:117–25.
Article CAS Google Scholar
Bensing BA, Meyer BJ, Dunny GM. Sensitive detection of bacterial transcription initiation sites and differentiation from RNA processing sites in the pheromone-induced plasmid transfer system of Enterococcus faecalis. Proc Natl Acad Sci U S A. 1996;93:7794–9.
Article PubMed Central CAS PubMed Google Scholar
Kühn K, Weihe A, Börner T. Multiple promoters are a common feature of mitochondrial genes in Arabidopsis. Nucleic Acids Res. 2005;33:337–46.
Article PubMed Central PubMed CAS Google Scholar
Wills JW, Troutman WB, Riggsby WS. Circular mitochondrial genome of Candida albicans contains a large inverted duplication. J Bacteriol. 1985;164:7–13.
PubMed Central CAS PubMed Google Scholar
Shaw JA, Troutman WB, Lasker BA, Mason MM, Riggsby WS. Characterization of the inverted duplication in the mitochondrial DNA of Candida albicans. J Bacteriol. 1989;171:6353–6.
PubMed Central CAS PubMed Google Scholar
Mercer TR, Neph S, Dinger ME, Crawford J, Smith MA, Shearwood A-MJ, et al. The human mitochondrial transcriptome. Cell. 2011;146:645–58.
Article PubMed Central CAS PubMed Google Scholar
Tarassov I, Kamenski P, Kolesnikova O, Karicheva O, Martin RP, Krasheninnikov IA, et al. Import of nuclear DNA-encoded RNAs into mitochondria and mitochondrial translation. Cell Cycle. 2007;6:2473–7.
Article CAS PubMed Google Scholar
Christianson TW, Rabinowitz M. Identification of multiple transcriptional initiation sites on the yeast mitochondrial genome by in vitro capping with guanylyltransferase. J Biol Chem. 1983;258:14025–33.
CAS PubMed Google Scholar
Christianson TW, Edwards J, Levens D, Locker J, Rabinowitz M. Transcriptional initiation and processing of the small ribosomal RNA of yeast mitochondria. J Biol Chem. 1982;257:6494–500.
CAS PubMed Google Scholar
Osinga KA, Tabak HF. Initiation of transcription of genes for mitochondrial ribosomal RNA in yeast: comparison of the nucleotide sequence around the 5′-ends of both genes reveals a homologous stretch of 17 nudeotides. Nucleic Acids Res. 1982;10:3617–26.
Article PubMed Central CAS PubMed Google Scholar
Kerscher S, Durstewitz G, Casaregola S, Gaillardin C, Brandt U. The complete mitochondrial genome of Yarrowia lipolytica. Comp Funct Genomics. 2001;2:80–90.
Article PubMed Central CAS PubMed Google Scholar
Drissi R, Sor F, Fukuhara H. DNA sequences coding for the ribosomal small subunit RNA and valyl tRNA from the linear mitochondrial genome of the yeast Williopsis mrakii. Nucleic Acids Res. 1993;21:2947.
Article PubMed Central CAS PubMed Google Scholar
Drissi R, Sor F, Nosek J, Fukuhara H. Genes of the linear mitochondrial DNA of Williopsis mrakii: coding sequences for a maturase-like protein, a ribosomal protein VAR1 homologue, cytochrome oxidase subunit 2 and methionyl tRNA. Yeast. 1994;10:391–8.
Article CAS PubMed Google Scholar
Koszul R, Malpertuy A, Frangeul L, Bouchier C, Wincker P, Thierry A, et al. The complete mitochondrial genome sequence of the pathogenic yeast Candida (Torulopsis) glabrata. FEBS Lett. 2003;534:39–48.
Article CAS PubMed Google Scholar
Ragnini A, Frontali L. Ordered processing of the polygenic transcripts from a mitochondrial tRNA gene cluster in K. lactis. Curr Genet. 1994;25:342–9.
Article CAS PubMed Google Scholar
McAllister WT. Structure and function of the bacteriophage T7 RNA polymerase (or, the virtues of simplicity). Cell Mol Biol Res. 1993;39:385–91.
CAS PubMed Google Scholar
Romilly C, Caldelari I, Parmentier D, Lioliou E, Romby P, Fechter P. Current knowledge on regulatory RNAs and their machineries in Staphylococcus aureus. RNA Biol. 2012;9:402–13.
Article CAS PubMed Google Scholar
Bobrovskyy M, Vanderpool CK. Regulation of bacterial metabolism by small RNAs using diverse mechanisms. Annu Rev Genet. 2013;47:209–32.
Article CAS PubMed Google Scholar
Georg J, Hess WR. cis-antisense RNA, another level of gene regulation in bacteria. Microbiol Mol Biol Rev. 2011;75:286–300.
Article PubMed Central CAS PubMed Google Scholar
Kubelik AR, Kennell JC, Akins RA, Lambowitz AM. Identification of Neurospora mitochondrial promoters and analysis of synthesis of the mitochondrial small rRNA in wild-type and the promoter mutant [poky]. J Biol Chem. 1990;265:4515–26.
CAS PubMed Google Scholar
Breitenberger CA, Browning KS, Alzner-DeWeerd B, RajBhandary UL. RNA processing in Neurospora crassa mitochondria: use of transfer RNA sequences as signals. EMBO J. 1985;4:185–95.
PubMed Central CAS PubMed Google Scholar
Burger G, Helmer Citterich M, Nelson MA, Werner S, Macino G. RNA processing in Neurospora crassa mitochondria: transfer RNAs punctuate a large precursor transcript. EMBO J. 1985;4:197–204.
PubMed Central CAS PubMed Google Scholar
Dyson NJ, Brown TA, Ray JA, Waring RB, Scazzocchio C, Davies RW. Processing of mitochondrial RNA in Aspergillus nidulans. J Mol Biol. 1989;208:587–99.
Article CAS PubMed Google Scholar
Tracy RL, Stern DB. Mitochondrial transcription initiation: promoter structures and RNA polymerases. Curr Genet. 1995;28:205–16.
Article CAS PubMed Google Scholar
Piechota J, Tomecki R, Gewartowski K, Szczesny R, Dmochowska A, Kudła M, et al. Differential stability of mitochondrial mRNA in HeLa cells. Acta Biochim Pol. 2006;53:157–68.
CAS PubMed Google Scholar
Krause K, Dieckmann CL. Analysis of transcription asymmetries along the tRNAE-COB operon: evidence for transcription attenuation and rapid RNA degradation between coding sequences. Nucleic Acids Res. 2004;32:6276–83.
Article PubMed Central CAS PubMed Google Scholar
Betat H, Long Y, Jackman JE, Mörl M. From End to End: tRNA Editing at 5'- and 3-' Terminal Positions. Int J Mol Sci. 2014;15:23975–98.
Article PubMed Central PubMed CAS Google Scholar
Osawa S, Jukes TH, Watanabe K, Muto A. Recent evidence for evolution of the genetic code. Microbiol Rev. 1992;56:229–64.
PubMed Central CAS PubMed Google Scholar
Bonitz SG, Berlani R, Coruzzi G, Li M, Macino G, Nobrega FG, et al. Codon recognition rules in yeast mitochondria. Proc Natl Acad Sci U S A. 1980;77:3167–70.
Article PubMed Central CAS PubMed Google Scholar
Gobert A, Gutmann B, Taschner A, Gössringer M, Holzmann J, Hartmann RK, et al. A single Arabidopsis organellar protein has RNase P activity. Nat Struct Mol Biol. 2010;17:740–4.
Article CAS PubMed Google Scholar
Rossmanith W. Of P and Z: mitochondrial tRNA processing enzymes. Biochim Biophys Acta. 1819;2012:1017–26.
Google Scholar
Walker SC, Engelke DR. A protein-only RNase P in human mitochondria. Cell. 2008;135:412–4.
Article PubMed Central CAS PubMed Google Scholar
Inglis DO, Arnaud MB, Binkley J, Shah P, Skrzypek MS, Wymore F, et al. The Candida genome database incorporates multiple Candida species: multispecies search and analysis tools with curated gene and protein information for Candida albicans and Candida glabrata. Nucleic Acids Res. 2012;40(Database issue):D667–74.
Article PubMed Central CAS PubMed Google Scholar
Lazowska J, Jacq C, Slonimski PP. Sequence of introns and flanking exons in wild-type and box3 mutants of cytochrome b reveals an interlaced splicing protein coded by an intron. Cell. 1980;22:333–48.
Article CAS PubMed Google Scholar
Lazowska J, Claisse M, Gargouri A, Kotylak Z, Spyridakis A, Slonimski PP. Protein encoded by the third intron of cytochrome b gene in Saccharomyces cerevisiae is an mRNA maturase. Analysis of mitochondrial mutants, RNA transcripts proteins and evolutionary relationships. J Mol Biol. 1989;205:275–89.
Article CAS PubMed Google Scholar
Saldanha R, Mohr G, Belfort M, Lambowitz AM. Group I and group II introns. FASEB J. 1993;7:15–24.
CAS PubMed Google Scholar
Lambowitz AM, Belfort M. Introns as mobile genetic elements. Annu Rev Biochem. 1993;62:587–622.
Article CAS PubMed Google Scholar
Belfort M. An expanding universe of introns. Science. 1993;262:1009–10.
Article CAS PubMed Google Scholar
Belfort M, Reaban ME, Coetzee T, Dalgaard JZ. Prokaryotic introns and inteins: a panoply of form and function. J Bacteriol. 1995;177:3897–903.
PubMed Central CAS PubMed Google Scholar
Belfort M, Perlman PS. Mechanisms of intron mobility. J Biol Chem. 1995;270:30237–40.
Article CAS PubMed Google Scholar
Belfort M, Roberts RJ. Homing endonucleases: keeping the house in order. Nucleic Acids Res. 1997;25:3379–88.
Article PubMed Central CAS PubMed Google Scholar
Dujon B. Group I introns as mobile genetic elements: facts and mechanistic speculations--a review. Gene. 1989;82:91–114.
Article CAS PubMed Google Scholar
Marchler-Bauer A, Derbyshire MK, Gonzales NR, Lu S, Chitsaz F, Geer LY, et al. CDD: NCBI's conserved domain database. Nucleic Acids Res. 2015;43(Database issue):D222–6.
Article PubMed Central PubMed Google Scholar
Chevalier BS, Stoddard BL. Homing endonucleases: structural and functional insight into the catalysts of intron/intein mobility. Nucleic Acids Res. 2001;29:3757–74.
Article PubMed Central CAS PubMed Google Scholar
Szczepanek T, Gora M, Monteilhet C, Wysocka M, Lazowska J, Golik P. In vivo analysis of the relationships between the splicing and homing activities of a group I intron-encoded I-ScaI/bi2-maturase of Saccharomyces capensis produced in the yeast cytoplasm. FEMS Yeast Res. 2006;6:823–35.
Article CAS PubMed Google Scholar
Szczepanek T, Lazowska J. Replacement of two non-adjacent amino acids in the S.cerevisiae bi2 intron-encoded RNA maturase is sufficient to gain a homing-endonuclease activity. EMBO J. 1996;15:3758–67.
PubMed Central CAS PubMed Google Scholar
Szczepanek T, Jamoussi K, Lazowska J. Critical base substitutions that affect the splicing and/or homing activities of the group I intron bi2 of yeast mitochondria. Mol Gen Genet. 2000;264:137–44.
Article CAS PubMed Google Scholar
Dunin-Horkawicz S, Feder M, Bujnicki JM. Phylogenomic analysis of the GIY-YIG nuclease superfamily. BMC Genomics. 2006;7:98.
Article PubMed Central PubMed CAS Google Scholar
Weiller G, Schueller CM, Schweyen RJ. Putative target sites for mobile G + C rich clusters in yeast mitochondrial DNA: single elements and tandem arrays. Mol Gen Genet. 1989;218:272–83.
Article CAS PubMed Google Scholar
Paquin B, Laforest MJ, Lang BF. Double-hairpin elements in the mitochondrial DNA of Allomyces: evidence for mobility. Mol Biol Evol. 2000;17:1760–8.
Article CAS PubMed Google Scholar
Bullerwell CE, Leigh J, Forget L, Lang BF. A comparison of three fission yeast mitochondrial genomes. Nucleic Acids Res. 2003;31:759–68.
Article PubMed Central CAS PubMed Google Scholar
Paquin B, Laforest MJ, Forget L, Roewer I, Wang Z, Longcore J, et al. The fungal mitochondrial genome project: evolution of fungal mitochondrial genomes and their gene expression. Curr Genet. 1997;31:380–95.
Article CAS PubMed Google Scholar
Seif E, Leigh J, Liu Y, Roewer I, Forget L, Lang BF. Comparative mitochondrial genomics in zygomycetes: bacteria-like RNase P RNAs, mobile elements and a close source of the group I intron invasion in angiosperms. Nucleic Acids Res. 2005;33:734–44.
Article PubMed Central CAS PubMed Google Scholar
Mühlhausen S, Kollmar M. Molecular phylogeny of sequenced saccharomycetes reveals polyphyly of the alternative yeast codon usage. Genome Biol Evol. 2014;6:3222–37.
Article PubMed CAS Google Scholar
Kurtzman CP. Phylogeny of the ascomycetous yeasts and the renaming of Pichia anomala to Wickerhamomyces anomalus. Antonie Van Leeuwenhoek. 2011;99:13–23.
Article PubMed Google Scholar
Burger G, Gray MW, Forget L, Lang BF. Strikingly bacteria-like and gene-rich mitochondrial genomes throughout jakobid protists. Genome Biol Evol. 2013;5:418–38.
Article PubMed Central PubMed CAS Google Scholar
Lang BF, Burger G, O'Kelly CJ, Cedergren R, Golding GB, Lemieux C, et al. An ancestral mitochondrial DNA resembling a eubacterial genome in miniature. Nature. 1997;387:493–7.
Article CAS PubMed Google Scholar
Herrmann JM, Woellhaf MW, Bonnefoy N. Control of protein synthesis in yeast mitochondria: the concept of translational activators. Biochim Biophys Acta. 1833;2013:286–94.
Google Scholar
Aphasizhev R, Aphasizheva I. Emerging roles of PPR proteins in trypanosomes. RNA Biol. 2013;10:1495–500.
Article PubMed Central CAS PubMed Google Scholar
Lightowlers RN, Chrzanowska-Lightowlers ZM. Human pentatricopeptide proteins. RNA Biol. 2013;10:1433–8.
Article PubMed Central CAS PubMed Google Scholar
Filipovska A, Rackham O. Pentatricopeptide repeats. RNA Biol. 2013;10:1426–32.
Article PubMed Central CAS PubMed Google Scholar
Giegé P. Pentatricopeptide repeat proteins. RNA Biol. 2013;10:1417–8.
Article PubMed Central PubMed CAS Google Scholar
Herbert CJ, Golik P, Bonnefoy N. Yeast PPR proteins, watchdogs of mitochondrial gene expression. RNA Biol. 2013;10:1477–94.
Article PubMed Central CAS PubMed Google Scholar
Schmitt ME, Brown TA, Trumpower BL. A rapid and simple method for preparation of RNA from Saccharomyces cerevisiae. Nucleic Acids Res. 1990;18:3091–2.
Article PubMed Central CAS PubMed Google Scholar
Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008;5:621–8.
Article CAS PubMed Google Scholar
Muzzey D, Schwartz K, Weissman JS, Sherlock G. Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure. Genome Biol. 2013;14:R97.
Article PubMed Central PubMed CAS Google Scholar
Malecki M, Jedrzejczak R, Puchta O, Stepien PP, Golik P. In vivo and in vitro approaches for studying the yeast mitochondrial RNA degradosome complex. Meth Enzymol. 2008;447:463–88.
Article CAS PubMed Google Scholar
Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–7.
Article PubMed Central CAS PubMed Google Scholar
Lartillot N, Rodrigue N, Stubbs D, Richer J. PhyloBayes MPI: phylogenetic reconstruction with infinite mixtures of profiles in a parallel environment. Syst Biol. 2013;62:611–5.
Article CAS PubMed Google Scholar
Lartillot N, Lepage T, Blanquart S. PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating. Bioinformatics. 2009;25:2286–8.
Article CAS PubMed Google Scholar
Crooks GE, Hon G, Chandonia J-M, Brenner SE. WebLogo: a sequence logo generator. Genome Res. 2004;14:1188–90.
Article PubMed Central CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by the EU Operational Programme Innovative Economy via the Foundation for Polish Science grant TEAM/2010-6/6. The authors wish to thank Dr. Michal Koper and Dr. Michal Krzyszton for their expert advice on RNA techniques, and Prof. Ewa Bartnik for the critical reading of the manuscript.

Author information

Authors and Affiliations

Institute of Genetics and Biotechnology, Faculty of Biology, University of Warsaw, Pawinskiego 5a, 02-106, Warsaw, Poland
Adam Kolondra, Karolina Labedzka-Dmoch, Joanna M. Wenda, Katarzyna Drzewicka & Pawel Golik
Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Pawinskiego 5a, 02-106, Warsaw, Poland
Pawel Golik

Authors

Adam Kolondra
View author publications
You can also search for this author in PubMed Google Scholar
Karolina Labedzka-Dmoch
View author publications
You can also search for this author in PubMed Google Scholar
Joanna M. Wenda
View author publications
You can also search for this author in PubMed Google Scholar
Katarzyna Drzewicka
View author publications
You can also search for this author in PubMed Google Scholar
Pawel Golik
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pawel Golik.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

AK performed RNA-seq experiments and data analysis, and the evolutionary studies, KL-D performed RT-PCR and 5’-RACE experiments, JMW and KD performed Northern blot and RT-PCR experiments, PG conceived and supervised the experiments, and participated in data and evolutionary analysis. PG wrote the manuscript, with contributions from AK, KL-D, and JMW. All the authors have read and approved the final version of the manuscript.

Additional files

Additional file 1:

Supplementary Results and Discussion (PDF). Includes Table S1. and Figures S1 to S5. (PDF 1524 kb)

Additional file 2: Table S2.

Promoters and promoter-like sequences in C. albicans mtDNA. Promoters of the primary transcription units are highlighted. In the “support” column “none” means that a consensus nonanucleotide is present, but no transcriptional activity can be reliably attributed to the promoter, “reads” means that there is an increased number of reads mapping downstream of the promoter, “reads and 5’-RACE” means promoters confirmed independently (Fig. 4). The second copy of inverted repeat (IRb) is omitted, as in Fig. 2b. (XLSX 9 kb)

Additional file 3: Table S3.

RPKM values of C. albicans mitochondrial transcripts. RNA preparations from purified mitochondria were used, reads were mapped to the reference genome with the second repeat (IRb) removed to ensure unambiguous mapping of transcripts originating in repeat regions. (XLSX 9 kb)

Additional file 4: Table S4.

GC-rich elements in C. albicans mtDNA. OpenDocument (ISO/IEC 26300) spreadsheet format. Colors denote different hairpin structure types, as in Fig. 8a. The second copy of inverted repeat (IRb) is omitted. (XLSX 9 kb)

Additional file 5: Table S5.

Primers, oligonucleotide probes and cloned PCR product probes used in this work. (XLSX 14 kb)

Additional file 6: Table S6.

Mitochondrial genome sequences used in the evolutionary comparisons. Genome size, accession numbers, number of introns, and references are provided. (XLSX 14 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Kolondra, A., Labedzka-Dmoch, K., Wenda, J.M. et al. The transcriptome of Candida albicans mitochondria and the evolution of organellar transcription units in yeasts. BMC Genomics 16, 827 (2015). https://doi.org/10.1186/s12864-015-2078-z

Download citation

Received: 02 April 2015
Accepted: 13 October 2015
Published: 21 October 2015
DOI: https://doi.org/10.1186/s12864-015-2078-z

The transcriptome of Candida albicans mitochondria and the evolution of organellar transcription units in yeasts

Abstract

Background

Methods

Results

Conclusions

Background

Results and discussion

RNA-seq analysis of Candida albicans mitochondrial transcriptome

Sample collection, library preparation and sequencing

Reference sequence and read mapping

Promoters and transcription units

Identification of putative transcription units

Identification of the primary transcription start sites by 5’-RACE

The mature transcripts, introns, and RNA processing

Differences in the expression of mitochondrial genes transcribed as polycistronic primary transcripts

Mature monocistronic and bicistronic transcripts are generated by tRNA punctuation

Introns in the mitochondrial genes of C. albicans

GC-rich elements in the C. albicans mitochondrial genome and transcriptome

Transcripts of unknown function originating from inverted repeat regions

Evolutionary dynamics of transcriptional organization in yeast mitochondria

Conclusions

Methods

Strains and media

Mitochondria isolation and RNA preparation

Library preparation and RNA-seq

RT-PCR

Northern blot analysis

5’-RACE analysis of transcription start sites

Phylogenetic analysis

Availability of supporting data

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Additional files

Additional file 1:

Additional file 2: Table S2.

Additional file 3: Table S3.

Additional file 4: Table S4.

Additional file 5: Table S5.

Additional file 6: Table S6.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Genomics

Contact us