- Research article
- Open Access
The genome of newly classified Ochroconis mirabilis: Insights into fungal adaptation to different living conditions
BMC Genomics volume 17, Article number: 91 (2016)
Ochroconis mirabilis, a recently introduced water-borne dematiaceous fungus, is occasionally isolated from human skin lesions and nails. We identified an isolate of O. mirabilis from a skin scraping with morphological and molecular studies. Its genome was then sequenced and analysed for genetic features related to classification and biological characteristics.
UM 578 was identified as O. mirabilis based on morphology and internal transcribed spacer (ITS)-based phylogeny. The 34.61 Mb assembled genome with 13,435 predicted genes showed less efficiency of this isolate in plant cell wall degradation. Results from the peptidase comparison analysis with reported keratin-degrading peptidases from dermatophytes suggest that UM 578 is very unlikely to be utilising these peptidases to survive in the host. Nevertheless, we have identified peptidases from M10A, M12A and S33 families that may allow UM 578 to invade its host via extracellular matrix and collagen degradation. Furthermore, the lipases in UM 578 may have a role in supporting the fungus in host invasion. This fungus has the potential ability to synthesise melanin via the 1,8-dihydroxynaphthalene (DHN)-melanin pathway and to produce mycotoxins. The mating ability of this fungus was also inspected in this study and a mating type gene containing alpha domain was identified. This fungus is likely to produce taurine that is required in osmoregulation. The expanded gene family encoding the taurine catabolism dioxygenase TauD/TdfA domain suggests the utilisation of taurine under sulfate starvation. The expanded glutathione-S-transferase domains and RTA1-like protein families indicate the selection of genes in UM 578 towards adaptation in hostile environments.
The genomic analysis of O. mirabilis UM 578 provides a better understanding of fungal survival tactics in different habitats.
The genus Ochroconis comprises oligotrophic species found in litter, soil and moist surfaces that have also been associated with occasional opportunistic infections in humans and animals . Macroscopically, these fungi are characterised by red-brown exudates in the culture medium. Under microscopic examination, rhexolytically liberated conidia with open denticles and frills remaining on the conidial bases are commonly observed . Their taxonomic classification has been problematic. The fungi have been suggested to be grouped within Chaetothyriales  or as unclassified anamorphic ascomycetes . Machouart et al. , however, used multigene (mtSSU, nuLSU, nuSSU, RPB2 region 5–7 and RPB2 region 7–11) phylogenetic analysis to classify the fungi in the family Sympoventuriaceae of class Dothideomycetes. From the original four species, O. gallopava, O. constricta, O. humicola and O. tshawytschae , the genus has expanded to include at least 13 species . Following intensive revision based on molecular, morphological and ecological comparisons, a new genus Verruconis was proposed for the thermophilic oligotrophs O. gallopava, O. verruculosa and O. calidifluminalis . The remaining Ochroconis species are mesophilic saprobes, causing infections mainly in cold-blooded animals such as fish and frogs, but also occasionally in warm-blooded vertebrates including humans . Two of the species, O. lascauxensis and O. anomala had been isolated from Lascaux Cave, causing black stains on cave sediments, walls and paintings . A strain of O. constricta from soil was reported to have keratinolytic activity .
O. mirabilis has been previously identified as O. constricta, a species that has been isolated mostly from water (aquatic vertebrates, sea sponge and sea fan), domestic environments (bathrooms and balconies), and human skin and nails [4, 5]. We previously isolated five strains of O. constricta (UM 314, UM 324, UM 326, UM 329 and UM 578) [8, 9]. The genome of UM 578 was sequenced, and preliminary observations on its genome were reported . In this study, UM 578 was re-identified as O. mirabilis based on phylogenetic and phylogenomic analyses. Moreover, we analysed its genome content in-depth. Additional studies on carbohydrate enzymes, lipases, secondary metabolite backbone genes, mating type genes, comparative gene families and protein families expansion and contraction are presented in this study. These genomic features are possibly associated with the ability of the species to thrive in different environments.
Results and discussion
Morphological and molecular identification
Microscopic examination revealed rhexolytic liberation of conidia, unbranched, cylindrical to acicular conidiophores; smooth-walled to verruculose, subhyaline to pale brown coloured conidia that were constricted at the septum; and the presence of anastomosing hyphae as described by Samerpitak et al.  (Fig. 1). The colony of UM 578 was grey brown and raised in the middle with red-brown colour in the circumference and the medium surrounding the colony. The reverse side of the colony was dark brown in colour and did not grow into the agar.
UM 578 was previously identified as O. constricta. Here, we re-examined its identity using an ITS-based phylogenetic tree. Phylogenetic analysis showed UM 578 clustering with UM 314 within the O. mirabilis cluster (Fig. 2).
We constructed a phylogenomic tree with protein sequences from 17 fungal genomes from different classes, including two Leotiomycetes, two Sodariomycetes, five Eurotiomycetes, six Dothideomycetes and two Saccharomycetes as outgroups (Fig. 3; Additional file 1: Table S1). A total of 181,384 proteins were clustered into 18,666 orthologous clusters with 917 single-copy orthologues identified. The phylogenomic tree is consistent with that in a previous study  which showed O. mirabilis UM 578 within class Dothideomycetes.
Genome assembly, gene models and transposable elements
The 500-bp and 5-kb insert libraries generated 22,277,778 reads and 11,222,224 reads , respectively using Illumina HiSeq 2000. The sequencing coverage for the combined sequenced reads was 80-fold (Table 1). The reads were assembled into 603 contigs, with 544 contigs having size ≥200 bp. The contigs were then scaffolded into 163 scaffolds based on the paired-end information from both libraries. The assembly size of UM 578 was ~34.61 Mb, with a scaffold N50 of 1,170,353 bp. The assembled genome had a GC content of 51.84 %. A total of 13,435 genes were predicted using GeneMark-ES version 2.3e , with an average gene length of 1411 bp. A total of 14 rRNAs and 71 tRNAs were predicted in the genome.
There were 285 class I retrotransposons and 10 class II DNA transposons encompassing 1.09 % of the assembled genome (Table 2). Gypsy and Copia from the class I retrotransposons and Tc1-Mariner from the class II DNA transposons are reported to be the most abundant transposable elements in fungal genomes . As seen in Table 2, Gypsy type forms the highest number of transposable elements in UM 578, followed by DDE_1 and Ty1_Copia while helitronORF type forms the largest number in class II transposons. Although the repetitive elements described here may not represent all transposable elements in the genome of this isolate owing to the limitation of Illumina technology , they provide an idea of the type of transposable elements identified in this genome. The composition and organisation of repetitive elements in the genome enable the delineation of the best strategy for sequencing the whole genome .
In the functional categorisation using EuKaryotic Orthologous Group (KOG), 6078 predicted genes were redundantly assigned into KOG classifications (Fig. 4a), of which, 1535 genes were annotated as poorly characterised proteins, 1177 assignments were in the category of Information Storage and Processing, 1173 in the Cellular Processes and Signalling category and 2424 in the Metabolism category. Of the eight KOG classifications in the Metabolism category, 470 genes were annotated to the Secondary Metabolites Biosynthesis, Transport and Catabolism [Q], 431 genes annotated to Lipid Transport and Metabolism [I] and 403 genes annotated to Energy Production and Conversion [C], composing the top three classifications in this category. In class Q, there were 57 genes annotated as flavin-containing monooxygenase. Flavin-containing monooxygenases are widely found in many organisms and have multiple biological functions. These enzymes in UM 578 might play a role in the biodegradation of environmental aromatic compounds, detoxification of drugs and antibiotics, and siderophore biosynthesis [14, 15], processes which provide a survival advantage in the adverse environment.
Of the 1012 predicted genes annotated in the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway maps, the xenobiotics biodegradation and metabolism is the fourth highest metabolic pathway mapped with predicted genes from this species (288 genes), after carbohydrate metabolism (578 genes), amino acid metabolism (507 genes) and lipid metabolism (311 genes) (Fig. 4b). From the styrene degradation map (Additional file 2: Figure S1), three enzymes were mapped in the degradation of Z-phenylacetaldoxime. The enzymes were nitrilase (EC 22.214.171.124), nitrile hydratase (EC 126.96.36.199) and amidase (EC 188.8.131.52) but no phenylacetaldoxime dehydratase (EC 184.108.40.206). To further inspect the capability of UM 578 in Z-phenylacetaldoxime degradation, a gene annotated as phenylacetaldoxime dehydratase was identified. The gene, UM578_4049 encodes an enzyme belonging to the haem-containing dehydratase family (IPR025702) that has 31 % similarity to the Bacillus sp. OxB-1 phenylacetaldoxime dehydratase Oxd [GenBank: P82604]  (Additional file 2: Figure S2). Interestingly, a nitrilase (UM578_4050) encoding gene is located adjacent to the oxd gene in the genome as observed previously in Fusarium graminearum , Bacillus sp. OxB-1  and Pseudomonas syringae  (Additional file 2: Figure S3). However, the gene arrangement in UM 578 is different from that in those reported species, and the predicted regulatory protein is missing in UM 578.
Furthermore, we found that UM 578 might be able to degrade cyanamide in the atrazine degradation pathway (Additional file 2: Figure S4). Cyanamide is a reactive substance that seldom occurs in nature and remains undegradable for a long time in an abiotic medium. It is used as a nitrogen fertiliser in the form of calcium salt and hydrogen cyanamide. Cyanamide was also reported to be phytotoxic, bactericidal and fungicidal [19, 20]. In this genome, two genes were identified to encode cyanamide hydratase (UM578_11171) and urease (UM578_4352), respectively and are shown to be expressed (Additional file 2: Figure S5). Cyanamide hydratase catalyses the hydrolysis of cyanamide to urea while urease converts the urea to ammonium. The cyanamide hydratase from the fungus Myrothecium verrucaria is well characterised and used in transgenic plants to act as a biocontrol of phytopathogens . Moreover, the end product, urea, is a useful compound that acts as a plant fertiliser to facilitate plant growth . The presence of genes involved in biodegradation shows the adaptation of the fungus towards the occurrence of non-natural compounds in the environment.
In the functional classification of UM 578 genome based on gene ontology (GO), 6829 of 13,435 predicted genes were given a GO assignment. Of these genes, 17,695 were redundantly assigned into Cellular Component Ontology, 14,584 into Molecular Function Ontology and 33,100 into Biological Process Ontology (Fig. 4c). Most of the genes were annotated to Cell (5874 genes) and Organelle (4656 genes) in the Cellular Component Ontology, Binding Activity (4230 genes) and Catalytic Activity (3743 genes) in the Molecular Function Ontology, and Cellular Process (5215 genes) and Metabolic Process (4662 genes) in the Biological Process Ontology. As these are the fundamental components and processes for the viability of an organism, it is not surprising that these genes encompass a large portion of the genome.
Heterotrophic fungi harbour Carbohydrate Active enZymes (CAZymes) to degrade complex carbohydrates from organic matters for nutrient supply. We identified 590 CAZyme catalytic domains in UM 578. The modules comprise 88 domains belonging to auxiliary activities, 43 to carbohydrate-binding modules (CBM), 149 to carbohydrate esterases (CE), 204 to glycoside hydrolases (GH), 101 to glycosyltransferases (GT) and five to polysaccharide lyases (PL) (Additional file 1: Table S2). Based on substrates specificity, UM 578 has a very small number of CAZymes involved in cellulose degradation but a high number of CAZymes involved in hemicellulose degradation (Additional file 1: Table S3), indicating a possible least preference towards cellulosic materials compared to compounds high in hemicellulose. It also harbours a larger number of CAZyme modules as compared to some necrotrophic, hemibiotrophic and saprophytic fungi (Fig. 5). However, it contains less CAZyme modules involved in plant cell wall degradation (Fig. 6). The large number of enzymes involved in hemicellulose degradation present in UM 578 indicates a possible preference of this fungus towards soft plant tissues such as fruits . Some modules such as cellobiohydrolase (GH6 and GH7) involved in cellulose degradation, endo-β-1,4-xylanase (GH11), arabinofuranosidase (GH62) and β-1,4-galactanase (GH53) involved in xylan degradation and, pectin methylesterase (CE8) and rhamnogalacturonan lyase (PL4) involved in pectin degradation were absent in this isolate. This might lead to least efficiency in the degradation of plant cell walls. Among the 31 isolates identified as O. mirabilis in the study by Samerpitak et al. , only a few were isolated from plants.
Using MEROPS analysis, we identified 186 peptidases in UM 578, of which, 47 were secreted enzymes. The highest numbers were from the metallopeptidase family (70 peptidases), and the serine peptidase family (56 peptidases) (Additional file 1: Table S4).
As O. mirabilis are isolated from human skin lesions and nails , we looked for genes encoding keratin degradation proteases and identified seven that are secreted proteases similar to the keratin-associated degradation proteases of the dermatophyte Trichophyton rubrum. These peptidases belong to the families M14A, S10, M28 and S9. One gene (UM578_1644) encoding a secreted metallocarboxypeptidase from the family M14A, shows 42 and 41 % identity to M14A peptidase of T. rubrum [GenBank: ABW79919]  and Metarhizium anisopliae [GenBank: AAB68600] , respectively. This peptidase contains conserved zinc-binding, substrate binding, catalytic sites and cysteine residues (Additional file 2: Figure S5). A total of six secreted carboxypeptidase S10 family were predicted. Of the putative S10 peptidases, UM578_13449 has the best match to the characterised T. rubrum carboxypeptidase SCPA [GenBank: AAS76667]  (43 % identity). Multiple sequence alignment showed the conserved active sites in UM 578_13449 (Ser228, Asp439 and His497) with T. rubrum SCPA and Aspergillus fumigatus Cp1 [GenBank: AAR91697], ortholog of SCPA (Additional file 2: Figure S6). In addition, another S10 peptidase, UM578_7889 was identified similar to T. rubrum carboxypeptidase Y (SCPC) [GenBank: AAS76668]  with 66 % identity.
Furthermore, two genes encoding leucine aminopeptidase (LAP) from the M28 family, UM578_5513 and UM578_7056 exhibited 51 and 54 % identity to T. rubrum LAP1 [GenBank: AAS76670]  and LAP2 [GenBank: AAS76669] , respectively. Multiple sequence alignment was conducted with LAP1 and LAP2 from T. rubrum and A. fumigatus that have been reported to have the same hydrolytic activity  (Additional file 2: Figures S7 and S8). Both genes have the same catalytic sites as reported. The consensus binding sites for the first and second Zn2+ ion in UM578_5513 are His252 and Asp326 and, Glu297 and His424 respectively. Asp264 is the predicted residue bridging the two Zn2+ ions. The UM578_7056 first Zn2+ ion binding site was predicted at His180 and Asp265, and the second Zn2+ ion binding site was predicted at Glu238 and His347. The bridging residue is at Asp199. Lastly, we also identified two genes encoding putative dipeptidyl peptidase IV (DPPIV) and DPPV from family S9. UM578_9285 shares 50 and 53 % identity to DPPIV secreted by T. rubrum [GenBank: AAS76665]  and A. fumigatus [GenBank: AAC34310] , respectively. Another gene, UM578_9264 shares 47 and 52 % identity to DPPV of the T. rubrum [GenBank: AAN03632]  and A. fumigatus [GenBank: AAB67282] , respectively. Both genes have the same conserved catalytic sites as previously reported  (Additional file 2: Figures S9 and S10). The two predicted peptidases have the catalytic triad Ser 619, Asp696, His731 and Ser 566, Asp647, His679.
In addition, we identified a gene (UM578_9214) encoding a putative sulphite efflux pump, Ssu1 (53.17 % identical to Arthroderma otae Ssu1 [GenBank: C5G0E3]). SSU1 is involved in the excretion of sulphite to digest keratin by reducing the disulphide bridges in the cornified cell layers . UM578_9214 has ten membrane-spanning helixes and hydrophilic N- and C- termini, which is consistent with the previous report by Léchenne et al.  (Additional file 2: Figure S11).
It has been hypothesised that the presence of multiple endoproteases of subtilisins (S8 family) and fungalysins (M36 family) enable dermatophytes to invade hosts as they share similar sets of peptidases with non-dermatophytes [28–31]. We compared the abundance of peptidases found in keratinophilic dermatophytes and in UM 578 (Additional file 1: Table S5). However, we could not find any similar abundance pattern of peptidase families between UM 578 and the dermatophytes. From the gene families analysis conducted with 17 fungal genomes (Additional file 1: Table S6), we identified shared genes among T. rubrum, T. verrucosum and UM 578, that are not involved in keratin degradation. Most of the shared genes encode hypothetical proteins while some genes encode the ATPase family associated with various cellular activities, methyltransferase, beta-lactamase, alpha/beta hydrolase, glyoxalase-like, NAD-dependent epimerase and DNA binding proteins. Thus, the secreted proteases in UM 578 do not seem to play a role in the survival of O. mirabilis in human skin and nails.
Peptidases from families M10A, M12A and S33 were found to be involved in the degradation of collagen and extracellular matrix [32–36]. In the UM 578 genome, we predicted four peptidases belonging to the M10A family (matrix metallopeptidases), nine zinc metallopeptidases from the astacin family (M12A) and 12 prolyl aminopeptidases (S33). Among the four M10A peptidases, three (UM578_7033, UM578_12210 and UM578_12211) are similar to myroilysin that was reported to have elastinolytic activity and synergistic effect in collagen degradation . The matrix metallopeptidase from Candida albicans was reported to degrade fibronectin and type I collagen completely as well as partially degrade laminin and type IV collagen [33, 34]. Astacin family peptidases (family M12A) have diverse functions ranging from digestion of food to the processing of extracellular matrix components . Prolyl aminopeptidases from family S33 free the N-terminal residue from a peptide with a preference towards proline, thus, providing an advantage for the fungus to utilise proline-rich substrates such as collagen . As the domestic environment has been reported as a reservoir for fungi causing human infections  and the infection was suggested to occur via moisturised human skin after a shower , these peptidases are likely to provide the nutrient source for O. mirabilis causing skin infections as described for fungi inhabiting moist indoor reservoirs.
Lipases belong to many different protein families that do not show sequence similarity but have the same architecture. Some of these enzymes share similar fold and catalytic machinery [38, 39]. The ubiquitous skin-inhabiting fungus, Malassezia globosa is unable to synthesise fatty acids but has multiple lipases that enable it to assimilate lipids from the skin of its human host . We compared putative lipases in UM 578 with those in several skin-inhabiting and non-skin inhabiting fungi (Additional file 1: Table S7) and found the percentage of predicted lipase genes in UM 578 (0.32 %) to be comparable to that predicted in Candida albicans (0.30 %), an opportunistic skin coloniser (Table 3). The result suggests the possibility of UM 578 employing these lipases for survival on the skin.
To the best of our knowledge, there have been no reports on secondary metabolites produced by Ochroconis. In UM 578, we identified a total of 14 secondary metabolite backbone genes comprising five polyketide synthases (PKS), one PKS-like, three nonribosomal peptide synthases (NRPS), four NRPS-like and four dimethylallyl tryptophan synthases (DMAT). Polyketides are secondary metabolites that are formed from small carbon precursor acids whose successive condensation is catalysed by PKS  and comprise diverse natural products including antibiotics, pigments and mycotoxins.
Melanin plays a significant role in resistance to abiotic stress and pathogenicity in dematiaceous fungi . We found a potential PKS (UM578_2557) responsible for DHN-melanin biosynthesis. The gene is highly similar to the C. heterostrophus PKS18 (70.26 %) [GenBank: AAR90272]  and the Bipolaris oryzae PKS1 (70.11 %) [GenBank: BAD22832] . UM578_2557 contains five conserved domains comprised of ketosynthase (KS), acyltransferase (AT), acyl carrier protein (ACP) and thioesterase (TE) domains in the order KS-AT-ACP-ACP-TE. The arrangement of the domains in the gene is the same as that in the PKS18 and PKS1 genes [43, 44]. Further inspection of the genes near UM578_2557 revealed two genes annotated as transcription factor Cmr1 (UM578_2558) and tetrahydroxynaphthalene reductase (UM578_2559), located downstream to the PKS gene (Additional file 2: Figure S12). This strengthens our postulation that UM578_2557 encodes a PKS to synthesise DHN-melanin precursor as most of the genes involved in secondary metabolism are found in the clusters. Moreover, the gene organisation and orientation of the UM 578 melanin gene cluster is similar to those in C. heterostrophus and Alternaria brassicicola . Another two enzymes that are essential in the DHN-melanin synthesis, scytalone dehydratase (UM578_4032) and trihydroxynaphthalene reductase (UM578_5506), were also found in the genome with 70.63 and 47.58 % identity to Colletotrichum obiculare scytalone dehydratase [GenBank: Q00455] and trihydroxynaphthalene reductase [GenBank: P87025], respectively.
On the other hand, putative aflatoxin (AF)/sterigmatocystin (ST) biosynthesis genes were found in the UM 578 genome (Additional file 1: Table S8). Production of the toxic and carcinogenic AF which was previously reported to be limited to Aspergillus spp. has recently been reported in Fusarium kyushuense . ST, the precursor of AF is produced by diverse fungi . Genetic and biochemical studies suggested that the production of AF, ST together with another toxin dothistromin share common biosynthetic pathways [12, 47]. Dosthistromin is a red toxin that was first isolated from Dothistroma septosporum. The disease caused by this toxin is known as Dothistroma needle blight with broad spectrum toxicity against bacteria, fungi, plant and animal cells . Few fungi from the class Dothideomycetes are also known to produce this toxin . However, it should be noted that homologues of AF/ST biosynthetic genes are known to be involved in functions other than the production of red toxin. At this stage of knowledge, the exact role of these putative genes remains unknown.
Trichothecenes are a family of mycotoxins consisting of more than 200 structurally related sesquiterpenoid metabolites. The toxins are potent protein synthesis inhibitors and apoptosis inducers in eukaryotic cells. Trichothecenes are usually encountered as contaminants of food and animal feeds. The biosynthesis of trichothecenes involves a series of oxygenation, cyclisation and esterification reactions . Trichothecene-producing fungi are found in the order Hypocreales, including Stachybotrys, Tricothecium, Myrothecium, Cephalosporium, Fusarium and Trichoderma. Species from the genus Stachybotrys have been reported as a significant contaminant of the indoor environment and have been associated with damp building-related illness, and the production of satratoxins, roridins and verrucarins types of trichothecenes [50, 51]. The gene families analysis showed a significant number of fungal trichothecene efflux pump (TRI12) domains in the UM 578 genome (Fig. 7a). The function of TRI12 has been postulated to be mainly responsible for self-protection of the fungus by exporting trichothecene outside the cells . We identified the trichodiene synthase encoding gene (UM578_3030) with 48.85 % identity to the Fusarium asiaticum trichodiene synthase [GenBank: Q8NIH6]. Other genes encoding enzymes required for trichothecene biosynthesis were located in other regions of the genome (Additional file 1: Table S9). Furthermore, the genes downstream to the putative trichodiene synthase are two proteins containing the cytochrome P450 monooxygenase domain and one protein containing the TRI12 domain (Additional file 2: Figure S13). Based on these results, we hypothesised that the region of genomic DNA that spans 8 kb from UM578_3030 to UM578_3033 is probably a trichothecene biosynthesis cluster for O. mirabilis. Overall, these findings show that UM 578 is likely to produce trichothecene.
Sexual reproduction enables the exchange of genetic material in eukaryotes to produce recombinants that are better adapted to the environment. In the genus Ochroconis, the sexual-morph of the species is not well known, and the only species with known teleomorph is O. sexualis . In this study, we investigated the involvement of O. mirabilis in sexual reproduction by looking for the presence of genes in mating and meiosis. We managed to identify several potential genes participating in the mating process, signalling, fruiting body development and meiosis (Additional file 1: Table S10). An alpha-box domain containing protein (UM578_3656) with 43 % identity to Fusarium oxysporum MAT-1 gene [GenBank: O59851]  involved in activation of alpha-specific genes was identified. UM578_3656 was located adjacent to the genes encoding DNA lyase APN2 (UM578_3657) and cytochrome C oxidase Vla Cox13 (UM578_3658) (Additional file 2: Figure S14). The presence of these two genes in the mating type gene organisation has been reported in Aspergillus, Coccidioides, Histoplasma and dermatophytes [28, 54]. As only MAT-1 gene was identified in UM 578, this strain could be a heterothallic fungus (Additional file 2: Figure S14).
The homeodomain proteins (HD1 and HD2) and, the alpha-box and HMG domain proteins are the two classes of proteins previously found in the mating type loci and are hypothesised as the ancestral fungal sex determinants. These two classes of sex determinants have undergone gene lost and acquisition in different lineages which resulted in the absence of homeodomain proteins in euascomycetes . A hypothetical protein containing homeodomain was identified upstream of the UM578_3656 gene (Additional file 2: Figure S14). This would be the first euascomycete reported having a putative homeodomain protein in such proximity to the alpha-box. However, further functional validation is required to characterise these proteins. It should also be noted that only pheromone receptors but no pheromone genes were identified in the genome (Additional file 1: Table S10). As pheromone-receptor systems are essential in sexual reproduction , there is a possibility that this fungal strain is not able to mate.
Gene families analysis
The selection of fungal genomes for comparative analysis is based on different lifestyles of the fungal candidates and the characteristics of O. mirabilis [4, 5] (Additional file 1: Table S1). A total of 18,666 gene families were identified using OrthoMCL with 1341 families conserved in all 17 fungal genomes. There are 51 families conserved in Dothideomycetes and 286 UM 578 specific gene families in this study set. Among those UM 578 specific gene families, the F-box domain containing genes formed the largest family (14 families), followed by the genes containing heterokaryon incompatibility domain (7 families) (Additional file 1: Table S11).
In the Pfam family expansion and contraction analysis, 61 families were shown to have undergone changes (P-value ≤ 0.01 for the whole tree), with 50 families significantly expanded and 11 families contracted in UM 578 (Additional file 1: Table S12). The domains enriched in UM 578 can be functionally categorised into proteins involved in transport functions, protein-protein interactions, transcriptional regulation, oxidoreductase activity and hydrolysis functions. In contrast, the contracted families are domains encoding for PKS backbone enzymes, glycoside hydrolase family 28, and LysM domain (Additional file 1: Table S12). Some of the expanded families are also present in gene families only observed in UM 578 such as taurine catabolism dioxygenase TauD/TdfA, trichothecene efflux pump (TRI12), BTB/POZ domain, antibiotic biosynthesis monooxygenase, and alpha/beta hydrolase fold.
Expansion of genes containing the taurine catabolism dioxygenase TauD/TdfA domain was found with an increase from 15 to 22 copies (Fig. 7b). Four TauD/TdfA genes were found in a UM 578 specific gene family. These genes encode alpha-ketoglutarate-dependent dioxygenase function in the catalysis of taurine to sulfite and aminoacetyldehyde . Taurine is a sulfur-containing amino acid present in high concentrations in mammals, marine invertebrates, fish and marine algae. Taurine plays a role in physiological functions in these organisms such as antioxidation, cell cytotoxicity reduction, osmoregulation and membrane stabilisation [57, 58]. Some microorganisms utilise taurine as a sulfur source under sulfate starvation  and as a source for growth . The high number of TauD genes identified suggests the utilisation of taurine as a nutrient source by O. mirabilis.
Recently, a Dothideomycetes, Acidomyces richmondensis was found able to synthesise and degrade taurine in a biofilm study. Taurine was then suggested to act as a compatible solute protecting the microbes from osmotic stress . KEGG annotations showed that UM 578 might produce taurine (Fig. 8). Two genes (UM578_322 and UM578_7794) were mapped to the glutamate decarboxylase (EC 220.127.116.11) in the taurine metabolism pathway. In addition, we managed to identify a gene (UM578_7116) annotated as cysteine lyase that is 53.81 % identical to Saccharomyces pombe cysteine lyase [GenBank: O94350]. This completes the taurine metabolism pathway. Thus, O. mirabilis that is frequently isolated from immensely low water availability environments such as coastal hypersaline and bathroom surfaces , might also acquire taurine in osmoregulation.
Among other enriched Pfam domains, the glutathione-S-transferase domains encode enzymes well-known to be responsible for detoxification by catalysing the conjugation of glutathione to xenobiotics, pesticides and drugs . The Pfam families encoding the N-terminal and C-terminal of glutathione-S-transferase increased from 15 to 30 and 11 to 20 copies respectively (Fig. 7c). RTA1-like protein that is also involved in detoxification has expanded from 12 to 21 copies (Fig. 7d). The RTA1-like protein plays a role in resistance to 7-aminocholesterol and prevents toxicity by binding to toxic substances . The inflation of these Pfam families might contribute to the survival of O. mirabilis in domestic environments that are rich in xenobiotics and compounds toxic to the fungus.
Our in silico genome analysis of O. mirabilis UM 578 revealed potential genes that enable the fungus to thrive in hostile environments and the involvement of mycotoxin production. Our analysis indicated that plant materials may not be the primary source of nutrient for this fungus. Occasional disease in humans may be related to the presence of several putative peptidases involved in the extracellular matrix and collagen degradation together with the action of lipases. The isolate might be heterothallic, and the mating activity remains to be elucidated. The expansion of genes involved in the degradation of taurine and detoxification enables the fungus to survive in the man-made hostile environment. This in-depth analysis of UM 578 genome provides a platform for more targeted functional studies in the future.
Fungal isolate sampling
O. mirabilis UM 578 was isolated from the skin scraping of a patient in the University of Malaya Medical Centre (UMMC), Malaysia. Morphological identification of the isolate was conducted as previously described . The isolate was sub-cultured on Sabouraud Dextrose Agar (SDA, 10 g/L Mycological peptone, 40 g/L glucose and 15 g/L agar; Oxoid, UK). The 14 day-old culture was incubated at 30 °C and the fungal colony was observed. Slide culture was carried out to study the microscopic characteristics.
Molecular identification was conducted accordingly with DNA extraction, amplification of the internal transcribed spacer (ITS) region followed by DNA sequencing . The identity of the fungal isolate was determined via BLASTn search against NCBI-nucleotide database. A total of 30 ITS sequences of all Ochroconis species together with a representative strain from a previous study (UM 314)  and two outgroup strains, Scolecobasidium excentricum and Sympoventuria capensis were obtained from GenBank to construct a phylogenetic tree. Multiple sequence alignment of the ITS sequences was performed using M-Coffee  and all aligned sequences were concatenated into a unique final alignment using T-Coffee. Bayesian tree analysis was conducted using MrBayes . The analysis was carried out using reversible jump Markov Chain Monte Carlo (MCMC) averaging over general time reversible (GTR) rate model space and gamma-distributed rate heterogeneity for all subsets of partitioned scheme. The stationary state frequencies were fixed to be equal. A total of 500,000 generations were run with a sampling frequency of 100, and diagnostics were calculated for every 1000 generations. The first 1250 trees were discarded based on the burn-in setting of 25 %, and convergence was assessed according to the Draft MrBayes version 3.2.1 Manual . Standard deviation of split frequencies below 0.01, potential scale reduction factor (PSRF) reasonably close to 1.0 for all parameters and, no obvious trend for the plot of the generation versus the log probability of the data (the log likelihood values) were observed.
Genomic DNA extraction, sequencing and de novo assembly
A large-scale DNA extraction was conducted based on a modified method as described in Kuan et al. . The sequencing and assembly of UM 578 genome were carried out as described previously . The library was prepared using TruSeq v3 Reagent Kits (Illumina). The 5-kb Illumina sequenced read was then combined with the 500-bp Illumina sequenced read for further processing. Both sets of sequenced reads were pre-processed by trimming two and four bases from the 5’ end of 500-bp and 5-kb reads respectively. Bases with a Phred quality below Qv20 were trimmed from the 3’ end of the reads. The trimmed reads shorter than 50 bp and reads with 40 % bases having Qv ≤ 20 were filtered out using FASTX-Toolkit (http://hannonlab.cshl.edu/fastx_toolkit/). Substitution error correction of pre-processed sequencing reads was performed using Quake version 0.3.5  with 16-mer setting. The error corrected reads were assembled using Velvet version 1.2.07  with k-mer setting = 67, scaffolding = no, insert_length = 500, ins_length_sd = 10, insert_length2 = 5000, ins_length2_sd = 1400 and shortMatePaired2 = yes. Contig sequences assembled from the Velvet were further scaffolded using SSPACE Basic version 2.0 (parameters: −z = 100, −k = 5, −a = 0.3 and -n = 30)  and the GapFiller version 1.10 (parameters: −m = 60, −o = 15, −r = 0.8, −n = 30 and -T = 40) was used to perform gap filling by utilising paired-end information from both libraries .
Gene prediction and annotation
Interspersed repetitive elements and low complexity DNA sequences were masked using RepeatMasker version open-3.3.0 with the Repbase fungal library version rm-20120418, followed by masking off the RNA sequences. The rRNAs were identified using RNAmmer version 1.2  while tRNAs were detected by tRNAscan-SE version 1.3.1 . Prediction of genes was carried out using GeneMark-ES version 2.3e . The function of putative coding sequences (CDSs) was annotated via local BLAST searches against NCBI nr and SwissProt databases. Local BLAST2GO was also conducted to annotate GO and KEGG metabolic pathways . KOG annotation were performed  and Interpro protein domain families match to Pfam database was performed using InterProScan 5 . The GO annotations were plotted using WEGO . Putative transposable elements were identified via PSI-TBLASTN search of the genome with a collection of (retro-) transposon ORF homology profiles from Transposon-PSI (http://transposonpsi.sourceforge.net).
Predicted protein models were submitted to dbCAN  for annotation of Carbohydrate-Active enZymes (CAZymes). A batch blast of UM 578 protein models against MEROPS database  was conducted for peptidases identification. The prediction of secreted proteins was carried out using the method of Ohm et al. . SignalP version 4.1  was used to predict the cleavage sites and signal peptide/non-signal peptide. The transmembrane (TM) domains were identified using TMHMM version 2.0 . Secreted proteins were selected based on the presence of 40 amino acids at N-terminal as TMM and proteins without TM domains. Lipases were predicted by BLASTP search against the Lipase Engineering Database (LED) as previously described  together with six other fungi (Additional file 1: Table S7). Secondary metabolite backbone genes and associated genes for secondary metabolite biosynthesis cluster were predicted using web-based SMURF analysis tool . The organisation of putative gene clusters were retrieved from the genome using sequence viewer Artemis version 12.0 .
Orthologous genes and genome comparative analysis
The predicted proteomes of 17 publicly available fungal genomes were retrieved from several databases (Additional file 1: Table S1). Orthologues in UM 578 were determined by employing OrthoMCL version 2.0.9 . Protein sequences ≥33 amino acids from all the genomes were clustered via all-against-all BLASTP searches. Orthologues were identified as protein sequences with reciprocal best blast hits from distinct genomes. OrthoMCL applies Markov Cluster algorithm  with 1e-5 BLASTP e-value cut-off and 1.5 inflation parameter.
A phylogenomic tree was constructed using predicted proteome clusters generated from the comparative analysis (Additional file 1: Table S1). ClustalW version 2.0  was used to compile individual multiple sequence alignments for 917 single-copy orthologous genes. Spurious sequences or poorly aligned regions were removed using trimAL (with the automated option). A super-alignment with 357,792 characters was concatenated from all individually filtered alignments. Bayesian MCMC analysis was run with a burn-in setting of 25 % and sampling frequency of 100 for 100,000 generations. A mixed amino acid model with gamma-distributed rate variation across sites and a proportion of invariable sites were selected for the phylogenetic analysis.
Gene families expansion and contraction analysis
The protein families of the 17 selected fungi were identified by Pfam analysis using pfam scan.pl search against the Pfam database. The database and tools were downloaded from Sanger Centre FTP site (ftp://ftp.sanger.ac.uk/pub/databases/Pfam/current release/ for database and ftp://ftp.sanger.ac.uk/pub/databases/Pfam/Tools/ for tools). Analysis of Pfam domain expansion and contraction was performed with CAFE software using a stochastic birth and death model . The ultrametric phylogenomic tree and pfam protein domain families were used as input.
Fungal cultures are part of the routine management of infected patients in the Medical Centre and isolates are made anonymous before they are used for studies. As we were not involved in specimen collection and had no data traceable to the identity of the infected patient from whom UM578 was derived, ethical clearance for this study was exempted from the UMMC Medical Ethics Committee (http://umresearch.um.edu.my/doc/File/UMREC/6_CODE%20OF%20RESEARCH%20ETHICS%20%20IN%20UNIVERSITY%20OF%20MALAYA.pdf).
Availability of data and material
The data sets supporting the results of this article are included within the article and its additional files. The nucleotide sequence of O. mirabilis UM 578 ITS region reported in this paper is available at DDBJ/EMBL/GenBank with accession number KP639587. The nucleotide sequence of O. mirabilis UM 578 genome reported in this manuscript is also available at DDBJ/EMBL/GenBank with accession number AZYM00000000. The version described in this paper is version AZYM01000000. The gene models reported can be accessed via in-house database, DemaDb (fungaldb.um.edu.my). The phylogenetic data of ITS-based phylogenetic and phylogenomic trees have been deposited in TreeBase (study number: S18646).
Acyl carrier protein
Carbohydrate active enZymes
Dimethylallyl tryptophan synthases
Internal transcribed spacer
Kyoto encyclopedia of genes and genomes
EuKaryotic Orthologous Group
Markov chain Monte Carlo
Nonribosomal peptide synthases
Sabouraud Dextrose Agar
Machouart M, Samerpitak K, de Hoog GS, Gueidan C. A multigene phylogeny reveals that Ochroconis belongs to the family Sympoventuriaceae (Venturiales, Dothideomycetes). Fungal Divers. 2013;65:77–88.
De Hoog GS, Guarro J, Gene J, Figueras MJ. Atlas of clinical fungi. 2nd ed. Utrecht: Centraalbureau voor Schimmekulture; 2000.
Revankar SG, Sutton DA. Melanized fungi in human disease. Clin Microbiol Rev. 2010;23:884–928.
Lian X, de Hoog GS. Indoor wet cells harbour melanized agents of cutaneous infection. Med Mycol. 2010;48:622–8.
Samerpitak K, Van der Linde E, Choi H-J, Gerrits van den Ende AHG, Machouart M, Gueidan C, et al. Taxonomy of Ochroconis, genus including opportunistic pathogens on humans and animals. Fungal Divers. 2013;65:89–126.
Martin-Sanchez PM, Nováková A, Bastian F, Alabouvette C, Saiz-Jimenez C. Two new species of the genus Ochroconis, O. lascauxensis and O. anomala isolated from black stains in Lascaux Cave, France. Fungal Biol. 2012;116:574–89.
Pakshir K, Ghiasi MR, Zomorodian K, Gharavi AR. Isolation and molecular identification of keratinophilic fungi from public parks soil in Shiraz, Iran. Biomed Res Int. 2013;2013:619576.
Yew SM, Chan CL, Lee KW, Na SL, Tan R, Hoh C-C, et al. A five-year survey of dematiaceous fungi in a tropical hospital reveals potential opportunistic species. PLoS ONE. 2014;9:e104352.
Chan CL, Yew SM, Na SL, Tan Y-C, Lee KW, Yee W-Y, et al. Draft genome sequence of Ochroconis constricta UM 578, isolated from human skin scraping. Genome Announc. 2014;2:2013-4e00074-14.
Lomsadze A, Ter-Hovhannisyan V, Chernoff YO, Borodovsky M. Gene identification in novel eukaryotic genomes by self-training algorithm. Nucleic Acids Res. 2005;33:6494–506.
Santana MF, Queiroz MV. Transposable elements in fungi: A genomic approach. Scientific J Genetics Gen Ther. 2015;1:12–6.
Ohm RA, Feau N, Henrissat B, Schoch CL, Horwitz BA, Barry KW, et al. Diverse lifestyles and strategies of plant pathogenesis encoded in the genomes of eighteen Dothideomycetes fungi. PLoS Pathog. 2012;8, e1003037.
Li S-F, Gao W-J, Zhao X-P, Dong T-Y, Deng C-L, Lu L-D. Analysis of transposable elements in the genome of Asparagus officinalis from high coverage sequence data. PLoS ONE. 2014;9, e97189.
Ballou DP, Entsch B, Cole LJ. Dynamics involved in catalysis by single-component and two-component flavin-dependent aromatic hydroxylases. Biochem Biophys Res Commun. 2005;338:590–8.
Van Berkel WJH, Kamerbeek NM, Fraaije MW. Flavoprotein monooxygenases, a diverse class of oxidative biocatalysts. J Biotechnol. 2006;124:670–89.
Kato Y, Nakamura K, Sakiyama H, Mayhew SG, Asano Y. Novel heme-containing lyase, phenylacetaldoxime dehydratase from Bacillus sp. strain OxB-1: Purification, characterization, and molecular cloning of the gene. Biochemistry. 2000;39:800–9.
Kato Y, Asano Y. Purification and characterization of aldoxime dehydratase of the head blight fungus. Fusarium graminearum Biosci Biotechnol Biochem. 2005;69:2254–7.
Kato Y, Asano Y. Molecular and enzymatic analysis of the “aldoxime-nitrile pathway” in the glutaronitrile degrader Pseudomonas sp. K-9. Appl Microbiol Biotechnol. 2006;70:92–101.
Amberger A. Cyanamide in plant metabolism. Int J Plant Physiol Biochem. 2013;5:1–10.
Kirubakaran SI, Sakthivel N. Site-directed mutagenesis, heterologous expression of cyanamide hydratase gene and antimicrobial activity of cyanamide. Curr Microbiol. 2008;56:42–7.
Amselem J, Cuomo CA, van Kan JAL, Viaud M, Benito EP, Couloux A, et al. Genomic analysis of the necrotrophic fungal pathogens Sclerotinia sclerotiorum and Botrytis cinerea. PLoS Genet. 2011;7, e1002230.
Zaugg C, Jousson O, Léchenne B, Staib P, Monod M. Trichophyton rubrum secreted and membrane-associated carboxypeptidases. Int J Med Microbiol. 2008;298:669–82.
Joshi L, St. Leger RJ. Cloning, expression, and substrate specificity of MeCPA, a zinc carboxypeptidase that is secreted into infected tissues by the fungal entomopathogen Metarhizium anisopliae. J Biol Chem. 1999;274:9803–11.
Monod M, Léchenne B, Jousson O, Grand D, Zaugg C, Stöcklin R, et al. Aminopeptidases and dipeptidyl-peptidases secreted by the dermatophyte Trichophyton rubrum. Microbiology. 2005;151:145–55.
Beauvais A, Monod M, Wyniger J, Debeaupuis JP, Grouzmann E, Brakch N, et al. Dipeptidyl-peptidase IV secreted by Aspergillus fumigatus, a fungus pathogenic to humans. Infect Immun. 1997;65:3042–7.
Beauvais A, Monod M, Debeaupuis J-P, Diaquin M, Kobayashi H, Latgé J-P. Biochemical and antigenic characterization of a new dipeptidyl-peptidase isolated from Aspergillus fumigatus. J Biol Chem. 1997;272:6238–44.
Léchenne B, Reichard U, Zaugg C, Fratti M, Kunert J, Boulat O, et al. Sulphite efflux pumps in Aspergillus fumigatus and dermatophytes. Microbiology. 2007;153:905–13.
Burmester A, Shelest E, Glöckner G, Heddergott C, Schindler S, Staib P, et al. Comparative and functional genomics provide insights into the pathogenicity of dermatophytic fungi. Genome Biol. 2011;12:R7.
Monod M. Secreted proteases from dermatophytes. Mycopathologia. 2008;166:285–94.
Jousson O, Léchenne B, Bontems O, Capoccia S, Mignon B, Barblan J, et al. Multiplication of an ancestral gene encoding secreted fungalysin preceded species differentiation in the dermatophytes Trichophyton and Microsporum. Microbiology. 2004;150:301–10.
Jousson O, Léchenne B, Bontems O, Mignon B, Reichard U, Barblan J, et al. Secreted subtilisin gene family in Trichophyton rubrum. Gene. 2004;339:79–88.
Chen X-L, Xie B-B, Bian F, Zhao G-Y, Zhao H-L, He H-L, et al. Ecological function of myroilysin, a novel bacterial M12 metalloprotease with elastinolytic activity and a synergistic role in collagen hydrolysis, in biodegradation of deep-sea high-molecular-weight organic nitrogen. Appl Environ Microbiol. 2009;75:1838–44.
Rodier MH, el Moudni B, Kauffmann-Lacroix C, Daniault G, Jacquemin JL. A Candida albicans metallopeptidase degrades constitutive proteins of extracellular matrix. FEMS Microbiol Lett. 1999;177:205–10.
Imbert C, Kauffmann-Lacroix C, Daniault G, Jacquemin L, Rodier MH. Effect of matrix metalloprotease inhibitors on the 95 kDa metallopeptidase of Candida albicans. J Antimicrob Chemother. 2002;49:1007–10.
Bond JS, Beynon RJ. The astacin family of metalloendopeptidases. Protein Sci. 1995;4:1247–61.
Shoulders MD, Raines RT. Collagen structure and stability. Annu Rev Biochem. 2009;78:929–58.
Hilmarsdottir I, Haraldsson H, Sigurdardottir A, Sigurgeirsson B. Dermatophytes in a swimming pool facility: difference in dermatophyte load in men’s and women's dressing rooms. Acta Derm Venereol. 2005;85:267–8.
Fischer M, Pleiss J. The Lipase Engineering Database: a navigation and analysis tool for protein families. Nucleic Acids Res. 2003;31:319–21.
Widmann M, Juhl PB, Pleiss J. Structural classification by the Lipase Engineering Database: a case study of Candida antarctica lipase A. BMC Genomics. 2010;11:123.
Xu J, Saunders CW, Hu P, Grant RA, Boekhout T, Kuramae EE, et al. Dandruff-associated Malassezia genomes reveal convergent and divergent virulence traits shared with plant and human fungal pathogens. Proc Natl Acad Sci. 2007;104:18730–5.
Metz JG, Roessler P, Facciotti D, Levering C, Dittrich F, Lassner M, et al. Production of polyunsaturated fatty acids by polyketide synthases in both prokaryotes and eukaryotes. Science. 2001;293:290–3.
Jacobson ES. Pathogenic roles for fungal melanins. Clin Microbiol Rev. 2000;13:708–17.
Eliahu N, Igbaria A, Rose MS, Horwitz BA, Lev S. Melanin biosynthesis in the maize pathogen Cochliobolus heterostrophus depends on two mitogen-activated protein kinases, Chk1 and Mps1, and the transcription factor Cmr1. Eukaryot Cell. 2007;6:421–9.
Moriwaki A, Kihara J, Kobayashi T, Tokunaga T, Arase S, Honda Y. Insertional mutagenesis and characterization of a polyketide synthase gene (PKS1) required for melanin biosynthesis in Bipolaris oryzae. FEMS Microbiol Lett. 2004;238:1–8.
Schmidt-Heydt M, Häckel S, Rüfer CE, Geisen R. A strain of Fusarium kyushuense is able to produce aflatoxin B1 and G1. Mycotoxin Res. 2009;25:141–7.
Varga J, Frisvad JC, Samson RA. A reappraisal of fungi producing aflatoxins. World Mycotoxin J. 2009;2:263–77.
Schwelm A, Bradshaw RE. Genetics of dothistromin biosynthesis of Dothistroma septosporum: an update. Toxins. 2010;2:2680–98.
Barnes I, Crous PW, Wingfield BD, Wingfield MJ. Multigene phylogenies reveal that red band needle light of Pinus is caused by two distinct species of Dothistroma, D. septosporum and D. pini. Stud Mycol. 2004;50:551–65.
McCormick SP, Stanley AM, Stover NA, Alexander NJ. Trichothecenes: From simple to complex mycotoxins. Toxins. 2011;3:802–14.
Kimura M, Tokai T, Takahashi-Ando N, Ohsato S, Fujimura M. Molecular and genetic studies of Fusarium trichothecene biosynthesis: pathways, genes, and evolution. Biosci Biotechnol Biochem. 2007;71:2105–23.
Pestka JJ, Yike I, Dearborn DG, Ward MDW, Harkema JR. Stachybotrys chartarum, trichothecene mycotoxins, and damp building-related illness: new insights into a public health enigma. Toxicol Sci. 2008;104:4–26.
Alexander NJ, McCormick SP, Hohn TM. TRI12, a trichothecene efflux pump from Fusarium sporotrichioides: gene isolation and expression in yeast. Mol Gen Genet. 1999;261:977–84.
Arie T, Kaneko I, Yoshida T, Noguchi M, Nomura Y, Yamaguchi I. Mating-type genes from asexual phytopathogenic ascomycetes Fusarium oxysporum and Alternaria alternata. Mol Plant Microbe Interact. 2000;13:1330–9.
Fraser JA, Stajich JE, Tarcha EJ, Cole GT, Inglis DO, Sil A, et al. Evolution of the mating type locus: Insights gained from the dimorphic primary fungal pathogens Histoplasma capsulatum, Coccidioides immitis, and Coccidioides posadasii. Eukaryot Cell. 2007;6:622–9.
Ni M, Feretzaki M, Sun S, Wang X, Heitman J. Sex in Fungi. Annu Rev Genet. 2011;45:405–30.
Eichhorn E, van der Ploeg JR, Kertesz MA, Leisinger T. Characterization of alpha-ketoglutarate-dependent taurine dioxygenase from Escherichia coli. J Biol Chem. 1997;272:23031–6.
Huxtable RJ. Physiological actions of taurine. Physiol Rev. 1992;72:101–63.
Schuller-Levis GB, Park E. Taurine: new implications for an old amino acid. FEMS Microbiol Lett. 2003;226:195–202.
Denger K, Smits THM, Cook AM. Genome-enabled analysis of the utilization of taurine as sole source of carbon or of nitrogen by Rhodobacter sphaeroides 2.4.1. Microbiology. 2006;152:3197–206.
Mosier AC, Justice NB, Bowen BP, Baran R, Thomas BC, Northen TR, et al. Metabolites associated with adaptation of microorganisms to an acidophilic, metal-rich environment identified by stable-isotope-enabled metabolomics. MBio. 2013;4:e00484–12.
Gostinčar C, Grube M, Gunde-Cimerman N. Evolution of fungal pathogens in domestic environments? Fungal Biol. 2011;115:1008–18.
Morel M, Ngadin AA, Droux M, Jacquot J-P, Gelhaye E. The fungal glutathione S-transferase system. Evidence of new classes in the wood-degrading basidiomycete Phanerochaete chrysosporium. Cell Mol Life Sci. 2009;66:3711–25.
Soustre I, Letourneux Y, Karst F. Characterization of the Saccharomyces cerevisiae RTA1 gene involved in 7-aminocholesterol resistance. Curr Genet. 1996;30:121–5.
Moretti S, Armougom F, Wallace IM, Higgins DG, Jongeneel CV, Notredame C. The M-Coffee web server: A meta-method for computing multiple sequence alignments by combining alternative alignment methods. Nucleic Acids Res. 2007;35:W645–8.
Huelsenbeck JP, Ronquist F. MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics. 2001;17:754–5.
Ronquist F, Huelsenbeck J, Teslenko M. Draft MrBayes version 3.2 Manual : Tutorials and Model Summaries. 2011.
Kuan CS, Yew SM, Toh YF, Chan CL, Ngeow YF, Lee KW, et al. Dissecting the fungal biology of Bipolaris papendorfii: From phylogenetic to comparative genomic analysis. DNA Res. 2015;22:219–32.
Kelley DR, Schatz MC, Salzberg SL. Quake: Quality-aware detection and correction of sequencing errors. Genome Biol. 2010;11:R116.
Zerbino DR, Birney E. Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008;18:821–9.
Boetzer M, Henkel CV, Jansen HJ, Butler D, Pirovano W. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics. 2011;27:578–9.
Boetzer M, Pirovano W. Toward almost closed genomes with GapFiller. Genome Biol. 2012;13:R56.
Lagesen K, Hallin P, Rødland EA, Staerfeldt H-H, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007;35:3100–8.
Lowe TM, Eddy SR. tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25:955–64.
Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M. Blast2GO: A universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005;21:3674–6.
Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, et al. The COG database: An updated version includes eukaryotes. BMC Bioinformatics. 2003;4:41.
Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, et al. InterProScan: protein domains identifier. Nucleic Acids Res. 2005;33:W116–20.
Ye J, Fang L, Zheng H, Zhang Y, Chen J, Zhang Z, et al. WEGO: A web tool for plotting GO annotations. Nucleic Acids Res. 2006;34:293–7.
Yin Y, Mao X, Yang J, Chen X, Mao F. dbCAN: A web resource for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. 2012;40:W445–51.
Rawlings ND, Barrett AJ, Bateman A. MEROPS: The database of proteolytic enzymes, their substrates and inhibitors. Nucleic Acids Res. 2012;40:D343–50.
Petersen TN, Brunak S, von Heijne G, Nielsen H. SignalP 4.0: Discriminating signal peptides from transmembrane regions. Nat Methods. 2011;8:785–6.
Krogh A, Larsson B, von Heijne G, Sonnhammer EL. Predicting transmembrane protein topology with a hidden Markov model: Application to complete genomes. J Mol Biol. 2001;305:567–80.
Khaldi N, Seifuddin FT, Turner G, Haft D, Nierman WC, Wolfe KH, et al. SMURF: Genomic mapping of fungal secondary metabolite clusters. Fungal Genet Biol. 2010;47:736–41.
Rutherford K, Parkhill J, Crook J, Horsnell T, Rice P, Rajandream MA, et al. Artemis: Sequence visualization and annotation. Bioinformatics. 2000;16:944–5.
Li L, Stoeckert CJ, Roos DS. OrthoMCL: Identification of ortholog groups for eukaryotic genomes. Genome Res. 2003;13:2178–89.
Van DS. Graph clustering by flow simulation. Phd Thesis. Netherlands: University of Utrecht; 2000.
Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, et al. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23:2947–8.
De Bie T, Cristianini N, Demuth JP, Hahn MW. CAFE: A computational tool for the study of gene family evolution. Bioinformatics. 2006;22:1269–71.
Zhao Z, Liu H, Wang C, Xu J-R. Correction: Comparative analysis of fungal genomes reveals different plant cell wall degrading capacity in fungi. BMC Genomics. 2014;15:6.
This study was supported by High Impact Research MoE Grant UM.C/625/1/HIR/MOHE/MED/31 (Account no. H-20001-00-E000070) from the Ministry of Education Malaysia and the Postgraduate Research Grant (PPP) PV051/2012A from the University of Malaya.
The authors declare that they have no competing interests.
SMY CLC YFN KPN conceived and designed the experiments. CSK YFT SLN performed the experiments. SMY CLC KWL WYY performed data analyses and interpretation. SMY CCH KWL WYY YFN KPN drafted the manuscript. All authors read and approved the final manuscript.
List of genome sequences used in gene families analysis and phylogenomic tree construction. Table S2. Number of CAZyme modules predicted in UM 578 genome. Table S3. Plant cell wall and fungal cell wall degrading and modifying CAZyme families predicted in UM 578. Table S4. Number of peptidases predicted in UM 578. Table S5. Comparison of putative secreted proteases families which had been reported to be expanded in dermatophytes. Table S6. Gene families clusters shared among UM 578 with Trichophyton rubrum and T. verrucosum in this study. Table S7. Predicted lipases in the genomes of UM 578 with skin-inhabiting and non-skin inhabiting fungi. Table S8. Putative biosynthetic pathway genes involved in aflatoxin (AF), sterigmatocystin (ST) and dothistromin (DOT) production predicted in UM 578. Table S9. Putative biosynthetic pathway genes for trichothecene production predicted in UM 578. Table S10. Predicted gene involved in sexual reproduction in UM 578. Table S11. UM 578 specific gene families clusters compared to 16 publicly available fungal genomes. Table S12. Expansion and contraction of Pfam families from CAFE analysis. The P-value for whole family expansion/ contraction is shown as family wide P-value and the node specific for UM 578 is shown (Additional file 2: Figure S15). Only families with family wide P-value ≤0.01 are shown. (XLS 307 kb)
KEGG map of styrene degradation. Genes annotated via KEGG are shaded. Z-phenylacetaldoxime degradation by nitrilase (EC 18.104.22.168), nitrile hydratase (EC 22.214.171.124) and amidase (EC 126.96.36.199). Although the phenylacetaldoxime dehydratase (EC 188.8.131.52) was not mapped, the gene was found in the genome. Figure S2. Alignment of putative phenylacetaldoxime dehydratase of O. mirabilis UM 578 (UM578_4049) with Bacillus sp. OxB-1 (P82604). Identical and similar residues are black and gray shaded respectively. The haem-containing dehydratase region is indicated by asterisk. Figure S3. Putative aldoxime-nitrile pathway gene cluster of UM 578. The phyenylacetaldoxime dehydratase (UM578_4049) and nitrilase (UM578_5050). The direction of transcription is indicated by the arrow for each gene. Figure S4. KEGG map of atrazine degradation. Genes annotated via KEGG are shaded. Cyanamide was degraded by cyanamide hydratase (EC 184.108.40.206) and urease (EC 35.1.5). Figure S5. Alignment of predicted metallopeptidase M14A of O. mirabilis UM 578 (UM578_1644). Alignment was carried out with metallopeptidase MeCPA from Metarhizium anisopliae (AAB68600) and TruMcpA from Trichophyton rubrum (ABW79919). Identical and similar residues are black and gray shaded, respectively. The zinc-binding residues are indicated by an asterisk. The active-site residues are indicated by circles. Conserved residues involved in substrate binding are indicated by solid triangles. The conserved Cys residues forming disulfide bridges are indicated by solid rhombus. Figure S6. Alignment of predicted serine carboxypeptidase of O.mirabilis UM 578 (UM578_13449). Alignment was carried out with TruSCPA from Trichophyton rubrum (AAS76667) and AfuCp1 from Aspergillus fumigatus (AAR91697). Identical and similar residues are black and gray shaded respectively. The consensus active residues are indicated by asterisk (Ser228, Asp439 and His497). Figure S7. Alignment of predicted leucine aminopeptidase (LAP) of O. mirabilis UM 578 (UM578_7056). Alignment was carried out with TruLAP1 from Trichophyton rubrum (AAS76670) and AfuLAP1 from Aspergillus fumigatus (AAR996058). Identical and similar residues are black and gray shaded respectively. The consensus binding sites for the first and the second Zn2+ ion binding sites are indicated in triangle (His180 and Asp265) and in rhombus (Glu238 and His347) respectively. The Asp199 is the residue bridging the two Zn2+ ions is indicated in circle. The active sites (Asp182 and Glu237) are indicated by asterisk. Figure S8. Alignment of predicted leucine aminopeptidase (LAP) of O.mirabilis UM 578 (UM578_5513). Alignment was carried out with TruLAP2 from Trichophyton rubrum (AAS76669) and AfuLAP2 from Aspergillus fumigatus (AAR96059). Identical and similar residues are black and gray shaded respectively. The consensus binding sites for the first and the second Zn2+ ion binding sites are indicated in triangle (His252 and Asp326) and in rhombus (Glu297 and His424) respectively. The Asp264 is the residue bridging the two Zn2+ ions is indicated in circle. The active sites (Asp254 and Glu296) are indicated in asterisk. Figure S9. Alignment of predicted dipeptidyl peptidase IV (DPPIV) of O. mirabilis UM 578 (UM578_9285). Alignment was carried out with TruDPPIV from Trichophyton rubrum (AAS76665) and AfuDPPIV from Aspergillus fumigatus (AAC34310). Identical and similar residues are black and gray shaded respectively. The catalytic triad is indicated in asterisk (Ser619, Asp 696, His731). Figure S10. Alignment of predicted dipeptidyl peptidase V (DPPV) of O. mirabilis UM 578 (UM578_9264). Alignment was carried out with TruDPPIV from Trichophyton rubrum (AAN03632) and AfuDPPIV from Aspergillus fumigatus (AAB67282). Identical and similar residues are black and gray shaded respectively. The catalytic triad is indicated in asterisk (Ser566, Asp647, His679). Figure S11. TMpred output of putative sulphite efflux pump (ssu1) in UM 578. The putative gene, UM578_9214 has ten membrane-spanning helixes and hydrophilic N- and C- termini. Figure S12. Putative melanin biosynthesis cluster in UM 578. The organisation and orientation of genes involved in melanin biosynthesis are similar to that of the reported melanin gene cluster in C. heterostrophus [GenBank: AAR90272] and A. brassicicola [GenBank: BAD22832]. The predicted genes encode polyketide synthase (UM578_2557), transcription factor Cmr1 (UM578_2558) and tetrahydroxynaphthalene reductase (UM578_2559). The direction of transcription is indicated by the arrow for each gene. Figure S13. Putative trichothecene biosynthesis cluster in UM 578. The 6751 bp cluster encompasses the trichodiene synthase (UM578_3030) with two cytochrome P450 encoding genes (UM578_3031 and UM578_3032) and the trichothecene efflux pump (UM578_3033). The direction of transcription is indicated by the arrow for each gene. Figure S14. Putative gene organisation of mating type genes in UM 578. The neighbouring genes of alpha-domain containing gene (UM578_3656) encompass the homeodomain-containing protein (UM578_3655), DNA lyase APN2 (UM578_3657) and cytochrome C oxidase Vla Cox13 (UM578_3658). The direction of transcription is indicated by the arrow for each gene. Figure S15. Phylogenomic tree showing number of each node in expansion/ contraction analysis. The number of genes and P-value for UM 578 (node 24) and the internode (node 23) are shown in Additional file 1: Table S12. (PDF 1000 kb)
About this article
Cite this article
Yew, S.M., Chan, C.L., Kuan, C.S. et al. The genome of newly classified Ochroconis mirabilis: Insights into fungal adaptation to different living conditions. BMC Genomics 17, 91 (2016). https://doi.org/10.1186/s12864-016-2409-8
- Ochroconis mirabilis
- Genome sequence