Genome and secretome analysis of the hemibiotrophic fungal pathogen, Moniliophthora roreri, which causes frosty pod rot disease of cacao: mechanisms of the biotrophic and necrotrophic phases

Background The basidiomycete Moniliophthora roreri is the causal agent of Frosty pod rot (FPR) disease of cacao (Theobroma cacao), the source of chocolate, and FPR is one of the most destructive diseases of this important perennial crop in the Americas. This hemibiotroph infects only cacao pods and has an extended biotrophic phase lasting up to sixty days, culminating in plant necrosis and sporulation of the fungus without the formation of a basidiocarp. Results We sequenced and assembled 52.3 Mb into 3,298 contigs that represent the M. roreri genome. Of the 17,920 predicted open reading frames (OFRs), 13,760 were validated by RNA-Seq. Using read count data from RNA sequencing of cacao pods at 30 and 60 days post infection, differential gene expression was estimated for the biotrophic and necrotrophic phases of this plant-pathogen interaction. The sequencing data were used to develop a genome based secretome for the infected pods. Of the 1,535 genes encoding putative secreted proteins, 1,355 were expressed in the biotrophic and necrotrophic phases. Analysis of the data revealed secretome gene expression that correlated with infection and intercellular growth in the biotrophic phase and invasive growth and plant cellular death in the necrotrophic phase. Conclusions Genome sequencing and RNA-Seq was used to determine and validate the Moniliophthora roreri genome and secretome. High sequence identity between Moniliophthora roreri genes and Moniliophthora perniciosa genes supports the taxonomic relationship with Moniliophthora perniciosa and the relatedness of this fungus to other basidiomycetes. Analysis of RNA-Seq data from infected plant tissues revealed differentially expressed genes in the biotrophic and necrotrophic phases. The secreted protein genes that were upregulated in the biotrophic phase are primarily associated with breakdown of the intercellular matrix and modification of the fungal mycelia, possibly to mask the fungus from plant defenses. Based on the transcriptome data, the upregulated secreted proteins in the necrotrophic phase are hypothesized to be actively attacking the plant cell walls and plant cellular components resulting in necrosis. These genes are being used to develop a new understanding of how this disease interaction progresses and to identify potential targets to reduce the impact of this devastating disease.


Results:
We sequenced and assembled 52.3 Mb into 3,298 contigs that represent the M. roreri genome. Of the 17,920 predicted open reading frames (OFRs), 13,760 were validated by RNA-Seq. Using read count data from RNA sequencing of cacao pods at 30 and 60 days post infection, differential gene expression was estimated for the biotrophic and necrotrophic phases of this plant-pathogen interaction. The sequencing data were used to develop a genome based secretome for the infected pods. Of the 1,535 genes encoding putative secreted proteins, 1,355 were expressed in the biotrophic and necrotrophic phases. Analysis of the data revealed secretome gene expression that correlated with infection and intercellular growth in the biotrophic phase and invasive growth and plant cellular death in the necrotrophic phase.
Conclusions: Genome sequencing and RNA-Seq was used to determine and validate the Moniliophthora roreri genome and secretome. High sequence identity between Moniliophthora roreri genes and Moniliophthora perniciosa genes supports the taxonomic relationship with Moniliophthora perniciosa and the relatedness of this fungus to other basidiomycetes. Analysis of RNA-Seq data from infected plant tissues revealed differentially expressed genes in the biotrophic and necrotrophic phases. The secreted protein genes that were upregulated in the biotrophic phase are primarily associated with breakdown of the intercellular matrix and modification of the fungal mycelia, possibly to mask the fungus from plant defenses. Based on the transcriptome data, the upregulated secreted proteins in the necrotrophic phase are hypothesized to be actively attacking the plant cell walls and plant cellular components resulting in necrosis. These genes are being used to develop a new understanding of how this disease interaction progresses and to identify potential targets to reduce the impact of this devastating disease.

Background
Fungal plant pathogens can be classified as biotrophic, necrotrophic or hemibiotrophic pathogens based on how they interact with their host. Biotrophic pathogens cause only minor responses from the plant, particularly at initial stages of the disease. These biotrophic pathogens appear to evade plant defenses with stealthy methods [1]. Fungal biotrophs are often obligate pathogens, typically having narrow host ranges, possessing haustoria and secreting limited amounts of lytic enzymes [2]. On the other hand, infection by necrotrophic pathogens causes rapid cell death in hosts and elicit major molecular responses from the plant. Necrotrophs appear to utilize brute force and overwhelm the plant defenses. Necrotrophs are typically non-obligate pathogens, have wide host ranges and secrete copious amounts of lytic enzymes and toxins [2]. Hemibiotrophs initiate infection with a period of biotrophy, followed by a necrotrophic phase, and they possess properties of both groups. However, most of our understanding of how hemibiotrophs interact with their hosts is derived from these two extremes.
Both biotrophic and necrotrophic fungi share common elements but these may have different purposes when causing disease. During the host-interaction, the pathogens synthesize and secrete various peptides/proteins that block host responses (biotrophs) or kill the host cells (necrotrophs). Among biotrophs, the rust fungus of flax, Melampsora lini, excretes cysteine-rich avirulence elicitor proteins from the haustoria [3] and the bean rust fungus Uromyces fabae shows highly coordinated stage specific regulation of its secreted proteins [4]. The necrotrophic fungus Fusarium graminearum has 109 secreted cell wall degrading genes in its genome [5], while Alternaria species secrete non-host and host specific toxins that disrupt photosynthesis and kill plant cells [6]. Therefore, a detailed understanding of specific peptides/proteins secreted during the host-pathogen interaction is vital to elucidate the biotrophic and necrotrophic mechanisms.
Moniliophthora roreri (Cif.) H.C. Evans, Stalpers, Samson & Benny [7] causes Frosty Pod Rot (FPR), a devastating pod disease of Theobroma cacao L. (cacao), the source of cocoa powder and cocoa butter. Phylogenetically, M. roreri is related to another fungal pathogen, Moniliophthora perniciosa (Stahel) Aime and Phillips-Mora [8], which causes Witches' Broom Disease (WBD), a disease that infects all cacao meristematic tissues including flowers, shoots, and pods [8]. Together, these pathogens cause two of the most economically important diseases of Theobroma cacao in the Western Hemisphere [9,10]. While both of these fungal species are pathogenic on the plant genera Theobroma and Herrania, M. roreri is not known to have any other hosts, whereas M. perniciosa has distinct biotypes that infect different host species [11]. Historically, these Moniliophthora pathogens have spread independently to cacao producing areas across the Western Hemisphere and they have typically resulted in production losses of 75% or higher in nearly all the cacao growing regions in the Americas [10,12].
Moniliophthora roreri and M. perniciosa are both hemibiotrophic pathogens, but have distinctive lifestyles and pathogenicity strategies. Unlike most other hemibiotrophic fungi, both Moniliophthora species have protracted biotrophic stages or phases that last three to six weeks. They also have distinctly different mycelial morphologies present at the beginning and the end of the disease process [10,13]. In both Moniliophthora diseases, the infected plant tissues are asymptomatic for 14 to 21 days. After that period of time, these tissues typically begin to show some form of altered growth or swelling that continues for the remainder of the biotrophic phase, culminating with the necrosis of the host tissues, which marks the beginning of the necrotrophic phase. In other hemibiotrophic fungi such as Cladosporium fulvum, Mycosphaerella graminicola, Pyrenopeziza brassicae and Septoria tritici, the biotrophic phase is asymptomatic with undifferentiated mycelia and lasts no more than two weeks [14].
As with hemibiotrophic Colletotrichum species, both M. roreri and M. perniciosa undergo distinct metabolic changes as they transition from the biotrophic phase to the necrotrophic phase of growth [15][16][17][18][19]. These metabolic changes suggest significant shifts in gene expression and in enzymatic processes.
In this study we present the genome of M. roreri and compare it to the genome of the closely related cacao pathogen, M. perniciosa. Transcriptional analysis of the genome will focus on genes with putative signal peptides from the M. roreri genome and compare them with expressed genes, identified through RNA sequencing of infected pods, to discover potential secreted proteins expressed during the disease process. Genes expressed in the early and late stages of FPR on pods were used to identify secreted genes that are potentially important during the biotrophic or necrotrophic phase of the disease cycle. The results presented here are the first analyses of genes encoding secreted proteins derived from the M. roreri genome. These new insights into the M. roreri genome, transcriptome and differential gene regulation provide a better understanding of the plant-pathogen interactions that occur during FPR.

Genome structure
Moniliophthora roreri has a genome size of 52.3 Mbp, which is 7.7 Mbp larger than the 44.6 Mbp found in the M. perniciosa genome (Table 1). Despite the size difference, M. roreri had only 912 additional coding sequences (CDS) than M. perniciosa. 17,920 coding sequences were detected in M. roreri. The average GC% for the M. roreri genome (46.88%) is lower than for the M. perniciosa genome (47.81%); however, the average G + C for the M. roreri CDS is nearly identical to the value for M. perniciosa. Gene density for M. roreri was also found to be lower than for M. perniciosa.
An analysis of the genomes of these pathogens has revealed that retro-transposons (LTRs) contribute to about 6.6% (3.46 Mbp) of the M. roreri genome compared to about 0.74% (0.332 Mbp) in the M. perniciosa genome. In M. perniciosa, transposable elements were reported to be active elements that may contribute to genetic variability of this pathogen [20]. Each genome was scanned for repetitive sequences and low complexity DNA using the Repeat Modeler and RepeatMasker programs. M. roreri had 7,060,129 bases associated with repeats and repetitive elements while M. perniciosa had 1,737,865 bases associated with repeats and repetitive elements (Additional file 1).

Synteny
Using a bi-directional blast with an E-value of E-04 as the cut-off value for two proteins being considered a match, a whole genome comparison of the predicted proteins between the M. roreri and M. perniciosa genomes was performed. This analysis revealed that 16,713 protein encoding genes (93%) from M. roreri have sequence similarity to M. perniciosa protein encoding genes and 15,674 protein encoding genes (92%) from M. perniciosa have sequence similarity to M. roreri protein encoding genes. At lower E-values, E-25 and E 0.0, the M. roreri genome open reading frames (ORFs) are 87% and 40% similar to the M. perniciosa genome ORFs, respectively. In M. roreri there are 8,851 predicted genes or 49% that have no predicted function and are listed as hypothetical genes. Synteny between M. roreri and M. perniciosa was analyzed with the program MUMmer using the NUCmer setting that compares the contigs from both genomes at the nucleotide level. In this analysis, M. roreri contigs were queried using M. perniciosa contigs as the reference sequence. Contigs with high sequence identity resulted in a straight line graph, while translocations and inversions result in either a parallel shift in the line (up or down) or a line at a negative angle, respectively. Though both genomes are closely related at the nucleotide level, there are regions where inversions and translocations have potentially occurred. This analysis confirms that these two genomes are very similar and most probably homologous, while it also shows that some regions in both genomes are unique. Due to the poor resolution of the MUMmer plot when all genome contigs were analyzed, we compared long intact regions of the genomes. Two hundred and fifty contigs from M. roreri and 233 contigs from M. perniciosa were compared that were at least 50 kb in length. Contigs with no corresponding sequence identity were eliminated, so that only contigs of high sequence identity between the two genomes were compared ( Figure 1). The results yielded a final comparison of 222 contigs from M. roreri and 207 contigs from M. perniciosa and showed highly similar sequences with slight shifts off the diagonal line, which indicate translocations in the contigs (Figure 1 insert that shows an expanded region of the main figure).

Comparisons to other basidiomycete fungi
Bidirectional blast analysis was conducted with M. roreri, M. perniciosa, Laccaria bicolor, Coprinopsis cinerea (Coprinus cinereus), and Ustilago maydis. These other genomes were selected because all are basidiomycetes but exhibit differing environmental adaptations representing a mushroom producing ectomycorrhizae a mushroom producing saprophyte and a pathogenic smut fungus, respectively. Figure 2 presents the Venn-diagram of the analysis, and the tax plot is shown in Table 2. Each of the five genomes is represented by an oval, and the numbers indicate how many genes are included within that intersect, at an E-value of E-04. Using this analysis, genes can be identified that are exclusive to a particular genome or to a particular group, which enables identification of putative similarities such as pathogenicity genes, and genus specific genes. The number of predicted genes particular to a single genome is: 1,133 for (A) M. roreri, 1,202 for (B) M. perniciosa, 5,568 for (C) L. bicolor, 2,539 for (D) C. cinerea and 1,324 for (E) U. maydis. Only seven of the 1,133 M. roreri genes were detected by RNA-Seq analysis (described below) and only 16 of the 1,133 M. roreri genes have putative functions (Table 3). Based on the annotation, several of these genes appear to have a role in genome integrity and function (a RNA polymerase Rpb1 C-terminal repeat domain-containing protein; a putative RecQ helicase; a P-loop containing nucleoside triphosphate hydrolase protein) while another set appears to be linked to retro  elements (a Gag protein, retrovirus-related Pol polyprotein from transposon TNT 1-94 and a reverse transcriptase protein). The remaining genes have similarity to a mix of toxins (volvatoxin A2 precursor, a dipterans toxic crystal protein); heme binding (a Hemopexin domaincontaining protein); cell adhesion (a calcium-binding tyrosine phosphorylation-regulated protein); pectin degradation (a probable pectin lyase precursor); possible oxidoreductases (two FAD NAD-binding domain-containing proteins); a putative CNVH-domain-containing protein and two capsule polysaccharide biosynthesis proteins. The ABCDE intersection contains 3,048 genes (data not shown), which are common to all of these basidiomycetes, while there are 34 genes associated with ABE and 70 with ABC. The Moniliophthora intersect (AB) has 1,287 genes, of which only 106 or 8% have a putative function (Table 4). Among the AB genes there are five antibiotic biosynthesis monooxygenases; five feruloyl esterases involved in hemicellulose degradation; four cellulose binding proteins; four cell wall glycosyl hydrolases, involved in cell wall sugar degradation; four chitin binding proteins; four het domain proteins associated with fungal vegetative incompatibility and mycelial cell death; three pectate lyases involved in pectin degradation; three family 12 carbohydrate esterases, a family that includes pectin acetylesterases, rhamnogalacturonan acetylesterases and acetyl xylan esterases; three biotrophy-associated secreted proteins that are putative effectors and 2 amidohydrolases, enzymes that act on amide bonds. Most of these AB intersect genes were constitutively expressed, but RNA-Seq data also showed differential gene expression for 14 genes, which are indicated in Table 4 (Additional file 1). In the necrotrophic phase the following genes were upregulated: an antibiotic biosynthesis monooxygenase, two biotrophy-associated secreted proteins, a cellulose binding protein, a FAD binding domain    protein, an integral membrane protein, a lactam utilization protein, a LEA domain-containing protein, a thaumatinlike protein, a transferase family protein and a urea hydrolase cyanamide protein. In the biotrophic phase a cell wall glycosyl hydrolase, a family 16 glycoside hydrolase and a NAD binding phosphogluconate dehydrogenate-like protein were upregulated (Additional file 1, AB intersect sheet). The pathogenic intersect (ABE) has 30 genes with known functions and these are shown in Table 5.
The biotrophic intersect (ABC) has 17 genes with putative functions (Table 6). RNA-Seq data revealed that most of ABC and ABE intersect genes are constitutively expressed (Additional file 1) at low to mid levels (adjusted means at 2 to 50 reads = Low level; 50 to 125 reads = mid level and >125 = high level of gene expression), with the exception of two genes from the ABC intersect, (a transcriptional family alpha/beta fold family protein and one hypothetical protein) that were differentially expressed in the necrotrophic phase (Additional file 1, ABE and ABC intersect sheets).

Identification of secreted proteins
Because of the hemibiotrophic nature of M. roreri and the role that secreted proteins are expected to play in that interaction, we analyzed the genome for genes that express secreted proteins. The first screening identified all of the putative proteins with signal peptides. In M. roreri 1,752 proteins (9.7%) were found to have signal peptides, which is slightly lower than M. perniciosa which has 1,810 proteins with signal peptides. Next the proteins with signal peptides were analyzed for transmembrane domains, to remove proteins that are expected to be incorporated into the membrane of the fungus. Ninety-two of the 1,752 proteins have two or more transmembrane  domains. One hundred and twenty-five glycosylphosphatidylinositol (GPI) anchor proteins were also excluded from the list of secreted proteins (Table 7). While the GPIanchored proteins are normally removed from secreted proteins, the posttranslational modification of glypiation is the only method of attachment to the membrane for these proteins, and because the bonds are subject to phospholipase cleavage, these proteins could still be released into the extracellular space via an enzymatic release process. After removal of proteins with transmembrane and GPI moieties the total number of M. roreri secreted proteins was 1,535, which is less than the 1,596 secreted proteins predicted for M. perniciosa.

Expression of the secretome
In this study we wanted to verify the expression of the predicted genes, including the secretome identified in the genome analysis. We utilized a replicate set of infected pod tissues from a previous study to obtain transcriptomes representing the biotrophic and necrotrophic phases of the disease [21]. While our present focus is on the secreted proteins, the complete fungal transcriptome, for FPR, in different resistant plant backgrounds will be addressed in a separate treatise. The planting material used represents a segregating F1 population of progeny from highly susceptible clones (PA16 × SIC433 and PA16 × EEG29). Briefly, the pods used in this study were derived from flowers hand-pollinated 60 days prior to infection with a spore suspension of 1×10 8 M. roreri spores ml -1 .
The infected pods were frozen in liquid nitrogen and lyophilized at 30 days post infection (DPI) and at 60 DPI. Based on typical progression of FPR symptoms [21], three malformed green pods were selected for RNA-Seq and RT-qPCR from 30 DPI pods (biotrophic phase) and three necrotic sporulating pods were selected for RNA-Seq and RT-qPCR from 60 DPI pods (necrotrophic phase (1.33%) and 503,459 (0.73%), respectively. When all of the libraries were compared 13,759 genes (77% of the predicted CDS) from the M. roreri genome were found to be expressed in these infected pods. The secreted proteins were then separated from the other expressed genes. Among the 1,535 secreted protein genes identified 1,355 or 88%, were expressed at some level under the tested conditions. Using a P-value of 0.05, we found that 222 of the secreted protein genes were differentially expressed between the 30 DPI and 60 DPI pods. All of the expressed secreted protein genes are listed in Additional file 2. The data revealed phase-dependent differential gene expression, where 164 of the genes were up-regulated in the 60 DPI pods while 58 were upregulated in the 30 DPI pods. Since the hypothetical genes can provided no information regarding their putative function they were eliminated from further consideration with regards to their role in the disease process.
Genes expressed during the early stages of infection and disease development reflect gene products that function in the biotrophic phase of the disease development, which can be initially asymptomatic. Late-stage gene expression  yields gene products that function in the necrotrophic phase, which is associated with pod chlorosis and necrosis, and possibly with sporulation of the fungus on pod tissue. To validate the differential expression of genes encoding secreted proteins as suggested by the RNA-Seq analysis, RT-qPCR was conducted, with a selected number of genes comparing expression in a set of seven malformed green pods at 30 DPI and seven necrotic pods at 60 DPI. At 30 DPI, 34 genes encoding secreted proteins with putative functions were expressed, with one being expressed only at this stage (Table 8). At 60 DPI, 105 genes encoding predicted secreted proteins with putative functions were expressed (Table 9). Forty-one of the 105 60 DPI genes were detected only in the necrotrophic phase. RT-qPCR confirmed the up-regulation of the 30 DPI and 60 DPI genes (Tables 8 and 9). Although the GPI-anchored proteins were removed from the secretome we did find a number of them highly expressed in both the biotrophic and necrotrophic phases. Table 10 provides a list of the most highly expressed genes encoding GPI-anchored proteins in both phases since controlled release of these proteins from the fungal membrane is possible. Several gene families of secreted proteins were prominently upregulated in the biotrophic phase. In this phase the 34 genes with known functions were primarily associated with nutrient acquisition and with plant and fungal cell wall modification. The largest single group identified was glycoside hydrolases (GH). Eleven glycoside hydrolases genes were upregulated: two genes from GH family 5; five genes from GH family 18; and one gene each from the GH families 16, 28, 43, 92. The GH family 18 genes are homologous to chitinase genes (EC 3.2.1.14) [22] and GH family 5 genes are endo-enzymes that are capable of hydrolyzing both β-mannans and β-glucans [23] and are homologous to beta-1,3-glucanase, which includes endoglucanase, beta-mannanase, exo-1,3-glucanase, endo-1,6-glucanase, xylanase, and endoglycoceramidase. The GH [27] (Additional file 2). In addition to the GHs, other enzymes were identified such as galactose-1-epimerase, a carbonic anhydrase, a family 8 carbohydrate esterase (pectin methylesterase), and a tryptophan dimethylally transferase. The plant cell wall-modifying enzymes identified in the biotrophic phase include an endo-polygalacturonase, a galactan 1,3-beta-galactosidase, a cell wall glycosyl hydrolase, a catalase, an acetylxylan esterase and a xylanase A. Additionally, three cerato-platanin genes, two hydrophobins, three PR-1-like proteins, and an immunomodulatory protein were all induced in the biotrophic phase that could be involved in the fungal host interaction (Table 8).
Based on RNA-Seq and verified by RT-qPCR analysis, the majority of the genes within the 105 genes expressed during the necrotrophic phase are associated with fungal growth/ pathogenicity and nutrient acquisition. Several genes families were identified with multiple expressed genes such as hydrophobins (10 genes), FAD binding domain containing proteins (6 genes), aryl-alcohol oxidases (4 genes), laccases (4 genes), extracellular triacylglycerol lipases (3 genes) and cerato-platanins (3 genes). Several of these highly upregulated genes are associated with lignin breakdown. The arylalcohol oxidases (AAOs) are FAD containing extracellular enzymes in the glucose-methanol choline oxidase (GMC) family [28]. AAOs are involved in lignin degradation where the enzyme interacts with p-coumaryl alcohol, coniferyl alcohol and sinapyl alcohol polymers that make up the lignin, and O 2 yielding aromatic aldehydes and hydrogen peroxide [28]. Laccases are polyphenol oxidases that reduce phenolic compounds in lignin [29]. Glyoxal oxidases are extracellular enzymes that form hydrogen peroxide during the breakdown of lignin [30]. Eight GH families were identified among the 12 GH genes (two GH family 35; two GH family 61; two GH family 5 and one gene each for families 3, 9, 10, 16 and 18). The GH 35 s proteins are homologous to beta-galactosidases and GH 61 s proteins are homologous to endoglucanases. The GH 3 s proteins are homologous to beta-glucosidases; GH 9 s proteins are endoglucanases or cellulases; and GH 10s proteins are xylanases [31]. Numerous other enzymes and pathogenicity-related genes were identified such as carboxypeptidases, carbohydrate esterases, tripeptidyl peptidase, a necrosis inducing protein, a fungal peroxidase, cytochrome p450s, and expansins. The highest relative expressions were found for a dioxygenase, a Fbox and FNIP repeat-containing protein, a chitin synthase, a proline-rich antigen and one of the expansins (Table 9).
Several gene families were found to be differently regulated and had specific members upregulated in the biotrophic phase and others upregulated in the necrotrophic phase. Of the 41 hydrophobin genes found in the genome, 12 were differentially expressed; 10 were upregulated in the necrotrophic phase; two in the biotrophic phase. Among the eight cerato-platanin genes found in the genome six were differentially upregulated; three in each phase. The expression of genes encoding fungal-derived pathogenesis-related proteins was detected in samples from the biotrophic phase, with MrPR-1n, MrPR-1 g and MrPR-1i2 being upregulated. Among the other nine genes encoding PR-1-like proteins found in the M. roreri genome, one MrPR-1d was upregulated in the necrotrophic phase and five were constitutively expressed under the tested conditions (Additional file 2). Mean* is the normalized mean based on the size factor of the libraries.    Mean* is the normalized mean based on the size factor of the libraries.

Genome comparison
We have produced a high-quality draft genome of M. roreri using different next-generation sequencing technologies. After sequence assembly using the whole genome shotgun (WGS) strategy, a total of 3,298 contigs with a N50 value of 48.1 kb was assembled with a total genome size calculated to be 52.3 Mbp. Although larger, the M. roreri genome is structurally and organizationally similar to M. perniciosa, a sister taxa with a draft genome consisting of 3,087 contigs (N50 = 48 kb) and a genome size of 44.7 Mbp. The majority of the size difference between the genomes is associated with repetitive DNA. The M. roreri genome size is larger than the genome of M. perniciosa, but the number of genes located in each genome is similar. One factor for the size difference between M. roreri and M. perniciosa is highly correlated to the difference in the number of Long Terminal Repeat-Transposable Elements (LTR-TEs) between the genomes. About 3.46 Mbp or approximately 7% of the M. roreri genome is attributed to LTR-TEs located in the genome, while only 0.332 Mbp or about 1% of the M. perniciosa genome is attributed to LTR-TEs. Transposable Elements (TEs) contribute no more than 10 to 15% in most fungal taxa [32]. For example, the Saccharomyces cereviseae genome contains 3.1% TEs [33], the Magnaporthe oryzae genome ranges from 8.2-14% [34], while 24% of the Laccaria bicolor genome is transposon derived [35]. Within the M. perniciosa genome these repetitive elements are active and appear to contribute to the genetic variability of the species [20]. Another factor contributing to the size difference between the genomes is associated with repetitive elements and low complexity DNA. M. roreri genome has four times as many repetitive elements (7,060,129 bp) as found in the M. perniciosa genome (1,737,865 bp). The classification and biological roles of these repetitive sequences and transposable elements in M. roreri are beyond the scope of this discussion but will be explored in depth in a separate treatise.
The high degree of similarity between M. roreri and M. perniciosa genomes supports their taxonomic relatedness, while distinctions have been identified that could contribute to their distinctive functional and morphologic differences. Bidirectional blast analysis of M. roreri with four other basidiomycetes revealed a set of 1,133 genes specific to M. roreri. The majority of these genes are hypothetical with only 16 genes having known homologies and most of these relate to TEs. Expression analysis of the A gene set (genes specific to M. roreri) revealed that only seven of these genes are expressed and all at low levels. Among the genes encoding hypothetical proteins that are specific to M. roreri, 129 were constitutively expressed and only three were differentially expressed in both the biotrophic and necrotrophic phases (Additional file 1, A set sheet). Low to mid-level constitutive expression in genes from the ABE and ABC intersects (Tables 5 and 6) suggest that these genes are used in general functions common to the sampled basidiomycetes such as metabolism. The differential expression of the 14 genes shared by M. roreri and M. perniciosa (AB intersect) in the biotrophic and necrotrophic phases, as expression of these genes suggests similarities in the hemibiotrophic life cycle of these species.
Secretome targeting the plant cell RNA-Seq, was used to validate the genome CDS through transcriptome expression analysis of M. roreri infected cacao pods at 30 DPI and 60 DPI. These two sampling points compare genes expressed in the early stage or biotrophic phase and the late stage or necrotrophic phase of the FPR disease cycle. In silico analysis of genes with signal peptides, which have a high likelihood of being transported outside of the fungus, yielded 1,355 genes expressed at some level in the 30 and 60 DPI samples, as detected by RNA-Seq. Only 222 of these genes were differentially expressed between the biotrophic and necrotrophic phases, using a P-value of 0.05. The similarities in gene expression between the two phases are greater than the dissimilarities. For example, genes encoding plant cell wall-degrading cellulases were not upregulated in the biotrophic phase, but, their expression was detected. Although the functions of many of these genes are unknown their expression raises the potential that they serve critical roles in the M. roreri life cycle. Many of the known secreted protein encoding genes that were upregulated in the biotrophic phase encode for putative glycoside hydrolases (GH) that act on oligosaccharides making up the plant cell wall. Of the 34 differentially expressed secreted protein genes associated with the biotrophic phase, 11 were glycoside hydrolases from six families of hydrolases, which enables M. roreri to interact with the different biochemical components of plant cell walls. This diverse array of polysaccharide degrading enzymes could function together with other hemicelluloses-degrading enzymes, such as those that hydrolyze xylan. All of these polysaccharide degrading enzymes appear to work in concert to loosen the connections between the plant cells, releasing nutrients that can be utilized by the fungus. Pectin is another major component of the intercellular matrix of the cacao pod husk [36]. M. perniciosa, the causal agent of WBD on cacao, has been observed to express genes for pectin lyase and pectin methylesterases during infection of parthenocarpic pods [18]. In FPR, pectin is also under attack during the biotrophic phase of M. roreri. Genes encoding pectin degrading enzymes are upregulated in this phase.
The biotrophic phase of M. roreri is a complex stage requiring controlled cell wall degradation and modification, allowing the invasion of the pod while the fungus remains substantially undetected by plant defenses. All of the M. roreri genes encoding plant cell wall enzymes that are induced in the biotrophic phase are able to attack the matrix of the intercellular space. This interaction likely facilitates penetration and spread of the fungus between cells without eliciting major responses from the plant.
In contrast, the expression pattern of genes encoding secreted proteins during the necrotrophic phase of M. roreri suggest a more aggressive breakdown of plant structures and the upregulation of aryl-alcohol oxidases, laccases, and a glyoxal oxidase gene during the necrotrophic phase suggests that M. roreri, targets lignin for breakdown during the necrotrophic phase. This evidence is further supported by the fact that three of the four aryl alcohol oxidases and three of the four laccases genes are exclusively expressed during the necrotrophic phase (Table 9) [28][29][30]. There are other upregulated enzymes that may also have a role in these ligninolytic reactions such as a fungal peroxidase, a copper amine oxidase and two dioxygenases. Fungal peroxidases were induced in M. perniciosa under nutrient limited conditions [18], while intradiol dioxygenase was shown to be induced in M. roreri during necrotrophy in a prior study [21].
Similar to the biotrophic phase, eight glycoside hydrolase (GH) families were also induced in the necrotrophic phase. Although the specific details concerning the exact function of these GH families of genes are lacking, it is interesting to note the differential expression patterns between the biotrophic and necrotrophic phases. GH families shared with the biotrophic phase include families 5, 16 and 18. GH families 28, 43 and 92 are unique to the biotrophic phase, whereas families 3, 9, 10, 35, and 61 are unique to the necrotrophic phase. The GH families unique to the necrotrophic phase include enzymes involved in the degradation of plant cell walls into compounds that the fungus can consume. Complex stage-specific regulation of genes targeting plant cell walls in association with the biotroph/necrotroph shift was also observed in hemibiotrophic Colletotrichum species [37]. The differential expression of genes with seemingly related functions provides the opportunity to fine tune aspects of growth and development to the changing conditions and requirements of the two phases of FPR.
Other cell surface-modifying enzymes upregulated in the necrotrophic phase are triacylglycerol lipase precursors and a cutinase suggesting lipid breakdown and modification of the protective surfaces of the plant cells. A necrosis inducing-like protein (NLP) that could be responsible for cell death and nutrient leakage was also found to be up-regulated [38]. NLPs have been identified in many pathogenic associations such as WBD and other biotrophic interactions [39][40][41]. Two genes from another class of virulence genes, biotrophy-associated secreted proteins BAS (MrBAS1 and MrBAS2) are highly expressed in the M. roreri necrotrophic phase. These effector proteins have been shown to accumulate in the biotrophic interface region for the ascomycete Magnaporthe grisea [42]; however, their roles in the necrotrophic phase of this interaction are yet to be determined. At the same time in the necrotrophic phase, multiple extracellular peptidases are being upregulated possibly in response to the proteins and peptides that are now available from the necrotic plant cells [43]. Specific activities of M. roreri associated with the necrotrophic phase appear to target cell wall and membrane breakdown in associations with enzymes that are linked with the switch of this fungus from biotrophic to necrotrophic phase. The differentially expressed secretome of the necrotroph suggests conditions for rapid growth of the fungus, where plant lignocellulosic cell walls are being degraded and multiple plant components are being released and are available to be used by the fungus as nutrients. The need for stealth has been removed and nutrient acquisition is predominantly to support the rapid growth associated with the necrotrophic phase [21].

Secretome targeting modification of the fungal cell wall
There is no evidence that haustoria or other cell wallpenetrating structures are formed during the biotrophic phase of M. roreri. The slow-growing M. roreri mycelia of the biotrophic phase are large pleomorphic cells that fill the spaces between the plant cells [21]. This abnormal mycelial growth habit does have some similarity to haustoria-like structures, although they are outside the cell wall and, possibly provide the fungus with more surface area for cell-to-cell interactions and exchange of metabolites. The necrotrophic mycelia are profuse and grow rapidly in necrotic tissues [21]. These mycelia are thin and elongated. The sporophores and resulting spores represent a second fungal morphology associated with the necrotrophic phase. Genes encoding proteins that are functional in and on the fungal cell wall were differentially expressed between the phases of FPR. Often unique members of the same gene family were differentially expressed.

Chitin synthesis and modification
There were five differentially expressed family 18 glycoside hydrolases associated with the biotrophic phase and one associated with the necrotrophic phase. These chitinolytic enzymes could be modifying the chitin structure and the surface of the mycelia. These modifications could protect the mycelia from plant defenses such as plant chitinases [44], thus allowing the fungus to grow intercellularly. Several chitinases are highly expressed in cacao pod tissues throughout development, and can be hyper-induced in response to M. roreri infection [21]. Another possibility is that the chitinases attack chitin oligosaccharides released from the fungus, after being attacked by plant chitinases. Additional extracellular chitinases could help eliminate these chitin oligosaccharides and prevent the elicitation of plant defense responses [45].
A chitin deacetylase and a chitin synthase were highly upregulated in the necrotrophic phase. The induction of chitin synthase is consistent with the rapid growth of the fungus initiated during the necrotrophic phase. The chitin deacetylase and related enzymes like exo-beta-glucanases and family 4 carbohydrate esterases can modify chitin, either loosening the chitin polymer bonds or converting the chitin to chitosan. Like the rust fungus Uromyces viciae-faba [46], M. roreri may use chitin deacetylase and other enzymes to modify its own cell wall reducing the effectiveness of the plant enzymes. A resistant wall structure may be particularly important in the harsh environment created as plant tissues die. Although cacao defense genes were induced in the later stages of the biotrophic phase of FPR, their induction levels continued to increase once the necrotrophic phase has been initiated [21].

PR-1 related proteins
Pathogenesis-related 1 proteins (PR-1) are a class of plant genes induced by pathogens during the infection stage and they are partially responsible for systemic acquired resistance responses in those plants. It is of particular interest that plant-like defense proteins would be found in a pathogen and within two species of the same fungal genus. Of the 12 PR-1 related genes identified in the M. roreri genome, 10 have homology to PR-1 genes recently described in Moniliophthora perniciosa [47]. Three of these PR-1-like proteins are upregulated during the biotrophic phase, five are constitutively expressed and only MrPr-1d was upregulated in the necrotrophic phase. All of the MrPR-1-like proteins have the SCP/TAPS-like conserved domains (Figure 3). The MrPR-1 g and MrPR-1a match the expression of their homologs in the biotrophic phase of M. perniciosa [47]. Also of interest is the Cterminal extension of the MrPR-1g protein, which is found in both M. perniciosa and M. roreri and could have a specific protein-protein mediated function as purposed by Teixeira et al., [47]. The MrPR-1b homolog was constitutively expressed in M. roreri, which differs from the case of M. perniciosa where it was upregulated in the necrotrophic phase. While the exact function of these PR-1 proteins in fungi is unknown, these fungal-derived PR-1 like proteins may be masking the fungal cell wall from plant defense mechanisms, or they may bind to them preventing their activity. Riviere et al., [48] demonstrated that PR-1a proteins in tobacco plants regulate the extracellular enzyme activity of β-(1-3)-glucanase activity and plants with PR-1 silenced genes had higher enzyme levels along with a decrease in the deposition of β-(1-3) glucans and callose. Callose deposition in the cell-wall sheath around the plasmodesmata openings is a dynamic regulator mechanism that appears to alter the size of the plasmodesmata orifice, which in turn can modify cell to cell transport by restricting the size of transport molecules [49]. Previous work has shown that M. roreri infected cacao pods at 30 DPI have a 1.6 to 2 fold reduction in glucose, phenylalanine and asparagine levels [21], which suggests the fungus has access to these plant metabolites, despite being limited to the intercellular spaces. During the necrotrophic phase (60 DPI) most pod metabolites are severely depleted. The M. roreri mycelia, which grow between plant cells, may have access to the plasmodesmata passages and the nutrients that pass through them. The PR-1 like proteins appear to be important to virulence, and it has been suggested that they support systemic spread of the fungus, possibly by limiting susceptibility to host defenses, or they may act as effectors suppressing host defenses [50]. The phase specific expression of the different PR-1 encoding genes is interesting, particularly because of the difference in the host interactions of the biotroph and necrotroph. The PR-1 proteins expressed by M. roreri in the different phases could be acting to alter the plant cell walls or as competitive inhibitors of plant PR-1 proteins, thus minimizing the plant defense response, or they could have antimicrobial activity, which would function to prevent subsequent infection and development of competing microorganisms.

Hydrophobins
Of the 41 hydrophobins found in the genome, three members of the hydrophobin gene family are up-regulated in the biotrophic phase and 10 members are up regulated in the necrotrophic phase. The hydrophobins interact at the fungal cell surface creating a hydrophobic surface layer and are important in many morphogenetic processes such as sporulation, fruiting body formation and infection structures [51]. Hydrophobins are grouped into three classes Ia, Ib, and II, with Ia and II occurring in ascomycetes and Ib in basidiomycetes [52]. These proteins have the unique ability to self assemble once secreted and they function at the fungal wall-air interface, limiting desiccation and providing protection against both chemical and enzymatic attack [53]. It is easy to postulate the need for different hydrophobins between the two phases of FPR, considering the need to limit detection in the biotrophic phase and the need to protect the surface from enzymatic and chemical attack in the necrotrophic phase. In animal systems at least, hydrophobins are known to prevent host recognition of pathogen-associated molecular patterns, thus preventing the activation of host immune response [54]. Hydrophobins are also of interest because of their potential uses in industrial processes [52].

Cerato-platanins and related proteins
Three cerato-platanin (CP) genes are up-regulated in the biotroph and four are upregulated in the necrotroph.
Cerato-platanin domain-containing proteins such as EPL (eliciting plant response-like), Snodprot, swollenins and expansins have a double ψβ-barrel fold and a carbohydrate binding site that have been shown to induce plant defense response [55,56]. Cerato-platanins can cause nutrient leakage and cell death in cacao [57,58]. Ectopic expression of the Magnaporthe grisea CP gene, MgSM1, upregulates the expression of plant defense genes such as PR-1, PR-5 and PDF1.2 and induces local hypersensitivity reactions (HR) [59]. This class of fungal proteins is called elicitors or effector proteins. A fungal effector protein found in Colletotrichum truncatum (CtCP1) is expressed during the infection stage [60]. Unlike the MgMS1 protein, CtCP1 does not cause HR or elicit a plant defense response, [60] demonstrating that the activity of cerato-platanins can be both host and protein specific. An EPL gene from Trichoderma species produces a protein that binds to chitin and self assembles at the air/water interface like a hydrophobin [61]. When MSP1, a snodprot1 homolog protein, was knocked out in Magnaporthe oryzae, the fungus was unable to grow within the plant after normal infection suggesting this protein has a key function during the intercellular growth of this fungus [62]. Other CP-family proteins such as snodprot1 homologs are expressed in ecotomycorrhizal fungi Pisolithus microcarpus and Laccaria bicolor [63], suggesting a role for this family of genes during the plant fungal interaction.
Two expansins were upregulated in the necrotrophic phase of this study. Expansins were first discovered in plants, due to their role in developmental processes involving cell wall modifications, including cell enlargement, by "loosening" plant cell walls [64]. A functional expansin family protein was reported in the basidiomycete Bjerkandera adusta [65], and expansin-like proteins have been reported in other fungi, including plant pathogens. Brotman et al. [66] reported swollenin, an expansin family protein, in Trichoderma that is involved in colonization of plant roots and apparently elicits plant-defense responses. The expansin-like and cerato-platanin proteins may be involved in invasion and pathogenesis of plant tissues by phytopathogens that employ penetration of host/susceptible tissues. Since members of the cerato-platanin family are highly expressed in both phases of FPR, it further supports scenarios in which the fungus uses unique members of complex gene families to facilitate growth and spreading under the conditions unique to each phase.
Another indicator of the biotrophic/necrotrophic switch is the upregulation of several genes in the necrotrophic phase that are associated with fungal reproduction such as the heterokaryon incompatibility protein, a fruiting bodyspecific gene and a riboflavin aldehyde-forming enzyme. The reproductive stage of the fungus occurs after the necrotrophic phase and is a nutrient intensive process [13]. Cytochrome P450s and hydrophobins have been previously found in necrotic cacao pods infected with FPR [21] and have been shown to be expressed during fungal reproduction, specifically in the fruiting body development [53,67]. However, unlike most basidiomycetes, M. roreri has lost the ability to form a basdiocarp or mushroom and sporulates directly from the mycelial mat that it forms on the diseased pod [13].

Conclusion
The M. roreri genome and expression analyses provide insight into the molecular mechanisms of the biotrophic and necrotrophic phases of this important hemibiotrophic pathogen. RNA-Seq analysis revealed differential gene expression during the biotrophic and necrotrophic phases of M. roreri during the development of FPR disease of cacao. Focusing on the secreted proteins found in this genome, we observed specific secreted proteins putatively involved in plant cell wall degradation in the biotrophic phase, which seem to degrade and utilize the intercellular spaces. At the same time, they appear to be masking or modifying the fungal cell surface to avoid plant defenses, allowing the fungus to colonize the pod. In contrast, other secreted proteins upregulated in the necrotrophic phase suggest that the fungus continues to protect its mycelia from plant defenses, while actively releasing enzymes and toxins that have been reported to attack the plant cell wall components and utilizing plant nutrients that are released during plant cell death. These findings support the existing pathogenesis model for FPR while giving greater details regarding the identity of putative secreted proteins involved in the different stages, and information regarding the differential use of unique members of complex gene families. Due to the high number of genes that encode hypothetical proteins in this genome and that are expressed during the disease process more research is needed to fully understand the various phases of this hemibiotrophic fungus.

Biological material, libraries construction and sequencing
The M. roreri clone (MCA2977) was isolated from the state of Los Rios in Ecuador. It was grown and maintained on a yeast extract (5 gL −1 ) and glycerol (20 mL L −1 ) medium. Cultures were incubated at 27°C at 200 rpm for 7 d. Total DNA was extracted using a cetyl trimethylammonium bromide (CTAB) method [68].

Genome assembly
Moniliophthora roreri DNA sequences were obtained by a hybrid sequencing approach using GS FLX and Illumina sequencing, and the whole-genome shotgun strategy. The GS FLX sequencer (454 Life Sciences/Roche) was used to produce 3,440,399 single-end reads with a mean length of 230 bp and 417,891 reads from paired end runs with 2Kbp insert size. An Illumina Genome Analyzer was used to produce 30,826,974 read pairs (2 × 76 bp, insert size of 365 bp). The Illumina data was filtered to remove reads from mitochondrial genome and plasmids. Illumina data was error-corrected using Quake v0.3 with default parameters. In total, 24,405,509 read pairs passed Quake error correction. The final hybrid assembly of both 454 data and Illuminia data was performed by using the Newbler v.2.6 assembler (default parameters with overlap parameter > = 40 and identity to > =95%, http://www.454. com/products/analysis-software/) [69,70].
Combined gene models Ab-initio gene models prediction Genefinding prediction programs, Genemark-ES v2.3 [71] and Augustus v2.3.1 [72,73] programs optimized with the Exonerate v2.2.0 protein2genome model [74] were used to perform the ab-initio gene prediction. Genefinding prediction program Genemark.hmm [75] was executed in self-training mode and ORFs larger than 200 bp were retained. The Genemak-ES predicted proteins were aligned against the GenBank NR, using BLASTP with a e-value cutoff of 1e-10. Finally, EVidence Modeller (EVM) [76] was used to combine the predictions from Genemark-ES, Augustus and Exonerate spliced protein alignments and generate a consensus gene prediction. Select gene models were manually inspected, especially incomplete genes 5' and 3' ends of the scaffold.

Automatic annotation
The automatic annotation program AutoFACT [77] was used for functional annotation of gene models. The main contribution of AutoFACT is the capacity to conduct the annotation based on sequence similarity searches in several databases. For this, we used the BLASTp (e-value cutoff of 1e-5) to align the gene models against various proteins databases: non-redundant (NR) database at NCBI, SwissProt-databases containing only manually curated proteins [78], uniref90 and uniref100-databases containing clustered sets of proteins from UniProt, Pfam-database of proteins families [79] and KEGG-database of metabolic pathways [80]. Open reading frames were also annotated using Blast2GO (http://www.blast2go.com/b2ghome) [81].

Transposable elements
The M. roreri and M. perniciosa genomes were scanned for transposable elements. Each genome was fragmented in silico into 1000-bp fragments. These fragments were compared to a database of full length LTR retroptransposons developed by the program LTR struc [82] of LTRs associated with M. roreri and M. perniciosa using the BLAST program. Fragments with significant E-values (E-04) and better were sequestered. The rest of the fragments with no hits to LTR retrotransposons were scanned against the repbase database [83], which is a collection of sequences representing repetitive DNA from different eukaryotic species using the Blast program.
The M. roreri and M. perniciosa genomes were also scanned for repeptive DNA using the program Repeat Masker with full Repbase. However, this scan did not find all the LTR and repetitive elements. Thus, repeat Modeler was also used to find denovo repeats and used as the database for input into Repeat Masker. This analysis is shown (Additional file 1 Repeat modeler sheet) Repeat-Masker Open-3.0. <http://www.repeatmasker.org>.

Transcriptome analysis in infected pods
Briefly, the pods used in this study were derived from flowers hand pollinated 60 days prior to infection with a spore suspension of 1×10 8 spores ml -1 [21]. The plant material used was a segregating F1 population of progeny from highly susceptible clones (PA16 × SIC433 and PA16 × EEG29). Seven infected pods were frozen in liquid nitrogen at 30 days post infection (DPI) and another seven at 60 DPI. Based on the typical progression of FPR symptoms [21], malformed green pods were selected for RNA-Seq and RT-qPCR from 30 DPI pods which represents the biotrophic phase. Necrotic sporulating pods were selected for RNA-Seq and RT-qPCR from 60 DPI pods, which represents the necrotrophic phase.

RNA extraction
Each infected pod was broken up and coarsely ground under liquid nitrogen and approximately 1 cm 3 was used for processing. The pod material was ground finely and transferred to a disposable 50 mL centrifuge tube containing 15 mL of 65°C extraction buffer [84]. Additional extraction methods were conducted as in Melnick et al. [18]. The cDNA was synthesized using the Invitrogen (Carlsbad, CA) Superscript VILO kit, following the manufacturer's directions.

RNA sequencing
For the genome-wide analyses of expression patterns, cDNA was generated using a routine RNA library preparation TruSeq protocol developed by Illumina Technologies (San Deigo, CA). Using the kit, mRNA was first isolated from total RNA by performing polyA selection step, followed by construction of single end sequencing libraries with an insert size of 160 bp. Single-end sequencing was performed on six samples (three 30 DPI and three 60 DPI) using the Illumina HiSeq platform. Samples were multiplexed with unique six-mer barcodes generating 404,820,524 filtered (for Illumina adapters/primers, and PhiX contamination) 1×50 bp paired end reads.

Expression analysis
Reads from 30 DPI and 60 DPI libraries were mapped to the nucleotide sequences of predicted protein coding genes of the M. roreri genome using the short read aligner Bowtie-0.12.7 which utilizes a Burrows-Wheeler index [85]. Count data were obtained for each coding sequence. Estimation and statistical analysis of expression levels using count data of each gene with 3 replicates for each library was performed using the DEseq package [86] and R x64 2.15.2 program. (http://www.r-project.org/).
After RNA-Seq, twenty five genes were chosen for analysis by RT-qPCR across the full set of infected pods. For RT-qPCR analysis, seven replicate malformed green pods 30 DPI and seven replicate necrotic sporulating pods (60 DPI) were used. RT-qPCR analysis was conducted following Bailey et al. [21], using Brilliant III® SYBR® Green Q-PCR Master Mix (Agilent, Santa Clara, CA). Primer sources, sequences for the M. roreri genes are in Additional file 3. M. roreri ESTs were chosen based upon results of RNA-Seq analysis. RT-qPCR was conducted to determine the changes in expression of M. roreri genes between the biotrophic and necrotrophic phases of FPR. The ddCt method was used to calculate the foldchange between the 30 and 60 day DPI samples [87,88].

Statistical analysis of data
A two-way ANOVA of the RNA-Seq data was conducted using the general linear model (PROC GLM) followed by Tukey post-hoc testing (α = 0.05) using SAS 9.3 (SAS Institute Inc., Raleigh, NC, USA) to analyze pod data. For RT-qPCR, relative transcript levels were determined following Pfaffl [89].

Secreted proteins
17,920 protein coding sequences were scanned for signal peptides using signalP [90,91]. The resulting proteins containing signal peptides were scanned for transmembrane proteins using the TMHMM program (Prediction of transmembrane helices in proteins) [92]. Proteins with no more than two transmembrane domains were sequestered for further analysis. The resulting proteins were scanned for GPI-anchored proteins [93,94] using FragAnchor, which is based on the tandem use of a Neural Network predictor and a Hidden Markov Model predictor (http://navet.ics. hawaii.edu/~fraganchor/NNHMM/NNHMM.html) [95]. Proteins that were highly probably and probable of having a GPI-anchor were discarded. Thus, using bioinformatics, proteins with signal peptides, with two or less transmembrane domains and without GPI-anchors were considered for possible secretion.

Venn diagram
Using a bi-directional blast method the M. roreri, M. perniciosa, Laccaria bicolor, Coprinus cinereus, and Ustilago maydis genomes were compared and used to generate a 5-way Venn diagram. In addition, five tax plots for all five fungal species were created using each genome as a reference, respectively. Protein sequences that share an expected E-value of E-04 were considered matches.

Dot plots
Dot plots between the large contigs (50,000 bp or larger) of M. roreri and M. perniciosa were created using the whole genome aligner MUMmer3. 22. The NUCmer and MUMmerplot programs from the MUMmer suite were used [96,97].