Skip to main content

Comparative genome analysis of plant ascomycete fungal pathogens with different lifestyles reveals distinctive virulence strategies



Pathogens have evolved diverse lifestyles and adopted pivotal new roles in both natural ecosystems and human environments. However, the molecular mechanisms underlying their adaptation to new lifestyles are obscure. Comparative genomics was adopted to determine distinct strategies of plant ascomycete fungal pathogens with different lifestyles and to elucidate their distinctive virulence strategies.


We found that plant ascomycete biotrophs exhibited lower gene gain and loss events and loss of CAZyme-encoding genes involved in plant cell wall degradation and biosynthesis gene clusters for the production of secondary metabolites in the genome. Comparison with the candidate effectome detected distinctive variations between plant biotrophic pathogens and other groups (including human, necrotrophic and hemibiotrophic pathogens). The results revealed the biotroph-specific and lifestyle-conserved candidate effector families. These data have been configured in web-based genome browser applications for public display ( This resource allows researchers to profile the genome, proteome, secretome and effectome of plant fungal pathogens.


Our findings demonstrated different genome evolution strategies of plant fungal pathogens with different lifestyles and explored their lifestyle-conserved and specific candidate effectors. It will provide a new basis for discovering the novel effectors and their pathogenic mechanisms.

Peer Review reports


Plant pathogens with different lifestyles utilize distinct strategies to interact with their host plants. Necrotrophs such as the white mold fungus Sclerotinia sclerotiorum and the gray mold fungus Botrytis cinerea often secrete enzymes and toxins to kill their hosts and reproduce in dead tissue [1], while biotrophs such as powdery mildew fungi (e.g., Blumeria graminis) establish haustoria and take up nutrients within living host cells [2]. Hemibiotrophs such as Magnaporthe oryzae first adopt a biotrophic growth phase and later change into a necrotrophic infection phase [3]. The plastic genome structure of fungal species could help them evolve distinct sets of genes to successfully complete their infection lifestyle [4,5,6]. In the variable genome compartment, gene gain and loss events reflect high adaptation within pathogenic life histories [7, 8]. Effector genes are often located in genomic compartments with high mutation and/or recombination rates, such as repeat-rich regions, near telomeres, this promotes genes lose, gain, or mutation, and are among the most rapidly evolving genes in pathogen populations [9]. Even the birth-and-death model of an effector gene was proposed by Fouché, et al. [10], whether plant pathogens virulence is associations with their different lifestyles and how does  evolution change the diversity of effector proteins among these species is poorly understood.

Successful plant pathogens have to overcome physical barriers, suppress or manipulate the plant immune system, and retrieve nutrients from host tissues when infecting host plants for pathogenicity [11]. The plant cell wall is an important physical barrier that serves as the first line of plant defense that plant pathogens encounter during infection and is composed of basic carbohydrate ingredients, including cellulose, hemicellulose, pectin and lignin. Plant pathogens produce carbohydrate-active enzymes (CAZymes) that play a pivotal role in breaching the frontline of plant defense [12]. Important members of the CAZyme family are cell wall degrading enzymes (CWDEs), including glycolside hydrolases (GHs), polysaccharide lyases (PLs) and carbohydrate esterases (CEs), that cleave or modify oligo- and polysaccharides as well as other glycoconjugates and are involved in complex carbohydrate metabolism [13]. Filamentous fungi also produce a wide repertoire of bioactive secondary metabolites. Many secondary metabolites act as pathogenicity factors and have detrimental impacts on host health. For example, trichothecenes produced by Fusarium spp., T-toxin polyketides produced by Cochliobolus heterostrophus and AM-toxin cyclic peptides by Alternaria alternata are clearly involved in pathogenicity [14, 15]. Many observations found that fungi have evolved different CAZyme and secondary metabolite production strategies to adapt their own lifestyle [16,17,18]. Well known functions of melanin are protecting spores or mycelium against external environment, such as desiccation, oxygen and UV. It is also required for appressorium formation of M. grisea and allows the fungus invades the plant tissue [19].

Plant pathogens secrete diverse groups of effector proteins to gain virulence advantages. One major role of effectors is to suppress plant immunity and aid infection for pathogenicity and successful establishment of different lifestyles [20,21,22]. Most effectors have N-terminal signal sequences for secretion, and this feature helps predict the entire inventory of secreted proteins [23,24,25,26]. The other trait of effector proteins is that they are rich in cysteine (greater than 2 %), which would help in the formation of stabilizing disulfide bridges [23]. A proportion of effectors perform multifarious functions, such as host cell wall degradation, act as inhibitors of host defensive enzymes (e.g., proteases) or affect defense signaling pathways (e.g., salicylic acid biosynthesis) [27]. They function either in the plant extracellular space (apoplast) or inside plant cells after translocation from the pathogen [28]. Despite their key functions as immunity suppressors, effectors can be recognized by the plant surveillance system, leading to effector-trigger immunity (ETI) in a manner highly specific to resistant (R) proteins [29, 30].

Many known plant pathogens are mostly distributed in four classes in Ascomycota. These phytopathogenic fungi have diverse survival strategies and a high level of ecological diversity [31]. To uncover lifestyle-specific features of effector proteins, we employed a comparative genomic method to analyze 17 selected plant fungal pathogens of Ascomycota, all causing important crop diseases, including 2 biotrophs, 6 necrotrophs, and 9 hemibiotrophs (Tables S1 and S2). Two human pathogens and three biotrophs from Basidiomycota were included for comparison. We focus on the following two aspects: (i) genome structure and evolution characteristics among pathogens with different lifestyles and (ii) whether there are differences in the effector genetic diversity of these plant pathogens. The identification and analyses of the repertoire of effector variants is a prerequisite to understand pathogen-plant interactions and will provide important insights into the different infection strategies of pathogens.


Lower gene gain and loss events occurred in ascomycete plant biotrophs

The genome sizes of the 22 genomes ranged between 19.7 and 124.5 Mb (Table S1). The number of protein sequences ranged from 6,532 to 27,347 (Table 1). The ascomycete biotrophs (e.g., B. graminis and Golovinomyces cichoracearum) have relatively smaller proteomes than those of other species (Table 1). A total of 16,036 protein families were identified in 22 species, among which the number of families expanded and contracted. The two ascomycete biotrophs, B. graminis and G. cichoracearum, harbor a relatively low number of expanded and contracted families (Fig. 1 and Table S3), implying a slow rate of gene gain and loss in their evolution. Although both species live a hemibiotrophic lifestyle, Fusarium oxysporum and Ceratocystis fimbriata were in sharp contrast to all species tested. F. oxysporum had the most rapidly expanded families (1,222), while C. fimbriata contained the largest number of rapidly contracted families (243), indicating that they had undergone significant gene gains or losses in their genomes (Fig. 1 and Table S3). Even though both F. oxysporum and F. graminearum belong to the genus Fusarium, F. oxysporum revealed obvious gene family expansion (Fig. 1) because of its broad host adaptation and different reproductive strategies. Ma, et al. [32] discovered that F. oxysporum (15 chromosomes) harbors a larger genome assembly than F. graminearum (4 chromosomes), and the lineage-specific (LS) genomic regions in F. oxysporum account for more than one-quarter of the genome and are rich in transposons and genes with distinct evolutionary profiles but related to pathogenicity, indicative of horizontal acquisition.

Table 1 Number of proteins in the proteome, secretome and effectome for each fungus
Fig. 1
figure 1

Phylogenetic relationship of pathogens and gene family size changes among these species. The tree was constructed by RAxML based on single-copy gene families present in the fungal pathogens examined. The colored strips represent different lifestyles of fungi. The circles represent the numbers of expanded and contracted families or gain and loss genes of each fungus. The threshold for being defined as rapidly evolving families is set to 0.01

Lower number of CWDE gene homologs in plant biotrophs

CAZymes play critical roles in the degradation of the plant cell wall. The CE, GH, and PL superfamilies are also known as cell wall degrading enzymes (CWDEs) due to their role in the disintegration of the plant cell wall by bacterial and fungal pathogens [13]. To better understand the frequency and distribution of CAZyme-coding genes in the genome of each lifestyle, we detected the presence of CAZyme-coding genes using a homology-based approach. We identified lower numbers of three types of CWDEs (GHs, PLs and CEs) in plant biotrophs than in necrotrophs and hemibiotrophs (Fig. 2 and Table S4), consistent with previous studies showing that a lower proportion of CWDEs allows biotrophic organisms to better adapt live plant tissue and avoid plant cell death [16, 21, 33]. A highly complex set of CWDE homologs in the genomes of plant hemibiotrophic and necrotrophic pathogens indicates their significant roles in pathogenicity by being involved in the breakdown of the plant cell wall. However, the human biotrophic pathogen A. fumigatus displayed a distinct pattern with plant biotrophs that had higher CWDE gene numbers (Fig. 2 and Table S4). The enzymes for auxiliary activities (AAs) and carbohydrate-binding modules (CBMs) were also reduced in plant biotrophs. These noncatalytic proteins could increase the affinity of enzymes to substrates. Glycosyltransferases (GTs) involved in the biosynthesis of oligosaccharides, polysaccharides, and glycoconjugates are more similar to hemibiotrophic and necrotrophic pathogens.

Fig. 2
figure 2

Numbers of CAZyme-encoding genes and secondary metabolite biosynthetic gene clusters in each fungus. The colored strips represent different lifestyles of fungi. The heatmap indicates the number of genes encoding CAZymes involved in GH, GT, PL, AA and CBM activity. The numbers were normalized with the “scale” function in the R “base” package. The circles denote the number of each biosynthetic gene cluster (BGC) type, including nonribosomal peptides (NRPs), polyketides, and terpenes

Loss of secondary metabolite biosynthesis genes in plant biotrophs

Secondary metabolites (SMs) are low-molecular-weight compounds with a significant ecological, symbiotic or pathogenic role. SM biosynthetic pathways that produce polyketides, nonribosomal peptides (NRPs), ribosomal peptides, terpenes and hybrid metabolites are the most prevalent. Biosynthetic gene clusters (BGCs) are responsible for the production of these small molecules. To investigate the presence and diversity of BGCs involved in specialized metabolism pathways, the fungal genomes were processed by performing antiSMASH analyses. Plant biotrophs possess a contracted array of nonribosomal peptide, polyketide, and terpene biosynthetic gene clusters, implying that they have a weaker capacity to produce these secondary metabolites than necrotrophs and hemibiotrophs (Fig. 2), consistent with biotrophs being able to keep their host alive and take nutrients from it, while necrotrophs need to produce more diverse secondary metabolites to maintain generally broader host ranges than biotrophs [34, 35]. Human pathogens, H. capsulatum and ascomycete plant biotrophs have lower numbers of NRPs (4 gene clusters), polyketides (1 gene cluster), and terpenes (4 gene clusters), while A. fumigatus has more than several times (8 NRPs, 10 polyketides, 6 terpenes) that of H. capsulatum (Table S5), which is inconsistent with plant ascomycete fungal pathogens. Although there was a discrepancy in NRP numbers with Syme, et al. [36], both our results and theirs reflected that NRPs were overrepresented in P. teres compared to other ascomycete fungal genomes. The antiSMASH analysis also revealed that the ascomycete genome formed a larger complement of secondary metabolite gene clusters than the basidiomycetes examined in this study even the strains from basidiomycetes harbor larger genome (Fig. 2), the results are similar to a previous report [17].

Plant biotrophs contain distinctive secretomes and effectomes

The predicted secretome and effectome varied in terms of the number of proteins among 22 genomes. The number of predicted secreted proteins ranged from 344 to 2015, or 3.69–13.37 % of their respective proteomes, while the number of predicted effectors ranged from 48 to 618, accounting for 0.52–3.87 % of the proteome, respectively (Table 1). We clustered 1902 orthologous families for secreted proteins and 388 orthologous families for effectors across all species using OrthoFinder. Biotrophs were distinctively separated from the other groups in the secretome and effector with principal coordinate analysis (PCoA) (Fig. 3; ANOSIM: P= 0.001 and 0.01, respectively), while hemibiotrophic and necrotrophic pathogens apparently showed an aggregated distribution pattern (Fig. 3; ANOSIM: P > 0.1), which indicated the genetic diversity variations of effector and secreted proteins in different lifestyles.

Fig. 3
figure 3

PCoA plots of the secretome (A) and effectome (B) based on the weighted UniFrac distance metric. Af, Aspergillus fumigatus; Bc, Botrytis cinerea; Bg, Blumeria graminis; Cf, Ceratocystis fimbriata; Cg, Colletotrichum gloeosporioides; Fg, Fusarium graminearum; Fo, Fusarium oxysporum; Gc, Golovinomyces cichoracearum; Gt, Gaeumannomyces tritici; Hc, Histoplasma capsulatum; Lm, Leptosphaeria maculans; Mo, Magnaporthe oryzae; Pn, Parastagonospora nodorum; Ps, Pyrenophora seminiperda; Pt, Pyrenophora teres; Ss, Sclerotinia sclerotiorum; Vd, Verticillium dahlia; Vi, Venturia inaequalis; Zt, Zymoseptoria tritici; Pg, Puccinia graminis; Mlp, Melampsora larici-populina ; Um, Ustilago maydis

To further probe the possible mechanism that causes the distribution pattern of effectors between biotrophs and other organisms, we determined the biotroph-specific and core candidate effector families. Some biotroph-specific candidate effectors were found to be species-specific and often lacked homologs (Table S6). They were annotated to exhibit diverse functions (Table S7). Although no common effectors were detected among species, as illustrated by the flower plot (Fig. 4 A), we defined 7 core effector families from the overlapping sets among different lifestyles (Fig. 4B). Of these, five effector families were annotated as cutinase (OG0000001), endo-1,4-beta-xylanase (OG0000005), FK506-binding protein (OG0000020), cysteine-rich secretory protein (OG0000037) and endosomal P24B protein (OG0000039) (Fig. S1 and Table S8). The domains they contained included cutinase (PF01083.22), Glyco_hydro_11 (PF00457.17), FKBP_C (PF00254.28), CAP (PF00188.26) and EMP24_GP25L (PF01105.24) (Table 2 and Fig. S2). One of the remaining two core effector families contains proteins with unknown function (DUF3455; OG0000043), while the other (OG0000019) is excluded from our analysis due to the lack of observed protein domains.

Fig. 4
figure 4

Flower plot (A) and Venn diagram (B) illustrating core and specific gene families of the effectome among each species or lifestyle. Flower plots showing no overlapping effectors were detected between species. The number in the flower plot represents the total effector gene family of each species. The number in the Venn diagram indicates the core or specific gene family of each lifestyle

Table 2 Pfam domain annotations of core effector families

Establishment of the genome database of plant fungal pathogens

To make our data easily available to the research community, we developed a genome database for plant fungal pathogens and encourage researchers to use these resources ( Currently, online resources provide genomic and protein sequences for our data only, but they could be integrated in the future with additional genomic or transcriptomic data from other plant fungal pathogens. The genome hub we built offers a suite of bioinformatics tools to explore, analyze, and interpret these data. For example, we set up sequence search web services that allow users to search their sequence of interest, perform BLAST similarity searches to compare sequences against the genomic and protein sequence database and utilize the genome browsing tool for visualization of the genomic data. Compared to other plant pathogenic fungi genome database, such as FungiDB (, our database has accumulated and stored sequences of the secretome and effectome of plant fungal pathogens. The goal of the website is to establish a user-friendly platform for phytopathologists to extract and summarize plant fungal pathogen information intuitively from multiple data sources and to explore the novel virulence factors.


The plasticity of the genetic profile allows biotrophic, necrotrophic and hemibiotrophic lifestyles to form evolutionarily stable life history traits within fungi. Expansion and contraction of gene families is important to pathogen and host coevolution [37, 38]. Our results suggested that all selected pathogens have experienced gene family expansion and contraction, even though two ascomycete biotrophs showed slow gain and loss rates. Compared with biotrophs, necrotrophs and hemibiotrophs experienced relatively obvious turnover of genes as well as intensive dependence on the changes of host-life-state. Host-adapted lifestyles range from mutualistic symbiosis to pathogenic states, the millions of years of co-evolution has resulted a complex molecular dialogue between organisms [39]. Gain and loss of genomic islands can lead to rapid lifestyle transitions in plant-associated Pseudomonas [40]. Cuscuta australis also experienced remarkably high levels of gene contraction to adapt their parasitic lifestyle [38]. Hence, understanding of the co-evolutionary process in the arms race between plant pattern recognition receptors and pathogen effectors will be an important goal of future research.

Moreover, our study showed that plant biotrophs carry a lower number of CAZyme-encoding genes associated with plant cell wall degradation and apparently undergo a convergent loss of secondary metabolic gene clusters to trade off their sustainable parasitism. Spanu, et al. [41] discovered powdery mildew fungi that adapt to their specific biotrophy by increasing retrotransposons and reducing the number of enzymes involved in primary and secondary metabolism, carbohydrate-active enzymes, and transporters. Similarly, in comparison with hemibiotrophic Phytophthora species, Baxter, et al. [42] explored the oomycete Hyaloperonospora arabidopsidis (Hpa), an obligate biotroph, and reduced some genes encoding host-targeted, degradative enzymes and genes required for the synthesis of arachidonic acid and polyamine oxidases. Profiles derived from whole-genomic analysis also revealed that necrotrophic pathogens harbored an extensive set of CAZymes and secondary metabolic enzymes compared to biotrophic pathogens [35, 43]. All of these results demonstrated that biotrophs have evolved specific genetic features to accommodate their infection and survival strategy.

To address how variations in effectors have an impact on the interaction between hosts and pathogens, we investigated the genetic diversity of effector repertoires from different lifestyles. The diversity distribution analysis of effectors indicated clear separation of biotrophs from the other lifestyle groups (Fig. 3), consistent with biotrophs preferentially harboring distinct suites of effectors that have specific functions in biotrophic pathogenesis [41]. Moreover, a set of core effector families from pathogens with different lifestyles was observed in our study, which suggests that most pathogens defeat plant immunity with core conserved effectors. Even genome database-integrated sequences and annotations for dozens of fungi species have been established [44], most of them neglected effector candidate repertoire for plant pathogenic fungi. The genome database we established is complementary for the most devastating fungal plant pathogens, it allows researchers to compare the functional repertoire in the species used in this study and uncover novel effectors and their pathogenic mechanism.


In this study, we presented a comparative genomic analysis of ascomycete plant pathogens and revealed genome variations among pathogenic fungi with different lifestyles. Plant ascomycete biotrophs appear to have very similar genomic characteristics, including much lower content in expanded or contracted gene families, genes coding for CAZymes, and enzymes participating in secondary metabolite synthesis. Their effector repertoire diversity shows remarkable differences relative to necrotrophs, hemibiotrophs and human pathogens. Although expanding the population genomics of plant pathogens offers a powerful approach to analyze whole-genome features and predict extensive effector gene repertoires, experimental verification of gene function in biological processes and effectors in virulence is still needed. Moreover, the variations of pathogen virulence in space and time are driven mainly by plant-pathogen co-evolutionary dynamics. Further research is in progress to try to understand more about virulence effectors evolution in plant immune system network.


Genome data collection

The fungi selected are among the most economically important groups of pathogens of native and cultivated plants. The members include the barley pathogens B. graminis f. sp. hordei and Pyrenophora teres f. teres, the wheat pathogens F. graminearum, Gaeumannomyces tritici, Parastagonospora nodorum, Zymoseptoria tritici and Puccinia graminis f. sp. tritici, the hemibiotrophic rice and wheat pathogen M. oryzae, the biotrophic maize pathogen Ustilago maydis, generalized economic crops and grass pathogens, and two human pathogens, Aspergillus fumigatus and Histoplasma capsulatum (Table S2). Genomes for all fungi were collected from the NCBI Assembly database (for assembly accessions, see Table S1) [45]. Three species of B. graminis, Colletotrichum gloeosporioides and P. teres f. teres lack annotation information, and their genome assemblies were each annotated using the MAKER Annotation Pipeline ( [46]. We detected and masked the repeats and transposable elements based on the algorithms GeneMark-ES [47] and SNAP [48] and obtained conceptually translated protein. Protein information of the closest homology for each sequence was retrieved from UniProt based on the best BLAST hit (maximum e-value of 1e−5). HMMER was performed to identify Pfam domains against the Pfam protein database [49].

Phylogenomic analyses

These twenty-two genomes were subjected to phylogenomic analyses. The gene families with only one gene per species were identified by OrthoFinder [50]. A total of 531 single-copy gene families were produced ( For these families, we performed multiple sequence alignment and maximum likelihood (ML) tree estimation. Sequence alignment was performed in MAFFT [51] and HAlign [52, 53] with default settings. TrimAl was used to trim the alignment to eliminate poorly aligned regions [54]. The species tree was estimated in RAxML using the standard algorithm [55]. Visualization of the tree was performed with iTOL [56].

Divergence date estimation

For divergence time evaluation for the genome-based tree, phylogeny was calibrated in r8s with the penalized-likelihood algorithm, and the smoothing parameter value was set to 1 through cross-validation [57]. Three calibration points were fixed in the molecular clock analysis. Node 1 was fixed to 635 Mya for the age estimates of Dikarya [16]. Node 2 was constrained to 500 - 582 Mya for the divergence of Basidiomycota [16, 58]. Node 3 was constrained to 207-339 Mya [59].

Evolution of gene family sizes

The computational tool CAFE was used to analyze the gene families that have experienced expansion or contraction [60]. The program can estimate the evolution of gene family sizes over a phylogeny based on the stochastic birth and death model assuming that gene gain and loss are equally probable. The expansion or contraction events of gene families were estimated by the comparisons of family size differences between the most recent common ancestor (MRCA) and each of the current species with a significant P-value of 0.01. If the significance of gene family expansion in each branch was also less than 0.01 (P-value), these families were regarded as rapidly evolving families. The sequence count table of protein families from OrthoFinder and the ultrametric tree from r8s were used in the analyses.

CAZymes annotation

The hmmscan program from the HMMER software package [61] was used to annotate the CAZyme domain by comparing query protein sequences to the dbCAN CAZyme domain HMM database ( [62]. The dbCAN2 meta server grouped CAZyme functional modules into six classes: glycosyl transferases (GTs), glycoside hydrolases (GHs), polysaccharide lyases (PLs), carbohydrate esterases (CEs), enzymes for auxiliary activities (AAs) and carbohydrate-binding modules (CBMs) [63].

Prediction of gene clusters for biosynthesis of specialized metabolites

Biosynthetic gene clusters (BGCs) for secondary metabolites were analyzed using antiSMASH, which applies a rule-based cluster detection approach to identify 45 different types of secondary metabolite biosynthetic pathways via their core biosynthetic enzymes. The 45 BGC classes were condensed into five major groups: nonribosomal peptide synthetase (NRPs), polyketide, ribosomally synthesized and posttranslationally modified peptides (RiPP), and terpenes [64].

Prediction of the secretome and effectome

The SignalP 4.0 server was used initially to predict proteins with a signal peptide [65]. The TMHMM server was used to screen for predicted proteins without a predicted transmembrane domain (TMHMM server version 2.0; The putative effector proteins were predicted by EffectorP 2.0. The class probability threshold for the EffectorP classifier is set at 0.55 [66]. Secreted and effector protein sequences were clustered into orthologous families using MCL. Representative protein sequences were randomly selected from each family and used to calculate phylogenetic distance. The distance was visualized by principal coordinate plots using the “pcoa” function of R. ANOSIM analysis was used to statistically test whether there was a significant difference among groups [67]. The core and specific families were illustrated by Venn diagram and flower plot using VennDiagram and the plotrix package of R [68, 69].

Genome database establishment

The web server was built using Drupal (, which is an open source platform that allows users to design robust and flexible websites. The framework and basic functionality of Drupal is written by PHP, and it supports many extensions, such as PHP, MySQL. Tripal provides other functions to extend the content of Drupal, such as genomic data loaders and data storage. Furthermore, Tripal also offers some extensions that support loading and visualization of BLAST and popular genome browsing tools, such as JBrowse [70].

Availability of data and materials

All raw data is archived under NCBI BioProject and their accession numbers are available in Supplementary Table S1. The web-based genome browser of plant fungal pathogens is accessible at the All the sequence data used can be downloaded from links on the home page.


  1. 1.

    Amselem J, Cuomo CA, van Kan JA, Viaud M, Benito EP, Couloux A, Coutinho PM, de Vries RP, Dyer PS, Fillinger S: Genomic analysis of the necrotrophic fungal pathogens Sclerotinia sclerotiorum and Botrytis cinerea. PLoS Genetics 2011, 7(8):e1002230.

    PubMed  PubMed Central  Google Scholar 

  2. 2.

    Panstruga R: Establishing compatibility between plants and obligate biotrophic pathogens. Curr Opin Plant Biol 2003, 6(4):320–326.

    PubMed  Google Scholar 

  3. 3.

    Wang C, Liu Y, Liu L, Wang Y, Yan J, Wang C, Li C, Yang J: The biotrophy-associated secreted protein 4 (BAS4) participates in the transition of Magnaporthe oryzae from the biotrophic to the necrotrophic phase. Saudi J Biol Sci 2019, 26(4):795–807.

    PubMed  PubMed Central  Google Scholar 

  4. 4.

    Spanu PD: The genomics of obligate (and nonobligate) biotrophs. Ann Rev Phytopathol 2012, 50:91–109.

    PubMed  Google Scholar 

  5. 5.

    Hu Y, Qiu S, Cheng L: Integration of Multiple-Omics Data to Analyze the Population-Specific Differences for Coronary Artery Disease. Comput Math Methods Med 2021, 2021:7036592.

    PubMed  PubMed Central  Google Scholar 

  6. 6.

    Ohm RA, Feau N, Henrissat B, Schoch CL, Horwitz BA, Barry KW, Condon BJ, Copeland AC, Dhillon B, Glaser F: Diverse lifestyles and strategies of plant pathogenesis encoded in the genomes of eighteen Dothideomycetes fungi. PLoS Pathogens 2012, 8(12):e1003037.

    PubMed  PubMed Central  Google Scholar 

  7. 7.

    Raffaele S, Kamoun S: Genome evolution in filamentous plant pathogens: why bigger can be better. Nature Reviews Microbiol 2012, 10(6):417–430.

    PubMed  Google Scholar 

  8. 8.

    Condon BJ, Leng Y, Wu D, Bushley KE, Ohm RA, Otillar R, Martin J, Schackwitz W, Grimwood J, MohdZainudin N: Comparative genome structure, secondary metabolite, and effector coding capacity across Cochliobolus pathogens. PLoS Genetics 2013, 9(1):e1003233.

    PubMed  PubMed Central  Google Scholar 

  9. 9.

    Möller M, Stukenbrock EH: Evolution and genome architecture in fungal plant pathogens. Nature Reviews Microbiol 2017, 15(12):756.

    PubMed  Google Scholar 

  10. 10.

    Fouché S, Plissonneau C, Croll D: The birth and death of effectors in rapidly evolving filamentous pathogen genomes. Curr Opin Microbiol 2018, 46:34–42.

    PubMed  Google Scholar 

  11. 11.

    Toruno TY, Stergiopoulos I, Coaker G: Plant-Pathogen Effectors: Cellular Probes Interfering with Plant Defenses in Spatial and Temporal Manners. Annu Rev Phytopathol 2016, 54:419–441.

    PubMed  PubMed Central  Google Scholar 

  12. 12.

    Kubicek CP, Starr TL, Glass NL: Plant cell wall-degrading enzymes and their secretion in plant-pathogenic fungi. Annu Rev Phytopathol 2014, 52:427–451.

    PubMed  Google Scholar 

  13. 13.

    Ospina-Giraldo MD, Griffith JG, Laird EW, Mingora C: The CAZyome of Phytophthora spp.: a comprehensive analysis of the gene complement coding for carbohydrate-active enzymes in species of the genus Phytophthora. BMC Genomics 2010, 11(1):525.

    PubMed  PubMed Central  Google Scholar 

  14. 14.

    Proctor R, Desjardins A, McCormick S, Plattner R, Alexander N, Brown D: Genetic analysis of the role of trichothecene and fumonisin mycotoxins in the virulence of Fusarium. Eur J Plant Pathol 2002, 108(7):691–698.

    Google Scholar 

  15. 15.

    Markham JE, Hille J: Host-selective toxins as agents of cell death in plant–fungus interactions. Mol Plant Pathol 2001, 2(4):229–239.

    PubMed  Google Scholar 

  16. 16.

    Kohler A, Kuo A, Nagy LG, Morin E, Barry KW, Buscot F, Canbäck B, Choi C, Cichocki N, Clum A: Convergent losses of decay mechanisms and rapid turnover of symbiosis genes in mycorrhizal mutualists. Nature Genetics 2015, 47(4):410.

    PubMed  Google Scholar 

  17. 17.

    Collemare J, Billard A, Böhnert HU, Lebrun M-H: Biosynthesis of secondary metabolites in the rice blast fungus Magnaporthe grisea: the role of hybrid PKS-NRPS in pathogenicity. Mycological Research 2008, 112(2):207–215.

    PubMed  Google Scholar 

  18. 18.

    Liang P, Liu S, Xu F, Jiang S, Yan J, He Q, Liu W, Lin C, Zheng F, Wang X: Powdery mildews are characterized by contracted carbohydrate metabolism and diverse effectors to adapt to obligate biotrophic lifestyle. Front Microbiol 2018, 9:3160.

    PubMed  PubMed Central  Google Scholar 

  19. 19.

    Talbot NJ: On the trail of a cereal killer: exploring the biology of Magnaporthe grisea. Ann Reviews Microbiol 2003, 57(1):177–202.

    Google Scholar 

  20. 20.

    De Wit PJGM, Mehrabi R, Van den Burg HA, Stergiopoulos I: Fungal effector proteins: past, present and future. Molecular Plant Pathol 2009, 10(6):735–747.

    PubMed  Google Scholar 

  21. 21.

    Lo Presti L, Lanver D, Schweizer G, Tanaka S, Liang L, Tollot M, Zuccaro A, Reissmann S, Kahmann R: Fungal effectors and plant susceptibility. Ann Review Plant Biol 2015, 66:513–545.

    PubMed  Google Scholar 

  22. 22.

    Horbach R, Navarro-Quesada AR, Knogge W, Deising HB: When and how to kill a plant cell: infection strategies of plant pathogenic fungi. J Plant Physiol 2011, 168(1):51–62.

    PubMed  Google Scholar 

  23. 23.

    Ellis JG, Rafiqi M, Gan P, Chakrabarti A, Dodds PN: Recent progress in discovery and functional analysis of effector proteins of fungal and oomycete plant pathogens. Curr Opin Plant Biol 2009, 12(4):399–405.

    PubMed  Google Scholar 

  24. 24.

    Kong L, Zhang L, He S: Improving Multi-type Gram-negative Bacterial Secreted Protein Prediction via Protein Evolutionary Information and Feature Ranking. Current Bioinformatics 2020, 15(6):538–546.

    Google Scholar 

  25. 25.

    Li J, Wei L, Guo F, Zou Q: EP3: An ensemble predictor that accurately identifies type III secreted effectors. Briefings Bioinformatics 2021, 22(2):1918–1928.

    PubMed  Google Scholar 

  26. 26.

    Wang Y, Zhou M, Zou Q, Xu L: Machine learning for phytopathology: from the molecular scale towards the network scale. Briefings in Bioinformatics 2021, 22(5).

  27. 27.

    Rovenich H, Boshoven JC, Thomma BPJCoipb: Filamentous pathogen effector functions: of pathogens, hosts and microbiomes. Curr Opin Plant Biol 2014, 20:96–103.

    PubMed  Google Scholar 

  28. 28.

    Giraldo MC, Valent B: Filamentous plant pathogen effectors in action. Nature Reviews Microbiol 2013, 11(11):800.

    PubMed  Google Scholar 

  29. 29.

    Dou D, Zhou J-M: Phytopathogen effectors subverting host immunity: different foes, similar battleground. Cell Host Microbe 2012, 12(4):484–495.

    Google Scholar 

  30. 30.

    Presti LL, Lanver D, Lozada-Ramirez J, Schweizer G, Tanaka S, Liang L, Tollot M, Zuccaro A, Reissmann S, Kahmann R: Fungal Effectors and Plant Susceptibility. Ann Review Plant Biol 2015, 66(1):513–545.

    PubMed  Google Scholar 

  31. 31.

    Berbee ML: The phylogeny of plant and animal pathogens in the Ascomycota. Physiological Molecular Plant Pathology 2001, 59(4):165–187.

    Google Scholar 

  32. 32.

    Ma L-J, Van Der Does HC, Borkovich KA, Coleman JJ, Daboussi M-J, Di Pietro A, Dufresne M, Freitag M, Grabherr M, Henrissat B: Comparative genomics reveals mobile pathogenicity chromosomes in Fusarium. Nature 2010, 464(7287):367–373.

    PubMed  PubMed Central  Google Scholar 

  33. 33.

    O’Connell RJ, Thon MR, Hacquard S, Amyotte SG, Kleemann J, Torres MF, Damm U, Buiate EA, Epstein L, Alkan N et al: Lifestyle transitions in plant pathogenic Colletotrichum fungi deciphered by genome and transcriptome analyses. Nature Genetics 2012, 44(9):1060–1065.

    PubMed  Google Scholar 

  34. 34.

    Howlett BJ: Secondary metabolite toxins and nutrition of plant pathogenic fungi. Curr Opin Plant Biol 2006, 9(4):371–375.

    PubMed  Google Scholar 

  35. 35.

    Pusztahelyi T, Holb IJ, Pócsi I: Secondary metabolites in fungus-plant interactions. Front Plant Sci 2015, 6:573.

    PubMed  PubMed Central  Google Scholar 

  36. 36.

    Syme RA, Martin A, Wyatt NA, Lawrence JA, Muria-Gonzalez MJ, Friesen TL, Ellwood SR: Transposable element genomic fissuring in Pyrenophora teres is associated with genome expansion and dynamics of host–pathogen genetic interactions. Front Genet 2018, 9:130.

    PubMed  PubMed Central  Google Scholar 

  37. 37.

    Hu X, Xiao G, Zheng P, Shang Y, Su Y, Zhang X, Liu X, Zhan S, Leger RJS, Wang C: Trajectory and genomic determinants of fungal-pathogen speciation and host adaptation. Proceedings of the National Academy of Sciences 2014, 111(47):16796-16801.

  38. 38.

    Sun G, Xu Y, Liu H, Sun T, Zhang J, Hettenhausen C, Shen G, Qi J, Qin Y, Li J: Large-scale gene losses underlie the genome evolution of parasitic plant Cuscuta australis. Nature Communications 2018, 9(1):2683.

    PubMed  PubMed Central  Google Scholar 

  39. 39.

    Gluck-Thaler E, Cerutti A, Perez-Quintero AL, Butchacas J, Roman-Reyna V, Madhavan VN, Shantharaj D, Merfa MV, Pesce C, Jauneau A: Repeated gain and loss of a single gene modulates the evolution of vascular plant pathogen lifestyles. Science Advances 2020, 6(46):eabc4516.

    PubMed  PubMed Central  Google Scholar 

  40. 40.

    Melnyk RA, Hossain SS, Haney CH: Convergent gain and loss of genomic islands drive lifestyle changes in plant-associated Pseudomonas. ISME J 2019, 13(6):1575–1588.

    PubMed  PubMed Central  Google Scholar 

  41. 41.

    Spanu PD, Abbott JC, Amselem J, Burgis TA, Soanes DM, Stüber K, van Themaat EVL, Brown JK, Butcher SA, Gurr SJ: Genome expansion and gene loss in powdery mildew fungi reveal tradeoffs in extreme parasitism. Science 2010, 330(6010):1543–1546.

    PubMed  Google Scholar 

  42. 42.

    Baxter L, Tripathy S, Ishaque N, Boot N, Cabral A, Kemen E, Thines M, Ah-Fong A, Anderson R, Badejoko W: Signatures of adaptation to obligate biotrophy in the Hyaloperonospora arabidopsidis genome. Science 2010, 330(6010):1549–1551.

    PubMed  PubMed Central  Google Scholar 

  43. 43.

    Ramzi AB, Me MLC, Ruslan US, Baharum SN, Muhammad NAN: Insight into plant cell wall degradation and pathogenesis of Ganoderma boninense via comparative genome analysis. PeerJ 2019, 7:e8065.

    PubMed  PubMed Central  Google Scholar 

  44. 44.

    Tang B, Wang Y, Zhu J, Zhao W: Web Resources for Model Organism Studies. Genomics Proteomics Bioinformatics 2015, 13(1):64–68.

    Google Scholar 

  45. 45.

    Kitts PA, Church DM, Thibaud-Nissen F, Choi J, Hem V, Sapojnikov V, Smith RG, Tatusova T, Xiang C, Zherikov A: Assembly: a resource for assembled genomes at NCBI. Nucleic Acids Research 2016, 44(D1):D73-D80.

    Google Scholar 

  46. 46.

    Holt C, Yandell M: MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinformatics 2011, 12(1):491.

    PubMed  PubMed Central  Google Scholar 

  47. 47.

    Ter-Hovhannisyan V, Lomsadze A, Chernoff YO, Borodovsky M: Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised training. Genome Research 2008:gr. 081612.081108.

  48. 48.

    Korf I: Gene finding in novel genomes. BMC Bioinformatics 2004, 5(1):59.

    PubMed  PubMed Central  Google Scholar 

  49. 49.

    Finn RD, Clements J, Eddy SR: HMMER web server: interactive sequence similarity searching. Nucleic Acids Research 2011, 39(suppl_2):W29-W37.

    PubMed Central  Google Scholar 

  50. 50.

    Emms DM, Kelly S: OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biology 2015, 16(1):157.

    PubMed  PubMed Central  Google Scholar 

  51. 51.

    Katoh K, Standley DM: MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Molecular Biol Evol 2013, 30(4):772–780.

    PubMed  PubMed Central  Google Scholar 

  52. 52.

    Wan S, Zou Q: HAlign-II: efficient ultra-large multiple sequence alignment and phylogenetic tree reconstruction with distributed and parallel computing. Algorithms Mol Biol 2017, 12:25.

    PubMed  PubMed Central  Google Scholar 

  53. 53.

    Zou Q, Hu Q, Guo M, Wang G: HAlign: Fast multiple similar DNA/RNA sequence alignment based on the centre star strategy. Bioinformatics 2015, 31(15):2475–2481.

    PubMed  Google Scholar 

  54. 54.

    Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T: trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 2009, 25(15):1972–1973.

    PubMed  PubMed Central  Google Scholar 

  55. 55.

    Stamatakis A: RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 2014, 30(9):1312–1313.

    PubMed  PubMed Central  Google Scholar 

  56. 56.

    Letunic I, Bork P: Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Research 2016, 44(W1):W242-W245.

    PubMed Central  Google Scholar 

  57. 57.

    Sanderson MJ: r8s: inferring absolute rates of molecular evolution and divergence times in the absence of a molecular clock. Bioinformatics 2003, 19(2):301–302.

    PubMed  Google Scholar 

  58. 58.

    Chen J-J, Cui B-K, Zhou L-W, Korhonen K, Dai Y-C: Phylogeny, divergence time estimation, and biogeography of the genus Heterobasidion (Basidiomycota, Russulales). Fungal Diversity 2015, 71(1):185–200.

    Google Scholar 

  59. 59.

    Hacquard S, Kracher B, Hiruma K, Münch PC, Garrido-Oter R, Thon MR, Weimann A, Damm U, Dallery J-F, Hainaut M: Survival trade-offs in plant roots during colonization by closely related beneficial and pathogenic fungi. Nature Communications 2016, 7(1):1–13.

    Google Scholar 

  60. 60.

    De Bie T, Cristianini N, Demuth JP, Hahn MW: CAFE: a computational tool for the study of gene family evolution. Bioinformatics 2006, 22(10):1269–1271.

    PubMed  Google Scholar 

  61. 61.

    Potter SC, Luciani A, Eddy SR, Park Y, Lopez R, Finn RD: HMMER web server: 2018 update. Nucleic Acids Research 2018, 46(W1):W200-W204.

    PubMed  PubMed Central  Google Scholar 

  62. 62.

    Yin Y, Mao X, Yang J, Chen X, Mao F, Xu Y: dbCAN: a web resource for automated carbohydrate-active enzyme annotation. Nucleic Acids Research 2012, 40(W1):W445-W451.

    PubMed Central  Google Scholar 

  63. 63.

    Huang L, Zhang H, Wu P, Entwistle S, Li X, Yohe T, Yi H, Yang Z, Yin Y: dbCAN-seq: a database of carbohydrate-active enzyme (CAZyme) sequence and annotation. Nucleic Acids Research 2017, 46(D1):D516-D521.

    PubMed Central  Google Scholar 

  64. 64.

    Blin K, Pascal Andreu V, de los Santos ELC, Del Carratore F, Lee SY, Medema MH, Weber T: The antiSMASH database version 2: a comprehensive resource on secondary metabolite biosynthetic gene clusters. Nucleic Acids Research 2018, 47(1):625–630.

    Google Scholar 

  65. 65.

    Petersen TN, Brunak S, von Heijne G, Nielsen H: SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods 2011, 8:785. doi: 710.1038/nmeth.1701.

  66. 66.

    Sperschneider J, Dodds PN, Gardiner DM, Singh KB, Taylor JM: Improved prediction of fungal effector proteins from secretomes with EffectorP 2.0. Mol Plant Pathol 2018, 19(9):2094–2110.

    PubMed  PubMed Central  Google Scholar 

  67. 67.

    Clarke KR: Non-parametric multivariate analyses of changes in community structure. Australian J Ecol 1993, 18(1):117–143.

    Google Scholar 

  68. 68.

    Chen H, Boutros PC: VennDiagram: a package for the generation of highly-customizable Venn and Euler diagrams in R. BMC Bioinformatics 2011, 12(1):35.

    PubMed  PubMed Central  Google Scholar 

  69. 69.

    Lemon J: Plotrix: a package in the red light district of R. R-news 2006, 6(4):8–12.

  70. 70.

    Ficklin SP, Sanderson L-A, Cheng C-H, Staton ME, Lee T, Cho I-H, Jung S, Bett KE, Main D: Tripal: a construction toolkit for online genome databases. Database 2011, 2011.

Download references


I would particularly like to acknowledge group members for their wonderful collaboration and patient support. I would also like to thank Zhibin Lv from Sichuan University and Yiwen Shao from Shandong University for their wonderful collaboration throughout my studies.


The work was supported by the State Key Laboratory of Tree Genetics and Breeding, the National Natural Science Foundation of China (No. 91935302, 62102269), China Postdoctoral Science Foundation (No. 2021M690029) and Post-doctoral Foundation Project of Shenzhen Polytechnic (No. 6021330009 K).

Author information




Ming Guo and Quan Zou designed this research. Yansu Wang performed the experiment and built the database. Jiacheng Yan and Jie Wu helped collect the data and complete the data processing. Yansu Wang drafted the manuscript. Liping Hou and Lei Xu revised the manuscript. All authors revised and approved the final manuscript.

Corresponding authors

Correspondence to Liping Hou or Quan Zou.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Wang, Y., Wu, J., Yan, J. et al. Comparative genome analysis of plant ascomycete fungal pathogens with different lifestyles reveals distinctive virulence strategies. BMC Genomics 23, 34 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Plant fungal pathogens
  • Carbohydrate-active enzymes
  • Secondary metabolites
  • Effector proteins
  • Gene gain and loss