Skip to main content

Comparative transcriptomics of Gymnosporangium spp. teliospores reveals a conserved genetic program at this specific stage of the rust fungal life cycle



Gymnosporangium spp. are fungal plant pathogens causing rust disease and most of them are known to infect two different host plants (heteroecious) with four spore stages (demicyclic). In the present study, we sequenced the transcriptome of G. japonicum teliospores on its host plant Juniperus chinensis and we performed comparison to the transcriptomes of G. yamadae and G. asiaticum at the same life stage, that happens in the same host but on different organs.


Functional annotation for the three Gymnosporangium species showed the expression of a conserved genetic program with the top abundant cellular categories corresponding to energy, translation and signal transduction processes, indicating that this life stage is particularly active. Moreover, the survey of predicted secretomes in the three Gymnosporangium transcriptomes revealed shared and specific genes encoding carbohydrate active enzymes and secreted proteins of unknown function that could represent candidate pathogenesis effectors. A transcript encoding a hemicellulase of the glycoside hydrolase 26 family, previously identified in other rust fungi, was particularly highly expressed suggesting a general role in rust fungi. The comparison between the transcriptomes of the three Gymnosporangium spp. and selected Pucciniales species in different taxonomical families allowed to identify lineage-specific protein families that may relate to the biology of teliospores in rust fungi. Among clustered gene families, 205, 200 and 152 proteins were specifically identified in G. japonicum, G. yamadae and G. asiaticum, respectively, including candidate effectors expressed in teliospores.


This comprehensive comparative transcriptomics study of three Gymnosporangium spp. identified gene functions and metabolic pathways particularly expressed in teliospores, a stage of the life cycle that is mostly overlooked in rust fungi. Secreted protein encoding transcripts expressed in teliospores may reveal new candidate effectors related to pathogenesis. Although this spore stage is not involved in host plant infection but in the production of basidiospores infecting plants in the Amygdaloideae, we speculate that candidate effectors may be expressed as early as the teliospore stage for preparing further infection by basidiospores.


Asian juniper trunk rust disease caused by Gymnosporangium japonicum is a major threat to juniper (Juniperus chinensis L.) plantations throughout China as well as in the rest of Asia. J. chinensis is widely planted as a landscaping tree in parks, gardens and Chinese temples, also its wood of high density and decay resistance properties makes it broadly used in construction, furniture, and pulp industries [1]. However, the ornamental and economic significances have been severely affected by the rust disease. Formation of G. japonicum teliospores causes tree shape malformation and the trunk becomes gradually cracked [2, 3].

G. japonicum exhibits a typical complex rust life cycle with four spore types and two alternate hosts (J. chinensis for the telial stage and Photinia villosa for the aecial stage). Most Gymnosporangium are known to be heteroecious and demicyclic, and all reported telial hosts are gymnosperms, which is different from most rust fungi for which gymnosperms are aecial hosts [2, 4]. From phylogenetic studies of the Pucciniales, Gymnosporangium evidently appears as an undescribed family-level lineage rather than a sub-group of Pucciniaceae [5]. G. japonicum, Gymnosporangium asiaticum and Gymnosporangium yamadae are the three most widespread Gymnosporangium species in Asia and the two later species are the causal agents of Japanese apple rust and Japanese pear rust diseases, respectively [6]. They all share J. chinensis as a telial host, but the parasitic symptoms are formed on different organs of the tree (Fig. 1). Telia formation is a relatively well conserved feature within Gymnosporangium spp. [6], however, unlike the two other Gymnosporangium species commonly found on J. chinensis, G. yamadae and G. asiaticum, telia formed by G. japonicum are mainly found on hard tissues such as trunks and branches [2]. In spring, wedge-shaped telial horns full of teliospores extrude from the juniper trunk and arrange in rows. After a heavy rain, telia and teliospores become gelatinous with bright-yellow colour and generate airborne basidiospores which can transfer to the aecial host P. villosa [2]. Once a compatible interaction is established between the host and G. japonicum, pycnia emerge on the upper surface of the host leaves within a few days. After cross-fertilization of pycniospores, tubular aecia are formed after a few more days on the lower surface of infected leaves, which produce dikaryotic aeciospores that can infect the junipers again (see [4] or [7] for more details about the rust demicyclic life cycle). G. japonicum overwinters as mycelium under the bark of the trunk of juniper, and telia emerge the next year or several years later when environmental conditions are suitable [8, 9]. An ultrastructure analysis of Gymnosporangium telia revealed that the meiosis takes place in teliospores shortly after karyogamy and continues without interruption [10]. On the contrary, the meiosis process of rust species with overwintering telia can begin before winter and can remain in meiosis until teliospores germinate in the next spring [11, 12]. In spite of the economic importance and the biological singularity of Gymnosporangium spp., there is very limited knowledge about molecular processes controlling the life cycle of these fungi. More particularly, there is no molecular data available for G. japonicum.

Fig. 1
figure 1

Illustration of rust disease symptoms caused by three Gymnosporangium species (G. asiaticum, G. yamadae and G. japonicum) on different organs of juniper (J. chinensis L.). Conical telia of G. asiaticum are folicolous and sometimes aggregated on witches’ broom; tongue-shaped telia of G. yamadae are formed on juniper gall, causing twigs dieback; whereas edge-shaped telia of G. japonicum are formed on trunk, causing tree shape malformation and gradually cracked trunk. Telia and teliospores can absorb water and become gelatinous with bright-yellow colour after a heavy rain

The availability of genome resources has greatly supported advanced research on rust fungi by identifying conserved features shared by different Pucciniales lineages [13]. However, the number of sequenced genomes is still limited for rust fungi compared to other fungal groups, and the large genome size of rust fungi hinders rapid progress for some taxonomical families within the Pucciniales [5, 14]. Transcriptomics has granted additional support towards the basic understanding of the rust functional genome through the use of DNA-arrays and more recently RNA-seq, allowing the study of different life stages [15, 16]; the prediction of secretome and candidate pathogenesis factors [17,18,19]; and the survey of molecular interaction between rust fungi and their hosts (see review [13] for details). However, rust transcriptomics has mostly focused on urediniospores, particularly in macrocyclic rust fungi [7]. There is very limited information about other stages or about rust fungi that exhibits life cycles different than macrocyclic rust species [11, 15, 16, 20]. Only a few studies have focused on the telial stage, which is crucial in the life cycle of demicyclic species such as Gymnosporangium [11, 21, 22]. In the absence of a reference genome for Gymnosporangium spp., almost no molecular information is available for this genus. Recently, expression profiling of teliospores of G. yamadae and G. asiaticum was performed by RNA-seq [22]. The comparison of the two transcriptomes identified many highly expressed transcripts shared in the two species at this stage, as well as transcripts also represented in the genomes of reference rust species like Melampsora larici-populina or Puccinia spp. [22]. Such high-throughput transcriptome studies are invaluable to provide prospect to understand gene functions and metabolic pathways in this rust group.

In this study, we performed an RNA-seq analysis to characterize the transcriptome of G. japonicum at the telial stage (infection of J. chinensis) and we determined the functional annotation of this life cycle stage with an emphasis on the secretome. We also took advantage of previously published transcriptome data in two other Gymnosporangium species at the same life cycle stage to identify lineage-specific gene families compared to other rust fungi, including candidate secreted effector proteins and carbohydrate active enzymes (CAZymes).


Transcriptome of G. japonicum teliospores

G. japonicum teliospores were collected from infected juniper branches in the Jiangsu province (China). Telia were visible in the form of a dense fungal mass of dark orange colour breaking through the bark of the juniper branch (Fig. 1). Teliospores were recovered by gently scratching the telia with a needle and inspection by light microscopy revealed typical two-cells spores attached to their pedicels and no contamination by plant material. Three biological replicates of G. japonicum teliospores were collected for total RNA extraction and were subjected to paired-end sequencing on Illumina Hiseq 2000 platform. In total, 49,245,136; 60,133,718; and 52,751,538 reads were obtained, respectively for the three replicates. After filtering low-quality reads and adapter sequences, we obtained a total of approximately 160 million clean reads (Additional file 1: Figure S1), which served as the input for an assembly with the Trinity program [23]. The clean reads of each replicate library were then mapped against this reference assembly and read count for each unigene was obtained using the RNA-Seq by Expectation Maximization (RSEM) software [24]. The mapping rates for each library were 82.9, 84.8 and 84.7%, respectively (Additional file 1: Figure S1). The generated assembly was composed of 40,583 transcripts, with a mean length of 1059-bp and 30,243 unigenes remained after elimination of transcripts redundancy (Additional file 1: Figure S1). A total of 15,722 CDS were determined from the unigenes using ESTscan and Blast against nr and swissprot as previously described [22]. Unigenes and transcript sequences are provided in the Additional files 2 and 3: Supplementary data 1 and 2. To further clarify the gene expression abundance in each library, FPKM values (fragments per Kb per million mapped reads) were calculated for each unigene (Additional file 4: Table S1). The FPKM density distribution profiles reflected consistent expression pattern across replicates (Additional file 5: Figure S2). Average FPKM value of the three replicates was further considered for all unigenes as a representative transcriptome of G. japonicum teliospores (Additional file 4: Table S1). In order to perform comparison across the Gymnosporangium lineage, we used transcriptome data previously published for G. yamadae and G. asiaticum teliospores [22]. The collection time (stage and time of the year in the life cycle) and the stage of maturation of teliospores on the same host tree, J. chinensis, were identical for the three Gymnosporangium species. The treatment of sequencing data and the assembly of unigenes performed in the transcriptome study by Tao and collaborators in 2017 [22] and in the present study were strictly identical. The Additional file 1: Figure S1 presents the sequencing metrics, and transcripts and unigenes assemblies which are all in very close range in term of numbers and unigenes length distribution. In some of the following sections, data from the three Gymnosporangium species were used for cross-comparison. In the G. japonicum teliospores transcriptome, 30,243 unigenes were identified, from which 15,722 CDS could be defined. These data served as a basis for annotation and comparison to transcriptomes from other Gymnosporangium spp.

Functional overview of genes expressed in G. japonicum teliospores

To obtain a detailed description and a functional annotation of the G. japonicum teliospores transcriptome, six public databases including the NCBI non-redundant (nr) and nucleotide (nt) databases, protein family (Pfam), SwissProt, eukaryotic orthologous groups (KOG), gene ontology (GO), and Kyoto encyclopaedia of genes and genomes (KEGG) databases were selected for similarity searches using Basic Local Alignment Search Tools (blastx and tblastx; e-value <1e− 5). A total of 18,872 unigenes showed similarity to at least one database and 2405 unigenes were annotated in all selected databases. The Table 1 gives the number of unigenes showing similarity in the seven databases and the details for each unigene can be retrieved in the (Additional file 4: Table S1). The nr database showed the largest number of homologs (13,165; 43.5%) for the G. japonicum unigenes set, which is detailed hereafter. Among these 13,165 unigenes, 41 and 25.1% had a length ranging between 200 and 500 bp, or larger than 2 Kbp, respectively (Fig. 2a). Besides, 39 and 56% of the unigenes showed similarity levels higher than 80%, and between 50 and 80%, respectively (Fig. 2b). Although about 45% of the 13,165 unigenes showed high homology e-values (ranging between 1e-50 to 1e-200), 45.2% of the unigenes with homology in nr had an e-value ranging between 1e-50 and 1e-10 (Fig. 2c). More than half of the 13,165 unigenes (56%) showed homology to Pucciniales sequences, notably 5521 (41.9%) and 1770 (13.4%) unigenes matched with Puccinia graminis f. sp. tritici and M. larici-populina, respectively (Fig. 2d). We also determined whether transposons were abundant in the dataset by comparing unigenes to Repbase (; blastx, e-value ≤10–5) and searching for transposon-related annotations in the results of the similarity search against nr. Only 75 and 138 unigenes presented similarity to repbase and transposon-related sequences in nr, respectively, indicating that transposons were marginal in the dataset.

Table 1 Annotation of Gymnosporangium japonicum unigenes in selected public databases
Fig. 2
figure 2

Homology of Gymnosporangium japonicum unigenes in the NCBI non-redundant (nr) database (blastx, e-value ≤10−5). a: Length distribution of G. japonicum unigenes with homologs in nr. b: Proportional distribution of similarity levels of G. japonicum unigenes with homologs in nr. c. E-value distribution of G. japonicum unigenes with homologs in nr. d: Proportional distribution by species of G. japonicum unigenes with homology in nr (details in the rust fungal order Pucciniales are provided)

Homology with KOG and KEGG databases were scrutinized to determine gene functions and metabolic pathways expressed in G. japonicum teliospores (Additional file 6: Table S2). A total of 7937 unigenes (26.3% of all unigenes) were assigned to 25 KOG functional categories. Unigenes falling in the “posttranslational modification, protein turnover, chaperones” category constituted the largest proportion (3.7%), followed by the category “translation, ribosomal structure and biogenesis” (3.4%). After genes placed in the category “general function prediction only”, a total of 2 and 1.9% of the expressed genes were classified in the “energy production and conversion” and “signal transduction mechanisms” categories, respectively. Of the 5444 unigenes assigned to 32 KEGG pathways, the three most important pathways were “translation” (913 unigenes), “carbohydrate metabolism” (669 unigenes) and “signal transduction” (605 unigenes) (Additional file 6: Table S2).

The distribution of the mean expression levels of unigenes across the G. japonicum teliospores replicates revealed a large proportion of unigenes with a low expression value (e.g. 12,480 unigenes with a Log10FPKM value below 0; Additional file 7: Figure S3), which is similar to observations made in transcriptomes of other Gymnosporangium spp. [22]. On the other hand, 17,763 and 6511 unigenes showed a Log10FPKM value above 0 and 1, respectively. A total of 977 unigenes showed a high expression level above 2 Log10FPKM, among which 65 showed a very high expression level above 3 Log10FPKM (Additional file 7: Figure S3). Among these, 44 (68%) unigenes had a homolog in the nr database (including 25 and 15 with the rust fungi P. graminis f. sp. tritici and M. larici-populina, respectively), and more than half of these unigenes encode hypothetical proteins with unknown function (Table 2). Two histones (GYJ|c28242_g1, GYJ|c12350_g1) and one glycoside hydrolase 26 (GH26, GYJ|c8249_g1) were among the most highly expressed unigenes. We searched all unigenes that belonged to teliospore-associated KOG categories based on previous transcriptomes for rust fungal teliospores [11, 21]. We found 26 unigenes encoding aquaporins, 3 unigenes encoding calcium transporting ATPase, 24 unigenes coding for pleiotropic drug resistance and 17 unigenes encoding multicopper oxidase laccase like proteins expressed in G. japonicum teliospores. Also, a large number of meiosis-related genes were found at this stage, including Dmc1, Spo11, Rec8 encoding meiotic recombination proteins; Mre1, Rad50, Rad51 coding for double-strand break repair proteins; the meiotic nuclear division protein Mnd1 and the meiotic cell division protein Dom34; and the meiotic checkpoint regulator Tsg24 (Additional file 4: Table S1). A total of 18,872 G. japonicum unigenes from the teliospores transcriptome were annotated according to diverse databases, allowing to derive subsets of highly expressed unigenes in functional categories.

Table 2 65 most highly expressed unigenes (FPKM value > 1000) in G. japonicum teliospores

Comparison of Gymnosporangium teliospore transcriptomes highlights commonly and specifically expressed genes

To get a better understanding of the functions expressed in the transcriptomes of Gymnosporangium spp. teliospores, we compared the transcriptomes of G. japonicum, G. yamadae and G. asiaticum teliospores which correspond to the same life cycle stage set on a same tree, J. chinensis, but in different organs [2, 22]. Distributions of unigenes in KOG categories were very similar in the surveyed Gymnosporangium spore stage (Fig. 3). In the three species, the number of unigenes without KOG functional annotation was predominant. Then, “posttranslational modification, protein turnover, chaperones” and “translation, ribosomal structure and biogenesis” categories were the most important annotated categories in the three species, followed by the categories “general function prediction only”; “energy production and conversion”; and “signal transduction mechanisms”. Since, G. japonicum teliospores are formed on woody tissues of the host tree J. chinensis, we gave a particular attention to CAZymes annotation, which are important enzymes involved in the plant cell wall decomposition [25]. For G. japonicum, G. yamadae and G. asiaticum, a total of 281, 284 and 297 unigenes were assigned to CAZymes families in the dbCAN database, respectively (Additional file 8: Table S3 and Additional file 9: Figure S4). Most CAZymes belonged to glycosyl hydrolases (GHs) families and only a few were grouped into carbohydrate binding modules families. Overall, the composition in each CAZyme type was relatively similar in G. japonicum compared with the two others (Additional file 9: Figure S4). Several unigenes belonging to the GH26 family, one from G. japonicum (GYJ|c4903_g1), two from G. yamadae (GYY|c14266_g2; GYY|c13067_g1) and four from G. asiaticum (GYA|c34482_g1; GYA|c1703_g1; GYA|c2116_g1; GYA|c20188_g1), were retrieved and in particular, one unigene from each species (GYJ|c4903_g1; GYY|c13067_g1 and GYA|c20188_g1) showed a higher expression level in teliospores. Other GHs from families 16, 17 and 18 showed high expression levels in the three Gymnosporangium spp. (Log10FPKM > 2; Additional file 8: Table S3), with different expression levels noticeable between members of these GH families in these three rust species. This result suggests that Gymnosporangium spp. may possess a common set of enzymes for plant cell wall decomposition and that expression reflects specific activities in infected host tissues. Overall, comparison of annotations of G. japonicum, G. yamadae and G. asiaticum unigenes indicates that the most highly represented functional KOG categories were similar in teliospores transcriptomes of the three rust species. Moreover, unigenes encoding CAZymes were similarly represented between Gymnosporangium spp. with particularly high expression profiles for specific GHs.

Fig. 3
figure 3

Distribution of unigenes from G. japonicum; and from G. asiaticum and G. yamadae in eukaryotic orthologous group (KOG) categories (x-axis). The main category corresponding to unknown function was removed for legibility. The manually annotated “secreted proteins” category refers to unigenes encoding predicted secreted proteins of unknown functions. Unigenes and proteomes of G. yamadae and G. asiaticum were retrieved from [22]

Comparison of Gymnosporangium teliospores predicted proteomes and secretomes identifies commonly and specifically expressed secreted proteins

CDS were determined in G. japonicum unigenes based on similarity in databases, which cannot identify new specific proteins. TransDecoder predicted 15,049 proteins from G. japonicum unigenes (Fig. 4). In total, 10,917 of these predicted proteins showed similarity in the nr database, which is similar to the number observed with CDS (Table 1). Predicted proteomes from G. asiaticum and G. yamadae were retrieved from [22] and compared by reciprocal best hits (BLASTp; e-value ≤1e-5). We found 5947 proteins conserved in all three Gymnosporangium species (Fig. 5; Additional file 10: Table S4). Moreover, 7548 and 7322 orthologous proteins were found between G. japonicum and G. asiaticum; and G. japonicum and G. yamadae, respectively. There were 6249; 6613 and 5461 proteins specifically detected in G. japonicum, G. asiaticum and G. yamadae, respectively (Fig. 5; Additional file 10: Table S4).

Fig. 4
figure 4

Proteomes and secretomes predicted from G. japonicum, G. yamadae and G. asiaticum unigenes. Proteomes were derived from unigenes using TransDecoder. Secretomes were predicted from proteomes using a combination of secretion predictors. Functional annotations of predicted secreted proteins were performed using the dbCAN, the Merops and the Lipase Engineering databases to identify putative carbohydrate active enzymes (CAZymes), proteases and lipases in each Gymnosporangium secretome. Unigenes and proteomes of G. yamadae and G. asiaticum were retrieved from [22]

Fig. 5
figure 5

Conserved and species-specific proteins identified between Gymnosporangium spp. Proteins predicted from G. japonicum (this study) and G. yamadae and G. asiaticum [22] unigenes were compared and homologs were identified by searching reciprocal blast hits (Blastp; e-value ≤1e-5)

Rust fungi possess large secretomes, which contain candidate virulence effectors that play a key role in fungal pathogenicity [7]. These effectors are often small secreted proteins of unknown function. Secreted proteins (SPs) were identified in the transcriptome of G. japonicum using a dedicated pipeline of secretion predictors as previously described [26]. Among the 15,049 predicted proteins, 854 (5.67%) corresponded to SPs (Additional file 11: Table S5). The predicted SPs had a size varying between 99 and 1930 amino acids, with a median length of 215aa (Fig. 6). Within SPs, four are homologues of the conserved rust transferred protein RTP1 [27]. Typical secreted proteins of CAZyme, protease and lipase categories were searched in dedicated databases. In summary, 20 SPs belonged to 18 CAZymes families, 60 SPs were annotated in 24 proteases families and six SPs were part of two lipases families (Fig. 4). Among the top 65 highly expressed unigenes, 19 corresponded to predicted SPs and 16 showed similarity to M. larici-populina or P. graminis f. sp. tritici proteins, including a glycoside hydrolase and a protease (Table 2). The same secretome prediction pipeline was applied to the previously published transcriptomes of G. yamadae and G. asiaticum teliospores [22] to allow comparison within Gymnosporangium species (Additional file 11: Table S5). In total, 775 SPs were predicted for G. yamadae, with a size ranging from 99 to 1083 amino acids and a median size of 215. In G. asiaticum, 722 SPs were predicted, with a size ranging from 99 to 1981 amino acids, with a median length of 200 (Fig. 4 and Fig. 6; Additional file 11: Table S5). The size distribution for the secreted proteins of the three Gymnosporangium species is relatively uniform and small SPs account for the largest proportion (Fig. 6). Some SPs were annotated as CAZymes (114 for G. yamadae and 93 for G. asiaticum), proteases (35 for G. yamadae and 38 for G. asiaticum) and lipases (not found in G. yamadae and 4 for G. asiaticum) using dedicated databases (Fig. 4).

Fig. 6
figure 6

Length distribution of secreted proteins in the predicted secretomes of G. japonicum, G. yamadae and G. asiaticum

Based on the secretome prediction, unigenes of unknown function corresponding to SPs were classified in a manually annotated category termed “secreted protein” determined after [16]. A total of 577 G. japonicum unigenes were classified in this “secreted protein” KOG category, whereas, 480 and 504 G. asiaticum and G. yamadae unigenes were classified in the same KOG category, respectively. In the three Gymnosporangium spp., the “secreted protein” category is among the most abundant KOG categories after signal transduction, representing 1.9, 1.4 and 1.3% of the respective unigenes (Fig. 3).

To conclude, proteome and secretome predictions in the three Gymnosporangium teliospore transcriptomes identified hundreds of SPs of unknown functions that may represent putative candidate virulence effectors in each rust species. More particularly, many SPs are among highly expressed transcripts in teliospores and show similarity to other rust fungi. Interestingly, SPs of unknown function are similarly relatively abundant in the three Gymnosporangium spp.

Comparison of Gymnosporangium spp. transcriptomes with Pucciniales genomes supports taxonomic proximity to the Pucciniaceae

In order to identify lineage-specific proteins in Gymnosporangium spp. and beyond within the Pucciniales, we compared the three available Gymnosporangium transcriptomes with deduced proteomes of selected basidiomycetes. Three other rust fungi were selected as representatives of different taxonomical families within the order Pucciniales for which a reference genome was available (i.e. the Cronartiaceae, the Melampsoraceae, and the Pucciniaceae) [28,29,30]. Microbotryum lychnidis-dioicae was selected as an outgroup fungal pathogen in the Microbotryomycetes, a sister class of Pucciniomycetes within the Pucciniomycotina [31]. A total of 1042 single copy genes conserved across the selected Pucciniales genomes, the three Gymnosporangium spp. transcriptomes and the outgroup genome in Microbotryomycetes were predicted by BUSCO [32] and were used to build a maximum likelihood tree with RAxML [33] (Fig. 7). Two rust clades were evident: C. quercuum f. sp. fusiforme clustered with M. larici-populina whereas the Gymnosporangium species were grouped with Puccinia graminis f. sp. tritici; G. japonicum branching more closely to G. yamadae than to G. asiaticum.

Fig. 7
figure 7

Phylogenomic relationship of three Gymnosporangium spp. with selected species in the Pucciniomycotina and identification of conserved proteins based on Multiclustering (MCL) analysis. A Maximum likelihood tree was constructed with RAxML using concatenated protein sequences from 1042 conserved single-copy genes found in the predicted proteomes of Gymnosporangium spp. teliospore transcriptomes and of four fungi selected in the Pucciniomycotina (Puccinia graminis f. sp. tritici, Melampsora larici-populina, Cronartium quercuum f. sp. fusiforme and Microbotryum lychnidis-dioicae). Bootstraps values are shown on branches (1000 replicates). Bold numbers shown at the species, genus or order levels, indicate lineage-specific clusters identified by the MCL analysis between the proteomes from the selected species, and bold numbers in bracket indicate the number of clusters remaining as specific after similarity search in the NCBI non-redundant protein database (Blastp; e-value ≤1e-5). The total number of proteins found in corresponding MCL clusters at each taxonomical level are shown non-bold below the cluster numbers

Comparison of Gymnosporangium spp. transcriptomes with Pucciniales identifies new specific rust fungal transcripts and putative candidate effectors

FastOrtho was used to conduct a Markov Cluster Algorithm (MCL) analysis of predicted proteomes of the seven selected fungal species [34]. Then, the specificity of proteins identified at any lineage level was assessed by comparison to the NCBI nr database (blastp, e-value ≤1e-5). A total of 1514 clusters (10,117 proteins) were identified for the six Pucciniales included in this study (Fig. 7; Additional file 12: Table S6). After filtering against the nr database, 1970 proteins within 305 clusters were considered as Pucciniales-specific (Fig. 7; Additional file 12: Table S6). To pinpoint protein families specific to the genus Gymnosporangium, we selected 1905 clusters, representing a total of 6010 proteins, which were only present in the three Gymnosporangium species after the MCL analysis (Fig. 7; Additional file 12: Table S6). After filtering against the nr database, 204 clusters representing a total of 635 proteins, were deemed as specific of Gymnosporangium spp. and specifically expressed in the life cycle stage teliospores (Fig. 7; Additional file 12: Table S6). Finally, at the species level, we found 180, 169 and 174 clusters (corresponding to 479, 403 and 467 proteins, respectively) that were specific to G. japonicum, G. yamadae and G. asiaticum, respectively (Fig. 7; Additional file 12: Table S6). After comparison to the nr database, 205 orphan proteins were specific to G. japonicum, of which 14 were predicted to encode SP, representing putative specific candidate effectors for this species. Additionally, there were 152 and 200 orphan proteins found in G. yamadae and G. asiaticum, respectively (Fig. 7; Additional file 12: Table S6). A total of 11 and 12 predicted secreted proteins were found in each of the later species, indicating candidate secreted effector proteins in G. yamadae and G. asiaticum, respectively. Previous genomic studies have revealed the presence of large secreted protein gene families in rust fungi that may represent diversifying candidate effector families [7, 35]. To identify such families within the three Gymnosporangium spp. transcriptomes, we selected clusters containing one or more SPs in each species (Additional file 13: Table S7). Among clusters with SPs, specific protein families containing only 3 to 4 SP members were retrieved (Additional file 13: Table S7). To conclude, the comparison of the teliospore transcriptomes with reference rust fungal genomes identified hundreds of orphans in each Gymnosporangium spp., as well as core proteins shared at the genus level or at the Pucciniales order, which informs us about the involvement of specific rust proteins at this stage of the life cycle.


In this study, we report the transcriptome profiling of G. japonicum teliospores and take advantage of the previously published transcriptomes of G. yamadae and G. asiaticum teliospores to perform comparison within this genus [22]. G. japonicum, G. yamadae, and G. asiaticum are the three most widespread Gymnosporangium species in Asian areas [6, 36]. They share J. chinensis as their telial host but they cause rust symptoms on different organs of juniper trees, which indicate major differences in their infection modes [2, 6]. In order to also unravel specificities of this life stage compared to heteroecious and macrocyclic rust fungi, we compare the three Gymnosporangium transcriptomes with predicted genes in the genomes of C. quercuum f. sp. fusiforme, M. larici-populina and P. graminis f. sp. tritici [28,29,30]. Gymnosporangium spp. may exhibit different gene expression patterns and metabolic processes at the teliospore stage on their unique telial host (gymnosperm) than heteroecious rust species [2, 4]. So far, only a few transcriptomic studies looked at gene expression at the telial stage in a few species [11, 21, 22]. Teliospores of macrocyclic rust fungi act as survival structures throughout winter on the telial host [7]. At the contrary, teliospores of Gymnosporangium species are not overwintering structures, they are produced transiently in spring under favourable conditions and they lead to the production of basidia then basidiospores within a few days [2, 37]. Basidiospores of Gymnosporangium have the capacity to infect plants of the subfamily Amygdaloideae, on which important damages can occur. It is crucial to understand the biology of this transient and unique life stage that differ from the corresponding one in macrocyclic rust fungi. So far, our understanding of the molecular processes at play in Gymnosporangium teliospores is still limited.

Rust fungi possess large numbers of genes and a majority have no known function, illustrating the need to better describe rust biology through global approaches such as transcriptomics [5]. This is illustrated by the low proportion of CDS predicted from the unigenes by similarity in nr and swissprot. A large proportion of highly expressed Gymnosporangium genes are annotated as proteins of unknown function with no homology in databases, indicating that teliospores undergo largely unknown biological processes. The distribution of unigenes annotated in KOG categories in the three Gymnosporangium teliospores transcriptomes suggests the expression of a conserved genetic program, despite the fact that each Gymnosporangium infect a different organ of J. chinensis. The most abundant cellular categories corresponded to translation, post-translation, signal transduction and energy production which supports that Gymnosporangium teliospores are highly active in spring, likely to prepare for the production of basidiospores and later infection of the aecial host. Plant pathogens like rust fungi secrete effector molecules inside cells of their host plants to manipulate immunity and physiology [38]. Secretome prediction allow to identify SPs that are candidate effectors that may play a role in pathogenesis [39,40,41]. Unigenes encoding predicted SP of unknown function likely contain candidate rust fungal effectors [41, 42]. With a total of 513 to 854 SPs detected in the three Gymnosporangium species, this category is also one of the most represented in expressed unigenes. Teliospores are not infecting living host tissues, thus the presence of a large number of expressed SPs maybe puzzling. However, teliospores lead to the production of basidia and basidiospores within only a few days in favourable conditions [2, 37] and it is tempting to speculate that candidate effector transcripts may be already produced at the teliospore stage to facilitate rapid infection of the aecial host by basidiospores. Also, it cannot be ruled out that secreted proteins may have unknown roles unrelated to host infection.

Functional annotation of top highly expressed genes in G. japonicum revealed a hemicellulose cleaving enzyme of the GH26 family and members of the GH16, 17, 18 families. Previous studies showed that the GH26 gene family is expanded in the genomes of M. larici-populina and P. graminis f. sp. tritici [28]. Unigenes presenting homology to GH26 genes were found in the three Gymnosporangium species with high expression levels. GH26 genes were previously shown to be highly expressed as well in M. larici-populina telia, urediniospores and during poplar infection [11, 28, 43]. Similarly, genes falling in this GH family were significantly induced in early needle infection stage of C. ribicola [15] and highly expressed in the urediniospores of P. triticina [29]. GH26 is a widespread hemicellulose-cleaving enzymes family conserved in rust species that may play an essential role in modifying plant cell wall to facilitate fungal invasion. Other functions previously reported in M. larici-populina and P. triticina teliospores, like multicopper oxidase encoding genes are represented among highly expressed genes of Gymnosporangium teliospores [11, 21]. Although the exact role of multicopper oxidase has yet to be resolved, our study completes the view of the functional transcriptome of teliospores and shows that this function might be critical at this stage. Finally, the composition in the different CAzyme categories was rather similar between the three Gymnosporangium species, indicating that the capacity to infect different organs of J. chinensis may not be reflected in the fungal equipment for plant cell wall decomposition, but rather in the expression or coordination of expression of genes encoding enzymes falling in these categories. Also, it cannot be ruled out that organ specificity may be driven by specific candidate SP effectors, rather than specific enzymatic capacity. Such organ-specific effectors were indeed reported in other fungal plant pathogens [44].

In overwintering macrocyclic rust fungi, ultrastructural microscopy observations showed that meiosis is blocked in prophase I at the diplonema stage and restart after germination in spring, consistent with expression profiling of karyogamy and meiosis-related genes [11, 12]. Ultrastructural analysis of Gymnosporangium teliospore maturation and basidiospores formation showed that meiosis begins very shortly after karyogamy. Also, meiosis is not interrupted because teliospores of Gymnosporangium germinate without a period of dormancy [10]. Some early stage karyogamy-related genes reported in M. larici-populina teliospores are not detected in G. japonicum, neither in G. asiaticum or G. yamadae [22] which implies that this process already took place in collected teliospores. Meiotic genes conserved in eukaryotic species [45] are found in G. japonicum teliospores. Meiosis anaphase genes are highly expressed in G. japonicum, G. asiaticum and G. yamadae, but not detected in M. larici-populina. For instance, genes encoding the meiotic cell division protein Dom34 and the checkpoint serine/threonine-protein kinase Bub1 are expressed in Gymnosporangium teliospores. Bub1 regulates chromosome segregation [46] and Dom34 is responsible for cell division [45]. These observations likely illustrate the differences in the timing of karyogamy and meiosis processes ongoing in teliospores of macrocyclic and demicyclic rust fungi. Evidence from a larger number of species is now needed to determine to which extent the onset of these biological processes differs between rust fungi with different life cycles.

Recent phylogenetic analysis revealed that the Gymnosporangium group is not confamilial with other members of Pucciniaceae as previous thought and more likely represents a family-level lineage [5]. Phylogenomics comparison of three Gymnosporangium spp. transcriptomes with the genomes of three rust species selected in different taxonomical families of the Pucciniales using predicted proteomes shows that Gymnosporangium stands on an independent branch with more proximity to Pucciniaceae than any other rust families. This comparison also confirms that G. japonicum is closer to G. yamadae than G. asiaticum, consistent with previous phylogenetic findings [6, 47, 48]. This phylogenomic approach also reveals proteins that are specific at different taxonomical levels, although it is not possible to definitively conclude on the extent of protein families from the transcriptome that is only the expressed portion of the gene complement in a given species. The comparison to the genomes of C. quercuum f. sp. fusiforme, M. larici-populina, P. graminis f. sp. tritici [28,29,30] reveals the presence of 1514 protein clusters shared at the order level in the MCL analysis, of which 305 are specific to Pucciniales after comparison with the nr database. Most of these specific genes are of unknown function ([5]; this study), nevertheless, the detection of their expression at this particular stage indicates that they likely play a role in teliospores, which brings new information to better delineate biology of rust fungi. Information about expression in teliospores is still lacking for some of the selected rust genomes and future studies are needed to better describe the genetic program expressed at this particular spore stage. Also, differences observed between demicyclic and macrocyclic rust fungi for key cellular processes like karyogamy and meiosis ([11, 22], this study), show that temporal dissection of spore maturation process is needed in rust fungi. On the other hand, the phylogenomic comparison using the MCL approach helps to identify genes that are either specific across the Gymnosporangium genus or specific to each Gymnosporangium species. It is tempting to speculate that some of these genes may be related to the telial host that is common to the three species. Future comparison to Gymnosporangium species infecting other gymnosperms will be needed to test this hypothesis and validate host-specific genes. Among highlighted specific genes, SPs represent candidate effectors that may play a role in infection of the aecial host by basidiospores. Identification of effectors is important to support progress in resistance deployment in host plants [7], particularly in Chinese apple orchards where G. yamadae causes major damages [49]. Discrimination of SPs in each Gymnosporangium spp. helps to narrow down the list of specific candidates and will support future effector studies to better understand the biology of infection as demonstrated for other rust species [50,51,52]. The transcriptome of G. japonicum and its comparison to other Gymnosporangium species bring new valuable information to gain insights into the biology of this genus of rust fungi poorly explored at the molecular level, as well as for the biology of Pucciniales in general since it focuses on a spore stage that is so far overlooked. Rust genomes are large and highly repetitive, and can be of several hundred megabases in size [5]. For instance, the genome size of the close-relative species Gymnosporangium confusum has been estimated at 893 Mb [14]. The genome size of other Gymnosporangium species is not yet determined but might be substantially large and may require the use of single molecule sequencing technologies, such as PacBio or Oxford Nanopore, to achieve complete sequencing [5]. These transcriptomic surveys are only the first glimpse into Gymnosporangium genomics and the next stage will be the sequencing of a representative genome to support progress in understanding pathogenesis of these fungi.


We presented the transcriptome analysis of G. japonicum teliospores collected from juniper tree and we conducted a comparative transcriptomic analysis with other Gymnosporangium spp. at the same stage of the life cycle. This stage is mostly overlooked in rust fungi and the complementary information provided here suggests that it is comparable across Gymnosporangium species in term of expressed cellular categories, likely to prepare for production of the next spore stage in the life cycle. Differences in the expression profiles of karyogamy and meiosis related genes in teliospores of Gymnosporangium species and in overwintering macrocyclic rust fungi likely highlight the different timing of related biological processes in these relative species. Comparison of Gymnosporangium transcriptomes to other Pucciniales fungi unravelled lineage-specific protein families, including Gymnosporangium spp. specific candidate secreted effectors expressed in teliospores, that remain to be functionally addressed.


Sample collection

Branch samples of juniper (J. chinensis L.) infected by G. japonicum were collected from the Yat-sen Mausoleum, Nanjing, Jiangsu, China, in March 2016. Telia are visible under the form of dark brown teliospore masses under the bark (Fig. 1). The identity of the rust fungus was confirmed by microscopy and by molecular identification (using ITS and TEF1; Tao S and Liang Y-M, unpublished observations). Batches of 50 mg of mature teliospores were recovered with a needle by gently scratching from beneath the bark of the branch. Three replicate samples were collected in sterile tubes. The samples were immediately snap frozen in liquid nitrogen and then stored at − 80 °C until processed.

RNA-sequencing and de novo assembly

Total RNA was isolated from 50 mg mature teliospores by using TRIzol Reagent (Invitrogen, CA, USA) and was subjected to a DNase treatment with the DNA-free™ DNA Removal Kit (Ambion, Austin, TX, United States) according to the furnisher’s instructions. The teliospores were ground to powder with sterile mortar and pestle in liquid nitrogen. RNA was stored in DEPC-treated water. RNA quality and quantity were determined with a NanoPhotometer® spectrophotometer (IMPLEN, CA, USA) and a Qubit® RNA Assay Kit in Qubit® 2.0 Fluorometer (Life Technologies, CA, USA). RNA integrity (RIN) was assessed using the RNA Nano 6000 Assay Kit of the Agilent Bioanalyzer 2100 system (Agilent Technologies, CA, USA). Three cDNA libraries of G. japonicum (DNJGJ_1, DNJGJ_2 and DNJGJ_3) were constructed with specific 6-bp nucleotide bar-coding tags, using a NEBNext® Ultra™ RNA Library Prep Kit for Illumina (NEB, USA) following the manufacturer’s protocols. Tagged cDNA libraries were pooled in equal ratios and used for 100-bp PE sequencing on the Illumina HiSeq2000 platform (Illumina, San Diego, CA, USA) at Novogene Bioinformatics Technology Co., Ltd. (Beijing, China). Raw reads from fastq files were firstly processed through in-house perl scripts at Novogene Bioinformatics Technology Co. In this step, clean reads were obtained by removing adapters, poly-N, and low-quality reads (based on Q20, Q30, GC-content and sequence duplication level using the Trimmomatic software [53];) from the raw data. The left files (read1 files) from all three libraries were pooled into one big left.fq file, and right files (read2 files) were pooled into one big right.fq file. A reference transcriptome assembly was determined based on the left.fq and right.fq using Trinity [23] with min_kmer_cov set to 2 by default and all other parameters set to default. Following the construction of the reference transcriptome, clean data from individual replicates were mapped back onto the assembled transcriptome (unigenes) and then read count for each gene was obtained from the mapping results, and gene expression levels were estimated by the software package RSEM, with Bowtie2 as a read aligner (default parameters, mismatch: 0) [24] for each library. Transcriptome assembly; unigene determination and putative CDS prediction; read count levels for the three replicates; and proteome prediction from unigenes were performed at Novogene Bioinformatics Technology Co following procedures previously described, using the same bioinformatic programs and versions [22].

Gene functional annotation

In order to determine the functional categories of unigenes, homology searches by BLASTX were performed against public databases (NCBI blast+ 2.2.28; cut-off e-value<1e-5), including NCBI nr and nt, KOG, KEGG pathways orthology and, Swissprot and Pfam databases. During assignment to KOG category, we added a KOG category termed “Secreted proteins” that consists of secreted proteins of unknown function as described in [16].

Prediction of secreted proteins

The secretome of G. japonicum was determined from the proteome using a custom pipeline combining different bioinformatic programs, including SignalP v.4, WolfPSort,Tmhmm, TargetP and PS-Scan algorithms as reported by [26]. The presence of nuclear localization signal (NLS) was defined with PredictNLS [54]. In order to accurately compare predicted secretomes from G. japonicum (this study) and from G. yamadae and G. asiaticum (previously published by [22]), the same pipeline was strictly applied to the three species. The predicted proteomes derived from the unigenes were used to search putative CAZymes in the dbCAN v2.0 HMM-based CAZy annotation server [55]. For the three predicted secretomes, functional annotations were also performed with Merops [56] and the Lipase Engineering database [57].

Gene family construction and Phylogenomics analysis

The complete proteomes of four fungi belonging to the Pucciniomycotina with available reference genome sequences were downloaded from the Joint Genome Institute Mycocosm website (; [58]): Cronartium quercuum f. sp. fusiforme G11 version 1.0 [30], Melampsora larici-populina v2 [28], Puccinia graminis f. sp. tritici CEL 75–36–700-3 race SCCL v2 [29]. Microbotryum lychnidis-dioicae p1A1 Lamole [31]. The three proteomes predicted for Gymnosporangium species (G. japonicum, this study, and G. yamadae and G. asiaticum from [22]) were used for comparison with reference rust proteomes. Gene families were clustered with fastOrtho MCL v12.135 [34] using inflation parameters of 3 and 50% identity and coverage.

The program Benchmarking Universal Single-Copy Orthologs (BUSCO, [32]) was independently run for the three Gymnosporangium transcriptome assemblies and the three selected Pucciniales genomes and in M. lychnidis-dioicae (Microbotryales), in order to detect the single copy genes from each assembly. The amino acid sequence of families found in all species were aligned using MAFFT and the alignments was then used to construct a phylogenomic tree using RAxML with 1000 bootstraps and the PROTGAM-MAAUTO substitution model [33]. The resulting tree was rooted with M. lychnidis-dioicae and visualized in Geneious and labels were manually placed to improve legibility.

Availability of data and materials

All raw sequence data generated in this study is deposited in the NCBI Sequence Read Archive ( under the accession number SRR8906254-SRR8906256. Additional previously published transcriptome data analyzed during this study are included in [22] and its supplementary information files. All other supporting data are included as additional files.



Carbohydrate active enzymes


Fragment per kilobase per million mapped reads


Glycosyl hydrolases


Gene ontology


Kyoto encyclopaedia of genes and genomes


Eukaryotic orthologous groups


Markov Cluster Algorithm


NCBI non-redundant database


NCBI Nucleotide database


Protein family


RNA-Seq by Expectation Maximization


Rust Transferred Protein 1


Secreted proteins


Small secreted proteins


  1. Zhao WJ, Jing WM, Liu DX, Zhang XL. Research on the growth characteristics of sabina. J Anhui Agric Sci. 2011;39:5847–8.

    Google Scholar 

  2. Kern FD. A revised taxonomic account of Gymnosporangium. Pennsylvania: Pennsylvania State University Press; 1973.

    Google Scholar 

  3. Yun HY, Lee SK, Kim KH. Two newly identified Gymnosporangium species, G. japonicum and G. cornutumn, in Korea. Korean Society of Plant Pathology. 2003;19:6.

    Google Scholar 

  4. Cummins GB, Hiratsuka Y. Illustrated genera of rust fungi. 3rd ed. Minnesota: American Phytopathological Society; 2003.

    Google Scholar 

  5. Aime MC, MacTaggart AR, Mondo SJ, Duplessis S. Phylogenetics and phylogenomics of rust fungi. Adv Genet. 2017;100:267–307.

    Article  PubMed  CAS  Google Scholar 

  6. Yun HY, Hong SG, Rossman AY, Lee SK, Lee KJ, Bae KS. The rust fungus Gymnosporangium in Korea including two new species, G. monticola and G. unicorne. Mycologia. 2009;101:790–809.

    Article  PubMed  Google Scholar 

  7. Lorrain C, Gonçalves dos Santos KC, Germain H, Hecker A, Duplessis S. Advances in understanding obligate biotrophy in rust fungi. New Phytol. 2019;222(3):1190–206.

    Article  PubMed  Google Scholar 

  8. Lee SK, Kakishima M. Surface structures of peridial cells of Gymnosporangium, and Roestelia (Uredinales). Mycoscience. 1999;40(2):121–31.

    Article  Google Scholar 

  9. Zhuang JY. Fungi flora of China. Beijing: Science Press; 2012. (In Chinese)

    Google Scholar 

  10. Mims CW. Ultrastructure of teliospore formation in the cedar-apple rust fungus Gymnosporangium juniperi-virginianae. Can J Bot. 1977;55:2319–29.

    Article  Google Scholar 

  11. Hacquard S, Delaruelle C, Frey P, Tisserant E, Kohler A, Duplessis S. Transcriptome analysis of poplar rust telia reveals overwintering adaptation and tightly coordinated karyogamy and meiosis processes. Front Plant Sci. 2013;4:456.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Boehm EWA, Wenstrom JC, McLaughlin DJ, Szabo LJ, Roelfs AP, Bushnell WR. An ultrastructural pachytene karyotype for Puccinia graminis f. sp. tritici. Can. J. Bot. 1992;70:401–13.

    Article  Google Scholar 

  13. Duplessis S, Bakkeren G, Hamelin R. Advancing knowledge on biology of rust fungi through genomics. Advances in Botanic Research. 2014;70:173–209.

    Article  Google Scholar 

  14. Tavares S, Ramos AP, Pires AS, Azinheira HG, Caldeirinha P, Link T, et al. Genome size analyses of Pucciniales reveal the largest fungal genomes. Front Plant Sci. 2014;5:422.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Liu JJ, Sturrock RN, Sniezko RA, Williams H, Benton R, Zamany A. Transcriptome analysis of the white pine blister rust pathogen Cronartium ribicola: de novo assembly, expression profiling, and identification of candidate effectors. BMC Genomics. 2015;16:678.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  16. Lorrain C, Marchal C, Hacquard S, Delaruelle C, Petrowski J, Petre B, et al. The rust fungus Melampsora larici-populina expresses a conserved genetic program and distinct sets of secreted protein genes during infection of its two host plants, larch and poplar. Mol Plant-Microbe Interact. 2018;31:695–706.

    Article  CAS  PubMed  Google Scholar 

  17. Fernandez D, Tisserant E, Talhinhas P, Azinheira H, Vieira A, Petitot AS, et al. 454-pyrosequencing of Coffea arabica leaves infected by the rust fungus Hemileia vastatrix, reveals in planta -expressed pathogen-secreted proteins and plant functions in a late compatible plant–rust interaction. Mol Plant Pathol. 2012;13:17–37.

    Article  CAS  PubMed  Google Scholar 

  18. Link TI, Lang P, Scheffler BE, Duke MV, Graham MA, Cooper B, et al. The haustorial transcriptomes of Uromyces appendiculatus and Phakopsora pachyrhizi and their candidate effector families. Mol Plant Pathol. 2014;15:379–93.

    Article  CAS  PubMed  Google Scholar 

  19. Garnica DP, Upadhyaya NM, Dodds PN, Rathjen JP. Strategies for wheat stripe rust pathogenicity identified by transcriptome sequencing. PLoS One. 2013;8:e67150.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Talhinhas P, Azinheira HG, Vieira B, Loureiro A, Tavares S, Batista D, et al. Overview of the functional virulent genome of the coffee leaf rust pathogen Hemileia vastatrix with an emphasis on early stages of infection. Front Plant Sci. 2014;5:88.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Xu J, Linning R, Fellers J, Dickinson M, Zhu W, Antonov I, et al. Gene discovery in EST sequences from the wheat leaf rust fungus Puccinia triticina sexual spores, asexual spores and haustoria, compared to other rust and corn smut fungi. BMC Genomics. 2011;12:161.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Tao SQ, Cao B, Tian CM, Liang YM. Comparative transcriptome analysis and identification of candidate effectors in two related rust species (Gymnosporangium yamadae and Gymnosporangium asiaticum). BMC Genomics. 2017;18:651.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  23. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011;29:644–52.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics. 2011;12:323.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Henrissat B, Surolia A, Stanley P. A genomic view of Glycobiology. In: Varki A, Cummings RD, Esko JD, Stanley P, Hart GW, Aebi M, et al., editors. Essentials of Glycobiology. New York: Cold Spring Harbor Laboratory Press; 2015.

    Google Scholar 

  26. Pellegrin C, Morin E, Martin FM, Veneault-Fourrey C. Comparative analysis of secretomes from ectomycorrhizal fungi with an emphasis on small-secreted proteins. Front Microbiol. 2015;6:1278.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Kemen E, Kemen AC, Rafiqi M, Hempel U, Mendgen K. Identification of a protein from rust fungi transferred from haustoria into infected plant cells. Mol Plant-Microbe Interact. 2005;18(11):1130.

    Article  CAS  PubMed  Google Scholar 

  28. Duplessis S, Cuomo CA, Lin YC, Aerts A, Tisserant E, Veneault-Fourrey C, et al. Obligate biotrophy features unravelled by the genomic analysis of rust fungi. Proc Natl Acad Sci U S A. 2011;108:9166–71.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Cuomo CA, Bakkeren G, Khalil HB, Panwar V, Joly D, Linning R, et al. Comparative analysis highlights variable genome content of wheat rusts and divergence of the mating loci. G3: genes. Genomes, Genetics. 2017;7(2):361–76.

    CAS  Google Scholar 

  30. Pendleton AL, Smith KE, Feau N, Martin FM, Grigoriev IV, Hamelin R, et al. Duplications and losses in gene families of rust pathogens highlight putative effectors. Front Plant Sci. 2014;5:299.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Perlin MH, Amselem J, Fontanillas E, Toh SS, Chen Z, Goldberg J, et al. Sex and parasites: genomic and transcriptomic analysis of Microbotryum lychnidis-dioicae, the biotrophic and plant-castrating anther smut fungus. BMC Genomics. 2015;16(1):461.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  32. Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single copy orthologs. Bioinformatics. 2015;31:3210–2.

    Article  PubMed  CAS  Google Scholar 

  33. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30(9):1312–3.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Wattam AR, Abraham D, Dalay O, Disz TL, Driscoll T, Grabbard JL, et al. PATRIC, the bacterial bioinformatics database and analysis resource. Nucl Acids Res. 2014;42(D1):D581–91.

    Article  CAS  PubMed  Google Scholar 

  35. Hacquard S, Joly DL, Lin YC, Tisserant E, Feau N, Delaruelle C, et al. A comprehensive analysis of genes encoding small secreted proteins identifies candidate effectors in Melampsora larici-populina (poplar leaf rust). Mol Plant-Microbe Interact. 2012;25:279–93.

    Article  CAS  PubMed  Google Scholar 

  36. Yun HY, Lee SK, Lee KJ. Identification of aecial host ranges of four Korean Gymnosporangium species based on the artificial inoculation with teliospores obtained from various forms of telia. The plant pathology journal. 2005;21(4):310–6.

    Article  Google Scholar 

  37. Dong XL, Li BH, Zhang ZF, Li BD, Xu XM. Effect of environmental conditions on germination and survival of teliospores and basidiospores of the pear rust fungus (Gymnosporangium asiaticum). Eur J Plant Pathol. 2006;115(3):341–50.

    Article  Google Scholar 

  38. Uhse S, Djamei A. Effectors of plant-colonizing fungi and beyond. PLoS Pathog. 2018;14(6):e1006992.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  39. Stergiopoulos I, de Wit PJ. Fungal effector proteins. Annu Rev Phytopathol. 2009;47:233–63.

    Article  CAS  PubMed  Google Scholar 

  40. Sperschneider J, Dodds PN, Taylor JM, Duplessis S. Computational methods for predicting effectors in rust pathogens. Methods Mol Biol. 2017;1659:73–83.

    Article  CAS  PubMed  Google Scholar 

  41. Lorrain C, Petre B, Duplessis S. Show me the way: rust effector targets in heterologous plant systems. Curr Opin Microbiol. 2018;46:19–25.

    Article  CAS  PubMed  Google Scholar 

  42. Petre B, Joly DL, Duplessis S. Effector proteins of rust fungi. Front Plant Sci. 2014;5:416.

    PubMed  PubMed Central  Google Scholar 

  43. Duplessis S, Hacquard S, Delaruelle C, Tisserant E, Frey P, Martin F, et al. Melampsora larici-populina transcript profiling during germination and timecourse infection of poplar leaves reveals dynamic expression patterns associated with virulence and biotrophy. Mol Plant-Microbe Interact. 2011;24(7):808.

    Article  CAS  PubMed  Google Scholar 

  44. Schilling L, Matei A, Redkar A, Walbot V, Doehlemann G. Virulence of the maize smut Ustilago maydis is shaped by organ-specific effectors. Mol Plant Pathol. 2014;15:780–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Malik SB, Pightling AW, Stefaniak LM, Schurko AM, Logsdon JM Jr. An expanded inventory of conserved meiotic genes provides evidence for sex in Trichomonas vaginalis. PLoS One. 2008;3:e2879.

    Article  PubMed Central  Google Scholar 

  46. Klebig C, Korinth D, Meraldi P. Bub1 regulates chromosome segregation in a kinetochore-independent manner. J Cell Biol. 2009;185(5):841–58.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  47. Zhao P, Liu F, Li YM, Cai L. Inferring phylogeny and speciation of Gymnosporangium species, and their coevolution with host plants. Sci Rep. 2016;6:29339.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Cao B, Tian CM, Liang YM. Gymnosporangium huanglongense sp. nov. from western China. Mycotaxon. 2016;131(2):375–83.

    Article  Google Scholar 

  49. Wang SQ, Yang ZP, Yu J. The reasons that apple rust happens seriously and comprehensive prevention and control measures. China Garden Abstr. 2010;26:145–6 (In Chinese).

    CAS  Google Scholar 

  50. Petre B, Saunders DG, Sklenar J, Lorrain C, Win J, Duplessis S, et al. Candidate effector proteins of the rust pathogen Melampsora larici-populina target diverse plant cell compartments. Mol Plant-Microbe Interact. 2015;28(6):689–700.

    Article  CAS  PubMed  Google Scholar 

  51. Petre B, Saunders DG, Sklenar J, Lorrain C, Krasileva KV, Win J, et al. Heterologous expression screens in Nicotiana benthamiana identify a candidate effector of the wheat yellow rust pathogen that associates with processing bodies. PLoS One. 2016;11(2):e0149035.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  52. Qi M, Grayczyk JP, Seitz JM, Lee Y, Link TI, Choi D, et al. Suppression or activation of immune responses by predicted secreted proteins of the soybean rust pathogen Phakopsora pachyrhizi. Mol Plant-Microbe Interact. 2018;31:163–74.

    Article  CAS  PubMed  Google Scholar 

  53. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  54. Nair R, Rost B. Better prediction of sub-cellular localization by combining evolutionary and structural information. Proteins. 2003;53:917–30.

    Article  CAS  PubMed  Google Scholar 

  55. Zhang H, Yohe T, Huang L, Entwistle S, Wu P, Yang Z, et al. dbCAN2: a meta server for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. 2018;46:W95–W101.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  56. Rawlings ND, Barrett AJ, Finn R. Twenty years of the Merops database of proteolytic enzymes, their substrates and inhibitors. Nucleic Acids Res. 2016;44:D343–50.

    Article  CAS  PubMed  Google Scholar 

  57. Fischer M, Pleiss J. The lipase engineering database: a navigation and analysis tool for protein families. Nucleic Acids Res. 2003;31:319–21.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  58. Grigoriev IV, Nikitin R, Haridas S, Kuo A, Ohm R, Otillar R, et al. MycoCosm portal: gearing up for 1000 fungal genomes. Nucleic Acids Res. 2013;42:D699–704.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

Download references


The authors would like to thank the anonymous reviewers for their thorough and constructive comments on the manuscript.


This work was financed by the National Natural Science Foundation of China (No. 31870628). SQT is supported by the China Scholarship Council for the Joint PhD Program between Beijing Forestry University and INRA-Nancy (No. 201806510009). EM and SD are supported by the French ‘Investissements d’Avenir’ program (ANR-11-LABX-0002-01, Lab of Excellence ARBRE). The funding bodies played no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Author information

Authors and Affiliations



SD and YML conceived and designed the project. BC collected the samples. EM performed bioinformatics and analysed these data. SQT performed the experiments and drafted the manuscript. SD and YML revised the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Ying-Mei Liang or Sébastien Duplessis.

Ethics declarations

Ethics approval and consent to participate

All the samples of Junipers (Juniperus chinensis L.) with rust pathogens G. japonicum were obtained from Nanjing, Jiangsu, China. All the collecting procedures were carried out in accordance with the local legislation.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Figure S1.

General read sequences and mapping information (top table); transcripts, unigenes and full-length CDS description (bottom table); and unigene length distribution (bottom graph) obtained for the G. japonicum teliospores transcriptome. The same information for the species G. asiaticum and G. yamadae were extracted from [22] and added side to side with G. japonicum to allow for direct metrics comparison. (PDF 345 kb)

Additional file 2: Supplementary Data 1.

Fasta file containing 40,583 G. japonicum transcript sequences. (FASTA 45632 kb)

Additional file 3: Supplementary Data 2.

Fasta file containing 30,243 G. japonicum unigene sequences. (FASTA 24636 kb)

Additional file 4: Table S1.

FPKM values and annotation results for each assembled G. japonicum unigene. (XLSX 7706 kb)

Additional file 5: Figure S2.

Density distribution profile of FPKM values in the three G. japonicum replicate libraries (DNJGJ_1, DNJGJ_2 and DNJGJ_3). (PDF 70 kb)

Additional file 6: Table S2.

KOG and KEGG annotation and classification of G. japonicum unigenes. (XLSX 93 kb)

Additional file 7: Figure S3.

Distribution of FPKM values for G. japonicum unigenes. The numbers of unigenes more expressed than each Log10(FPKM) level are detailed. Log10(FPKM) are used as arbitrary separations of highly, moderately and less expressed gene categories. (PDF 5281 kb)

Additional file 8: Table S3.

CAZymes annotation results for Gymnosporangium unigenes in the dbCAN v2.0. AA, CAZyme with auxiliary activities; CBM, carbohydrate binding module; CE, carbohydrate esterase; GH, glycosyl hydrolase; GT, glycosyl transferase; PL, polysaccharide lyase. FPKM and Log10FPKM values are reported along CAZyme annotation. (XLSX 42 kb)

Additional file 9: Figure S4.

Distribution of G. japonicum, G. yamadae and G. asiaticum predicted proteins in CAZymes families in dbCAN v2.0. (PDF 27 kb)

Additional file 10: Table S4.

Reciprocal best Blastp hits results between Gymnosporangium spp. predicted proteomes. GYA, G. asiaticum; GYJ, G. japonicum; GYY, G. yamadae. (XLSX 627 kb)

Additional file 11: Table S5.

Secreted proteins predicted in G. japonicum, G. yamadae and G. asiaticum. CPL stands for CAZymes, proteases and lipases homology prediction; SP is for swissprot blastp homology search; perc C for the percentage of cysteine residues in the amino acid sequence and NLS stands for Nuclear Localisation Signal prediction. (XLSX 261 kb)

Additional file 12: Table S6.

Gymnosporangium spp. specific clusters identified by Markov Cluster Algorithm (MCL). Cluster IDs are provided and secretion prediction is indicated. Gymas/GYA, G. asiaticum; Gymja/GYJ, G. japonicum; Gymya/GYY, G. yamadae. (XLSX 276 kb)

Additional file 13: Table S7.

Clusters of secreted proteins for the three Gymnosporangium species. Cluster ID from the MCL analysis; prot nb, number of proteins in the given cluster; SP nb, number of predicted secreted proteins in the cluster; and detailed protein ID for secreted proteins. Novel secreted proteins with no hit in the NCBI nr database are indicated. (XLSX 13 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Tao, SQ., Cao, B., Morin, E. et al. Comparative transcriptomics of Gymnosporangium spp. teliospores reveals a conserved genetic program at this specific stage of the rust fungal life cycle. BMC Genomics 20, 723 (2019).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Gymnosporangium
  • Pucciniales
  • Secretome
  • Phylogenomics
  • Candidate effectors