Skip to main content
  • Research article
  • Open access
  • Published:

A new esophageal gland transcriptome reveals signatures of large scale de novo effector birth in the root lesion nematode Pratylenchus penetrans



The root lesion nematode Pratylenchus penetrans is a migratory plant-parasitic nematode responsible for economically important losses in a wide number of crops. Despite the importance of P. penetrans, the molecular mechanisms employed by this nematode to promote virulence remain largely unknown.


Here we generated a new and comprehensive esophageal glands-specific transcriptome library for P. penetrans. In-depth analysis of this transcriptome enabled a robust identification of a catalogue of 30 new candidate effector genes, which were experimentally validated in the esophageal glands by in situ hybridization. We further validated the expression of a multifaceted network of candidate effectors during the interaction with different plants. To advance our understanding of the “effectorome” of P. penetrans, we adopted a phylogenetic approach and compared the expanded effector repertoire of P. penetrans to the genome/transcriptome of other nematode species with similar or contrasting parasitism strategies. Our data allowed us to infer plausible evolutionary histories that shaped the effector repertoire of P. penetrans, as well as other close and distant plant-parasitic nematodes. Two remarkable trends were apparent: 1) large scale effector birth in the Pratylenchidae in general and P. penetrans in particular, and 2) large scale effector death in sedentary (endo) plant-parasitic nematodes.


Our study doubles the number of validated Pratylenchus penetrans effectors reported in the literature. The dramatic effector gene gain in P. penetrans could be related to the remarkable ability of this nematode to parasitize a large number of plants. Our data provide valuable insights into nematode parasitism and contribute towards basic understating of the adaptation of P. penetrans and other root lesion nematodes to specific host plants.


The genus Pratylenchus (Pratylenchidae) comprises more than 100 valid species, with some of them being highly important due to the damage they cause in economically important crops [1, 2]. Members of the Pratylenchus genus are known as root lesion nematodes (RLNs) and belong to the third most important group of plant-parasitic nematodes (PPNs) [3]. One of the common features of these migratory nematodes is their dynamic behavior within the host roots, i.e. nematodes do not become sedentary and are able to migrate in and out of the roots, causing extensive damage as they do so. In the absence of a host crop, most of these polyphagous nematodes are able to survive on weeds or adapt their survival mechanisms (e.g. anhydrobiosis), which make them difficult to control [4].

Pratylenchus penetrans is one of the most successful and economically devastating RLN species with a wide range distribution, associated with more than 400 hosts worldwide [4]. The infection process of P. penetrans in different plants has been comprehensively investigated (e.g. [5, 6]). Like other RLNs, it can enter the plant along the entire length of the root. Once inside of the roots, nematodes migrate and feed almost exclusively from the cortical cells, where they cause mechanical damage, browning, and necrosis of the root tissue. Infection of the roots by P. penetrans often results in the release of phenolic compounds, oxidation of which has been associated with the browning of the root tissues [4]. Complex networks of defense genes and secondary metabolites have been identified in alfalfa as part of host responses to P. penetrans infection [7]. Nevertheless, the molecular mechanisms that RLNs employ to promote virulence remain largely unknown.

An important feature of all PPNs is the presence of a repertoire of secreted proteins (known as effectors), which are critical components determining the outcome of the plant-nematode interactions. The majority of these effectors are synthesized in the esophageal glands of PPNs (one dorsal and two sub-ventral glands) and are ultimately secreted through the nematode stylet into plant tissues [8]. During infection, PPNs can deploy dozens of different effectors that are capable of manipulating and suppressing key molecular pathways of the plant in order to complete their life cycle. Although a growing number of nematode effectors have been analyzed [9], different parasitism strategies, host range, variability, and composition of the effector repertoire complicate their identification and characterization.

Over the past decade, a rapid advance of genomic and transcriptomic sequencing approaches has greatly accelerated the identification of nematode effectors. Specialized techniques for RNA extraction from single cells have been adapted for the esophageal glands of PPNs, which resulted in generation of gland-specific libraries for different nematodes and identification of a significant number of candidate effector genes [10].

The explanation for the ability of P. penetrans to parasitize a wide range of hosts has yet to be determined, but it may, at least in part, lie in its effector repertoire. The annotation of PPN effector proteins often relies on sequence similarity to known effectors, or prediction via promoter motif elements (e.g., [11, 12]). Sequence data from both sedentary and migratory plant-parasitic nematodes have provided insights into the conservation of some effectors, which presumably reveals basic function(s) required for nematode parasitism [9]. Contrary to sedentary PPNs, RLNs do not induce complex feeding sites like syncytia or giant-cells and their effector repertoires are likely to reflect this difference. In our previous work, we have generated an extensive catalogue of candidate secreted proteins for P. penetrans [13]. A significant proportion of those predicted proteins have no homologues, and their functions are unknown, indicating a distinct complement of the P. penetrans “effectorome”. More recently, we determined the spatial expression of 23 candidate effectors within the esophageal glands of P. penetrans by in situ hybridization assays [14, 15]. These proteins comprised common signatures of PPN effectors with a diversity of known functions (e.g. cell wall-degrading enzymes, CWDEs). In addition, a set of pioneer genes specific to P. penetrans, that possesses unique features has been identified [14]. A high proportion of the predicted secreted proteins of P. penetrans showed no similarities with proteins deposited in public datasets, which raises the possibility that additional effectors for this species remained unexplored.

Here, we generated a new and more comprehensive esophageal glands-specific transcriptome library of P. penetrans. An in-depth analysis of this transcriptome led to the identification of new candidate effector genes for this species, which are specifically expressed in the esophageal glands. Many of these novel genes are phylogenetically restricted to P. penetrans or the Pratylenchus genus.


A more complete roster of candidate effector genes of Pratylenchus penetrans

The current library generated from the esophageal glands of P. penetrans resulted in ~ 150 million raw reads. The raw reads were cleaned and approximately 99 million high-quality reads were mapped against the 23,715 transcripts previously generated from the same P. penetrans isolate [13], and against the genomic sequence of the recently discovered root lesion nematode virus 1, which was found to be associated with the same nematode isolate [16]. As it is common in similar datasets [12], a large proportion of the reads (~ 90 million) mapped to the nematode 18S and 28S ribosomal genes. Surprisingly, approximately two million reads mapped to the genome of root lesion nematode virus 1, thus confirming our earlier observations on the virus localization adjacent to the nematode esophageal glands [16]. The remaining 6,380,993 reads mapped to 11,514 nematode transcripts, with coverage ranging from 1 to 239,619 reads. A tenfold increase in the transcripts number observed in the present dataset as compared to the previous gland cell sequencing attempts of P. penetrans ([10]; n = 1098 versus n = 11,514), was largely due to the improved extraction procedures of a technically difficult experiment and higher sequencing depth.

Some of the key features for the identification of candidate effectors, which are expected to be secreted by the classical ER-Golgi secretory pathway, are the presence of a signal peptide and absence of a transmembrane domain. Of the 11,514 transcripts with at least one read in the gland cell library, 864 (7.5%) encode putatively secreted proteins. All transcripts were assigned to one of twelve bins (increasing in a Log2 series) based on how highly they were represented in the gland cell library (Fig. 1). As representation in the gland cell library increased, two general trends were observed: 1) the total number of transcripts decreased; and 2) the relative proportion of transcripts that encode proteins with a signal peptide increased. The proportion of transcripts encoding proteins with a predicted signal peptide was significantly enriched (hypergeometric test, p-value between 0.001 and 1e− 20) in all bins above FPKM > 8 (Fragments Per Kilobase of Transcript per Million mapped reads), with the exception of the most highly represented bin due to a low n (FPKM > 1024, p > 0.1). In some of the most highly represented bins, 80–100% of transcripts encode putatively secreted proteins (Fig. 1). Assuming representation in the library is a function of expression in the gland cells, these observations support the important secretory function of the nematode esophageal glands. Since transcripts with low coverage in the gland cell library could not be distinguished from those derived from non-specific sequenced RNA originated from tissues potentially adjacent to gland cells (due to a technically challenging gland cell extraction procedure), a highly stringent cut-off value was used as a conservative measure of bona fide gland cell expression to identify transcripts to be further studied in this work. We used the statistically significant enrichment of secretory proteins (p < 0.001) as a proxy to establish the minimum coverage bin for further analyses. A total of 230 transcripts from the highly enriched bins (FPKM > 8) encode putatively secreted proteins with no transmembrane domain (Additional file 1: Table S1).

Fig. 1
figure 1

Schematic representation of all gland-cell transcripts. White bars were assigned to one of twelve bins (increasing in a Log2 series) based on their abundance using the fragment per kilobase of transcript per million mapped reads (FPKM) values in the gland cell library. The proportion of transcripts encoding proteins with a predicted signal peptide (red bars) was significantly enriched (p-value between 0.001 and 1e− 20, yellow bars) in all bins above FPKM > 8, with the exception of the most highly represented bin due to a low number of transcripts (FPKM > 1024, p > 0.1)

Mining for new candidate effector genes

To evaluate the validity of this strategy, we first queried for the presence of genes that were previously confirmed in the esophageal glands of P. penetrans. Remarkably, 91% (21/23) of the genes validated previously were in this set of 230 transcripts with FPKM > 8 in the gland-cell library (Fig. 2a and Additional file 1: Table S1). These included transcripts encoding proteins with known annotation (e.g. CWDEs), as well as seven previously identified “pioneer effectors” [14]. Most of the 21 previously validated genes ranked among the top of this list, when ranked by representation in the gland cell library (Additional file 1: Table S1). Two previously validated gland cell expressed genes were absent from this set of 230 transcripts: a fatty acid- and retinoid-binding protein (FPKM = 1.21) and a pectin methylesterase (FPKM = 1.50) (Additional file 2: Table S2).

Fig. 2
figure 2

Characterization of the most abundant transcripts encoding putative secreted proteins collected from the esophageal gland library of Pratylenchus penetrans. a Venn diagram showing the number of experimentally validated candidate effectors within the top 230 most abundant transcripts. b Most abundant Pfam protein domains represented within the list of 230 transcripts encoding putative secreted proteins (e-value <1e− 5)

A detailed examination of the remaining 209 transcripts revealed an extensive overlap with genes previously reported to be involved in parasitism of other PPNs (Fig. 2b and Additional file 1: Table S1). For instance, transcripts encoding various proteases [17], transthyretin-like proteins [12, 18], protein disulfide isomerases [19], protease inhibitors [14], an acid phosphatase [20], and a saposin precursor [21] were identified. In addition, some transcripts were similar to putative effectors recently identified for the cyst nematode Heterodera avenae (BLAST e-value < 10− 5, [22]). It is noteworthy that the majority (133/230, 57%) of the 230 transcripts encode novel proteins lacking any functional annotations or similar sequences in the NR database (BLAST e-value < 10− 5).

Given that other candidate effectors could be present below the established cut-off (FPKM < 8), we assessed the distribution of additional known effector genes among those transcripts encoding predicted secreted proteins (Additional file 2: Table S2). This led to the identification of other relevant candidate effectors (1 < FPKM < 8), such as transcripts encoding an esophageal gland protein of the root-knot nematode (FPKM = 7.92), a chorismate mutase (FPKM = 6.7), among other candidate effectors. Taken together, these results suggested that a convincing number of previously identified effectors were highly abundant in the gland-cell library and above the stringent threshold established herein.

Identification of new candidate effector genes for Pratylenchus penetrans

From the list of 230 transcripts, 68 candidates were selected for an in-depth analysis based on the mining results (Additional file 1: Table S1). The selected transcripts included those encoding 52 novel proteins (i.e., those with no annotation) and 16 genes with predicted functional annotation. Out of the 68 candidates tested, 30 transcripts were specifically localized in the nematode esophageal glands, as demonstrated by in situ hybridization assays with the corresponding anti-sense probes (Fig. 3). Hybridization with control probes of sense orientation revealed no signal (Additional file 3: Fig. S1). No signal (not to be confused with a conflicting signal) was detected in the nematode sections for the remaining genes, which suggests that additional candidate effectors could be added to this list following probe optimization.

Fig. 3
figure 3

Detection of gene transcripts encoding putative secreted proteins by in situ hybridization. Transcripts encoding thirty different genes were localized in the nematode esophageal glands of Pratylenchus penetrans using the corresponding anti-sense DIG-labeled probes. Due to the high variability of the esophageal gland size among different specimens and nematode stages, both dorsal and subventral glands were labelled as esophageal glands. Details regarding each gene annotation and description are presented in the same order as sorted in Table 1. g: esophageal glands; m: median bulb; s: stylet. Bars = 20 μm

To determine whether transcripts above the FPKM threshold that encode proteins without a predicted signal peptide were also specific to the esophageal glands, we performed in situ hybridization for the following four transcripts: a transcript encoding an ShK domain-like protein (FPKM = 213.3), a translationally-controlled tumor protein (TCTP, FPKM = 92.4), one novel protein (FPKM = 50.4) and a 14–3-3 protein (FPKM = 25.3). A positive labeling localized in the esophageal glands area was found for transcripts encoding the ShK domain-containing protein (Fig. 4a). Although the signal associated with transcripts encoding the 14–3-3 protein (Fig. 4b) was detectable in the glands, it was also observed along the intestine region of some nematodes (Fig. 4c). No signal was observed for the two remaining genes, which was also true for all control probes of sense orientation (Additional file 4: Fig. S2).

Fig. 4
figure 4

Detection of gene transcripts encoding putative proteins without signal peptide by in situ hybridization. (a) ShK domain-like protein, and (b-c) 14–3-3 protein. g: esophageal glands; i: intestine region; m: median bulb; s: stylet. Bars = 20 μm

Following these results, we also validated the presence of the CAA [A|G|T|C] TG [T|G] C motif previously identified in the promoter region of some P. penetrans effectors [14]. This motif was found in the promoter region of 12 genes (Additional file 5: Table S3), representing 40% of the total number of new candidate effectors.

Presence of orthologues in other plant-parasitic nematodes

A BLAST search conducted with all candidate effectors localized in the gland cells identified here against the draft genome assembly of P. penetrans of the same isolate (Vieira and Nemchinov, unpublished), confirmed their nematode origin (0 = e-value < 5.14E-114; Additional file 6: Table S4). Overall, the new candidates were predicted to encode proteins ranging in size from 76 to 806 amino acids (Table 1). All protein sequences were analyzed for the presence of conserved domains using Pfam database searches. Only four candidate effectors were predicted to contain known protein domains: a papain family cysteine protease (PF00112), a pepsin inhibitor-3-like repeated domain (PF06394.1), an astacin (PF01400.21), and a domain of unknown function DUF148 (PF02520.14). For all the remaining candidates no other known functional domains were found. Previously, we have found that several pioneer effectors of P. penetrans contained a high number of proline residues. As most of the effectors found here were also novel, we quantified the levels of proline, cysteine and glycine residues. Twelve of the new candidate effectors were rich in proline, while others presented a high content of glycine residues (> 10% of the full protein sequence; Table 1).

Table 1 Characterization of candidate gene effector genes specifically localized in the esophageal glands of Pratylenchus penetrans. The list is sorted in agreement to their respective fragment per kilobase of transcript per million mapped reads (FPKM) values presented in Additional file 1: Table S1

The identification of a more complete roster of candidate effectors in P. penetrans, coupled with available genome and transcriptome sequences for many phylogenetically well-positioned species, provides an opportunity to investigate the evolutionary history of the effector repertoire. We reconstructed a robust multi-gene phylogenetic tree, based on 86 CEGMA (Core Eukaryotic Genes Mapping Approach) genes highly conserved in the following species: root lesion nematodes Pratylenchus coffeae [23,24,25], P. neglectus, and P. thornei (PRJNA512537 [26];), and the burrowing nematode Radopholus similis [27]; root-knot nematodes Meloidogyne incognita [28] and M. hapla [29]; cyst nematodes Globodera pallida [30], G. rostochiensis [11], and Heterodera glycines [31]; the false root-knot nematode Nacobbus aberrans [32], the reniform nematode Rotylenchulus reniformis [33], the migratory potato rot nematode Ditylenchus destructor [34]; the pinewood nematode Bursaphelenchus xylophilus [35]; and the free-living nematode Caenorhabditis elegans ( We then searched for the presence of putative homologs of the entire P. penetrans putative effector repertoire validated by in situ hybridization (n = 53) in all these species using BLAST (e-values < 1 × 10− 5 and coverage > 50, Fig. 5).

Fig. 5
figure 5

Comparative analyses of the full set of candidate effector genes of Pratylenchus penetrans suggest a large-scale effector birth for this species. Top panel corresponds to a schematic phylogeny of the phylum Nematoda based on 86 highly conserved CEGMA genes among plant-parasitic nematodes with different parasitism strategies and the free-living nematode Caenorhabditis elegans (Ce). Shaded squares in the lower panel indicate significant blast hits of each nematode species against the set of effectors identified for P. penetrans. The new candidate effectors identified in this study are represented in bold (n = 13) while the remaining 17 genes are represented within the set of 22 effectors found specifically for P. penetrans (marked with an asterisk). n, represents the putative number of gene births in each nematode speciation cluster. Pp: Pratylenchus penetrans; Pc: Pratylenchus coffeae; Pn: Pratylenchus neglectus; Pt: Pratylenchus thornei; Mi: Meloidogyne incognita; Mh: Meloidogyne hapla; Na: Nacobbus aberrans; Rs: Radopholus similis; Rr: Rotylenchulus reniformis; Hg: Heterodera glycines; Gr: Globodera rostochiensis; Gp: Globodera pallida; Dd: Ditylenchus destructor; Bx: Bursaphelenchus xylophilus; Ce: Caenorhabditis elegans. RKN: root-knot nematodes; CN: cyst nematodes

Two noteworthy trends were apparent: 1) large scale effector birth in the Pratylenchidae in general and P. penetrans in particular, and 2) large scale effector death in root-knot nematodes (in some cases convergent loss with cyst nematodes). As evidence of the former, and despite the inclusion of three other Pratylenchus species in the analysis and the application of a relatively relaxed similarity threshold, 41% of the putative effectors (and 65% of the pioneer effectors) were unique to P. penetrans (c.f. 0.22% of all transcripts and 0.29% of all pioneer transcripts). The most parsimonious explanation for this observation is a dramatic de novo effector gene birth (i.e. not by neofunctionalization) since the divergence from the last common ancestor in the phylogeny. Only three putative effectors were common to both P. penetrans and its three most closely related Pratylenchidae species included in the analysis (P. coffeae, P. neglectus and P. thornei). Five putative effectors were common to three of the four Pratylenchus species, possibly indicative of disparate gene loss in the Pratylenchidae and in root-knot nematodes (given the monophyletic relationship with Pratylenchidae). Notably, two pioneer effectors were generally common to root lesion nematodes, the false root-knot nematode, the burrowing nematode, and the reniform nematode, but absent in sedentary endoparasitic nematodes, such as root-knot and cyst nematodes. The simplest possible explanation could be an ancient origin of the gene, in the last common ancestor of these species and subsequent convergent loss in sedentary endoparasitic root-knot and cyst nematodes. A final pioneer gene appears to have a similarly ancient origin with disparate loss. Several other putative effectors have homologues distributed across the tree, many of which have known annotations, and together likely represent evidence for neofunctionalization (when not putatively originated from horizontal gene transference events as, for example, different families of CWDEs).

Expression of selected genes during nematode-plant interaction

To validate the results of in silico analysis, we examined the expression of 15 genes in soybean hairy roots at 1, 3, 7 and 12 days after nematode inoculation (DAI). All of the selected genes could be detected within the nematode-infected roots and their expression varied (Fig. 6a) in accordance with the patterns observed during in silico analysis. To get a broader insight into the expression of all candidate effectors validated so far in P. penetrans, we took advantage of the transcriptome data previously generated from different plants, i.e., two different cultivars of alfalfa and soybean, infected with the same isolate (Fig. 6b). Heat maps that were used to visualize the expression profile confirmed the transcription of a large set of effectors during nematode interactions with the roots. A core set of effectors was prominently induced (FPKM > 100) during infection independently of the host genotype (e.g., a catalase, an expansin-like gene, an endoglucanase, and several pioneer genes). These results suggested that a complex network of effectors is actively transcribed in the course of infection.

Fig. 6
figure 6

Expression profile of Pratylenchus penetrans candidate effector genes in planta. a Semi-quantitative reverse transcription PCR validating the expression levels of 15 gland-cell localized nematode genes on soybean hairy roots at 1, 3, 7 and 12 days after nematode infection. As a positive control, all cDNA libraries were amplified with primers derived from the 18S gene of P. penetrans or the Ubiquitin-3 (UBQ-3) gene of Glycines max. C corresponds to non-infected hairy roots. b Heat maps representing the expression profile of the full set of 53 candidate effectors identified so far for P. penetrans using the fragment per kilobase of transcript per million mapped reads (FPKM) values of the current gland-cell library (Glands), and public messenger RNA-sequencing datasets originating from total RNA extracted from soybean (BioProject ID PRJNA304159 [7]) and alfalfa plants (cultivars Baker and MNGNR-16; BioProject ID PRJNA547347 [13]) infected with the same P. penetrans isolate at 7 DAI. The new candidate effectors identified in this study are represented in bold


In this work, we conducted a comprehensive screening of potential effector proteins of the agriculturally important nematode species P. penetrans. This was accomplished using a tissue-specific transcriptome sequencing approach to identify transcripts abundantly expressed in the esophageal gland cells and encoding proteins likely secreted during nematode parasitism. A substantial number of reads were generated and an extensive catalogue of transcripts specific to the gland cells compiled. Comparative analyses of the gland cell transcriptome confirmed most of the previously described cases of gland-cell expression and, most importantly, drastically expanded the size of the roster of gland-cell expressed genes to double its previous value. Importantly, this is still likely an underestimate for P. penetrans due to the stringent cut-off used: other candidate effectors with lower levels of expression and established roles in parasitism were also present in this library.

Of the new candidates identified, only four genes had a predicted functional domain or known annotation. Among them, transcripts encoding a second member of the SXP/RAL-2 family were validated to be gland-specific. Members of this gene family are characterized by the presence of the DUF148 protein domain and can be found across different clades of nematodes [36]. In PPNs, SXP/RAL-2 transcripts showed distinct localization in the subventral esophageal glands of the root-knot nematode M. incognita [37] or in the epidermis and amphidial sheet cells of the cyst nematode G. rostochiensis [38]. The biological functions of the different members of this family are still lacking. However, their high abundance in the glands and active transcription during plant infection provides additional insight regarding their significance during nematode parasitism. In addition, two proteases and one protease inhibitor-like protein were validated specifically in the glands: M12 astacin and a papain cysteine protease and pepsin inhibitor-3-like domain-containing protein, respectively. To date, several studies have evidenced the role of different classes of proteases in other plant-pathogenic interactions [39]. In M. incognita, expression of cysteine and aspartyl proteases, potentially involved in softening of plant cell walls during migration, pre-digestion of nutrients or inactivation of plant defense proteins, was confirmed in the esophageal glands [40, 41]. In animal-parasitic nematodes, different classes of proteases and protease inhibitors have been identified among the main components of excretory/secretory products of different species [42]. For example, M12 astacins, which are involved in skin penetration and migration [43], are widely distributed in nematodes belonging to the clade IV. The secretion of these proteases has been linked to diverse functions, including host tissue penetration, modification of the host environment, destruction of plant defense proteins and digestion [43]. Protease inhibitors on the other hand may be involved in the protection against degradation by host proteases or manipulation of the host responses. The identification of new gland-specific proteases and protease inhibitors in P. penetrans, and other PPNs, supports the idea that these proteins may have an active role during plant parasitism.

The lack of recognizable Pfam domains is a hallmark of PPN effectors. The fact that a large proportion of transcripts found in this work belongs to novel genes (> 80%) highlights the uniqueness of P. penetrans. To advance our understanding of these pioneer effectors of P. penetrans, we adopted a phylogenetic approach and compared the expanded effector repertoire of P. penetrans to the genome/transcriptome of other nematode species with similar or contrasting parasitism strategies. With the caveat that the absence of evidence is not always evidence of absence, the consistency of the patterns of presence and absence allowed us to inferred plausible evolutionary histories that shaped the effector repertoires of not just P. penetrans, but also other species in the phylogeny. For example, the dominant majority of pioneer effectors were restricted to P. penetrans, thus, indicating substantial de novo gene gain (i.e., not neofunctionalization of the existing genetic pool). Whether this dramatic gene gain is linked to the broad host range of this species remains to be determined. At the opposite end of the spectrum, many non-pioneer effectors (i.e. those with recognizable domains) have homologues across the phylogeny, in some cases even in non-parasitic species. In these cases, it is likely that neofunctionalization has occurred to give rise to new effector genes in P. penetrans. However, with the currently available information does not yet allow these events to be dated. The instances between gene gain and functional diversification are similarly informative. Given the monophyly of root-knot nematodes (Meloidogynidae) with Pratylenchidae, effector genes that are generally conserved across the Pratylenchidae but absent in root-knot nematodes, can be indicative of secondary loss. These cases (five discovered in this study) are part of a larger trend of effector gene loss in root-knot nematodes, including those originated before the split from cyst nematodes (Heteroderidae). One particularly interesting example (Ppen16480_c0_seq1) appears to be a pioneer gene that is present only in species with migratory lifestyles (root lesion, burrowing and the false root-knot nematodes). Given its wide distribution among these species, the most parsimonious explanation could be an ancient origin and convergent loss in sedentary endoparasites. Although some effector candidates, for instance those similar to the arabinogalactan (GH53), have comparable distributions in other PPNs, the same conclusions can unlikely be drawn because their spread could be equally well explained by multiple horizontal transfer events.

The stringent cut-off criteria used in this work while enriching for likely secreted candidate effectors, could also provide a hint on transcripts specific to the gland cell machinery (e.g. chaperones responsible for packaging effector proteins). Therefore, transcripts encoding proteins without a canonical signal peptide should not be ignored. Moreover, there is evidence that some effectors lacking a conventional signal peptide, for instance: Mi-14-3-3 [44], Mi-GST-1 [45] and MiPFN3 [46], play important roles in parasitism of the root-knot nematode M. incognita. The cyst nematode effector HsIPT, which also lacks a signal peptide, is involved in the activation of the host cell cycle of the syncytium cells [47]. Transcripts encoding proteins without a canonical signal peptide, such as ShK domain-like proteins, were found in this work to be highly abundant and localized specifically in the nematode esophageal glands. ShK-domain containing proteins secreted by animal-parasitic nematodes have a role in immuno-modulatory activity [48]. Although the mechanisms of protein secretion into the plant cell in the absence of a genuine signal peptide are still unknown, it cannot be ruled out, at least for P. penetrans, that some of these proteins may be part of the molecular machinery of the esophageal glands rather than effectors per se. However, effector proteins of PPNs are often identified in the secretory granules within the esophageal glands [49, 50]. It remains unknown if the contents of such vesicles can be directly released into the host cells as a complex network of proteins. Validation of secretion of such types of proteins by nematodes into the host using, for example, effector-specific antibodies [41, 51, 52], will provide crucial information regarding their molecular involvement with the host.

Since P. penetrans is one of the most successful species in the genus and is able to infect a large number of plants, high diversity of its effectors may potentially create a strong molecular basis for this broad host range. One of the common features of host infection by RLNs is the massive damage induced in the root cortex as a consequence of migration and feeding activity by these nematodes. Despite a severe impact on root tissues, plants seem to be unable to mount strong defenses (e.g., cell death) to block the progression of nematodes [7]. The diversity of effectors highly abundant in the glands is likely to be related to the wide range of molecular functions that are required for penetration and invasion of host roots, detoxification, suppression of host defenses and many other unidentified functions. The RNA-seq libraries previously generated from nematode-infected roots served to support differential gene expression of all candidate effectors validated so far for P. penetrans. Plant signals may be required to trigger specific nematode responses, including secretion of effector proteins. For example, regulation of different CWDE genes of P. coffeae is host-specific [25]. Host-dependent response could also explain differential expression profiles observed for the set of P. penetrans effectors. Nevertheless, there is a possibility that these differences may result from the limited number of nematode reads recovered from nematode-infected roots. Since P. penetrans is a polyphagous species, comparative analyses of gland transcriptomes originated from nematodes parasitizing on different host plants could support the dynamics of effector expression relative to the host-specific responses. It may be that to define a truly comprehensive repertoire of effectors for a polyphagous nematode species, like P. penetrans, would require gland cell sequencing of nematodes collected from a variety of hosts.


Altogether, our data demonstrate that P. penetrans deploys a novel repertoire of effectors during its interaction with the host. Notable, 26 out of the 30 (86%) new candidate effectors identified in this study represent genes without known domains, which demonstrates the lack of knowledge about this group of nematodes and warrants further investigation. Future efforts should focus on the identification and characterization of the host targets of these effectors to determine their biological roles and provide crucial information for application of genetic engineering strategies for the control of PPNs. Furthermore, the results reported in this study may also contribute towards basic understanding of the adaptation of P. penetrans, and other RLNs, to specific host plants.


Nematode isolate

Pratylenchus penetrans isolate (NL 10p RH) collected in Beltsville (Maryland, U.S.) was maintained under sterile conditions in soybean hairy roots (Glycine max) growing on agar plates with Murashige and Skoog (MS) medium. Nematodes were transferred every 2 months to new soybean hairy roots and kept in the dark at 25 °C.

Gland-cell cDNA library of Pratylenchus penetrans

Mixed nematode life stages were collected using the Baermann funnel technique from hairy roots in vitro stocks and washed in PBS buffer. Pratylenchus penetrans gland cells were collected from a pool of 100 mixed dorsal and subventral gland cells as previously described [10]. Gland cells RNA was extracted using the Arcturus PicoPure RNA isolation kit (Arcturus Bioscience). Total RNA (≤ 1 ng) was used directly as input for the SMARTer Stranded Total RNA-Seq kit - Pico Input Mammalian (Clontech). The P. penetrans RNA-seq library was then submitted for sequencing to Admera Health (South Plainsfield, NJ) and sequenced by Illumina NextSeq 500. The paired-end library totaled 147,906,800 raw reads and the quality of the library was assessed with the program FASTQC ( The vast majority of the reads were greater than 144 bp in length. The GC content averaged 47%. The median quality value ranged from 32 to 40 over the length of the entire read with an overall per average sequence quality of 40. The reads were also free of adapter content.

Gene expression analyses

Illumina RNA-seq reads of the esophageal gland cells were initially trimmed and mapped to the 23,715 transcripts generated for the same P. penetrans isolate [13] using CLC Genomics v. 8 with default parameters. Gene expression patterns were deduced from the aligned reads and determined as Fragments Per Kilobase of Transcript per Million mapped reads (FPKM) values. The expression data were partitioned into expression bins and enrichment in secreted proteins for each expression bins was assessed using a hypergeometric test with the phyp-function available in R (v3.5.2). The FDR (False Discovery Rate) was calculated according to [53] with the p-adjust function available in R. The expression bins were considered as significantly enriched in secreted protein when FDR < 0.001.

BLASTp searches were carried out against the NR database at NCBI (e-value cutoff of 1e− 5 and bitscore > 50). Interpro were performed for the predicted proteins of the set of 230 transcripts using BLAST2GO [54] with default parameters. PFAM domain searches were performed using the Pfam dataset Pfam-A v32 obtained at [55] and run through CLC Main Workbench v.7. SIGNALP v. 4.0 was used to confirm the presence/absence of protein signal peptide in the genome of the predicted proteins [56], and transmembrane domains were predicted using TMHMM server version 2.0 ( Cysteine, glycine and proline contents were calculated for each predicted mature protein with CLC Main Workbench v.7.

In situ hybridization assays

Total RNA was extracted from a pool of mixed stages of P. penetrans using the RNeasy Plant Mini kit (QIAGEN) according to the manufacturer’s instructions. RNA was treated with RNase-free DNase (QIAGEN) before reverse transcription. The quantity and quality of the extracted RNA was assessed by a ND-1000 NanoDrop spectrophotometer (Thermo Scientific) and cDNA was synthesized using the iScript first-strand synthesis kit (Bio-Rad) following the manufacturer’s instructions. Whole mount in situ hybridizations were performed in all stages of P. penetrans following the protocol of [57]. Specific primers were designed to amplify a range of gene products (104 to 305 bp) using the cDNA library produced from the mix pool of P. penetrans stages (Additional file 7: Table S5). The resulting PCR products were then used as a template for generation of sense and antisense DIG-labeled probes using a DIG-nucleotide labeling kit (Roche). Hybridized probes within the nematode tissues were detected using an anti-DIG antibody conjugated to alkaline phosphatase and its substrate. Nematode segments were observed using a Nikon Eclipse 5i light microscope.

Promoter analyses

To identify the non-coding promoter motif previously determined for several effector of P. penetrans [14], approximately 600 nucleotides of the 5′ sequence from the start codon were manually extracted based on BLASTn coordinates against the draft genome of the same nematode isolate (Vieira and Nemchinov, unpublished). These promoter regions were screened for the presence of the motif consensus CAA [A|G|T|C] TG [T|G]C.

BLAST hit analyses against other nematode genomes/transcriptomes

Focusing on a subset of candidate effectors with verified esophageal gland-cell expression in P. penetrans, additional in silico analyses were performed. Open reading frames were used to perform BLASTn searches (e-value >1e− 10) against the draft genome of the same P. penetrans isolate (Vieira and Nemchinov, unpublished). The top hit sequences were manually examined and each transcript sequence was aligned to the respective genomic scaffold using MUSCLE [58]. Genomic and transcript sequences were submitted to FGENESH ( for gene and protein prediction using Caenorhabditis elegans as model [59].

Transcripts highly represented in the gland cell library were additionally compared using BLASTx and tBLASTx (e-value cutoff of 1e− 5 and bitscore > 50) to a set of transcriptomes and genomes of PPNs publicly available at NCBI and Wormbase ( This set comprised sequenced genomes of PPNs from different clades [60]: 1) Clade 12B: sedentary species such as root-knot nematodes Meloidogyne incognita [28] and M. hapla [29]; cyst nematodes Globodera pallida [30], G. rostochiensis [11], and Heterodera glycines [31]; the false root-knot nematode Nacobbus aberrans [32], the reniform nematode Rotylenchulus reniformis [33], and the draft genome of the burrowing nematode Radopholus similis [27]; 2) Clade 12A: the migratory potato rot nematode Ditylenchus destructor [34]; 3) Clade 10: the pinewood nematode Bursaphelenchus xylophilus [35]; and 4) Clade 9A: the free-living nematode C. elegans ( Local tBLASTn searches were performed against the transcriptomes of additional Pratylenchidae species (Clade 12B), such as P. coffeae [23,24,25], P. neglectus and P. thornei. For the last two species de novo assemblies were generated using the corresponding raw reads deposited at NCBI Sequence Read Archive (SRA) BioProject PRJNA512537 [26]. Eighty-six CEGMA genes conserved in the genome and or transcriptome resources of all 15 nematodes species described above were used for phylogenetic reconstruction (Additional file 7: Table S6). Protein sequences of individual CEGMA genes were aligned and refined using MUSCLE [58]. Alignments were concatenated and model selection for each partition was carried out using the IQtree server. A concatenated multi-gene phylogeny was generated using the ultra-fast mode and 1000 bootstraps [61].

Differential expression of Pratylenchus penetrans candidate effectors in planta

Semi-quantitative RT-PCR analyses were performed using total RNA extracted from nematode-infected soybean hairy roots at different time points following the same methodology as described by [15]. The same set of primers selected for in situ hybridization were used for nematode transcript amplification, while the following genes were used as references: Ubiquitin-3 (Ubi3) for soybean hairy roots and 18S for P. penetrans (Additional file 7: Table S5). The abundance of the full set of candidate effectors were estimated as FPKM values and presented as heatmaps using previously generated transcriptome data for P. penetrans-infected plants, i.e. soybean (SRA BioProject PRJNA304159 [13]) and two alfalfa (Medicago sativa L.) cultivars (Baker and MNGNR-16, SRA BioProject PRJNA547347 [7]).

Availability of data and materials

Raw RNAseq reads generated in this publication are available at National Center for Biotechnology Information (NCBI), under the SRA accession number PRJNA668165.



cell wall-degrading enzymes


Days after inoculation


Fragments per Kilobase per Million Mapped Reads


Non-redundant database




Plant-parasitic nematodes


root lesion nematodes


Sequencing of RNA


  1. Fosu-Nyarko J, Jones MGK. Advances in understanding the molecular mechanisms of root lesion nematode host interactions. Annu Rev Phytopathol. 2016;54:253–78.

    Article  CAS  PubMed  Google Scholar 

  2. Orlando V, Grove IG, Edwards SG, Prior T, Roberts D, Neilson R, Back M. Root-lesion nematode of potato: current status of diagnostics, pathogenicity and management. Plant Pathol. 2020;69:405–17.

    Article  Google Scholar 

  3. Jones JT, Haegeman A, Danchin EGJ, Gaur HS, Helder J, Jones MGK, Kikuchi T, Manzanilla-López R, Palomares-Rius JE, Wesemael WML, Perry RN. Top 10 plant-parasitic nematodes in molecular plant pathology. Mol Plant Pathol. 2013;14:946–61.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Castillo P, Vovlas N, editors. Pratylenchus (Nematoda: Pratylenchidae): Diagnosis, Biology, Pathogenicity and Management. Nematology Monographs and Perspectives, 6. 2007. Leiden: Brill.

  5. Zunke U. Observations on the invasion and endoparasitic behavior of the root lesion nematode Pratylenchus penetrans. J Nematol. 1990;22:309–20.

    CAS  PubMed  PubMed Central  Google Scholar 

  6. Vieira P, Mowery J, Kilcrease J, Eisenback JD, Kamo K. Characterization of Lilium longiflorum cv. ‘Nellie white’ infection with root lesion nematode Pratylenchus penetrans by bright-field and transmission electron microscopy. J Nemat. 2017;49:2–11.

    Article  CAS  Google Scholar 

  7. Vieira P, Mowery J, Eisenback JD, Shao J, Nemchinov LG. Cellular and transcriptional responses of resistant and susceptible cultivars of alfalfa to the root lesion nematode, Pratylenchus penetrans. Front Plant Sci. 2019;10:971.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Mitchum MG, Hussey RS, Baum TJ, Wang X, Elling AA, Wubben M, Davis EL. Nematode effector proteins: an emerging paradigm of parasitism. New Phytol. 2013;199:879–94.

    Article  PubMed  Google Scholar 

  9. Vieira P, Gleason C. Plant-parasitic nematode effectors - insights into their diversity and new tools for their identification. Curr Opin Plant Biol. 2019;50:37–43.

    Article  CAS  PubMed  Google Scholar 

  10. Maier TR, Hewezi T, Peng J, Baum TJ. Isolation of whole esophageal gland cells from plant-parasitic nematodes for transcriptome analyses and effector identification. Mol Plant-Microbe Interact. 2013;26:31–5.

    Article  CAS  PubMed  Google Scholar 

  11. Eves-van den Akker S, Laetsch DR, Thorpe P, Lilley CJ, Danchin EGJ, da Rocha M, Rancurel C, Holroyd NE, Cotton JA, Szitenberg A, Grenier E, Montarry J, Mimee B, Duceppe M, Boyes I, Marvin JMC, Jones LM, Yusup HB, Lafond-Lapalme J, Esquibet M, Sabeh M, Rott M, Overmars H, Finkers- Tomczak A, Smant G, Koutsovoulos G, Blok V, Mantelin S, Cock PJA, Phillips W, Henrissat B, Urwin PE, Blaxter M, Jones JT. The genome of the yellow potato cyst nematode, Globodera rostochiensis, revals insights into the basis of parasitism and virulence. Genome Biol. 2016;17:124.

  12. Espada M, Eves-van den Akker S, Maier T, Paramasivan V, Baum T, Mota M, Jones JT. STATAWAARS: a promoter motif associated with spatial expression in the major effector-producing tissues of the plant-parasitic nematode Bursaphelenchus xylophilus. BMC Genomics. 2018;19:553.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  13. Vieira P, Eves-van den Akker S, Verma R, Wantoch S, Eisenback JD, Kamo K. The Pratylenchus penetrans transcriptome as a source for the development of alternative control strategies: mining for putative genes involved in parasitism and evaluation of in planta RNAi. Plos One. 2015;10:e0144674.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  14. Vieira P, Maier T. Eves-van den Akker, Howe DK, Zasada I, Baum T, Eisenback JD, Kamo K. identification of candidate effector genes of Pratylenchus penetrans. Mol Plant Pathol. 2018;19:1887–907.

    Article  CAS  PubMed Central  Google Scholar 

  15. Vicente CSL, Nemchinov LG, Mota M, Eisenback JD, Kamo K, Vieira P. Identification and characterization of the first pectin methylesterase gene discovered in the root lesion nematode Pratylenchus penetrans. PLoS One. 2019;14:e0212540.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Vieira P, Nemchinov LG. A novel species of RNA virus associated with root lesion nematode Pratylenchus penetrans. J Gen Virol. 2019;100:704–8.

    Article  CAS  PubMed  Google Scholar 

  17. Espada M, Silva AC, Eves-van den Akker S, PJA C, Mota M, Jones JT. Identification and characterization of parasitism genes from the pinewood nematode Bursaphelenchus xylophilus reveals a multilayered detoxification strategy. Mol Plant Pathol. 2016;17:286–95.

    Article  CAS  PubMed  Google Scholar 

  18. Lin B, Zhuo K, Chen S, Hu L, Sun L, Wang X, Zhang L-H, Liao J. A novel nematode effector suppresses plant immunity by activating host reactive oxygen species-scavenging system. New Phytol. 2016;209:1159–73.

    Article  CAS  PubMed  Google Scholar 

  19. Habash SS, Sobczak M, Siddique S, Voigt B, Elashry A, Grundler FMW. Identification and characterization of a putative protein disulfide isomerase (HsPDI) as an alleged effector of Heterodera schachtii. Sci Rep. 2017;7:13536.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  20. Liu Y-K, Huang W, Long H, Peng H, He W, Peng D. Molecular characterization and functional analysis of a new acid phosphatase gene (Ha-acp1) from Heterodera avenae. J Integrat Agricul. 2014;13:1303–10.

    Article  CAS  Google Scholar 

  21. Hu L-J, Wu X-Q, Li H-Y, Zhao Q, Wang Y-C, Ye J-R. An effector, BxSapB1, induces cell death and contributes to virulence in the pine wood nematode Bursaphelenchus xylophilus. Mol Plant-Microbe Interact. 2019;32:452–63.

    Article  CAS  PubMed  Google Scholar 

  22. Chen C, Chen Y, Jian H, Yang D, Dai Y, Pan L, Shi F, Yang S, Liu Q. Large-scale identification and characterization of Heterodera avenae putative effectors suppressing or inducing cell death in Nicotiana benthamiana. Front Plant Sci. 2018;8:2062.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Haegeman A, Joseph S, Gheysen G. Analysis of the transcriptome of the root lesion nematode Pratylenchus coffeae generated by 454 sequencing technology. Mol Biochem Parasitol. 2011;178:7–14.

    Article  CAS  PubMed  Google Scholar 

  24. Burke M, Scholl EH, Bird DM, Schaff JE, Colman SD, Crowell R, Diener S, Gordon O, Graham S, Wang X, Windham E, Wright GM, Opperman CH. The plant parasite Pratylenchus coffeae carries a minimal nematode genome. Nematology. 2015;17:621–37.

    Article  CAS  Google Scholar 

  25. Bell CA, Lilley CJ, McCarthy J, Atkinson HJ, Urwin PE. Plant-parasitic nematodes respond to root exudate signals with host-specific gene expression patterns. PLoS Pathog. 2019;15:e1007503.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Okubara PA, Peetz AB, Sharpe RM. Cereal root interactions with soilborne pathogens – from trait to gene and back. Agronomy. 2019;9:188.

    Article  CAS  Google Scholar 

  27. Wram CL, Hesse CN, Wasala SK, Howe DK, Peetz AB, Denver DR, Humphreys-Pereira D, Zasada I. Genome announcement: the draft genomes of two Radopholus similis populations from Costa Rica. J Nematol. 2019;51:e2019–52.

    Article  PubMed Central  PubMed  Google Scholar 

  28. Abad P, Gouzy J, Aury J-M, Castagnone-Sereno P, Danchin EGJ, Deleury E, Perfus-Barbeoch L, Anthouard V, Artiguenave F, Blok VC, Caillaud M-C, Coutinho PM, Dasilva C, De Luca F, Deau F, Esquibet M, Flutre T, Goldstone JV, Hamamouch N, Hewezi T, Jaillon O, Jubin C, Leonetti P, Magliano M, Maier TR, Markov GV, McVeigh P, Pesole G, Poulain J, Robinson-Rechavi M, Sallet E, Segurens B, Steinbach D, Tytgat T, Ugarte E, van Ghelder C, Veronico P, Baum TJ, Blaxter M, Bleve-Zacheo T, Davis EL, Ewbank JJ, Favery B, Grenier E, Henrissat B, Jones JT, Laudet V, Maule AG, Quesneville H, Rosso M-N, Schiex T, Smant G, Weissenbach J, Wincker P. Genome sequence of the metazoan plant-parasitic nematode Meloidogyne incognita. Nat Biotech. 2008;26:909–15.

  29. Opperman CH, Bird DM, Williamson VM, Rokhsar DS, Burke M, Cohn J, Cromer J, Diener S, Gajan J, Graham S, Houfek TD, Liu Q, Mitros T, Schaff J, Schaffer R, Scholl E, Sosinski BR, Thomas VP, Windham E. (2008). Sequence and genetic map of Meloidogyne hapla: a compact nematode genome for plant parasitism. Proc Natl Acad Sci U S A. 2008;105:14802–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Cotton JA, Lilley CJ, Jones LM, Kikuchi T, Reid AJ, Thorpe P, Tsai IJ, Beasley H, Blok V, Cock PJA, Eves-van den Akker S, Holroyd N, Hunt M, Mantelin S, Naghra H, Pain A, Palomares-Rius JE, Zarowiecki M, Berriman M, Jones JT, Urwin PE. The genome and life-stage specific transcriptomes of Globodera pallida elucidate key aspects of plant parasitism by a cyst nematode. Genome Biol. 2014;15:R43.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  31. Masonbrink R, Maier TR, Muppirala U, Seetharam AS, Lord E, Juvale PS, Schmutz J, Johnson NT, Korkin D, Mitchum MG, Mimee B, Eves-van den Akker S, Hudson M, Severin AJ, Baum TJ. The genome of the soybean nematode (Heterodera glycines) reveals complex patterns of duplications involved in the evolution of parasitism genes. BMC Genomics. 2019;20:119.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Eves-van den Akker S, Lilley CJ, EGJ D, Rancurel C, PJA C, Urwin PE, Jones JT. The transcriptome of Nacobbus aberrans reveals insights into the evolution of sedentary endoparasitism in plant-parasitic nematodes. Genome Biol Evol. 2014;6:2181–94.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Eves-van den Akker S, Lilley CJ, Yusup HB, Jones JT, Urwin PE. Functional C-terminally encoded plant peptide (CEP) hormone domains evolved de novo in the plant parasite Rotylenchulus reniformis. Mol Plant Pathol. 2016;17:1265–75.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Zheng J, Peng D, Chen L, Liu H, Chen F, Xu M, Ju S, Ruan L, Sun M. The Ditylenchus destructor genome provides new insights into the evolution of plant parasitic nematodes. Proc Biol Soc. 2016;283:20160942.

    Google Scholar 

  35. Kikuchi T, Cotton JA, Dalzell JT, Hasegawa K, Kanzaki N, McVeigh P, Takanashi T, Tsai IJ, Assefa SA, Cock PJA, Dan Otto T, Hunt M, Reid AJ, Sanchez-Flores A, Tsuchihara K, Yokoi T, Larsson MC, Miwa J, Maule AG, Sahashi N, Jones JT, Berriman M. Genomic insights into the origin of parasitism in the emerging plant pathogen Bursaphelenchus xylophilus. PLoS Pathog. 2011;7:e1002219.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Rao KV, Eswaran M, Ravi V, Gnanasekhar B, Narayanan RB, Kaliraj P, Jayaman K, Marson A, Raghavan N, Scott AL. The Wuchereria bancrofti orthologue of Brugia malayi SXP1 and the diagnosis of bancroftian filariasis. Mol Biochem Parasitol. 2000;107:71–80.

    Article  CAS  PubMed  Google Scholar 

  37. Tytgat T, Vercauteren I, Vanholme B, de Meutter J, Vanhoutte I, Gheysen G, Borgonie G, Coomans A, Gheysen G. An SXP/RAL-2 protein produced by the subventral pharyngeal glands in the plant parasitic root-knot nematode Meloidogyne incognita. Parasitol Res. 2005;95:50–4.

    Article  PubMed  Google Scholar 

  38. Jones JT, Smant G, Blok VC. SXP/RAL-2 proteins of the potato cyst nematode Globodera rostochiensis: secreted proteins of the hypodermis and amphids. Nematology. 2000;2:887–93.

    Article  CAS  Google Scholar 

  39. Cheng Z, Li J-F, Niu Y, Zhang X-C, Woody OZ, Xiong Y, Djonovic S, Millet Y, Bush J, McConkey BJ, Sheen J, Ausubel FM. Pathogen-secreted proteases activate a novel plant immune pathway. Nature. 2015;521:213–6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Bellafiore S, Shen Z, Rosso M-N, Abad P, Shih P, Briggs SP. Direct identification of the Meloidogyne incognita secretome reveals proteins with host cell reprogramming potential. PLoS Pathog. 2008;4:e1000192.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  41. Vieira P, Danchin EGJ, Neveu C, Crozat C, Jaubert S, Hussey RS, Engler G, Abda P, Almeida-Engler J, Castagnone-Sereno P, Rosso M-N. The plant apoplasm is an important recipient compartment for nematode secreted proteins. J Exp Bot. 2011;62:1241–53.

    Article  CAS  PubMed  Google Scholar 

  42. Abuzeid AMI, Zhou X, Huang Y, Li G. Twenty-five-year research progress in hookworm excretory-secretory products. Parasites Vectors. 2020;13:136.

    Article  PubMed  PubMed Central  Google Scholar 

  43. Hunt VL, Tsai IJ, Coghlan A, Reid AJ, Holroyd N, Foth BJ, Tracey A, Cotton JA, Stanley EJ, Beasley H, Bennett HM, Brooks K, Harsha B, Kajitanu R, Kulkarni A, Harbecke D, Nagayasu E, Nichol S, Ogura Y, Quail MA, Randle N, Xia D, Brattig NW, Soblik H, Ribeiro DM, Sanchez-Flores A, Hayashi T, Itoh T, Denver DR, Grant W, Stoltzfus JD, Lok JB, Murayama H, Wastling J, Streit A, Kikuchi T, Viney M, Berriman M. The genomic basis of parasitism in the Strongyloides clade of nematodes. Nat Genet. 2016;48:299–307.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  44. Jaubert S, Ledger TN, Laffaire JB, Piotte C, Abad P, Rosso M-N. Direct identification of stylet secreted proteins from root-knot nematodes by a proteomic approach. Mol Biochem Parasitol. 2002;121:205–11.

    Article  CAS  PubMed  Google Scholar 

  45. Dubreuil G, Magliano M, Deleury E, Abad P, Rosso M-N. Transcriptome analysis of root-knot nematode functions induced in the early stages of parasitism. New Phytol. 2007;176:426–36.

    Article  CAS  PubMed  Google Scholar 

  46. Leelarasamee N, Zhang L, Gleason C. The root-knot nematode effector MiPFN3 disrupts plant actin filaments and promotes parasitism. PLoS Pathog. 2018;14:e1006947.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  47. Siddique S, Radakovic ZS, de la Torre C, Chronis D, Novák O, Ramireddy E, Holbein J, Matera C, Hutten M, Gutbrod P, Anjam MS, Rozanska E, Habash S, Elashry A, Sobczak M, Kakimoto T, Strnad M, Schmulling T, Mitchum MG, FMW G. A parasitic nematode releases cytokinin that controls cell division and orchestrate feeding site formation in host plants. Proc Natl Acad Sci U S A. 2015;112:12669–74.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Chhabra S, Chang SC, Nguyen HM, Huq R, Tanner MR, Londono LM, Estrada R, Dhawan V, Chauhan S, Upadhyay SK, Gindin M, Hotez PJ, Valenzeula JG, Mohanty B, Swarbrick JD, Wulff H, Iadonat SP, Gutman GA, Beeton C, Pennington MW, Norton RS, Chnady KG. Kv1.3 channel-blocking immunomodulatory peptides from parasitic worms: implications for autoimmune diseases. FASEB. 2014;28:3952–64.

    Article  CAS  Google Scholar 

  49. Hussey RS, Mims CW. Ultrastructure of esophageal glands and their secretory granules in the root-knot nematode Meloidogyne incognita. Protoplasma. 1990;156:9–18.

    Article  Google Scholar 

  50. Vieira P, Banora MY, Castagnone-Sereno P, Rosso M-N, Engler G, de Almeida Engler J. An immunocytochemical procedure for protein localization in nematode pre-parasitic and parasitic stages using methylacrylate-embedded tissues. Phytopathology. 2012;102:990–6.

    Article  CAS  PubMed  Google Scholar 

  51. Eves-van den Akker S, Lilley CJ, Jones JT, Urwin PE. Identification and characterisation of a hyper-variable apoplastic effector gene family of the potato cyst nematodes. PLoS Path. 2014;10:e1004391.

    Article  CAS  Google Scholar 

  52. Lilley CJ, Maqbool A, Wu D, Yusup HB, Jones LM, Birch PRJ, Banfield MJ, Urwin PE, Eves-van den Akker S. Effector gene birth in plant parasitic nematodes: Neofunctionalization of a housekeeping glutathione synthetase gene. PLoS Genet. 2018;14:e1007310.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  53. Benjamini Y, Yekutieli D. The control of the false discovery rate in multiple testing under dependency. Ann Stat. 2001;29:1165–88.

    Article  Google Scholar 

  54. Conesa A, Gotz S, García-Gómez JM, Terol J, Talón M, Robles M. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005;21:3674–6.

    Article  CAS  PubMed  Google Scholar 

  55. El-Gebali S, Mistry J, Bateman A, Eddy SR, Luciani A, Potter SC, Quresghi M, Richardson LJ, Salazar GA, Smart A, Sonnhammer ELL, Hirsh L, Paladin L, Piovesan D, Tosatto SCE, Finn RD. The Pfam protein families database in 2019. Nucleic Acids Res. 2019;47:D427–32.

    Article  CAS  PubMed  Google Scholar 

  56. Petersen TN, Brunak S, von Heijne G, Nielsen H. SIGNALP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods. 2011;8:785–6.

    Article  CAS  PubMed  Google Scholar 

  57. de Boer JM, Yan Y, Smant G, Davis EL, Baum TJ. In-situ hybridization to messenger RNA in Heterodera glycines. J Nematol. 1998;30:309–12.

    PubMed  PubMed Central  Google Scholar 

  58. Edgar RC. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics. 2004;5:113.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  59. Solovyev V, Kosarev P, Seledsov I, Vorobyev D. Automatic annotation of eukaryotic genes, pseudogenes and promoters. Genome Biol. 2006;7:1–12.

    Article  PubMed  Google Scholar 

  60. van Megen H, van den Elsen S, Holterman M, Karssen G, Mooyman P, Bongers T, Holovachov O, Bakker J, Helder J. A phylogenetic tree of nematodes based on about 1200 full-length small subunit ribosomal DNA sequences. Nematol. 2009;11:927–50.

    Article  CAS  Google Scholar 

  61. Trifinopoulos J, Nguyen L-T, von Haeseler A, Minh BQ. W-IQ-TREE: a fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res. 2016;44:W232–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references


We thank Dr. Inga Zasada for providing the draft genome assembly of Radopholus similis.


This research was supported by the United States Department of Agriculture, Agricultural Research Service, CRIS project 8042–21000-300-00D. Work by TRM and TJB was supported by state of Iowa and Hatch funds. Work on plant-parasitic nematodes at the University of Cambridge is supported by DEFRA licence 125034/359149/3 and funded by BBSRC grants BB/R011311/1 and BB/S006397/1. CP has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement MSCA-IF-EF-ST-882941-NEMABOX.

Author information

Authors and Affiliations



PVa planned and performed research. TRM performed the preparation of nematodes and micro-aspiration of the gland cells. PVi isolated the RNA and constructed the gland cells cDNA library for sequencing. TJB obtained funding, oversaw and co-conceived the idea of gland cell extraction work. PVa, JS, CP and SE-vdA analyzed the data and performed bioinformatic analysis. LGN obtained funding, and contributed with materials and reagents. PVa, LGN and SE-vdA wrote the manuscript, with inputs from all other authors. All authors read and approved the manuscript for publication.

Corresponding author

Correspondence to Paulo Vieira.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Table S1.

Summary of BLAST hit analyses of the top 230 most abundant transcripts of Pratylenchus penetrans encoding putative secreted proteins (FPKM > 8) originated from the esophageal gland cells library. BLAST analyses were performed against the non-redundant GenBank database. Transcript abundance was determined as Fragments Per Kilobase of Transcript per Million mapped reads (FPKM) values. (XLSX 30 kb)

Additional file 2: Table S2.

List of transcripts encoding putative effector proteins with FPKM values below the established cut-off (FPKM < 8) originated from the esophageal gland cells library. (XLSX 14 kb)

Additional file 3: Fig. S1.

In situ hybridization assays using the corresponding sense DIG-labeled probes (control) for 30 candidate effector genes detected within the esophageal glands of Pratylenchus penetrans. Details regarding each gene annotation and description are presented in the same order as sorted in Table 1. Bars = 20 μm. (TIFF 5250 kb)

Additional file 4: Fig. S2.

In situ hybridization assays using the corresponding sense DIG-labeled probes (control) for transcripts encoding: (a) ShK domain-like protein, and (b) a 14–3-3 protein. Bars = 20 μm. (TIFF 656 kb)

Additional file 5: Table S3.

List of promoter sequences extracted for 30 genes with gland cell localization in Pratylenchus penetrans. The 600 nt of the upstream region of the start codon was queried for the presence of the CAA[AG|T|C]TG[T|G] C motif previously found associated with gland cell expression of other candidate effectors of P. penetrans. (XLSX 17 kb)

Additional file 6: Table S4.

BLAST hit analyses of transcripts with gland cell localization against the draft genome of Pratylenchus penetrans. (XLSX 11 kb)

Additional file 7: Table S5.

List of primers used in this study. (XLSX 14 kb)

Additional file 8: Table S6.

List of CEGMA genes used in this study. (XLSX 693 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Vieira, P., Shao, J., Vijayapalani, P. et al. A new esophageal gland transcriptome reveals signatures of large scale de novo effector birth in the root lesion nematode Pratylenchus penetrans. BMC Genomics 21, 738 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: