Paleo-evolutionary plasticity of plant disease resistance genes
© Zhang et al.; licensee BioMed Central Ltd. 2014
Received: 5 September 2013
Accepted: 25 February 2014
Published: 12 March 2014
The recent access to a large set of genome sequences, combined with a robust evolutionary scenario of modern monocot (i.e. grasses) and eudicot (i.e. rosids) species from their founder ancestors, offered the opportunity to gain insights into disease resistance genes (R-genes) evolutionary plasticity.
We unravel in the current article (i) a R-genes repertoire consisting in 7883 for monocots and 15758 for eudicots, (ii) a contrasted R-genes conservation with 23.8% for monocots and 6.6% for dicots, (iii) a minimal ancestral founder pool of 384 R-genes for the monocots and 150 R-genes for the eudicots, (iv) a general pattern of organization in clusters accounting for more than 60% of mapped R-genes, (v) a biased deletion of ancestral duplicated R-genes between paralogous blocks possibly compensated by clusterization, (vi) a bias in R-genes clusterization where Leucine-Rich Repeats act as a ‘glue’ for domain association, (vii) a R-genes/miRNAs interome enriched toward duplicated R-genes.
Together, our data may suggest that R-genes family plasticity operated during plant evolution (i) at the structural level through massive duplicates loss counterbalanced by massive clusterization following polyploidization; as well as at (ii) the regulation level through microRNA/R-gene interactions acting as a possible source of functional diploidization of structurally retained R-genes duplicates. Such evolutionary shuffling events leaded to CNVs (i.e. Copy Number Variation) and PAVs (i.e. Presence Absence Variation) between related species operating in the decay of R-genes colinearity between plant species.
Pathogen attacks from fungi , viruses , nematodes  or bacteria , compelled plants to prevent damages by engaging an “arms race” with these organisms. Therefore, plants have developed a battery of defense mechanisms involving (1) PTI (PAMP-Triggered Immunity) triggered by PAMP (Pathogen-Associated Molecular Patterns) [5–7] and (2) ETI (Effector-Triggered Immunity) triggered by effectors leading to hypersensitive response (referenced as HR ). Therefore, constant evolution leading to novel mechanisms is crucial for plant defense processes as well as adaptation to biotic stresses. The most studied disease resistance proteins encoding genes (hereafter R-genes) or genes involved in disease resistance pathways are Nucleotide-Binding-Sites (NBS) [9, 10], Leucine-Rich Repeats (LRR) [9, 10], Toll-Interleukine1 Receptors (TIR), WRKY transcription factors [11, 12], Lysine Motif (LysM) families [13, 14], and Protein Kinase families (hereafter referenced as PKinase) [15, 16]. R-genes can then be functionally classified into five distinct groups consisting in CNL (genes encoding proteins with coiled-coil, nucleotide binding site, leucine-rich repeat domains, i.e. CC-NBS-LRR), TNL (genes encoding proteins with Toll-interleukin receptor-like, nucleotide binding site, leucine-rich repeat domains, i.e. TIR-NBS-LRR), RLP (genes encoding proteins with receptor serine-threonine kinase like, extracellular leucine rich repeat domains, i.e. ser/thr-LRR), RLK (genes encoding proteins with kinase, extracellular leucine-rich repeat domains, i.e. Kin-LRR), and RGA (includes all other genes conferring resistance through different molecular mechanisms) classes . LRR-RLK, LRR, LysM, LysM-kinase act as pattern-recognition receptors (PRR) involved in the PTI pathway, while NBS-LRR commonly responds in the frame of the ETI pathway [18, 19]. Finally, WRKY and protein-kinases, associated with protein domains encoded by R-genes (hereafter R-domains), can also be activated by PRRs in disease resistance pathways [18–20].
R-genes have been reported to be ancient and conserved genes that have been detected in gymnosperms, plants and animals to ensure immunity [21–23]. However, comparative genomic analyses have shown that R-genes are associated with a great structural diversity in vertebrates and plants. For example, the presence of TIR domains in conifers and mosses indicated that TIR may represent an ancestral R-gene family with shared functionality with their mammalian or insect homologues regarding innate immunity [21, 22, 24–26]. TIR genes typically expanded in eudicot genomes, while they have been reported to be absent (or at least rare) in grass genomes [27–31]. Moreover, tandem and segmental duplications have been reported as a source of structural plasticity of NBS-LRR genes in plant genomes . Furthermore, PAV (Presence/Absence Variation) polymorphisms often exist in a population or between species [33–36]. Overall, small-scale studies (i.e. few R-genes families/domains and/or few plant species investigated) have suggested R-genes as one of the most plastic gene family in plants associated with intense structural shuffling in the course of evolution leading to synteny erosion or alternatively loss . For example, evolutionary investigations of R-genes in Arabidopsis and rice have been conducted suggesting contrasted amplification of TNL and CNL families as well as clusterization of NBS-LRRs via segmental and tandem duplications or ectopic gene conversions .
Few studies have investigated the conservation of R-genes across a large set of plant species and at the whole-genome level. Genome sequences from flowering plants that are derived from a common ancestor 135 to 250 million years ago (mya) are increasingly available in the public domain for evolutionary studies. Recent paleohistorical studies demonstrated that modern grass genomes, including Panicoideae (sorghum [Sorghum bicolor],  maize [Zea mays], ), Ehrhartoideae (rice [Oryza sativa], ), and Pooideae (Brachypodium distachyon; ), were shaped from n = 5 to 12 ancestral grass karyotypes (AGKs) containing a minimal set of 6045 ordered protogenes with a minimum physical size of 33 Mb [43–45] through whole-genome duplication (WGD) and ancestral chromosome fusion events. Likewise, the recent comparison of numerous eudicot genomes (i.e. mainly eurosids), including grape (Vitis vinifera; ), poplar , Arabidopsis thaliana, soybean Glycine max; , and cacao (Theobroma cacao; ), revealed that modern eudicot genomes derived from an n = 7 ancestor that went through a paleohexaploidization event to reach a n = 21 intermediate followed by numerous lineage-specific WGDs and chromosome fusion events [46, 51]. During the last 135 to 250 million years of evolution, the protein-coding gene families have been then shaped by various gene duplication mechanisms, including WGDs (or polyploidization), segmental duplications, and tandem duplications. It is now well established that all modern diploid plant species are highly shuffled paleopolyploids [52–55].
Duplication (WGDs, segmental duplications and tandem duplications) were proposed as the major mechanisms driving R-genes family expansion or contraction from their traceable ancestral copies [56–58]. However, a systematic and detailed study of the paleohistorical evolution of R-genes across plant subfamilies including rosids species (Arabidopsis thaliana, Grape , Apple , Poplar , Soybean , Lotus , Strawberry , Cacao  and Papaya ), and grasses (Rice , Maize , Sorghum bicolor and Brachypodium distachyon) is still lacking. Particularly, how R-genes have behaved following polyploidization events is not well established. Such a precise investigation of the paleohistory of R-genes during the last 250 million years of evolution will unravel precise mechanisms that lead to the reduced conservation of R-genes observed between modern plant species.
Disease resistance gene mapping, conservation and evolutionary patterns
R-genes catalog and conservation in plant genomes
R-genes data set
Oryza sativa (rice)
Zea mays (maize)
Vitis vinifera (grape)
Populus trichocarpa (poplar)
Glycine max (soybean)
Malus x domestica (apple)
Fragaria vesca (strawberry)
R-genes were classified into seven distinct groups, according to their specific encoded protein domains (R-domains). In monocots, we identified 2398 LRR, 1162 NBS, 27 TIR, 62 LysM, 384 WRKY, 5333 Protein-kinases, and 290 RG. In eudicots, 5345 LRR, 2291 NBS, 936 TIR, 122 LysM, 658 WRKY, 9910 Protein-kinases, and 517 RG were characterized. The distribution of the R-domain repertoire excluding Pkinases in the 13 plant species investigated is illustrated as Figure 1B with a color code that illuminates the six different R-domains (i.e. for a total of 7743 LRR, 3453 NBS, 963 TIR, 171 LysM, 1042 WRKY and 807 RG). Regarding the six different R-domains investigated, LRR and NBS are more abundant in the investigated plant genomes (Figure 1B). LRR and NBS consist in, on average, more than 50% and 20% of detected R-genes in both eudicots and monocots respectively. Few WRKY domains were detected in plants (~3.98% and 3.33% in monocot and eudicot species respectively). However, the number of TIR domains appeared much abundant in eudicots than the monocot species (Additional file 1: Table S2) as previously reported [27–31], which may indicated a specific amplification of such domain during rosids paleohistory. The distribution of R-gene families structured into PTI (consisting in LRR-RLK, LRR, LysM, and LysM kinase), ETI (consisting in NBS-LRR), other Pattern Recognition Receptors (PRRs) divided into R-domains combinations (including NBS, TIR, RG, NBS-Pkinase, NBS-WRKY, TIR-NBS, TIR-NBS-Pkinase, TIR-Pkinase, hereafter ‘R-combination’) and genes involved in disease resistance pathway (including WRKYs and protein-kinases, hereafter ‘R-pathway’) is available as Additional file 1: Figure S3. The observed distribution of R-gene families in the investigated species from the more abundant is R-pathway (13189) > PTI (5844) > R-combination (2525) > ETI (2070).
In order to reconstruct the ancestral R-genes repertoire in plants, we used the recently reconstructed ancestral monocot (5 protochromosomes) and eudicot (7 protochromosomes) karyotypes to investigate the R-genes evolutionary dynamic. In Figure 1C, the evolutionary scenario of the modern grass genomes deriving from a n = 5 ancestor is illustrated . Circular distributions illuminated the conservation rate of R-genes families excluding Pkinases (Figure 1C top) and their abundances within the different species (Figure 1C bottom). In the four grasses, the distribution of R-domains appeared very similar, except for RG, more abundant in Brachypodium compared to the other grasses investigated (Fisher Exact Test P-value = 4.10E-08, 1.46E-09 and 7.38E-08 in comparison to rice, sorghum, and maize; illustrated as red star in Figure 1C bottom). This phenomenon can simply be explained by the differences in RGA annotation and functional characterization efforts in the different species investigated. Finally, WRKY appeared less abundant in rice compared with the other grasses (Fisher Exact Test P-value = 3.41E-04, 2.70E-05, 9.47E-07 in comparison to rice Brachypodium, sorghum, and maize; illustrated as blue star in Figure 1C bottom). According to the reconstructed five protochromosomes, dating back to ~50-70 mya before the speciation of the four modern species investigated (containing 1622, 732, 845, and 832 R-genes excluding Pkinase domains in rice, Brachypodium, sorghum, and maize respectively), we were able to reconstruct a minimal founder (conserved) pool of 465 ancestral R-domains consisting in 361 LRR, 54 NBS, 6 TIR, 11 LySM, 42 WRKY, and 52 RG (Figure 1C top, Additional file 3: Dataset S2). Based on the same strategy, the evolutionary scenario of the eudicots has been used to unravel a minimal founder pool of 150 R-genes (Additional file 1: Figure S4). In order to understand in more details the evolution of the major PTI/ETI families, we have reconstructed their ancestral pools. The results suggests that PTI genes content in the ancestors are significantly higher than observed in each modern species (P-value = 6.33E-04, 3.98E-04, 3.37E-04 in grasses ancestor, rice-Brachypodium ancestor and sorghum-maize ancestor respectively, Additional file 1: Figure S5), while the ETI genes content is lower in ancestors compared to modern species (P-value = 1.12E-04, 5.03E-04, 3.97E-04 in grasses ancestor, rice-Brachypodium ancestor and sorghum-maize ancestor respectively). This result may suggest an opposite evolutionary trend between PTI and ETI families that are respectively lost and gained in the course of evolution.
R-genes plasticity in response to duplication events
Such observed absence in R-genes deletion partitioning in grasses between ancestral duplicated chromosomes may be due to R-gene clusters identified as more abundant in ancestral sensitive chromosomes compared to dominant chromosomal compartment (R2 = 0.64 with P-value = 4.73E-06 for sensitive chromosomes and R2 = 0.34 with P-value = 3.70E-03 for dominant chromosomes; Figure 2B and Additional file 1: Table S5). For example, between A2 (reported as dominant) and A4 (reported as sensitive), there is no cluster located in the A2 in contrast to seven genes in three clusters on A4, while between A1 (reported as dominant) and A5 (reported as sensitive) more clusters in the A5 (16 genes in five clusters) do not reverse or reduce the reported dominance of A1 (24 genes in ten clusters). In contrast, R-genes clusters in maize did not affected the observed bias retention of duplicated R-genes between paralogous fragments excluding for A5 (m6 vs m8), A12 (m3 vs m1/10), A2 (m4 vs m5) and A4 (m2 vs m10); Figure 2A (bottom), Additional file 1: Table S6. This result may indicate that the random deletion of R-genes after WGD, not following the known subgenome dominance rule for the ancestral tetraploidization, may be a consequence of the high plasticity of such gene family evolving particularly in local tandem duplications (also referenced as clusterization in the next section) that may have compensated the ancestral biased deletion of duplicates in known sensitive subgenomes in the course of evolution. However, for the recent WGD in maize dating back to 5 mya, tandem duplications or clusters can’t offset the dominance/sensitivity effect in such short period of time. Thus, our data led to the hypothesis that R-genes, identified as diploidization sensitive genes, may have followed the subgenome dominance hypothesis that was compensate in the course of history by a reshuffling return flow consisting in local tandem duplications, enriching sensitive genomic compartments in R-genes content.
R-genes plasticity viaclusterization and transposition mechanisms
While R-genes have been reported to be clustered in grass chromosomes  based on few species or loci investigated, a large-scale investigation of this phenomenon is still lacking in monocots and dicots. The structural definition of a R-gene cluster was considered following a previous study where linked (i.e. clustered) R-genes were not interrupted by more than eight non-R-genes . In Additional file 1: Table S7, we reported that there are about 69% and 63% R-genes on average organized in clusters in monocots and eudicots respectively, suggesting that R-genes families expanded by lineage-specific tandem duplications leading to duplicated gene copy variants associated with high sequence similarities. Surprisingly, in poplar for example, we detected only 32% of R-genes organized in clusters using the same strategy, then unraveling possible specific patterns of R-gene clusterization between species. The typology of the R-genes clusters differs between species then reinforcing the concept of a recent clusterization process with most (58% on average) of them consisting in clusters of two locally duplicated R-genes, especially in maize (72%), while the largest and rare cluster, made of six R-genes, was only observed in rice (Additional file 1: Figure S6).
The Figure 3B illustrates the retention of duplicated loci with 90% of sequence similarity in cacao, one locus with one R-gene (Tc06p002810 consisting TIR-NBS-LRR domains), and the duplicated locus harboring two R-genes (i.e. Tc03p00890 with a TIR domain and Tc03p00900 with NBS-LRR domains). We located precisely TSD (Target Site Duplication) motifs and long terminal reverse duplication, suggesting Tc06p002810 as the acceptor site (Figure 3B, Dotplot horizontal axis) and the duplicated region with two genes as the donor site (Figure 3B, Dotplot vertical axis). The acceptor site is characterized by a 7 kb fragmental deletion, which is located in the intergenic region of the donor site between Tc03p00890 and Tc03p00900 (with no transposable element or repeat detected in this particular fragment). The deletion of the intergenic fragment between the neighbor genes may have led to the read through of the ORFs leading the two neighbor genes fused into a single one (Tc06p002810) in the course of evolution. These paralogous regions from cacao may then suggest duplication as a major process resulting in domain shuffling (then reducing colinearity between species) between tandem duplicated R-genes.
Such R-gene structural plasticity may also be driven by TEs as we illustrated in the Figure 3C with two tandem duplicated R-genes with about 80% of sequence similarity from the rice genome, i.e. Os06g16300 (LRR domain) and Os06g16330 (LRR-Pkinase domains). Using LTR_Finder , we identified a 240 bp ancient LTRs flanking the Os06g16300 gene (dark blue arrow) as well as 5 bp TSD motifs associated with 1.1 kb recent LTRs (light blue arrow) flanking two transposase genes, i.e. Os06g16310 and Os06g16320. We then proposed an evolutionary scenario for this locus, where Os06g16300 is the ancestral gene (conserved with the modern Brachypodium gene, Bradi1g43690) that have been partially (illustrated as red exons) duplicated in tandem as Os06g16330 and finally physically separated by the TE-based transposition of the two transposases (Os06g16310 and Os06g16320). Overall, this example of tandem duplication followed by TE-based transposition events illustrates another source of R-gene plasticity reported in the current analysis, leading to R-gene synteny erosion between closely related species.
Taking into account the previous case examples obtained from cacao and rice genomes and in order to investigate R-gene synteny erosion at the whole genome level, we aligned the non-syntenic R-genes with the total R-genes repertoire. Using the parameters CIP > = 70% and CALP > = 70%  to identify the paired non-self matches, and according to the similarity of flanked protein-coding genes between the paired R-genes using E-value < e-10 as a blast threshold, we distinguished segmental from single-gene duplications (also referenced as Small-Scale Duplication i.e. SSD) from these gene pairs (detailed in Methods section). In rice, Brachypodium, sorghum, and maize, we found 13.04% (153 out of 1173), 9.08% (74 out of 815), 12.35% (115 out of 931), and 35.63% (404 out of 1134) R-genes loci (R-genes located in the same clusters were considered as a single locus) involved in single-gene duplications (Additional file 1: Table S8 and Additional file 4: Dataset S3), which is higher than the 5% to 7% of single-gene duplication frequency reported for the total annotated protein-coding genes in grasses . Among grasses, the single-gene duplication frequency in maize is significantly higher than in rice, Brachypodium, and sorghum respectively (P-value = 6.32E-24, 2.28E-29, and 1.71E-22 in Fisher’s Exact Test respectively, cf Additional file 1: Table S9). In addition, we observed hotspots of single-gene duplications where R-loci showing higher sequence similarity with at least two other non-related R-loci was considered as hotspot (Figure 3D). In maize, 51.73% (209 out of 404) of the single-gene duplications frequency was observed, a much higher rate compared to 23.53% (36 out of 153), 37.84% (28 out of 74), and 31.30% (36 out of 115) in rice, Brachypodium, and sorghum respectively (Additional file 1: Table S8). We can then speculate that the recent WGD in maize, dating back to 5 mya, may have promoted and accelerated R-gene singleton duplication frequency compared to the other grasses.
Homologous R-genes sequences within clusters generated by tandem duplications provided the structural template to form novel R-gene informs though domain recombinations. We characterized all the different R-domains in modern clusters and observed a specific domain affinity for clusterization (Additional file 1: Figure S7A and Additional file 1: Table S2). NBS-LRR and LRR-Pkinase combinations are observed as representing the majority of domain combinations in clusters (on average 31.78% and 42.63% out of the total R-domains in clusters for NBS-LRR, 51.08% and 46.47% for LRR-Pkinase domains, respectively in monocots and eudicots), compared to rare observed combinations in clusters for LRR-NBS-PKinase-WRKY or LRR-TIR-WRKY (Additional file 1: Table S2). More interestingly, we observed a preference or affinity in domain combinations where more than 90% of them included LRR (Additional file 1: Figure S7B), with a preferential observed R-domain association with Pkinase (59% and 52% in monocots and eudicots respectively; Additional file 1: Figure S7C) and NBS (41% and 46% in monocots and eudicots respectively; Additional file 1: Figure S7C) domains. Therefore, our data confirm and largely refine previous conclusions suggesting LRR as a ‘glue’ for domain association leading to new combinations of R-gene domains observed in modern species, one major source of R-gene plasticity. This dynamic recombination of R-domains within clusters, especially enriching NBS-LRR associations, may promote the development a novel source of disease resistance in the investigated species.
R-gene plasticity mediated by miRNA/R-gene interactome
MiRNAs, as a versatile class of post-transcriptional gene regulator, are reported to be involved in a large variety of cellular processes, including development and defense responses in plants [77–79]. Small RNA cloning and high-throughput sequencing from plants infected by pathogens have shown that many microRNAs [80–82] and siRNAs  may be involved in biotic defense responses through up or down regulation of targeted gene expression. We wanted then to investigate whether the miRNA/R-gene interactome had an impact on the R-genes evolutionary plasticity as the role of miRNAs in plant immunity system has been largely reported in the literature [80–82]. To unveil if the plant paleoevolution has affected or even shaped the R-gene/miRNA interactome, we investigated miRNAs potentially targeting R-genes in the four monocots investigated (rice, Brachypodium, sorghum, and maize) as well as in nine eudicots (grape, Arabidopsis, strawberry, cacao, papaya, poplar, soybean, apple, and lotus).
miRNA repertoire targeting conserved and lineage-specific R-genes in plants
Lineage-specific WGD rounds
Finally, we investigated the R-domains/miRNA affinity and observed that NBS/TIR > LRR > WRKY/PKinase domains are preferentially targeted by miRNAs (P-value = 2.17E-03 and 8.45E-03, 1.21E-03, and 3.97E-03 with paired student t-test for NBS vs LRR, TIR vs LRR, LRR vs WRKY, LRR vs Pkinase, respectively; Figure 4C & Additional file 1: Table S11). We also observed that 67% and 63% of R-genes clusters are targeted by miRNA in contrast to 33% and 37% of singleton R-genes, respectively in eudicots and monocots (Additional file 1: Figure S8 and Additional file 1: Table S12). Overall, the interaction affinity between species-specific R-genes (in clusters and in majority involving NBS/TIR domains) and miRNAs after WGD can be considered as a major source of R-gene family plasticity in plants, as part of a possible functional diploidization of structurally retained duplicated R-genes.
Diploidization following duplication as a major source of R-genes structural plasticity
Most of the investigated rosids (grape, Arabidopsis, soybean, poplar, and cacao) species experienced up to three WGD events, whereas the investigated grasses (rice, maize, sorghum, and Brachypodium) went through one shared ancestral WGD during their evolution, except for maize which experienced a recent extra-WGD 5 mya . Biased erosion of duplicated gene redundancy between sister blocks has been characterized recently in plants defining dominant and sensitive blocks [68, 69]. In our current analysis, the identification of 23641 R-genes sequences in angiosperms established a higher R-genes conservation in grasses (on average 23.8%) compared to rosids (on average 6.6%) suggesting that successive rounds of WGDs act as a decay into R-genes conservation, as a primer source of R-gene plasticity. The evolutionary investigation of the characterized R-genes repertoire allowed the reconstruction of minimal ancestral pool of 465 and 150 founder R-genes respectively for the grasses and rosids.
Tandem duplication or clusterization played an important role in R-genes plasticity leading to structural variations such as CNV/PAV between species, which are thought to contribute to the reported tremendous R-genes diversity . Special expansion of tandem duplications especially in sensitive chromosomes, as a rapid counterbalance flow of the duplicates deletion phenomenon, may have compensate R-genes loss in such chromosomal fragments as an expected consequence of the known diploidization process. R-genes clusters may have been shaped by classically proposed shuffling mechanisms such as replication slippage, segmental duplication via homologous/non-homologous unequal crossover, transposition via ectopic recombination/TE capture . Such clusters may have been shaped by domain shuffling events , domain breakage and fusion so that R-domain combinations such as LRR-NBS-PK-TIR and LRR-NBS-PK-WRKY might be the result of local shuffling and recombination events. Altogether, R-gene clusterization, a center source of plasticity, triggered a serial of reshuffling events to make rapid copy variation leading to PAVs and CNVs between species, as a putative source for R-gene structural diversity. In the current analysis we observed up to 60% of R-genes organized in clusters in grasses and rosids. Genic and intergenic sequence repeats within R-clusters generated by duplication, transpositions and insertions provide a structural template that allows mis-pairing during recombination giving rise to unequal crossovers and interlocus gene conversions/rearrangements. The resulting R-domain combinations appeared not random with LRR as a ‘glue’ for domain association leading to new resistance gene isoforms in modern plant species.
MicroRNA/R-gene interactome as a major source of R-genes functional diploidization
Recently, new evidences have been proposed regarding miRNAs regulating NBS-LRR in plants such as miR2109/miR2118/miR1507 in Medicago, miR482/miR2118 in tomato, and miR6019/miR6020 in tobacco guiding the cleavage of transcript of NBS-LRRs, and then triggering the secondary phased of siRNA production by RNA-dependent RNA polymerase [88–90]. Thus, MiRNAs may be involved in defense immunity in regulating R-genes expression level. Our in silico analysis suggests that miRNA may target preferentially duplicated R-genes either deriving from WGDs and more interestingly from local tandem duplications (i.e. clusters). This observation may suggest that the presence of redundant duplicated R-genes copies, when retained after diploidization, may require modification or specialization in expression/regulation through possibly miRNA interaction. Moreover, a specific R-domain affinity was observed for miRNA in silico interaction toward LRR/NBS/TIR, which may indicate to some extant a domain preference for R-gene/miRNA interaction. Overall, our data may suggest miRNAs as a dosage regulator playing a possible role in R-genes functional redundancy erosion following large or local duplication events. The preferential post-transcriptional regulation of duplicated R-genes by miRNA can be proposed as part of a functional diploidization process in response to duplications to maintain a perfect dosage balance regarding the product of R-genes duplicates.
Putative model of R-genes paleohistory in plants
We reconstructed the R-genes paleohistory in plant unraveling duplications (either whole genome, small-scale clusters or single-gene based) as the major source of structural (CNV, PAV, domain recombination) or even potentially functional (enhanced miRNA regulation for R-gene clusters and specific R-domains) plasticity that may have promote the development a novel source of disease resistance in the course of the evolution of the different investigated species. The conserved role of similar R-gene families (especially TIR and NBS-LRR) in both plant and animal defense systems suggest a common and ancestral origin. The current reconstruction of the ancestral gene pool in angiosperm opens the perspective to determine the origin of innate immunity mechanism in eukaryotes.
R-gene identification and mapping
R-genes were selected from monocots Oryza sativa, Sorghum bicolor, Zea mays, Brachypodium distachyon, and eudicots Arabidopsis thaliana, Populus trichocarpa, Carica papaya, Glycine max, Lotus japonicas, Fragaria vesca, and Theobroma cacao, on the basis of functional annotations available on Phytozome (http://www.phytozome.net/), Plant GDB (http://www.plantgdb.org/). Vitis vinifera and Malus x domestica R-genes annotation were retrieved from  and  supplementary data. The R-genes identification methods is illustrated in Additional file 1: Figure S1A. PFAM domain identification – Putative R-genes were investigated using profile Hidden Markov Model. Several PFAM profiles  were used to extract putative R-gene proteins within the 13 genomes investigated: LRR: pf00560, pf07723, pf07725, pf12799, pf01463, pf08263; NB-ARC: pf00931; TIR: pf01582; LysM: pf01476; Pkinase: pf00069; WRKY: pf03106. PFAM profiles were identified within genomes using the hmmsearch algorithm (e-value cut: 1e-10) from HMMER3 (http://hmmer.janelia.org/; ). PRGdb – R-genes sequences from the Plant Resistance Gene database were downloaded (http://prgdb.cbm.fvg.it/index.php, ). Annotation, PFAM and PRGdb –based R-genes were aligned against Soybean (46194 protein sequences), Cacao (27814 protein sequences), Strawberry (34809 protein sequences), Lotus (15470 protein sequences), Papaya (19205 protein sequences), Poplar (30260 protein sequences), Apple (58979 protein sequences), Brachypodium, (32255 protein sequences) Sorghum (36338 protein sequences), and Maize (53764 protein sequences) genome data using BLASTP (PFAM and Annotation R-gene sequences) and BLASTX (PRGdb sequences). BLAST results were parsed using CIP (Cumulative Identity Percentage) and CALP (Cumulative Alignment Length Percentage) parameters (70% as minimum threshold) delivering a non-redundant list of R-genes for each species .
Orthologs/Paralogs identification and synteny relationships
Orthologous and paralogous R-genes were identified aligning Rice RefBank against Brachypodium, Sorghum and Maize using BLASTALL. BLASTP results are parsed with CIP and CALP parameters set to 60% and 70% as minimum threshold for ortholog identification and 60% and 70% as minimum threshold for paralog identification as described in . Ancestral relationship between monocot species were represented as concentric circles with the visualization tool Circos . Relationship between rosids species were investigated with the same protocol. Grape RefBank was aligned against Soybean, Cacao, Strawberry, Lotus, Papaya, Poplar and Apple genome data. BLAST results were parsed with the same parameters described previously. The synteny relations at the chromosome levels were considered using public synteny data available for both Monocotyledones and Eudicotyledones .
R-genes not located in the syntenic region were BLAST aligned against the total R-genes content. The gene pairs excluding self matches (CIP > = 70%, CIAP > = 70%) were considered as single-gene duplication and used to the further analysis. Then we selected 40 flanking genes windows surrounding R-genes pairs. If flanking pairs with E-value < = e-10 are observed these paired R-genes were then considered as part of a segmental duplication, otherwise, as single-gene duplication.
Permutation-test for R-genes partitioning between duplicated blocks
MiRNA identification associated with R-genes as targets
Mature miRNAs dataset from miRBase (http://www.mirbase.org/; Release 18) was used to predict R-genes as targets in the investigated plant genomes Targetfinder algorithm (http://carringtonlab.org/resources/targetfinder/) with score < 4 . To reduce the false positive, secondary structures of the identified mature miRNA was validated using MiReNA software . R-genes targeted by miRNA with validated secondary structure and with a mismatch score < 4, are considered as in silico targets of miRNA. The detailed pipeline is illustrated in Additional file 1: Figure S1B.
This work has been supported by grants from the Agence Nationale de la Recherche (program ANR Blanc-PAGE, ref: ANR-2011-BSV6-00801).
- Glazebrook J: Contrasting mechanisms of defense against biotrophic and necrotrophic pathogens. Annu Rev Phytopathol. 2005, 43: 205-227. 10.1146/annurev.phyto.43.040204.135923.PubMedView ArticleGoogle Scholar
- Pallas V, Garcia JA: How do plant viruses induce disease? Interactions and interference with host components. J Gen Virol. 2011, 92 (Pt 12): 2691-2705.PubMedView ArticleGoogle Scholar
- Soriano IR, Riley IT, Potter MJ, Bowers WS: Phytoecdysteroids: a novel defense against plant-parasitic nematodes. J Chem Ecol. 2004, 30 (10): 1885-1899.PubMedView ArticleGoogle Scholar
- Choy A, Roy CR: Autophagy and bacterial infection: an evolving arms race. Trends Microbiol. 2013, 21 (9): 451-456. 10.1016/j.tim.2013.06.009.PubMedView ArticleGoogle Scholar
- Chisholm ST, Coaker G, Day B, Staskawicz BJ: Host-microbe interactions: shaping the evolution of the plant immune response. Cell. 2006, 124 (4): 803-814. 10.1016/j.cell.2006.02.008.PubMedView ArticleGoogle Scholar
- Medzhitov R, Janeway CA: Innate immunity: the virtues of a nonclonal system of recognition. Cell. 1997, 91 (3): 295-298. 10.1016/S0092-8674(00)80412-2.PubMedView ArticleGoogle Scholar
- Nurnberger T, Brunner F: Innate immunity in plants and animals: emerging parallels between the recognition of general elicitors and pathogen-associated molecular patterns. Curr Opin Plant Biol. 2002, 5 (4): 318-324. 10.1016/S1369-5266(02)00265-0.PubMedView ArticleGoogle Scholar
- Mur LAJ, Kenton P, Lloyd AJ, Ougham H, Prats E: The hypersensitive response; the centenary is upon us but how much do we know?. J Exp Bot. 2008, 59 (3): 20-View ArticleGoogle Scholar
- Hammond-Kosack KE, Parker JE: Deciphering plant-pathogen communication: fresh perspectives for molecular resistance breeding. Curr Opin Biotechnol. 2003, 14 (2): 177-193. 10.1016/S0958-1669(03)00035-1.PubMedView ArticleGoogle Scholar
- Glowacki S, Macioszek VK, Kononowicz AK: R proteins as fundamentals of plant innate immunity. Cell Mol Biol Lett. 2011, 16 (1): 1-24. 10.2478/s11658-010-0024-2.PubMedView ArticleGoogle Scholar
- Qiu D, Xiao J, Ding X, Xiong M, Cai M, Cao Y, Li X, Xu C, Wang S: OsWRKY13 mediates rice disease resistance by regulating defense-related genes in salicylate- and jasmonate-dependent signaling. Mol Plant Microbe Interact. 2007, 20 (5): 492-499. 10.1094/MPMI-20-5-0492.PubMedView ArticleGoogle Scholar
- Shimono M, Koga H, Akagi A, Hayashi N, Goto S, Sawada M, Kurihara T, Matsushita A, Sugano S, Jiang CJ, Kaku H, Inoue H, Takatsuji H: Rice WRKY45 plays important roles in fungal and bacterial disease resistance. Mol Plant Pathol. 2012, 13 (1): 83-94. 10.1111/j.1364-3703.2011.00732.x.PubMedView ArticleGoogle Scholar
- Spaink HP: Specific recognition of bacteria by plant LysM domain receptor kinases. Trends Microbiol. 2004, 12 (5): 201-204. 10.1016/j.tim.2004.03.001.PubMedView ArticleGoogle Scholar
- Buist G, Steen A, Kok J, Kuipers OP: LysM, a widely distributed protein motif for binding to (peptido)glycans. Mol Microbiol. 2008, 68 (4): 838-847. 10.1111/j.1365-2958.2008.06211.x.PubMedView ArticleGoogle Scholar
- Witte CP, Keinath N, Dubiella U, Demouliere R, Seal A, Romeis T: Tobacco calcium-dependent protein kinases are differentially phosphorylated in vivo as part of a kinase cascade that regulates stress response. J Biol Chem. 2010, 285 (13): 9740-9748. 10.1074/jbc.M109.052126.PubMed CentralPubMedView ArticleGoogle Scholar
- Kurusu T, Hamada J, Nokajima H, Kitagawa Y, Kiyoduka M, Takahashi A, Hanamata S, Ohno R, Hayashi T, Okada K, Koga J, Hirochika H, Yamane H, Kuchitsu K: Regulation of microbe-associated molecular pattern-induced hypersensitive cell death, phytoalexin production, and defense gene expression by calcineurin B-like protein-interacting protein kinases, OsCIPK14/15, in rice cultured cells. Plant Physiol. 2010, 153 (2): 678-692. 10.1104/pp.109.151852.PubMed CentralPubMedView ArticleGoogle Scholar
- Bent AF: Plant disease resistance genes: function meets structure. Plant Cell. 1996, 8 (10): 1757-1771.PubMed CentralPubMedView ArticleGoogle Scholar
- Wang J, Tan S, Zhang L, Li P, Tian D: Co-variation among major classes of LRR-encoding genes in two pairs of plant species. J Mol Evol. 2011, 72 (5–6): 498-509.PubMedView ArticleGoogle Scholar
- Panstruga R, Parker JE, Schulze-Lefert P: SnapShot: plant immune response pathways. Cell. 2009, 136 (5): 978-e971-973PubMedView ArticleGoogle Scholar
- Yue JX, Meyers BC, Chen JQ, Tian D, Yang S: Tracing the origin and evolutionary history of plant nucleotide-binding site-leucine-rich repeat (NBS-LRR) genes. New Phytol. 2012, 193 (4): 1049-1063. 10.1111/j.1469-8137.2011.04006.x.PubMedView ArticleGoogle Scholar
- Meyers BC, Dickerman AW, Michelmore RW, Sivaramakrishnan S, Sobral BW, Young ND: Plant disease resistance genes encode members of an ancient and diverse protein family within the nucleotide-binding superfamily. Plant J. 1999, 20 (3): 317-332. 10.1046/j.1365-313X.1999.t01-1-00606.x.PubMedView ArticleGoogle Scholar
- Akita M, Valkonen JP: A novel gene family in moss (Physcomitrella patens) shows sequence homology and a phylogenetic relationship with the TIR-NBS class of plant disease resistance genes. J Mol Evol. 2002, 55 (5): 595-605. 10.1007/s00239-002-2355-8.PubMedView ArticleGoogle Scholar
- Bertin J, Nir WJ, Fischer CM, Tayber OV, Errada PR, Grant JR, Keilty JJ, Gosselin ML, Robison KE, Wong GH, Glucksmann MA, DiStefano PS: Human CARD4 protein is a novel CED-4/Apaf-1 cell death family member that activates NF-kappaB. J Biol Chem. 1999, 274 (19): 12955-12958. 10.1074/jbc.274.19.12955.PubMedView ArticleGoogle Scholar
- Meyers BC, Morgante M, Michelmore RW: TIR-X and TIR-NBS proteins: two new families related to disease resistance TIR-NBS-LRR proteins encoded in Arabidopsis and other plant genomes. Plant J. 2002, 32 (1): 77-92. 10.1046/j.1365-313X.2002.01404.x.PubMedView ArticleGoogle Scholar
- Liu JJ, Ekramoddoullah AK: Isolation, genetic variation and expression of TIR-NBS-LRR resistance gene analogs from western white pine ( Pinus monticola Dougl. ex. D. Don). Mol Genet Genomics. 2003, 270 (5): 432-441.PubMedView ArticleGoogle Scholar
- Girardin SE, Sansonetti PJ, Philpott DJ: Intracellular vs extracellular recognition of pathogens–common concepts in mammals and flies. Trends Microbiol. 2002, 10 (4): 193-199. 10.1016/S0966-842X(02)02334-X.PubMedView ArticleGoogle Scholar
- Tarr DE, Alexander HM: TIR-NBS-LRR genes are rare in monocots: evidence from diverse monocot orders. BMC Res Notes. 2009, 2: 197-10.1186/1756-0500-2-197.PubMed CentralPubMedView ArticleGoogle Scholar
- Bai J, Pennill LA, Ning J, Lee SW, Ramalingam J, Webb CA, Zhao B, Sun Q, Nelson JC, Leach JE, Hulbert SH: Diversity in nucleotide binding site-leucine-rich repeat genes in cereals. Genome Res. 2002, 12 (12): 1871-1884. 10.1101/gr.454902.PubMed CentralPubMedView ArticleGoogle Scholar
- Meyers BC, Kozik A, Griego A, Kuang H, Michelmore RW: Genome-wide analysis of NBS-LRR-encoding genes in Arabidopsis. Plant Cell. 2003, 15 (4): 809-834. 10.1105/tpc.009308.PubMed CentralPubMedView ArticleGoogle Scholar
- Yang S, Zhang X, Yue JX, Tian D, Chen JQ: Recent duplications dominate NBS-encoding gene expansion in two woody species. Mol Genet Genomics. 2008, 280 (3): 187-198. 10.1007/s00438-008-0355-0.PubMedView ArticleGoogle Scholar
- Porter BW, Paidi M, Ming R, Alam M, Nishijima WT, Zhu YJ: Genome-wide analysis of Carica papaya reveals a small NBS resistance gene family. Mol Genet Genomics. 2009, 281 (6): 609-626. 10.1007/s00438-009-0434-x.PubMedView ArticleGoogle Scholar
- Leister D: Tandem and segmental gene duplication and recombination in the evolution of plant disease resistance gene. Trends Genet. 2004, 20 (3): 116-122. 10.1016/j.tig.2004.01.007.PubMedView ArticleGoogle Scholar
- Luo S, Zhang Y, Hu Q, Chen J, Li K, Lu C, Liu H, Wang W, Kuang H: Dynamic nucleotide-binding site and leucine-rich repeat-encoding genes in the grass family. Plant Physiol. 2012, 159 (1): 197-210. 10.1104/pp.111.192062.PubMed CentralPubMedView ArticleGoogle Scholar
- Guo YL, Fitz J, Schneeberger K, Ossowski S, Cao J, Weigel D: Genome-wide comparison of nucleotide-binding site-leucine-rich repeat-encoding genes in Arabidopsis. Plant Physiol. 2011, 157 (2): 757-769. 10.1104/pp.111.181990.PubMed CentralPubMedView ArticleGoogle Scholar
- Shen J, Araki H, Chen L, Chen JQ, Tian D: Unique evolutionary mechanism in R-genes under the presence/absence polymorphism in Arabidopsis thaliana. Genetics. 2006, 172 (2): 1243-1250.PubMed CentralPubMedView ArticleGoogle Scholar
- Yang S, Feng Z, Zhang X, Jiang K, Jin X, Hang Y, Chen JQ, Tian D: Genome-wide investigation on the genetic variations of rice disease resistance genes. Plant Mol Biol. 2006, 62 (1–2): 181-193.PubMedView ArticleGoogle Scholar
- Leister D, Kurth J, Laurie DA, Yano M, Sasaki T, Devos K, Graner A, Schulze-Lefert P: Rapid reorganization of resistance gene homologues in cereal genomes. Proc Natl Acad Sci U S A. 1998, 95 (1): 370-375. 10.1073/pnas.95.1.370.PubMed CentralPubMedView ArticleGoogle Scholar
- Meyers BC, Kaushik S, Nandety RS: Evolving disease resistance genes. Curr Opin Plant Biol. 2005, 8 (2): 129-134. 10.1016/j.pbi.2005.01.002.PubMedView ArticleGoogle Scholar
- Paterson AH, Bowers JE, Bruggmann R, Dubchak I, Grimwood J, Gundlach H, Haberer G, Hellsten U, Mitros T, Poliakov A, Schmutz J, Spannagl M, Tang H, Wang X, Wicker T, Bharti AK, Chapman J, Feltus FA, Gowik U, Grigoriev IV, Lyons E, Maher CA, Martis M, Narechania A, Otillar RP, Penning BW, Salamov AA, Wang Y, Zhang L, Carpita NC, et al: The Sorghum bicolor genome and the diversification of grasses. Nature. 2009, 457 (7229): 551-556. 10.1038/nature07723.PubMedView ArticleGoogle Scholar
- Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, Pasternak S, Liang C, Zhang J, Fulton L, Graves TA, Minx P, Reily AD, Courtney L, Kruchowski SS, Tomlinson C, Strong C, Delehaunty K, Fronick C, Courtney B, Rock SM, Belter E, Du F, Kim K, Abbott RM, Cotton M, Levy A, Marchetto P, Ochoa K, Jackson SM, Gillam B, et al: The B73 maize genome: complexity, diversity, and dynamics. Science. 2009, 326 (5956): 1112-1115. 10.1126/science.1178534.PubMedView ArticleGoogle Scholar
- International Rice Genome Sequencing P: The map-based sequence of the rice genome. Nature. 2005, 436 (7052): 793-800. 10.1038/nature03895.View ArticleGoogle Scholar
- International Brachypodium I: Genome sequencing and analysis of the model grass Brachypodium distachyon. Nature. 2010, 463 (7282): 763-768. 10.1038/nature08747.View ArticleGoogle Scholar
- Salse J, Abrouk M, Bolot S, Guilhot N, Courcelle E, Faraut T, Waugh R, Close TJ, Messing J, Feuillet C: Reconstruction of monocotelydoneous proto-chromosomes reveals faster evolution in plants than in animals. Proc Natl Acad Sci U S A. 2009, 106 (35): 14908-14913. 10.1073/pnas.0902350106.PubMed CentralPubMedView ArticleGoogle Scholar
- Salse J, Abrouk M, Murat F, Quraishi UM, Feuillet C: Improved criteria and comparative genomics tool provide new insights into grass paleogenomics. Brief Bioinform. 2009, 10 (6): 619-630. 10.1093/bib/bbp037.PubMedView ArticleGoogle Scholar
- Bolot S, Abrouk M, Masood-Quraishi U, Stein N, Messing J, Feuillet C, Salse J: The ‘inner circle’ of the cereal genomes. Curr Opin Plant Biol. 2009, 12 (2): 119-125. 10.1016/j.pbi.2008.10.011.PubMedView ArticleGoogle Scholar
- Jaillon O, Aury JM, Noel B, Policriti A, Clepet C, Casagrande A, Choisne N, Aubourg S, Vitulo N, Jubin C, Vezzi A, Legeai F, Hugueney P, Dasilva C, Horner D, Mica E, Jublot D, Poulain J, Bruyère C, Billault A, Segurens B, Gouyvenoux M, Ugarte E, Cattonaro F, Anthouard V, Vico V, Del Fabbro C, Alaux M, Di Gaspero G, Dumas V, et al: The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature. 2007, 449 (7161): 463-467. 10.1038/nature06148.PubMedView ArticleGoogle Scholar
- Tuskan GA, Difazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, Putnam N, Ralph S, Rombauts S, Salamov A, Schein J, Sterck L, Aerts A, Bhalerao RR, Bhalerao RP, Blaudez D, Boerjan W, Brun A, Brunner A, Busov V, Campbell M, Carlson J, Chalot M, Chapman J, Chen GL, Cooper D, Coutinho PM, Couturier J, Covert S, Cronk Q, et al: The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science. 2006, 313 (5793): 1596-1604. 10.1126/science.1128691.PubMedView ArticleGoogle Scholar
- Arabidopsis Genome I: Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000, 408 (6814): 796-815. 10.1038/35048692.View ArticleGoogle Scholar
- Schmutz J, Cannon SB, Schlueter J, Ma J, Mitros T, Nelson W, Hyten DL, Song Q, Thelen JJ, Cheng J, Xu D, Hellsten U, May GD, Yu Y, Sakurai T, Umezawa T, Bhattacharyya MK, Sandhu D, Valliyodan B, Lindquist E, Peto M, Grant D, Shu S, Goodstein D, Barry K, Futrell-Griggs M, Abernathy B, Du J, Tian Z, Zhu L, et al: Genome sequence of the palaeopolyploid soybean. Nature. 2010, 463 (7278): 178-183. 10.1038/nature08670.PubMedView ArticleGoogle Scholar
- Argout X, Salse J, Aury JM, Guiltinan MJ, Droc G, Gouzy J, Allegre M, Chaparro C, Legavre T, Maximova SN, Abrouk M, Murat F, Fouet O, Poulain J, Ruiz M, Roguet Y, Rodier-Goud M, Barbosa-Neto JF, Sabot F, Kudrna D, Ammiraju JS, Schuster SC, Carlson JE, Sallet E, Schiex T, Dievart A, Kramer M, Gelley L, Shi Z, Bérard A, et al: The genome of Theobroma cacao. Nat Genet. 2011, 43 (2): 101-108. 10.1038/ng.736.PubMedView ArticleGoogle Scholar
- Abrouk M, Murat F, Pont C, Messing J, Jackson S, Faraut T, Tannier E, Plomion C, Cooke R, Feuillet C, Salse J: Palaeogenomics of plants: synteny-based modelling of extinct ancestors. Trends Plant Sci. 2010, 15 (9): 479-487. 10.1016/j.tplants.2010.06.001.PubMedView ArticleGoogle Scholar
- Paterson AH, Bowers JE, Chapman BA: Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics. Proc Natl Acad Sci U S A. 2004, 101 (26): 9903-9908. 10.1073/pnas.0307901101.PubMed CentralPubMedView ArticleGoogle Scholar
- Tang H, Bowers JE, Wang X, Ming R, Alam M, Paterson AH: Synteny and collinearity in plant genomes. Science. 2008, 320 (5875): 486-488. 10.1126/science.1153917.PubMedView ArticleGoogle Scholar
- Tang H, Wang X, Bowers JE, Ming R, Alam M, Paterson AH: Unraveling ancient hexaploidy through multiply-aligned angiosperm gene maps. Genome Res. 2008, 18 (12): 1944-1954. 10.1101/gr.080978.108.PubMed CentralPubMedView ArticleGoogle Scholar
- Van de Peer Y, Fawcett JA, Proost S, Sterck L, Vandepoele K: The flowering world: a tale of duplications. Trends Plant Sci. 2009, 14 (12): 680-688. 10.1016/j.tplants.2009.09.001.PubMedView ArticleGoogle Scholar
- Bowers JE, Chapman BA, Rong J, Paterson AH: Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature. 2003, 422 (6930): 433-438. 10.1038/nature01521.PubMedView ArticleGoogle Scholar
- Lawton-Rauh A: Evolutionary dynamics of duplicated genes in plants. Mol Phylogenet Evol. 2003, 29 (3): 396-409. 10.1016/j.ympev.2003.07.004.PubMedView ArticleGoogle Scholar
- Blanc G, Wolfe KH: Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes. Plant Cell. 2004, 16 (7): 1667-1678. 10.1105/tpc.021345.PubMed CentralPubMedView ArticleGoogle Scholar
- Velasco R, Zharkikh A, Troggio M, Cartwright DA, Cestaro A, Pruss D, Pindo M, Fitzgerald LM, Vezzulli S, Reid J, Malacarne G, Iliev D, Coppola G, Wardell B, Micheletti D, Macalma T, Facci M, Mitchell JT, Perazzolli M, Eldredge G, Gatto P, Oyzerski R, Moretto M, Gutin N, Stefanini M, Chen Y, Segala C, Davenport C, Demattè L, Mraz A, et al: A high quality draft consensus sequence of the genome of a heterozygous grapevine variety. PLoS One. 2007, 2 (12): e1326-10.1371/journal.pone.0001326.PubMed CentralPubMedView ArticleGoogle Scholar
- Velasco R, Zharkikh A, Affourtit J, Dhingra A, Cestaro A, Kalyanaraman A, Fontana P, Bhatnagar SK, Troggio M, Pruss D, Salvi S, Pindo M, Baldi P, Castelletti S, Cavaiuolo M, Coppola G, Costa F, Cova V, Dal Ri A, Goremykin V, Komjanc M, Longhi S, Magnago P, Malacarne G, Malnoy M, Micheletti D, Moretto M, Perazzolli M, Si-Ammour A, Vezzulli S, et al: The genome of the domesticated apple (Malus x domestica Borkh). Nat Genet. 2010, 42 (10): 833-839. 10.1038/ng.654.PubMedView ArticleGoogle Scholar
- Sato S, Nakamura Y, Kaneko T, Asamizu E, Kato T, Nakao M, Sasamoto S, Watanabe A, Ono A, Kawashima K, Fujishiro T, Katoh M, Kohara M, Kishida Y, Minami C, Nakayama S, Nakazaki N, Shimizu Y, Shinpo S, Takahashi C, Wada T, Yamada M, Ohmido N, Hayashi M, Fukui K, Baba T, Nakamichi T, Mori H, Tabata S: Genome structure of the legume, Lotus japonicus. DNA Res. 2008, 15 (4): 227-239. 10.1093/dnares/dsn008.PubMed CentralPubMedView ArticleGoogle Scholar
- Shulaev V, Sargent DJ, Crowhurst RN, Mockler TC, Folkerts O, Delcher AL, Jaiswal P, Mockaitis K, Liston A, Mane SP, Burns P, Davis TM, Slovin JP, Bassil N, Hellens RP, Evans C, Harkins T, Kodira C, Desany B, Crasta OR, Jensen RV, Allan AC, Michael TP, Setubal JC, Celton JM, Rees DJ, Williams KP, Holt SH, Ruiz Rojas JJ, Chatterjee M, et al: The genome of woodland strawberry (Fragaria vesca). Nat Genet. 2011, 43 (2): 109-116. 10.1038/ng.740.PubMed CentralPubMedView ArticleGoogle Scholar
- Ming R, Hou S, Feng Y, Yu Q, Dionne-Laporte A, Saw JH, Senin P, Wang W, Ly BV, Lewis KL, Salzberg SL, Feng L, Jones MR, Skelton RL, Murray JE, Chen C, Qian W, Shen J, Du P, Eustice M, Tong E, Tang H, Lyons E, Paull RE, Michael TP, Wall K, Rice DW, Albert H, Wang ML, Zhu YJ: The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus). Nature. 2008, 452 (7190): 991-996. 10.1038/nature06856.PubMed CentralPubMedView ArticleGoogle Scholar
- Finn RD, Bateman A, Clements J, Coggill P, Eberhardt RY, Eddy SR, Heger A, Hetherington K, Holm L, Mistry J, Sonnhammer EL, Tate J, Punta M: The Pfam protein families database. Nucleic Acids Res. 2014, 42 (Database issue): D222-D230.PubMed CentralPubMedView ArticleGoogle Scholar
- Sanseverino W, Roma G, De Simone M, Faino L, Melito S, Stupka E, Frusciante L, Ercolano MR: PRGdb: a bioinformatics platform for plant resistance gene analysis. Nucleic Acids Res. 2010, 38 (Database issue): D814-D821.PubMed CentralPubMedView ArticleGoogle Scholar
- Salse J: In silico archeogenomics unveils modern plant genome organisation, regulation and evolution. Curr Opin Plant Biol. 2012, 15 (2): 122-130. 10.1016/j.pbi.2012.01.001.PubMedView ArticleGoogle Scholar
- Murat F, Xu JH, Tannier E, Abrouk M, Guilhot N, Pont C, Messing J, Salse J: Ancestral grass karyotype reconstruction unravels new mechanisms of genome shuffling as a source of plant evolution. Genome Res. 2010, 20 (11): 1545-1557. 10.1101/gr.109744.110.PubMed CentralPubMedView ArticleGoogle Scholar
- Abrouk M, Zhang R, Murat F, Li A, Pont C, Mao L, Salse J: Grass microRNA gene paleohistory unveils new insights into gene dosage balance in subgenome partitioning after whole-genome duplication. Plant Cell. 2012, 24 (5): 1776-1792. 10.1105/tpc.112.095752.PubMed CentralPubMedView ArticleGoogle Scholar
- Schnable JC, Freeling M, Lyons E: Genome-wide analysis of syntenic gene deletion in the grasses. Genome Biol Evol. 2012, 4 (3): 265-277. 10.1093/gbe/evs009.PubMed CentralPubMedView ArticleGoogle Scholar
- Thomas BC, Pedersen B, Freeling M: Following tetraploidy in an Arabidopsis ancestor, genes were removed preferentially from one homeolog leaving clusters enriched in dose-sensitive genes. Genome Res. 2006, 16 (7): 934-946. 10.1101/gr.4708406.PubMed CentralPubMedView ArticleGoogle Scholar
- Sankoff D, Zheng C, Zhu Q: The collapse of gene complement following whole genome duplication. BMC Genomics. 2010, 11: 313-10.1186/1471-2164-11-313.PubMed CentralPubMedView ArticleGoogle Scholar
- Xiong Y, Liu T, Tian C, Sun S, Li J, Chen M: Transcription factors in rice: a genome-wide comparative analysis between monocots and eudicots. Plant Mol Biol. 2005, 59 (1): 191-203. 10.1007/s11103-005-6503-6.PubMedView ArticleGoogle Scholar
- Freeling M: Bias in plant gene content following different sorts of duplication: tandem, whole-genome, segmental, or by transposition. Annu Rev Plant Biol. 2009, 60: 433-453. 10.1146/annurev.arplant.043008.092122.PubMedView ArticleGoogle Scholar
- Richly E, Kurth J, Leister D: Mode of amplification and reorganization of resistance genes during recent Arabidopsis thaliana evolution. Mol Biol Evol. 2002, 19 (1): 76-84. 10.1093/oxfordjournals.molbev.a003984.PubMedView ArticleGoogle Scholar
- Xu Z, Wang H: LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res. 2007, 35 (Web Server issue): W265-W268.PubMed CentralPubMedView ArticleGoogle Scholar
- Wicker T, Buchmann JP, Keller B: Patching gaps in plant genomes results in gene movement and erosion of colinearity. Genome Res. 2010, 20 (9): 1229-1237. 10.1101/gr.107284.110.PubMed CentralPubMedView ArticleGoogle Scholar
- Reinhart BJ, Weinstein EG, Rhoades MW, Bartel B, Bartel DP: MicroRNAs in plants. Genes Dev. 2002, 16 (13): 1616-1626. 10.1101/gad.1004402.PubMed CentralPubMedView ArticleGoogle Scholar
- Lim LP, Glasner ME, Yekta S, Burge CB, Bartel DP: Vertebrate microRNA genes. Science. 2003, 299 (5612): 1540-10.1126/science.1080372.PubMedView ArticleGoogle Scholar
- Bagga S, Pasquinelli AE: Identification and analysis of microRNAs. Genet Eng. 2006, 27: 1-20. 10.1007/0-387-25856-6_1.View ArticleGoogle Scholar
- Navarro L, Dunoyer P, Jay F, Arnold B, Dharmasiri N, Estelle M, Voinnet O, Jones JD: A plant miRNA contributes to antibacterial resistance by repressing auxin signaling. Science. 2006, 312 (5772): 436-439. 10.1126/science.1126088.PubMedView ArticleGoogle Scholar
- Li Y, Zhang Q, Zhang J, Wu L, Qi Y, Zhou JM: Identification of microRNAs involved in pathogen-associated molecular pattern-triggered plant innate immunity. Plant Physiol. 2010, 152 (4): 2222-2231. 10.1104/pp.109.151803.PubMed CentralPubMedView ArticleGoogle Scholar
- Du P, Wu J, Zhang J, Zhao S, Zheng H, Gao G, Wei L, Li Y: Viral infection induces expression of novel phased microRNAs from conserved cellular microRNA precursors. PLoS Pathog. 2011, 7 (8): e1002176-10.1371/journal.ppat.1002176.PubMed CentralPubMedView ArticleGoogle Scholar
- Katiyar-Agarwal S, Morgan R, Dahlbeck D, Borsani O, Villegas A, Zhu JK, Staskawicz BJ, Jin H: A pathogen-inducible endogenous siRNA in plant immunity. Proc Natl Acad Sci U S A. 2006, 103 (47): 18002-18007. 10.1073/pnas.0608258103.PubMed CentralPubMedView ArticleGoogle Scholar
- Allen E, Xie Z, Gustafson AM, Carrington JC: microRNA-directed phasing during trans-acting siRNA biogenesis in plants. Cell. 2005, 121 (2): 207-221. 10.1016/j.cell.2005.04.004.PubMedView ArticleGoogle Scholar
- McHale LK, Haun WJ, Xu WW, Bhaskar PB, Anderson JE, Hyten DL, Gerhardt DJ, Jeddeloh JA, Stupar RM: Structural variants in the soybean genome localize to clusters of biotic stress-response genes. Plant Physiol. 2012, 159 (4): 1295-1308. 10.1104/pp.112.194605.PubMed CentralPubMedView ArticleGoogle Scholar
- Chantret N, Salse J, Sabot F, Rahman S, Bellec A, Laubin B, Dubois I, Dossat C, Sourdille P, Joudrier P, Gautier MF, Cattolico L, Beckert M, Aubourg S, Weissenbach J, Caboche M, Bernard M, Leroy P, Chalhoub B: Molecular basis of evolutionary events that shaped the hardness locus in diploid and polyploid wheat species (Triticum and Aegilops). Plant Cell. 2005, 17 (4): 1033-1045. 10.1105/tpc.104.029181.PubMed CentralPubMedView ArticleGoogle Scholar
- Sun X, Wang GL: Genome-wide identification, characterization and phylogenetic analysis of the rice LRR-kinases. PLoS One. 2011, 6 (3): e16079-10.1371/journal.pone.0016079.PubMed CentralPubMedView ArticleGoogle Scholar
- Zhai J, Jeong DH, De Paoli E, Park S, Rosen BD, Li Y, González AJ, Yan Z, Kitto SL, Grusak MA, Jackson SA, Stacey G, Cook DR, Green PJ, Sherrier DJ, Meyers BC: MicroRNAs as master regulators of the plant NB-LRR defense gene family via the production of phased, trans-acting siRNAs. Genes Dev. 2011, 25 (23): 2540-2553. 10.1101/gad.177527.111.PubMed CentralPubMedView ArticleGoogle Scholar
- Li F, Pignatta D, Bendix C, Brunkard JO, Cohn MM, Tung J, Sun H, Kumar P, Baker B: MicroRNA regulation of plant innate immune receptors. Proc Natl Acad Sci U S A. 2012, 109 (5): 1790-1795. 10.1073/pnas.1118282109.PubMed CentralPubMedView ArticleGoogle Scholar
- Eckardt NA: A microRNA cascade in plant defense. Plant Cell. 2012, 24 (3): 840-10.1105/tpc.112.240311.PubMed CentralPubMedView ArticleGoogle Scholar
- Finn RD, Clements J, Eddy SR: HMMER web server: interactive sequence similarity searching. Nucleic Acids Res. 2011, 39 (Web Server issue): W29-W37.PubMed CentralPubMedView ArticleGoogle Scholar
- Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, Jones SJ, Marra MA: Circos: an information aesthetic for comparative genomics. Genome Res. 2009, 19 (9): 1639-1645. 10.1101/gr.092759.109.PubMed CentralPubMedView ArticleGoogle Scholar
- Mathelier A, Carbone A: MIReNA: finding microRNAs with high accuracy and no learning at genome scale and from deep sequencing data. Bioinformatics. 2010, 26 (18): 2226-2234. 10.1093/bioinformatics/btq329.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.