- Research article
- Open Access
LRR-RLK family from two Citrus species: genome-wide identification and evolutionary aspects
BMC Genomics volume 17, Article number: 623 (2016)
Leucine-rich repeat receptor-like kinases (LRR-RLKs) represent the largest subfamily of plant RLKs. The functions of most LRR-RLKs have remained undiscovered, and a few that have been experimentally characterized have been shown to have important roles in growth and development as well as in defense responses. Although RLK subfamilies have been previously studied in many plants, no comprehensive study has been performed on this gene family in Citrus species, which have high economic importance and are frequent targets for emerging pathogens. In this study, we performed in silico analysis to identify and classify LRR-RLK homologues in the predicted proteomes of Citrus clementina (clementine) and Citrus sinensis (sweet orange). In addition, we used large-scale phylogenetic approaches to elucidate the evolutionary relationships of the LRR-RLKs and further narrowed the analysis to the LRR-XII group, which contains several previously described cell surface immune receptors.
We built integrative protein signature databases for Citrus clementina and Citrus sinensis using all predicted protein sequences obtained from whole genomes. A total of 300 and 297 proteins were identified as LRR-RLKs in C. clementina and C. sinensis, respectively. Maximum-likelihood phylogenetic trees were estimated using Arabidopsis LRR-RLK as a template and they allowed us to classify Citrus LRR-RLKs into 16 groups. The LRR-XII group showed a remarkable expansion, containing approximately 150 paralogs encoded in each Citrus genome. Phylogenetic analysis also demonstrated the existence of two distinct LRR-XII clades, each one constituted mainly by RD and non-RD kinases. We identified 68 orthologous pairs from the C. clementina and C. sinensis LRR-XII genes. In addition, among the paralogs, we identified a subset of 78 and 62 clustered genes probably derived from tandem duplication events in the genomes of C. clementina and C. sinensis, respectively.
This work provided the first comprehensive evolutionary analysis of the LRR-RLKs in Citrus. A large expansion of LRR-XII in Citrus genomes suggests that it might play a key role in adaptive responses in host-pathogen co-evolution, related to the perennial life cycle and domestication of the citrus crop species.
Signaling through cell surface receptors is essential for cells to communicate and interact with the environment. Plant cells are able to perceive and transduce a wide range of signals mainly through receptor-like kinases (RLKs), which mediate cell-to-cell communication by binding to extracellular ligands or forming heteromeric complexes to mediate intracellular signaling . These modular kinase receptors belong to a large monophyletic gene family with more than 400 members identified in Arabidopsis . RLKs are typically transmembrane (TM) proteins with a variable amino-terminal extracellular domain (ECD) and a conserved cytoplasmic serine/threonine kinase domain (KD) in the carboxyl-terminal region . The ECD regions play important roles in the recognition of internal signals or environmental stimuli and, according to their features, can be used to classify RLKs . More than 21 structural classes were classified in Arabidopsis RLKs based on their ECDs, with the largest one containing leucine-rich repeats (LRRs) . Phylogenetic-based analysis of the Arabidopsis RLKs using the KDs and structural comparison of their ECDs allowed the identification of more than 40 subfamilies .
In plants, LRR-RLK proteins constitute a diverse group of transmembrane receptors involved in many biological functions usually associated with growth and development [5–9] and responses to biotic and abiotic stresses [10–13]. More than 200 LRR-RLK genes have been identified in the fully sequenced Arabidopsis genome [14–16]. Concerning plant-microbe interactions, certain LRR-RLKs play essential roles in defense responses to pathogen attacks by recognizing conserved pathogen- or microbe-associated molecular patterns (PAMPs/MAMPs) such as flagellin and elongation factor thermo unstable (EF-Tu) [17, 18]. LRR-containing proteins are suitable for pathogen recognition because their structural plasticity allows them to bind to many distinct ligands, such as proteins, peptides or lipids . Receptor proteins that are able to recognize PAMPs/MAMPs are designated pattern-recognition receptors (PRRs)  and represent an essential step for the host to perceive and defend itself against pathogens by triggering innate immune responses. Many PRRs belong to the LRR-RLK subfamily [18, 20]. The Arabidopsis FLAGELLIN SENSING 2 (FLS2) , EF-TU RECEPTOR (EFR)  and rice XA21  are among the best-studied plant PRRs and can activate immunity responses by perceiving specific bacterial proteins (or derived peptidic epitopes). These well-characterized PRRs belong to the XII group of LRR-RLKs (LRR-XII), suggesting an important role in mediating immunity responses during plant-microbe interactions.
Citrus comprise some of the most economically important crops in the world, and the species of this group produce fruits with great commercial value, such as oranges, mandarins, lemons, grapefruits and pummelos. While Citrus clementina represents one of the major species of mandarins, consumed as fresh fruit , sweet orange (C. sinensis) has the largest commercial importance, mainly due to the orange juice market . The genomes of these species were recently sequenced and even though the identity and contribution of ancestors in the composition of the domesticated citrus genome was unclear, it is suggested that these crops are hybrids derived mainly from C. maxima and C. reticulata [26, 27]. Citrus species are mostly diploid and display a basic chromosome number of x = 9. Substantial segmental synteny is observed with other eudicots and an orthology relation of one to one across oranges and plants such as grape, strawberry and cacao suggests the inexistence of recent whole genome duplication (WGD), with the exception of a triplication genome shared by all eudicots . Citrus is part of the Sapindales order, a sister order of Brassicales into the Malvidae family, which allows the performance of studies involving genomic comparisons with Arabidopsis thaliana .
The main problem that affects the citrus culture worldwide is the huge amount of phytopathogens , which cause significant damage to the citrus agribusiness. Apomictic reproduction, high juvenility and a long cultivation period are characteristics that contribute to a narrow genetic diversity in citrus crops, which increases the probability of the appearance of diseases and makes it difficult for breeding programs to obtain materials with increased resistance to pathogens . Although there is narrow genetic diversity, there are different levels of resistance among Citrus species for different diseases, such as the Citrus canker [27–29], Alternaria brown spot , Huanglongbing  and Citrus variegated chlorosis (CVC) [32, 33]. Xylella fastidiosa, for instance, causes CVC in all commercial sweet orange varieties, but not in C. clementina, despite both species being derived from the same ancestors . The comparison of defense gene families among plants with different levels of resistance is a strategy for better understanding the host defense in plant-pathogen interactions. Considering the recent sequencing of the complete genomes of C. clementina and C. sinensis and the involvement of LRR-RLKs in defense responses, we performed in silico analyses to elucidate and compare the structural organization of LRR-RLK members from the Citrus databases. The LRR-RLK subfamily has been characterized in plants such as Arabidopsis, rice, Populus, tomato, and others [35–39], but no comprehensive study was performed for Citrus species.
Results and discussion
Identification of Citrus LRR-RLKs
To identify the LRR-RLK subfamily members encoded by C. sinensis and C. clementina genomes, we used a combined computational approach. The predicted proteomes of each Citrus species and A. thaliana were used as inputs (Table 1) to build signature databases with the InterProScan tool. The resulting data were then uploaded into relational databases.
A search for protein sequences with both kinase and transmembrane signatures was initially performed for the identification of surface RLK family homologs. The catalytic KD was detected in 1,169, 1,208, and 1,034 non-redundant protein sequences from C. clementina, C. sinensis and A. thaliana, respectively. Plant protein kinases are one of the largest existing gene families and represent approximately 4 % of all coding genes in A. thaliana ; a similar number was found for C. clementina (4.9 %) and C. sinensis (4.0 %). These percentages of genes encoding kinase proteins are close to what was found in poplar and rice . Among these protein sequences, 617, 626 and 466 members of C. clementina, C. sinensis and A. thaliana, respectively, contained the KD and TM helices (Table 2). In the present study, we did not work with alternative splicing variants, and we considered only the membrane surface RLKs in our analysis, which did not include the receptor-like cytoplasmic kinases (RLCKs) because they do not have TM domains [38, 42]. For A. thaliana, approximately 620 RLK members have been reported to be present in the genome, including almost 150 RLCKs . Thus, the number of cell surface RLKs identified for Arabidopsis in this work (466) is consistent with previous reports . Cell surface RLKs displaying LRR-type ECDs were considered putative LRR-RLK subfamily members. LRR-RLKs belong to a large subfamily for which we identified 300, 297 and 236 protein sequences in the C. clementina, C. sinensis and A. thaliana genomes, respectively, which represents approximately 50 % of the total surface RLKs in each genome. We also removed the A. thaliana sequences that were the products of alternative splicing, as previously performed for C. clementina and C. sinensis. The result was compared to an A. thaliana LRR-RLK dataset [14, 15] to improve accuracy. Thus, from the 236 A. thaliana protein sequences, we considered a final dataset containing 209 LRR-RLK sequences for further analysis (Table 2; Additional file 1). The KDs from LRR-RLKs of C. clementina, C. sinensis and A. thaliana were identified by searching for diagnostic domains according to the functional annotation in the Pfam database (Pkinase_Tyr PF07714 and Pkinase PF00069). As reported by Shiu et al. , some proteins exhibited more than one catalytic KD. In these cases, we further analyzed the catalytic domains to determine whether the truncated regions should be manually merged or kept as different KDs. After another round of filtering, such as removing gap-rich regions and excluding redundant sequences, the final dataset used for the evolutionary analyses contained a total of 302 and 304 KDs from C. clementina and C. sinensis, respectively, in addition to the 209 KDs from the A. thaliana LRR-RLKs (Table 2; Additional file 2).
Evolutionary analyses and structural organization of LRR-RLKs
An identification and classification of LRR-RLK from C. clementina and C. sinensis was previously done using genome assemblies obtained from the outdated Phytozome v.7 . In this work we used currently accepted genome assemblies to identify,classify and perform a comprehensive genomic study for the LRR-RLK subfamily groups in the C. clementina and C. sinensis. The KD sequences from each Citrus dataset were independently aligned with the KDs from all A. thaliana LRR-RLK subfamily members. Maximum-likelihood (ML) phylogenetic trees allowed us to estimate the evolutionary relationships among the sequences (Fig. 1). The Citrus sequences that clustered together with known members of A. thaliana LRR-RLK were defined as part of the correspondent group. The majority of clades in each phylogeny was well-supported with confidence statistical values above 0.70 (Additional files 3 and 4), demonstrating the reliability of the generated data (Fig. 1). The robustness of our analysis was confirmed by generating another phylogenetic tree using the LRR-RLKs from both Citrus species together in the same tree (Additional file 5). Of 606 KDs, 601 (>99 %) were grouped in well-supported clades, as observed in the individual analysis of the C. clementina and C. sinensis datasets (Fig. 1 ), reinforcing the hypothesis that these sequences were evolutionarily related.
The topology of ML phylogenetic trees allowed us to distinguish, in the C. clementina and C. sinensis genomes, the same 16 groups of LRR-RLKs (I to XVI) previously described for A. thaliana  that were used for Citrus classification (Table 3 ; Additional file 6). Additionally, group XII, which was the focus of our work, presents the characteristic division in two sub-groups (Fig. 1; Additional file 5) as also reported for other plant species [15, 37].
In general, the number of LRR-RLK receptors for most of the subfamily groups among the Citrus species was similar to A. thaliana, except for two of them, i.e., LRR-I and LRR-XII. Regarding LRR-I, 38 members were reported for A. thaliana, while in Citrus, we identified only 9 and 11 members for C. clementina and C. sinensis, respectively. Despite having a smaller genome , A. thaliana had about four times more RLKs in this group compared to the Citrus species. Although A. thaliana LRR-I includes receptor proteins associated with defense responses, such as IMPAIRED OOMYCETE SUSCEPTIBILITY 1 (IOS1)  and FLG22-INDUCED RECEPTOR-LIKE KINASE 1 (FRK1) , the majority of members in this group has unknown functions. According to Fischer et al. , the last common ancestor of angiosperms (LCAA) probably had only 7 LRR-I in the genome and the expansion rate was related to ancestral expansion rather than species-specific events, suggesting that the high copy number found in A. thaliana was due to Brassicaceae expansion and subsequent retention in this species. For Citrus, in contrast, the number of LRR-I remained the same as suggested by LCAA, perhaps because no recent WGD was detected in Citrus plants .
In relation to LRR-XII, C. clementina and C. sinensis showed 148 and 140 members, respectively, while in A. thaliana there were only 10 corresponding members. However, one of them (At2g24130) was not included in our analysis because it did not show a predicted transmembrane helix domain in TMHMM v 2.0. If again we compare this with the number of receptors in LCAA LRR-XII (13 genes) , it is possible to verify a slight reduction of this group in Arabidopsis, while for Citrus species, the LRR-XII had a stronger numerical expansion. Interestingly, as observed for C. clementina and C. sinensis, the LRR-XII group of different plant species also underwent an expansion [38, 39]. The dynamic rates of duplication, retention and loss of genes occurred independently in each subgroup of LRR-RLKs, which resulted in a distinct composition between species, for example, related to LRR-I and XII (Table 4). A. thaliana LRR-XII contains two of the most studied PRRs, i.e., FLS2 and EFR, which are involved in the perception of the bacterial PAMPs flagellin and EF-Tu, respectively [21, 22]. In addition to EFR and FLS2, the LRR-RLK XII XA21 from Oryza longistaminata is another important well-studied PRR , which recognizes the bacterial sulfated protein RaxX .
The expansion or reduction in the size of gene families is a result of evolutionary events usually related to duplications, de novo creation of genes and deletions . Selective pressure to perceive changing environment signals can drive the expansion of specific RLK subfamily groups that may have important functional effects related to adaptation, for example, to fast-evolving pathogens [14, 41]. It was reported that LRR-XII is a group that keeps expanding and their members are involved in biotic stress responses . In general, we observed that in Citrus and other crop species, the number of LRR-XII is increased in relation to LCAA (Table 4), suggesting that domestication may be contributing to the expansion of this group.
Evolutionary aspects of C. clementina and C. sinensis LRR-XII
Analysis of LRR-XII orthologs
Based on the large expansion of Citrus LRR-XII and its important role in response to biotic stresses, we further focused on homology studies involving this receptor group. Initially, we searched for orthologs through integrated analyses of phylogeny, sequence similarity and chromosomal distribution in the C. clementina and C. sinensis LRR-XII subfamily.
Understanding evolutionary aspects, such as paralogy and orthology relationships between genes, is important to deduce the evolution of species . The clades of the phylogenetic tree formed by C. clementina and C. sinensis LRR-XII sequences in association with the Bidirectional Best Hits (BBH) method were used to detect the LRR-XII orthologs. A total of 68 paired sequences were identified whose similarity was confirmed through 13 well-supported clade grouping of the sequences from the reconstructed phylogenetic tree (Fig. 2; Additional file 7).
The tree topology allowed us to distinguish two monophyletic groups, one formed by two clades (light blue and light gray), related to the LRR-XIIb members and the other formed by the remaining eleven clades, which represent the LRR-XIIa (Fig. 2 ), as can also be observed in Fig. 1 . LRR-XIIa harbors seven of the nine members from the previously assigned A. thaliana LRR-XII group. LRR-XIIb harbors the two remaining members of the group, At1g35710 and At4g08850, indicating a non-monophyletic origin for the LRR-XII subfamily. This distinct grouping of LRR-XII members was also observed in phylogenetic analysis using tomato  and A. thaliana .
The two Arabidopsis members mentioned above and the correspondent Citrus members from the same clade (light gray and light blue in Fig. 2), did not share enough similarity with the other LRR-XII members. Previous work showed that these two members fell in the LRR-XI group  and they should comprise an independent group of LRR-RLKs. Based on this approach, the resulting LRR-XII group from C. clementina and C. sinensis would comprise 123 and 126 members, respectively.
LRR-XII kinase RD motif analysis
Non-arginine-aspartate (non-RD) kinases are kinases that lack the highly conserved arginine (R) that precedes the catalytic aspartate (D), which is typical of most kinases . This subclass of kinases is often found as part of pattern recognition receptors [50, 51]. Considering the high incidence of pathogens that cause diseases to Citrus and their importance in the recognition of conserved microbial patterns, it is important to identify these classes of kinase proteins in the C. clementina and C. sinensis LRR-XII groups. A total of 93 of 148 and 94 of 140 LRR-XII elements were classified as non-RD in C. clementina and C. sinensis, respectively (Additional file 8), which represents approximately 70 % in both Citrus LRR-XII groups. Usually, non-RD kinases carry the cysteine (C) or a glycine (G) amino acid residue in the substitution of the highly conserved arginine (R) in the HRD motif  and the same is observed for both Citrus species, in which non-RD carrying C or G in place of R accounts for over 95 % (Fig. 3). However, in a few cases, tryptophan (Y) or serine (S) substitutes for R (less than 2 %).
These changes can affect the charge of the molecules and potentially modify the kinase regulatory and catalytic mechanisms . Only 7 members from C. clementina and 9 members from C. sinensis have a kinase with an alternative catalytic function (ACF), which did not show either RD or non-RD motifs. The non-RD kinases identified in Citrus LRR-XII open new possibilities for further studies involving the function of these receptors in defense responses by recognition of microbial signatures.
In addition to the identification of RD and non-RD kinases in the Citrus LRR-XII group, we analyzed the LRR-XII RD and non-RD kinase motifs in A. thaliana. Only two sequences showed the RD motif, while all the other seven were non-RD kinases, as already identified by other authors [15, 50]. These two RD kinases correspond to RLK members, which were grouped in a separated clade of the phylogenetic trees (At4g08850.1 and At1g35710.1) (Fig. 2). Additionally, all the non-RD kinases were grouped in the clade that contained LRR-XIIa (Fig. 2). Curiously, and in agreement with this classification, 98 % and 100 % of the RD members from C. clementina and C. sinensis, respectively, were grouped across LRR-XIIb. For the non-RD kinases, 97 % and 99 % from C. clementina and C. sinensis, respectively, were grouped across LRR-XIIa. This separation of the Citrus RD and non-RD kinases in two distinct clades, as observed in A. thaliana, suggests a possible common phylogenetic origin for each of the RD and non-RD kinases in the LRR-XII group.
LRR-XII tandem duplication paralogs in C. clementina and C. sinensis
We analyzed the paralogous sequences in the C. clementina and C. sinensis LRR-XII group because they can evolve new functions in relation to the ancestor proteins  (Additional files 9 and 10).
We identified 78 and 62 LRR-XII tandem duplicated sequences in the genomes of C. clementina and C. sinensis, respectively. Because both species are hybrids derived from a cross of C. reticulata and C. maxima, it is difficult to make any assumptions about when, in the evolutionary history of the group, these duplications appeared, even though it is known that they have the same parents . An analysis of the chromosomal distribution of LRR-XII allowed us to detect tandem duplications of this gene family on the chromosomes from C. clementina (Fig. 4a) and C. sinensis (Fig. 4b).
The Citrus LRR-XII receptors are distributed in all chromosomes but are mainly concentrated on chromosome 3 for C. clementina and on chromosome Un (for unassigned contigs) for C. sinensis. Chromosomes 4 and Un from C. clementina and C. sinensis, respectively, displayed the majority of the tandem duplicated genes. Duplication events seem to be pronounced in domesticated plant species . Segmental duplication events must have contributed to the acquisition of novel and distinct functions in relation to the ancestor by neo-functionalization or sub-functionalization . Considering the large number of pathogens in citrus crops, the observed large expansion of Citrus LRR-XII might be regarded as a plant-specific adaptation to extracellular signal perception, for example, to recognize different PAMPs .
Identification and distribution of LRR-XII gene clusters
Gene duplication is an important strategy for adaptive evolution in plants . To identify clusters that encompass LRR-XII tandem duplicated genes, we grouped these genes in each Citrus genome into the same cluster if they were arranged in a genomic fragment with a maximum of 200 Kb. LRR-XII gene clusters are present in all chromosomes, with the exception of chromosomes 1 and 3 from C. clementina and C. sinensis, respectively (Fig. 4; Additional file 11). A distribution analysis revealed 117 of 148 LRR-XII genes of C. clementina (79 %) were found in 33 cluster regions, and for C. sinensis, 94 of 140 LRR-XII genes (67.1 %) were distributed in 30 clusters. Tandem duplications seem to be an important mechanism for expansion of the LRR-XII group, which could be confirmed by the presence of the LRR-XII gene in clusters. Approximately 70 % and 63 % of these clusters are formed by tandem duplicated paralogs in the C. clementina and C. sinensis genomes, respectively. Wang et al.  also demonstrated high clustering and the importance of duplication events in the expansion of Citrus nucleotide binding site (NBS) genes, which is a large class of intracellular immune receptor genes that also display LRR domains beyond the nucleotide-binding site domain. Clustering in NBS gene loci has been reported in many species, including Arabidopsis and rice . On the other hand for LRR-XII genes, this expansion is not widespread in plants as the NBS genes are mainly observed in rice  and citrus.
Of the 68 LRR-XII orthologous pairs identified for C. clementina and C. sinensis, 46 and 38 genes from C. clementina and C. sinensis, respectively, were located in cluster regions (Additional file 7). We identified orthologous pairs in the same clusters, which suggested high conservation and correspondence of these clusters between C. sinensis and C. clementina genomes. These data suggested blocks of elevated homology among C. clementina and C. sinensis LRR-XII sequences and chromosome regions.
Syntenic blocks in LRR-XII and Citrus genomes
The establishment of synteny relations between species is an important tool to improve the understanding of genome evolution and the conserved biological functions among species . To better understand the evolution of C. clementina and C. sinensis LRR-XII, we searched for syntenic blocks in the chromosomes. The similarity identified among LRR-XII gene sequences from one species in the genome of another species allowed us to identify conserved blocks in the C. clementina and C. sinensis chromosomes (Fig. 5). When analyzing the collinearity between both genomes, 25 syntenic blocks (SBs) were found between LRR-XII from C. sinensis and C. clementina (Fig. 5a). Of the 68 orthologous pairs previously identified by BBH and phylogeny, only 20 were also verified within these pairwise syntenic genes (Additional file 12). The different number of SBs identified resulted from a more stringent algorithm in this analysis. In addition, another analysis using the Citrus species independently found a total of 39 SBs in the chromosomes of C. clementina when evaluated with the 140 C. sinensis LRR-XII genes. A comparable number of 40 SBs was verified in the C. sinensis genome matching the 148 C. clementina LRR-XII genes. Some LRR-XII genes from both Citrus genomes matched more than one locus in the chromosomes and each locus was considered an independent SB. The number of LRR-XII genes that displayed similarity with the genome was of 26 of 148 LRR-XII genes from C. clementina and 25 of 140 LRR-XII genes for C. sinensis (Additional file 12). These numbers might be higher because we used a stringent analysis to increase the reliability of the results and avoid false positive SBs. In general, a reciprocal homology was observed in the SBs between C. sinensis and C. clementina chromosomes. We observed SBs distributed throughout almost all the chromosomes of the Citrus genomes (Fig. 5b-c). The highest number of SBs was found in chromosome 6 in both species, with 17 SBs for C. clementina and 16 for C. sinensis. The C. sinensis chromosome 2 matched the highest number of C. clementina LRR-XII, presenting homology with 6 sequences. For C. clementina, in addition to chromosome 2, chromosome 6 also exhibited the highest number of matches with C. sinensis LRR-XII, each of them displaying linkage with 6 genes in the corresponding chromosomes (Fig. 5b-c). Curiously, genes belonging to the same chromosome in one species matched SBs in different chromosomes from the other species, as in chromosome 2 from C. clementina and chromosomes 5, 8 and 10 from C. sinensis (Fig. 5b-c). In the case of chromosome 2 from C. clementina, one LRR-XII gene matched an SB in chromosome 9 from C. sinensis, while all the others matched SBs in chromosome 2. Chromosomes 10 or UN are particular because they are composed of sequences that were not assembled in the correct chromosomes. Therefore, it is an artifact from the genome assembly because the Citrus genome has only 9 chromosomes, thus the LRR-XII genes identified in this chromosome must be located in other genomic regions. On the other hand, genes in chromosomes 5 and 8 from the C. sinensis genome matched SBs in the same chromosome of C. clementina (Chr 8). This miscorrelation of some LRR-XII genes and SBs in the same chromosomes could be a result of chromosomal rearrangements in the genomes.
This work provides the first comprehensive evolutionary analysis of the LRR-RLKs in Citrus. Lineage-specific expansion was observed in the Citrus LRR-XII group that might have occurred due to tandem duplications making the number of individuals larger compared to the majority of plant species. Considering the diverse number of pathogens affecting the domesticated citrus culture, the independent expansion of a defense-related group could be associated with an adaptive process related to plant-pathogen co-evolution. Our comparative data provided valuable information concerning these RLKs, opening new perspectives in the study of their function in diverse processes, such as development and defense responses in two worldwide important economic crops, specifically, sweet oranges and clementines.
Sequence database search
Predicted proteomes from Citrus clementina (Version 1.0, https://phytozome.jgi.doe.gov/pz/portal.html#!info?alias=Org_Cclementina), Citrus sinensis (Version CsiDB201301, http://citrus.hzau.edu.cn/orange/download/data.php)  and Arabidopsis thaliana (http://arabidopsis.org) were selected and downloaded. The InterProScan 4 package software (http://www.ebi.ac.uk/interpro/download.html) was used to identify different protein signatures in each proteome dataset . To recover and analyze the dataset, we developed local relational databases for each included plant species. It allowed us to extract and interpret the large amount of data obtained in this work. In-house Perl scripts and Structured Query Language (SQL) queries were used to analyze data files during the database building and searching processes. Access to these local relational databases was implemented using DbVisualizer version 9.1.7 (http://dbvis.com/).
Domain annotation and LRR-RLK retrieval
The LRR-RLK homologues were retrieved from the relational databases by searching for protein sequences with kinase, transmembrane and leucine-rich repeat domains. To recover the identifiers with KD, we considered Pkinase (PF00069) and Pkinase_Tyr (PF07714), according to the Pfam platform , as diagnostic domains. TMs were predicted from the TMHMM website (http://www.cbs.dtu.dk/services/TMHMM/) hosted at the Center for Biological Sequence Analysis, Technical University of Denmark. The prediction of transmembrane helices in the protein sequences were conducted according to the default parameters of version 2.0, without considering the old model option (version 1).
The following LRR diagnostic domains were searched: LRR_1 - Leucine Rich Repeat (PF00560), LRRNT - Leucine rich repeat N-terminal domain (PF01462), LRV - Leucine rich repeat variant (PF01816), LRRNT_2 - Leucine rich repeat N-terminal domain (PF08263), LRR_4 - Leucine rich repeats (2 copies) (PF12799), LRR_5 - Leucine rich repeats (6 copies) (PF13306), LRR_8 - Leucine rich repeat (PF13855), LRR_9 - Leucine-rich repeat (PF14580), LRRCT - Leucine rich repeat C-terminal domain (PF01463), LRR_2 - Leucine Rich repeat (PF07723), and LRR_3 - Leucine Rich repeat (PF07725). Only proteins containing LRRs, TM and KD were then considered to be putative LRR-RLK, and for this reason, At2g24130, which did not show TM, was not included in our analyses. Alternative splicing variants were excluded from our analysis.
Kinase domain alignment and phylogenetic analysis
Sequences of conserved KDs from Arabidopsis and Citrus LRR-RLK proteins were extracted by an in-house Pearl script that consider KD coordinates annotation from the Pfam database. In addition, six human kinase protein sequences were used as an outgroup (Additional file 13). The KD sequences were aligned using MAFFT version 7 (http://mafft.cbrc.jp/alignment/software/) with G-INS-i strategy and default parameters . The aligned sequences were visualized and manually refined using Jalview version 15.0 . The proteins with a short length (<100 aa) or large inserted gap regions were removed. Gap-rich columns were further filtered using trimAl v.1.3 with the gappyout method . To optimize the datasets for evolutionary analyses, the Decrease Redundancy tool, available as a resource at ExPaSy (www.expasy.org), was used to remove identical or distantly related sequences. The Decrease Redundancy parameters were set as 99 for “% max similarity” and 30 for “% min similarity”. Phylogenetic analyses were performed using the Maximum-likelihood method, as implemented in PhyML . Twelve different evolutionary models (JTT, LG, DCMut, MtREV, MtMam, MtArt, Dayhoff, WAG, RtREV, CpREV, Blosum62 and VT) were tested using ProtTest 2.4 software . The evolutionary model best fitting the data (best fit model) was determined by comparing the likelihood of the tested models according to the Akaike Information Criterion. A discrete gamma-distribution model with four rate categories plus invariant positions was assumed with the gamma parameter and the fraction of invariant positions was estimated from the data. Tree support values were estimated using approximate likelihood ratio test (ALRT), as implemented in PhyML. The ML trees were visualized and edited using the FigTree software (tree.bio.ed.ac.uk/software/figtree). The alignments are available at FigShare (10.6084/m9.figshare.3474752).
Identification of the RD motif in the kinase domain
Identification of the Citrus RD motif in the catalytic loop from the LRR-XII kinase subdomain was performed using multiple expectation maximization for motif (MEME) suite web server using default parameters . The kinases were classified as RD or non-RD according to the presence or absence of the Arg (R) in the conserved HRD motif, respectively.
Chromosomal distribution of LRR XII
The genomic coordinates of each LRR-XII gene from C. clementina and C. sinensis were used to determine their distribution in the Citrus chromosomes. The coordinates were retrieved accessing the genome browser from each Citrus database. The MapChart graphical tool  was used to generate schematic diagrams to represent the LRR-XII gene positions in the chromosomes.
LRR-XII orthologs and tandem duplicated paralogs
The identification of orthologous pairwise sequences among Citrus species was achieved through grouping in the phylogenetic tree and the BBH method. The Blastp searches were performed using all the C. sinensis and C. clementina LRR-RLK sequence proteins from group XII. For tandem duplicated paralogs searches, the results from Blasp were analyzed together with well-supported clades from the LRR-XII phylogenetic trees. The tandem duplicated paralogs were eligible when they formed the same clade and showed proximity in their chromosomal location. The identification of the LRR-XII gene clusters was performed from the arrangement of these genes in the chromosomes of each species. The LRR-XII genes were grouped in the same cluster if the genome location between two genes was within 200 kb in the chromosomes of C. sinensis and C. clementina.
LRR-XII gene synteny identification
The synteny analyses were performed using Sibelia software. Although this tool was originally optimized to efficiently identify syntenic blocks between closely related microbial genomes , this tool was employed because the chromosome comparisons were restricted to a small gene family of Citrus species with evolutionary proximity.
The minimal nucleotide length considered in the syntenic block was adjusted to 1,000 pb. Iterative de Bruijn graphs were used to show the homology results found across the LRR-XII and the genomic regions in the chromosomes.
ACF, alternative catalytic function; ALRT, approximate likelihood ratio test; BBH, bidirectional best hits; BLAST, basic local alignment search tool; C, cysteine; CsiDB, citrus sinensis database; ECD, extracellular domain; EFR, Ef-Tu receptor; Ef-Tu, elongation factor thermo unstable; FLS2, flagellin sensing 2; FRK1, flg22-induced receptor-like kinase 1; G, glycine; HRD, histidine-arginine-aspartate; IOS1, impaired oomycete susceptibility 1; Kb, kilobase; KD, kinase domain; LRRCT, leucine-rich repeat C-terminal; LRRNT, leucine-rich repeat N-terminal; LRR-RLK, leucine-rich repeat receptor-like kinases; LRV, leucine-rich repeat variant; MAFFT, multiple alignment using fast fourier transform; MAMP, microbe-associated molecular pattern; MEME, multiple expectation maximization for motif elicitation; ML, maximum-likelihood; NBS, nucleotide binding site; PAMP, pathogen-associated molecular pattern; PRR, pattern-recognition receptor; R, arginine; RD, arginine-aspartate; RLCK, receptor-like cytoplasmic kinase; RLK, receptor-like kinase; S, serine; SB, syntenic block; SQL, structured query language; TAIR, the Arabidopsis information resource; TM, transmembrane; Un, unassigned; Y, tryptophan.
Osakabe Y, Yamaguchi-Shinozaki K, Shinozaki K, Tran L-SP. Sensing the environment: key roles of membrane-localized kinases in plant perception and response to abiotic stress. J Exp Bot. 2013;64:445–58.
Shiu SH, Bleecker AB. Receptor-like kinases from Arabidopsis form a monophyletic gene family related to animal receptor kinases. Proc Natl Acad Sci U S A. 2001;98:10763–8.
Walker JC. Structure and function of the receptor-like protein kinases of higher plants. Plant Mol Biol. 1994;26:1599–609.
Shiu SH, Bleecker AB. Plant receptor-like kinase gene family: diversity, function, and signaling. Sci STKE. 2001. doi:10.1126/stke.2001.113.re22.
Clark SE, Williams RW, Meyerowitz EM. The CLAVATA1 gene encodes a putative receptor kinase that controls shoot and floral meristem size in Arabidopsis. Cell. 1997;89:575–85.
van der Knaap E, Song WY, Ruan DL, Sauter M, Ronald PC, Kende H. Expression of a gibberellin-induced leucine-rich repeat receptor-like protein kinase in deepwater rice and its interaction with kinase-associated protein phosphatase. Plant Physiol. 1999;120:559–70.
Li J, Wen J, Lease KA, Doke JT, Tax FE, Walker JC. BAK1, an Arabidopsis LRR Receptor-like Protein Kinase, Interacts with BRI1 and Modulates Brassinosteroid Signaling. Cell. 2002;110:213–22.
Osakabe Y, Maruyama K, Seki M, Satou M, Shinozaki K, Yamaguchi-shinozaki K. Leucine-Rich Repeat Receptor-Like Kinase1 Is a Key Membrane-Bound Regulator of Abscisic Acid Early Signaling in Arabidopsis. Plant Cell. 2005;17:1105–19.
Pitorre D, Llauro C, Jobet E, Guilleminot J, Brizard J-P, Delseny M, Lasserre E. RLK7, a leucine-rich repeat receptor-like kinase, is required for proper germination speed and tolerance to oxidative stress in Arabidopsis thaliana. Planta. 2010;232:1339–53.
Ouyang SQ, Liu YF, Liu P, Lei G, He SJ, Ma B, Zhang WK, Zhang JS, Chen SY. Receptor-like kinase OsSIK1 improves drought and salt stress tolerance in rice (Oryza sativa) plants. Plant J. 2010;62:316–29.
de Lorenzo L, Merchan F, Laporte P, Thompson R, Clarke J, Sousa C, Crespi M. A novel plant leucine-rich repeat receptor kinase regulates the response of Medicago truncatula roots to salt stress. Plant Cell. 2009;21:668–80.
Zipfel C. Pattern-recognition receptors in plant innate immunity. Curr Opin Immunol. 2008;20:10–6.
Roux M, Schwessinger B, Albrecht C, Chinchilla D, Jones A, Holton N, Malinovsky FG, Tör M, de Vries S, Zipfel C. The arabidopsis leucine-rich repeat receptor-like kinases BAK1/SERK3 and BKK1/SERK4 are required for innate immunity to Hemibiotrophic and Biotrophic pathogens. Plant Cell. 2011;23:2440–55.
Lehti-Shiu MD, Zou C, Hanada K, Shiu S-H. Evolutionary history and stress regulation of plant receptor-like kinase/pelle genes. Plant Physiol. 2009;150:12–26.
Gou X, He K, Yang H, Yuan T, Lin H, Clouse SD, Li J. Genome-wide cloning and sequence analysis of leucine-rich repeat receptor-like protein kinase genes in Arabidopsis thaliana. BMC Genomics. 2010;11:19.
Shiu S, Bleecker AB. Expansion of the Receptor-Like Kinase/Pelle Gene Family and Receptor-Like Proteins in Arabidopsis. Plant Physiol. 2003;132:530–43.
Boller T, Felix G. A renaissance of elicitors: perception of microbe-associated molecular patterns and danger signals by pattern-recognition receptors. Annu Rev Plant Biol. 2009;60:379–406.
Zipfel C. Plant pattern-recognition receptors. Trends Immunol. 2014;35:345–51.
Holt III BF, Mackey D, Dangl JL. Recognition of pathogens by plants. Curr Biol. 2000;10:R5–7.
Goff KE, Ramonell KM. The role and regulation of receptor-like kinases in plant defense. Gene Regul Syst Biol. 2007;1:167–75.
Gómez-Gómez L, Boller T. FLS2: an LRR receptor-like kinase involved in the perception of the bacterial elicitor flagellin in Arabidopsis. Mol Cell. 2000;5:1003–11.
Zipfel C, Kunze G, Chinchilla D, Caniard A, Jones JDG, Boller T, Felix G. Perception of the bacterial PAMP EF-Tu by the receptor EFR restricts Agrobacterium-mediated transformation. Cell. 2006;125:749–60.
Song WY, Wang GL, Chen LL, Kim HS, Pi LY, Holsten T, Gardner J, Wang B, Zhai WX, Zhu LH, Fauquet C, Ronald P. A receptor kinase-like protein encoded by the rice disease resistance gene, Xa21. Science. 1995;270(5243):1804–6.
Talon M, Gmitter FG. Citrus genomics. Int J Plant Genomics. 2008;2008:528361.
Liu Y, Heying E, Tanumihardjo AS. History, Global Distribution, and Nutritional Importance of Citrus Fruits. Compr Rev Food Sci Food Saf. 2012;11(6):530–45.
Xu Q, Chen L-L, Ruan X, Chen D, Zhu A, Chen C, Bertrand D, Jiao W-B, Hao B-H, Lyon MP, Chen J, Gao S, Xing F, Lan H, Chang J-W, Ge X, Lei Y, Hu Q, Miao Y, Wang L, Xiao S, Biswas MK, Zeng W, Guo F, Cao H, Yang X, Xu X-W, Cheng Y-J, Xu J, Liu J-H, et al. The draft genome of sweet orange (Citrus sinensis). Nat Genet. 2013;45:59–66.
Wu GA, Prochnik S, Jenkins J, Salse J, Hellsten U, Murat F, Perrier X, Ruiz M, Scalabrin S, Terol J, Takita MA, Labadie K, Poulain J, Couloux A, Jabbari K, Cattonaro F, Del Fabbro C, Pinosio S, Zuccolo A, Chapman J, Grimwood J, Tadeo FR, Estornell LH, Muñoz-Sanz JV, Ibanez V, Herrero-Ortega A, Aleza P, Pérez-Pérez J, Ramón D, Brunel D, et al. Sequencing of diverse mandarin, pummelo and orange genomes reveals complex history of admixture during citrus domestication. Nat Biotechnol. 2014;32:656–62.
Pitino M, Armstrong CM, Duan Y. Rapid screening for citrus canker resistance employing pathogen-associated molecular pattern-triggered immunity responses. Hortic Res. 2015;2:15042.
Shi Q, Febres VJ, Jones JB, Moore GA. Responsiveness of different citrus genotypes to the Xanthomonas citri ssp. citri -derived pathogen-associated molecular pattern (PAMP) flg22 correlates with resistance to citrus canker. Mol Plant Pathol. 2015;16:507–20.
Cuenca J, Aleza P, Vicent A, Brunel D, Ollitrault P, Navarro L. Genetically based location from triploid populations and gene ontology of a 3.3-mb genome region linked to Alternaria brown spot resistance in citrus reveal clusters of resistance genes. PLoS One. 2013;8:e76755.
Folimonova SY, Robertson CJ, Garnsey SM, Gowda S, Dawson WO. Examination of the Responses of Different Genotypes of Citrus to Huanglongbing (Citrus Greening) Under Different Conditions. Phytopathology. 2009;99:1346–54.
Gmitter FG, Chen C, Machado MA, de Souza AA, Ollitrault P, Froehlicher Y, Shimizu T. Citrus genomics. Tree Genet Genomes. 2012;8:611–26.
Rodrigues CM, de Souza AA, Takita MA, Kishi LT, Machado MA. RNA-Seq analysis of Citrus reticulata in the early stages of Xylella fastidiosa infection reveals auxin-related genes as a defense response. BMC Genomics. 2013;14:676.
Velasco R, Licciardello C. A genealogy of the citrus family. Nat Biotech. 2014;32:640–2.
Diévart A, Gilbert N, Droc G, Attard A, Gourgues M, Guiderdoni E, Périn C. Leucine-rich repeat receptor kinases are sporadically distributed in eukaryotic genomes. BMC Evol Biol. 2011;11:367.
Zan Y, Ji Y, Zhang Y, Yang S, Song Y, Wang J. Genome-wide identification, characterization and expression analysis of populus leucine-rich repeat receptor-like protein kinase genes. BMC Genomics. 2013;14:318.
Sakamoto T, Deguchi M, Brustolini OJB, Santos AA, Silva FF, Fontes EPB. The tomato RLK superfamily: phylogeny and functional predictions about the role of the LRRII-RLK subfamily in antiviral defense. BMC Plant Biol. 2012;12:229.
Shiu S, Karlowski WM, Pan R, Tzeng Y, Mayer KFX, Li W. Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice. Plant Cell. 2004;16:1220–34.
Fischer I, Diévart A, Droc G, Dufayard J-F, Chantret N. Evolutionary dynamics of the Leucine-Rich Repeats Receptor-Like Kinase (LRR-RLK) subfamily in angiosperms. Plant Physiol. 2016;170:1595–610.
Nemoto K, Seto T, Takahashi H, Nozawa A, Seki M, Shinozaki K, Endo Y, Sawasaki T. Autophosphorylation profiling of Arabidopsis protein kinases using the cell-free system. Phytochemistry. 2011;72:1136–44.
Lehti-Shiu MD, Shiu S-H. Diversity, classification and function of the plant protein kinase superfamily. Philos Trans R Soc Lond B Biol Sci. 2012;367:2619–39.
Vij S, Giri J, Dansana PK, Kapoor S, Tyagi AK. The receptor-like cytoplasmic kinase (OsRLCK) gene family in rice: organization, phylogenetic relationship, and expression during development and stress. Mol Plant. 2008;1:732–50.
Arabidopsis Genome Initiative. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000;408:796–815.
Hok S, Danchin EGJ, Allasia V, Panabières F, Attard A, Keller H. An Arabidopsis (malectin-like) leucine-rich repeat receptor-like kinase contributes to downy mildew disease. Plant Cell Environ. 2011;34:1944–57.
Asai T, Tena G, Plotnikova J, Willmann MR, Chiu W-L, Gomez-Gomez L, Boller T, Ausubel FM, Sheen J. MAP kinase signalling cascade in Arabidopsis innate immunity. Nature. 2002;415:977–83.
Pruitt RN, Schwessinger B, Joe A, Thomas N, Liu F, Albert M, Robinson MR, Chan LJG, Luu DD, Chen H, Bahar O, Daudi A, Vleesschauwer D De, Caddell D, Zhang W, Zhao X, Li X, Heazlewood JL, Ruan D, Majumder D, Chern M, Kalbacher H, Midha S, Patil PB, Sonti R V, Petzold CJ, Liu CC, Brodbelt JS, Felix G, Ronald PC. The rice immune receptor XA21 recognizes a tyrosine-sulfated protein from a Gram-negative bacterium. Sci Adv. 2015; doi: 10.1126/sciadv.1500245.
Guo YL. Gene family evolution in green plants with emphasis on the origination and evolution of Arabidopsis thaliana genes. Plant J. 2013;73:941–51.
Gabaldón T, Koonin EV. Functional and evolutionary implications of gene orthology. Nat Rev Genet. 2013;14:360–6.
Schwessinger B, Ronald PC. Plant innate immunity: perception of conserved microbial signatures. Annu Rev Plant Biol. 2012;63:451–82.
Dardick C, Ronald P. Plant and animal pathogen recognition receptors signal through non-RD kinases. PLoS Pathog. 2006;2:14–28.
Dardick C, Schwessinger B, Ronald P. Non-arginine-aspartate (non-RD) kinases are associated with innate immune receptors that recognize conserved microbial signatures. Curr Opin Plant Biol. 2012;15:358–66.
Tatusov RL, Koonin EV, Lipman DJ. A Genomic Perspective on Protein Families. Science. 1997;278:631–8.
Corbi J, Debieu M, Rousselet A, Montalent P, Le Guilloux M, Manicacci D, Tenaillon MI. Contrasted patterns of selection since maize domestication on duplicated genes encoding a starch pathway enzyme. Theor Appl Genet. 2011;122:705–22.
Rastogi S, Liberles DA. Subfunctionalization of duplicated genes as a transition state to neofunctionalization. BMC Evol Biol. 2005. doi:10.1186/1471-2148-5-28
Wendel JF, Flagel LE, Wendel JF. Gene duplication and evolutionary novelty in plants. New Phytol. 2009;183:557–64.
Wang Y, Zhou L, Li D, Dai L, Lawton-Rauh A, Srimani PK, Duan Y, Luo F. Genome-Wide Comparative Analysis Reveals Similar Types of NBS Genes in Hybrid Citrus sinensis Genome and Original Citrus clementine Genome and Provides New Insights into Non-TIR NBS Genes. PLoS One. 2015. doi:10.1371/journal.pone.0121893.
Hwang S-G, Kim DS, Jang CS. Comparative analysis of evolutionary dynamics of genes encoding leucine-rich repeat receptor-like kinase between rice and Arabidopsis. Genetica. 2011;139:1023–32.
Schmidt R. Synteny: recent advances and future prospects. Curr Opin Plant Biol. 2000;3:97–102.
Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R. InterProScan: Protein domains identifier. Nucleic Acids Res. 2005;33:116–20.
Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K, Ceric G, Clements J, Heger A, Holm L, Sonnhammer ELL, Eddy SR, Bateman A, Finn RD. The Pfam protein families database. Nucleic Acids Res. 2012;40:290–301.
Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30:772–80.
Waterhouse AM, Procter JB, Martin DM, Clamp M, Barton GJ. Jalview Version 2--a multiple sequence alignment editor and analysis workbench. Bioinformatics. 2009;25:1189–91.
Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25:1972–3.
Guindon S, Gascuel O. A Simple, Fast, and Accurate Algorithm to Estimate Large Phylogenies by Maximum Likelihood. Syst Biol. 2003;52:696–704.
Abascal F, Zardoya R, Posada D. ProtTest: selection of best-fit models of protein evolution. Bioinformatics. 2005;21:2104–5.
Bailey TL, Williams N, Misleh C, Li WW. MEME: Discovering and analyzing DNA and protein sequence motifs. Nucleic Acids Res. 2006;34:369–73.
Voorrips RE. MapChart: software for the graphical presentation of linkage maps and QTLs. J Hered. 2001;93:77–8.
Minkin I, Patel A, Kolmogorov M, Vyahhi N, Pham S. Sibelia: A scalable and comprehensive synteny block generation tool for closely related microbial genomes. Proc 13th Workshop Algs in Bioinf (WABI’13) Lecture Notes in Comp Sci. 2013;8126:215–29.
Zhou F, Guo Y, Qiu L. Genome-wide identification and evolutionary analysis of leucine-rich repeat receptor-like protein kinase genes in soybean. BMC Plant Biol. 2016;16:1–13.
Rameneni JJ, Lee Y, Dhandapani V, Yu X, Choi SR. Genomic and Post-Translational Modification Analysis of Leucine-Rich-Repeat Receptor-Like Kinases in Brassica rapa. PLoS One. 2015;10:1–23.
Argout X, Salse J, Aury J, Guiltinan MJ, Droc G, Gouzy J, Allegre M, Chaparro C, Legavre T, Maximova SN, Abrouk M, Murat F, Fouet O, Poulain J, Ruiz M, Roguet Y, Rodier-goud M, Barbosa-neto JF, Sabot F, Kudrna D, Ammiraju JSS, Schuster SC, Carlson JE, Sallet E, Schiex T, Dievart A, Kramer M, Gelley L, Shi Z, Bérard A, et al. The genome of Theobroma cacao. Nat Genet. 2011;43:101–9.
The authors thank Nicholas Holton for critical reading of the manuscript before submission. The authors have no conflict of interest.
This work was supported by research grants from Fundação de Amparo à Pesquisa do Estado de São Paulo (2013/17485-7). D.M.M is a fellow PhD from FAPESP (2013/01412-0). A.A.S., G.C.O. and M.A.T. are recipient of research fellowships from CNPq.
Research in the Zipfel laboratory is funded by the Gatsby Charitable Foundation and the Two Blades Foundation.
Availability of data and materials
Phylogenetic data have been deposited to FigShare and are accessible via https://dx.doi.org/10.6084/m9.figshare.3474752.v1. Additional supporting data are included as Additional files.
AAS and MAT planned and supervised the study. AAS, MAT, LLSS and GCO contributed to the design of the analysis. DMM and LLSS performed the analysis of the data. NVS carried out the syntenic analysis. DMM drafted the manuscript. DMM, AAS, MAT, LLSS and CZ contributed to the interpretation of the data and provided intellectual input. AAS, MAT and CZ revised the manuscript. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
Additional file 1:
Amino acid sequences from Arabidopsis thaliana LRR-RLKs. Dataset containing 209 LRR-RLK sequences obtained from http://arabidopsis.org/. (XLSX 137 kb)
Additional file 2:
Amino acid sequences from C. clementina, C. sinensis and A. thaliana kinase domains. The sequences were obtained by in-house Perl scripts using kinase coordinates from pfam database. (XLSX 108 kb)
Additional file 3:
Phylogenetic tree of LRR-RLKs from Citrus clementina and Arabidopsis thaliana. The phylogenetic tree was reconstructed with amino acid sequences from kinase domains by Maximum-likelihood method. The groups of LRR-RLK subfamily (I – XVI) are separated in different colors. Tree support values (aLRT) are indicated. (PDF 68 kb)
Additional file 4:
Phylogenetic tree of LRR-RLKs from Citrus sinensis and Arabidopsis thaliana. The phylogenetic tree was reconstructed with amino acid sequences from kinase domains by Maximum-likelihood method. The groups of LRR-RLK subfamily (I – XVI) are separated in different colors. Tree support values (aLRT) are indicated. (PDF 66 kb)
Additional file 5:
Phylogenetic tree of LRR-RLKs from Citrus clementina, Citrus sinensis and Arabidopsis thaliana. The phylogenetic tree was reconstructed with amino acid sequences from kinase domains by Maximum-likelihood method. The groups of LRR-RLK subfamily (I – XVI) are separated in different colors. (PDF 69 kb)
Additional file 6:
Complete list and classification of LRR-RLKs from C. clementina, C. sinensis and A. thaliana. Citrus sequences were categorized in 16 groups of LRR-RLKs. (XLSX 55 kb)
Additional file 7:
Pairwise list of orthologues LRR-XII subfamily members of C. clementina and C. sinensis. The colors represent the gene position in the same phylogenetic group (Fig. 2 ). * represents genes in clusters. (XLSX 11 kb)
Additional file 8:
Classification of kinases from C. clementina and C. sinensis LRR-RLK as RD, non-RD and ACF. The kinases classification was done according to the presence or absence of the Arg (R) in the conserved HRD motif. (XLSX 13 kb)
Additional file 9:
Statistics of the tandem duplicated paralogues sequences from C. clementina LRR-XII. Comparative data generated by Blastp using all C. clementina LRR-XII. (XLSX 13 kb)
Additional file 10:
Statistics of the tandem duplicated paralogues from C. sinensis LRR-XII. Comparative data generated by Blastp using all C. sinensis LRR-XII. (XLSX 12 kb)
Additional file 11:
List of C. clementina and C. sinensis LRR-XII localized in cluster region. The chromosome distance of 200 kb was used as a reference to include the LRR-XII genes in the same cluster region. (XLSX 12 kb)
Additional file 12:
Syntenic blocks (SBs) of LRR-XII from C. sinensis and C. clementina. SBs were obtained from genome collinearity and independent homology analysis. (XLSX 18 kb)
Additional file 13:
Amino acid sequences of human kinase proteins. Sequences used as outgroup in the phylogenetic analysis with Citrus and Arabidopsis LRR-RLKs. (XLSX 10 kb)
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Magalhães, D.M., Scholte, L.L.S., Silva, N.V. et al. LRR-RLK family from two Citrus species: genome-wide identification and evolutionary aspects. BMC Genomics 17, 623 (2016). https://doi.org/10.1186/s12864-016-2930-9
- Citrus sinensis
- Citrus clementina
- Leucine-rich repeat receptor-like kinase