- Research article
- Open Access
Genome-wide identification, characterization and expression analysis of populusleucine-rich repeat receptor-like protein kinase genes
BMC Genomics volume 14, Article number: 318 (2013)
Leucine-rich repeat receptor-like kinases (LRR-RLKs) comprise the largest group within the receptor-like kinase (RLK) superfamily in plants. This gene family plays critical and diverse roles in plant growth, development and stress response. Although the LRR-RLK families in Arabidopsis and rice have been previously analyzed, no comprehensive studies have been performed on this gene family in tree species.
In this work, 379 LRR-RLK genes were retrieved from the Populus trichocarpa genome and further grouped into 14 subfamilies based on their structural and sequence similarities. Approximately 82% (312 out of 379) of the PtLRR-RLK genes are located in segmental duplication blocks indicating the role of duplication process in the expansion of this gene family. The conservation and variation in motif composition and intron/exon arrangement among PtLRR-RLK subfamilies were analyzed to provide additional support for their phylogenetic relationship and more importantly to indicate the potential divergence in their functions. Expression profiling of PtLRR-RLKs showed that they were differentially expressed in different organs and tissues and some PtLRR-RLKs were specifically expressed in meristem tissues, which indicated their potential involvement in tissue development and differentiation. For most AtLRR-RLKs with defined functions, Populus homologues exhibiting similar expression patterns could be identified, which might indicate the functional conservation during evolution. Among 12 types of environmental cues analyzed by the genome-wide microarray data, PtLRR-RLKs showed specific responses to shoot organogenesis, wounding, low ammonium feeding, hypoxia and seasonal dormancy, but not to drought, re-watering after drought, flooding, AlCl3 treatment and bacteria or fungi treatments.
This study provides the first comprehensive genomic analysis of the Populus LRR-RLK gene family. Segmental duplication contributes significantly to the expansion of this gene family. Populus and Arabidopsis LRR-RLK homologues not only share similar genetic structures but also exhibit comparable expression patterns which point to the possible functional conservation of these LRR-RLKs in two model systems. Transcriptome profiling provides the first insight into the functional divergence among PtLRR-RLK gene subfamilies and suggests that they might take important roles in growth and adaptation of tree species.
Plant cells are able to sense and transduce signals through cell surface receptors which mediate the cell-to-cell communication by binding to the extracellular ligands and possessing protein kinase catalytic activities . In 1990, the first plant receptor-like kinase (RLK) was identified in maize  and since then, many RLKs have been identified from other plant species. According to the classification based on the extracellular domains, the major group of plant RLK is the leucine-rich repeat RLK family (LRR-RLK) . The structural features of LRR-RLKs include an extracellular receptor domain to perceive signals, a single-pass transmembrane domain to anchor the protein within the membrane and a cytoplasmic serine/threonine (ser/thr) protein kinase domain to transduce the signal downstream via autophosphorylation followed by further phosphorylation of specific substrates [4, 5].
Previous reports have classified plant LRR-RLK genes into two broad categories . First, they are important in plant growth and development including morphogenesis, organogenesis and hormone signaling. Second, many LRR-RLKs respond to abiotic and biotic stress and therefore could be defense-related. Some LRR-RLKs have been demonstrated to possess dual functions due to the cross-talk between defense and developmental pathways or due to the recognition of multiple ligands by one signal receptor . For instance, ERECTA is involved in both ovule development and resistance to bacterial wilt [7, 8]. Although important progress has been made in understanding LRR-RLK functions in recent years, open questions still remain for most LRR-RLKs. The phenotypes associated with various LRR-RLK mutants show that they play roles in diverse processes during growth and development . Meanwhile, the functional redundancy of LRR-RLK family members definitely adds to the complexity of the signaling network they mediate. For example, CLAVATA1 (CLV1) forms a receptor complex with CLV2 upon perception of the CLV3/ESR-related (CLE) peptide derived from CLV3 in the shoot apical meristem to regulate the expression pattern of the stem cell-promoting transcription factor WUSCHEL (WUS) [10–12]. In parallel with CLV1, additional receptors, namely Barely any Meristem (BAM1, BAM2, and BAM3), exhibit similar sequences as CLV1 but perform seemingly contradictory functions. While CLV1 promotes stem cell differentiation, BAM receptors are required for stem cell maintenance . It has been shown that CLV1 and BAM receptors have retained significant similarity in their biochemical function and the differences in their genetic functions appear to be largely driven by their distinctive expression patterns .
LRR-RLKs seem to have evolved to acquire novel and diverse functions through neofunctionalization and subfunctionalization by extensive gene duplication . The drastic expansion of this gene family in the land plant lineage is regarded as a plant-specific adaptation for extracellular signal sensing and propagation [15, 16]. As a forest model organism, poplar is a fast-growing diploid plant that has attracted much attention since its whole genome being sequenced . The structural features and expression profiles of LRR-RLK gene family members have been extensively described in Arabidopsis and rice, however, there has been much less information about this family in woody species including poplar. In the current study, the entire LRR-RLK gene family of Populus trichocarpa was comprehensively identified and analyzed by incorporating sequence phylogeny, gene organization, conserved motif, expression profiling, and gene adaption analysis. Our results provide a framework for further functional investigation on Populus LRR-RLKs and contribute to a better understanding of the complexity of LRR-RLK gene family in higher plants.
Results and discussion
Composition and phylogenetic analysis of LRR-RLK gene family in populus trichocarpa
To date, approximately 213 and 309 LRR-RLK genes have been identified in the fully sequenced Arabidopsis and rice genomes, respectively [18, 19]. In this work, a larger LRR-RLK gene family composed of 379 members was identified in the P. trichocarpa genome. The number of LRR-RLK genes in Populus is roughly 1.78 fold of that in Arabidopsis, which is consistent with the ratio of putative Populus homologues to each Arabidopsis gene (1.4~1.6) . The detailed information of LRR-RLK family genes in Populus including the accession numbers and the characteristics of the encoded proteins is listed in Additional file 1 and the summarized information concerning each group or subgroup is presented in Table 1. Since the diversity of extracellular domains (ECDs) represents the capability of LRR-RLKs to recognize various ligands and thus constitute the basis of their functional versatility , we first identified the ECD for each PtLRR-RLK and constructed the phylogenetic tree to determine their evolutionary relationship (Figure 1, Additional file 2). It has been shown that many events which resulted in the fusion between ECDs and kinase domains occurred early in land plant evolution, thus RLK genes with related kinase sequences tend to have similar ECDs [15, 20, 21]. In this work, the phylogenetic relationship among the PtLRR-RLKs was also examined based on their catalytic kinase domains and similar categories were obtained (Additional file 3). Since the nodes of the phylogenetic tree based on the ECDs exhibit the best confidence of support, PtLRR-RLKs were classified into 14 subfamilies (I to XIV) accordingly (Figure 1). No well-supported positions could be identified for six PtLRR-RLKs, so they were not included in the phylogeny (Additional file 1). When PtLRR-RLKs were clustered with AtLRR-RLKs (Additional file 4), the numbering for the Populus LRR-RLK subfamilies was determined based on the nomenclature of the majority of Arabidopsis homologues within the same group. The Populus subfamilies I, II, III and XIII were grouped together with Arabidopsis LRR-RLKs involved in organ/tissue development and with the ones involved in defense signaling. Group IV included only two Arabidopsis Inflorescence Meristem Receptor-like Kinase (IMK) genes which are involved in cell fate specification and proliferation. Group V included the Arabidopsis Strubbelig-receptor Family (SRF) gene family members that affect different aspects of cell wall biology [22, 23] and the SCM gene involved in root hair specification . Group VI, VII, VIII and IX had no Arabidopsis orthologs with identified functions. Group X was grouped together with Arabidopsis genes involved in brassinosternoid and peptide signaling such as BAK1-interacting Receptor1 (BIR1), BAK1-interacting Receptor-like (BIR-like), and Phytosulfokin receptor1-2 (PSKR1-2) genes . Subgroups XI-a and XI-b were represented with Arabidopsis LRR-RLKs with important roles in organ morphogenesis, cell fate specification and vascular development such as CLV1 and Phloem Intercalated with Xylem (PXY) [26–29], while in the subgroup XI-c, PEP1 receptor1-2 (AtPEPR1-2) is involved in abscisic acid signaling and defense response [30, 31]. For group XII, subgroup XII-b is Populus specific and subgroup XII-a was clustered with Flagellin-sensitive2 (FLS2) and EF-Tu receptor (EFR) which take part in innate immunity against pathogens [32, 33]. Group XIV only included Excess Microsporocytes1 (EMS1) gene, which is involved in endosperm and pollen development . The dispersal pattern of 63 Arabidopsis LRR-RLKs with well-defined roles prompted consideration of the ancestral role of distinct PtLRR-RLK subfamilies and there is a possibility that PtLRR-RLKs belonging to distinct subfamilies perform certain functions in different developmental aspects. For example, the subgroups XI-a and b are more likely to be involved in plant growth and development, while XI-c could be more likely to take roles in plant-microbe interactions. The large size of the Populus LRR-RLK gene family has been regarded as a indication of a great need for LRR-RLK genes to participate in more complicated transcriptional regulations in woody species . Meanwhile, the species-specific genes could play important roles in plant responses to a variation of biotic factors, such as the variation of the spectrum of pathogens , so it would be very attractive to investigate the functions of the poplar-specific subgroups identified in this work.
Intron–exon organizations of PtLRR-RLKs
The presence of multiple introns has been shown to be essential for ERECTA expression in Arabidopsis, so the intron–exon organizations of PtLRR-RLKs were examined for a clearer understanding of their potential functions. Additional file 5 provides the detailed illustration of the distribution and position of introns for each PtLRR-RLK genes and Figure 2 listed the representative intron/exon structures and their distributions among different gene subfamilies (Figure 2). Out of 379 Populus LRR-RLK genes, 30 had alternative mRNA splicing modes and 25 genes had no intron. One, two, three, four, and five introns was found in 153, 54, 23, 6 and 3 genes, respectively. One hundred and fifteen genes had more than five introns and 72 out of them had more than ten introns (Additional file 5). In terms of exon/intron organization, most of the closely related Populus LRR-RLK genes have roughly the same number and location of introns (Figure 2), which strongly supports their close evolutionary relationship. Populus and Arabidopsis genes belonging to the same subfamilies also exhibit similar genomic features. For example, the gene structure of the Populus subfamily XI were fairly simple and has only one or two introns over their full length sequence, except three genes in the subgroup XI-b which contain as many as 26 introns. Arabidopsis homologues of this group have been shown to play important roles in plant development and organogenesis and most of them contained less than two introns except ERECTA and ERECTA-LIKE1-2 (ERL1-2) which contained as many as 26 introns, this is the most complicated intron/exon structure of AtLRR-RLKs. Although all of the 63 AtLRR-RLKs with known functions could be matched with Populus homologues with similar intron/exon structures, the exactly same genetic structures as AtLRR-RLKs were only found in Populus group V, XI and XIII and interestingly, all of them are developmental genes responsible for cell fate specification and morphogenesis (Additional file 6). These results confirmed that the common ancestral genes of PtLRR-RLKs and AtLRR-RLKs already possess multiple intron/exon structures and probably the complicated mRNA processing modes as well. Meanwhile, it seemed possible that the development-related LRR-RLK genes are more conserved in the evolution of genetic structures than the defense-related LRR-RLKs due to their indispensible roles for plant life.
Conserved motifs of PtLRR-RLKgenes
To further reveal the diversification and functional potentials of Populus LRR-RLKs, their conserved motifs were investigated and the consistency of domain arrangement for each subfamily was determined using the Multiple EM for Motif Elicitation (MEME) motif detection software . The LRR motif is usually composed of 20–29 residues with conserved leucines  and the consensus residues within the LRR motif were thought to provide a structural skeleton for protein-protein interactions and non-consensus residues within LRRs are though to determine the specificity of such interactions . In total, 17 LRR-related motifs were identified among PtLRR-RLK family members and the basic LRR motif was concluded as LxxLxLxxNx L/f sGx I/l Pxx l/I gxLxx, which shows a good match to the plant LRR consensus LxxLxLxxNxLxGxIPxxLxxLxx and was slightly different from the basic LRR motif in rice (LxxLxLxxNx L/f xGx I/l Pxx l/i Gx L/c xx) . The most conserved amino acid residues in Populus LRR motifs were Gly at position 1, Pro at position 4, Leu at position 13, 16 and 18, and Asn at position 21. Ile at positions 3 and Leu at position 7 are often substituted by each other and Leu at position 23 is often replaced by Phe (Table 2). Some repeats contain additional conserved residues in other positions, such as a Gly at position 8 of the M23 repeat; Ser residue at position 19 of the M1, M5, M7, M9, M12 and M19 repeat. Since the repetitive structure of LRR makes it capable of the rapid generation of new variants by duplications and deletions of entire repeats , the repeat number and distribution of LRR motif have been regarded as important parameters to reflect the evolutionary history. For PtLRR-RLKs, most of the closely related members in the phylogenetic tree kept similar motifs, providing additional support for their phylogenetic relations (Additional file 7). The conserved motifs in the LRR-RLK proteins within the same subfamily may suggest their functional similarities and divergence in motif composition may indicate their functional diversity . Although no group- or subgroup-specific LRR motif was identified, members of different subfamilies did exhibit various degree of complexity in terms of the LRR motif composition (Additional file 7). The most complicated motif composition was observed for the group XI which included all 17 types of LRR motif and in contrast, the group I and II had only 3 to 7 LRR motifs. In addition to the motif composition, the similarity in terms of the arrangement of different LRR motifs also varied among subfamilies (Additional file 7). The arrangement was almost identical for members of subfamily II, III, V, IX and XII-b. The variation in LRR patterns gets more obvious among the members of other subfamilies, although after careful comparison, several clades sharing a regular motif arrangement could still be identified for each subfamily (Additional file 7). The high divergence in the alignments of LRR motif within one subfamily could reflect the functional diversity among their members. In addition to LRR motifs, non-LRR motifs were also identified in the extracellular regions of PtLRR-RLK (Additional file 8). Common motifs including M4, M14 and M17 could be identified in the N-terminal for most of PtLRR-RLKs, while M13 could be found at the C-terminal for most PtLRR-RLKs. Different from these common motifs are certain non-LRR motifs which appear to be subfamily-specific, for example, the motif M24 only appeared in most members of VIII (Additional file 8). PtLRR-RLKs sharing the same or similar motif composition and arrangement could be identified for 50 out of 63 AtLRR-RLKs with known functions (Table 3 and Additional file 9), which supports the theory that the domain organization of most RLK/Pelle subfamilies was established before the monocot–dicot split .
When the trans-membrane (TM) domains were predicted by TMHMM, in a total of 379 PtLRR-RLKs, 339 had one TM and 26 PtLRR-RLKs did not have any TM. Further analysis of the remaining 14 PtLRR-RLKs with two TMs revealed one of them is atypical. The RLK domain of most PtLRR-RLK consists of approximately 250–280 amino acid residues with a maximum of 324 and a minimum of 168. In literature, plant RLK could be divided into 12 conserved subdomains (I–XII) from N- to C-terminal . In the 2- lobed structure of the RLK domain, the smaller lobe is composed of subdomains I to IV and is involved in anchoring and orienting the nucleotide. The larger lobe is composed of subdomain VI to XI and is largely responsible for binding the peptide substrate and initiating phosphor-transfer . In the kinase part of all PtLRR-RLKs, 25 motifs are identified which are similar to those identified for rice LRR-RLKs and were named as 1 to 25 according to the frequencies of their appearance (Table 4). Although most motifs did not seem to be subfamily-specific, motif 10, 13 and 16 only appeared in the subgroup VII and motif 15, 24 and 25 only showed up in the subfamily XII. Since only these two subfamilies included a Populus-specific clade in the phylogenic tree, these specific motifs may, to some extent, attribute to the functional divergence of these subfamily members in poplar.
Kinase is commonly referred to as arginine-aspartate (RD) kinases if it is strongly activated by the phosphorylation of the activation loop and they usually contain an Arg(R) in the subdomain preceding the catalytic loop . Conversely, a smaller number of kinases are referred to as Non-arginine-aspartate (non-RD) kinases which lack the conserved R in subdomain VI [42, 43]. It has been proposed that the signal of pathogen recognition mediated by RLKs is usually through a non-RD kinase . In PtLRR-RLKs, about half are RD-kinases including all the members of subfamily VII and IX. Interestingly, no Arabidopsis homolog with known functions has been identified for these two subfamilies and the VII-b subfamily is Populus-specific. In contrast, all members in the subfamily III, IV, V and XII are non-RD kinases although the Arabidopsis LRR-RLKs grouped with them take part in both defense and development (Table 1).
Contributions of tandem and large-scale duplications to the family size of PtLRR-RLKs
The explosion of members of a gene family has generally occurred as the result of repetitive tandem duplication (TD) and segmental and/or whole genome duplication events (S/WGD). PtLRR-RLK genes were comprehensively distributed within the poplar genome and 22 genes are localized to unassembled genomic sequence scaffolds and thus could not be mapped to any particular chromosome (Figure 3). Approximately 82% (312 out of 379) PtLRR-RLK genes are located in the replicated region, which is different from rice and Arabidopsis in which the frequencies of genes generated by S/WGDs are much lower (11% in rice and 26% in Arabidopsis) . Among them, 140 genes lacked duplicates on the corresponding duplicated blocks, suggesting that dynamic rearrangement, mutation or segmental loss may have occurred following the segmental duplication. According to previous literature, a chromosome region containing two or more genes within 200 kb can be defined as a gene cluster . In poplar, 72 PtLRR-RLK genes were located in 20 tandem duplication clusters (Figure 3). The smallest tandem duplication clusters consisted of only 2 genes and the largest cluster had 8 tightly linked genes on chromosome 15 and 19. The clusters were distributed unevenly among the 14 phylogenetic groups, and Populus-specific subgroup VII-b contains 6 clusters incorporating 86.7% of the genes of this subgroup. By contrast, group II, III, IV, V, VI, IX had no clusters present (Additional file 10).
Differential expression profiles of populus LRR-RLKgenes
To gain a broader understanding of the function of LRR-RLKs, we analyzed the divergence among Populus LRR-RLK genes in spatial and temporal expression and expression in response to specific environmental signals. Probe sets were readily identifiable for 283 out of 379 PtLRR-RLKs in the PopGenExpress data set, and their distinct transcript abundance patterns were retrieved by the Populus Electronic Fluorescent Pictograph (eFP) browser . Most Populus LRR-RLK genes demonstrated distinct tissue specific expression patterns except for mature leaves, where all have low transcriptional levels (Additional file 11). Filtering was added to select genes that had at least a 2-fold higher expression in one specific tissue compared to the median expression level of all analyzed tissues. Out of the PtLRR-RLK genes for which microarray data are available, 28%, 29%, 15%, 27% and 19% showed specific transcript accumulations in young leaf, roots, female catkins, male catkins and developing xylem, respectively (Figure 4A). Identification of the genes predominantly expressed in meristem tissues provides an important clue for their functions during cell fate specification and organ formation. Therefore, the expression of PtLRR-RLKs in multiple meristem tissues was investigated which may provide a further solid basis to select meristem-specific genes for related functional validation (Figure 4B).
Out of 16 tandem duplicated gene clusters, 8 clusters exhibited similar expression patterns among genes with expression data available (Additional file 12). It has been reported that in both rice and Arabidopsis, more than 50% of duplicate LRR-RLK gene pairs that were generated by a whole genome duplication event exhibited expressional divergence [48, 49]. In poplar, among 82 pairs of LRR-RLK paralogs with expression data available, 68 (group I), 10 (group II) and 4 (group III) pairs shared >80%, 60-80% and <60% similarities over their full amino acid sequences, respectively. When expression patterns were compared, 70%, 30% and 0% pairs shared similar expression pattern in group I, II and III, respectively (Additional file 13). Thus, the expressional diversity of duplicated genes in poplar was correlated with the sequence variation which may represent a dynamic functional diversification of this gene family over evolutionary time and contribute to the adaptability of trees. Among the 63 Arabidopsis LRR-RLKs with known functions, 52 genes showed obvious tissue-specific expression instead of a whole-plant expression as illustrated by eFP and for 45 genes, their Populus homologues showed a similar spatial expression pattern (Additional file 14). This result supports that orthologous genes from different species may retain similar temporal and spatial expression patterns [50, 51].
By complete searching of the digital expression profiles from the Gene Expression Omnibus (GEO) repository at NCBI website, we also investigated the expression patterns of the PtLRR-RLK genes during shoot organogenesis and in response to various stress stimulus including drought, cold, hypoxia, nitrogen limitation, aluminum stress in roots, bacteria, fungi and mimic wounding (Additional file 15). In a total of 12 treatments, the expression profile of PtLRR-RLKs varied considerably when exposed to 7 treatments, except for infection by Marssonina pathogen and Melampsora rust fungi, drought and aluminum stress in roots. Genes responsive to various treatments were summarized as heat maps in Additional file 16. The percentages of members of each subfamily being induced or suppressed for each treatment were listed in Additional file 17 and summarized in the format of heatmap in Figure 5. It can be seen that LRR-RLKs respond to various stimulates in a temporal and spatial manner by changing the expression profiles of different gene sets. For example, in wounding experiment, 90 hours after treatment (GSE16785), 102 and 59 PtLRR-RLK genes were up-regulated in leaf LPI5 and root, respectively, and qualitative differences in the induction patterns were detected for these two types of tissue (Figure 5). When the sampling time was extended to one week (GSE16783), only 31 and 87 genes were detected as induced in LPI1 leaves and LPI5 leaves, respectively, and compared to very young leaves (LPI1), the older leaves with LPI5 were much more enriched with up-regulated LRR-RLKs, which were overrepresented by members from subfamily III, IV, V, IX and XI-b (Figure 5). In another assay, the gene expression response of Populus tremuloides cell suspension cultures to methyl jasmonate feeding was analyzed; the transcript level of 37 PtLRR-RLKs was elevated. All these data indicated that LRR-RLK gene family plays an indispensable role in wounding defense of tree species. When confronted with ammonium shortage, at a 4-week checkpoint, the induction was more dramatic in young leaves (LPI2) than the older leaves (LPI5). With the progression of the ammonium shortage, the transcript of LRR-RLKs from subfamily III, IV, IX and XI-b got obviously repressed in the older leaves (Figure 5). When the effect of hypoxia on gene expression was investigated in grey poplar, 117 genes responded by induction in leaves with only 11 genes got induced in roots. This located induction pattern may imply the localized functions of different PtLRR-RLK members. From Figure 5, it was seen that members of different LRR-RLK subfamilies act in an overlapping manner when dealing with different stimulus which indicated that cross talk and signal integration exist among different signaling pathways mediated by PtLRR-RLKs. In terms of down-regulation, several things need to be pointed out (Figure 5B). First, members of VII and XIV subfamily were highly repressed in LPI5 leaves one week after wounding. Second, only 26 PtLRR-RLK genes got transcriptionally induced in the winter survival and maintenance mechanism of P. trichocarpa, 144 genes responded with repression instead (GSE21480). Third, in the hypoxia treatment, with more than 90% of the induced genes was located in the leaf tissue, 88% down-regulated genes were found in the root tissue instead. In summary, although it is hard to assign distinct roles to different Populus LRR-RLK subfamilies based on the results of limited microarray analysis, it could be reasonable to suggest that PtLRR-RLKs are widely involved in different aspects of plant development in both normal and stressed circumstances. However, subset of Arabidopsis LRR-RLK genes have previously been shown to play crucial roles in biotic stress response (Additional file 18). Two biotic signals, Marssonina pathogen and Melampsora rust fungi, did not cause significant change of gene expression profiles in the current study, which indicates a need for more microarray experiments to better understand the roles of Populus LRR-RLK genes in biotic defense. For trees, it is unlikely to generate a collection of LRR-RLK T-DNA insertion mutants, as in Arabidopsis, to be easily applied for the analysis of other developmental aspects. The results from this study could provide insights into possible functions for some PtLRR-RLKs before future functional analyses would eventually elucidate their biological meanings.
Characterization of LRR-RLK genes in a ligneous species would facilitate a better understanding of the evolutionary processes and functions of this gene family. The current work shows that the LRR-RLKs represent a large gene family in Populus trichocarpa. Gene structures, motif composition and arrangements are considerably conserved among the (sub)groups. The distribution of genes was found to be non-random across chromosomes and a high proportion of the genes are located in segmental duplicated regions instead of tandem duplicated clusters. For most of the 63 Arabidopsis with known functions, Populus homologues always could be identified with similar genetic structure, motif character and expression profiles, providing insight into the evolutionary and functional conservation of this gene family in plant species. Expression patterns based on microarray data suggest that many PtLRR-RLK genes are expressed in a tissue-specific manner and responsive to various stresses. Data in this work may provide valuable information for future investigations to reveal the functional divergence and adaptive evolution of this gene family in tree species.
Sequence retrieval and phylogenetic reconstruction of LRR-RLK genes in poplar genome
Arabidopsis thaliana gene identifiers of different LRR-RLK super-families were downloaded from the PlantsP server v.2011 Arabidopsis 2010 project (http://plantsp.genomics.purdue.edu/html/projects/lrr/Clouse2010.htm)  for the first round Blastp search against the poplar genomic sequence database at the DOE Joint Genome Institute (JGI) website. Subsequently, each identified hit was used as a query to conduct Blastp searches in the poplar assembly genomic sequence database to minimize the risk of missing potential PtLRR-RLKs. The version 2.2 P. trichocarpa genome and protein sequences were downloaded from Phytozome (http://www.phytozome.net) . These resulted hit sequences were then analyzed with SMART (http://smart.embl-heidelberg.de)  and PFAM (http://pfam.sanger.ac.uk/)  to assure the presence of at least two LRR domains and one RLK domain. Identical and defective sequences were eliminated using manual inspection in Molecular Evolutionary Genetics Analysis (MEGA) v5.1 . After the signal sequences were deleted, ClustalX v.2.0.12  was used to generate a multiple sequence alignment of either the full length sequences, the trimmed LRR domains or kinase domains among the PtLRR-RLK protein sequences. The phylogenetic trees were constructed using the neighbor-joining method  in the MEGA package v5.1  with bootstrap values from 1000 replicates indicated at each node. Representative sequences from each Arabidopsis LRR-RLK subfamily or AtLRR-RLKs with defined functions were chosen to generate alignments with Populus LRR-RLKs.
Protein structure and conserved motif distribution
The number and position of exons and introns for individual PtLRR-RLK genes were determined by comparison of the cDNAs with their corresponding genomic DNA sequences. Information concerning PtLRR-RLK protein sequences, such as number of amino acids, molecular weights and PIs, were determined using ProtParam (http://au.expasy.org/tools/protparam.html) . Presence of the signal peptides were predicted at SignalP v.4.1 (http://www.cbs.dtu.dk/services/SignalP) . Transmembrane domains were predicted with TMHMM v. 2.0 (http://www.cbs.dtu.dk/services/TMHMM-2.0/)  and Phobius (http://phobius.binf.ku.dk/) . To exhibit the structural divergence of PtLRR-RLK genes, the conserved motifs in the encoded proteins were performed with the Multiple Expectation Maximization for Motif Elicitation (MEME) online program v.4.9.0 (http://meme.sdsc.edu/meme/intro.html)  and visualized with WebLogo (http://weblogo.berkeley.edu/logo.cgi) . Parameters were set as follows: the maximum number of motifs 30; minimum motif width 10; and maximum motif width 30; all other parameters were defaulted.
Chromosome location analysis
The chromosomal locations of the poplar LRR-RLK genes were drawn on the schematic diagram tool at PopGenIE  (http://popgenie.org/gp). Identification of homologous chromosome segments resulting from whole-genome duplication events was accomplished as described previously . Blocks with the same color represent homologous chromosome segments. Tandem gene duplications were identified as genes separated by ten or fewer gene loci in a range of 200 kb distance.
Gene expression analysis
Gene expression data mainly came from Poplar eFP Browser (http://bar.utoronto.ca/efppop/cgi-bin/efpWeb.cgi). In addition, the gene expression pattern of Populus meristem tissue series was obtained from PopGenIE . The genome-wide microarray data was obtained from the Gene Expression Omnibus database at the NCBI under the series accession numbers GSE23637 (Populus euphratica leaves subjected to drought), GSE13043 (from P. trichocarpa), GSE21480 (transcriptional regulation in the winter survival), GSE20061 (young differentiating xylem of poplar in response to a drought –rewatering cycle), GSE23726 (Populus euphratica leaves subjected to infection by Marssonina pathogen), GSE9673 (interactions with Melampsora rust fungi), GSE13109 (Effect of hypoxia on gene expression in Grey poplar), GSE14893 (Populus leaves under nitrogen limitation: clone 3200), GSE19297 (aluminum stress in roots of aspen, Populus tremula L.), GSE16773 (gene expression response of Populus tremuloides cell suspension cultures to methyl jasmonate feeding), GSE12152 (Genome scale transcriptome analysis of shoot organogenesis in Populus tremula x P. alba), GSE17223 (Molecular bases of acclimation and adaptation to water deficit in Populus anadensis) and GSE16785 (Wound-induced gene expression changes in Populus: 90 hours; clone RM5). Probe sets corresponding to the putative Populus LRR-RLKs were identified using an online Probe Match tool available at the NetAffx Analysis Center (http://www.affymetrix.com/). Genes were clustered based on the expression profiles and Hierarchical clustering of microarray data performed in MultiExperiment Viewer (MeV) v4.7.4 , using Pearson correlation and Average Linkage Clustering algorithm. Heatmaps of gene expression were generated using R (http://www.r-project.org/).
van der Geer P, Hunter T, Lindberg RA: Receptor protein-tyrosine kinases and their signal transduction pathways. Annu Rev Cell Biol. 1994, 10: 251-337. 10.1146/annurev.cb.10.110194.001343.
Walker JC, Zhang R: Relationship of a putative receptor protein kinase from maize to the S-locus glycoproteins of brassica. Nature. 1990, 345 (6277): 743-746. 10.1038/345743a0.
Dievart A, Clark SE: Using mutant alleles to determine the structure and function of leucine-rich repeat receptor-like kinases. Curr Opin Plant Biol. 2003, 6 (5): 507-516. 10.1016/S1369-5266(03)00089-X.
Shiu SH, Bleecker AB: Plant receptor-like kinase gene family: diversity, function, and signaling. Sci STKE. 2001, 2001 (113): re22-
Gou X, He K, Yang H, Yuan T, Lin H, Clouse SD, Li J: Genome-wide cloning and sequence analysis of leucine-rich repeat receptor-like protein kinase genes in arabidopsis thaliana. BMC Genomics. 2010, 11: 19-10.1186/1471-2164-11-19.
Afzal AJ, Wood AJ, Lightfoot DA: Plant receptor-like serine threonine kinases: roles in signaling and plant defense. Mol Plant Microbe Interact. 2008, 21 (5): 507-517. 10.1094/MPMI-21-5-0507.
Torii KU, Mitsukawa N, Oosumi T, Matsuura Y, Yokoyama R, Whittier RF, Komeda Y: The arabidopsis ERECTA gene encodes a putative receptor protein kinase with extracellular leucine-rich repeats. Plant Cell. 1996, 8 (4): 735-746.
Godiard L, Sauviac L, Torii KU, Grenon O, Mangin B, Grimsley NH, Marco Y: ERECTA, an LRR receptor-like kinase protein controlling development pleiotropically affects resistance to bacterial wilt. Plant J. 2003, 36 (3): 353-365. 10.1046/j.1365-313X.2003.01877.x.
Dievart A, Clark SE: LRR-containing receptors regulating plant development and defense. Development. 2004, 131 (2): 251-261.
Mayer KFX, Schoof H, Haecker A, Lenhard M, Jurgens G, Laux T: Role of WUSCHEL in regulating stem cell fate in the arabidopsis shoot meristem. Cell. 1998, 95 (6): 805-815. 10.1016/S0092-8674(00)81703-1.
Schoof H, Lenhard M, Haecker A, Mayer KFX, Jurgens G, Laux T: The stem cell population of arabidopsis shoot meristems is maintained by a regulatory loop between the CLAVATA and WUSCHEL genes. Cell. 2000, 100 (6): 635-644. 10.1016/S0092-8674(00)80700-X.
Brand U, Fletcher JC, Hobe M, Meyerowitz EM, Simon R: Dependence of stem cell fate in arabidopsis on a feedback loop regulated by CLV3 activity. Science. 2000, 289 (5479): 617-619. 10.1126/science.289.5479.617.
Deyoung BJ, Clark SE: BAM receptors regulate stem cell specification and organ development through complex interactions with CLAVATA signaling. Genetics. 2008, 180 (2): 895-904. 10.1534/genetics.108.091108.
Walsh B: Population-genetic models of the fates of duplicate genes. Genetica. 2003, 118 (2–3): 279-294.
Shiu SH, Bleecker AB: Expansion of the receptor-like kinase/pelle gene family and receptor-like proteins in arabidopsis. Plant Physiol. 2003, 132 (2): 530-543. 10.1104/pp.103.021964.
Shiu SH, Karlowski WM, Pan R, Tzeng YH, Mayer KF, Li WH: Comparative analysis of the receptor-like kinase family in arabidopsis and rice. Plant Cell. 2004, 16 (5): 1220-1234. 10.1105/tpc.020834.
Tuskan GA, Difazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, Putnam N, Ralph S, Rombauts S, Salamov A, et al: The genome of black cottonwood, populus trichocarpa (torr. & Gray). Science. 2006, 313 (5793): 1596-1604. 10.1126/science.1128691.
Dievart A, Gilbert N, Droc G, Attard A, Gourgues M, Guiderdoni E, Perin C: Leucine-rich repeat receptor kinases are sporadically distributed in eukaryotic genomes. BMC Evol Biol. 2012, 11: 367-
Sun X, Wang GL: Genome-wide identification, characterization and phylogenetic analysis of the rice LRR-kinases. PLoS One. 2011, 6 (3): e16079-10.1371/journal.pone.0016079.
Lehti-Shiu MD, Zou C, Hanada K, Shiu SH: Evolutionary history and stress regulation of plant receptor-like kinase/pelle genes. Plant Physiol. 2009, 150 (1): 12-26. 10.1104/pp.108.134353.
Shiu SH, Li WH: Origins, lineage-specific expansions, and multiple losses of tyrosine kinases in eukaryotes. Mol Biol Evol. 2004, 21 (5): 828-840. 10.1093/molbev/msh077.
Tanaka M, Takahata Y, Nakayama H, Nakatani M, Tahara M: Altered carbohydrate metabolism in the storage roots of sweet potato plants overexpressing the SRF1 gene, which encodes a Dof zinc finger transcription factor. Planta. 2009, 230 (4): 737-746. 10.1007/s00425-009-0979-2.
Eyuboglu B, Pfister K, Haberer G, Chevalier D, Fuchs A, Mayer KF, Schneitz K: Molecular characterisation of the STRUBBELIG-RECEPTOR FAMILY of genes encoding putative leucine-rich repeat receptor-like kinases in arabidopsis thaliana. BMC Plant Biol. 2007, 7: 16-10.1186/1471-2229-7-16.
Dolan L: Positional information and mobile transcriptional regulators determine cell pattern in the arabidopsis root epidermis. J Exp Bot. 2006, 57 (1): 51-54.
Kinoshita T, Cano-Delgado A, Seto H, Hiranuma S, Fujioka S, Yoshida S, Chory J: Binding of brassinosteroids to the extracellular domain of plant receptor kinase BRI1. Nature. 2005, 433 (7022): 167-171. 10.1038/nature03227.
van Zanten M, Snoek LB, Proveniers MC, Peeters AJ: The many functions of ERECTA. Trends Plant Sci. 2009, 14 (4): 214-218. 10.1016/j.tplants.2009.01.010.
Sanchez-Rodriguez C, Estevez JM, Llorente F, Hernandez-Blanco C, Jorda L, Pagan I, Berrocal M, Marco Y, Somerville S, Molina A: The ERECTA receptor-like kinase regulates cell wall-mediated resistance to pathogens in arabidopsis thaliana. Mol Plant Microbe Interact. 2009, 22 (8): 953-963. 10.1094/MPMI-22-8-0953.
Wang Z, Meng P, Zhang X, Ren D, Yang S: BON1 Interacts with the protein kinases BIR1 and BAK1 in modulation of temperature-dependent plant growth and cell death in arabidopsis. Plant J. 2011, 67 (6): 1081-1093. 10.1111/j.1365-313X.2011.04659.x.
Lee JS, Kuroha T, Hnilova M, Khatayevich D, Kanaoka MM, McAbee JM, Sarikaya M, Tamerler C, Torii KU: Direct interaction of ligand-receptor pairs specifying stomatal patterning. Genes Dev. 2012, 26 (2): 126-136. 10.1101/gad.179895.111.
Yamaguchi Y, Huffaker A, Bryan AC, Tax FE, Ryan CA: PEPR2 Is a second receptor for the Pep1 and Pep2 peptides and contributes to defense responses in arabidopsis. Plant Cell. 2010, 22 (2): 508-522. 10.1105/tpc.109.068874.
Krol E, Mentzel T, Chinchilla D, Boller T, Felix G, Kemmerling B, Postel S, Arents M, Jeworutzki E, Al-Rasheid KA, et al: Perception of the arabidopsis danger signal peptide 1 involves the pattern recognition receptor AtPEPR1 and its close homologue AtPEPR2. J Biol Chem. 2010, 285 (18): 13471-13479. 10.1074/jbc.M109.097394.
Gomez-Gomez L, Boller T: FLS2: an LRR receptor-like kinase involved in the perception of the bacterial elicitor flagellin in arabidopsis. Mol Cell. 2000, 5 (6): 1003-1011. 10.1016/S1097-2765(00)80265-8.
Zipfel C, Kunze G, Chinchilla D, Caniard A, Jones JD, Boller T, Felix G: Perception of the bacterial PAMP EF-Tu by the receptor EFR restricts agrobacterium-mediated transformation. Cell. 2006, 125 (4): 749-760. 10.1016/j.cell.2006.03.037.
Zhao DZ, Wang GF, Speal B, Ma H: The excess microsporocytes1 gene encodes a putative leucine-rich repeat receptor protein kinase that controls somatic and reproductive cell fates in the arabidopsis anther. Genes Dev. 2002, 16 (15): 2021-2031. 10.1101/gad.997902.
Wang J, Tan S, Zhang L, Li P, Tian D: Co-variation among major classes of LRR-encoding genes in two pairs of plant species. J Mol Evol. 2011, 72 (5–6): 498-509.
Karve R, Liu W, Willet SG, Torii KU, Shpak ED: The presence of multiple introns is essential for ERECTA expression in arabidopsis. RNA. 2011, 17 (10): 1907-1921. 10.1261/rna.2825811.
Bailey TL, Williams N, Misleh C, Li WW: MEME: discovering and analyzing DNA and protein sequence motifs. Nucleic Acids Res. 2006, 34 (Web Server issue): W369-W373.
Padmanabhan M, Cournoyer P, Dinesh-Kumar SP: The leucine-rich repeat domain in plant innate immunity: a wealth of possibilities. Cell Microbiol. 2009, 11 (2): 191-198. 10.1111/j.1462-5822.2008.01260.x.
Kobe B, Deisenhofer J: The leucine-rich repeat: a versatile binding motif. Trends Biochem Sci. 1994, 19 (10): 415-421. 10.1016/0968-0004(94)90090-6.
Ellis J, Dodds P, Pryor T: The generation of plant disease resistance gene specificities. Trends Plant Sci. 2000, 5 (9): 373-379. 10.1016/S1360-1385(00)01694-0.
Hanks SK, Hunter T: Protein kinases 6. The eukaryotic protein kinase superfamily: kinase (catalytic) domain structure and classification. FASEB J. 1995, 9 (8): 576-596.
Adams JA: Activation loop phosphorylation and catalysis in protein kinases: is there functional evidence for the autoinhibitor model?. Biochemistry. 2003, 42 (3): 601-607. 10.1021/bi020617o.
Krupa A, Preethi G, Srinivasan N: Structural modes of stabilization of permissive phosphorylation sites in protein kinases: distinct strategies in Ser/Thr and Tyr kinases. J Mol Biol. 2004, 339 (5): 1025-1039. 10.1016/j.jmb.2004.04.043.
Dardick C, Ronald P: Plant and animal pathogen recognition receptors signal through non-RD kinases. PLoS Pathog. 2006, 2 (1): e2-10.1371/journal.ppat.0020002.
Hwang SG, Kim DS, Jang CS: Comparative analysis of evolutionary dynamics of genes encoding leucine-rich repeat receptor-like kinase between rice and arabidopsis. Genetica. 2011, 139 (8): 1023-1032. 10.1007/s10709-011-9604-y.
Holub EB: The arms race is ancient history in arabidopsis, the wildflower. Nat Rev Genet. 2001, 2 (7): 516-527. 10.1038/35080508.
Wilkins O, Nahal H, Foong J, Provart NJ, Campbell MM: Expansion and diversification of the populus R2R3-MYB family of transcription factors. Plant Physiol. 2009, 149 (2): 981-993.
Yim WC, Lee BM, Jang CS: Expression diversity and evolutionary dynamics of rice duplicate genes. Mol Genet Genomics. 2009, 281 (5): 483-493. 10.1007/s00438-009-0425-y.
Blanc G, Wolfe KH: Functional divergence of duplicated genes formed by polyploidy during arabidopsis evolution. Plant Cell. 2004, 16 (7): 1679-1691. 10.1105/tpc.021410.
Chen F, Mackey AJ, Stoeckert CJ, Roos DS: OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups. Nucleic Acids Res. 2006, 34 (Database issue): D363-D368.
Sonnhammer EL, Koonin EV: Orthology, paralogy and proposed classification for paralog subtypes. Trends Genet. 2002, 18 (12): 619-620. 10.1016/S0168-9525(02)02793-2.
Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, Mitros T, Dirks W, Hellsten U, Putnam N, et al: Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 2012, 40 (Database issue): D1178-D1186.
Schultz J, Copley RR, Doerks T, Ponting CP, Bork P: SMART: a web-based tool for the study of genetically mobile domains. Nucleic Acids Res. 2000, 28 (1): 231-234. 10.1093/nar/28.1.231.
Finn RD, Tate J, Mistry J, Coggill PC, Sammut SJ, Hotz HR, Ceric G, Forslund K, Eddy SR, Sonnhammer EL, et al: The pfam protein families database. Nucleic Acids Res. 2008, 36 (Database issue): D281-D288.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28 (10): 2731-2739. 10.1093/molbev/msr121.
Chenna R, Sugawara H, Koike T, Lopez R, Gibson TJ, Higgins DG, Thompson JD: Multiple sequence alignment with the clustal series of programs. Nucleic Acids Res. 2003, 31 (13): 3497-3500. 10.1093/nar/gkg500.
Saitou N, Nei M: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987, 4 (4): 406-425.
Wilkins MR, Gasteiger E, Bairoch A, Sanchez JC, Williams KL, Appel RD, Hochstrasser DF: Protein identification and analysis tools in the ExPASy server. Methods Mol Biol. 1999, 112: 531-552.
Petersen TN, Brunak S, von Heijne G, Nielsen H: SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods. 2011, 8 (10): 785-786. 10.1038/nmeth.1701.
Krogh A, Larsson B, von Heijne G, Sonnhammer EL: Predicting transmembrane protein topology with a hidden markov model: application to complete genomes. J Mol Biol. 2001, 305 (3): 567-580. 10.1006/jmbi.2000.4315.
Kall L, Krogh A, Sonnhammer EL: Advantages of combined transmembrane topology and signal peptide prediction--the phobius web server. Nucleic Acids Res. 2007, 35 (Web Server issue): W429-W432.
Crooks GE, Hon G, Chandonia JM, Brenner SE: WebLogo: a sequence logo generator. Genome Res. 2004, 14 (6): 1188-1190. 10.1101/gr.849004.
Sjodin A, Street NR, Sandberg G, Gustafsson P, Jansson S: The populus genome integrative explorer (PopGenIE): a new resource for exploring the populus genome. New Phytol. 2009, 182 (4): 1013-1025. 10.1111/j.1469-8137.2009.02807.x.
Saeed AI, Sharov V, White J, Li J, Liang W, Bhagabati N, Braisted J, Klapa M, Currier T, Thiagarajan M, et al: TM4: a free, open-source system for microarray data management and analysis. Biotechniques. 2003, 34 (2): 374-378.
This study was supported by National Program on Key Basic Research Project (2012CB114500), National High Technology Research and Development Program (2011AA100200) and National Natural Science Foundation (31270644, 31070600 and 31100451) of China.
The authors declare that they have no competing interests.
YZ and YJ performed most of the data mining and data analysis. YZ participated in the illustrations of the figures and tables. SY and YS helped to retrieve data from GEO database and draw the heat-map. JW designed and coordinated the work and wrote the manuscript. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: PtLRR-RLKs identified in the present study. Genomic DNA sequences are obtained from Phytozome (http://www.phytozome.net/poplar, release 2.1). Amino acid sequences are deduced from the corresponding coding sequences. (XLS 172 KB)
Additional file 2: LRR-RLK genes in Populus trichocarpa. The unrooted tree was constructed using MEGA 4.0. Numbers at nodes indicate the percentage bootstrap scores and only bootstrap values higher than 50% from 1,000 replicates are shown. (PDF 96 KB)
Additional file 3: LRR-RLK genes in Populus trichocarpa. The unrooted tree was constructed using MEGA 4.0. Numbers at nodes indicate the percentage bootstrap scores and only bootstrap values higher than 50% from 1,000 replicates are shown. (PDF 66 KB)
Additional file 4: LRR-RLK genes in Populus trichocarpa and Arabidopsis thaliana. The unrooted tree was constructed using MEGA 4.0. Numbers at nodes indicate the percentage bootstrap scores and only bootstrap values higher than 50% from 1,000 replicates are shown. (PDF 97 KB)
Additional file 5: PtLRR-RLK gene to illustrate the distribution and position of introns. Exons and introns are represented to scale by colored boxes and lines, respectively. The group number and name of PtLRR-RLK gene and its intron–exon structure pattern are indicated at the left and right sides, respectively. (XLS 7 MB)
Additional file 6: AtLRR-RLKs with known functions and Populus homologues with similar genetic structures.(XLS 432 KB)
Additional file 7: Populus LRR-RLK subfamily.(XLS 799 KB)
Additional file 11: PtLRR-RLKs with probes available in the PopGenExpress data set. The microarray-based expression data were downloaded from the Poplar eFP browser, gene-wise normalized and hierarchical clustered based on Pearson correlation. Color scale at the top of each dendrogram represents log2 expression values. Rt, roots; ML, mature leaves; YL, young leaves; FC, female catkins; MC, male catkins; DX, differentiating xylems. (XLS 26 KB)
Additional file 12: Expression patterns of tandem duplicated gene clusters. The microarray-based expression data were downloaded from the Poplar eFP browser, gene-wise normalized and hierarchical clustered based on Pearson correlation. Color scale at the top of each dendrogram represents log2 expression values. Rt, roots; ML, mature leaves; YL, young leaves; FC, female catkins; MC, male catkins; DX, differentiating xylems. (PDF 243 KB)
Additional file 13: Expression patterns of 82 pairs of PtLRR-RLK paralogs. The microarray-based expression data were downloaded from the Poplar eFP browser, gene-wise normalized and hierarchical clustered based on Pearson correlation. Color scale at the top of each dendrogram represents log2 expression values. Rt, roots; ML, mature leaves; YL, young leaves; FC, female catkins; MC, male catkins; DX, differentiating xylems. (PDF 176 KB)
Additional file 14: Arabidopsis LRR-RLKs and their respective Populus homologues.(XLS 31 KB)
Additional file 16: Populus LRR-RLK genes exhibit differential expression upon a range of treatments. The patterns of relative transcript accumulation of each PtLRR-RLK genes as determined by microarray analysis are presented as a heat map, with red indicating higher levels and blue indicating lower levels of transcript accumulation. (DOC 3 MB)
About this article
Cite this article
Zan, Y., Ji, Y., Zhang, Y. et al. Genome-wide identification, characterization and expression analysis of populusleucine-rich repeat receptor-like protein kinase genes. BMC Genomics 14, 318 (2013). https://doi.org/10.1186/1471-2164-14-318
- Populus trichocarpa
- Leucine-rich repeat receptor-like kinase (LRR-RLK)
- Phylogenetic analysis
- Motif elicitation
- Expression profiling