- Research article
- Open Access
Genome-wide cloning and sequence analysis of leucine-rich repeat receptor-like protein kinase genes in Arabidopsis thaliana
BMC Genomics volume 11, Article number: 19 (2010)
Transmembrane receptor kinases play critical roles in both animal and plant signaling pathways regulating growth, development, differentiation, cell death, and pathogenic defense responses. In Arabidopsis thaliana, there are at least 223 Leucine-rich repeat receptor-like kinases (LRR-RLKs), representing one of the largest protein families. Although functional roles for a handful of LRR-RLKs have been revealed, the functions of the majority of members in this protein family have not been elucidated.
As a resource for the in-depth analysis of this important protein family, the complementary DNA sequences (cDNAs) of 194 LRR-RLKs were cloned into the GatewayR donor vector pDONR/ZeoR and analyzed by DNA sequencing. Among them, 157 clones showed sequences identical to the predictions in the Arabidopsis sequence resource, TAIR8. The other 37 cDNAs showed gene structures distinct from the predictions of TAIR8, which was mainly caused by alternative splicing of pre-mRNA. Most of the genes have been further cloned into GatewayR destination vectors with GFP or FLAG epitope tags and have been transformed into Arabidopsis for in planta functional analysis. All clones from this study have been submitted to the Arabidopsis Biological Resource Center (ABRC) at Ohio State University for full accessibility by the Arabidopsis research community.
Most of the Arabidopsis LRR-RLK genes have been isolated and the sequence analysis showed a number of alternatively spliced variants. The generated resources, including cDNA entry clones, expression constructs and transgenic plants, will facilitate further functional analysis of the members of this important gene family.
Multi-cellular organisms such as plants and animals use cell surface receptors to sense and transduce chemical signals for cell-to-cell communications. One of the most important groups of cell surface receptors, the receptor-like protein kinases (RLKs), has unique structural features that make them particularly suitable for cell-to-cell signaling. A typical RLK contains an extracellular receptor domain to perceive a specific signal, a single-pass transmembrane domain to anchor the protein within the membrane, and a cytoplasmic kinase domain to transduce the signal downstream via autophosphorylation followed by further phosphorylation of specific substrates. Plant receptor kinases were originally named "receptor-like" protein kinases since ligands for these receptors were largely unknown at the time when the first RLK was identified in maize . Since then, a small number of RLKs have been functionally characterized in plants and a few specific ligands have been identified. They play essential roles in plant growth, development, pathogen resistance and cell death [2–8].
In the model plant Arabidopsis, both transmembrane RLKs and receptor-like cytoplasmic kinases (RLCKs, which lack extracellular domains) belong to a large, monophyletic gene superfamily of at least 610 members, representing nearly 2.5% of the protein coding sequences within the entire genome [9, 10]. About two thirds of genes in this superfamily encode proteins with a typical N-terminal signal peptide and a hydrophobic transmembrane domain, which are consistent structural features of transmembrane RLKs. Based on their structural and sequence similarities, the RLKs are further grouped into more than 10 subfamilies. Leucine-rich repeat (LRR)-RLKs represent the largest subfamily in the Arabidopsis genome with at least 223 members .
Despite the identification of a large number of LRR-RLKs in Arabidopsis, biological functions have been defined for only about 30 proteins (Additional file 1: Table S1), which play crucial roles in a variety of different physiological processes. For instance, ERECTA (ER) regulates organ shape and inflorescence architecture ; CLAVATA1 (CLV1) determines the balance between undifferentiated and differentiated shoot and floral meristem cells ; BRASSINOSTEROID-INSENSITIVE 1 (BRI1) and BRI1-ASSOCIATED RECEPTOR KINASE 1 (BAK1) are a pair of RLKs involved in brassinosteroid (BR) signaling [13–15]; HAESA controls floral organ abscission ; FLAGELLIN-SENSITIVE 2 (FLS2) contributes to plant defense/pathogen-recognition ; VASCULAR HIGHWAY 1 (VH1) influences leaf cell patterning ; and EXCESS MICROSPOCYTES 1 (EMS1), SOMATIC EMBRYOGENESIS RECEPTOR KINASE 1 (SERK1) and SERK2 play important roles in microsporogenesis and male sterility [19–21]. Other LRR-RLKs of known function include RECEPTOR-LIKE PROTEIN KINASE 1 (RPK1), involved in abscisic acid early signaling [22, 23]; TOAD2 and its redundant homologue RPK1, both required in Arabidopsis embryonic pattern formation ; PXY, responsible for maintaining vascular tissue polarity ; and GASSHO1 (GSO1) and GASSHO2 (GSO2) which are essential for the normal development of epidermal surface of Arabidopsis embryos . Recently, two LRR-RLKs, BIR1 and SOBIR1, were identified to regulate cell death and innate immunity in Arabidopsis . Interestingly, several RLKs were found to possess dual or multiple roles during plant growth and development. For example, ERECTA is involved in both plant development and pathogen defense responses . BAK1 and BAK1-LIKE 1 (BKK1) regulate BR-dependent cell growth, and play an important role in cell-death control under various biotic and abiotic stresses. When plants are attacked by bacterial pathogens, BAK1 also can be recruited to the FLS2 complex and regulates the innate immunity response [29–32].
Reverse genetics has been used as a routine and effective approach to dissect the biological functions of genes. Isolated complementary DNA (cDNA) sequences are valuable resources in many processes in determining the functions of their corresponding genes. For example, the cDNA sequences can be used for ectopic expression, complementary experiments for gene knock out lines, site-directed mutagenesis, dominant negative analysis, gene silencing and RNA interference, subcellular localization of epitope-tagged fusion proteins, and protein-protein interaction analysis. Epitope-tagged fusion proteins can also facilitate the proteomic studies of interesting genes. For example, in vivo phosphorylation sites of BRI1 and BAK1 were identified by immunoprecipitation of epitope-tagged BRI1/BAK1 from Arabidopsis followed by liquid chromatography-tandem mass spectrometry (LC/MS/MS) and the functions of the identified phosphorylation sites were determined in planta [33, 34].
In this paper, the full-length cDNA cloning of the entire Arabidopsis LRR-RLK subfamily genes is reported. A total of 194 cDNA sequences have been successfully amplified by RT-PCR and cloned into a GatewayR donor vector pDONR/ZeoR. Sequence analysis indicated that 157 cDNAs are identical to the predicted or earlier submitted cDNA sequences in The Arabidopsis Information Resource (TAIR) database, whereas 37 other genes showed altered cDNA sequences distinct from those presented in the database, which is likely due to alternative splicing of pre-mRNA. One hundred eighty cDNA sequences with 100% sequence accuracy were further transferred, by in vitro DNA recombination, into two different destination vectors with either FLAG or GFP as the C-terminal fusion tags. Preliminary results indicated that most of the gene products can be detected by Western hybridization analysis using anti-FLAG or anti-GFP antibodies. The results and resources generated by this study will be useful tools for future functional analyses of LRR-RLKs.
Construction of GatewayR-compatible binary vectors for plant transformation
To facilitate future functional analyses of all LRR-RLKs, we generated 4 different GatewayR-compatible binary vectors for high through-put cloning of LRR-RLKs (Figure 1A). The four vectors contain a GatewayR cassette for DNA recombination with plasmid DNA of entry clones to produce final expression constructs. GFP or FLAG sequences were integrated at the 3' terminus of the GatewayR cassette for the production of epitope-tagged fusion proteins that will facilitate subsequent immunoprecipitation and coimmunoprecipitation analyses.
The first vector, named pB35GWG, contains a BASTA resistance gene for selecting transgenic plants and a C-terminal GFP tag. The second vector, designated pK35GWG, uses a kanamycin resistance gene for selecting transgenic plants, also with a C-terminal GFP tag. The third vector, termed pB35GWF, uses the BASTA gene for transgenic selection and FLAG as the C-terminal fusion tag. The fourth vector, labeled pK35GWF, contains a kanamycin resistance gene as the selectable marker and again has a C-terminal FLAG tag. All vectors use the CaMV 35 S promoter with dual enhancers to drive expression of the gene of interest. Detailed sequence information of the junction region of the GatewayR cassette and the GFP or FLAG eptiope tags is also shown (Figure 1B).
To examine whether the newly-constructed GatewayR-compatible vectors are reliable in generating LRR-RLK overexpressed transgenic plants, a functionally characterized gene, BAK1, was used for the test. Previous studies have shown that BAK1 is involved in the BR signal transduction pathway [14, 15, 34]. Overexpession of BAK1 can suppress the dwarf phenotype of the bri1 weak allele, bri1-5, to wildtype [14, 15, 34]. To clone BAK1 into the destination vectors, att B1 and att B2 flanked BAK1 was PCR-amplified and gel purified as described in experimental procedures. After BP and LR clonase reactions, BAK1 was transferred into the destination vectors and introduced into bri1-5 mutant plants. Obtained transgenic plants showed a typical bri1-5 suppression phenotype (Figure 2A). Western hybridization analysis using anti-FLAG or anti-GFP antibodies also indicated that both BAK1-FLAG and BAK1-GFP were truly overexpressed in the transgenic plants (Figure 2B). The results suggest that the generated destination vectors are fully functional and can be used for cloning and overexpression of all LRR-RLKs in Arabidopsis plants for future functional analyses.
GatewayR cloning of LRR-RLKs
A three-step protocol was used to efficiently produce att B1- and att B2-flanked LRR-RLK ORF fragments (Additional file 2A): (a) the reverse transcriptase reaction to generate single-stranded cDNA; (b) the first round of PCR with gene-specific primers to amplify target ORF flanked with partial att B1 and att B2 adaptor sequences; and (c) the second round of PCR with universal att B1 and att B2 adaptor primers to integrate complete att B1 and att B2 sites into the ORF amplicons. Two hundred twenty three predicted LRR-RLKs distributed on all five chromosomes of the Arabidopsis genome with ORF sizes ranging from 339 bp -3,759 bp are presented in TAIR8. The coding sequences of 221 LRR-RLKs are larger than 1,500 bp. Superscript III was used to produce long cDNAs with full-length ORFs and a proof-reading polymerase (AccuPfx) was employed to amplify the predicted ORFs with high fidelity. Two rounds of PCR can produce enough DNA for GatewayR cloning even for some genes with relatively low expression. PCR products were obtained for 208 of the 223 predicted LRR-RLKs genes, while 15 genes were never amplified by RT-PCR (Additional file 2B). All PCR products were agarose gel purified and introduced into pDONR/ZeoR to produce the entry clones. Plasmid DNA from entry clones was then used for LR clonase-mediated in vitro DNA recombination with appropriate destination vectors to yield FLAG and GFP epitope tagged constructs.
Sequence analysis of the isolated LRR-RLKs
A total of 194 cDNA sequences were successfully cloned into the donor vector and are summarized in Table S2 in Additional file 1. Among them, 157 (80.9%) of the clones contain cDNA sequences identical to those predicted in TAIR8 (Additional file 1: Table S3). The other 37 isolated sequences (19.1%) display gene structures that are different from their corresponding predictions in TAIR8 (Additional file 1: Table S4, S5). Based on their structural differences, they can be divided into two groups: (1) one complete ORF exists from the predicted start codon to the predicted stop codon despite the coding sequences being different from that predicted (Figure 3); (2) no continuous ORF exists from the predicted start codon to the predicted stop codon because of the different coding sequences (Figure 4). The other 29 LRR-RLKs (Additional file 1: Table S6) were not isolated successfully because of possible wrong annotation, specific and/or low expression, and bactericidal effect.
The first group includes 23 genes (Figure 3, Additional file 1: Table S4). The detailed sequence differences are summarized in Table S7 and the alignments among isolated cDNAs, predicted ORFs and the corresponding genomic DNA sequences are shown in Additional file 3. The isolated sequence of At1g31420 is 3 bp shorter than the prediction. One clone [GenBank:AK226234] with the same sequence as the prediction was found in database, indicating that this gene has transcripts with alternative splicing (Figure 3A). The isolated sequence of At4g26540 is 6 bp longer than the prediction, and both Cs at position 1,412 and 1,484 in the isolated sequence are not found in the Arabidopsis genome (Figure 3B). The isolated sequence of At5g37450 displays two unpredicted exons and shows one unpredicted intron in predicted sequence (Figure 3C). An unpredicted exon is found in the isolated sequence of At5g45840 (Figure 3D). Two predicted introns in gene At3g56100 are eliminated in the isolated sequence and have become a part of the first exon (Figure 3E). The first predicted exons of At3g24660 and At4g20270 have one unpredicted extra intron each (Figure 3F, G). The predicted 5th exon of At5g14210 contains one extra unpredicted intron, and the intron/exon boundary is also different from that predicted (Figure 3H). Two predicted exons disappear and one unpredicted intron is shown in the predicted 10th exon of At1g51890, and a different intron/exon boundary is also observed (Figure 3I). The isolated sequence of At5g65240 is 30 bp shorter than the prediction because of the different intron/exon boundaries; and a RIKEN clone [GenBank:AY059844] without a continuous ORF from the predicted start codon to the predicted stop codon is available in the database (Figure 3J). Isolated sequences of the other 13 genes, At1g05700, At1g07560, At1g14390, At1g34110, At1g51880, At1g53430, At2g02780, At3g21340, At4g20940, At4g29180, At5g35390, At5g59650 and At5g59680, show different intron/exon boundaries compared with the predicted sequences, resulting in different mRNA sequences (Figure 3K-W).
The second group contains 14 genes (Figure 4, Additional file 1: Table S5). The detailed sequence differences are summarized in Table S8 in Additional file 1 and the alignments among isolated cDNAs, predicted ORFs and the corresponding genomic DNA sequences are shown in Additional file 4. Unlike the genes in the first group, the isolated sequences in this group do not display continuous ORFs from the predicted start codon to the predicted stop codon that were used to design the forward and reverse PCR primers for GatewayR cloning. The isolated sequences of genes At1g06840, At1g35710, At1g51860, At1g53440, At3g46370 and At5g44700 exhibit different intron/exon boundaries compared to the predicted ORF sequences (Figure 4A-F). Different intron/exon boundaries are also found in the isolated ORF sequences of At1g53420, At1g56120, At5g07150, At1g29730 and At1g56140, with other structural differences (Figure 4G-K). The predicted 6th intron disappears in the isolated ORF sequence of At1g53420 (Figure 4G). The predicted intron 17, exon 17 and exon 18 are merged into exon 18 and the predicted exon 16 is split into exon 16 and exon 17 in the isolated sequence of At1g56120 (Figure 4H). The third predicted exon does not exist in the experimentally derived sequence of At5g07150, and the other six predicted exons are merged into two exons (Figure 4I). The first two exons and the first intron in the prediction of At1g29730 merge into the first exon in the isolated sequence (Figure 4J). The predicted exon 17 is split into exon 17 and exon 18 in the isolated sequence of At1g56140. In database, a previously isolated sequence [GenBank:BT011697] is different from both the prediction of At1g56140 in TAIR8 and the sequence from this report, losing the sequence from exon 6 to exon 23 and part of exon 24, resulting in a much smaller protein with 184 aa compared to the predicted protein of 1,032 aa (Figure 4K). Two predicted introns in At2g28970 do not exist in the isolated sequence (Figure 4L). At1g56130 displays an unpredicted intron (Figure 4M). One extra unpredicted intron is shown in At4g29990, and the isolated sequence is different from both the existing sequence [GenBank:X97774] and the TAIR prediction (Figure 4N).
Although the isolated sequences of genes At4g31250 and At5g01950 are the same as the current predictions in TAIR8, the previously reported coding sequences of them are different (Figure 5A, 5B). The predicted exon 1 of At4g31250 is split into exon 1 and exon 2 in sequence AK176245 [GenBank:AK176245] (Figure 5A). Gene At5g01950 has a new annotation in TAIR8. The isolated sequence contains the same ORF as the current prediction, but the first two predicted exons in TAIR5 are arranged as three exons. The existing sequence AK229912 [GenBank: AK229912] shows a different intron/exon boundary between exon 7 and intron 7, resulting in a smaller ORF of 631 amino acids (Figure 5B).
Detection of alternative splicing of LRR-RLKs
Potentially alternatively spliced variants of 38 LRR-RLKs were examined by RT-PCR with variant-specific primers according to the predicted mRNA sequences and previous reports (Figure 6). Isolated cDNA sequences from this study were not examined because they were identified by RT-PCR during the cloning procedure. Isolated cDNA of At4g26540 in this report showed a structure with slight difference from the prediction in database, which made it difficult to examine the sequence difference with variant-specific primers. This gene was not included in the RT-PCR experiment. From inflorescence, 34 variants of 33 LRR-RLKs were confirmed by RT-PCR with expected size of products (Figure 6). No RT-PCR products were obtained from At1g34110, At1g51880, At3g21340, At3g56100 and At4g31250. The previously reported cDNA sequence [GenBank: BT011697] of At1g56140 was not amplified from this study, but the predicted variant sequence [GenBank: NM_104492] of it was confirmed by RT-PCR. Both the previously reported cDNA sequence [GenBank: AY059844] and the predicted mRNA sequence [GenBank: NM_125922] of At5g65240 were confirmed in this study. From leaf, RT-PCR fragments of 35 variants of 34 LRR-RLKs were obtained (Figure 6). No RT-PCR products with expected sizes were obtained for the same genes as in inflorescence except At1g51880 that produced a larger fragment than predicted. No RT-PCR product of previously reported cDNA [GenBank: AK176245] of At4g31250 was recovered. Together, a total of 34 LRR-RLKs were confirmed with alternative splicing of pre-mRNA, including four previously reported cDNA variants of At1g31420 [GenBank: AK226234], At4g29990 [GenBank: X97774], At5g01950 [GenBank: AK229912] and At5g65240 [GenBank: AY059844].
LRR-RLKs phylogenetic analysis
Sequence analyses of isolated LRR-RLKs reported in this paper demonstrate that some of them encode protein sequences distinct from the predictions. This sequence variation and the improved annotation of Arabidopsis genome makes it necessary to examine the previously created phylogenies of this superfamily. The previous report suggested 15 subfamilies because the sequences clearly fell into distinct clades . Studies in this report based on the alignment of the full-length amino acid sequences result in a similar phylogenetic tree to the previous report  with minor adjustments (Additional file 5). (1) At1g74360, a member of the previously assigned subfamily LRR X, fell into the LRR VII subfamily; (2) two members (At1g35710 and At4g08850) of the previously assigned subfamily LRR XII, one previously ungrouped gene (At2g25790), and one member (At5g51350) of the subfamily LRR XIV fell into the LRR XI subfamily.
Epitope-tagged proteins of LRR-RLKs in transgenic Arabidopsis plants
The expression of LRR-RLKs cloned in the destination vectors pB35GWF and pK35GWG and transformed into Arabidopsis ecotype 'Columbia-0 (Col-0)' was verified by Western hybridization analysis with αFLAG and αGFP antibodies respectively (Figure 7A, B). Immunoprecipitated membrane protein was prepared and separated by SDS-PAGE for the detection of FLAG-tagged fusion proteins while total protein could be used directly to detect signals of GFP-tagged fusion proteins. The FLAG- or GFP-tagged LRR-RLKs could be detected in most of the examined transgenic lines usually as one distinct and specific protein band.
Experimentally derived sequences help to verify and expand the predicted genome annotation
The TAIR annotation release TAIR8 (April, 2008) contains 33,282 genes, including 27,235 putative protein coding genes. Among all the putative protein coding genes, 2,289 genes have not been experimentally supported by identified transcripts. Among the 223 predicted LRR-RLK genes, 30 of them have no EST support. EST support for 12 of them is now provided (Additional file 1: Table S9). A total of 94 LRR-RLK genes have no isolated full length coding sequence in the existing database. From this study, 70 new LRR-RLK cDNAs with full length coding sequence were provided (Additional file 1: Table S10). The resources generated in this study will provide useful tools for future functional analyses of this important protein family. At the same time, phylogenetic analysis can guide researchers to create double, even higher level, mutants to overcome functional redundancy of genes in one subfamily. For example, elegant genetics studies in subfamily LRR II revealed redundant functions of SERK genes in brassinosteroid signal transduction [14, 15, 30, 35], male sporogenesis [21, 35], pathogen response [29, 31, 35] and cell death [30, 32]. Phylogenetic analysis in this study indicated that several subfamilies, such as LRR XII whose members fell into two different subfamilies based on the phylogeny of full length amino acid sequence, could be rearranged to aim future functional analysis of their gene members.
The sequence data generated from this project will also greatly improve genome annotation . In a previous study, 5,000 full-length gene transcripts from Arabidopsis were used to re-annotate its genome. The results indicated that the gene structures of approximately 35% of the examined genes could be improved according to the isolated full-length cDNA sequences . When examing existing EST and full-length cDNA sequences for all of the predicted LRR-RLKs, one full-length cDNA, clone RAFL25-47-F19 [GenBank:AK221400], was identified that covered two predicted loci, At1g51830 and At1g51840, with a full and complete ORF of 886 aa. The loci should be merged into one according to this data. As described above, from this study a total of 37 genes were identified with different variant transcripts compared to the predictions (Additional file 1: Tables S7, S8). All the data are useful for the improvement of Arabidopsis LRR-RLK annotation.
Gene functions and alternatively spliced transcripts
The TAIR8 release showed that 4,330 of the annotated 27,235 protein coding genes (15.9%) have alternatively spliced transcripts. In this report, sequence analyses show that a total of 37 LRR-RLK genes have different sequences from the TAIR8 predictions. This includes two possibilities: (1) the prediction was not correct; (2) both the predicted and the isolated sequences exist in plant, which suggests some LRR-RLK genes have alternatively spliced transcripts, possibly in the same tissue, or in different tissues, or under different growth conditions.
The sequence analysis of isolated LRR-RLKs in this report revealed different forms of the CDS compared to TAIR8 predictions or the existing sequences in the database, including alternative intron donor and/or acceptor sites (for example, At1g05700, At4g20940, At5g44700), unpredicted introns (At3g24660, At4g20270, At1g56130, At4g29990), unpredicted exons (At5g45840), unspliced introns (At3g56100, At2g28970) and different combinations of the aforementioned changes. They form a continuous ORF or several discontinuous ORFs. The presence of the observed alternative splicing was further confirmed by RT-PCR (Figure 6). It is already known that alternative splicing can significantly increase the complexity of the transcriptome and proteome by synthesizing multiple transcripts and proteins from one gene. Several previous reports showed that approximately 20% of Arabidopsis genes are alternatively spliced and some alternatively spliced transcripts have different functions [38–42]. Serine/arginine-rich (SR) proteins form a conserved family of splicing regulators in eukaryotes. The pre-mRNAs of Arabidopsis SR genes are extensively alternatively spliced, and about 95 transcripts are produced from 15 genes. The transcriptome complexity of SR genes is increased by six-fold. Abiotic stresses regulate the alternative splicing of the pre-mRNAs of SR genes to produce different isoforms of SR proteins that are likely to have altered functions in pre-mRNA splicing . Six mRNA variants were generated by alternative splicing in the pre-mRNA of a homologue of SR protein, atSR45a. The transcript abundance and the splicing patterns of atSR45a were altered under various types of stress . The U1 small nuclear ribonucleoprotein particle (U1 snRNP) 70K protein (U1-70K) interacts with splicing factors and is involved in basic and alternative splicing of pre-mRNA [43–47]. In Arabidopsis, two distinct transcripts are produced by alternative splicing of the pre-mRNA of the U1 snRNA 70K gene. Only the short transcript encodes a full-length functional U1-70K, whereas the long transcript codes for a truncated U1-70K . COP1 is a negative regulator of Arabidopsis light-dependent development. COP1b is generated by alternative splicing, resulting in a 60-amino acid deletion in the WD-40 repeat domain relative to the full-length COP1, which functions as a dominant negative regulator of COP1 function . The maize MIK gene codes for a GCK-like MAP4K that can be activated by interaction with maize atypical receptor kinase (MARK) . Four different mature mRNAs of MIK are generated by alternative splicing, and the resulting polypeptides display different kinase activity and are differentially activated by interaction with the MARK receptor . Recent studies further demonstrated that alternative splicing affected regions frequently code for intrinsically disordered regions of the corresponding protein products and the association of alternative splicing and intrinsic disorder results in various isoforms to increase the functional and regulatory diversity of the gene [51–54].
LRR-RLKs are critical proteins involved in many aspects of plant growth, development and stress responses. It is noticed that six genes (At1g05700, At1g07560, At1g51880, At4g29180, At5g37450 and At5g44700) produce RT-PCR fragments with different sizes in inflorescence and leaf (Figure 6), which indicates that different forms of LRR-RLK protein may be required for distinct tissue development and function. Some of the alternatively spliced transcripts of LRR-RLKs will generate truncated versions of the predicted proteins. The truncated proteins may be involved in the functional regulation of these genes in different developmental stages and different growth conditions/stresses. Future functional analyses of the alternatively spliced LRR-RLKs, revealed from this study, would eventually elucidate the biological meaning of the process.
Gene function and phosphorylation sites analysis
Clones reported in this paper are not only a resource for gene annotation, but also will be very useful for gene function analysis. Genes in entry clones can be transferred freely to any GatewayR-compatible destination vectors and introduced into Arabidopsis. They can be used for overexpression in Arabidopsis to dissect the resulting phenotypes that will indicate the possible related pathways and functions of the target genes. They can be used to generate different epitope-tagged fusion proteins. For example, as described above, GFP- and FLAG-tagged fusion protein can be produced in transgenic plants with different antibiotic resistances. The subcellular localizations of the interesting genes can be determined with the help of confocal microscopy. The homodimerization or heterodimerization between LRR-RLKs can be detected and confirmed by coexpression in planta and coimmunoprecipitation analysis. The transgenic plants can also be used to isolate protein complexes for each LRR-RLKs, which can help to dissect the complicated signaling pathways that the genes participate in. The cloned genes could be mutated directly by site-directed mutagenesis in entry clones to create kinase-inactive copies. Overexpression of kinase-inactive genes in Arabidopsis will be useful to dissect the functions of the genes by dominant negative effects, especially in the case that functional redundancy is a problem during analysis of them.
In order to clearly understand LRR-RLK function, it is necessary to characterize cytoplasmic kinase domain phosphorylation and examine the role of receptor oligomerization in initiating signaling pathways. The primary goal of this study was to generate resources for our current Arabidopsis 2010 project that is focused on mapping LRR-RLK phosphorylation sites, assessing the functions of the identified sites in plant growth and development, and examining the in vivo interactions of numerous LRR-RLKs. A prototype for this approach has been developed for the BRI1 and BAK1, two LRR-RLKs involved in BR signaling. For example, immunoprecipitated BRI1-FLAG protein was analyzed by liquid chromatography-tandem mass spectrometry (LC/MS/MS) and multiple in vivo phosphorylated Ser and Thr residues of BRI1 were identified. T-1049 and S-1044 are highly conserved activation loop residues that were shown to be essential for kinase function in vitro and BRI1 signaling in planta . The interaction of BRI1 and BAK1 was studied in detail both in vitro and in vivo, and a novel mechanism of sequential transphosphorylation was developed, which helps explain the role of the BAK1 co-receptor in regulating BR signaling through BRI1 . This approach, utilizing the resources developed here, is being expanded to examine the mechanisms of action of numerous LRR-RLKs across this important family of regulatory proteins.
This study generated four GatewayR-compatible destination vectors for plant transformation and they were proved functional by overexpressing BAK1 to suppress bri1-5 mutant plant phenotype. Complementary DNA sequences of 194 Arabidopsis LRR-RLKs were cloned into the GatewayR donor vector pDONR/ZeoR and analyzed by DNA sequencing. A total of 37 isolated LRR-RLKs showed distinct sequences from the database prediction or previously reported sequences. Alternative RNA splicing was observed in some of them, which was thought involved in the regulation of gene functions and plant development. Experimental evidences for the annotation of these LRR-RLKs were provided in is study. The generated cDNA clones, expression constructs and transgenic plants are useful resources for scientific communities and will accelerate the research in this field.
Primer design and reverse transcriptase PCR reaction for LRR-RLK cloning
Coding sequences of all the predicted LRR-RLK genes  were retrieved from the database (TAIR5 release). Primer pairs for all the genes were designed according to the predicted ORF sequences. The forward primer contained partial att B1 sequence (5'-AAAAAGCAGGCT-3'), the start codon and 18-28 gene-specific nucleotides thereafter to yield a sequence with a Tm value higher than 55°C. The reverse primer contained partial att B2 sequence (5'-AGAAAGCTGGGT-3') and 18-28 nucleotides of 3' gene specific sequence without the stop codon. To make the cloned sequence in frame with FLAG and GFP sequences in the vectors, one extra C was added before the gene specific sequence in the reverse primer.
Total RNA was extracted from whole plants, inflorescences and roots of Arabidopsis using RNeasy Plant Mini Kit (Qiagen, Valencia, CA). Messenger RNA (mRNA) was isolated from the total RNA by Oligotex mRNA Mini Kit (Qiagen). Either total RNA or mRNA was reverse transcribed into single-stranded cDNA with Superscript III reverse transcriptase (Invitrogen, Carlsbad, CA) in a 40 μl volume. Two rounds of PCR reactions were performed to generate att B-flanked PCR products. The first round of PCR with gene specific primers was processed with the following program: 95°C for 2 min; 30 cycles of 95°C for 15 s, 55°C for 30 s, 68°C for 4 min; 72°C for 10 min. After the first round of PCR reaction was completed, the second round of PCR was performed using att B1 and att B2 adaptors as universal primers containing att B1 and att B2 recombinational cloning sites (att B1 adaptor: 5'-GGGGACAAGTTTGTACAAAAAAGCAGGCT-3'; att B2 adaptor: 5'-GGGGACCACTTTGTACAAGAAAGCTGGGT-3') to incorporate complete att B1 and att B2 sequences into the final PCR products.
Gel purification and in vitro DNA cloning
PCR products of all LRR-RLKs were subjected to agarose gel electrophoresis in 1 × TAE buffer. DNA products were purified from the DNA containing gel slices using GENECLEAN® Turbo kit (Qbiogene, Irvine, CA) and PureLink™ Gel Extraction Kit (Invitrogen). Purified PCR products were eluted into 50 μl ddH2O. GatewayR BP clonase-directed in vitro DNA cloning (Invitrogen) was performed between purified DNA and plasmid DNA of the GatewayR donor vector pDONR/ZeoR in a 5 μl volume at room temperature for approximately 16 h. The BP clonase reactions were transformed into E.coli DH5α competent cells and incubated overnight at 37°C for selecting positive entry clones on Luria Bertani (LB) agar plates containing 50 μg/ml zeocin (Invitrogen). Positive entry clones were picked for further analysis by colony PCR with M13 forward (5'- TGTAAAACGACGGCCAGT-3') and M13 reverse (5'- CAGGAAACAGCTATGACC-3') primers. Entry clones with positive PCR signal and correct molecular size were inoculated into 2.5 ml LB broth containing 50 μg/ml zeocin and incubated overnight at 37°C. Plasmid DNA of entry clones were isolated and analyzed by restriction enzymatic digestion. Clones with appropriate insert sizes were selected for further analyses by DNA sequencing.
After sequence verification, plasmid DNA of each entry clone was recombined into the destination vectors pB35GWF, pB35GWG, pK35GWF and pK35GWG (see below) with the help of LR clonase (Invitrogen). The LR reactions were transformed into E.coli DH5α competent cells and incubated overnight at 37°C for selecting positive expression clones on LB agar plates containing 50 μg/ml kanamycin. The recombinants were inoculated into 2.5 ml LB broth containing 50 μg/ml kanamycin and incubated overnight at 37°C. Plasmid DNA of each expression clone was isolated and further analyzed by restriction enzymatic digestion.
DNA sequence analysis
All coding sequences in entry clones were sent to High-Throughput Sequencing Solutions (The University of Washington, http://www.htseq.org) for sequence analysis with M13 forward primer, M13 reverse primer and gene specific primers. Sequences from the same clone were manually assembled into contigs with the help of Seqtools http://www.seqtools.dk/. Sequences from contigs were compared by BlastN to the Arabidopsis AGI CDS dataset to examine the sequence identity and whether the sequences were from mRNA of target genes. The sequences of contigs were also compared to the AGI whole genome dataset by BlastN. The sequences of assembled contigs, the corresponding CDS sequences and genomic sequences were aligned and analyzed by Spidy http://www.ncbi.nlm.nih.gov/IEB/Research/Ostell/Spidey and GeneDoc http://www.nrbsc.org/gfx/genedoc/index.html to identify introns and exons in target genes and view the detailed sequence differences.
Determination of LRR-RLK variants with reverse transcriptase PCR
Total RNA of inflorescence (including flowers and siliques) and leaf was prepared from four week old Arabidopsis Col-0 plants grown in soil with RNeasy plant mini kit (Qiagen). On column DNase I digestion of the total RNA was performed during the RNA purification process according to the manufacture's instruction to eliminate the genomic DNA contamination. Ten micrograms of total RNA were reverse transcribed to cDNA with PowerScript reverse transcriptase (Clontech, Mountain View, CA) in a 40 μl volume according to the manufacture's instruction. Same amount of cDNA equivalent to 100 ng total RNA was used to perform the primary PCR reactions for 38 LRR-RLKs that show potential alternative splicing of pre-mRNA. The nested PCR reactions were conducted to increase the sensitivity and specificity of the investigation of alternative splicing with variant-specific primers. The variant-specific primers were carefully designed, for example, flanking the alternatively spliced sequence if available, to further eliminate the possible genomic DNA contamination. The cDNA sequences generated from this study were not examined by RT-PCR. The used primers were listed in Table S11 in Additional file 1.
Sequence alignment and phylogenetic analysis
Full-length cDNA sequences of six previously reported LRR-RLKs (At1g51830 [GenBank:AK221400], At1g75640 [GenBank:AK226809], At3g24240 [GenBank:AJ550163], At4g29990 [GenBank:X97774], At4g39270 [GenBank:AY099851], At5g67200 [GenBank:BT003370]) were retrieved from GenBank. The predicted mRNA sequences of 37 LRR-RLKs without experimentally produced complete coding sequences were retrieved from TAIR. The mRNA sequences of all the other 180 genes were from this study. The corresponding protein sequences were then imported into MEGA 4  for multiple sequence alignment by ClustalW  and phylogenetic analysis by using the Neighbor-joining  and bootstrap  methods. The weighing matrix used for ClustalW alignment was BLOSUM with the penalty of gap opening 10 and gap extension 0.2. The bootstrap consensus tree was inferred from 1,000 replicates.
Construction of GatewayR-compatible binary vectors
The mannopine synthase (mas) promoter (Pmas) and the coding region of glufosinate resistance (BAR) gene were PCR-amplified from pSKI015  and the resulting PCR products were purified and cloned into Hind III/Bam HI digested pBlueScriptSK(+) (Stratagene, La Jolla, CA). Synonymous mutations were introduced into the BAR sequence by site-directed mutagenesis to eliminate all regularly used restriction sites including Eco RI, Xho I, Sac I, and Kpn I, resulting in pBlueScriptSK(+)-BAR. All site-directed mutagenesis reactions were carried out with PfuUltra™ High-Fidelity DNA Polymerase (Stratagene). After treatment with 10 units of Dpn I for 1 h at 37°C, 2 μl PCR products were transformed into E.coli DH5α competent cells for selecting positive colonies on LB agar plates containing 100 μg/ml ampicillin. PCR products of positive colonies were digested with Eco RI, Xho I, Sac I and Kpn I to select those with mutations. After sequence verification, the plasmid DNA of pBlueScriptSK(+)-BAR was used as template to amplify the Pmas and BAR region flanked with Hind III and Bgl II sites. PCR products were digested by Hind III and Bgl II, and cloned into Hind III/Bam HI digested pBIB-HYG-35S . The resulting vector was named pBIB-BASTA-35S and the Bam HI restriction site in this vector was eliminated after Bam HI/Bgl II ligation. The T-DNA region of the vector was sequenced, and the resistance of transgenic plants to herbicide was confirmed by spraying with Finale (AgrEvo, Montvale, NJ).
GatewayR-FLAG fragments and GatewayR-GFP fragments were amplified from pEarleyGate 302 and pEarleyGate 103  respectively by AccuPrime™ Pfx DNA Polymerase (Invitrogen). The digested and purified fragments were cloned into the Kpn I/Sac I sites of pBIB-KAN-35S  and pBIB-BASTA-35S to produce GatewayR-compatible binary vectors pK35GWF, pK35GWG, pB35GWF and pB35GWG. The T-DNA regions of the binary vectors were confirmed by DNA sequencing.
Plant materials, growth conditions, transformation and selection
Arabidopsis Col-0 plants were grown at 22°C in a long-day condition (16 h of light and 8 h of dark) in the greenhouse. The floral dip method  was used to transform wild-type Arabidopsis and bri1-5 mutant plants . Agrobacterium tumefaciens strain GV3101 containing each target construct was grown at 30°C for 30 h to the stationary phase. Cells were then harvested by centrifugation and resuspended in two volumes of water with 5% (w/v) sucrose and 0.03% (v/v) Silwet L-77 (Lehle Seeds, Round Rock, TX). Healthy and vigorously growing inflorescences of Arabidopsis were immersed in the above A. tumefaciens suspension for 30 sec for gene transformation. After treatment, plants were kept in covered flats for 1 day. All the seeds subjected to screening were treated at 4°C for 3 d before being sown on soil or agar plates. Seeds from plants dipped with constructs containing the glufosinate resistance (BAR) gene were sown directly on soil and sprayed with 1.5:1,000 (v/v) commercially available Finale (AgrEvo) in water to screen for transgenic plants with herbicide resistance. Seeds from plants dipped with constructs containing neomycin phosphotransferase II (NPTII) were grown on ½ Murashige and Skoog medium (MS) plates  with 50 μg/ml kanamycin, 0.6% (w/v) agar and 1% (w/v) sucrose to obtain transgenic plants with kanamycin resistance. After about 10 days on agar plates, the selected kanamycin resistant individuals were transplanted to soil.
Western hybridization analyses
Transgenic plants harboring GFP fusion proteins were harvested after 3 weeks of growth in soil. Total proteins from leaves were prepared for Western hybridization. Membrane proteins were extracted from 11 d seedlings grown in shaking liquid culture and subjected to immunoprecipitation of FLAG-tagged fusion proteins as previously described [14, 33]. Protein samples were separated on 7.5% (w/v) SDS-PAGE gel. Western hybridization analyses with GFP or FLAG antibodies were performed as previously described [14, 33].
Sequence data from this study can be found in the GenBank database under accession numbers: FJ708625-FJ708818.
Walker JC, Zhang R: Relationship of a putative receptor protein kinase from maize to the S-locus glycoproteins of Brassica . Nature. 1990, 345 (6277): 743-746. 10.1038/345743a0.
Becraft PW: Receptor kinase signaling in plant development. Annu Rev Cell Dev Bi. 2002, 18: 163-192. 10.1146/annurev.cellbio.18.012502.083431.
Cock JM, Vanoosthuyse V, Gaude T: Receptor kinase signalling in plants and animals: distinct molecular systems with mechanistic similarities. Curr Opin Cell Biol. 2002, 14 (2): 230-236. 10.1016/S0955-0674(02)00305-8.
Morris ER, Walker JC: Receptor-like protein kinases: the keys to response. Curr Opin Plant Biol. 2003, 6 (4): 339-342. 10.1016/S1369-5266(03)00055-4.
Dievart A, Clark SE: LRR-containing receptors regulating plant development and defense. Development. 2004, 131 (2): 251-261. 10.1242/dev.00998.
Torii KU: Leucine-rich repeat receptor kinases in plants: structure, function, and signal transduction pathways. Int Rev Cytol. 2004, 234: 1-46. full_text.
Morillo SA, Tax FE: Functional analysis of receptor-like kinases in monocots and dicots. Curr Opin Plant Biol. 2006, 9 (5): 460-469. 10.1016/j.pbi.2006.07.009.
Afzal AJ, Wood AJ, Lightfoot DA: Plant receptor-like serine threonine kinases: roles in signaling and plant defense. Mol Plant Microbe In. 2008, 21 (5): 507-517. 10.1094/MPMI-21-5-0507.
Arabidopsis Genome Initiative: Analysis of the genome sequence of the flowering plant Arabidopsis thaliana . Nature. 2000, 408 (6814): 796-815. 10.1038/35048692.
Shiu SH, Bleecker AB: Receptor-like kinases from Arabidopsis form a monophyletic gene family related to animal receptor kinases. P Natl Acad Sci USA. 2001, 98 (19): 10763-10768. 10.1073/pnas.181141598.
Torii KU, Mitsukawa N, Oosumi T, Matsuura Y, Yokoyama R, Whittier RF, Komeda Y: The Arabidopsis ERECTA gene encodes a putative receptor protein kinase with extracellular leucine-rich repeats. Plant Cell. 1996, 8 (4): 735-746. 10.1105/tpc.8.4.735.
Clark SE, Williams RW, Meyerowitz EM: The CLAVATA1 gene encodes a putative receptor kinase that controls shoot and floral meristem size in Arabidopsis. Cell. 1997, 89 (4): 575-585. 10.1016/S0092-8674(00)80239-1.
Li J, Chory J: A putative leucine-rich repeat receptor kinase involved in brassinosteroid signal transduction. Cell. 1997, 90 (5): 929-938. 10.1016/S0092-8674(00)80357-8.
Li J, Wen J, Lease KA, Doke JT, Tax FE, Walker JC: BAK1, an Arabidopsis LRR receptor-like protein kinase, interacts with BRI1 and modulates brassinosteroid signaling. Cell. 2002, 110 (2): 213-222. 10.1016/S0092-8674(02)00812-7.
Nam KH, Li J: BRI1/BAK1, a receptor kinase pair mediating brassinosteroid signaling. Cell. 2002, 110 (2): 203-212. 10.1016/S0092-8674(02)00814-0.
Jinn TL, Stone JM, Walker JC: HAESA, an Arabidopsis leucine-rich repeat receptor kinase, controls floral organ abscission. Gene Dev. 2000, 14 (1): 108-117.
Gomez-Gomez L, Boller T: FLS2: an LRR receptor-like kinase involved in the perception of the bacterial elicitor flagellin in Arabidopsis. Mol Cell. 2000, 5 (6): 1003-1011. 10.1016/S1097-2765(00)80265-8.
Clay NK, Nelson T: VH1, a provascular cell-specific receptor kinase that influences leaf cell patterns in Arabidopsis. Plant Cell. 2002, 14 (11): 2707-2722. 10.1105/tpc.005884.
Zhao DZ, Wang GF, Speal B, Ma H: The EXCESS MICROSPOROCYTES1 gene encodes a putative leucine-rich repeat receptor protein kinase that controls somatic and reproductive cell fates in the Arabidopsis anther. Gene Dev. 2002, 16 (15): 2021-2031. 10.1101/gad.997902.
Albrecht C, Russinova E, Hecht V, Baaijens E, de Vries S: The Arabidopsis thaliana SOMATIC EMBRYOGENESIS RECEPTOR-LIKE KINASES1 and 2 control male sporogenesis. Plant Cell. 2005, 17 (12): 3337-3349. 10.1105/tpc.105.036814.
Colcombet J, Boisson-Dernier A, Ros-Palau R, Vera CE, Schroeder JI: Arabidopsis SOMATIC EMBRYOGENESIS RECEPTOR KINASES1 and 2 are essential for tapetum development and microspore maturation. Plant Cell. 2005, 17 (12): 3350-3361. 10.1105/tpc.105.036731.
Hong SW, Jon JH, Kwak JM, Nam HG: Identification of a receptor-like protein kinase gene rapidly induced by abscisic acid, dehydration, high salt, and cold treatments in Arabidopsis thaliana . Plant Physiol. 1997, 113 (4): 1203-1212. 10.1104/pp.113.4.1203.
Osakabe Y, Maruyama K, Seki M, Satou M, Shinozaki K, Yamaguchi-Shinozaki K: Leucine-rich repeat receptor-like kinase1 is a key membrane-bound regulator of abscisic acid early signaling in Arabidopsis. Plant Cell. 2005, 17 (4): 1105-1119. 10.1105/tpc.104.027474.
Nodine MD, Yadegari R, Tax FE: RPK1 and TOAD2 are two receptor-like kinases redundantly required for arabidopsis embryonic pattern formation. Dev Cell. 2007, 12 (6): 943-956. 10.1016/j.devcel.2007.04.003.
Fisher K, Turner S: PXY, a receptor-like kinase essential for maintaining polarity during plant vascular-tissue development. Curr Biol. 2007, 17 (12): 1061-1066. 10.1016/j.cub.2007.05.049.
Tsuwamoto R, Fukuoka H, Takahata Y: GASSHO1 and GASSHO2 encoding a putative leucine-rich repeat transmembrane-type receptor kinase are essential for the normal development of the epidermal surface in Arabidopsis embryos. Plant J. 2008, 54 (1): 30-42. 10.1111/j.1365-313X.2007.03395.x.
Gao M, Wang X, Wang D, Xu F, Ding X, Zhang Z, Bi D, Cheng YT, Chen S, Li X: Regulation of cell death and innate immunity by two receptor-like kinases in Arabidopsis. Cell Host Microbe. 2009, 6 (1): 34-44. 10.1016/j.chom.2009.05.019.
Godiard L, Sauviac L, Torii KU, Grenon O, Mangin B, Grimsley NH, Marco Y: ERECTA, an LRR receptor-like kinase protein controlling development pleiotropically affects resistance to bacterial wilt. Plant J. 2003, 36 (3): 353-365. 10.1046/j.1365-313X.2003.01877.x.
Chinchilla D, Zipfel C, Robatzek S, Kemmerling B, Nurnberger T, Jones JD, Felix G, Boller T: A flagellin-induced complex of the receptor FLS2 and BAK1 initiates plant defence. Nature. 2007, 448 (7152): 497-500. 10.1038/nature05999.
He K, Gou X, Yuan T, Lin H, Asami T, Yoshida S, Russell SD, Li J: BAK1 and BKK1 regulate brassinosteroid-dependent growth and brassinosteroid-independent cell-death pathways. Curr Biol. 2007, 17 (13): 1109-1115. 10.1016/j.cub.2007.05.036.
Heese A, Hann DR, Gimenez-Ibanez S, Jones AM, He K, Li J, Schroeder JI, Peck SC, Rathjen JP: The receptor-like kinase SERK3/BAK1 is a central regulator of innate immunity in plants. P Natl Acad Sci USA. 2007, 104 (29): 12217-12222. 10.1073/pnas.0705306104.
Kemmerling B, Schwedt A, Rodriguez P, Mazzotta S, Frank M, Qamar SA, Mengiste T, Betsuyaku S, Parker JE, Mussig C: The BRI1-Associated Kinase 1, BAK1, has a brassinolide-independent role in plant cell-death control. Curr Biol. 2007, 17 (13): 1116-1122. 10.1016/j.cub.2007.05.046.
Wang X, Goshe MB, Soderblom EJ, Phinney BS, Kuchar JA, Li J, Asami T, Yoshida S, Huber SC, Clouse SD: Identification and functional analysis of in vivo phosphorylation sites of the Arabidopsis BRASSINOSTEROID-INSENSITIVE1 receptor kinase. Plant Cell. 2005, 17 (6): 1685-1703. 10.1105/tpc.105.031393.
Wang X, Kota U, He K, Blackburn K, Li J, Goshe MB, Huber SC, Clouse SD: Sequential transphosphorylation of the BRI1/BAK1 receptor kinase complex impacts early events in brassinosteroid signaling. Dev Cell. 2008, 15 (2): 220-235. 10.1016/j.devcel.2008.06.011.
Albrecht C, Russinova E, Kemmerling B, Kwaaitaal M, de Vries SC: Arabidopsis SOMATIC EMBRYOGENESIS RECEPTOR KINASE proteins serve brassinosteroid-dependent and -independent signaling pathways. Plant Physiol. 2008, 148 (1): 611-619. 10.1104/pp.108.123216.
Seki M, Narusaka M, Kamiya A, Ishida J, Satou M, Sakurai T, Nakajima M, Enju A, Akiyama K, Oono Y: Functional annotation of a full-length Arabidopsis cDNA collection. Science. 2002, 296 (5565): 141-145. 10.1126/science.1071006.
Haas BJ, Volfovsky N, Town CD, Troukhan M, Alexandrov N, Feldmann KA, Flavell RB, White O, Salzberg SL: Full-length messenger RNA sequences greatly improve genome annotation. Genome Biol. 2002, 3 (6): RESEARCH0029-10.1186/gb-2002-3-6-research0029.
Zhou DX, Kim YJ, Li YF, Carol P, Mache R: COP1b, an isoform of COP1 generated by alternative splicing, has a negative effect on COP1 function in regulating light-dependent seedling development in Arabidopsis. Mol Gen Genet. 1998, 257 (4): 387-391. 10.1007/s004380050662.
Iida K, Seki M, Sakurai T, Satou M, Akiyama K, Toyoda T, Konagaya A, Shinozaki K: Genome-wide analysis of alternative pre-mRNA splicing in Arabidopsis thaliana based on full-length cDNA sequences. Nucleic Acids Res. 2004, 32 (17): 5096-5103. 10.1093/nar/gkh845.
Wang BB, Brendel V: Genomewide comparative analysis of alternative splicing in plants. P Natl Acad Sci USA. 2006, 103 (18): 7175-7180. 10.1073/pnas.0602039103.
Palusa SG, Ali GS, Reddy AS: Alternative splicing of pre-mRNAs of Arabidopsis serine/arginine-rich proteins: regulation by hormones and stresses. Plant J. 2007, 49 (6): 1091-1107. 10.1111/j.1365-313X.2006.03020.x.
Tanabe N, Yoshimura K, Kimura A, Yabuta Y, Shigeoka S: Differential Expression of Alternatively Spliced mRNAs of Arabidopsis SR Protein Homologs, atSR30 and atSR45a, in Response to Environmental Stress. Plant Cell Physiol. 2007, 48 (7): 1036-1049. 10.1093/pcp/pcm069.
Wu JY, Maniatis T: Specific interactions between proteins implicated in splice site selection and regulated alternative splicing. Cell. 1993, 75 (6): 1061-1070. 10.1016/0092-8674(93)90316-I.
Kohtz JD, Jamison SF, Will CL, Zuo P, Luhrmann R, Garcia-Blanco MA, Manley JL: Protein-protein interactions and 5'-splice-site recognition in mammalian mRNA precursors. Nature. 1994, 368 (6467): 119-124. 10.1038/368119a0.
Manley JL, Tacke R: SR proteins and splicing control. Gene Dev. 1996, 10 (13): 1569-1579. 10.1101/gad.10.13.1569.
Golovkin M, Reddy AS: The plant U1 small nuclear ribonucleoprotein particle 70K protein interacts with two novel serine/arginine-rich proteins. Plant Cell. 1998, 10 (10): 1637-1648. 10.1105/tpc.10.10.1637.
Golovkin M, Reddy AS: An SC35-like protein and a novel serine/arginine-rich protein interact with Arabidopsis U1-70K protein. J Biol Chem. 1999, 274 (51): 36428-36438. 10.1074/jbc.274.51.36428.
Golovkin M, Reddy AS: Structure and expression of a plant U1 snRNP 70K gene: alternative splicing of U1 snRNP 70K pre-mRNAs produces two different transcripts. Plant Cell. 1996, 8 (8): 1421-1435. 10.1105/tpc.8.8.1421.
Llompart B, Castells E, Rio A, Roca R, Ferrando A, Stiefel V, Puigdomenech P, Casacuberta JM: The direct activation of MIK, a germinal center kinase (GCK)-like kinase, by MARK, a maize atypical receptor kinase, suggests a new mechanism for signaling through kinase-dead receptors. J Biol Chem. 2003, 278 (48): 48105-48111. 10.1074/jbc.M307482200.
Castells E, Puigdomenech P, Casacuberta JM: Regulation of the kinase activity of the MIK GCK-like MAP4K by alternative splicing. Plant Mol Biol. 2006, 61 (4-5): 747-756. 10.1007/s11103-006-0046-3.
Cortese MS, Uversky VN, Dunker AK: Intrinsic disorder in scaffold proteins: getting more from less. Prog Biophys Mol Biol. 2008, 98 (1): 85-106. 10.1016/j.pbiomolbio.2008.05.007.
Dunker AK, Oldfield CJ, Meng J, Romero P, Yang JY, Chen JW, Vacic V, Obradovic Z, Uversky VN: The unfoldomics decade: an update on intrinsically disordered proteins. BMC Genomics. 2008, 9 (Suppl 2): S1-10.1186/1471-2164-9-S2-S1.
Romero PR, Zaidi S, Fang YY, Uversky VN, Radivojac P, Oldfield CJ, Cortese MS, Sickmeier M, LeGall T, Obradovic Z: Alternative splicing in concert with protein intrinsic disorder enables increased functional diversity in multicellular organisms. P Natl Acad Sci USA. 2006, 103 (22): 8390-8395. 10.1073/pnas.0507916103.
Uversky VN, Oldfield CJ, Dunker AK: Intrinsically disordered proteins in human diseases: introducing the D2 concept. Ann Rev Biophys. 2008, 37: 215-246. 10.1146/annurev.biophys.37.032807.125924.
Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24 (8): 1596-1599. 10.1093/molbev/msm092.
Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R: Clustal W and Clustal X version 2.0. Bioinformatics. 2007, 23 (21): 2947-2948. 10.1093/bioinformatics/btm404.
Saitou N, Nei M: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987, 4 (4): 406-425.
Felsenstein J: Confidence limits on phylogenies: an approach using the bootstrap. Evolution. 1985, 39 (4): 783-791. 10.2307/2408678.
Weigel D, Ahn JH, Blazquez MA, Borevitz JO, Christensen SK, Fankhauser C, Ferrandiz C, Kardailsky I, Malancharuvil EJ, Neff MM: Activation tagging in Arabidopsis. Plant Physiol. 2000, 122 (4): 1003-1013. 10.1104/pp.122.4.1003.
Becker D: Binary vectors which allow the exchange of plant selectable markers and reporter genes. Nucleic Acids Res. 1990, 18 (1): 203-10.1093/nar/18.1.203.
Earley KW, Haag JR, Pontes O, Opper K, Juehne T, Song K, Pikaard CS: Gateway-compatible vectors for plant functional genomics and proteomics. Plant J. 2006, 45 (4): 616-629. 10.1111/j.1365-313X.2005.02617.x.
Clough SJ, Bent AF: Floral dip: a simplified method for Agrobacterium -mediated transformation of Arabidopsis thaliana. Plant J. 1998, 16 (6): 735-743. 10.1046/j.1365-313x.1998.00343.x.
Noguchi T, Fujioka S, Choe S, Takatsuto S, Yoshida S, Yuan H, Feldmann KA, Tax FE: Brassinosteroid-insensitive dwarf mutants of Arabidopsis accumulate brassinosteroids. Plant Physiol. 1999, 121 (3): 743-752. 10.1104/pp.121.3.743.
Murashige T, Skoog F: A revised medium for rapid growth and bio assays with tobacco tissue cultures. Physiol Plantarum. 1962, 15 (3): 473-497. 10.1111/j.1399-3054.1962.tb08052.x.
We thank The Arabidopsis Biological Resource Center (ABRC) for providing DNA stocks (DQ446880, DQ459169) for two LRR-RLKs (At4g29450, At1g51810). We acknowledge Tina Lee and James M. Jones for their technical assistance. This study was supported by National Science Foundation (NSF) grant MCB-0419819 (to S.D. Clouse, J. Li, S. Huber, and M. Goshe).
JL supervised the project in which the experiments were carried out. XG, JL and SDC designed the experiments. XG and JL made the GatewayR-compatible destination vectors for Arabidopsis transformation. XG performed the molecular cloning and sequence analysis of LRR-RLKs. XG, KH, HY and TY extracted plasmid DNA and made glycerol stocks. Arabidopsis transformation and Western hybridization analyses were performed by XG, KH and HY. HL partially participated in the experiments. XG, JL and SDC wrote the manuscript. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Supplemental tables and related references. Additional file 1 contains Tables S1-S11 and references cited in Table S1. Supplemental Table S1. Arabidopsis LRR-RLKs with known functions. Supplemental Table S2. Summary of isolated LRR-RLKs. Supplemental Table S3. Isolated LRR-RLKs with the same structure as predicted in TAIR8. Supplemental Table S4. Isolated LRR-RLKs with different coding sequences and one continuous ORF. Supplemental Table S5. Isolated LRR-RLKs with different coding sequences and no continuous ORF.Supplemental Table S6. Uncloned LRR-RLKs.Supplemental Table S7. Detailed sequence information of isolated LRR-RLKs with one continuous ORF showing sequence differences. Supplemental Table S8. Detailed sequence information of isolated LRR-RLKs without continuous ORF. Supplemental Table S9. Isolated LRR-RLKs without EST sequence in TAIR database. Supplemental Table S10. Isolated LRR-RLKs without full-length coding sequence in TAIR database.Supplemental Table S11. Primers used to detect alternative splicing of LRR-RLKs. (DOC 285 KB)
Additional file 2: Cloning strategy and results. (a) Target LRR-RLK sequences without stop codons are RT-PCR amplified, agarose gel purified and recombined with the pDONR/ZeoR vector by BP clonase to create pENTR-LRR-RLK entry clones. Final expression constructs are created by performing LR clonase-mediated DNA recombination between the pENTR-LRR-RLK clones and the destination vectors that contain GFP or FLAG epitope tags. (a) The cloning results of the predicted LRR-RLKs in Arabidopsis. (TIFF 323 KB)
Additional file 3: Sequence alignments of isolated LRR-RLKs displaying different coding sequences and containing one continuous ORF. Corresponding genomic DNA sequences, predicted mRNA sequences, previously reported cDNA sequences (if available), and isolated cDNA sequences obtained from this report for each LRR-RLK were aligned. Sequences with differences are indicated with red boxes. (RTF 1 MB)
Additional file 4: Sequence alignments of isolated LRR-RLKs showing different coding sequences but not containing one continuous ORF. Corresponding genomic DNA sequences, predicted mRNA sequences, previously reported cDNA sequences (if available), and isolated cDNA sequences obtained from this report for each LRR-RLK were aligned. Sequences with differences are indicated with red boxes. (RTF 1009 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.