Comparative genomics analysis of WAK/WAKL family in Rosaceae identify candidate WAKs involved in the resistance to Botrytis cinerea
BMC Genomics volume 24, Article number: 337 (2023)
Wall associated kinase (WAK) and WAK-like (WAKL) are typical pattern recognition receptors act as the first sentry of plant defense. But little of WAK/WAKL family is known in Rosaceae.
In this study, 131 WAK/WAKL genes from apple, peach and strawberry were identified using a bioinformatics approach. Together with 68 RcWAK/RcWAKL in rose, we performed a comparative analysis of 199 WAK/WAKL in four Rosaceae crops. The phylogenetic analysis divided all the WAK/WAKL into five clades. Among them, the cis-elements of Clade II and Clade V promoters were enriched in jasmonic acid (JA) signaling and abiotic stress, respectively. And this can also be verified by the rose transcriptome responding to different hormone treatments. WAK/WAKL families have experienced a considerable proportion of purifying selection during evolution, but still 26 amino acid sites evolved under positive selection, which focused on extracellular conserved domains. WAK/WAKL genes presented collinearity relationship within and between crops, throughout four crops we mined four orthologous groups (OGs). The WAK/WAKL genes in OG1 and OG4 were speculated to involve in plant-Botrytis cinerea interaction, which were validated in rose via VIGS as well as strawberry by qRT-PCR.
These results not only provide genetic resources and valuable information for the evolutionary relationship of WAK/WAKL gene family, but also offer a reference for future in-depth studies of Rosaceae WAK/WAKL genes.
In the process of growth and development, plant must respond rapidly to environmental changes and various biotic and abiotic stresses . Receptor-like kinases (RLKs) are a class of protein that widely located on the surface of plant cell and usually contain extracellular signal-sensing and intracellular kinase structures. RLKs play an important role in transducing extracellular signals by perceiving changes of polysaccharides, proteins, lipids and other ligands [2, 3]. RLKs are ubiquitous in plant species, including no less than 17 subfamilies [4, 5]. Wall associated kinase (WAK) and WAK-like (WAKL) is one subfamily of RLKs .
With at least one transmembrane domain, WAK/WAKL proteins have an intracellular domain of serine/threonine kinase (PKinase) at the C-terminus and complex N-terminal extracellular domains, including calcium-binding EGF domain (EGF_CA) and galacturonic binding domain (GUB_WAK_bind). EGF_CA is composed by about 40 residues of epidermal growth factor (EGF). The N-terminal of EGF protein is often accompanied by a calcium-binding site, and calcium is required for their biological functions . The cysteine-rich GUB_WAK_bind domain is a characteristic structure of WAK/WAKL, which interacts with homogalacturonans, the pectin fragment of plant cell wall . It has been shown that WAK/WAKL family plays an important role in effective intracellular and extracellular communication, regulating root growth , stem strength  and disease resistance [11, 12].
The polyphagous necrotrophic fungi Botrytis cinerea is one of the major pathogens causing gray mold disease, with 1400 known hosts covering 586 plant genera . Global economic losses caused by B. cinerea can reach hundred-billions of dollars annually . B. cinerea has been listed as the second largest plant disease in the world , and is also the main postharvest disease of rosaceous crops (such as strawberries and roses) . Arabidopsis AtWAK1 was the first reported recognition receptor for B. cinerea .
Rosaceae crops, especially apple, peach, strawberry and rose, are extremely susceptible to B. cinerea, however the systematic comparative analysis of WAK/WAKL genes in Rosaceae has not been reported. In this study, we compared the structure, phylogenetic relationships, selection pressure and collinearity relationships of the WAK/WAKL gene family in four Rosaceae crops, including apple (Malus domestica), peach (Prunus persica), strawberry (Fragaria vesca) and rose (Rosa chinensis). Our results will provide useful information and theoretical support for further study of the WAK/WAKL biological functions in Rosaceae.
Materials and methods
Identification of the WAK/WAKL genes in Rosaceae crops
The information of Rosaceae genome were obtained from Genome Database for Rosaceae, containing gddh13 v1.1 for Apple (Malus x domestica) , Fragaria vesca v4.0.a1 for strawberry (Fragaria vesca)  and Prunus persica v2.0 for peach (Prunus persica) [20, 21]. Firstly, to identify the nonredundant WAK genes, the HMM files (EGF_ CA, PF07645.18; GUB_WAK_ bind, PF13947.9; PKinase_Tyr_ser, PF07714.20) generated from protein families (Pfam) website were used for hmmsearch with e-value < 1e− 3. All candidate WAK/WAKL members were validated transmembrane helix (TMhelix) by TMHMM-2.0 and signal peptide (SignalP) by SignalP-5.0 (https://services.healthtech.dtu.dk). Finally, we selected the WAK/WAKL members with SignalP, EGF_CA or GUB_WAK_bind, TMhelix and PKinase.
Phylogenetic analysis of the WAK/WAKL in Rosaceae
WAKs/WAKLs protein sequences were aligned by algorithm ClustalW in MGEA7 with default parameters . Then use neighbor joining (JFF + G model) with 1000 bootstrap replicates to construct the phylogenetic tree.
Domain position of Pkinase, GUB_WAK_Bind and EGF_CA was generated from conserved domain database (CDD) (https://www.ncbi.nlm.nih.gov/cdd) and Pfam (http://pfam-legacy.xfam.org) online predicting and removing redundant results before visualization, TMhelix and SignalP are directly based on the online prediction outputs of Technical University of Denmark, and gene structure is extracted from GFF/GTF files from their respect genomes. Finally, the amino acid sites of the domain were used as input files in TBtools . For amino acid sequence logo exhibition, WebLogo3 online application was performed .
Selective pressure analysis
PAML4.9j was used for selective pressure analysis by branch model and site model . The branch model considers the ω (dN/dS) value represented the adaptive evolution between every end branch in phylogenetic tree. Site model assumes various selection pressures at different codons. M0/M3 is used to detect the consistency of ω-ratios between sites. The positively selected sites were identified by M1a/M2a and M7/M8. When positive selection is detected, M2a and M8 can be used to further identify the amino acid sites via Bayes Empirical Bayes (BEB) algorithm.
Promoter region analysis of the WAK/WAK in Rosaceae
2000 bp promoter sequences of Rosaceae WAK/WAKL were extracted from DNA sequence before translation initiation codon (ATG). Cis-elements were forecasted on PlantCARE (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/). Conventional elements such as: short function, core promoter element around − 30 of transcription start, common cis-element in promoter and enhancer regions were deleted. Other elements in the output file were manually sorted and simplified based on previous classification rules [26,27,28] (Supplementary Table 1).
Heat map analysis
RNA-seq data were obtained from rose petals treated with a variety of plant hormones , including abscisic acid (ABA) treatment, auxin (2,4D), cytokinin (6-BA), gibberellin (GA3), jasmonic acid (JA) and salicylic acid (SA). The fragments per kilobase per million reads (FPKM) values which measure gene expression abundance were converted by min-max normalization between different hormone treatments to insulate the heat maps from extreme values.
The paralogous and orthologous analysis of WAK/WAKL in Rosaceae
Selection of paralogous genes and orthologous genes share a common set of parameters, determination of collinearity is proceeded by MCscanX  with default parameters. Selecting collinear blocks are conducted according to the following parameters: match score: 50, match size: 5, overlap window: 5, e-value: 1e and max gaps: 25. The orthologous groups are selected manually through the MCscanX output file, and the visualization is displayed by TBtools software.
Virus induced gene silencing (VIGS)
A 249 bp specific fragment of RcWAK8 was cloned from rose petal cDNA and finally inserted to TRV2 vector (TRV2-RcWAK8) . Then, take down the outer petals of rose at stage 2 of flower opening , and make a 12.5-mm diameter petal disc with a hole punch in the middle of the petals. Agrobacterium tumefaciens containing TRV1 and recombinant TRV2 vector were mixed in a ratio of 1:1, and vacuum suction was used to make agrobacterium invade plant tissues. To ensure a stable phenotype, the VIGS assay was repeated at least three times with at least 48 petals each time.
All the constructs and plant materials used in this study are freely available from the corresponding author, for research purposes. There no special permissions are necessary to collect item.
B. cinerea inoculation and qRT-PCR analysis
The method for B. cinerea cultivation has been described previously . Briefly, the B. cinerea strain B05.10 for strawberry and rose petal discs was grown on potato dextrose agar (PDA) at room temperature for two weeks. The spores were harvesting with deionized water and then suspending in potato dextrose broth (PDB) to a final concentration of 1 × 105 conidia/mL.
For strawberry (Fragaria vesca) inoculation, 5-µL drops of B. cinerea inoculum or PDB (mock) were dropped onto the ripe fruits, and the fruits were placed on petri dishes with wet filter paper to ensure 100% humidity. All the samples were harvested at 48 h post inoculation (hpi) with four biological repeats and the fruits were immediately frozen in liquid nitrogen and stored at -80 °C.
For detached rose petal discs, six days after recombinant tobacco rattle virus (TRV) vector’s infection, petal discs were inoculated with 2 µL spore suspension of B. cinerea. All the petal disc were placed on petri dishes with 0.4% agar (v/v) to ensure 100% humidity. All samples were photographed at 60 hpi, and the lesion area was measured using the ImageJ. Finally, they were immediately frozen in liquid nitrogen and stored at -80 °C.
RNA extraction was carried out with the E.Z.N.A Plant RNA Kit (OMEGA), reference plant RNA difficult sample protocol. cDNA was generated using Takara Reverse Transcriptase M-MLV, and 1 µL of the first strand cDNA was used as a template in the reaction with the KAPATM SYBRR quantitative PCR kit (Takara), which was run on a StepOnePlus Real-Time PCR System (Thermo Fisher Scientific). FvACTIN and RcUBI was used as a housekeeping gene. The primers used for determining transcript abundance are listed in Supplementary Table 2.
All the study protocols must comply with relevant institutional, national, and international guidelines and legislation.
Identification and classification of Rosaceae WAK/WAKL gene family
As previously stated, PKinase (PF07714.20), GUB_WAK_bind (PF13947.9), EGF_CA (PF07645.18) are typical domains for WAK/WAKL members, in addition signal peptide (signalP) and transmembrane helix (TMhelix) were necessary. The genes contained three typical domains were named WAK, the ones without GUB_WAK_bind or EGF_CA were named WAKL. Based on these rules, 68 RcWAK/RcWAKL family members have been identified in rose (Rosa chinensis) . In this study, a total of 131 non-redundant Rosaceae WAK/WAKL genes were identified (Fig. 1), including 36 in strawberry (14 FvWAKs, 22 FvWAKLs), 35 in apple (9 MdWAKs, 26 MdWAKLs), 60 in peach (12 PpWAKs, 48 PpWAKLs) (Supplementary Fig. 1). Their gene name, accession number, the number of introns and exons, the length of coding sequence (CDS) and amino acid sequence, as well as the chromosome location were shown in the Supplementary Table 3 ~ 5. All WAK/WAKL genes were named according to their order on the chromosomes. Among them, PpWAKL5 had the shortest CDS coding 408 aa, and FvWAK12 was the longest one coding 2266 aa. The mean of CDS length was 2341 bp and the average amino acid sequence length was 717 aa among all 131 WAK/WAKL.
In summary, a total of 199 Rosaceae WAK/WAKL were obtained from four Rosaceae crops. Comparative analysis of WAK/WAKL family members in strawberry, apple, peach and rose showed that the longest average length of the WAK/WAKL CDS was 2973 bp in strawberry, followed by apple, pear and rose. The average sequence length of WAK/WAKL in apple (2194 bp), rose (2036 bp) and peach (2047 bp) is similar, but the length in strawberry is 32.9% larger. Meanwhile, the WAK/WAKLs were different distributions on chromosomes in four Rosaceae crops. The FvWAK/FvWAKL and RcWAK/RcWAKL are distributed on all the 7 chromosomes in strawberry and rose. However, there is no MdWAK/MdWAKL gene on chromosome 3, 11 and 16 among 17 chromosomes of apple and no PpWAK/PpWAKL gene on chromosome 7 and 8 among 8 chromosomes of peach. In addition, the WAK/WAKL were unevenly distributed across each Rosaceae specie chromosomes, e.g.: the high density of PpWAKs/PpWAKLs location was observed at 19.63 ~ 19.84 Mb on chromosome 3 (PpWAKL23 ~ PpWAKL33) and at 4.59 ~ 4.77 Mb on chromosome 4 (PpWAK2 ~ PpWAK12). The unbalanced distribution of WAK/WAKL genes indicated genetic variation of Rosaceae crops during the evolutionary process.
Phylogenetic analysis of WAK/WAKL genes in four Rosaceae crops
In order to explore the evolutionary relationship among 199 WAK/WAKL in Rosaceae crops, we constructed a rootless phylogenetic tree using the Neighbor-Joining algorithm, included resistance related WAK/WAKL genes AtWAK1, AtWAK2, AtWAKL10, AtWAKL22, GhWAK7A, OsWAK14, OsWAK91, OsWAK92, OsWAK112d, OsWAK25, OsIRBB4_Xa4, SlWAK1, TaWAK6, ZmWAK, and ZmWAK-RLK1 [10, 12, 17, 35,36,37,38,39,40]. We divided all the WAK/WAKL genes into five clades [6, 26]. Clade V was the largest group contains 68 genes, accounting for 34.2% of all family genes. Followed by Clade I (54 genes), which was the only clade had no verified disease resistance genes. Clade II had the least number of genes containing only 11 genes (Figs. 1 and 2). The conserved domains and gene exon-intron structure were visualized on the basis of evolutionary analysis (Supplementary Fig. 2). Rosaceae WAKLs were intensively distributed in Clade IV and Clade V. Although we defined those genes lacking GUB_WAK_bind or EGF_CA as WAKLs, only 9 of 134 Rosaceae WAKLs actually lacked GUB_WAK_bind, while all the others were missing EGF_CA. The number of exons of Rosaceae WAK/WAKL varied greatly, from 1 to 12. There was also a big difference in Rosaceae WAK/WAKL gene length, the shortest one was RcWAKL14 (1448 bp) and the longest one was FvWAK12 (17,454 bp). Interestingly, the FvWAKLs and MdWAKLs in CladeV had more exon-intron structures, but RcWAKLs and PpWAKLs were not.
After multiple sequence alignment, we clearly found two characteristic domains of WAK/WAKL in the alignment results (Fig. 3). EGF_CA with six conserved cysteines for calcium-binding and GUB_WAK_bind located near the N-terminal end of the extracellular domain.
WAK/WAKL underwent purifying selection in four Rosaceae crops
The dN/dS (ω) nucleotide substitution ratios were used to evaluated the selection pressure among WAK/WAKL between different branches in phylogenetic tree. Generally, ω < 1 is consistent with purifying selection, while ω > 1 indicates a positive selection. With M0 model, which assumes that all branches have equal ω values, we got a ω = 0.254. With M1 model, which assumes that all branches have unequal ω values, 86.7% of ω values among different branches were < 1. All the results showed that WAK/WAKL genes undergone a strong purifying selection during their evolutionary history (Table 1).
We further use the site model to detect the sites of protein sequences which occurred positive selection (Table 2). The results showed that the M3 model with LRT P-value < 0.01 was better than the M0 model, indicating that different amino acid sites have different selection pressures in WAK/WAKL family. Further results demonstrated that a total of 26 amino acid sites were identified in the M2a and M8 models that were subject to positive selection. The positive sites were mostly located in the extracellular domains of WAK/WAKL family members, mainly concentrated on the GUB_WAK_bind domain.
Promoter cis-elements analysis of WAK/WAKL genes in Rosaceae crops
To investigate the cis-elements in promoter sequences of WAK/WAKL genes, 2 kb of DNA sequences up-stream of the initiation codon (ATG) were analyzed by Plant_CARE. We classified the identified cis-elements into three major categories, including plant development and growth, phytohormone response and abiotic response (Supplementary Table 1). We speculated that WAK/WAKL genes in different clade played a role in different biological processes. The predicted cis-elements were analyzed according to different clades in phylogenetic tree (Table 3). The cis-elements responsive to abscisic acid (ABA) and associated with abiotic stress accounted for 25.52% in Clade I. The cis-elements in Clade II and Clade III prominently responded to MeJA response, accounting for 27.43% and 24.65%, respectively. Abiotic stress-responsive cis-elements are enriched in Clade IV and Clade V.
In order to explore the expression patterns of WAK/WAKL family genes under different hormone treatments, we analyzed the transcriptome of rose flowers responding to different hormone treatments . According to the heat map (Fig. 4), Clade I members showed strong response to ABA and JA, followed by GA. while members of Clade II did not show a more significant response to a particular hormone. Clade III members tend to respond to JA and GA3. And Clade IV and Clade V showed strong response to ABA, followed by JA and GA3. The transcript expression patterns of RcWAK/RcWAKL in hormone treated RNA-seq data was consistent with promoter cis-elements enrichment analysis of Rosaceae WAK/WAKL (Fig. 4; Table 3).
The collinearity relationships of WAK/WAKL in four Rosaceae crops
To further investigated the WAK/WAKL evolutionary trajectory events, we have constructed paralogous gene pairs and orthologous gene groups in or between four Rosaceae crops, respectively. Collinearity analysis have identified 6, 2, 4 and 0 WAK/WAKL paralogous gene pairs in apple, peach, rose and strawberry, respectively (Supplementary Fig. 3). Interestingly, rose possessed four paralogous gene pairs, three of them were located on chromosome 5 (11.37 ~ 12.01 Mb), included RcWAK7, RcWAK8, RcWAK9, RcWAK11, RcWAK13, and RcWAK14. This result implied the small-scale tandem replication events within rose genome. In addition, apple with the largest genome size has the least WAK/WAKL number among four Rosaceae crops. Meanwhile, 45.7% (16/35) of MdWAK/MdWAKL were detected as paralogous gene. The results revealed that MdWAK/MdWAKL experienced high-pressure selection under evolutionary in apple, the expansion of MdWAK/MdWAKL family genes mainly depend on whole-genome duplication (WGD).
Orthologous group is used to describe a cluster of genes evolved from a single gene in the last common ancestor (LCA) among crops. Here, we constructed a syntenic map and compiled a gene list of WAK/WAKL orthologous genes across the four Rosaceae crops. 71 of 199 WAK/WAKL genes were mapped in orthologous blocks, accounting for 35.7% of all genes. They were divided into fourteen orthologous groups (Supplementary Fig. 4). There were four orthologous groups OG1, OG3, OG4 and OG6 were highly conserved across four Rosaceae crops (Fig. 5; Table 4). Although WAK/WAKL genes underwent different degrees of contraction and expansion during the divergence of the four species, the gene functions in same orthologous group may have the similar function.
The orthologous groups implied some Rosaceae WAKs playing important role in B. cinerea resistance
As the same orthologous group has shared ancestry, the genes in one group may have the similar gene function between crops. In our previous report, RcWAK8 (OG1) and RcWAK22 (OG4) were significantly up-regulated expression in rose petals during B. cinerea infection . These results indicated the genes from OG1 and OG4 may involve in plant-pathogen interaction.
To further verified the function of RcWAK8 and RcWAK22 in rose-Botrytis interaction, we attempt to knock down the expression of RcWAK8 and RcWAK22 one by one via VIGS. However, the VIGS must ensure the specificity of inserted fragments (without no consecutive 23 bp sequence is exactly the same) . Unfortunately, the coding sequence of RcWAK22 is too similar to RcWAK2 (Supplementary Fig. 5). By contrast, blast analysis suggested TRV-RcWAK8 construct do not have any off-target silencing in rose, it is specifically targeting RcWAK8. Petal discs silenced with RcWAK8 gene showed significant lesion enlargement compared with controls (Fig. 6a and b). Meanwhile, the silencing efficiency was also confirmed by qRT-PCR (Fig. 6c). All these evidences suggest that RcWAK8 plays an important role in the resistance of rose petals to gray mold.
On the other hand, to verify expression patten of FvWAK/FvWAKLs during B. cinerea infection in OG1 and OG4, real-time quantitative PCR (RT-qPCR) was performed to test the expression of FvWAK1, FvWAK2, FvWAK3, FvWAK5, FvWAK7, FvWAK8 and FvWAK9 in strawberry fruits after B. cinerea infection 48 h. As shown in Fig. 7, the expression level of FvWAK1 (OG4), FvWAK5 (OG4) and FvWAK9 (OG4) were significantly increased in B. cinerea-treated fruits comparing with mock (PDB-treated fruits). FvWAK2 and FvWAK8 have no change in B. cinerea-treated fruits. FvWAK3 and FvWAK7 were undetermined in all samples, may because these two WAKs existed tissue-specific expression.
WAK/WAKL family is one of the important receptor-like kinases (RLKs) localized on the cell membrane. Here, we use comparative genomics analysis of four Rosaceae WAK/WAKL family genes to provide novel insights in gene function and evolution. The four Rosaceae crops diverged around the 59 million years ago (MYA), corresponding to the Cretaceous-Paleogene (K-Pg) mass extinction event, with a large number of genome ploidy events occurring [42,43,44] (Fig. 1). It is implied that whole-genome duplication (WGD) events may have helped crops adapt to the harsh climatic conditions of the time, allowing them to survive the extinction event . However, the number of WAK/WAKL family genes was insignificantly related with WGD event or the genome size (Supplementary Table 6). For example, apple with a genome of 742 Mb had a neoteric WGD event at about 30–45 MYA, but only 35 WAK/WAKL genes were identified, a relatively fewer number among crops (Fig. 1, Supplementary Table 6). MdWAK/MdWAKL family seems no expansion event happened during the WGD. It might be apple (‘Golden Delicious’) experienced a long period of artificial domestication selection after WGD, genes with redundant functions are streamlined.
The phylogenetic evolutionary tree of WAK/WAKL showed that RcWAK/RcWAKL and FvWAK/FvWAKL were classified on the closest terminal branch, as well as WAK/WAKL of peach and apple (Fig. 2), supporting the relationship of crops divergence (Fig. 1). Meanwhile, we clearly found two characteristic domains EGF_CA and GUB_WAK_bind (Fig. 3). The EGF_CA domain is a conserved domain of about forty amino-acid residues found in epidermal growth factor (EGF). This domain is required for calcium-binding which may be crucial for numerous protein-protein interactions [46, 47]. The cysteine-rich GUB_WAK_bind is a unique domain of WAKs, it is responsible for transmitting signals (like pectin fragments) outside cell walls to cells and activating corresponding physiological processes .
Gene duplication plays an important role in plant evolution, different models of duplicated gene evolution have been proposed to evaluate selective pressure . Clearly, the WAK/WAKL gene has undergone duplicate during evolution, the branch model showed that WAK/WAKL underwent a strong purifying selection (Table 1). However, the site model revealed that almost all of the positive selection sites are located in N-terminal within first 200aa region, corresponding to the typical domain GUB_WAK_bind (Fig. 3; Table 2). The similar results have been reported in another subfamily of RLK, leucine-rich repeat receptor-like kinases (LRR-RLK) in angiosperms. Four main targets of positive selection amino acids were found in the N-terminal extracellular domain (the leucine-rich repeat domains) . These positive selection sites in LRR have been inferred for genes involved in biotic stress: Xa21 and Xa4, which confer resistance to the bacterial blight disease, was found to have evolved under positive selection in rice [10, 51]; FLS2 in Arabidopsis, involved in responses to multiple biotic stress, shows a signature of rapid fixation of an adaptive allele . GUB_WAK_bind domain plays an important function in recognizing to extracellular signals, mainly oligogalacturonides . Previous studies showed pathogens absorbed nutrients from plants by degrading plant cell walls which will produced oligogalacturonides. At the same time, oligogalacturonides act as a danger signal to induce plant defense response [48, 54, 55]. Interestingly, oligogalacturonide’s biological activity varies with their degree of polymerization (DP) and concentration. For example, oligogalacturonides with 12 DP and 2 ~ 6 DP had the highest activity in soybean  and wheat , respectively. In general, the dynamics and plasticity of this region in the WAK/WAKL genes implied Rosaceae has a broad tool set to respond to variously environmental challenges, thus undergoes positive selection.
Enrichment of promoter cis-element classification of WAK/WAKL revealed that Clade II prominently responds to MeJA, one of the major hormones for plant defense response (Table 3). This result was consistent with that 4 of 11 Clade II WAK/WAKL genes were reported involved in plant defense (Fig. 2). Furthermore, compared to other clades, Clade IV & V were tended to response to abiotic stress. Through transcriptome analysis of rose petals in response to different hormone treatments, it was found that the response intensity of each subfamily to different hormones was indeed different. For example, it is evident that Clade V has a strong response to ABA as well as Clade I to JA. This may be slightly different from the analysis of promoters, after all the rule of rose does not necessarily represent the universal conclusion of rosaceae crops, but the heat map (Fig. 4) does provide a good resource for study of rose. As WAK/WAKL plays a wide range of biological functions in growth , development  and interaction with stress factors , in further studies of WAK/WAKL genes, we can predict their functions in advance via different clades.
Orthologous group is a group of genes across different crops, such genes originated from a same gene or a same syntenic block from the last common ancestor before the crops diverging . Those orthologous genes have been selectively retained during the respective evolutionary processes, not only because the similarity of amino acid sequences, but also the collinearity relationships formed by the nearby genes in the genome. Therefore, a group of genes with orthologous relationships will perform similar biological function . Here, we identified four orthologous groups throughout four Rosaceae crops. Based on rose-B. cinerea interaction study , and transient silencing of RcWAK8 in rose petals exhibited increased susceptibility to B. cinerea. We speculated OG1 and OG4 which contained B. cinerea induced RcWAK8 and RcWAK22 may play a role in resistance to B. cinerea, and this hypothesis was verified by qRT-PCR. It is proved that it is feasible to infer the function of orthologous genes in other crops by orthologous group, which provides theoretical reference and basis for exploring the function of WAK/WAKL gene in other Rosaceae plants.
In conclusion, this study identified 131 WAK/WAKL family members through Rosaceae crops, apples, peaches and strawberries. The number and structure of WAK/WAL family in different crops were analyzed in detail. By comparing the phylogenetic evolution, selective pressure, collinearity relationship of WAK/WAKL throughout four Rosaceae crops, including roses. Finally, one RcWAK (RcWAK8) was verified and three FvWAK genes (FvWAK1, FvWAK5 and FvWAK9) were speculated, which were excavated through the orthologous gene group, were involved to gray mold resistance. This study offers a new idea to infer gene function with the help of the orthogonal group, also provides gene resources for the research of WAK/WAKL gene family, and help understand the function and evolution of Rosaceae WAK/WAKL.
The datasets used and/or analyzed during the current study has been downloaded from Genome Database for Rosaceae (http://www.rosaceae.org/). The genome data was also available in the following links: gddh13 v1.1 for Apple (Malus x domestica) (https://iris.angers.inra.fr/gddh13/), Fragaria vesca v4.0.a1 for strawberry (Fragaria vesca) (https://github.com/wurmlab/flo), Prunus persica v2.0 for peach (Prunus persica) (https://phytozome-next.jgi.doe.gov/info/Ppersica_v2_1). The plant materials are available from the corresponding author on reasonable request.
Afzal AJ, Wood AJ, Lightfoot DA. Plant receptor-like serine threonine kinases: roles in signaling and plant defense. Mol Plant Microbe Interact. 2008;21(5):507–17.
Cheng W, Wang Z, Xu F, Ahmad W, Lu G, Su Y, Xu L. Genome-wide identification of LRR-RLK family in saccharum and expression analysis in response to biotic and abiotic stress. Curr Issues Mol Biol. 2021;43(3):1632–51.
Shiu S-H, Bleecker AB. Receptor-like kinases from Arabidopsis form a monophyletic gene family related to animal receptor kinases. Proceedings of the National Academy of Sciences 2001, 98(19):10763–10768.
Gao L-L, Xue H-W. Global analysis of expression profiles of rice receptor-like kinase genes. Mol Plant. 2012;5(1):143–53.
Shiu S-H, Bleecker AB. Plant receptor-like kinase gene family: diversity, function, and signaling. Science’s STKE 2001, 2001(113):re22-re22.
Tripathi RK, Aguirre JA, Singh J. Genome-wide analysis of wall associated kinase (WAK) gene family in barley. Genomics. 2021;113(1):523–30.
Davis CG. The many faces of epidermal growth factor repeats. The New Biologist. 1990;2(5):410–9.
Decreux A, Thomas A, Spies B, Brasseur R, Van Cutsem P, Messiaen J. In vitro characterization of the homogalacturonan-binding domain of the wall-associated kinase WAK1 using site-directed mutagenesis. Phytochemistry. 2006;67(11):1068–79.
Kaur R, Singh K, Singh J. A root-specific wall-associated kinase gene, HvWAK1, regulates root growth and is highly divergent in barley and other cereals. Funct Integr Genom. 2013;13:167–77.
Hu K, Cao J, Zhang J, Xia F, Ke Y, Zhang H, Xie W, Liu H, Cui Y, Cao Y. Improvement of multiple agronomic traits by a disease resistance gene via cell wall reinforcement. Nat plants. 2017;3(3):1–9.
Li H, Zhou S-Y, Zhao W-S, Su S-C, Peng Y-L. A novel wall-associated receptor-like protein kinase gene, OsWAK1, plays important roles in rice blast disease resistance. Plant Mol Biol. 2009;69:337–46.
Zuo W, Chao Q, Zhang N, Ye J, Tan G, Li B, Xing Y, Zhang B, Liu H, Fengler KA. A maize wall-associated kinase confers quantitative resistance to head smut. Nat Genet. 2015;47(2):151–7.
Mercier A, Carpentier F, Duplaix C, Auger A, Pradier JM, Viaud M, Gladieux P, Walker AS. The polyphagous plant pathogenic fungus Botrytis cinerea encompasses host-specialized and generalist populations. Environ Microbiol. 2019;21(12):4808–21.
Poveda J, Barquero M, González-Andrés F. Insight into the microbiological control strategies against Botrytis cinerea using systemic plant resistance activation. Agronomy. 2020;10(11):1822.
Dean R, Van Kan JA, Pretorius ZA, Hammond-Kosack KE, Di Pietro A, Spanu PD, Rudd JJ, Dickman M, Kahmann R, Ellis J. The top 10 fungal pathogens in molecular plant pathology. Mol Plant Pathol. 2012;13(4):414–30.
Hua L, Yong C, Zhanquan Z, Boqiang L, Guozheng Q, Shiping T. Pathogenic mechanisms and control strategies of Botrytis cinerea causing post-harvest decay in fruits and vegetables. Food Qual Saf. 2018;2(3):111–9.
Brutus A, Sicilia F, Macone A, Cervone F, De Lorenzo G. A domain swap approach reveals a role of the plant wall-associated kinase 1 (WAK1) as a receptor of oligogalacturonides. Proc Natl Acad Sci. 2010;107(20):9452–7.
Daccord N, Celton J-M, Linsmith G, Becker C, Choisne N, Schijlen E, Van de Geest H, Bianco L, Micheletti D, Velasco R. High-quality de novo assembly of the apple genome and methylome dynamics of early fruit development. Nat Genet. 2017;49(7):1099–106.
Edger PP, VanBuren R, Colle M, Poorten TJ, Wai CM, Niederhuth CE, Alger EI, Ou S, Acharya CB, Wang J. Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity. Gigascience. 2018;7(2):gix124.
Initiative IPG, Verde I, Abbott AG, Scalabrin S, Jung S, Shu S, Marroni F, Zhebentyayeva T, Dettori MT, Grimwood J. The high-quality draft genome of peach (Prunus persica) identifies unique patterns of genetic diversity, domestication and genome evolution. Nat Genet. 2013;45(5):487–94.
Verde I, Jenkins J, Dondini L, Micali S, Pagliarani G, Vendramin E, Paris R, Aramini V, Gazza L, Rossini L. The Peach v2. 0 release: high-resolution linkage mapping and deep resequencing improve chromosome-scale assembly and contiguity. BMC Genomics. 2017;18(1):1–18.
Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33(7):1870–4.
Chen C, Chen H, Zhang Y, Thomas HR, Frank MH, He Y, Xia R. TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol Plant. 2020;13(8):1194–202.
Crooks GE, Hon G, Chandonia J-M, Brenner SE. WebLogo: a sequence logo generator. Genome Res. 2004;14(6):1188–90.
Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24(8):1586–91.
Li M, Ma J, Liu H, Ou M, Ye H, Zhao P. Identification and characterization of wall-associated kinase (WAK) and wak-like (WAKL) gene family in Juglans regia and its wild related species Juglans mandshurica. Genes. 2022;13(1):134.
Sun Z, Song Y, Chen D, Zang Y, Zhang Q, Yi Y, Qu G. Genome-wide identification, classification, characterization, and expression analysis of the wall-associated kinase family during fruit development and under wound stress in tomato (Solanum lycopersicum L). Genes. 2020;11(10):1186.
Zhang Z, Ma W, Ren Z, Wang X, Zhao J, Pei X, Liu Y, He K, Zhang F, Huo W. Characterization and expression analysis of wall-associated kinase (WAK) and WAK-like family in cotton. Int J Biol Macromol. 2021;187:867–79.
Liu X, Wu J, Ji F, Cao X, Zhao Q, Cheng C, Ma N, Zhou X, Zhang Z. Transcriptomic profiling of rose flower under treatment of various phytohormones and plant growth regulators. Sci Data. 2022;9(1):669.
Wang Y, Tang H, Debarry JD, Tan X, Li J, Wang X, Lee T-h, Jin H, Marler B, Guo H. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012;40(7):e49–9.
Liu Y, Schiff M, Dinesh-Kumar SP. Virus-induced gene silencing in tomato. Plant J. 2002;31(6):777–86.
Ma N, Cai L, Lu W, Tan H, Gao J. Exogenous ethylene influences flower opening of cut roses (Rosa hybrida) by regulating the genes encoding ethylene biosynthesis enzymes. Sci China C Life Sci. 2005;48(5):434–44.
Cao X, Yan H, Liu X, Li D, Sui M, Wu J, Yu H, Zhang Z. A detached petal disc assay and virus-induced gene silencing facilitate the study of Botrytis cinerea resistance in rose flowers. Hortic Res 2019, 6.
Liu X, Wang Z, Tian Y, Zhang S, Li D, Dong W, Zhang C, Zhang Z. Characterization of wall-associated kinase/wall-associated kinase-like (WAK/WAKL) family in rose (Rosa chinensis) reveals the role of RcWAK4 in Botrytis resistance. BMC Plant Biol. 2021;21:1–12.
Diener AC, Ausubel FM. RESISTANCE TO FUSARIUM OXYSPORUM 1, a dominant Arabidopsis disease-resistance gene, is not race specific. Genetics. 2005;171(1):305–21.
Dmochowska-Boguta M, Kloc Y, Zielezinski A, Werecki P, Nadolska-Orczyk A, Karlowski WM, Orczyk W. TaWAK6 encoding wall-associated kinase is involved in wheat resistance to leaf rust similar to adult plant resistance. PLoS ONE. 2020;15(1):e0227713.
Harkenrider M, Sharma R, De Vleesschauwer D, Tsao L, Zhang X, Chern M, Canlas P, Zuo S, Ronald PC. Overexpression of rice wall-associated kinase 25 (OsWAK25) alters resistance to bacterial and fungal pathogens. PLoS ONE. 2016;11(1):e0147310.
Hurni S, Scheuermann D, Krattinger SG, Kessel B, Wicker T, Herren G, Fitze MN, Breen J, Presterl T, Ouzunova M. The maize disease resistance gene Htn1 against northern corn leaf blight encodes a wall-associated receptor-like kinase. Proceedings of the National Academy of Sciences 2015, 112(28):8780–8785.
Kohorn BD, Kohorn SL, Todorova T, Baptiste G, Stansky K, McCullough M. A dominant allele of Arabidopsis pectin-binding wall-associated kinase induces a stress response suppressed by MPK6 but not MPK3 mutations. Mol Plant. 2012;5(4):841–51.
Wang P, Zhou L, Jamieson P, Zhang L, Zhao Z, Babilonia K, Shao W, Wu L, Mustafa R, Amin I. The cotton wall-associated kinase GhWAK7A mediates responses to fungal wilt pathogens by complexing with the chitin sensory receptors. Plant Cell. 2020;32(12):3978–4001.
Burch-Smith TM, Anderson JC, Martin GB, Dinesh-Kumar SP. Applications and advantages of virus-induced gene silencing for gene function studies in plants. Plant J. 2004;39(5):734–46.
Foster CS, Sauquet H, Van der Merwe M, McPherson H, Rossetto M, Ho SY. Evaluating the impact of genomic data and priors on bayesian estimates of the angiosperm evolutionary timescale. Syst Biol. 2017;66(3):338–51.
Kumar S, Suleski M, Craig JM, Kasprowicz AE, Sanderford M, Li M, Stecher G, Hedges SB. TimeTree 5: an expanded resource for species divergence times. Mol Biol Evol. 2022;39(8):msac174.
Zhang L, Chen F, Zhang X, Li Z, Zhao Y, Lohaus R, Chang X, Dong W, Ho SY, Liu X. The water lily genome and the early evolution of flowering plants. Nature. 2020;577(7788):79–84.
Wu S, Han B, Jiao Y. Genetic contribution of paleopolyploidy to adaptive evolution in angiosperms. Mol Plant. 2020;13(1):59–71.
Selander-Sunnerhagen M, Ullner M, Persson E, Teleman O, Stenflo J, Drakenberg T. How an epidermal growth factor (EGF)-like domain binds calcium. High resolution NMR structure of the calcium form of the NH2-terminal EGF-like domain in coagulation factor X. J Biol Chem. 1992;267(27):19642–9.
Doolittle RF, Fei Feng D, Johnson MS. Computer-based characterization of epidermal growth factor precursor. Nature. 1984;307:558–60.
Côté F, Ham K-S, Hahn MG, Bergmann CW. Oligosaccharide elicitors in host-pathogen interactions: generation, perception, and signal transduction. Plant-Microbe Interact 1998:385–432.
Force A, Lynch M, Pickett FB, Amores A, Yan Y-l, Postlethwait J. Preservation of duplicate genes by complementary, degenerative mutations. Genetics. 1999;151(4):1531–45.
Fischer I, Diévart A, Droc G, Dufayard J-F, Chantret N. Evolutionary dynamics of the leucine-rich repeat receptor-like kinase (LRR-RLK) subfamily in angiosperms. Plant Physiol. 2016;170(3):1595–610.
Tan S, Wang D, Ding J, Tian D, Zhang X, Yang S. Adaptive evolution of Xa21 homologs in Gramineae. Genetica. 2011;139:1465–75.
Vetter MM, Kronholm I, He F, Häweker H, Reymond M, Bergelson J, Robatzek S, De Meaux J. Flagellin perception varies quantitatively in Arabidopsis thaliana and its relatives. Mol Biol Evol. 2012;29(6):1655–67.
He Z-H, Cheeseman I, He D, Kohorn BD. A cluster of five cell wall-associated receptor kinase genes, Wak1–5, are expressed in specific organs of Arabidopsis. Plant Mol Biol. 1999;39:1189–96.
Davis KR, Hahlbrock K. Induction of defense responses in cultured parsley cells by plant cell wall fragments. Plant Physiol. 1987;84(4):1286–90.
Liners F, Thibault J-F, Van Cutsem P. Influence of the degree of polymerization of oligogalacturonates and of esterification pattern of pectin on their recognition by monoclonal antibodies. Plant Physiol. 1992;99(3):1099–104.
Reymond P, Grünberger S, Paul K, Müller M, Farmer EE. Oligogalacturonide defense signals in plants: large fragments interact with the plasma membrane in vitro. Proceedings of the National Academy of Sciences 1995, 92(10):4145–4149.
Randoux B, Renard-Merlier D, Mulard G, Rossard S, Duyme F, Sanssené J, Courtois J, Durand R, Reignault P. Distinct defenses induced in wheat against powdery mildew by acetylated and nonacetylated oligogalacturonides. Phytopathology. 2010;100(12):1352–63.
Lally D, Ingmire P, Tong H-Y, He Z-H. Antisense expression of a cell wall–associated protein kinase, WAK4, inhibits cell elongation and alters morphology. Plant Cell. 2001;13(6):1317–32.
Wagner TA, Kohorn BD. Wall-associated kinases are expressed throughout plant development and are required for cell expansion. Plant Cell. 2001;13(2):303–18.
Sivaguru M, Ezaki B, He Z-H, Tong H, Osawa H, Baluška Fe, Volkmann D, Matsumoto H. Aluminum-induced gene expression and protein localization of a cell wall-associated receptor kinase in Arabidopsis. Plant Physiol. 2003;132(4):2256–66.
Setubal JC, Stadler PF. Gene phylogenies and orthologous groups. Comp Genomics: Methods Protocols 2018:1–28.
Xu Y, Zhang H, Zhong Y, Jiang N, Zhong X, Zhang Q, Chai S, Li H, Zhang Z. Comparative genomics analysis of bHLH genes in cucurbits identifies a novel gene regulating cucurbitacin biosynthesis. Hortic Res 2022, 9.
This work is supported by the National Natural Science Foundation of China (32202526) and the China Postdoctoral Science Foundation (2022M713391) to XL. And further supported by National Natural Science Foundation of China (grants numbers 31772344 and 31972444), The National Key Research and Development Program of China National Key Research and Development Project (2020YFD1000403) and The Construction of Beijing Science and Technology Innovation and Service Capacity in Top Subjects(CEFF-PXM2019_014207_000032)to ZZ.
The authors declare no competing interests.
Ethics approval and consent to participate
Consent for publication
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Cis-element categories based on biological process. Supplementary Table 2 Primers used in the experiment. Supplementary Table 3 Predicted WAK/WAKL family members in strawberrySupplementary Table 4 Predicted WAK/WAKL family members in apple. Supplementary Table 5 Predicted WAK/WAKL family members in peach. Supplementary Table 6 Number of WAK/WAKL genes in different species. Supplementary Figure 1 Venn diagrams of Genes containing different domains. (a) to (c) represents the quantitative relationships of apple, peach and strawberry respectively. GUB_ WAK_ Bind, galacturonan binding domain, EGF_ CA, calcium binding EGF domain, PKinase, serine/threonine kinase. SignalP&TM, signal peptide and transmembrane helix. Supplementary Figure 2 DNA structures and conserved domains of the WAK/WAKL gene family. Supplementary Figure 3 Paralogous genes pairs in four Rosaceae crops. (a) to (d) represent microsyntenic analysis of apple, peach, rose and strawberry, respectively. The grey lines represent pairs of genes that has syntenic relationship around the genome, and the highlighted red line indicating paralogous pairs ofWAK/WAKL family. Supplementary Figure 4 Syntenic map of orthologous genes running through Rosaceae species. The different coloured bars indicate their chromosomes, the grey lines indicate pairs of genes that are covalently related, and the highlighted red lines mean both of the genes from the WAK/WAKL family. Supplementary Figure 5 Comparison of CDS (coding sequence) of RcWAK2 and RcWAK22. Sequence alignment was performed by DNAMAN with default parameters in pairwise alignment.
About this article
Cite this article
Wang, Z., Ma, Y., Chen, M. et al. Comparative genomics analysis of WAK/WAKL family in Rosaceae identify candidate WAKs involved in the resistance to Botrytis cinerea. BMC Genomics 24, 337 (2023). https://doi.org/10.1186/s12864-023-09371-9