Genome-wide and molecular evolution analysis of the subtilase gene family in Vitis vinifera
© Cao et al.; licensee BioMed Central. 2014
Received: 11 April 2014
Accepted: 11 December 2014
Published: 16 December 2014
Vitis vinifera (grape) is one of the most economically significant fruit crops in the world. The availability of the recently released grape genome sequence offers an opportunity to identify and analyze some important gene families in this species. Subtilases are a group of subtilisin-like serine proteases that are involved in many biological processes in plants. However, no comprehensive study incorporating phylogeny, chromosomal location and gene duplication, gene organization, functional divergence, selective pressure and expression profiling has been reported so far for the grape.
In the present study, a comprehensive analysis of the subtilase gene family in V. vinifera was performed. Eighty subtilase genes were identified. Phylogenetic analyses indicated that these subtilase genes comprised eight groups. The gene organization is considerably conserved among the groups. Distribution of the subtilase genes is non-random across the chromosomes. A high proportion of these genes are preferentially clustered, indicating that tandem duplications may have contributed significantly to the expansion of the subtilase gene family. Analyses of divergence and adaptive evolution show that while purifying selection may have been the main force driving the evolution of grape subtilases, some of the critical sites responsible for the divergence may have been under positive selection. Further analyses of real-time PCR data suggested that many subtilase genes might be important in the stress response and functional development of plants.
Tandem duplications as well as purifying and positive selections have contributed to the functional divergence of subtilase genes in V. vinifera. The data may contribute to a better understanding of the grape subtilase gene family.
KeywordsVitis vinifera Subtilase Gene family Evolution Positive selection Differential expression
Subtilases are a very diverse family of subtilisin-like serine proteases found in all three domains of life (bacteria, archaea and eukaryotes). They are characterized by a catalytic triad of Asp, His, and Ser residues or a conserved catalytic residue Asn in an arrangement shared with subtilisins from Bacillus[1–3], all of which are located in the N-terminal domains of the mature enzymes. Most subtilases have a multi-domain structure, comprising a signal peptide, a pro-peptide, a protease domain, and frequently one or more additional domains [1, 4]. In prokaryotes, subtilases are generally secreted outside the cell during nutrition and play a role in host invasion. Subtilases lacking a signal peptide should remain inside the cell and most likely play a role in intracellular maturation of other proteins and peptides .
The first subtilase identified in eukaryotes was kexin . Since then, nine subtilases have been discovered in mammals, among which seven are related to kexin and the remaining two (S1P and PCSK9) belong to the proteinase K and pyrolysin subfamilies of subtilases, respectively . They are involved in the maturation of growth factors, neuropeptides, peptide hormones, receptor proteins, enzymes and viral surface glycoproteins in animals [7, 8]. The first subtilase cloned from higher plants was cucumisin, an extracellular protease highly abundant in melon fruit . After that, other subtilase cDNAs have been cloned from Alnus glutinosa, Arabidopsis thaliana and Lilium multiflorum[10, 11]. The most striking characteristic of plant subtilases is the presence of long insertions of up to 169 amino acids in the central region of the catalytic domain, resulting in a shift of the Ser of the catalytic triad towards the C-terminus . Moreover, the completion of genome projects for some model species revealed that large subtilase gene families exist throughout the plant kingdom, ranging from 23 genes in the moss Physcomitrella patens, 56 genes in Arabidopsis[12, 13], and 63 genes in rice  to 90 members in Populus trichocarpa. Clearly, plants possess many more of these subtilases than animals, suggesting important roles of subtilases in plant biology. Plant subtilases share many properties with their bacterial and mamalian homologs, but have some unique biochemical and structural features (e.g. Ca2+ independence and the inserted PA_subtilisin_like domain) that distinguish them from those in other organisms [15–18]. Expansion of the subtilase family in plants is also accompanied by functional diversification. It seems that most plant subtilases have gained some plant-specific functions during their evolution. For instance plant subtilases are involved in xylem differentiation , fruit ripening , seed development [21, 22], formation of lateral roots  and pathogen interactions [24–26]. In addition, this gene family is also related the processing of peptide growth factors  and programmed cell death in plants [28, 29].
Structural features and expression profiles of some subtilase homologs have been partially described in Arabidopsis[12–14] and rice . However, there is much less information about this gene family in Vitis vinifera. In the present study, we performed a genome-wide identification of the subtilase gene family in V. vinifera. Detailed analyses, including the molecular phylogeny, structural organization, functional divergence, adaptive evolution and expression profiling, were performed. Such an in-depth investigation is expected to provide insights into the underlying evolutionary mechanisms of the subtilase gene family in V. vinifera.
Results and discussion
Identification and characterization of the subtilase gene family in V. vinifera
Subtilases possess a conserved protease-associated (PA) subtilisin-like domain (PA_subtilisin_like domain). Based on this, we used the amino acid sequence of the PA_subtilisin_like domain (cd02120) as a query to search for homologs encoded by the grape genome. Subsequently, all identified candidate subtilase sequences were analyzed to determine whether they contain PA_subtilisin_like domains using the Conserved Domain Database (CDD) . As a result, we identified 80 subtilase proteins in V. vinifera (Additional file 1: Table S1). This number is higher than those reported in other plant species, such as 56 in Arabidopsis thaliana, 63 in rice, and 15 in tomato . These data likely suggest significant physiological functions of the subtilase gene family in V. vinifera. The numbers of subtilase genes in all plant species analyzed to date have been higher than in humans, suggesting their potentially more diverse function or different evolutionary mechanisms in plants.
Predictions of the subcellular localization of a gene product can provide additional information for its functional involvement. In this study, TargetP and PredoTar (http://urgi.versailles.inra.fr/predotar/predotar.html) were used for primary structural analyses of grape subtilases . The results indicated that most of the 80 grape subtilases possess signal sequences for targeting to the secretory pathway. In mammalian cells, most subtilases, such as proprotein convertases family (PCs), act as secretory enzymes and are targeted to the endoplasmic reticulum (ER) by virtue of their N-terminal signal peptides . This result indicated a common feature among plant species, such as grape and mammalian cells. However, we found that 32 members of the subtilase genes in V. vinifera do not contain any known protein-targeting motif. It is predicted that two members (LOC100250428 and LOC100265894) are targeted to chloroplasts and one (LOC100255614) to mitochondria (Additional file 1: Table S1), suggesting potential chloroplast and mitochondrial functions. Different subcellular localizations of plant subtilases have been found to correlate with their different physiological functions. For example, the subtilase-like serine protease SDD1 in Arabidopsis is located at the cell plasma membrane, where it mediates cell-to-cell signaling and controls stomatal distribution and density during leaf development . Another subtilisin-like protease, is stimulated in the presence of calcium ion . ALE1 encodes a subtilisin-like serine protease, which is localized in the endosperm cells surrounding the embryo, and is required for epidermal surface formation in Arabidopsis embryos and juvenile plants . In some cases, subtilases that share high sequence identities may have differential functions and are localized in specific tissues during different developmental stages. For example, tomato subtilase-like protease genes P79A, P69B, P79C and P69D share 79-88% identities; however, they exhibit different developmental and tissue-specific expression profiles, suggesting plant subtilases evolved various strategies to control their activities .
Phylogenetic analyses of grape subtilases
Inference of duplication time in paralogous pairs
Data (million years ago)
Conserved and diverged domains, motifs and gene structures
The modular structure of subtilase proteins has been studied thoroughly in Arabidopsis. This detailed information allowed us to analyze comparable domains for the 80 subtilases identified from the grape genome (Additional file 2: Table S2). We used CDD to identify major domains of subtilases in grape. Our results showed that four conserved domains (inhibitor_I9, peptidases_S8_3, PA_subtilisin_like, and peptidases_S8_S53) are present in the majority of grape subtilases (Additional file 2: Table S2). Compared with their mammalian homologs, plant subtilases share an insertion of 120–160 amino acids, within the catalytic domain (PA domain). The PA domain was originally identified as a region of homology between human transferrin and plant vacuolar sorting receptors. It is associated with different families of peptidases and has been implicated in protein–protein interactions and substrate recognition . For most plant subtilases, such as tomato SlSBT3, their activation is stimulated by PA-domain-mediated homo-dimerization. In our analysis, most grape subtilases contained the PA domain, suggesting the potential function of the PA domain in grape subtilase protein dimerization , and that the ability to form homodimers through the PA domain is likely a common feature of plant subtilases. The PA domain is also important for determining optimum substrate length in soybean , suggesting a possible role of the PA domain in grape subtilases in substrate selection. Here, we also identified a novel domain, inhibitor_I9. This domain (sometimes referred to as an activation peptide) is responsible for modulating the folding and activity of the peptidase pro-enzyme. In many cases, it is synthesized as part of a large precursor protein as an N-terminal domain associated with an inactive peptidase. This domain prevents access of the substrate to the active site. Once the N-terminal inhibitory domain is removed, either by interaction with a secondary peptidase or by autocatalytic cleavage, the activity of subtilase is stimulated . It seems that autocatalytic cleavage of the inhibitor-I9 domain contributes to the precise regulation of grape subtilase enzymes’ activities. A similar regulatory mechanism is reported in other plants. For example, the tomato (Solanum lycopersicum) SISBT3 possesses a potentially auto-inhibitory beta-hairpin domain that may obstruct the active site of the monomeric enzyme. Upon homo-dimerization mediated by the PA domain, this hairpin is immobilized by binding to the PA domain and its auto-inhibitory activity is relieved, stimulating the subtilase activity . Peptidases_S8_3 and peptidases_S8_S53 domains might play a role in digesting the specific substrates for grape subtilases. However, the following exceptions were observed: in addition to the four conserved domains, LOC100253594, LOC100265894, LOC100265217, LOC100242573, LOC100245233, LOC100250404 and LOC100251210 also contain other domains. For example, the DUF1034 domain exists in LOC100245233 and LOC100251210. This domain functions in sugar hydrolysis in other organisms such as fungi . In addition, the co-occurrence of a proteinase K domain and a P450 domain was reported in Magnaporthe grisea. We also found that two subtilases (LOC100265129 and LOC100266876) do not contain the inhibitor_I9 domain. Three copies of the peptidases_S8_S53 domains were present in LOC100250428 (Additional file 2: Table S2), suggesting possible domain duplication events during this gene’s evolution.
Gene structural diversification may play an important role in the evolution of multigene families [43, 44]. To gain further insights into the structural diversity of subtilases, we compared the exon-intron organization of the coding sequences of individual subtilase genes in grape. Detailed illustrations of the exon-intron structures are shown in Figure 1. In general, most closely related members in the same group shared a similar exon-intron structure. Interestingly, we also found that the number of introns varies considerably between different groups of grape subtilases, and most members of groups 1, 2, 6 and 7 do not contain introns. This can be explained by differences in the rates of intron gain and loss . Similar to grape subtilases, papaya subtilases also contain introns ; however, intronless subtilase genes have been reported in Arabidopsis and tomato . It has been suggested that introns not only increase the fitness of an organism by increasing intragenic recombination , but also are related to the evolutional rate of genes. For instance, some genes that rarely contain introns (F-box gene family, pentatricopeptide repeat containing gene family, DEAD box RNA helicases, early auxin-responsive SAUR) often experienced positive selection in their evolution [47–49]. Introns are unequally distributed in some gene families [50, 51] because of the ongoing intron gain and loss. Whether the large number of intron losses in groups 1, 2, 6 and 7 of grape subtilases have similar effects to those described above remains to be further experimentally examined.
Chromosomal distribution and gene duplications of the grape subtilase genes
Analysis of functional divergence
Estimated functional divergence among grape subtilase paralogs
Group 1/Group 2
Group 1/Group 3
Group 1/Group 4
Group 1/Group 5
Group 1/Group 6
Group 1/Group 7
Group 1/Group 8
Group 2/Group 3
Group 2/Group 4
Group 2/Group 5
Group 2/Group 6
Group 2/Group 7
Group 2/Group 8
Group 3/Group 4
Group 3/Group 5
Group 3/Group 6
Group 3/Group 7
Group 3/Group 8
Group 4/Group 5
Group 4/Group 6
Group 4/Group 7
Group 4/Group 8
Group 5/Group 6
Group 5/Group 7
Group 5/Group 8
Group 6/Group 7
Group 6/Group 8
Group 7/Group 8
Site-specific selective pressure analysis
Predicted numbers and locations of codons under positive selection within different subtilase groups
K a /K s
Positive selection sites
Integrative selection analysis
Evidence for positive selection in subtilase coding sequences
Null model log-likelihoods
Alternative mode log-likelihoods
Evidence for positive selection
Differential expression profiles of grape subtilase genes
We further selected four fruit growth phases to investigate these genes’ expressions during the fruit maturing process; these four phases were green hard berry, green soft berry, pink soft berry and red soft berry. As shown in Figure 6, different expression levels of subtilase genes were found in these four growth phases. Some genes, such as LOC100251507 and LOC100251409, showed a higher transcript level in fruit, but lower in other tissues. Furthermore, LOC100260464, LOC100243364, LOC100265217, LOC100243546 and LOC100250404 showed lower transcript levels in fruit, but higher levels in leaves, roots and floral buds. The subtilase gene SBT1.1 is specifically expressed in the endosperm of Medicago truncatula and Pisum sativum seeds to control seed size . In this study, we found that LOC 100253001, an ortholog of SBT1.1, was also transcribed at a high level during grape fruit development.
Environmental stress might regulate subtilase gene expression differentially . Therefore, we tested the differences in expressions of grape subtilase mRNAs under various environmental stresses, including salt, cold, heat and drought. These environmental stresses are frequently confronted during grape growth. Several genes, such as LOC100260739, LOC100249001, LOC100255668, LOC100257393, LOC100243634, and LOC100251409, were obviously induced after different stress treatments. Other genes, including LOC100255614, LOC100259937, LOC100254828, LOC100266528, and LOC100258241, were obviously suppressed by these environmental stresses. Meanwhile, most of these genes demonstrated a comparable expression profile when subjected to the various environmental stresses, except LOC100267603 and LOC100248833, which were expressed at a higher level after salt treatment, but a lower level after heat and drought stresses. Previous studies reported that PvSLP2 transcription was not induced by drought stress; however, PvSLP2 activity can be stimulated by drought stress, suggesting that plant subtilase activities may be regulated at the post-transcriptional level . Thus, we could not exclude the possibility that some grape subtilases are involved in environmental stress responses at the post-transcriptional level, even though we did not detect their transcriptional differences.
In summary, we identified and annotated 80 subtilases comprising eight subgroups in the V. vinifera genome. The analyses of gene structures, duplications, and selection provided valuable information on the evolution of grape subtilases. In particular, we found that tandem duplications have played an important role in the expansion of the subtilase gene family. Selection analysis revealed that purifying selection has been the main force during the evolution of the subtilase, while some of the critical sites have been subjected to positive selection. Moreover, analyses of their expression profiles provided functional information for members of the subtilase gene family in grape at different development stages. Further, investigations on the response patterns of the subtilase genes to salinity, cold, heat and drought conditions identified candidate stress-responsive genes in grape. Our results contribute valuable information for future functional investigations of this gene family.
Sequence retrieval and identification
To identify potential members of the subtilase gene family in grape, we performed multiple database searches. The amino acid sequence of the PA_subtilisin_like domain (cd02120) was retrieved and used as a query in BLAST searches against the grape genomes at the National Center for Biotechnology Information (NCBI, http://www.ncbi.nlm.nih.gov). TargetP and PredoTar (http://urgi.versailles.inra.fr/predotar/predotar.html) were used for primary structure analyses of the grape subtilase members .
Phylogenetic analyses of the grape subtilase gene family
Multiple sequence alignments of the full-length protein sequences were performed using MUSCLE 3.52, followed by manual comparisons and refinement . Phylogenetic analyses of the subtilase protein family, based on amino acid sequences, were performed with a maximum likelihood method using PhyML 3.0 and by a distance method using PHYLIP . ModelGenerator was used to select the optimal model of protein substitution and rate heterogeneity that best fitted the data set . Bootstrap support values were estimated using 100 pseudo-replicates.
Chromosomal location and gene structure of the subtilase genes
The chromosomal locations of the subtilase genes were determined using the grape genome browser (http://www.genoscope.cns.fr/spip/Vitis-vinifera-e.html). Gene intron/exon structure information was collected from the genome annotations of grape from the NCBI and Phytozome (http://www.phytozome.net) databases.
Inference of duplication time
Pairwise alignment of nucleotide sequences of the subtilase paralogs was performed using MEGA 5 . Alignments were performed using ClustalW (codons). The K a and K s values of the paralogous genes were estimated by the program K-Estimator 6.0 . To better explain the patterns of macroevolution, estimates of the evolutionary rates were considered extremely useful. Assuming a molecular clock, the synonymous substitution rates (K s ) of the paralogous genes are expected to be similar over time. Thus, K s could be used as the proxy for time to estimate the dates of segmental duplication events. The K s value was calculated for each of the gene pairs and then used to calculate the approximate date of the duplication event (T = K s /2λ), assuming clock-like rates (λ) of synonymous substitution of 6.5 × 10- 9 for grape .
Conserved motifs analyses
The program MEME (http://meme.sdsc.edu) was used to identify motifs in the candidate grape subtilase protein sequences . MEME was run locally, with the following parameters: number of repetitions = any, maximum number of motifs = 25, and with optimum motif widths constrained to between six and 50 residues.
Functional divergence analyses
To estimate the level of functional divergence and predict the amino acid residues responsible for functional differences in the subtilase subfamilies, coefficients of Type-I functional divergence were calculated using the method suggested by Gu . The analyses were carried out with DINERGE (version 2.0). The method is based on maximum likelihood procedures to estimate significant changes in the site-specific shift of the evolutionary rate or the site-specific shift of amino acid properties after the emergence of two paralogous sequences. The advantage of this method is that it uses amino acid sequences and, therefore, is not sensitive to saturation of synonymous sites. Type-I functional divergence designates amino acid configurations that are highly conserved in gene 1 but highly variable in gene 2, or vice versa, implying that these residues have experienced altered functional constraints . Coefficients of functional divergence that are significantly greater than 0 indicate site-specific altered selective constraints or radical shifts of amino acid physiochemical properties after gene duplication. Site-specific posterior analysis was used to predict amino acid residues that were crucial for functional divergence.
Site-specific selection assessment and testing
In the study, SLAC, REL and FEL were employed to select individual codons using the default settings of the Datamonkey web interface [56, 57, 67]. SLAC fits a nucleotide substitution model to the data and calculates a global Ka/Ks ratio. Then, ancestral sequences at each codon are reconstructed using maximum likelihood. Finally, expected and observed numbers of synonymous and nonsynonymous substitutions are calculated to infer selection at each codon site. Significance was assessed using a P value derived from a two-tailed binomial distribution. SLAC calculates the expected and observed numbers of synonymous and nonsynonymous substitutions to infer selection. REL is an extension of the site-by-site positive selection analyses implemented in PAML . Notably, it allows the synonymous and nonsynonymous substitution rates to vary among codon sites, and uses Bayes factors >50 to determine a site as selected [56, 67]. FEL directly estimates Ka and Ks based on a codon-substitution model; a likelihood ratio test is used to assess significance at a level of 0.1. Finally, we applied the "integrative selection analysis" to determine the total number of positively selected codons, which were detected by at least one of the three methods [56, 67]. PARRIS can allow tree topologies and branch lengths to change across detected recombination breakpoints ; therefore, we used it to test for the signatures of selection.
RNA extraction and real-time qRT-PCR
Total RNA after different stress treatments or from different tissues was isolated using an RNeasy Kit (Qiagen) from plant samples that had been ground in liquid nitrogen and then converted into first-strand cDNA using SuperScriptII reverse transcriptase (Invitrogen) with an oligo(dT) primer. The cDNA templates were amplified using a CFX384 Real-time PCR detection system (Bio-Rad) with SYBR premix Ex Taq (Takara). The primer sequences are given in Additional file 4: Table S4. The thermal program was 5 min at 95°C, followed by 60 cycles of 10 s at 95°C, 10 s at 55°C and 10 s at 72°C. The specificity of the reactions was confirmed by the machine standard melt curve method. The grape tubulin gene was used as the reference gene. The quantified data were analyzed by hierarchical clustering using the cluster 3.0 and Treeview software (http://bonsai.ims.u-tokyo.ac.jp/~mdehoon/software/cluster). The different colors correspond to the log-transformed values of protein change-fold ratios shown in the bar at the bottom of Figure 6.
This work was supported by the 100 Talents Program of CAS (to XH) and grants from the National Science Foundation of China (Nos. 30871704, 30971452 and 31170256 to XH) JH is supported by a NSF Assembling the Tree of Life grant (DEB 0830024), an NSFC Oversea, Hong Kong, Macao collaborative grant (31328003), and the CAS/SAFEA International Partnership Program for Creative Research Teams.
- Siezen RJ, Leunissen JA: Subtilases: the superfamily of subtilisin-like serine proteases. Protein Sci. 1997, 6: 501-523.PubMed CentralPubMedView ArticleGoogle Scholar
- Antao CM, Malcata FX: Plant serine proteases: biochemical, physiological and molecular features. Plant Physiol Biochem. 2005, 43: 637-650. 10.1016/j.plaphy.2005.05.001.PubMedView ArticleGoogle Scholar
- Dodson G, Wlodawer A: Catalytic triads and their relatives. Trends Biochem Sci. 1998, 23: 347-352. 10.1016/S0968-0004(98)01254-7.PubMedView ArticleGoogle Scholar
- Siezen RJ, Renckens B, Boekhorst J: Evolution of prokaryotic subtilases: genome-wide analysis reveals novel subfamilies with different catalytic residues. Proteins. 2007, 67 (3): 681-994. 10.1002/prot.21290.PubMedView ArticleGoogle Scholar
- Fuller RS, Brake A, Thorner J: Yeast prohormone processing enzyme (KEX2 gene product) is a Ca2_-dependent serine protease. Proc Natl Acad Sci U S A. 1989, 86: 1434-1438. 10.1073/pnas.86.5.1434.PubMed CentralPubMedView ArticleGoogle Scholar
- Seidah NGKA, Prat A: The proprotein convertases and their implication in sterol and/or lipid metabolism. Biol Chem. 2006, 387: 871-877.PubMedView ArticleGoogle Scholar
- Schaller A, Stintzi A, Graff L: The family of subtilisin/kexin like pro-protein and pro-hormone convertases: Divergent or shared functions. Biochimie. 1994, 76: 197-209. 10.1016/0300-9084(94)90147-3.View ArticleGoogle Scholar
- Seidah NG, Chrétien M: Proprotein and prohormone convertases: a family of subtilases generating diverse bioactive polypeptides. Brain Res. 1999, 848 (1–2): 45-62.PubMedView ArticleGoogle Scholar
- Yamagata H, Masuzawa T, Nagaoka Y, Ohnishi T, Iwasaki T: cucumisin, a serine protease from melon fruits, shares structural homology with subtilisin and is generated from a large precursor. J Biol Chem. 1994, 269: 32725-32731.PubMedGoogle Scholar
- Ribeiro A, Akkermans ADL, van Kammen A, Bisseling T, Pawlowski K: A nodule-specific gene encoding a subtilisin like protease is expressed in early stages of actinorhizal nodule development. Plant Cell. 1995, 7: 785-794. 10.1105/tpc.7.6.785.PubMed CentralPubMedView ArticleGoogle Scholar
- Taylor AA, Horsch A, Rzepczyk A, Hasenkampf CA, Riggs CD: Maturation and secretion of a serine proteinase is associated with events of late microsporogenesis. Plant J. 1997, 12: 1261-1271. 10.1046/j.1365-313x.1997.12061261.x.PubMedView ArticleGoogle Scholar
- Beers EP, Jones AM, Dickerman AW: The S8 serine, C1A cysteine and A1 aspartic protease families in Arabidopsis. Phytochemistry. 2004, 65: 43-58. 10.1016/j.phytochem.2003.09.005.PubMedView ArticleGoogle Scholar
- Rautengarten C, Steinhauser D, Bussis D, Stintzi A, Schaller A, Kopka J, Altmann T: Inferring hypotheses on functional relationships of genes: Analysis of the Arabidopsis thaliana subtilase gene family. PLoS Comput Biol. 2005, 1: e40-10.1371/journal.pcbi.0010040.PubMed CentralPubMedView ArticleGoogle Scholar
- Tripathi L, Sowdhamini R: Cross genome comparisons of serine proteases in Arabidopsis and rice. BMC Genomics. 2006, 7: 200-10.1186/1471-2164-7-200.PubMed CentralPubMedView ArticleGoogle Scholar
- Schaller A, Stintzi A, Graff L: Subtilases-versatile tools for protein turnover, plant development, and interactions with the environment. Physiol Plant. 2012, 145 (1): 52-66. 10.1111/j.1399-3054.2011.01529.x.PubMedView ArticleGoogle Scholar
- Mahon P, Bateman A: The PA domain: A protease-associated domain. Protein Sci. 2000, 9: 1930-1934. 10.1110/ps.9.10.1930.PubMed CentralPubMedView ArticleGoogle Scholar
- Luo X, Hofmann K: The protease-associated domain: A homology domain associated with multiple classes of proteases. Trends Biochem Sci. 2001, 26: 147-148. 10.1016/S0968-0004(00)01768-0.PubMedView ArticleGoogle Scholar
- Ottmann C, Rose R, Huttenlocher F, Cedzich A, Hauske P, Kaiser M, Huber R, Schaller A: Structural basis for Ca2 + -independence and activation by homodimerization of tomato subtilase 3. Proc Natl Acad Sci U S A. 2009, 106: 17223-17228. 10.1073/pnas.0907587106.PubMed CentralPubMedView ArticleGoogle Scholar
- Zhao C, Johnson BJ, Kositsup B, Beers EP: Exploiting secondary growth in Arabidopsis. Construction of xylem and bark cDNA libraries and cloning of three xylem endopeptidases. Plant Physiol. 2000, 123: 1185-1196. 10.1104/pp.123.3.1185.PubMed CentralPubMedView ArticleGoogle Scholar
- Othman R, Nuraziyan A: Fruit-specific expression of papaya subtilase gene. J Plant Physiol. 2010, 167 (2): 131-137. 10.1016/j.jplph.2009.07.015.PubMedView ArticleGoogle Scholar
- Beilinson V, Moskalenko OV, Livingstone DS, Reverdatto SV, Jung R, Nielsen NC: Two subtilisin-like proteases from soybean. Physiol Plant. 2002, 115: 585-597. 10.1034/j.1399-3054.2002.1150413.x.PubMedView ArticleGoogle Scholar
- D'Erfurth I, Le Signor C, Aubert G, Sanchez M, Vernoud V, Darchy B, Lherminier J, Bourion V, Bouteiller N, Bendahmane A, Buitink J, Prosperi JM, Thompson R, Burstin J, Gallardo K: A role for an endosperm-localized subtilase in the control of seed size in legumes. New Phytol. 2012, 196 (3): 738-751. 10.1111/j.1469-8137.2012.04296.x.PubMedView ArticleGoogle Scholar
- Neuteboom LW, Veth-Tello LM, Clijdesdale OR, Hooykaas PJ, van derZaal BJ: A novel subtilisin-like protease gene from Arabidopsis thaliana is expressed at sites of lateral root emergence. DNA Res. 1999, 6: 13-19. 10.1093/dnares/6.1.13.PubMedView ArticleGoogle Scholar
- Chichkova NV, Kim SH, Titova ES, Kalkum M, Morozov VS, Rubtsov YP, Kalinina NO, Taliansky ME, Vartapetian AB: A plant-caspase-like protease activated during hypersensitive response. Plant Cell. 2004, 16: 157-171. 10.1105/tpc.017889.PubMed CentralPubMedView ArticleGoogle Scholar
- Tian M, Kamoun S: A two disulfide bridge Kazal domain from Phytophthora exhibits stable inhibitory activity against serine proteases of the subtilisin family. BMC Biochem. 2005, 6: 15-10.1186/1471-2091-6-15.PubMed CentralPubMedView ArticleGoogle Scholar
- Tian MY, Benedetti B, Kamoun S: A second kazal-like protease inhibitor from Phytophthora infestans inhibits and interacts with the apoplastic pathogenesis-related protease P69B of tomato. Plant Physiol. 2005, 138: 1785-1793. 10.1104/pp.105.061226.PubMed CentralPubMedView ArticleGoogle Scholar
- Srivastava R, Liu L, Howell SH: Proteolytic processing of a precursor protein for a growth promoting peptide by a subtilisin serine protease in Arabidopsis. Plant J. 2008, 56: 219-227. 10.1111/j.1365-313X.2008.03598.x.PubMed CentralPubMedView ArticleGoogle Scholar
- Coffeen WC, Wolpert TJ: Purification and characterization of serine proteases that exhibit caspase-like activity and are associated with programmed cell death in Avena sativa. Plant Cell. 2004, 16: 857-873. 10.1105/tpc.017947.PubMed CentralPubMedView ArticleGoogle Scholar
- Vartapetian AB, Tuzhikov AL, Chichkova NV, Taliansky M, Wolpert TJ: A plant alternative to animal caspases: subtilisin-like proteases. Cell Death Differ. 2011, 18: 1289-1297. 10.1038/cdd.2011.49.PubMed CentralPubMedView ArticleGoogle Scholar
- Marchler-Bauer A, Lu SN, Anderson JB, Chitsaz F, Derbyshire MK, DeWeese-Scott C, Fong JH, Geer LY, Geer RC, Gonzales NR, Gwads M, Hurwitz DI, KE Z, Jackson JD, Lanczycki CJ, Lu F, Marchler GH, Mullokandov M, Omelchenko MV, Robertson CL, Song JS, Thanki N, Yamashita RA, Zhang N, Sheng C, Bryant SH: CDD: a Conserved Domain Database for the functional annotation of proteins. Nucleic Acids Res. 2011, 39 (Database issue): D225-D229.PubMed CentralPubMedView ArticleGoogle Scholar
- Rose R, Schaller A, Ottmann C: Structural features of plant subtilases. Plant Signal Behav. 2010, 5 (2): 180-183. 10.4161/psb.5.2.11069.PubMed CentralPubMedView ArticleGoogle Scholar
- Emanuelsson O, Nielsen H, Brunak S, von Heijne G: Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. J Mol Biol. 2000, 300: 1005-1016. 10.1006/jmbi.2000.3903.PubMedView ArticleGoogle Scholar
- von Groll U, Berger D, Altmann T: The subtilisin-like serine protease SDD1 mediates cell-to-cell signaling during Arabidopsis stomatal development. Plant Cell. 2002, 14: 1527-1539. 10.1105/tpc.001016.PubMed CentralPubMedView ArticleGoogle Scholar
- Hamilton JM, Simpson DJ, Hyman SC, Ndimba BK, Slabas AR: Ara12 subtilisin-like protease from Arabidopsis thaliana: purification, substrate specificity and tissue localization. Biochem J. 2003, 370 (Pt 1): 57-67.PubMed CentralPubMedView ArticleGoogle Scholar
- Tanaka H, Onouchi H, Kondo M, Hara-Nishimura I, Nishimura M, Machida C, Machida Y: A subtilisin-like serine protease is required for epidermal surface formation in Arabidopsis embryos and juvenile plants. Development. 2001, 128 (23): 4681-4689.PubMedGoogle Scholar
- Jorda L, Coego A, Conejero V, Vera P: A genomic cluster containing four differentially regulated subtilisin-like processing protease genes is in tomato plants. J Biol Chem. 1999, 274 (4): 2360-2365. 10.1074/jbc.274.4.2360.PubMedView ArticleGoogle Scholar
- Meichtry J, Amrhein N, Schaller A: Characterization of the subtilase gene family in tomato (Lycopersicon esculentum Mill.). Plant Mol Biol. 1999, 39 (4): 749-760. 10.1023/A:1006193414434.PubMedView ArticleGoogle Scholar
- Tan-Wilson A, Bandak B, Prabu-Jeyabalan M: The PA domain is crucial for determining optimum substrate length for soybean protease C1: Structure and kinetics correlate with molecular function. Plant Physiol Biochem. 2012, 53: 27-32.PubMedView ArticleGoogle Scholar
- Bergeron F, Leduc R, Day R: Subtilase-like pro-protein convertases: from molecular specificity to therapeutic applications. J Mol Endocrinol. 2000, 24 (1): 1-22. 10.1677/jme.0.0240001.PubMedView ArticleGoogle Scholar
- Siezen RJ: Subtilases: subtilisin-like serine proteases. Adv Exp Med Biol. 1996, 379: 75-93. 10.1007/978-1-4613-0319-0_9.PubMedView ArticleGoogle Scholar
- Muszewska A, Taylor JW, Szczesny P, Grynberg M: Independent subtilases expansions in fungi associated with animals. Mol Biol Evol. 2011, 28 (12): 3395-3404. 10.1093/molbev/msr176.PubMed CentralPubMedView ArticleGoogle Scholar
- Bailey TL, Williams N, Misleh C, Li WW: MEME: discovering and analyzing DNA and protein sequence motifs. Nucleic Acids Res. 2006, 34 (Web Server issue): W369-W373.PubMed CentralPubMedView ArticleGoogle Scholar
- Cao J, Shi F, Liu X, Huang G, Zhou M: Phylogenetic analysis and evolution of aromatic amino acid hydroxylase. FEBS Lett. 2010, 584 (23): 4775-4782. 10.1016/j.febslet.2010.11.005.PubMedView ArticleGoogle Scholar
- Cao J, Shi F: Dynamics of arginase gene evolution in metazoans. J Biomol Struct Dyn. 2012, 30: 407-418. 10.1080/07391102.2012.682207.PubMedView ArticleGoogle Scholar
- Roy SW, Gilbert W: The evolution of spliceosomal introns: patterns, puzzles and progress. Nat Rev Genet. 2006, 7 (3): 211-221.PubMedGoogle Scholar
- Pearce G, Yamaguchi Y, Barona G, Ryan CA: A subtilisin-like protein from soybean contains an embedded, cryptic signal that activates defense-related genes. Proc Proc Natl Acad Sci USA. 2010, 107 (33): 14921-14925. 10.1073/pnas.1007568107.View ArticleGoogle Scholar
- Aubourg S, Kreis M, Lecharny A: The DEAD box RNA helicase family in Arabidopsis thaliana. Nucleic Acids Res. 1999, 27 (2): 628-636. 10.1093/nar/27.2.628.PubMed CentralPubMedView ArticleGoogle Scholar
- Gagne JM, Downes BP, Shiu SH, Durski AM, Vierstra RD: The F-box subunit of the SCF E3 complex is encoded by a diverse superfamily of genes in Arabidopsis. Proc Natl Acad Sci U S A. 2002, 99 (17): 11519-11524. 10.1073/pnas.162339999.PubMed CentralPubMedView ArticleGoogle Scholar
- Chen Y, Hao X, Cao J: Small auxin upregulated RNA (SAUR) gene family in maize: Identification, evolution, and its phylogenetic comparison with Arabidopsis, rice, and sorghum. J Integr Plant Biol. 2014, 56: 133-150. 10.1111/jipb.12127.PubMedView ArticleGoogle Scholar
- Cao J, Huang J, Yang Y, Hu X: Analyses of the oligopeptide transporter gene family in poplar and grape. BMC Genomics. 2011, 12: 465-10.1186/1471-2164-12-465.PubMed CentralPubMedView ArticleGoogle Scholar
- Chen Y, Cao J: Comparative genomic analysis of the Sm gene family in rice and maize. Gene. 2014, 539: 238-249. 10.1016/j.gene.2014.02.006.PubMedView ArticleGoogle Scholar
- Jaillon O, Aury JM, Noel B, Policriti A, Clepet C, Casagrande A, Choisne N, Aubourg S, Vitulo N, Jubin C, Vezzi A, Legeai F, Hugueney P, Dasilva C, Horner D, Mica E, Jublot D, Poulain J, Bruyère C, Billault A, Segurens B, Gouyvenoux M, Ugarte E, Cattonaro F, Anthouard V, Vico V, Del Fabbro C, Alaux M, Di Gaspero G, Dumas V, et al: The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature. 2007, 449 (7161): 463-467. 10.1038/nature06148.PubMedView ArticleGoogle Scholar
- Ma LJ, Ibrahim AS, Skory C, Grabherr MG, Burger G, Butler M, Elias M, Idnurm A, Lang BF, Sone T, Abe A, Calvo SE, Corrochano LM, Engels R, Fu J, Hansberg W, Kim JM, Kodira CD, Koehrsen MJ, Liu B, Miranda-Saavedra D, O’Leary S, Ortiz-Castellanos L, Poulter R, Rodriguez-Romero J, Ruiz-Herrera J, Shen YQ, Zeng Q, Galagan J, Birren BW, et al: Genomic analysis of the basal lineage fungus Rhizopus oryzae reveals a whole-genome duplication. PLoS Genet. 2009, 5 (7): e1000549-10.1371/journal.pgen.1000549.PubMed CentralPubMedView ArticleGoogle Scholar
- Gu X: Maximum-likelihood approach for gene family evolution under functional divergence. Mol Biol Evol. 2001, 18 (4): 453-464. 10.1093/oxfordjournals.molbev.a003824.PubMedView ArticleGoogle Scholar
- Rawlings ND, Barrett AJ: Families of serine peptidases. Meth Enzymol. 1994, 244: 19-61.PubMedView ArticleGoogle Scholar
- Pond SL, Frost SD: Datamonkey: rapid detection of selective pressure on individual sites of codon alignments. Bioinformatics. 2005, 21 (10): 2531-2533. 10.1093/bioinformatics/bti320.PubMedView ArticleGoogle Scholar
- Kosakovsky Pond SLFS: Not so different after all: a comparison of methods for detecting amino acid sites under selection. Mol Biol Evol. 2005, 22 (5): 1208-1222. 10.1093/molbev/msi105.PubMedView ArticleGoogle Scholar
- Schaller A, Stintzi A, Graff L: Robust inference of positive selection from recombining coding sequences. Bioinformatics. 2006, 22 (20): 2493-2499. 10.1093/bioinformatics/btl427.View ArticleGoogle Scholar
- Subbian E, Yabuta Y, Shinde U: Positive selection dictates the choice between kinetic and thermodynamic protein folding and stability in subtilases. Biochemistry. 2004, 43 (45): 14348-14360. 10.1021/bi048397x.PubMedView ArticleGoogle Scholar
- Neuteboom LW, Veth-Tello LM, Clijdesdale OR, Hooykaas PJ, van der Zaal BJ: A novel subtilisin-like protease gene from Arabidopsis thaliana is expressed at sites of lateral root emergence. DNA Res. 1999, 6 (1): 13-19. 10.1093/dnares/6.1.13.PubMedView ArticleGoogle Scholar
- Budic M, Sabotic J, Meglic V, Kos J, Kidric M: Characterization of two novel subtilases from common bean (Phaseolus vulgaris L.) and their responses to drought. Plant Physiol Biochem. 2013, 62: 79-87.PubMedView ArticleGoogle Scholar
- Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32 (5): 1792-1797. 10.1093/nar/gkh340.PubMed CentralPubMedView ArticleGoogle Scholar
- Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O: New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010, 59 (3): 307-321. 10.1093/sysbio/syq010.PubMedView ArticleGoogle Scholar
- Keane TM, Creevey CJ, Pentony MM, Naughton TJ, McInerney JO: Assessment of methods for amino acid matrix selection and their use on empirical data shows that ad hoc assumptions for choice of matrix are not justified. BMC Evol Biol. 2006, 6: 29-10.1186/1471-2148-6-29.PubMed CentralPubMedView ArticleGoogle Scholar
- Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: Molecular Evolutionary Genetics Analysis using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.PubMed CentralPubMedView ArticleGoogle Scholar
- Gaut BS, Morton BR, McCaig BC, Clegg MT: Substitution rate comparisons between grasses and palms: synonymous rate differences at the nuclear gene Adh parallel rate differences at the plastid gene rbcL. Proc Natl Acad Sci U S A. 1996, 93 (19): 10274-10279. 10.1073/pnas.93.19.10274.PubMed CentralPubMedView ArticleGoogle Scholar
- Delport W, Poon AF, Frost SD, Kosakovsky Pond SL: Datamonkey 2010: a suite of phylogenetic analysis tools for evolutionary biology. Bioinformatics. 2010, 26 (19): 2455-2457. 10.1093/bioinformatics/btq429.PubMed CentralPubMedView ArticleGoogle Scholar
- Yang Z: PAML: a program package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci. 1997, 13 (5): 555-556.PubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.