Cysteine peptidases and their inhibitors in Tetranychus urticae: a comparative genomic approach
© Santamaría et al.;licensee BioMed Central Ltd. 2012
Received: 9 April 2012
Accepted: 11 July 2012
Published: 11 July 2012
Cysteine peptidases in the two-spotted spider mite Tetranychus urticae are involved in essential physiological processes, including proteolytic digestion. Cystatins and thyropins are inhibitors of cysteine peptidases that modulate their activity, although their function in this species has yet to be investigated. Comparative genomic analyses are powerful tools to obtain advanced knowledge into the presence and evolution of both, peptidases and their inhibitors, and could aid to elucidate issues concerning the function of these proteins.
We have performed a genomic comparative analysis of cysteine peptidases and their inhibitors in T. urticae and representative species of different arthropod taxonomic groups. The results indicate: i) clade-specific proliferations are common to C1A papain-like peptidases and for the I25B cystatin family of inhibitors, whereas the C1A inhibitors thyropins are evolutionarily more conserved among arthropod clades; ii) an unprecedented extensive expansion for C13 legumain-like peptidases is found in T. urticae; iii) a sequence-structure analysis of the spider mite cystatins suggests that diversification may be related to an expansion of their inhibitory range; and iv) an in silico transcriptomic analysis shows that most cathepsin B and L cysteine peptidases, legumains and several members of the cystatin family are expressed at a higher rate in T. urticae feeding stages than in embryos.
Comparative genomics has provided valuable insights on the spider mite cysteine peptidases and their inhibitors. Mite-specific proliferations of C1A and C13 peptidase and I25 cystatin families and their over-expression in feeding stages of mites fit with a putative role in mite’s feeding and could have a key role in its broad host feeding range.
KeywordsComparative genomics Cystatins Cysteine peptidases Feeding Tetranychus urticae Thyropins
The two-spotted spider mite, Tetranychus urticae, is one of the main pests of agricultural crops due to its broad host range. This polyphagous species feeds on more than 1,100 plant species, from which about 150 are of great economic value. Thus, it represents a very important pest for field and greenhouse crops, ornamentals, annual and perennial plants all over the world. T. urticae has one of the smallest known animal genomes of about 90Mbp with over 18,000 genes identified. Genome features, such as large expansion of gene families associated with digestion and detoxification of plant secondary compounds are consistent with the spider mite’s wide host feeding range .
Regarding mite digestive physiology, mites use both extracellular and intracellular digestion, with the latter occurring in gut wall-derived epithelial cells that ingest and digest food particles that can be free floating [2, 3]. The midgut is the site for synthesis and secretion of digestive enzymes and absorption of nutrients. Processed food and epithelial cells pass into the posterior midgut, are subsequently compacted in the hindgut and excreted as faecal pellets . Mite species that feed on plants rely mostly on cysteine peptidase activities for the digestion of dietary proteins [4, 5].
Cysteine peptidases are enzymes that hydrolyse peptide bonds using a catalytic cysteine. The MEROPS database contains all the existing peptidases grouped in clans . Clans represent one or more families that show evidence of their evolutionary relationship by their similar tertiary structures, or when structures are not available, by the order of catalytic-site residues in the polypeptide chain and often by common sequence motifs around the catalytic residues. At present, among the 72 families of cysteine peptidases identified, 43 are included in 8 clans exclusively formed by cysteine peptidases, 13 are distributed in three clans that comprise peptidases with different catalytic mechanisms, and 16 are not enclosed in any determined clan. Most of the cysteine peptidases characterized in arthropods belong to the papain-like family (C1A), although members of the legumain (C13), calpain (C2), caspase (C14) or separase (C50) families have also been reported.
Arthropod papain-like cysteine peptidases are homologous to mammalian cathepsins, which are present in lysosomes, but can also be localized in extracellular spaces . In mites and insect species belonging to the orders Coleoptera, Hemiptera and Homoptera most C1A peptidases, particularly cathepsins L and B-like, are involved in the digestion process [8–10]. Besides, they are implicated in other physiological processes in arthropods, such as embryogenesis or metamorphosis [11–14]. For T. urticae in vitro assays determined that major protease activity in extracts relies on papain-like cysteine type protease activity , which match up with the proliferation of this gene family in the spider mite genome . In addition, a multigene family of legumain genes was also found in the spider mite genome , which could have a role in feeding similar to that observed for a legumain peptidase related to the digestive process of the hard tick Ixodes ricinus.
The most known inhibitors of C1A and C13 peptidases are the members of the I25 cystatin superfamily. In arthropods, the two ancestral eukaryotic lineages of the cystatin superfamily, stefins (I25A) and cystatins (I25B), have been reported . Stefins are single copy intracellular inhibitors without disulphide bridges. Cystatins undergo frequent duplication events during evolution, and have a signal peptide and one or two disulphide bridges. The inhibitory consensus motif for C1A peptidases is formed by a conserved glycine residue in the N-terminal region, a QxVxG motif in the central region of the polypeptide and a tryptophan in the C-terminal region , although proteins with variants in these motives are also active as inhibitors, such as the sialostatins from ticks that lack the conserved tryptophan residue . The inhibitory activity against C13 peptidases is achieved by an asparagine included in the consensus motif S/T-N-D/S-M/I/L based on vertebrate and plant cystatins [19, 20], although the ability to inhibit legumains by variants of this motif has not been tested. Some studies have suggested that cystatins from arthropods have functions related to the control of endogenous proteolysis, the balance of host–vector immune relationships, innate immunity, or antimicrobial defence [21–24]. In addition, two other types of cysteine peptidase inhibitors, the propeptide regions of C1A peptidases (I29) and the thyropins (I31) have also been reported in arthropods [25, 26]. However, with the exception of the cystatins that contributes to blood feeding in ticks by suppressing host immune response , their potential role in the feeding process of arthropods has yet to be investigated.
Comparative genomic analyses provide valuable insights into the conservation and evolution of protein families, which could aid to elucidate issues concerning their function . Thus, we have performed a comparative study of the gene families of cysteine peptidases and their inhibitors in representative species of different taxonomic groups belonging to Arthropoda. This analysis has been focused in those cysteine peptidases potentially involved in T. urticae feeding . Results indicate mite-specific proliferations of C1A and C13 cysteine peptidases and their inhibitors I25B cystatins, as well as a correlation between the in silico expression of specific cystatins with putative digestive cysteine peptidases, providing evidences for their involvement in the feeding process of the spider mite.
C1A and C13 cysteine peptidases and their inhibitors in fully sequenced arthropods
As previously described, a proliferation of both C1A and C13 cysteine peptidases was detected in the genome of the two-spotted spider mite . To obtain further insights on the genomic content of proteins putatively related to the proteolityc digestive process of T. urticae, we extended the search of members from these peptidase families and their inhibitors to other species of arthropods, whose genomes have been completely sequenced and annotated, and drafts of these sequences are available on the web. The selected species were: two acari, T. urticae and Ixodes scapularis (black legged tick); one crustacean, Daphnia pulex (common water flea); and ten insect species, the dipterans Drosophila melanogaster (fruit fly) and Anopheles gambiae (African malaria mosquito), the lepidopteran Bombyx mori (domestic silkworm), the coleopteran Tribolium castaneum (red flour beetle), the hymenopterans Camponotus floridanus (carpenter ant), Apis mellifera (honey bee) and Nasonia vitripennis (jewel wasp), the hemipterans Rhodnius prolixus (kissing bug) and Acyrthosiphon pisum (pea aphid), and the phtirapteran Pediculus humanus corporis (human body louse).
Gene content evolution of I25 cystatins in arthropods
To understand how the cystatin lineage has evolved in the different arthropod clades, the individual I25B domains were aligned by MUSCLE (see Additional file 1A). Extensive amino acid differences avoid the construction of a robust phylogenetic tree using all the cystatin sequences. Thus, sequences contributing to extensive gaps in the conserved regions of the alignment were discarded and a phylogenetic tree constructed by the maximum likelihood PhyML method (see Additional file 2A). The corresponding schematic cladogram is shown in Figure 2B. As highlighted, clado-specific proliferations are detected, supported by approximate likelihood-ratio test values (aLRT) higher than 80%. This cladogram suggests that the evolution of the cystatin family in arthropods is the result of extensive duplications from the ancestral genes probably determined for specific features in each clade.
Gene content evolution of I31 thyropins in arthropods
Structural features of T. urticae cystatins
To determine how amino acid differences can influence the tri-dimensional structure of the cystatins, the structures of the TuCPI-1, -3, -7, and −13 proteins, representatives of each group, were modelled using the crystallographic structure of the cystatins from chicken egg (1YVBI), soft tick (3L0R), and human cystatin F (2CH9) (Figure 4B). TuCPI-1 and −7 aligned to chicken egg cystatin at sequence identities of 30% and 23%, and with Q-MEAN Z-scores of −3.36 and −2.87, respectively. TuCPI-3 and −13 aligned to soft tick cystatin and human cystatin F at sequence identities of 28% and 22%, and with Q-MEAN Z-scores of −2.32 and −1.76, respectively. These results imply relatively accurate models for T. urticae cystatins. From models, strong differences in the putative region for C1A peptidase inhibition were observed. The consensus amino acid residues in TuCPI-1 fit with a canonical interaction with C1A peptidases. On the contrary, the conserved tryptophan in TuCPI-3 and −7 located at the end of the third β-sheet could be involved in a distinct interaction with these peptidases leading to changes in their inhibitory specificity. In the case of TuCPI-13, the lack of conserved residues in the domain responsible to C1A peptidase inhibition could mean a lack of inhibition to these peptidases. TuCPI-1 and −3 also have an asparagine in the loop after the conserved α-helix that could be involved in legumain inhibition. This asparagine was absent in TuCPI-7 and −13. However, four group cystatins have at least two different conserved motifs (YNK and SKPY) at the spatial region where the C13 inhibitory activity is achieved. A role for the asparagine of the YNK motif in legumain inhibition could be hypothesized, although the conserved motifs could be alternatively related to a different function.
Expression profiling of cysteine peptidases and inhibitors
Expression of cysteine peptidases and inhibitors in adults against embryos
Adult> > Embryoa
Adult = Embryo
Embryo> > Adulta
A similar analysis was conducted for the four groups of spider mite cystatins. The different groups have specific expression patterns (Figure 5C, D). TuCPI-1, the only protein in group 1, was highly expressed in all developmental stages (data not shown) with its maximum expression in nymphs. Members of group 2 showed the highest expression values and were most abundant in adults and nymphs. Conversely, group 3 cystatins presented the lower level of expression, with one gene mostly expressed in embryos and the other in larvae. Finally, the members of group 4 had, in general, low expression values, but had an interesting developmental pattern, with a weaker expression in adults or embryos than in nymphs.
Cysteine peptidases are crucial in different arthropod physiological processes [11–14], including the digestion of dietary proteins in mites and some insect species [8–10]. The dominant cysteine peptidase activity detected in body extracts of T. urticae points out their main role in proteolytic digestion after feeding [4, 5]. In addition, the proliferation of C1A papain and C13 legumain families in the recently annotated genome of T. urticae and changes in expression detected in these peptidase genes after host change suggest their implication in the feeding process and, most probably, on the ability of mites to feed on the large number of plant hosts .
The broad multigene family of papain-like cysteine peptidases and the unusual proliferation of legumain peptidases in T. urticae were found to have no counterparts after extending the analysis to the annotated genomes of arthropods available so far, including ten insect species belonging to different orders, the crustacean D. pulex, and the acari I. scapularis. Gene expansions in C1A cysteine peptidases were also found, though in a lower extent, in insects such as the aphid A. pisum and the beetle T. castaneum, which also rely on cysteine peptidases for proteolytic digestion [29, 30]. On the contrary, none of the insects analyzed belonging to the orders Diptera (D. melanogaster and A. gambiae), Lepidoptera (B. mori), Hymenoptera (C. floridanus A. mellifera and N. vitripennis) and Phthiraptera (P. humanus), with a digestive system mostly based on serine peptidases , presented this extensive proliferation of cysteine peptidases. Hemipterans (R. prolixus) and ticks (I. scapularis), that in addition to cysteine peptidases use serine and/or aspartyl peptidases for proteolytic digestion , did not also have an expansion of their cysteine peptidases. In the case of the crustacean D. pulex, in which digestion commonly relies on trypsins and chymotrypsins , proliferation of cysteine peptidase genes could be associated to specific adaptations to the lifestyle of a planktonic filter feeder in a highly variable aquatic environment. Thus, we could conclude that, in general, extensive C1A cysteine peptidase duplications in arthropods are correlated to their corresponding diet and could be related to nutritional functions.
Large-scaled gene amplification for legumains is a striking feature restricted to T. urticae. Legumains, also called asparaginyl endopeptidases, have been involved in the degradation of host hemoglobin in ticks [9, 33]. A legumain from Ixodes ricinus is located intracellularly in the vacuoles of gut epithelial cells where digestion occur throughout the whole duration of feeding , and contributes directly to the cleavage of hemoglobin and/or the processing of other gut peptidase zymogens . In most acariform mites, digestion is considered to be composed of an extracellular phase achieved by enzymes secreted to the lumen of the midgut, followed by an intracellular one that can be performed in free floating cells [2, 3, 35]. Legumains could be involved in the intracellular phase by processing some other peptidases or by a direct action on the plant feed proteins. Large expansion of this family in T. urticae could be related to the broad range of C1A cysteine peptidases that need to be activated for digestion or to their direct role on host selectivity by processing plant toxic proteins.
Massive gene expression data have been previously used to detect different peptidases in the gut of insects [30, 36–38] and ticks . Since digestive proteolytic enzymes should be abundantly generated to deal with the higher volume of food that must be hydrolysed in free living developmental stages, genes involved in the proteolytic digestion are expected to be expressed at higher rate in free leaving stages comparing to embryos. Transcriptomic data analysis in T. urticae indicates that most cathepsin B and L peptidases and legumains are expressed at a higher rate in larvae, nymphs and adults, confirming their putative role in proteolytic digestion. On the contrary, other gene cysteine peptidase families like caspases, separases, calpains and GPI:protein transamidases have similar levels of expression in the four developmental stages analysed, supporting a role for these genes in some other endogenous processes across the whole life of the spider mite.
The activity of this extended number of peptidases must be regulated in the acari. Among cysteine peptidase inhibitors, the cystatin, thyropin and C1A cysteine peptidase propeptide families have been previously described in arthropods. Cysteine peptidase propeptides are inhibitory domains that are included in all the C1A peptidases analysed in this study. They have a conserved role in the control of C1A peptidase activity before the inhibitory domain is released and the peptidase becomes active . The T. urticae genome does not contain small propeptide-like genes similar to those present in Bombyx.
Remarkably, the expansion in peptidase genes of the families C1A and C13 in T. urticae is accompanied by a proliferation of cystatin inhibitors putatively targeting both enzyme families. The physiological functions of this extended number of I25 cystatins in T. urticae may be related to their inhibitory activity on specific C1A and C13 peptidases. In insects, cystatins have been associated to processes related to the regulation of endogenous protease activity, such as insect morphogenesis and development [23, 41], and/or in the inhibition of heterologous cysteine peptidases, e.g. during insect immune response and plant feeding [21, 42]. Moreover, cystatins have been previously related to blood-feeding in ticks [27, 43, 44], where they are expressed in salivary glands and the midgut contributing to suppress host immune response. The fact that cystatins have evolved differentially in arthropod clades implies a specific function for the members of the proliferated groups in each clade. Thus, besides their potential implications in some other physiological processes, spider mite cystatins could have a role in mite feeding by regulating their own digestive cysteine peptidases, by inhibiting cysteine peptidases of the host after feeding, and/or by contributing to counteract host defence mechanisms. Several results support the putative involvement of cystatins in mite feeding: i) transcriptomic data analysis indicate that group two and four cystatins are consistently detected at greater levels in larvae, nymphs and adults relative to embryos; ii) the amino acid sequences for several spider mite cystatin groups have diverged deeply from consensus cystatin motifs related to cysteine peptidase inhibition. In fact, group 4 cystatin genes code for a new type of cystatins with no known paralogs in living organisms that putatively contain C13 peptidase binding domains, but do not contain canonical C1A peptidase binding domains. These results support an evolutionary scenario in which cystatin family may have evolved in the spider mite to control the proliferation of divergent C1A cysteine peptidases and C13 legumains involved in protein digestion. Alternatively, cystatins in T. urticae may have evolved as specific adaptations to the lifestyle of an extremely polyphagous species to deal with highly variable resources.
On the contrary, limited data on the physiological roles for thyropins are available. Thyropins are thyroglobulin domains capable of exhibiting inhibitory activity, which has been reported mainly against C1A cysteine peptidases . No arthropod thyropins have been characterized to date, although some members are present in the sialotranscriptome of several ticks [46, 47]. The possible implication of thyropins in the control of peptidases involved in feeding comes from the reduction of cysteine peptidase activity and deleterious effects on larval growth when the thyropin equistatin isolated from sea anemone Actinia equine was introduced in the diet of the insects T. castaneum and Leptinotarsa decemlineata[48, 49]. However, the parallel evolution of thyropins, with complex architectures shared by different arthropod clades, and their transcriptional pattern, having most of them a similar level of expression in adults and embryos, suggest a common conserved role that could be in some cases related to feeding, but no specific for the spider mite nutritional aspects.
Comparative genomic analyses have provided valuable insights into the conservation and evolution of cysteine peptidases and their inhibitors in T. urticae. A phylogenetic analysis of these gene families in representative species of different arthropod taxonomic groups has allowed us to state that clade-specific proliferations are common to C1A papain-like and C13 legumain-like peptidases, as well as to the I25 cystatins, whereas the I31 thyropins are evolutionarily more conserved among arthropod clades. Extensive duplications and transcriptomic data for spider mite C1A and C13 peptidases support their role in proteolytic digestion. The expansion of the I25 cystatin family of inhibitors and their highest expression in feeding stages suggest a role for some cystatin members in mite feeding by regulating endogenous or exogenous peptidases and/or by contributing to counteract host defence mechanisms. In conclusion, mite-specific proliferations of both peptidases and their inhibitors are in accordance with mite’s feeding features and support a key role for these proteins in allowing the broad plant host feeding range described for the two-spotted spider mite.
Blast searches for cystatins, thyropins and cysteine peptidases were performed in publicly available genome databases. Sequences for Tetranychus urticae were obtained at the BOGAS (Bioinformatics Online Genome Annotation System) website . Sequences for other arthopods were identified by searching the current genome releases at: the ant Fourmidable database ; the AphidBase ; the invertebrate vectors for human pathogens VectorBase ; the Daphnia wFleaBase ; the FlyBase ; the Bombyx mori SilkDB ; the BeeBase ; the wasp NasoniaBase ; and the BeetleBase . Blast searches were made in a recurrent way. First, a complete amino acid arthropod sequence from data banks corresponding to a protein of the family was used. Then, the protein sequences of each arthropod species were used to search in the species. Finally, after an alignment of the proteins found in arthropods, the conserved region surrounding the catalytic sites from the species most related was used to a final search in each arthropod species.
Information about gene models for all these proteins is compiled in Additional file 5.
Domain architecture prediction
Amino acid sequences for arthropod proteins putatively including at least one cystatin or thyropin domain were subjected to a sequence search in the Pfam database v 26.0  to know the combination of domains within each protein. From these results, the domain architecture of each protein was manually schematized.
Protein alignments and Phylogenetic trees
Alignments of the amino acid sequences were performed using the default parameters of MUSCLE version 3.8 . Sequences with extensive gaps were manually excluded from phylogenetic analysis using the multiple alignment editor Jalview version 2.7 . Phylogenetic and molecular evolutionary analyses were conducted using the programs PhyML 3.0 and MEGA version 5.0 [63, 64]. The program PROTTEST (2.4) was employed for selecting the model of protein evolution that fits better to each alignment according to the corrected Akaike Information Criterion . The parameters of the selected models were employed to reconstruct the displayed clan CD cysteine peptidases trees by means of a maximum likelihood PhyML method using a BIONJ starting tree. The approximate likelihood-ratio test (aLRT) based on a Shimodaira-Hasegawa-like procedure was used as statistical test for non-parametric branch support . All families were also analysed with the Maximum parsimony and the Neighbour-Joining algorithms, and with different gap penalties. No significant differences in the tree topologies were detected.
Molecular modelling of T. urticae cystatins
The three-dimensional structures of the T. urticae cystatins were modelled using the standard automated routine of SWISS-MODEL program . The known crystal structures of the cystatins from chicken egg (PDB identifier 1YVBI), soft tick (3L0R), and human cystatin F (2CH9) were used to construct the homology-based models. The template structures were selected on the basis of highest sequence similarities. Models were evaluated with the QMEAN Z-score for predicting the absolute quality of a model . The Swiss-PdbViewer program  was used to generate the single images of protein models.
In silico transcriptome expression
The transcriptomic information available at the BOGAS T. urticae website  was used to the developmental expression analyses. The protocol to normalized read counts of RNA-seq Illumina reads has been previously described . To determine significant differences in the levels of gene expression between spider mite embryos and adults, we defined as differentially expressed genes that for which the false discovery rate (FDR) corrected p-value was ≤ 0.05 and for which the fold change was ≥ 2 (either up- or down-regulated).
The financial support from the Ministerio de Educación y Ciencia (AGL2011-23650) and Government of Canada through Genome Canada and the Ontario Genomics Institute (OGI-046) is gratefully acknowledged.
- Grbic M, Van Leeuwen T, Clark RM, Rombauts S, Rouze P, Grbic V, Osborne EJ, Dermauw W, Ngoc PC, Ortego F, et al: The genome of Tetranychus urticae reveals herbivorous pest adaptations. Nature. 2011, 479 (7374): 487-492. 10.1038/nature10640.View ArticlePubMedGoogle Scholar
- Filimonova SA: The ultrastructural investigation of the midgut in the quill mite Syringophilopsis fringilla (Acari, Trombidiformes: Syringophilidae). Arthropod Struct Dev. 2009, 38 (4): 303-313. 10.1016/j.asd.2009.01.002.View ArticlePubMedGoogle Scholar
- Hamilton KA, Nisbet AJ, Lehane MJ, Taylor MA, Billingsley PF: A physiological and biochemical model for digestion in the ectoparasitic mite, Psoroptes ovis (Acari: Psoroptidae). Int J Parasitol. 2003, 33 (8): 773-785. 10.1016/S0020-7519(03)00089-4.View ArticlePubMedGoogle Scholar
- Carrillo L, Martinez M, Ramessar K, Cambra I, Castanera P, Ortego F, Diaz I: Expression of a barley cystatin gene in maize enhances resistance against phytophagous mites by altering their cysteine-proteases. Plant Cell Rep. 2011, 30 (1): 101-112. 10.1007/s00299-010-0948-z.View ArticlePubMedGoogle Scholar
- Nisbet AJ, Billingsley PF: A comparative survey of the hydrolytic enzymes of ectoparasitic and free-living mites. Int J Parasitol. 2000, 30 (1): 19-27. 10.1016/S0020-7519(99)00169-1.View ArticlePubMedGoogle Scholar
- Rawlings ND, Barrett AJ, Bateman A: MEROPS: the database of proteolytic enzymes, their substrates and inhibitors. Nucleic Acids Res. 2012, 40 (Database issue): D343-350.PubMed CentralView ArticlePubMedGoogle Scholar
- Turk V, Stoka V, Vasiljeva O, Renko M, Sun T, Turk B, Turk D: Cysteine cathepsins: From structure, function and regulation to new frontiers. Biochim Biophys Acta. 2012, 1824 (1): 68-88. 10.1016/j.bbapap.2011.10.002.View ArticlePubMedGoogle Scholar
- Cristofoletti PT, Ribeiro AF, Terra WR: The cathepsin L-like proteinases from the midgut of Tenebrio molitor larvae: sequence, properties, immunocytochemical localization and function. Insect Biochem Mol Biol. 2005, 35 (8): 883-901. 10.1016/j.ibmb.2005.03.006.View ArticlePubMedGoogle Scholar
- Horn M, Nussbaumerova M, Sanda M, Kovarova Z, Srba J, Franta Z, Sojka D, Bogyo M, Caffrey CR, Kopacek P, et al: Hemoglobin digestion in blood-feeding ticks: mapping a multipeptidase pathway by functional proteomics. Chem Biol. 2009, 16 (10): 1053-1063. 10.1016/j.chembiol.2009.09.009.PubMed CentralView ArticlePubMedGoogle Scholar
- Soares-Costa A, Dias AB, Dellamano M, de Paula FF, Carmona AK, Terra WR, Henrique-Silva F: Digestive physiology and characterization of digestive cathepsin L-like proteinase from the sugarcane weevil Sphenophorus levis. J Insect Physiol. 2011, 57 (4): 462-468. 10.1016/j.jinsphys.2011.01.006.View ArticlePubMedGoogle Scholar
- Cho WL, Tsao SM, Hays AR, Walter R, Chen JS, Snigirevskaya ES, Raikhel AS: Mosquito cathepsin B-like protease involved in embryonic degradation of vitellin is produced as a latent extraovarian precursor. J Biol Chem. 1999, 274 (19): 13311-13321. 10.1074/jbc.274.19.13311.View ArticlePubMedGoogle Scholar
- Liu J, Shi GP, Zhang WQ, Zhang GR, Xu WH: Cathepsin L function in insect moulting: molecular cloning and functional analysis in cotton bollworm, Helicoverpa armigera. Insect Mol Biol. 2006, 15 (6): 823-834. 10.1111/j.1365-2583.2006.00686.x.View ArticlePubMedGoogle Scholar
- Seixas A, Dos Santos PC, Velloso FF, Da Silva Vaz I, Masuda A, Horn F, Termignoni C: A Boophilus microplus vitellin-degrading cysteine endopeptidase. Parasitology. 2003, 126 (Pt 2): 155-163.View ArticlePubMedGoogle Scholar
- Uchida K, Ohmori D, Ueno T, Nishizuka M, Eshita Y, Fukunaga A, Kominami E: Preoviposition activation of cathepsin-like proteinases in degenerating ovarian follicles of the mosquito Culex pipiens pallens. Dev Biol. 2001, 237 (1): 68-78. 10.1006/dbio.2001.0357.View ArticlePubMedGoogle Scholar
- Franta Z, Frantova H, Konvickova J, Horn M, Sojka D, Mares M, Kopacek P: Dynamics of digestive proteolytic system during blood feeding of the hard tick Ixodes ricinus. Parasit Vectors. 2010, 3: 119-10.1186/1756-3305-3-119.PubMed CentralView ArticlePubMedGoogle Scholar
- Kordis D, Turk V: Phylogenomic analysis of the cystatin superfamily in eukaryotes and prokaryotes. BMC Evol Biol. 2009, 9: 266-10.1186/1471-2148-9-266.PubMed CentralView ArticlePubMedGoogle Scholar
- Turk V, Bode W: The cystatins: protein inhibitors of cysteine proteinases. FEBS Lett. 1991, 285 (2): 213-219. 10.1016/0014-5793(91)80804-C.View ArticlePubMedGoogle Scholar
- Kotsyfakis M, Horka H, Salat J, Andersen JF: The crystal structures of two salivary cystatins from the tick Ixodes scapularis and the effect of these inhibitors on the establishment of Borrelia burgdorferi infection in a murine model. Mol Microbiol. 2010, 77 (2): 456-470. 10.1111/j.1365-2958.2010.07220.x.PubMed CentralView ArticlePubMedGoogle Scholar
- Alvarez-Fernandez M, Barrett AJ, Gerhartz B, Dando PM, Ni J, Abrahamson M: Inhibition of mammalian legumain by some cystatins is due to a novel second reactive site. J Biol Chem. 1999, 274 (27): 19195-19203. 10.1074/jbc.274.27.19195.View ArticlePubMedGoogle Scholar
- Martinez M, Diaz-Mendoza M, Carrillo L, Diaz I: Carboxy terminal extended phytocystatins are bifunctional inhibitors of papain and legumain cysteine proteinases. FEBS Lett. 2007, 581 (16): 2914-2918. 10.1016/j.febslet.2007.05.042.View ArticlePubMedGoogle Scholar
- Buarque DS, Spindola LM, Martins RM, Braz GR, Tanaka AS: Tigutcystatin, a cysteine protease inhibitor from Triatoma infestans midgut expressed in response to Trypanosoma cruzi. Biochem Biophys Res Commun. 2011, 413 (2): 241-247. 10.1016/j.bbrc.2011.08.078.View ArticlePubMedGoogle Scholar
- Miyaji T, Murayama S, Kouzuma Y, Kimura N, Kanost MR, Kramer KJ, Yonekura M: Molecular cloning of a multidomain cysteine protease and protease inhibitor precursor gene from the tobacco hornworm (Manduca sexta) and functional expression of the cathepsin F-like cysteine protease domain. Insect Biochem Mol Biol. 2010, 40 (12): 835-846. 10.1016/j.ibmb.2010.08.003.View ArticlePubMedGoogle Scholar
- Saito H, Suzuki T, Ueno K, Kubo T, Natori S: Molecular cloning of cDNA for sarcocystatin A and analysis of the expression of the sarcocystatin A gene during development of Sarcophaga peregrina. Biochemistry. 1989, 28 (4): 1749-1755. 10.1021/bi00430a049.View ArticlePubMedGoogle Scholar
- Agarwala KL, Kawabata S, Hirata M, Miyagi M, Tsunasawa S, Iwanaga S: A cysteine protease inhibitor stored in the large granules of horseshoe crab hemocytes: purification, characterization, cDNA cloning and tissue localization. J Biochem. 1996, 119 (1): 85-94. 10.1093/oxfordjournals.jbchem.a021220.View ArticlePubMedGoogle Scholar
- Novinec M, Kordis D, Turk V, Lenarcic B: Diversity and evolution of the thyroglobulin type-1 domain superfamily. Mol Biol Evol. 2006, 23 (4): 744-755. 10.1093/molbev/msj082.View ArticlePubMedGoogle Scholar
- Yamamoto Y, Watabe S, Kageyama T, Takahashi SY: A novel inhibitor protein for Bombyx cysteine proteinase is homologous to propeptide regions of cysteine proteinases. FEBS Lett. 1999, 448 (2–3): 257-260.View ArticlePubMedGoogle Scholar
- Kotsyfakis M, Karim S, Andersen JF, Mather TN, Ribeiro JM: Selective cysteine protease inhibition contributes to blood-feeding success of the tick Ixodes scapularis. J Biol Chem. 2007, 282 (40): 29256-29263. 10.1074/jbc.M703143200.View ArticlePubMedGoogle Scholar
- Martinez M: Plant protein-coding gene families: emerging bioinformatics approaches. Trends Plant Sci. 2011, 16 (10): 558-567. 10.1016/j.tplants.2011.06.003.View ArticlePubMedGoogle Scholar
- Rispe C, Kutsukake M, Doublet V, Hudaverdian S, Legeai F, Simon JC, Tagu D, Fukatsu T: Large gene family expansion and variable selective pressures for cathepsin B in aphids. Mol Biol Evol. 2008, 25 (1): 5-17.View ArticlePubMedGoogle Scholar
- Morris K, Lorenzen MD, Hiromasa Y, Tomich JM, Oppert C, Elpidina EN, Vinokurov K, Jurat-Fuentes JL, Fabrick J, Oppert B: Tribolium castaneum larval gut transcriptome and proteome: A resource for the study of the coleopteran gut. J Proteome Res. 2009, 8 (8): 3889-3898. 10.1021/pr900168z.View ArticlePubMedGoogle Scholar
- Terra WR, Ferreira C: 4.5 - Biochemistry of Digestion. Comprehensive Molecular Insect Science. Edited by: Lawrence IG, Kostas I, Sarjeet SG. 2005, Elsevier, Amsterdam, 171-224.View ArticleGoogle Scholar
- Schwerin S, Zeis B, Lamkemeyer T, Paul RJ, Koch M, Madlung J, Fladerer C, Pirow R: Acclimatory responses of the Daphnia pulex proteome to environmental changes. II. Chronic exposure to different temperatures (10 and 20 degrees C) mainly affects protein metabolism. BMC Physiol. 2009, 9: 8-10.1186/1472-6793-9-8.PubMed CentralView ArticlePubMedGoogle Scholar
- Alim MA, Tsuji N, Miyoshi T, Islam MK, Hatta T, Yamaji K, Fujisaki K: Developmental stage- and organ-specific expression profiles of asparaginyl endopeptidases/legumains in the ixodid tick Haemaphysalis longicornis. J Vet Med Sci. 2008, 70 (12): 1363-1366. 10.1292/jvms.70.1363.View ArticlePubMedGoogle Scholar
- Sojka D, Hajdusek O, Dvorak J, Sajid M, Franta Z, Schneider EL, Craik CS, Vancova M, Buresova V, Bogyo M, et al: IrAE: an asparaginyl endopeptidase (legumain) in the gut of the hard tick Ixodes ricinus. Int J Parasitol. 2007, 37 (7): 713-724. 10.1016/j.ijpara.2006.12.020.PubMed CentralView ArticlePubMedGoogle Scholar
- Mothes-Wagner U: Fine structure of the ‘hindgut’ of the two-spotted spider mite, Tetranychus urticae, with special reference to origin and function. Exp Appl Acarol. 1985, 1 (3): 253-272. 10.1007/BF01198522.View ArticleGoogle Scholar
- Dostalova A, Votypka J, Favreau AJ, Barbian KD, Volf P, Valenzuela JG, Jochim RC: The midgut transcriptome of Phlebotomus (Larroussius) perniciosus, a vector of Leishmania infantum: comparison of sugar fed and blood fed sand flies. BMC Genomics. 2011, 12: 223-10.1186/1471-2164-12-223.PubMed CentralView ArticlePubMedGoogle Scholar
- Jochim RC, Teixeira CR, Laughinghouse A, Mu J, Oliveira F, Gomes RB, Elnaiem DE, Valenzuela JG: The midgut transcriptome of Lutzomyia longipalpis: comparative analysis of cDNA libraries from sugar-fed, blood-fed, post-digested and Leishmania infantum chagasi-infected sand flies. BMC Genomics. 2008, 9: 15-10.1186/1471-2164-9-15.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhang S, Shukle R, Mittapalli O, Zhu YC, Reese JC, Wang H, Hua BZ, Chen MS: The gut transcriptome of a gall midge, Mayetiola destructor. J Insect Physiol. 2010, 56 (9): 1198-1206. 10.1016/j.jinsphys.2010.03.021.View ArticlePubMedGoogle Scholar
- Anderson JM, Sonenshine DE, Valenzuela JG: Exploring the mialome of ticks: an annotated catalogue of midgut transcripts from the hard tick, Dermacentor variabilis (Acari: Ixodidae). BMC Genomics. 2008, 9: 552-10.1186/1471-2164-9-552.PubMed CentralView ArticlePubMedGoogle Scholar
- Wiederanders B, Kaulmann G, Schilling K: Functions of propeptide parts in cysteine proteases. Curr Protein Pept Sci. 2003, 4 (5): 309-326. 10.2174/1389203033487081.View ArticlePubMedGoogle Scholar
- Goto SG, Denlinger DL: Genes encoding two cystatins in the flesh fly Sarcophaga crassipalpis and their distinct expression patterns in relation to pupal diapause. Gene. 2002, 292 (1–2): 121-127.View ArticlePubMedGoogle Scholar
- Francischetti IM, Lopes AH, Dias FA, Pham VM, Ribeiro JM: An insight into the sialotranscriptome of the seed-feeding bug, Oncopeltus fasciatus. Insect Biochem Mol Biol. 2007, 37 (9): 903-910. 10.1016/j.ibmb.2007.04.007.PubMed CentralView ArticlePubMedGoogle Scholar
- Grunclova L, Horn M, Vancova M, Sojka D, Franta Z, Mares M, Kopacek P: Two secreted cystatins of the soft tick Ornithodoros moubata: differential expression pattern and inhibitory specificity. Biol Chem. 2006, 387 (12): 1635-1644.View ArticlePubMedGoogle Scholar
- Zhou J, Liao M, Ueda M, Gong H, Xuan X, Fujisaki K: Characterization of an intracellular cystatin homolog from the tick Haemaphysalis longicornis. Vet Parasitol. 2009, 160 (1–2): 180-183.View ArticlePubMedGoogle Scholar
- Mihelic M, Turk D: Two decades of thyroglobulin type-1 domain research. Biol Chem. 2007, 388 (11): 1123-1130.View ArticlePubMedGoogle Scholar
- Anatriello E, Ribeiro JM, de Miranda-Santos IK, Brandao LG, Anderson JM, Valenzuela JG, Maruyama SR, Silva JS, Ferreira BR: An insight into the sialotranscriptome of the brown dog tick Rhipicephalus sanguineus. RBMC Genomics. 2010, 11: 450-10.1186/1471-2164-11-450.View ArticleGoogle Scholar
- Francischetti IM, Sa-Nunes A, Mans BJ, Santos IM, Ribeiro JM: The role of saliva in tick feeding. Front Biosci. 2009, 14: 2051-2088.View ArticleGoogle Scholar
- Gruden K, Strukelj B, Popovic T, Lenarcic B, Bevec T, Brzin J, Kregar I, Herzog-Velikonja J, Stiekema WJ, Bosch D, et al: The cysteine protease activity of Colorado potato beetle (Leptinotarsa decemlineata Say) guts, which is insensitive to potato protease inhibitors, is inhibited by thyroglobulin type-1 domain inhibitors. Insect Biochem Mol Biol. 1998, 28 (8): 549-560. 10.1016/S0965-1748(98)00051-4.View ArticlePubMedGoogle Scholar
- Oppert B, Morgan TD, Hartzer K, Lenarcic B, Galesa K, Brzin J, Turk V, Yoza K, Ohtsubo K, Kramer KJ: Effects of proteinase inhibitors on digestive proteinases and growth of the red flour beetle, Tribolium castaneum (Herbst) (Coleoptera: Tenebrionidae). Comp Biochem Physiol C Toxicol Pharmacol. 2003, 134 (4): 481-490. 10.1016/S1532-0456(03)00042-5.View ArticlePubMedGoogle Scholar
- Tetranychus urticae website: BOGAS (Bioinformatics Online Genome Annotation System) http://bioinformaticspsbugentbe/webtools/bogas/overview/Tetur
- The ant Fourmidable database. http://antgenomesorg/
- The AphidBase. http://wwwaphidbasecom/aphidbase/
- The invertebrate vectors for human pathogens VectorBase. http://wwwvectorbaseorg/
- The Daphnia wFleaBase. http://wfleabaseorg/
- The FlyBase. http://flybaseorg/
- The Bombyx mori SilkDB. http://silkwormgenomicsorgcn/
- The BeeBase. http://hymenopteragenomeorg/beebase/
- The wasp NasoniaBase. http://hymenopteragenomeorg/nasonia/
- The BeetleBase. http://wwwbeetlebaseorg/
- Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K, Ceric G, Clements J, et al: The Pfam protein families database. Nucleic Acids Res. 2012, 40 (Database issue): D290-301.PubMed CentralView ArticlePubMedGoogle Scholar
- Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32 (5): 1792-1797. 10.1093/nar/gkh340.PubMed CentralView ArticlePubMedGoogle Scholar
- Waterhouse AM, Procter JB, Martin DM, Clamp M, Barton GJ: Jalview Version 2–a multiple sequence alignment editor and analysis workbench. Bioinformatics. 2009, 25 (9): 1189-1191. 10.1093/bioinformatics/btp033.PubMed CentralView ArticlePubMedGoogle Scholar
- Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O: New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010, 59 (3): 307-321. 10.1093/sysbio/syq010.View ArticlePubMedGoogle Scholar
- Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28 (10): 2731-2739. 10.1093/molbev/msr121.PubMed CentralView ArticlePubMedGoogle Scholar
- Abascal F, Zardoya R, Posada D: ProtTest: selection of best-fit models of protein evolution. Bioinformatics. 2005, 21 (9): 2104-2105. 10.1093/bioinformatics/bti263.View ArticlePubMedGoogle Scholar
- Anisimova M, Gil M, Dufayard JF, Dessimoz C, Gascuel O: Survey of branch support methods demonstrates accuracy, power, and robustness of fast likelihood-based approximation schemes. Syst Biol. 2011, 60 (5): 685-699. 10.1093/sysbio/syr041.PubMed CentralView ArticlePubMedGoogle Scholar
- Bordoli L, Kiefer F, Arnold K, Benkert P, Battey J, Schwede T: Protein structure homology modeling using SWISS-MODEL workspace. Nat Protoc. 2009, 4 (1): 1-13.View ArticlePubMedGoogle Scholar
- Benkert P, Biasini M, Schwede T: Toward the estimation of the absolute quality of individual protein structure models. Bioinformatics. 2011, 27 (3): 343-350. 10.1093/bioinformatics/btq662.PubMed CentralView ArticlePubMedGoogle Scholar
- Guex N, Diemand A, Peitsch MC: Protein modelling for all. Trends Biochem Sci. 1999, 24 (9): 364-367. 10.1016/S0968-0004(99)01427-9.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.