Phylogenetic tree of cysteine cathepsins. An analysis of predicted proteins from cysteine cathepsin genes annotated in the draft genome of H. halys was performed using MEGA7. The tree with the highest log likelihood is shown (-9596.54). The percentage of trees in which the associated taxa clustered together is indicated beside the branches. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. Cysteine cathepsins annotated in H. halys include those that are conserved in the cathepsin L-like subfamily (Hh CatF, Hh CatO, Hh CatI, Hh CatLl) in green and species-specific (Hh Cat ss.uLX.x) in yellow; cathepsin B ortholog (Hh CatB) in blue and species-specific cathepsin B-like (Hh Cat ss.uLX.x) in pink; and human cathepsins, which are marked according to UniProt IDs: L (CATL1_HUMAN, P07711), V (CATL2_HUMAN, O60911), F (CATF_HUMAN, Q9UBX1), O (CATO_HUMAN, P43234), H (CATH_HUMAN, P09668), K (CATK_HUMAN, P43235), S (CATS_HUMAN, P25774), W (CATW_HUMAN, P56202), Z (CATZ_HUMAN, Q9UBR2), B (CATB_HUMAN, P07858), C (CATC_HUMAN, P53634) and TINAL-like protein (TINAL_HUMAN, Q9GZM7). Correspondences between leaf node identifiers and NCBI protein sequences are indicated in Additional file 1: Table S14.