Skip to main content

Table 5 Distribution of top 40 protein sequences containing at least two copies of EPIYA motif

From: Effector prediction in host-pathogen interaction based on a Markov model of a ubiquitous EPIYA motif

Protein Name Number of proteins (number of genuses)
  Total Archaea Viruses Bacteria Protista Fungi Metazoa Viridiplantae
CagA 1015(1)    1015(1)     
hypothetical protein 689(186) 15(10) 10(6) 242(88) 162(19) 78(28) 127(25) 55(11)
ATP* 81(21) 2(2)   68(11) 6(4) 4(3)   1(1)
Ankryin 55(7)    51(3)    4(4)  
DNA* 52(34) 3(3)   40(25) 7(4) 1(1)   1(1)
Kinase 43(28) 5(2)   23(15) 4(3)   11(8)  
zinc finger protein 43(11)       43(11)  
TPR repeat protein 33(15)    33(15)     
Polyprotein 24(2)   24(2)      
SecA 23(14)    23(14)     
Peptidase 19(12) 1(1)   16(9) 1(1)   1(1)  
dynein heavy chain 17(13)     4(2) 1(1) 11(9) 1(1)
elongation factor 2 15(7)     10(2) 1(1) 4(4)  
Palmdelphin 14(9)       14(9)  
tRNA* 14(11) 2(1)   10(8) 1(1) 1(1)   
glycogen synthase 13(1)    13(1)     
GTP-binding 13(3)    12(2) 1(1)    
transcriptional regulator 13(8) 1(1)   9(4)   3(3)   
unc-119 homolog 13(6)       13(6)  
FAT tumor suppressor homolog 3 12(9)       12(9)  
nuclear ribonucleoprotein 12(9)     1(1) 10(7)   1(1)
4-alpha-glucanotransferase 9(1)    9(1)     
paternally expressed 3 8(6)       8(6)  
Striatin 8(7)       8(7)  
Tarp 8(1)    8(1)     
nuclear autoantigen 7(6)       7(6)  
putative mannosyltransferase 7(1)    7(1)     
Ubiquitin 7(6)     5(4) 2(2)   
26S proteasome regulatory subunit 6(3)        6(3)
cell division protein 6(4) 3(3)   1(1)     
centaurin, delta 3 6(5)       6(5)  
fat tumor suppressor homolog 2 6(5)       6(5)  
glycosyl transferase 6(5)    6(5)     
guanine nucleotide exchange factor 6(6)       6(6)  
cytochrome c oxidase subunit VI 5(4)     5(4)    
PEG3 5(5)       5(5)  
polyketide synthase 5(4)    3(3) 2(1)    
polysaccharide biosynthesis protein 5(3)    5(3)     
TatD-related deoxyribonuclease 5(1)    5(1)     
translation initiation factor 5(4)    2(2) 2(1) 1(1)   
  1. ATP* includes ATPase, ABC transporter, ATP-binding protein, and ATP-dependent helicase; DNA* includes DNA photolyase, DNA primase, DNA repair protein, DNA-binding protein, and DNA mismatch repair protein; kinases* includes histidine kinase, protein kinase, hexokinase, serine kinase, and fyn-related kinase; tRNA* includes tRNA synthetase, tRNA formyltransferase, and tRNA ligase.