Skip to main content

Table 2 The average sequence similarity between proteins in training datasets and those in test datasets

From: A generalized approach to predicting protein-protein interactions between virus and host

  

Average

Proteins in training datasets

Target proteins in test datasets

sequence

  

similarity

766 virus proteins in TR1,TR3

11 H1N1 virus proteins in TS1

9.6%

774 virus proteins in TR2,TR4

3 Ebola virus proteins in TS2

10.9%

3,924 human proteins in TR5

368 non-human animal proteins in TS5.1

10.7%

3,924 human proteins in TR5

13 plant proteins in TS5.2

10.6%

3,924 human proteins in TR5

106 bacteria proteins in TS5.3

10.4%