Skip to main content

Table 1 Datasets for training and testing the predicting model

From: A context-free encoding scheme of protein sequences for predicting antigenicity of diverse influenza A viruses

Subtype

Number of sequences

T

D/T

HA1 lengths

H1N1

68

355

0.50

327

H3N2

621

791

0.47

329

H5N1

148

293

0.57

320

H9N2

29

118

0.68

317

Combined

866

1557

0.52

340

  1. 1T: Total number of viral pairs;
  2. 2D: The number of antigenic distinct viral pairs;
  3. 3Combined: The combined dataset of H1N1, H3N2, H5N1 and H9N2