Skip to main content

Table 1 Datasets for training and testing the predicting model

From: A context-free encoding scheme of protein sequences for predicting antigenicity of diverse influenza A viruses

Subtype Number of sequences T D/T HA1 lengths
H1N1 68 355 0.50 327
H3N2 621 791 0.47 329
H5N1 148 293 0.57 320
H9N2 29 118 0.68 317
Combined 866 1557 0.52 340
  1. 1T: Total number of viral pairs;
  2. 2D: The number of antigenic distinct viral pairs;
  3. 3Combined: The combined dataset of H1N1, H3N2, H5N1 and H9N2