Skip to main content
Fig. 1 | BMC Genomics

Fig. 1

From: A context-free encoding scheme of protein sequences for predicting antigenicity of diverse influenza A viruses

Fig. 1

A pipeline for machine learning projects and illustration for CFreeEnS. a Encoding a non-numeric dataset into equal-length numeric vectors is necessary for both traditional machine learning models and deep neural networks. b CFreeEnS encodes m aligned protein sequence pairs of length l with k substitution matrices, resulting in a numeric feature matrix X with dimension m×k×l

Back to article page