Skip to main content

Table 1 Genome-wide target class label assignment to each protein coding gene as a data point for Dickeya didantii 3937 and Pectobacterium carotovorum WPP14

From: Identification of host-microbe interaction factors in the genomes of soft rot-associated pathogens Dickeya dadantii 3937 and Pectobacterium carotovorum WPP14 with supervised machine learning

 

Total # CDS*

IF**

CF**

Training data set

Testing data set

Pseudogene

Dd3937

4520

267

1264

1531

2989

28

WPP14

4590

233

1111

1344

3246

174

  1. *we only use protein coding genes and pseudogenes are not included.
  2. **IF stands for host-microbe interaction factor; CF stands for genes involved in core biological processes