Skip to main content

Table 2 Composition of the datasets used in this work

From: Avoidance of recognition sites of restriction-modification systems is a widespread but not universal anti-restriction strategy of prokaryotic viruses

Dataset

Number of different sites

Number of different genomes

Number of (site, genome) pairs

Experimental dataset

494a

2861b

66,704

Control dataset 1

899

3407

3,062,893

Control dataset 2

899

4021

3,614,879

  1. aR-M systems encoded in the genomes of the known phage hosts recognize 494 among all 899 known RS. bOnly 2861 phages among 3407 have known host species with available data on the encoded R-M systems