Skip to main content

Table 2 Percentage of underrepresented sites of different types in the different datasets

From: Lifespan of restriction-modification systems critically affects avoidance of their recognition sites in host genomes

R-M system site type Actual pairs dataset Experimentally proven dataset Prokaryotic control dataset Viral control dataset
Type I 0.0 % 0.0 % 0.1 % 0.1 %
0/100 0/14 238/357501 21/18859
Type III 0.0 % 0.0 % 0.3 % 0.2 %
0/76 0/7 213/82065 57/31571
Type IIC/G 0.0 % 0.0 % 0.1 % 0.2 %
0/107 0/47 171/218322 66/27699
Type II orthodox 47.9 % 45.3 % 3.9 % 1.7 %
850/1774 58/128 21380/542911 2720/158921
Type IIM 70.4 % 14.3 % 0.6 % 0.3 %
38/54a 1/7 125/21128 79/29070
Type IV 0.0 % 0.0 % 1.0 % 0.2 %
0/13 0/3 64/6342 25/10116
  1. aAll 38 underrepresented sites are GATC