Skip to main content

Table 1 Specificity of the different null models

From: Decreasing the number of false positives in sequence classification

null model

WAM low

WAM med

WAM high

CM low

CM med

CM high

5%GC

2431 (70%)

2377 (68%)

2173 (63%)

2439 (77%)

3775 (75%)

3567 (71%)

25%GC

1118 (32%)

1517 (44%)

1500 (43%)

1681 (53%)

2509 (50%)

2243 (44%)

50%GC

863 (25%)

79 ( 2%)

700 (20%)

674 (21%)

58 ( 1%)

0 ( 0%)

75%GC

1443 (42%)

1534 (44%)

738 (21%)

1644 (52%)

2521 (50%)

2197 (44%)

95%GC

2114 (61%)

2332 (68%)

2529 (73%)

2297 (73%)

3642 (72%)

3625 (72%)

target

45 ( 1%)

18 ( 0%)

25 ( 0%)

3 ( 0%)

0 ( 0%)

0 ( 0%)

  1. Number (and percentage) of positively scored sequences for each null model. WAM low , WAM med and WAM high designate the WAM models generated by the training set with low (36%), medium (48%) and high (65%) GC content, respectively. CM low , CM med and CM high designate the CM models generated by the training set with low (5.6%), medium (49.2%) and high (71.4%) GC content, respectively.