Skip to main content

Table 2 Percentage of residues in the different datasets.

From: Parameterization of disorder predictors for large-scale applications requiring high specificity by using an extended benchmark dataset

Dataset

Number of residues with disorder annotation (%)

Number of residues with order annotation (%)

Non-annotated residues (%)

Total number of residues in dataset

DisProt r4.5

(Jul 2008)

24.7

1.2

74.1

239120

(in 520 proteins)

Remark 465

7.2

53.7

39.1

164793

(in 364 proteins)

SL

26.3

33.0

40.7

239120

(in 520 proteins)

  1. The SL dataset comprises the DisProt release 4.5 data in addition to residues in the same proteins annotated as having an ordered 3D structure found by similarity searches among sequences of known tertiary structure. Since some of these structures contain unidentified segments (Remark 465 regions), the number of residues with disorder annotation in SL is slightly larger than in DisProt