Skip to main content

Table 4 The number of patterns that have cysteine-cysteine and proline-proline dipeptides in NEP

From: Characterization of oligopeptide patterns in large protein sets

 

Dipeptide

Dataset

CC

PP

NEP-A-Genome

0

2

NEP-A-Swissprot

0

0

NEP-B-Genome

0

12

NEP-B-Swissprot

0

14

NEP-E-Genome

12

1

NEP-E-Swissprot

1

1

All-A-Genome

4%

35%

All-A-Swissprot

3%

34%

All-B-Genome

4%

38%

All-B-Swissprot

3%

38%

All-E-Genome

18%

57%

All-E-Swissprot

18%

54%

All: all proteins from the kingdom.

  1. Dipeptides of cysteine and proline are suspected to be structurally unfavored. Several of the peptides in bacterial NEPs do have diprolines but no dicysteines. Eukaryotic NEPs from the genome set have several dicysteines. However, for NEPs, the fraction of proteins having these dipeptides are considerably lower than in there their respective kingdom section of the full data set.