Skip to main content

Table 1 Size of proteomic, simulated proteogenomic, and real proteogenomic databases for yeast

From: Evaluating the effect of database inflation in proteogenomic search on sensitive and reliable peptide identification

Database (target + decoy)

# Target (AA)

# Decoy (AA)

Proteomic

1Ty + 1Dy

3,062,279

3,062,279

Simulated proteogenomic

1T1Dy + 2Dy

6,124,558

6,124,558

1T2Dy + 3Dy

9,186,837

9,186,837

1T5Dy + 6Dy

18,373,674

18,373,674

Real proteogenomic

6FTTy + 6FTDy

9,654,965

9,654,965

  1. Database sizes are measured by total length (AA) of contained peptides. 1Ty: yeast reference protein database. nDy: decoy database of which size is n times of 1Ty. 6FTTy: proteogenomic database constructed by 6-frame translation of yeast genome. 6FTDy: decoy database for 6FTTy