Skip to main content

Table 6 Subcellular locations and number of sequences in each location. Table shows unbalanced and balanced dataset.

From: Supervised learning method for the prediction of subcellular localization of proteins using amino acid and amino acid pair composition

Subcellular Location Unbalanced Data (No of Sequences) Balanced Data (No of Sequences)
Chloroplast 98 980
Cytoplasm 436 900
Cytoskeleton 37 991
Endoplasmic Reticulum 39 989
Extracellular 164 995
Golgi 34 997
Lysosome 37 323
Mitochondrion 76 993
Nucleus 216 992
Peroxisome 23 603
Plasma Membrane 557 996
Vacuole 24 184
\