Figure 1From: C-terminal motif prediction in eukaryotic proteomes using comparative genomics and statistical over-representation across protein familiesFlowchart of the SOCT pipeline. A combination of filters and pre-processing was performed against individual proteomes to obtain a comprehensive set of z-statistics for each possible tripeptide at all positions from the C-terminal end to 100 residues in from the C-terminus. Programs and scripts for data analysis are represented as barred boxes, while resulting datasets are depicted as polygons.Back to article page