Skip to main content
Fig. 2 | BMC Genomics

Fig. 2

From: Small open reading frames: a comparative genetics approach to validation

Fig. 2

Selection of high-confidence smORFs. For selected RefSeq genes with varying pLI scores and amino acid length < 150 (left facets) and smORFs (right facets), violin and boxplots showing A N/S ratios, B LoF/S ratios, C MOEUF scores, and D GERP scores. MOEUF and GERP thresholds used to filter putative smORFs are shown by dashed red line. Selected RefSeq genes are segregated by pLI scores ranging from low (n = 400), moderate (n = 400) and high (n = 400) scores, and genes with less than 150 amino acids (n = 400). smORF subsets includes known smORFs (n = 28), putative smORFs unique to Chen et al. (n = 4,030) and Martinez et al. (n = 1,244) dataset, putative smORFs with exact matches reported by both Chen et al. and Martinez et al. (n = 515), and smORFs in both datasets with imperfect overlap (n = 739). Box plots display the first quartile, median and third quartile. Abbreviations: N, nonsynonymous; S, synonymous; LoF, loss-of-function; pLI, probability of loss-of-function intolerant

Back to article page