Skip to main content

Table 10 Descriptive words for both the unidirectional and bidirectional promoter sets. The top 25 words that are correlated in the two promoter sets, according to their overrepresentation scores. The Words had to be overrepresented according to SlnSES with at least a score of 1.5. Shown are the words with a distance between -0.11 and 0.11.

From: Word-based characterization of promoters involved in human DNA repair pathways

Word Unidirectional Bidirectional Distance
CTTTGGCC 2.08857 2.23024 -0.100175818
AGGCAGGA 1.51526 1.64780 -0.093719933
CTCAGGAT 1.58527 1.71375 -0.090849079
GGGGGGAC 1.61803 1.70814 -0.063717392
CTTGCGGA 1.65530 1.73350 -0.055295750
CTGAGCAG 1.99183 2.05890 -0.047425652
GCCTGAGG 1.99183 2.04796 -0.039689904
TGAAGTGG 1.61803 1.66175 -0.030914708
GCCATCCG 1.86393 1.89589 -0.022599133
AGGTTGCA 2.20477 2.23024 -0.018010010
TCTGTGCC 1.84096 1.85915 -0.012862272
TACCACTA 1.86393 1.88037 -0.011624835
CAAAGAAT 1.61803 1.61872 -0.000487904
ACCGCTCA 1.61803 1.61872 -0.000487904
TATCTTAG 1.61803 1.61872 -0.000487904
AGAGTTCC 1.62605 1.61872 0.005183093
GTCGGCTT 1.90512 1.88037 0.017500893
CGCGCGCA 1.94164 1.90263 0.027584236
CAGGCCAG 1.95383 1.86972 0.059474751
ACAGAAAG 2.79686 2.70295 0.066404398
GTCAGGAG 2.40520 2.25776 0.104255824
GGAAGTGA 1.96108 1.81095 0.106157941
TAGAGAGC 1.99183 1.84125 0.106476139
TGCCAGGG 1.75813 1.60511 0.108201480
GCACAAGC 1.95383 1.80053 0.108399470
TTCACTTA 2.15055 1.99725 0.108399470