Skip to main content

Table 2 Top 25 words not part of promoter sets. The top 25 words that were not discovered as being part of the bidirectional (a) and unidirectional (b) promoter set of DNA-repair pathways. The words are sorted in descending order by the expected sequence occurrence (ES).

From: Word-based characterization of promoters involved in human DNA repair pathways

(a) Bidirectional (b) Unidirectional
Word ES Word ES
GCGGCCCG 3.34859 CGCCCCTG 4.12035
GGAGGCGC 2.94738 GGCGGAGG 3.91749
GCCTCTCC 2.84694 AAAGGGGC 3.15484
GCTGAGGA 2.59894 CTGGTCTC 3.14943
GCCGGGGC 2.56699 GCCTGGGC 2.75165
GCGCCTCC 2.56699 GTTTGAAA 2.47933
GCGAGGCG 2.54354 GCGCGAGG 2.25604
AGTGGGGG 2.46473 TTCTTTTC 2.23192
CTGGAGGC 2.45191 ATTCTGGA 2.21123
CGGGGGTG 2.41485 CAGGCAGG 2.17759
GAGGGGAG 2.41485 ATTTTGTT 2.15141
TGCCCGCC 2.39066 CAAAAAAA 2.13045
GCACCCCC 2.23699 AAACCTCA 2.11329
GCCTCTGG 2.23699 TCCCGCCT 2.11329
TGCCTGCG 2.23699 CCCCGCCG 2.05605
GGGCTCGC 2.21328 GAGGAGGC 2.05268
GGCAGGGC 2.18091 AGCACTGG 2.02023
CAGCAAGG 2.1341 TTATCTGC 2.02023
CGAGGCCT 2.12325 CCGCCCCA 1.99873
GAGGGAAG 2.12325 CCCGCCCT 1.94132
GGAGCTGA 2.11348 CTCTTTCT 1.94132
CCTGTCCT 2.10187 GAGAGAGC 1.94132
TCCAGGAC 2.0706 GGCCCAAC 1.94132
CCAGGCCG 2.06039 GTCTGGGC 1.94132
CGCCTGTC 2.06039 TAGGGGGC 1.94132