Skip to main content

Table 9 Unique and interesting words for the promoter sets. The words for the unidirectional and bidirectional promoter set which exhibit a significant score-based distance to the other data set.

From: Word-based characterization of promoters involved in human DNA repair pathways

(a) Unidirectional (b) Bidirectional
Word Unidirectional Bidirectional Distance Word Unidirectional Bidirectional Distance
ACCCGCCT 6.87826 -0.0263597 4.882303411 TCCCGGGA -0.0850495 5.60208 -4.021407835
GGGGCTGC 4.49226 -1.0872000 3.945274001 GGCCCGCC 0 5.36123 -3.790962089
CGGCGGCG 4.01229 -1.3139900 3.766248706 CGCGGCCG -0.3641650 4.91487 -3.732841447
AGGAAACA 4.92885 0.1254760 3.396498328 TCCCGGCT 0 5.14921 -3.641041309
CTTCTTTC 5.19624 0.4219750 3.375915157 CAGGGGCC 0 5.13315 -3.629685174
TCCGCCGG 3.76460 -0.8986470 3.297413576 AGGGCCGT 0 5.10145 -3.607269889
TCTTCTTC 4.48225 0 3.169429370 TCTGAGGA 0 4.99234 -3.530117468
ATTAAAAT 4.29023 0 3.033650726 CGTGGGGG 0.0180292 4.92572 -3.470261445
GGGGAGTA 4.44222 0.3737000 2.876878081 TCTGGGAT 0 4.81380 -3.403870623
CGCCCCTT 3.90482 -0.1463740 2.864626749 AGGGAGGG 0 4.72230 -3.339170353
TTTTTTGA 4.01229 0 2.837117467 AGAAAAGA 0 4.66976 -3.302018963
TTTCTCCA 3.96242 0 2.801854052 GGGCCAGG 0 4.62990 -3.273833686
AGCCGGCT 3.94551 0 2.789896876 ACTCCAGC 0 4.53045 -3.203511917
TTGGTCTC 3.65176 -0.2608830 2.766656398 CCCCAGCT -0.9904730 3.48143 -3.162112936
GCGCCGCG 3.81433 0 2.697138609 CGGGCCGA 0 4.45426 -3.149637451
ATTCCCAG 3.80733 0 2.692188861 TCCGCCGC -0.8886350 3.55395 -3.141381979
GCAGGGCG 4.66535 0.8645290 2.687586303 TGCCCGCG -0.3137370 4.10844 -3.126951344
GAGGGGCG 3.03108 -0.7557900 2.677721456 TGCGGAAT 0 4.41371 -3.120964271
CCCCCGCG 3.55664 -0.1908410 2.649869227 GCCGGCGA 0 4.33335 -3.064141170
AGGGGAGC 3.15866 -0.5635770 2.632019024 CAGCAGCC -0.0679120 4.10418 -2.950114545
TGCGCCGA 3.68519 0 2.605822839 CGAGGAGA 0 4.09415 -2.895001228
CCGCGCCC 2.25420 -1.4189300 2.597295131 CGCAGGCG -0.2779570 3.74626 -2.845551130
GTGCGTTT 3.66247 0 2.589757373 TTCCACCC 0 4.02098 -2.843262225
CTGGGAGC 3.36673 -0.2940760 2.588580747 TCGCCCCA 0 3.94598 -2.790229216
TGCCTCCC 3.34992 -0.2629130 2.554658714 GGGGCCGG 0.8548330 4.76672 -2.766121825