Skip to main content

Table 8 Word co-occurrence. The top 25 word pairs for the bidirectional (a) and unidirectional (b) promoter set. The word pairs are sorted in descending order by S*ln(S/ES) score.

From: Word-based characterization of promoters involved in human DNA repair pathways

(a) Bidirectional
(a) Bidirectional (b) Unidirectional
Word 1 Word 2 S ES Sln(S/ES) Word 1 Word 2 S ES Sln(S/ES)
TCTGAGGA TCGCGCCA 3 0.0529 12.1158 GTTCATTC TCCGCCGG 2 0.0073 11.2184
ACTCCAGC TCGCGCCA 3 0.0580 11.8387 CTGTGTGC TGCGCCGA 2 0.0074 11.1966
GCCCAGCC TCCGCCGC 3 0.0722 11.1827 TGACGCGA CTCCCGCT 2 0.0082 10.9997
GCCCAGCC CGGAGCGC 2 0.0087 10.8711 AGCCGGCT GGGGAGTA 2 0.0131 10.0590
TGCCCGCG TCCCGGGA 4 0.2729 10.7404 ATTGCAGG ATTCTCTC 2 0.0169 9.5459
GGCAGGGA GGGCCAGG 4 0.3400 9.8609 GGGGAGTA AGGAAACA 2 0.0190 9.3177
TCCCGGGA TCGCGCCA 3 0.1140 9.8112 CTGGGAGC GTTCATTC 2 0.0218 9.0337
AGCCTGTC TCCCGGGA 3 0.1158 9.7646 CCTTCCGA CTGGGAGC 2 0.0240 8.8439
GGAGGCTG TCGCGCCA 3 0.1173 9.7250 TGGGCGGA ACCCGCCT 2 0.0247 8.7895
TCCGCCGC GCCCCTCC 4 0.3554 9.6830 TTTCTCCA CGGAAACC 2 0.0265 8.6446
AGAAAAGA TCGCGCCA 2 0.0182 9.4042 CCCCCGCG ACCCGCCT 2 0.0280 8.5339
GCCCAGCC GCCCCTCC 3 0.1360 9.2808 TCCGCCGG GGGGCTGC 2 0.0415 7.7522
TGCCAAAA GCCGGCGA 2 0.0195 9.2604 AGCTGGCT CCAGGCTG 2 0.0422 7.7192
CAGCAGCC TGCGGAAT 2 0.0208 9.1297 TTGGTCTC AGGAAACA 2 0.0446 7.6068
AGGGCCGT TCCCGGCT 3 0.1433 9.1249 CTGGGAGC TCCGCCGG 2 0.0519 7.3020
CCTCCAGA TTCCACCC 2 0.0216 9.0521 CTTTTCTC GCGCCGCG 2 0.0545 7.2046
CGAGGAGA TCGCGCCA 2 0.0220 9.0204 ATTGCAGG ATTAAAAT 2 0.0585 7.0639
TCCGCCGC CGGAGCGC 2 0.0228 8.9501 TGGAACCC GCAGGGCG 2 0.0645 6.8693
ACCCTCGT AGGGAGGG 2 0.0253 8.7380 GGGCAGGC AGCTGGCT 2 0.0657 6.8326
GCCCAGCC TCCACTGT 2 0.0254 8.7315 TTGGTCTC CTTCTTTC 2 0.0676 6.7745
CAGCAGCC AGGGCCGT 3 0.1705 8.6024 CTTTTTCA CGCCCCTT 2 0.0684 6.7522
TGCCCGCG TCCCGGCT 3 0.1747 8.5291 GCAGGGCG AGGAAACA 2 0.0766 6.5251
CCCAGGAC AGAGAGCT 2 0.0291 8.4590 GGGCAGGC TTTCTCCA 2 0.0939 6.1181
TCTGGGAT GGCCCGCC 2 0.0329 8.2123 CTGGGAGC TCTCCCCT 2 0.0947 6.0996
AGCCGGGC AGAAAAGA 2 0.0333 8.1930 AGCAGGGC GGCTTTTA 2 0.0956 6.0805