Skip to main content

Table 8 Word co-occurrence. The top 25 word pairs for the bidirectional (a) and unidirectional (b) promoter set. The word pairs are sorted in descending order by S*ln(S/ES) score.

From: Word-based characterization of promoters involved in human DNA repair pathways

(a) Bidirectional

(a) Bidirectional

(b) Unidirectional

Word 1

Word 2

S

ES

Sln(S/ES)

Word 1

Word 2

S

ES

Sln(S/ES)

TCTGAGGA

TCGCGCCA

3

0.0529

12.1158

GTTCATTC

TCCGCCGG

2

0.0073

11.2184

ACTCCAGC

TCGCGCCA

3

0.0580

11.8387

CTGTGTGC

TGCGCCGA

2

0.0074

11.1966

GCCCAGCC

TCCGCCGC

3

0.0722

11.1827

TGACGCGA

CTCCCGCT

2

0.0082

10.9997

GCCCAGCC

CGGAGCGC

2

0.0087

10.8711

AGCCGGCT

GGGGAGTA

2

0.0131

10.0590

TGCCCGCG

TCCCGGGA

4

0.2729

10.7404

ATTGCAGG

ATTCTCTC

2

0.0169

9.5459

GGCAGGGA

GGGCCAGG

4

0.3400

9.8609

GGGGAGTA

AGGAAACA

2

0.0190

9.3177

TCCCGGGA

TCGCGCCA

3

0.1140

9.8112

CTGGGAGC

GTTCATTC

2

0.0218

9.0337

AGCCTGTC

TCCCGGGA

3

0.1158

9.7646

CCTTCCGA

CTGGGAGC

2

0.0240

8.8439

GGAGGCTG

TCGCGCCA

3

0.1173

9.7250

TGGGCGGA

ACCCGCCT

2

0.0247

8.7895

TCCGCCGC

GCCCCTCC

4

0.3554

9.6830

TTTCTCCA

CGGAAACC

2

0.0265

8.6446

AGAAAAGA

TCGCGCCA

2

0.0182

9.4042

CCCCCGCG

ACCCGCCT

2

0.0280

8.5339

GCCCAGCC

GCCCCTCC

3

0.1360

9.2808

TCCGCCGG

GGGGCTGC

2

0.0415

7.7522

TGCCAAAA

GCCGGCGA

2

0.0195

9.2604

AGCTGGCT

CCAGGCTG

2

0.0422

7.7192

CAGCAGCC

TGCGGAAT

2

0.0208

9.1297

TTGGTCTC

AGGAAACA

2

0.0446

7.6068

AGGGCCGT

TCCCGGCT

3

0.1433

9.1249

CTGGGAGC

TCCGCCGG

2

0.0519

7.3020

CCTCCAGA

TTCCACCC

2

0.0216

9.0521

CTTTTCTC

GCGCCGCG

2

0.0545

7.2046

CGAGGAGA

TCGCGCCA

2

0.0220

9.0204

ATTGCAGG

ATTAAAAT

2

0.0585

7.0639

TCCGCCGC

CGGAGCGC

2

0.0228

8.9501

TGGAACCC

GCAGGGCG

2

0.0645

6.8693

ACCCTCGT

AGGGAGGG

2

0.0253

8.7380

GGGCAGGC

AGCTGGCT

2

0.0657

6.8326

GCCCAGCC

TCCACTGT

2

0.0254

8.7315

TTGGTCTC

CTTCTTTC

2

0.0676

6.7745

CAGCAGCC

AGGGCCGT

3

0.1705

8.6024

CTTTTTCA

CGCCCCTT

2

0.0684

6.7522

TGCCCGCG

TCCCGGCT

3

0.1747

8.5291

GCAGGGCG

AGGAAACA

2

0.0766

6.5251

CCCAGGAC

AGAGAGCT

2

0.0291

8.4590

GGGCAGGC

TTTCTCCA

2

0.0939

6.1181

TCTGGGAT

GGCCCGCC

2

0.0329

8.2123

CTGGGAGC

TCTCCCCT

2

0.0947

6.0996

AGCCGGGC

AGAAAAGA

2

0.0333

8.1930

AGCAGGGC

GGCTTTTA

2

0.0956

6.0805