Skip to main content

Table 10 Descriptive words for both the unidirectional and bidirectional promoter sets. The top 25 words that are correlated in the two promoter sets, according to their overrepresentation scores. The Words had to be overrepresented according to SlnSES with at least a score of 1.5. Shown are the words with a distance between -0.11 and 0.11.

From: Word-based characterization of promoters involved in human DNA repair pathways

Word

Unidirectional

Bidirectional

Distance

CTTTGGCC

2.08857

2.23024

-0.100175818

AGGCAGGA

1.51526

1.64780

-0.093719933

CTCAGGAT

1.58527

1.71375

-0.090849079

GGGGGGAC

1.61803

1.70814

-0.063717392

CTTGCGGA

1.65530

1.73350

-0.055295750

CTGAGCAG

1.99183

2.05890

-0.047425652

GCCTGAGG

1.99183

2.04796

-0.039689904

TGAAGTGG

1.61803

1.66175

-0.030914708

GCCATCCG

1.86393

1.89589

-0.022599133

AGGTTGCA

2.20477

2.23024

-0.018010010

TCTGTGCC

1.84096

1.85915

-0.012862272

TACCACTA

1.86393

1.88037

-0.011624835

CAAAGAAT

1.61803

1.61872

-0.000487904

ACCGCTCA

1.61803

1.61872

-0.000487904

TATCTTAG

1.61803

1.61872

-0.000487904

AGAGTTCC

1.62605

1.61872

0.005183093

GTCGGCTT

1.90512

1.88037

0.017500893

CGCGCGCA

1.94164

1.90263

0.027584236

CAGGCCAG

1.95383

1.86972

0.059474751

ACAGAAAG

2.79686

2.70295

0.066404398

GTCAGGAG

2.40520

2.25776

0.104255824

GGAAGTGA

1.96108

1.81095

0.106157941

TAGAGAGC

1.99183

1.84125

0.106476139

TGCCAGGG

1.75813

1.60511

0.108201480

GCACAAGC

1.95383

1.80053

0.108399470

TTCACTTA

2.15055

1.99725

0.108399470