Skip to main content

Advertisement

Table 13 Co-occurrence in 3'UTRs

From: The word landscape of the non-coding segments of the Arabidopsis thaliana genome

Word1 Word2 S ES S*ln(S/ES)
TTCTTTTT TTTTTCTT 322 238.5802 96.5504
TGTTTTTT TTTTTCTT 283 217.7183 74.2154
TTCTTTTT TTTTTTCT 260 197.5705 71.3925
TTTTTCTT TTTTTGTT 326 273.0848 57.7395
TCTTTTTT TTTTTCTT 270 218.9471 56.5898
TTTTCTTT TTTTTTCT 278 226.8886 56.479
TTTTTTGG TTTTTGTT 161 116.5969 51.9517
TTATTTTT TTTTTCTT 211 166.8299 49.5604
TTCTTTTT TTTTTGTT 290 248.3755 44.9324
TGTTTTTT TTCTTTTT 239 198.0677 44.8973
TTTTCTTT TCTTTTTT 270 228.7449 44.7699
TCTTTTTC TTTTTTCT 112 76.7939 42.2658
TGTTTTTT TTTTTTGG 129 93.1111 42.0564
TTTTTTGG TTTTTCTT 148 112.0287 41.2117
TTTTTTCT TTTTTTGG 128 92.8787 41.0542
TTTTCTTT TGTTTTTT 265 227.4605 40.4796
TTTGTTTT TTTTTTGG 170 134.4256 39.9138
TTCTTTTT TTTTTTGG 136 101.9687 39.1665
TCTTTTTT TTTTTTGG 127 93.6332 38.7099
TTTTCTTT TTCTTTTT 285 249.2674 38.1794
TTTTTATT TTATTTTT 137 103.7794 38.0467
TGTTTTTT TTTTTTCT 215 180.3272 37.8109
TCTTTTTT TTTTTTCT 216 181.3431 37.7758
TTTTTGGT TTTTTGTT 161 127.4072 37.6766
ATTTTTTA TTTTTCTT 82 53.2457 35.4078
  1. Overrepresented non-overlapping word-pairs detected in the 3'Untranslated Regions of Arabidopsis thaliana. A word-pair is characterized through the two nucleotide sequences associated with it (Word1 and Word2), the number of sequences the pair occurs in (S) as well as the expected number of sequences (ES) and a statistical score symbolizing the overrepresentation of the word-pair in the specific sequence set (S*ln(S/ES)).