Skip to main content

Table 12 Conservation analysis. The results for conservation analysis of the top 10 word pairs in the bidirectional (a) and unidirectional (b) promoter set. For each word pair, the occurrence location of the pair is given, as well as an identifier for the conservation of the sites, and a PhastCons score for the quality of the conservation across 28 organisms. Conservation can be categorized as: none (no word was conserved), partial (one word was conserved) and complete (all words were conserved).

From: Word-based characterization of promoters involved in human DNA repair pathways

(a) Bidirectional
Word 1 Word 2 Location Conservation Hit Score
TCTGAGGA TCGCGCCA chr19:53365272–53366372 None   
   chr19:48776246–48777346 None   
   chr19:7600339–7601439 Partial TCGCGCCA 385
ACTCCAGC TCGCGCCA chr4:57538069–57539168 None   
   chr19:48776246–48777346 None   
   chr19:7600339–7601439 Partial TCGCGCCA 385
GCCCAGCC TCCGCCGC chr3:185561446–185562546 Partial TCCGCCGC 310
   chr14:19992129–19993229 None   
   chr11:832429–833529 None   
GCCCAGCC CGGAGCGC chr3:185561446–185562546 None   
   chr14:19992129–19993229 None   
TGCCCGCG TCCCGGGA chr19:53365272–53366372 Partial TCCCGGGA 390
   chr13:107668425–107669525 None   
   chr20:5055168–5056268 None   
   chr11:832429–833529 None   
GGCAGGGA GGGCCAGG chr19:53365272–53366372 Partial GGGCCAGG 390
   chr22:40346240–40347340 Complete GGCAGGGA 325
     GGGCCAGG 522
   chr5:60276548–60277648 None   
   chr12:131773918–131775018 None   
TCCCGGGA TCGCGCCA chr19:53365272–53366372 Partial TCCCGGGA 390
   chr4:57538069–57539168 None   
   chr19:7600339–7601439 Partial TCGCGCCA 385
AGCCTGTC TCCCGGGA chr17:38530557–38531657 None   
   chr13:107668425–107669525 Partial AGCCTGTC 244
   chr4:57538069–57539168 None   
GGAGGCTG TCGCGCCA chr4:57538069–57539168 None   
   chr19:48776246–48777346 None   
   chr19:7600339–7601439 Partial TCGCGCCA 385
TCCGCCGC GCCCCTCC chr3:185561446–185562546 Partial TCCGCCGC 310
   chr14:19992129–19993229 None   
   chr1:11674165–11675265 Partial GCCCCTCC 360
   chr11:832429–833529 None   
(b) Unidirectional
Word 1 Word 2 Location Conservation Hit Score
GTTCATTC TCCGCCGG chr7:73306574–73307674 None   
   chr12:52868924–52870024 Partial TCCGCCGG 325
CTGTGTGC TGCGCCGA chr10:131154509–131155609 None   
   chr19:1046236–1047336 None   
TGACGCGA CTCCCGCT chr12:116937892–116938992 None   
   chr17:30330654–30331754 None   
AGCCGGCT GGGGAGTA chr6:30982955–30984055 None   
   chr16:13920523–13921623 None   
ATTGCAGG ATTCTCTC chr5:86744492–86745592 None   
   chr17:30330654–30331754 None   
GGGGAGTA AGGAAACA chr16:13920523–13921623 None   
   chr8:101231014–101232114 None   
CTGGGAGC GTTCATTC chr7:73306574–73307674 None   
   chr12:52868924–52870024 None   
CCTTCCGA CTGGGAGC chr5:68890824–68891924 None   
   chr7:73306574–73307674 None   
TGGGCGGA ACCCGCCT chr6:30982955–30984055 None   
   chr9:99499360–99500460 None   
TTTCTCCA CGGAAACC chr8:55097461–55098561 None   
   chr11:118471287–118472387 None