Skip to main content

Table 5 Repeats of length 30 with each of the six possible protein translations

From: Database of exact tandem repeats in the Zebrafish genome

GP

Repeat Base

All possible protein translations

#

1

AGCCCCTGAGCGCCCTCCAGTGTCGGCTCC

APAPERPPVS

DTGGRSGAGA

RHWRALRGWS

RLQPLSALQC

GSSP*APSSV

TLEGAQGLEP

32

1

AGAGCGCCCGCCAGTGTCGGCTCCAGCCCC

APAPERPPVS

DTGGRSGAGA

RHWRALWGWS

RLQPQSARQC

GSSPRAPASV

TLAGALGLEP

12

1

ACACTGGAGGGCGCTCAGGGGCTGGAGCCG

APAPERPPVS

DTGGRSGAGA

RHWRALRGWS

RLQPLSALQC

GSSP*APSSV

TLEGAQGLEP

16

1

ACACTGGCGGGCGCTCTGGGGCTGGAGCCG

APAPERPPVS

DTGGRSGAGA

RHWRALWGWS

RLQPQSARQC

GSSPRAPASV

TLAGALGLEP

12

1

ACACTGGAGGGCGCTCTGGGGCTGGAGCCG

APAPERPPVS

DTGGRSGAGA

RHWRALWGWS

RLQPQSALQC

GSSPRAPSSV

TLEGALGLEP

15

1

AGAGCGCCCTCCAGTGTCGGCTCCAGCCCC

APAPERPPVS

DTGGRSGAGA

RHWRALWGWS

RLQPQSALQC

GSSPRAPSSV

TLEGALGLEP

14

1

ACACTGGCGGGCGCTCGGGGGCTGGAGCCG

APAPERPPVS

DTGGRSGAGA

RHWRALGGWS

RLQPPSARQC

GSSPRAPASV

TLAGARGLEP

2

2

ACTGGAGGCAGCTGGACTGGAGCCGGCGGG

APVQLPPVPP

AGGTGGSWTG

PAGLEAAGLE

RRDWRQLDWS

LQSSCLQSRR

SSPAASSPAG

2

2

ACAGGAGGCAGCTGGGCTGGAGCCGGCGGG

APAQLPPVPP

AGGTGGSWAG

PAGQEAAGLE

RRDRRQLGWS

LQPSCLLSRR

SSPAASCPAG

1

2

AGCTGCCTCCAGTCCCGCCGGCTCCTGCCC

APAQLPPVPP

AGGTGGSWAG

PAGLEAAGQE

RRDWRQLGRS

LLPSCLQSRR

SSPAGSCPAA

1

2

ACTGGAGGCAGCTGGGCAGGAGCCGGCGGG

APAQLPPVPP

AGGTGGSWAG

PAGLEAAGQE

RRDWRQLGRS

LLPSCLQSRR

SSPAGSCPAA

1

3

AAGATGGCCGACTCCAGTCCTCCAGCTCAC

QLTRWPTPVL

SSQDGRLQSS

AHKMADSSP

WRTGVGHLVS

GGLESAIL*A

EDWSRPSCEL

7

3

ACTGGAGTCGGCCATCTTGTGAGCTGGAGG

QLTRWPTPVL

SSQDGRLQSS

AHKMADSSP

WRTGVGHLVS

GGLESAIL*A

EDWSRPSCEL

1

4

AAACTGCCGCAAGGCTCCAAATACTTCTCC

KLPQGSKYFS

NCRKAPNTSP

TAARLQILLQ

LEKYLEPCGS

WRSIWSLAAV

GEVFGALRQF

1

  1. GP: grouping; #: number of instances found in the Zv8 assembly. Note the repeat base has been reordered by lexicographical order.