Skip to main content

Table 4 Repeats of length 27 with each of the six possible protein translations

From: Database of exact tandem repeats in the Zebrafish genome

GP

Repeat Base

All Possible Protein Translations

#

1

AACACAGCTGATAAGAACTCGCGCGAC

VLISCVVAR

RATTQLIRT

SRDNTADKN

FLSAVLSRE

SSYQLCCRA

LARQHS**E

2

1

AACACAGCTGATCAGTACGCGCGCGAC

VLISCVVAR

RATTQLIST

ARDNTADQY

Y*SAVLSRA

RTDQLCCRA

RARQHS*SV

1

1

AGCTGTGTTGTCGCGCGCGTTATGATC

VMISCVVAR

RATTQLIIT

ARDNTADHN

L*SAVLSRA

RYDQLCCRA

RARQHS*S*

2

1

AGCTGTGTTGTCGCGCGCGTTCTGATC

VLISCVVAR

RATTQLIRT

ARDNTADQN

F*SAVLSRA

RSDQLCCRA

RARQHS*SE

1

1

AATAGCGGTGTTGTCGCGCGCGTTCTG

VLNSGVVAR

RATTPLFRT

ARDNTAIQN

F*IAVLSRA

RSE*RCCRA

RARQHRYSE

1

1

AACACCGCTCTTCAGTACGCGCGCGAC

VLKSGVVAR

RATTPLFST

ARDNTALQY

Y*RAVLSRA

RTEERCCRA

RARQHRSSV

1

1

AGCTGTGTTGTCGCGCGAGTTCTTATC

VLISCVVAR

RATTQLIRT

SRDNTADKN

FLSAVLSRE

SSYQLCCRA

LARQHS**E

3

1

AACACAGCTGATAAGAACGCGCGCGAC

VLISCVVAR

RATTQLIRT

ARDNTADKN

FLSAVLSRA

RSYQLCCRA

RARQHS**E

1

1

AACACCGCTACTCAGTACGCGCGCGAC

VLSSGVVAR

RATTPLLST

ARDNTATQY

Y*VAVLSRA

RTE*RCCRA

RARQHRYSV

1

1

AACACAGCTGATCAGAACGCGCGCGAC

VLISCVVAR

RATTQLIRT

ARDNTADQN

VLSRAF*SA

RSDQLCCRA

RARQHS*SE

1

1

AACACAGCTGACAAGAACTCGCGCGAC

VLVSCVVAR

RATTQLTRT

SRDNTADKN

FLSAVLSRE

SSCQLCCRA

LARQHS*QE

1

2

ACCCAGGCTCCTCGCCCTGCCGGCGCC

LALPAPPRL

SPCRRHPGS

RPAGATQAP

RSLGGAGRA

GAWVAPAGR

EPGWRRQGE

1

2

ACCCAGACGTCTCGCCCTGCCGGCGCC

LALPAPPRR

SPCRRHPDV

RPAGATQTS

RRLGGAGRA

DVWVAPAGR

TSGWRRQGE

1

2

ACGTCTGGGTGGCGCCGGCAGGGCGAG

LALPAPPRR

SPCRRHPDV

RPAGATQTS

RRLGGAGRA

DVWVAPAGR

TSGWRRQGE

1

2

ACGTCTGGGTGGCGCCGGCTGGGCGAG

LAQPAPPRR

SPSRRHPDV

RPAGATQTS

RRLGGAGWA

DVWVAPAGR

TSGWRRLGE

1

2

ACAGGCCTCCAGCCCAGCCGGCTCCCC

PAQPAPHRP

GSRLGWRPV

SPAGSPQAS

GGLWGAGWA

LGWRPVGSR

PAGLEACGE

1

3

AATGGCCGCCGCCTCCTGAGCTTCCTG

LPEWPPPPE

SSGGGGHSG

LRRRRPFRK

S*MAAAS*A

AQEAAAIQE

FLNGRRLLS

1

3

AAGCTCAGGAGGCGGCGGCCATTCAGG

LPEWPPPPE

SSGGGGHSG

LRRRRPFRK

S*MAAAS*A

AQEAAAIQE

FLNGRRLLS

2

3

AGCTCAGGCGGCGGCGGCCATTCAGGG

LPEWPPPPE

SSGGGGHSG

LRRRRPFRE

P*MAAAA*A

AQAAAAIQG

SLNGRRRLS

1

3

AGCGAGCTCGGGAGGCGGCGGCCATTC

LAEWPPPPE

SSGGGGHSA

LGRRRPFSE

R*MAAASRA

AREAAAIQR

SLNGRRLPS

1

3

AATGGCCGCCGCCGCCTGAGCTTCCTG

LPEWPPPPE

SSGGGGHSG

LRRRRPFRK

S*MAAAA*A

AQAAAAIQE

FLNGRRRLS

3

3

AAGCTCAGGAGGCGGCGGCCGTTCAGG

LPERPPPPE

SSGGGGRSG

LRRRRPFRK

S*TAAAS*A

AQEAAAVQE

FLNGRRLLS

3

3

AATGGCCGCCGCCGCCTGAGCTCCCTG

LPEWPPPPE

SSGGGGHSG

LRRRRPFRE

P*MAAAA*A

AQAAAAIQG

LNGRRRLSS

1

3

AACGGCCGCCGCCTCCTGAACTCCCTG

LPERPPPPE

SSGGGGRSG

FRRRRPFRE

P*TAAAS*T

VQEAAAVQG

LNGRRLLNS

2

4

AAGACCAGAGGGGAGCCGGCGGGGCTG

G*RPEGSRR

AGGAEDQRG

LKTRGEPAG

PRRLPSGLQ

PAGSPLVFS

PPAPLWSSA

1

4

CCCCTCTGGTCTCCTGCCCTGCCGGCT

GRRPEGSRQ

AGRAGDQRG

QETRGEPAG

PCRLPSGLL

PAGSPLVSC

LPAPLWSPA

1

5

AACTCTATTGAGTGTCAGGTCATCTCC

SSPTLLSVR

HLQLY*VSG

ISNSIECQV

DLTNRVGD

T*HSIELEM

PDTQ*SWR*

1

6

AAAGCAACAACTCCACCAACATCAGCT

QLHQHQLKQ

NSTNIS*SN

TPPTSAKAT

CCFS*CWWS

VALADVGGV

LL*LMLVEL

1

7

AACAGCAGAGCCATTCACTGACCCCAG

H*PQNSRAI

TDPRTAEPF

LTPEQQSHS

*MALLFWGQ

EWLCCSGVS

NGSAVLGSV

1

8

AAGGAGCCGGGCTGGGGCCGGCAGGGC

AGRARSRAG

PAGQGAGLG

RQGKEPGWG

APARLLALP

PQPGSLPCR

PSPAPCPAG

1

  1. GP: grouping; #: number of instances found in the Zv8 assembly. Note the repeat base has been reordered by lexicographical order.