Skip to main content

Table 2 Translated transcripts containing putative toxin sequences.

From: Characterization of the Conus bullatus genome and its venom-duct transcriptome

O-superfamily: C-C-CC-C-C
  1. MKLTCVAIVAVLLLTACQLITAEDSRGTQLHRALRKTTKLSVS TR C KGPGAK C LKTMYD CC KYS C SRGR C
  2. MKLTCVLIIAVLFLTAITADDSRDKQVYRAVGLIDKMRR IR ASEG C RKKGDR C GTHL CC PGLR C GSGRAGGA C RPPYN
  3. MKLMCVLIVSVLVLTACQLSTADDTRDKQKDRLVRLFRKKRDSSDSGLL PR T C VMFGSM C DKEEHSI CC YE C DYKKGI C V
  4. MKLTCVVIVAVLLLTACQLIIAEDSRGTQLHRALRKATKLSVS TR T C VMFGSM C DKEEHSI CC YE C DYKKGI C V
  5. MKLTCVLIVAVLFLTACQLATAENSREEQGYSAVRSSDQIQDSDLKL TK S C TDDFEP C EAGFEN CC SKS C FEFEDVYV C* GVSIDYYDSR
  6. MKLICVFIVAVLLLTACQLNAADDSRDTQKHRALRSTTKLSMS KK DS C VPDGDS C LFSRIP CC GT C SSRSKS C V*G
  7. MKLTCMMIVTVLFLTAWTFVTADDSTYGLKNLLPKARHEMMNPEAPKLNKK DE C SAPGAF C LIRPGL CC SEF C FFA C F[67]
  8. AEDSRGTQLHRALRKATKLSES TR C KRKGSS C RRTSYD CC TGS C RNGK C* G
  9. AVLLLTACQLITAEDSRDTQKHRALRSDTKLSMLT LR C ATYGKP C GIQND CC NI C DPARRT C T
  10. DSRGTQLHRALRKATILSVS AR C KLSGYR C KRPKQ CC NLS C GNYM C* G
  11.    ACQLITAEDSRGTQLHRALRSTSKVSK STS C VEAGSY C RPNVKL CC GF C SPYSKI C MNFPKN
  12. TAEDSRGTQLHRALRKATKLPVS TR C ITPGTR C KVPSQ CC RGP C KNGR C TPSPSEW
  13. AEDSRGTQLHRALRKTTKLSLS IR C KGPGAS C IRIAYN CC KYS C RNGK C S
  14. AACQLGTAASFARDKQDYPAVRSDGRQDSKDSTLDRIA KR C SEGGDF C SKNSE CC DKK C QDEGEGRGV C LIVPQNVILLH
M-superfamily: CC-C-C-CC
  15. MLKMGVLLFTFLVLFPLATLQLDADQPVERYADNKQDLNPD ER MIFLFGG CC RMSS C QPPPV C N CC AKQDLNPDER
  16. DQPADRPAERMQDDISSEQNPLLEKR VGER CC KNGKRG C GRW C RDHSR CC* GRR[17]
  17. GLY CC QPKPNGQMM C NRW C EINSR CC* GRR
A-superfamily: CC-C-C; CC-C-C-C-C
  18. MGMRMMFTVFLLIVLATTVVSFSTDDESDGSNEEPSADQTARSSMNR APG CC NNPA C VKHR C* G[68]
  19. MGMRMVFTVFLLVVLATTVVSFTSDRASDGRNAAANDKASDLAALA VR G CC HDIF C KHNNPDI C* G
  20. MGMRMRMMFTVFLLVVLANTVVSFPSDRDSDGADAEASDEPVEFER DENG CC WNPS C PRPR C T*GRR[68]
  21. DGANAEATDNKPGVFER DE KK CC WNRA C TRLVP C SK
  22. SDRASDGRNAAANDRASDLVALT VR G CC TYPP C AVLSPL C D
  23. MGMRMMVTVFLLGVLATTVVSLRSNRASDGRRGIVNK LNDLVPQYWTECC GRIGPH C SR C I C PEVV C PKN*G
  24. MGMRMMVTVFLLVVLATTVVSLRSNRASDGRRGIVNKLNDLVPK YWTECC GRIGPH C SR C I C PEVA C PKN*G
  25. MGMRMMVTVFPLVVLATTVVSLRSNRASDGRRGIVNKLNDLVPK YWTECC GRIGPH C SR C I C PGVV C PKR*G
  26.    LVVLATTVVSFRSNRASDGRKIAVNKRRRELVVPPG K LRE CC GRVGPM C PK C M C PPRR C
  27. ASDGRNAVVH ER APELVVTATTT CC GYDPMTI C PP C M C THS C PPKRKP*GRRND
J-superfamily
  28.    MTSVQSATCCCLLWLVLCVQLVTPDSPATAQLSRHLTAR VPVGPALAYA C SVM C AKGYDTVV C T C TRRRG*VVSSSI
Contryphan
  29.    MGKLTILVLVAAVLLSTQVMGQGDRDQPAARNAVPRDDNPGGASAKLMNLLHRSKCPWSPWC*G
Conkunitzin
  30. MEGRRFAAVLILPICMLAPGAVAS KR WTRPSV C NLPAESGTGTQSLKRFYYNSDKMQ C RTFIYKGNGGNDNNFPRTYD C QKK C LYRP*G
  1. Cysteine motifs are shown next to the superfamilies. The underlined residues indicate presumed propeptide cleavage site ascertained by analogy to previously isolated toxins; * indicate probable amidation at the C-terminal residue after cleavage of the following G residue. In the case of 23,24,25,26 where the propeptide cleavage site is uncertain, we have indicated the cleavage site at the basic residues (K) proximal to the presumed toxin sequence. The peptides Bu 7, 16, 18 and 20 have been previously characterized.
\