Skip to main content

Table 3 Ascidian and human HEAT repeats mapped on the protein sequence of the corresponding species.

From: Huntingtin gene evolution in Chordata and its peculiar features in the ascidian Ciona genus

Species HEAT name REP E-value Htt region Location Sequence
C. intestinalis      
  A1 0.0005 N-term 58–96 PGLLAVSVETLLQSCADDNADVRLNANECLNRLIKGLYE
  A2 5.96E-06 N-term 139–177 RPYILNLLPCLCRISQREEDGVQETLGLSLVKIFKILGP
  A3 1.35E-06 N-term 181–219 ESEIQGLLASFLKNLSHKSATMRRTACVCLHSVILNCRK
  B4 6.19E-06 N-term 682–720 QSLSHQALSIALKCLCDDDLRLRKTAAATIVTMPTSFPT
  c 2.30E-06 Central 867–905 SQQQFGILPFVMSLLHSAWLPLDVTAHSDALVLAGNLVA
  E1 1.26E-06 Central 1341–1378 QGSASHVIPAMQPIIHDI.YVVRASSKNEPPEVTTQREV
  g1 9.05E-06 C-term 2771–2809 ARVMSKVLPSMLDDFFPAQDIMNKIIAEFISTLQPFPAS
  g2 1.46E-06 C-term 2864–2904 NRWISSMVPLIISRVHDPTLDVDWTCFCKAAVDFYTCQLSE
C. savignyi      
  A1 2.92E-07 N-term 58–96 PGLLAVSVETLLQSCADENADVRLNSNECLNRVIKGLYD
  A2 0.0001 N-term 139–177 RPYILNLLPCLCRISQREEDAVQEVLSSSLAKIFIVLGA
  A3 2.52E-06 N-term 181–219 ESEIQGLLASFLKNLSHKSPTVRRTACICLHSILTNSRK
  B4 1.53E-06 N-term 692–730 KSIAQKALSIALECLCDEDTRLRKTSSAAIVSMATSYPT
  c 1.46E-06 Central 876–914 AQQQFGILPIVMSLLRSAWLPLDVTAHSDALVLAGNLIA
  E1 - Central 1352–1389 QGSASHVIPAMQPITHDI.FVVRGSLKNEPPEVTTQREV
  g1 1.27E-06 C-term 2770–2808 ARVMSKILPSMLDDFFPAQEIMNKIIAEFISTLQPFPGS
  g2 - C-term 2864–2903 RWISSMVPLIISRSHDPSLDRNWTCFCKSAVDFYTCQLSE
Homo sapiens      
  A1 4.75E-07 N-term 124–162 QKLLGIAMELFLLCSDDAESDVRMVADECLNKVIKALMD
  A2 0.0001 N-term 205–243 RPYLVNLLPCLTRTSKRPEESVQETLAAAVPKIMASFGN
  A3 5.48E-07 N-term 247–285 DNEIKVLLKAFIANLKSSSPTIRRTAAGSAVSICQHSRR
  a4 * N-term 291–329 SWLLNVLLGLLVPVEDEHSTLLILGVLLTLRYLVPLLQQ
  a5 7.77E-06 N-term 318–362 LTLRYLVPLLQQQVKDTSLKGSFGVTRKEMEVSPSAEQLVQVYEL
  b1 * N-term 745–783 EYPEEQYVSDILNYIDHGDPQVRGATAILCGTLICSILS
  b2 1.04E-06 N-term 803–841 TFSLADCIPLLRKTLKDESSVTSKLACTAVRNCVMSLCS
  b3 * N-term 842–880 SSYSELGLQLIIDVLTLRNSSYWLVRTELLETLAEIDFR
  B4 6.69E-08 N-term 904–942 KLQERVLNNVVIHLLGDEDPRVRHVAAASLIRLVPKLFY
  b5 9.05E-06 N-term 984–1025 RIYRGYNLLPSITDVTMENNLSRVIAAVSHELITSTTRALTF
  d 5.62E-06 Central 1425–1463 RLFEPLVIKALKQYTTTTCVQLQKQVLDLLAQLVQLRVN
  E1 * Central 1534–1575 RKAVTHAIPALQPIVHDLFVLRGTNKADAGKELETQKEVVVS
  e2 * Central 1610–1648 RQIADIILPMLAKQQMHIDSHEALGVLNTLFEILAPSSL
  e3 * Central 1670–1710 TVQLWISGILAILRVLISQSTEDIVLSRIQELSFSPYLISC
  f 3.51E-06 C-term 2798–2836 DDTAKQLIPVISDYLLSNLKGIAHCVNIHSQQHVLVMCA
  1. HEAT repeats are named according to their relative position along the chordate aligned sequences, using the same letter for repeats closer than 45 amino acids. Orthologous HEAT repeats conserved in ascidians and human share the same name, and are reported in upper case. The Expectation values (E-value) was calculated by the REP program [62]. Htt regions defined as in Methods. Absolute position of the HEAT repeats in the corresponding protein sequence is reported in the "Location" column. Dash: REP E-value not statistically significant. Asterisk: HEAT repeats originally described in Andrade and Bork [18] but not identified by the REP program as statistically significant [62].