Skip to main content

Table 2 Comparison of the HRP isoenzyme sequences between GenBank, UniProt, transcriptome and verified genome sequences

From: Peroxidase gene discovery from the horseradish transcriptome

HRP Sanger sequence Transcriptome sequence GenBank sequence UniProt sequence
Nt aa (exon)/intron nt aa nt aa (exon)/intron aa
C1A TA Y37 - - AT I37 Y37
109-110 109-110
C1159 intron - intron G1159 intron -
C1B T/C253 intron - intron T253 intron -
T/C859 intron - intron C859 intron -
C1C ss ss ss ss * * *
nt1-60 aa1-20 nt1-60 aa1-20
C178 R60 C178 R60 A118 S40 S40
A/T1335 intron - intron - - -
A/G1888 T/A165 A/G493 T/A165 G433 A145 A145
C1889 A165 C/T1889 A/V165    
C/G1921 Q/E176 C/G526 Q/E176 G466 E156 E156
C2 CT intron - intron * intron -
1250-1251
A1334 intron - intron * intron -
C3 G/T1294 intron - intron G1294 intron -
A/T1323 intron - intron A1323 intron -
T/C1484 L231 - - T1484 L231 L231
C/T1541 F250 - - C1541 F250 F250
A2 ss ss ss ss - - *
nt1-93 aa1-31 1-93 aa1-31
AAT N78 AAT N47 - - D47
231-234 231-234
GGA G220 GGA G220 - - N189
996-998 661-663
AAT N221 AAT N221 - - G190
999-1001 664-666
ACG T284 ACG T284 - - L253
1185-1187 850-852
G/A1203 A/T290 G868 A290 - - A259
AAT N334 AAT N334 - - D303
1335-1337 999-1002
E5 ss ss ss ss - - *
nt1-81 aa1-27 nt1-81 aa1-27
T419 L82 C/T246 L82 - - L55
C422 D83 T/C249 D83 - - D56
C545 C124 T/C372 C124 - - C97
01805 None none none none - - -
22684 G1611 R337 A1010 K337 - - -
TGA D343 CGG G343 - - -
1627-1629 1026-1028
01350 None none none none - - -
02021 None none none none - - -
03523 None none none none - - -
06117 T30 V10 C/T30 V10 - - -
C1088 I269 T807 I269 - - -
17517 T190 Y64 C190 H64 - - -
C1157 G282 T846 G282 - - -
A1232 K307 G921 K307 - - -
08562.1 None none none none - - -
08562.4 None none none none - - -
23190 T1345 S109 G1345 S109 - - -
C1423 G135 T1423 G135 - - -
T1842 S222 T/C1842 S/P222    
C1850 T224 A/C1850 T224 - - -
A2221 E348 T/A2221 V/E348 - - -
04663 None none none none - - -
06351 None none none none - - -
05508 G/A346 A/T116 G/A346 A/T116 - - -
22489 - - G/A597 T199 - - -
. . G/T715 A/S239 - - -
  1. All nucleotide (nt) and amino acid (aa) positions were calculated from the start ATG. Variations between the nt positions of the transcriptome sequence compared to other sequence sources are either due to deletions or intronic sequences in the other sources. “-“ indicates that no sequence information is available from the respective source. “ss” indicates a putative signal sequence. Deletions or missing sequences are marked with “*”. “N/NX” indicates a variation at position X. “None” indicates that no polymorphisms or differences between transcriptome and genome sequence were found. The isoenzymes C1A and C3 were detected in the transcriptome raw reads with only partial/low coverage (0-2x), thus no consensus sequence was formed.