Skip to main content

Table 3 64 taxa analyzed by 16S rRNA conventional blast analysis and by the new peptide based approach.

From: Peptide markers of aminoacyl tRNA synthetases facilitate taxa counting in metagenomic data

Kegg Taxon Name Blast 16S Peptides S61 Specific Peptide
ban Bacillus anthracis Ames 100% 100% ISRQLWWGH
bar Bacillus anthracis Ames 0581    ISRQLWWGH
cgb Corynebacterium glutamicum ATCC 13032 Bielefeld 100% 100% ISRQLWWGH
cgl Corynebacterium glutamicum ATCC 13032 Kyowa Hakko    ISRQLWWGH
sar Staphylococcus aureus MRSA252 100% > 99% ISRQLWWGH
sas Staphylococcus aureus MSSA476    ISRQLWWGH
sbl Shewanella baltica OS155 100% > 99% ISRQLWWGH
sbm Shewanella baltica OS185    ISRQLWWGH
ypa Yersinia pestis Antiqua 100% 100% ISRQLWWGH
ypg Yersinia pestis Angola    ISRQLWWGH
ypp Yersinia pestis Pestoides    ISRQLWWGH
bfr Bacteroides fragilis YCH46 > 99% 100% ISRQLWWGH
bfs Bacteroides fragilis NCTC9343    ISRQLWWGH
cbf Clostridium botulinum F > 99% 100% ISRQLWWGH
cbo Clostridium botulinum A    ISRQLWWGH
cpa Chlamydophila pneumoniae AR39 > 99% 100% ISRQLWWGH
cpj Chlamydophila pneumoniae J138    ISRQLWWGH
cta Chlamydia trachomatis serovar A > 99% 100% ISRQLWWGH
ctr Chlamydia trachomatis serovar D    ISRQLWWGH
eci Escherichia coli UTI89 UPEC > 99% > 99% ISRQLWWGH
eco Escherichia coli K-12 MG1655    ISRQLWWGH
llc Lactococcus lactis subsp cremoris SK11 > 99% > 99% ISRQLWWGH
llm Lactococcus lactis subsp cremoris MG1363    ISRQLWWGH
mtc Mycobacterium tuberculosis CDC1551 > 99% 100% ISRQLWWGH
mtf Mycobacterium tuberculosis F11    ISRQLWWGH
sag Streptococcus agalactiae 2603 serotype V > 99% > 99% ISRQLWWGH
san Streptococcus agalactiae NEM316 serotype III    ISRQLWWGH
stc Streptococcus thermophilus CNRZ1066 > 99% > 99% ISRQLWWGH
ste Streptococcus thermophilus LMD-9    ISRQLWWGH
syd Synechococcus sp CC9605 > 99% V ISRQLWWGH
sye Synechococcus sp CC9902   V ISRQLWWGH
bbr Bordetella bronchiseptica > 99% V ISRQLWWGH
bpe Bordetella pertussis   V ISRQLWWGH
pmf Prochlorococcus marinus MIT 9303 V V ISRQLWWGH
pmh Prochlorococcus marinus MIT 9215 V V ISRQLWWGH
tle Thermotoga lettingae V V ISRQLWWGH
tma Thermotoga maritime V V ISRQLWWGH
bth Bacteroides thetaiotaomicron V V ISRQLWWGH
cau Chloroflexus aurantiacus V V ISRQLWWGH
cdi Corynebacterium diphtheria V V ISRQLWWGH
cef Corynebacterium efficiens V V ISRQLWWGH
cha Campylobacter hominis ATCC BAA-381 V V ISRQLWWGH
cje Campylobacter jejuni NCTC11168 V V ISRQLWWGH
cmu Chlamydia muridarum V V ISRQLWWGH
cph Chlorobium phaeobacteroides V V ISRQLWWGH
det Dehalococcoides ethenogenes V V ISRQLWWGH
dge Deinococcus geothermalis V V ISRQLWWGH
dra Deinococcus radiodurans V V ISRQLWWGH
gfo Gramella forsetii V V ISRQLWWGH
rpd Rhodopseudomonas palustris BisB5 V V ISRQLWWGH
aav Acidovorax avenae V V ISRQLWWGH
abu Arcobacter butzleri V V ISRQLWWGH
ade Anaeromyxobacter dehalogenans V V ISRQLWWGH
atu Agrobacterium tumefaciens C58 UWashDupont V V ISRQLWWGH
rpa Rhodopseudomonas palustris CGA009 V V ISRQLWWGH
aae Aquifex aeolicus V V FFWVARMIM
cch Chlorobium chlorochromatii V V FFWVARMIM
chu Cytophaga hutchinsonii V V FFWVARMIM
fnu Fusobacterium nucleatum V V FFWVARMIM
rba Rhodopirellula baltica V V DTWFSSALWP
gme Geobacter metallireducens V V DTWFSSALWP
aau Arthrobacter aurescens V V DDNGLPTER
mga Mycoplasma gallisepticum V V DTWFSSALWP
mpe Mycoplasma penetrans V V ISRQLWWGH
  1. Kegg code and taxa names are given in the first 2 columns. Last three columns display the results of the two methods. Blast 16S analysis (3rd column): strains of the same species that were both fully matched (100%) to the same 16S rRNA query hence cannot be distinguished. Additional taxa that cannot be distinguished at an identity threshold of 99% are also indicated. 'V' represents taxa distinguished from each other with an identity level > 99%. S61 Peptide analysis (4th column): strains of the same species that were fully matched, i.e. no difference between the corresponding two sequences exist within the tested range (100%). Additional strains that cannot be distinguished if up to 4 amino-acid differences are allowed between the sequences (99%). 'V' represents taxa distinguished from each other with an identity level > 99%. Last column: SP identification of the sequence.