Skip to main content

Table 4 Top ten organisms having the highest number of CDS specific to Genome Reviews (GR) database

From: Matching curated genome databases: a non trivial task

rank

organism

Total

matches

without sequence or specific to

  

RS

GR

total

inexact

by location

RS

GR

GR1

Mycobacterium leprae TN

1605

2723

1605

77

1

0

1118

GR2

Orientia tsutsugamushi str. Boryong (Seoul National University)

1182

2143

1182

3

0

0

961

GR3

Orientia tsutsugamushi str. Boryong (Kitasato University)

1562

2085

1562

6

0

0

523

GR4/RS3

Xanthomonas oryzae pv. oryzae KACC10331

4144

4540

4030

497

2

114

510

GR5

Acinetobacter baumannii ATCC 17978

3368

3807

3368

77

0

0

439

GR6/RS5

Shewanella oneidensis MR-1

4467

4779

4364

34

1

103

415

GR7/RS1

Pyrococcus horikoshii OT3

1955

2076

1806

124

0

149

270

GR8/RS6

Escherichia coli O157:H7 str. Sakai

5318

5461

5227

391

2

87

232

GR9

Prochlorococcus marinus subsp. pastoris str. CCMP1986

1717

1935

1714

4

2

3

221

GR10

Prochlorococcus marinus str. MIT 9312

1810

1962

1810

10

0

0

152

  1. The organisms are sorted by their respective rank that is computed as the number of CDS that are found only in Genome Reviews database (Release 94.0). The organism names standing in the top ten list of both databases (Tables 3 and 4) are in bold.