Skip to main content

Table 1 Number and distribution of redundant (R) and non-redundant (NR) reported DENV protein sequences in 2007 and 2018

From: Identification of highly conserved, serotype-specific dengue virus sequences: implications for vaccine design

Protein / Serotype

DENV1

DENV2

DENV3

DENV4

Total

2018R

2018NR

2018R

2018NR

2018R

2018NR

2018R

2018NR

2007R

2018R

2018NR

Increase (#|%)a

Reduction (#|%)b

C

3566

322

2061

312

1736

293

454

114

1278

7817

1041

6539 | 511%

6776 | 86.68%

prM

2651

364

2376

329

1787

168

659

89

1530

7473

950

5943 | 388%

6523 | 87.29%

E

2329

1074

5269

1533

2950

933

1724

543

3845

12,272

4083

8427 | 219%

8189 | 66.73%

NS1

2470

491

2190

488

1314

306

397

114

1784

6371

1399

4587 | 257%

4972 | 78.04%

NS2a

1982

411

1535

349

1012

207

334

97

705

4863

1064

4158 | 589%

3799 | 78.12%

NS2b

1978

155

1537

126

1019

87

259

38

614

4793

406

4179 | 680%

4387 | 91.53%

NS3

1976

404

1578

384

1204

309

276

92

695

5034

1189

4339 | 624%

3845 | 76.38%

NS4a

1949

141

1519

114

993

84

241

40

523

4702

379

4179 | 799%

4323 | 91.94%

NS4b

1952

193

1524

261

999

97

319

77

602

4794

628

4192 | 696%

4166 | 86.90%

NS5

2021

742

1995

749

1334

494

421

149

828

5771

2134

4943 | 596%

3637 | 63.02%

Total

22,874

4297

21,584

5020

14,348

2978

5084

1353

12,404

63,890

13,648

51,486 | 415%

50,242 | 78.64%

  1. RNumber of redundant sequences collected from the National Center for Biotechnology Information (NCBI) Taxonomy database in December 2007 [35] and April 2018
  2. NRNumber of non-redundant sequences after removal of duplicate sequences (full length and partial)
  3. aNumber and percentage of redundant sequences increase from 2007 [35] and 2018
  4. bNumber and percentage of sequence reduction for the 2018 dataset as a result of the removal of duplicate sequences; rounded to two decimal places