Skip to main content

Table 1 The edge analysis of overlap graph before and after Edge Adjustment

From: A de novo next generation genomic sequence assembler based on string graph and MapReduce cloud computing framework

Simulated

E. coli Dataset

Edge Type

# of edges before

Edge Adjustment

# of edges after

Edge Adjustment

100 × 36 bp

0.5% error

dataset

Class I

92829732

92754696

[99.92%]

 

Class II

14519426

322510

[2.22%]

 

Class III

252762

118542

[46.90%]

 

Class IV

377856

294110

[77.84%]

100 × 36 bp

1% error

dataset

Class I

76439532

76364264

[99.90%]

 

Class II

24836446

749900

[3.02%]

 

Class III

358432

76162

[21.25%]

 

Class IV

132412

92834

[70.11%]

200 × 150 bp

0.5% error

dataset

Class I

115230002

115163888

[99.94%]

 

Class II

74214420

438274

[0.59%]

 

Class III

1347100

51988

[3.86%]

 

Class IV

403836

322746

[79.92%]

200 × 150 bp

1% error

dataset

Class I

32604042

32580388

[99.93%]

 

Class II

53758272

554020

[1.03%]

 

Class III

1422472

57494

[4.04%]

 

Class IV

256952

225124

[87.61%]