Skip to main content

Table 1 The edge analysis of overlap graph before and after Edge Adjustment

From: A de novo next generation genomic sequence assembler based on string graph and MapReduce cloud computing framework

Simulated
E. coli Dataset
Edge Type # of edges before
Edge Adjustment
# of edges after
Edge Adjustment
100 × 36 bp
0.5% error
dataset
Class I 92829732 92754696 [99.92%]
  Class II 14519426 322510 [2.22%]
  Class III 252762 118542 [46.90%]
  Class IV 377856 294110 [77.84%]
100 × 36 bp
1% error
dataset
Class I 76439532 76364264 [99.90%]
  Class II 24836446 749900 [3.02%]
  Class III 358432 76162 [21.25%]
  Class IV 132412 92834 [70.11%]
200 × 150 bp
0.5% error
dataset
Class I 115230002 115163888 [99.94%]
  Class II 74214420 438274 [0.59%]
  Class III 1347100 51988 [3.86%]
  Class IV 403836 322746 [79.92%]
200 × 150 bp
1% error
dataset
Class I 32604042 32580388 [99.93%]
  Class II 53758272 554020 [1.03%]
  Class III 1422472 57494 [4.04%]
  Class IV 256952 225124 [87.61%]