Skip to main content

Table 1 The data sets that are used for evaluating the performance of error correction models

From: Mining statistically-solid k-mers for accurate NGS error correction

Data set Genome name Genome size (bp) Error rate (%) Read length (bp) Coverage Number of reads Insert length Is sythetic
R1 S. aueus 2,821,361 1.28 101 46.3 × 1,294,104 180 No
R2 R. sphaeroides 4,603,110 1.08 101 45.0 × 2,050,868 180 No
R3 H. chromosome 14 88,218,286 0.52 101 41.8 × 36,504,800 155 No
R4 B. impatiens 249,185,056 0.86 124 150.8 × 303,118,594 400 No
S1 H. chromosome 14 88,218,286 0.97 101 41.8 × 36,504,800 180 Yes
S2 B. impatiens 249,185,056 0.98 124 150.8 × 303,118,594 400 Yes