Skip to main content

Table 2 Compression ratios on H. sapiens dataset-1152 (3128 GB)

From: Sketch distance-based clustering of chromosomes for large genome database compression

Reference

Compression ratio with algorithm

 

HiRGC

iDoComp

GDC2

HG00096

991.77

485.35

2919.58

NA18856

889.32

437.05

2805.19

GCA_000004845

784.84

53.94

2901.44

GCA_000252825

504.41

114.40

2897.76

GCA_000365445

13.07

/

/

hg19

1046.84

68.36

/

hg38

826.31

52.03

/

Result of ECC

1137.21

576.84

3033.84

Ratio gain*

7.95%

15.86%

3.77%

  1. ’/’ indicates a running time longer than 500 h. Bold text indicates the highest compression ratio of an algorithm, italic text indicates the best case of fixed single reference compression result.
  2. *The ratio gain of ECC against the best case