Skip to main content

Table 2 Compression ratios on H. sapiens dataset-1152 (3128 GB)

From: Sketch distance-based clustering of chromosomes for large genome database compression

ReferenceCompression ratio with algorithm
 HiRGCiDoCompGDC2
HG00096991.77485.352919.58
NA18856889.32437.052805.19
GCA_000004845784.8453.942901.44
GCA_000252825504.41114.402897.76
GCA_00036544513.07//
hg191046.8468.36/
hg38826.3152.03/
Result of ECC1137.21576.843033.84
Ratio gain*7.95%15.86%3.77%
  1. ’/’ indicates a running time longer than 500 h. Bold text indicates the highest compression ratio of an algorithm, italic text indicates the best case of fixed single reference compression result.
  2. *The ratio gain of ECC against the best case