From: VariantSpark: population scale clustering of genotype information
Tool | Pre-processing | Clustering | Accuracy | ||||
---|---|---|---|---|---|---|---|
Threads | Memory | Time | Threads | Memory | Time | ||
VariantSpark | 8 | 32 | 2 min 58 sec | 8 | 32 | 1 min 20 sec | 0.84 |
ADAM | 8 | 32 | 12 min 48 sec | 8 | 32 | 1 min 52 sec | 0.84 |
Hadoop | 8 | 32 | 14 min 22 sec | 8 | 32 | 14 min 23 sec | 0.84 |
R | 1 | 32 | 34 min 30 sec | 8 | 32 | 7 min 25 sec | 0.84 |
Python | 1 | 32 | 34 min 15 sec | 8 | 32 | 11 min 29 sec | 0.84 |
Admixture | 1 | 32 | 10 min 08 sec | 8 | 32 | 8 min 19 sec | 0.25 |