Skip to main content

Table 3 Properties of the five benchmark datasets

From: Significant distinct branches of hierarchical trees: a framework for statistical analysis and applications to biological data

Dataset

Origin

Number of leaves

Number of variables

True number of classes

Simulated6

Simulation of gene expression

60

600

6

Leukemia

mRNA levels from microarray analysis

38

999

3

T10

DNA copy number analysis, sequencing

100

354

4

Organelles

Proteomic analysis, using mass spectrometry

24

4768

4

Chondrosarcoma

Flow cytometry analysis of surface markers from fluorescence intensity

32

11

4