Skip to main content
Figure 1 | BMC Genomics

Figure 1

From: Pooling breast cancer datasets has a synergetic effect on classification performance and improves signature stability

Figure 1

Result of the repeated random resampling procedure on the Veer et al.[1] data. The histogram shows the frequencies of genes being among the top 200 genes over 500 resamplings. Below the histogram, two lanes containing light-blue bars indicate the genes that are part of the published signatures. The red line indicates the frequency threshold corresponding to the expected value of the frequency under the null hypothesis (no information) given the number of genes N that is selected in each of the R resamplings.

Back to article page