Skip to main content
Figure 10 | BMC Genomics

Figure 10

From: Pooling breast cancer datasets has a synergetic effect on classification performance and improves signature stability

Figure 10

Chart listing the 127 genes selected in the classifier trained on all six datasets. For each gene, we list the rank, Entrez id, and Gene symbol. Green cell shading indicates the genes that are part of the signature from the six pooled datasets, which are not part of any of the signatures from the single datasets. Yellow cell shading indicates the seven microtubule associated genes. The succeeding columns indicate the rank position of a particular gene in each of the six separate rankings. An orange cell shading indicates the genes that were part of the individual signatures. The purple cell shading indicates the overlap to a group of existing breast cancer signatures (Wang et al. [2], Vijver et al. [3], Naderi et al. [19], Teschendorff et al. [20]), and a group of breast cancer associated genes (Pujana et al. [29]). P-values indicating the significance of the overlap (hypergeometric test) of these signatures is given at the bottom of the columns.

Back to article page