Skip to main content


Figure 3 | BMC Genomics

Figure 3

From: Scoring relevancy of features based on combinatorial analysis of Lasso with application to lymphoma diagnosis

Figure 3

Variation of area under the ROC curve when different number of features are used. The features are sorted by applying FeaLect on 20 random training samples. Then, the training samples and the highly scored features are considered to build linear classifiers by lars. The best AUC is reported by testing on a set of validating samples disjoint from the training set. For both lymphoma and colon datasets, the performance of the optimum classifier decreases if all features are provided to lars. This observation practically shows the advantage of using a limited number of highly scored features over pure lars.

Back to article page