A critical evaluation of network and pathway based classifiers for outcome prediction in breast cancer
C. Staiger, S. Cadot, R. Kooter, M. Dittrich, T. Mueller, G. W. Klau,, L. F. A. Wessels

TL;DR
This study critically evaluates network and pathway-based classifiers for breast cancer outcome prediction, revealing they do not outperform simpler single gene classifiers when assessed on a large, unbiased dataset.
Contribution
It provides an unbiased comparison of composite feature classifiers and single gene classifiers using a large breast cancer dataset, challenging previous claims of superiority.
Findings
Composite classifiers do not outperform single gene classifiers.
Randomization of secondary data sources does not reduce classifier performance.
Proper correction for gene set size equalizes stability between classifier types.
Abstract
Recently, several classifiers that combine primary tumor data, like gene expression data, and secondary data sources, such as protein-protein interaction networks, have been proposed for predicting outcome in breast cancer. In these approaches, new composite features are typically constructed by aggregating the expression levels of several genes. The secondary data sources are employed to guide this aggregation. Although many studies claim that these approaches improve classification performance over single gene classifiers, the gain in performance is difficult to assess. This stems mainly from the fact that different breast cancer data sets and validation procedures are employed to assess the performance. Here we address these issues by employing a large cohort of six breast cancer data sets as benchmark set and by performing an unbiased evaluation of the classification accuracies of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
