ROOFS: RObust biOmarker Feature Selection
Anastasiia Bakhmach, Paul Dufoss\'e, Andrea Vaglio, Florence Monville, Laurent Greillier, Fabrice Barl\'esi, S\'ebastien Benzekry

TL;DR
ROOFS is a Python package that benchmarks feature selection methods for biomedical data, helping researchers choose the most suitable approach to improve reproducibility and clinical translation.
Contribution
The paper introduces ROOFS, a comprehensive benchmarking tool for feature selection methods tailored to biomedical data challenges, with demonstrated utility on lung cancer immunotherapy data.
Findings
A filter based on FDR-adjusted p-values outperformed LASSO.
ROOFS effectively evaluates stability and predictive performance of FS methods.
Benchmarking improves reproducibility and translational potential of clinical models.
Abstract
Feature selection (FS) is essential for biomarker discovery and clinical predictive modeling. Over the past decades, methodological literature on FS has become rich and mature, offering a wide spectrum of algorithmic approaches. However, much of this methodological progress has not fully translated into applied biomedical research. Moreover, challenges inherent in biomedical data, such as high-dimensional feature space, low sample size, multicollinearity, and missing values, make FS non-trivial. To help bridge this gap between methodological development and practical application, we propose ROOFS (RObust biOmarker Feature Selection), a Python package available at https://gitlab.inria.fr/compo/roofs, designed to help researchers in the choice of FS method adapted to their problem. ROOFS benchmarks multiple FS methods on the user's data and generates reports summarizing a comprehensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer Immunotherapy and Biomarkers · Radiomics and Machine Learning in Medical Imaging · vaccines and immunoinformatics approaches
