Variable selection in high-dimensional logistic regression models using a whitening approach
Wencan Zhu, C\'eline L\'evy-Leduc, Nils Tern\`es

TL;DR
This paper introduces WLogit, a novel feature selection method for high-dimensional logistic regression that effectively handles correlated biomarkers, improving biomarker identification and classification accuracy in bioinformatics.
Contribution
The paper proposes WLogit, a whitening-based feature selection method that outperforms existing approaches in identifying active biomarkers in highly correlated data.
Findings
WLogit accurately identifies active biomarkers in synthetic data with high correlation.
WLogit achieves higher classification accuracy on lymphoma subtype data.
WLogit outperforms other methods in biomarker selection and classification tasks.
Abstract
In bioinformatics, the rapid development of sequencing technology has enabled us to collect an increasing amount of omics data. Classification based on omics data is one of the central problems in biomedical research. However, omics data usually has a limited sample size but high feature dimensions, and it is assumed that only a few features (biomarkers) are active, i.e. informative to discriminate between different categories (cancer subtypes, responder/non-responder to treatment, for example). Identifying active biomarkers for classification has therefore become fundamental for omics data analysis. Focusing on binary classification, we propose an innovative feature selection method aiming at dealing with the high correlations between the biomarkers. Various research has shown the notorious influence of correlated biomarkers and the difficulty of accurately identifying active ones. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification
