
TL;DR
The paper introduces Accept-Reject Lasso (ARL), a novel method that improves feature selection stability in highly correlated data by distinguishing true from spurious correlations through subset analysis.
Contribution
ARL is the first approach to simultaneously address both false negatives and false positives in correlated feature selection using a subset-based accept-reject framework.
Findings
ARL effectively differentiates true and spurious correlations.
ARL improves feature selection stability in simulations.
ARL demonstrates superior performance on real datasets.
Abstract
The Lasso method is known to exhibit instability in the presence of highly correlated features, often leading to an arbitrary selection of predictors. This issue manifests itself in two primary error types: the erroneous omission of features that lack a true substitutable relationship (falsely redundant features) and the inclusion of features with a true substitutable relationship (truly redundant features). Although most existing methods address only one of these challenges, we introduce the Accept-Reject Lasso (ARL), a novel approach that resolves this dilemma. ARL operationalizes an Accept-Reject framework through a fine-grained analysis of feature selection across data subsets. This framework is designed to partition the output of an ensemble method into beneficial and detrimental components through fine-grained analysis. The fundamental challenge for Lasso is that inter-variable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
