Correlated Feature Selection with Extended Exclusive Group Lasso
Yuxin Sun, Benny Chain, Samuel Kaski, John Shawe-Taylor

TL;DR
This paper introduces a fast algorithm for the exclusive group Lasso, improving feature selection in correlated biological data and addressing unknown group structures, outperforming traditional Lasso.
Contribution
The paper proposes a novel, efficient algorithm for exclusive group Lasso and a method to handle unknown group structures, enhancing feature selection in correlated high-dimensional data.
Findings
Outperforms Lasso in selecting informative features
Effective in biological datasets with correlated features
Handles unknown group structures using stability selection
Abstract
In many high dimensional classification or regression problems set in a biological context, the complete identification of the set of informative features is often as important as predictive accuracy, since this can provide mechanistic insight and conceptual understanding. Lasso and related algorithms have been widely used since their sparse solutions naturally identify a set of informative features. However, Lasso performs erratically when features are correlated. This limits the use of such algorithms in biological problems, where features such as genes often work together in pathways, leading to sets of highly correlated features. In this paper, we examine the performance of a Lasso derivative, the exclusive group Lasso, in this setting. We propose fast algorithms to solve the exclusive group Lasso, and introduce a solution to the case when the underlying group structure is unknown.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Gene expression and cancer classification · Ferroptosis and cancer prognosis
