TL;DR
This paper introduces fast algorithms for $L_0$-regularized sparse learning problems, combining coordinate descent and local combinatorial optimization, outperforming existing methods in speed and accuracy across various statistical regimes.
Contribution
It develops a new hierarchy of optimality conditions and algorithms for $L_0$-regularized problems with convex penalties, significantly improving computational efficiency and statistical performance.
Findings
Algorithms outperform state-of-the-art in prediction and variable selection.
Open-source toolkit L0Learn achieves up to 3x speedup.
Methods excel in high signal strength and challenging settings.
Abstract
The -regularized least squares problem (a.k.a. best subsets) is central to sparse statistical learning and has attracted significant attention across the wider statistics, machine learning, and optimization communities. Recent work has shown that modern mixed integer optimization (MIO) solvers can be used to address small to moderate instances of this problem. In spite of the usefulness of -based estimators and generic MIO solvers, there is a steep computational price to pay when compared to popular sparse learning algorithms (e.g., based on regularization). In this paper, we aim to push the frontiers of computation for a family of -regularized problems with additional convex penalties. We propose a new hierarchy of necessary optimality conditions for these problems. We develop fast algorithms, based on coordinate descent and local combinatorial optimization, that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
