Improved error rates for sparse (group) learning with Lipschitz loss functions
Antoine Dedieu

TL;DR
This paper develops a unified theoretical framework for high-dimensional sparse and group-sparse estimators using Lipschitz loss functions, providing new bounds and an efficient algorithm for large-scale problems.
Contribution
It introduces a new theoretical approach that derives high-dimensional estimation bounds for L1, Slope, and Group L1-L2 regularizations with Lipschitz losses, improving existing results.
Findings
Bounds match the optimal minimax rate for L1 and Slope regularizations.
Group L1-L2 bounds outperform the least-squares case when the signal is strongly group-sparse.
An accelerated proximal algorithm efficiently computes estimators for very large variable sets.
Abstract
We study a family of sparse estimators defined as minimizers of some empirical Lipschitz loss function -- which include the hinge loss, the logistic loss and the quantile regression loss -- with a convex, sparse or group-sparse regularization. In particular, we consider the L1 norm on the coefficients, its sorted Slope version, and the Group L1-L2 extension. We propose a new theoretical framework that uses common assumptions in the literature to simultaneously derive new high-dimensional L2 estimation upper bounds for all three regularization schemes. %, and to improve over existing results. For L1 and Slope regularizations, our bounds scale as -- is the size of the design matrix and the dimension of the theoretical loss minimizer -- and match the optimal minimax rate achieved for the least-squares case. For Group L1-L2…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
