Improved error rates for sparse (group) learning with Lipschitz loss   functions

Antoine Dedieu

arXiv:1910.08880·stat.ML·September 23, 2021

Improved error rates for sparse (group) learning with Lipschitz loss functions

Antoine Dedieu

PDF

Open Access

TL;DR

This paper develops a unified theoretical framework for high-dimensional sparse and group-sparse estimators using Lipschitz loss functions, providing new bounds and an efficient algorithm for large-scale problems.

Contribution

It introduces a new theoretical approach that derives high-dimensional estimation bounds for L1, Slope, and Group L1-L2 regularizations with Lipschitz losses, improving existing results.

Findings

01

Bounds match the optimal minimax rate for L1 and Slope regularizations.

02

Group L1-L2 bounds outperform the least-squares case when the signal is strongly group-sparse.

03

An accelerated proximal algorithm efficiently computes estimators for very large variable sets.

Abstract

We study a family of sparse estimators defined as minimizers of some empirical Lipschitz loss function -- which include the hinge loss, the logistic loss and the quantile regression loss -- with a convex, sparse or group-sparse regularization. In particular, we consider the L1 norm on the coefficients, its sorted Slope version, and the Group L1-L2 extension. We propose a new theoretical framework that uses common assumptions in the literature to simultaneously derive new high-dimensional L2 estimation upper bounds for all three regularization schemes. %, and to improve over existing results. For L1 and Slope regularizations, our bounds scale as $(k^{*} / n) lo g (p / k^{*})$ -- $n \times p$ is the size of the design matrix and $k^{*}$ the dimension of the theoretical loss minimizer $\B β^{*}$ -- and match the optimal minimax rate achieved for the least-squares case. For Group L1-L2…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Statistical Methods and Inference · Statistical Methods and Bayesian Inference