Derivatives and residual distribution of regularized M-estimators with   application to adaptive tuning

Pierre C Bellec; Yiwei Shen

arXiv:2107.05143·math.ST·January 3, 2025·COLT·5 cites

Derivatives and residual distribution of regularized M-estimators with application to adaptive tuning

Pierre C Bellec, Yiwei Shen

PDF

Open Access

TL;DR

This paper derives formulas for derivatives of regularized M-estimators in linear models, characterizes residual distributions in high dimensions, and proposes an adaptive tuning criterion that approximates out-of-sample error without prior noise or covariance knowledge.

Contribution

It provides a unified differentiability structure for convex regularized M-estimators, characterizes residual distributions in high-dimensional settings, and introduces a practical adaptive tuning method.

Findings

01

Derived general formulas for derivatives of regularized M-estimators.

02

Characterized residual distribution in high-dimensional regimes.

03

Proposed an adaptive criterion for tuning parameter selection.

Abstract

This paper studies M-estimators with gradient-Lipschitz loss function regularized with convex penalty in linear models with Gaussian design matrix and arbitrary noise distribution. A practical example is the robust M-estimator constructed with the Huber loss and the Elastic-Net penalty and the noise distribution has heavy-tails. Our main contributions are three-fold. (i) We provide general formulae for the derivatives of regularized M-estimators $\hat{β} (y, X)$ where differentiation is taken with respect to both $y$ and $X$ ; this reveals a simple differentiability structure shared by all convex regularized M-estimators. (ii) Using these derivatives, we characterize the distribution of the residual $r_{i} = y_{i} - x_{i}^{⊤} \hat{β}$ in the intermediate high-dimensional regime where dimension and sample size are of the same order. (iii) Motivated by the distribution of the residuals, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Direction-of-Arrival Estimation Techniques · Sparse and Compressive Sensing Techniques

MethodsHuber loss