Derivatives and residual distribution of regularized M-estimators with application to adaptive tuning
Pierre C Bellec, Yiwei Shen

TL;DR
This paper derives formulas for derivatives of regularized M-estimators in linear models, characterizes residual distributions in high dimensions, and proposes an adaptive tuning criterion that approximates out-of-sample error without prior noise or covariance knowledge.
Contribution
It provides a unified differentiability structure for convex regularized M-estimators, characterizes residual distributions in high-dimensional settings, and introduces a practical adaptive tuning method.
Findings
Derived general formulas for derivatives of regularized M-estimators.
Characterized residual distribution in high-dimensional regimes.
Proposed an adaptive criterion for tuning parameter selection.
Abstract
This paper studies M-estimators with gradient-Lipschitz loss function regularized with convex penalty in linear models with Gaussian design matrix and arbitrary noise distribution. A practical example is the robust M-estimator constructed with the Huber loss and the Elastic-Net penalty and the noise distribution has heavy-tails. Our main contributions are three-fold. (i) We provide general formulae for the derivatives of regularized M-estimators where differentiation is taken with respect to both and ; this reveals a simple differentiability structure shared by all convex regularized M-estimators. (ii) Using these derivatives, we characterize the distribution of the residual in the intermediate high-dimensional regime where dimension and sample size are of the same order. (iii) Motivated by the distribution of the residuals, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Direction-of-Arrival Estimation Techniques · Sparse and Compressive Sensing Techniques
MethodsHuber loss
