On Lasso refitting strategies
Evgenii Chzhen, Mohamed Hebiri, Joseph Salmon

TL;DR
This paper formalizes refitting strategies for Lasso, analyzing their theoretical properties and proposing methods that improve interpretability and prediction accuracy, supported by extensive numerical experiments.
Contribution
It introduces a formal framework for Lasso refitting, defines sign consistent refitting, and analyzes Bregman and Boosted Lasso strategies with theoretical guarantees.
Findings
Sign consistent refitting preserves original signs.
Bregman Lasso converges to Sign-Least-Squares refitting.
Boosted Lasso achieves better oracle prediction rates.
Abstract
A well-know drawback of l_1-penalized estimators is the systematic shrinkage of the large coefficients towards zero. A simple remedy is to treat Lasso as a model-selection procedure and to perform a second refitting step on the selected support. In this work we formalize the notion of refitting and provide oracle bounds for arbitrary refitting procedures of the Lasso solution. One of the most widely used refitting techniques which is based on Least-Squares may bring a problem of interpretability, since the signs of the refitted estimator might be flipped with respect to the original estimator. This problem arises from the fact that the Least-Squares refitting considers only the support of the Lasso solution, avoiding any information about signs or amplitudes. To this end we define a sign consistent refitting as an arbitrary refitting procedure, preserving the signs of the first step…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
