Assessment of Case Influence in the Lasso with a Case-weight Adjusted Solution Path
Zhenbang Jiao, Yoonkyung Lee

TL;DR
This paper develops a method to assess the influence of individual data points in Lasso regression by creating a solution path that accounts for case weights, enabling influence analysis without refitting models.
Contribution
It introduces a piecewise linear solution path for case influence in Lasso, incorporating a weight parameter to evaluate influence across different penalty levels.
Findings
Solution path is piecewise linear with respect to the weight parameter.
Case influence varies with the penalty parameter and variable selection.
Influence graphs reveal different patterns in underfitting and overfitting regimes.
Abstract
We study case influence in the Lasso regression using Cook's distance which measures overall change in the fitted values when one observation is deleted. Unlike in ordinary least squares regression, the estimated coefficients in the Lasso do not have a closed form due to the nondifferentiability of the penalty, and neither does Cook's distance. To find the case-deleted Lasso solution without refitting the model, we approach it from the full data solution by introducing a weight parameter ranging from 1 to 0 and generating a solution path indexed by this parameter. We show that the solution path is piecewise linear with respect to a simple function of the weight parameter under a fixed penalty. The resulting case influence is a function of the penalty and weight, and it becomes Cook's distance when the weight is 0. As the penalty parameter changes, selected variables change, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference
