Post model-fitting exploration via a "Next-Door" analysis
Leying Guan, Robert Tibshirani

TL;DR
The paper introduces a 'Next-Door' analysis method for evaluating the importance of predictors in models like lasso, by examining nearby models that omit each predictor, providing insights into variable significance and alternative models.
Contribution
It proposes a novel approach to assess predictor importance post-model fitting, including p-values and a model score, applicable to various regression methods and implemented in R.
Findings
Identifies indispensable predictors based on predictive power deterioration.
Provides a statistical framework for comparing base and nearby models.
Applicable to Gaussian, generalized linear models, and other supervised learning problems.
Abstract
We propose a simple method for evaluating the model that has been chosen by an adaptive regression procedure, our main focus being the lasso. This procedure deletes each chosen predictor and refits the lasso to get a set of models that are "close" to the one chosen, referred to as "base model". If the deletion of a predictor leads to significant deterioration in the model's predictive power, the predictor is called indispensable; otherwise, the nearby model is called acceptable and can serve as a good alternative to the base model. This provides both an assessment of the predictive contribution of each variable and a set of alternative models that may be used in place of the chosen model. In this paper, we will focus on the cross-validation (CV) setting and a model's predictive power is measured by its CV error, with base model tuned by cross-validation. We propose a method for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Control Systems and Identification · Fault Detection and Control Systems
