Lazy Estimation of Variable Importance for Large Neural Networks
Yue Gao, Abby Stevens, Rebecca Willet, Garvesh Raskutti

TL;DR
This paper introduces a fast, linearization-based method for estimating variable importance in large neural networks, providing theoretical guarantees and confidence bounds, and demonstrating efficiency and accuracy in simulations and climate data.
Contribution
It proposes a novel linearization approach with inferential guarantees for variable importance estimation, reducing computational costs compared to traditional methods.
Findings
Method is faster than retraining models
Estimates are asymptotically normal with confidence bounds
Effective in simulations and climate forecasting
Abstract
As opaque predictive models increasingly impact many areas of modern life, interest in quantifying the importance of a given input variable for making a specific prediction has grown. Recently, there has been a proliferation of model-agnostic methods to measure variable importance (VI) that analyze the difference in predictive power between a full model trained on all variables and a reduced model that excludes the variable(s) of interest. A bottleneck common to these methods is the estimation of the reduced model for each variable (or subset of variables), which is an expensive process that often does not come with theoretical guarantees. In this work, we propose a fast and flexible method for approximating the reduced model with important inferential guarantees. We replace the need for fully retraining a wide neural network by a linearization initialized at the full model parameters.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning
