Variable Importance in Generalized Linear Models -- A Unifying View Using Shapley Values
Sinan Acemoglu, Christian Kleiber, J\"org Urban

TL;DR
This paper unifies the concept of variable importance in generalized linear models using Shapley values and introduces a Kullback-Leibler based pseudo-R^2 for consistent importance assessment.
Contribution
It proposes a unifying framework for variable importance in GLMs using Shapley values and introduces a new pseudo-R^2 measure based on Kullback-Leibler divergence.
Findings
Kullback-Leibler R^2 enables consistent importance measurement in GLMs.
The framework applies to both linear and nonlinear models.
Examples demonstrate practical utility in health and insurance data.
Abstract
Variable importance in regression analyses is of considerable interest in a variety of fields. There is no unique method for assessing variable importance. However, a substantial share of the available literature employs Shapley values, either explicitly or implicitly, to decompose a suitable goodness-of-fit measure, in the linear regression model typically the classical . Beyond linear regression, there is no generally accepted goodness-of-fit measure, only a variety of pseudo-s. We formulate and discuss the desirable properties of goodness-of-fit measures that enable Shapley values to be interpreted in terms of relative, and even absolute, importance. We suggest to use a pseudo- based on the Kullback-Leibler divergence, the Kullback-Leibler , which has a convenient form for generalized linear models and permits to unify and extend previous work on variable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Statistical Methods and Bayesian Inference · Statistical Methods and Inference
