A Generalized Variable Importance Metric and Estimator for Black Box   Machine Learning Models

Mohammad Kaviul Anam Khan; Olli Saarela; Rafal Kustra

arXiv:2212.09931·stat.CO·December 27, 2023·1 cites

A Generalized Variable Importance Metric and Estimator for Black Box Machine Learning Models

Mohammad Kaviul Anam Khan, Olli Saarela, Rafal Kustra

PDF

Open Access

TL;DR

This paper introduces a new population parameter called GVIM for measuring predictor importance in black box models, providing a causal interpretation and demonstrating its effectiveness through extensive simulations.

Contribution

It defines GVIM for black box models, linking it to CATE for causal interpretation, and validates its estimator with complex simulation studies.

Findings

01

GVIM can be expressed as a function of CATE.

02

The proposed estimator performs well in complex simulation scenarios.

03

GVIM offers a causal perspective on variable importance in black box models.

Abstract

In this paper we define a population parameter, ``Generalized Variable Importance Metric (GVIM)'', to measure importance of predictors for black box machine learning methods, where the importance is not represented by model-based parameter. GVIM is defined for each input variable, using the true conditional expectation function, and it measures the variable's importance in affecting a continuous or a binary response. We extend previously published results to show that the defined GVIM can be represented as a function of the Conditional Average Treatment Effect (CATE) for any kind of a predictor, which gives it a causal interpretation and further justification as an alternative to classical measures of significance that are only available in simple parametric models. Extensive set of simulations using realistically complex relationships between covariates and outcomes and number of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Causal Inference Techniques · Statistical Methods and Inference · Bayesian Modeling and Causal Inference