Unrestricted Permutation forces Extrapolation: Variable Importance   Requires at least One More Model, or There Is No Free Variable Importance

Giles Hooker; Lucas Mentch; Siyu Zhou

arXiv:1905.03151·stat.ME·October 11, 2021·38 cites

Unrestricted Permutation forces Extrapolation: Variable Importance Requires at least One More Model, or There Is No Free Variable Importance

Giles Hooker, Lucas Mentch, Siyu Zhou

PDF

Open Access 1 Repo

TL;DR

This paper critically reviews permutation-based variable importance methods, highlighting their limitations due to feature dependence, and advocates for alternative approaches involving additional modeling to improve interpretability.

Contribution

It provides a comprehensive critique of permute-and-predict methods and proposes using performance-based measures with supplementary models as a more reliable alternative.

Findings

01

Permutation methods can overemphasize correlated features.

02

Breaking feature dependencies leads to misleading interpretations.

03

Alternative performance-based measures are more robust.

Abstract

This paper reviews and advocates against the use of permute-and-predict (PaP) methods for interpreting black box functions. Methods such as the variable importance measures proposed for random forests, partial dependence plots, and individual conditional expectation plots remain popular because they are both model-agnostic and depend only on the pre-trained model output, making them computationally efficient and widely available in software. However, numerous studies have found that these tools can produce diagnostics that are highly misleading, particularly when there is strong dependence among features. The purpose of our work here is to (i) review this growing body of literature, (ii) provide further demonstrations of these drawbacks along with a detailed explanation as to why they occur, and (iii) advocate for alternative measures that involve additional modeling. In particular, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

antonFJohansson/Please-Stop-Permuting-Features
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Statistical Methods and Inference · Statistical Methods in Clinical Trials