A Guide to Feature Importance Methods for Scientific Inference

Fiona Katharina Ewald; Ludwig Bothmann; Marvin N. Wright; Bernd; Bischl; Giuseppe Casalicchio; Gunnar K\"onig

arXiv:2404.12862·stat.ML·August 30, 2024·6 cites

A Guide to Feature Importance Methods for Scientific Inference

Fiona Katharina Ewald, Ludwig Bothmann, Marvin N. Wright, Bernd, Bischl, Giuseppe Casalicchio, Gunnar K\"onig

PDF

Open Access 1 Repo

TL;DR

This paper provides a comprehensive review and interpretation guide for feature importance methods in machine learning, aiding scientific inference by clarifying their use, limitations, and future research directions.

Contribution

It offers an extensive review, new proofs, and concrete recommendations for interpreting feature importance methods in scientific research.

Findings

01

Clarifies different interpretations of FI methods

02

Provides new proofs for FI interpretation

03

Recommends best practices for scientific inference

Abstract

While machine learning (ML) models are increasingly used due to their high predictive power, their use in understanding the data-generating process (DGP) is limited. Understanding the DGP requires insights into feature-target associations, which many ML models cannot directly provide due to their opaque internal mechanisms. Feature importance (FI) methods provide useful insights into the DGP under certain conditions. Since the results of different FI methods have different interpretations, selecting the correct FI method for a concrete use case is crucial and still requires expert knowledge. This paper serves as a comprehensive guide to help understand the different interpretations of global FI methods. Through an extensive review of FI methods and providing new proofs regarding their interpretation, we facilitate a thorough understanding of these methods and formulate concrete…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

slds-lmu/paper_2024_guide_fi
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Data Analysis with R