Statistical Significance of Feature Importance Rankings

Jeremy Goldwasser; Giles Hooker

arXiv:2401.15800·stat.ML·July 8, 2025·2 cites

Statistical Significance of Feature Importance Rankings

Jeremy Goldwasser, Giles Hooker

PDF

Open Access 2 Repos

TL;DR

This paper introduces statistically rigorous methods to verify and identify the most important features in machine learning models, ensuring high-probability correctness despite sampling variability.

Contribution

It proposes hypothesis testing-based techniques and sampling algorithms to reliably determine feature importance rankings with theoretical guarantees.

Findings

01

Algorithms accurately identify top features with high probability

02

Methods validated empirically on SHAP and LIME importance scores

03

Provides stability assessment for feature importance rankings

Abstract

Feature importance scores are ubiquitous tools for understanding the predictions of machine learning models. However, many popular attribution methods suffer from high instability due to random sampling. Leveraging novel ideas from hypothesis testing, we devise techniques that ensure the most important features are correct with high-probability guarantees. These assess the set of $K$ top-ranked features, as well as the order of its elements. Given a set of local or global importance scores, we demonstrate how to retrospectively verify the stability of the highest ranks. We then introduce two efficient sampling algorithms that identify the $K$ most important features, perhaps in order, with probability exceeding $1 - α$ . The theoretical justification for these procedures is validated empirically on SHAP and LIME.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Advanced Image and Video Retrieval Techniques

MethodsShapley Additive Explanations · Local Interpretable Model-Agnostic Explanations