Scrutinizing XAI using linear ground-truth data with suppressor variables
Rick Wilming, C\'eline Budding, Klaus-Robert M\"uller, Stefan Haufe

TL;DR
This paper critically evaluates explainable AI methods using a linear ground-truth dataset to identify their ability to correctly recognize important features and suppressor variables, revealing many methods' limitations.
Contribution
It introduces a linear ground-truth dataset as a benchmark to assess XAI methods' capacity to distinguish true important features from suppressors.
Findings
Most explanation methods fail to identify suppressor variables correctly.
Many popular XAI techniques cannot reliably differentiate between important features and suppressors.
The study highlights the need for improved validation of explanation methods.
Abstract
Machine learning (ML) is increasingly often used to inform high-stakes decisions. As complex ML models (e.g., deep neural networks) are often considered black boxes, a wealth of procedures has been developed to shed light on their inner workings and the ways in which their predictions come about, defining the field of 'explainable AI' (XAI). Saliency methods rank input features according to some measure of 'importance'. Such methods are difficult to validate since a formal definition of feature importance is, thus far, lacking. It has been demonstrated that some saliency methods can highlight features that have no statistical association with the prediction target (suppressor variables). To avoid misinterpretations due to such behavior, we propose the actual presence of such an association as a necessary condition and objective preliminary definition for feature importance. We carefully…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Bayesian Modeling and Causal Inference
MethodsShapley Additive Explanations · Local Interpretable Model-Agnostic Explanations
