Decoupling Pixel Flipping and Occlusion Strategy for Consistent XAI Benchmarks
Stefan Bl\"ucher, Johanna Vielhaben, Nils Strodthoff

TL;DR
This paper introduces a new framework for evaluating occlusion-based XAI methods by measuring their reliability with the R-OMS score and combining influence measures into the SRG metric, ensuring consistent rankings across strategies.
Contribution
It proposes the R-OMS score for assessing occlusion strategy reliability and the SRG measure to unify influence rankings, resolving inconsistencies in pixel flipping benchmarks.
Findings
R-OMS score effectively compares occlusion strategies.
SRG measure provides consistent feature importance rankings.
Validated on 40 different occlusion strategies.
Abstract
Feature removal is a central building block for eXplainable AI (XAI), both for occlusion-based explanations (Shapley values) as well as their evaluation (pixel flipping, PF). However, occlusion strategies can vary significantly from simple mean replacement up to inpainting with state-of-the-art diffusion models. This ambiguity limits the usefulness of occlusion-based approaches. For example, PF benchmarks lead to contradicting rankings. This is amplified by competing PF measures: Features are either removed starting with most influential first (MIF) or least influential first (LIF). This study proposes two complementary perspectives to resolve this disagreement problem. Firstly, we address the common criticism of occlusion-based XAI, that artificial samples lead to unreliable model evaluations. We propose to measure the reliability by the R(eference)-Out-of-Model-Scope (OMS) score. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Radiomics and Machine Learning in Medical Imaging
MethodsSparse Evolutionary Training · Inpainting · Diffusion
