What Makes for a Good Saliency Map? Comparing Strategies for Evaluating Saliency Maps in Explainable AI (XAI)
Felix Kares, Timo Speith, Hanwei Zhang, Markus Langer

TL;DR
This study compares different saliency map methods for neural network explanations across subjective, objective, and mathematical evaluation metrics, revealing inconsistencies and insights into their effectiveness and correlation.
Contribution
First comprehensive comparison of saliency map evaluation methods across subjective, objective, and mathematical metrics in XAI.
Findings
Saliency maps do not agree across different evaluation methods.
Grad-CAM improves user understanding most effectively.
Mathematical metrics sometimes correlate with user understanding, often counterintuitively.
Abstract
Saliency maps are a popular approach for explaining classifications of (convolutional) neural networks. However, it remains an open question as to how best to evaluate salience maps, with three families of evaluation methods commonly being used: subjective user measures, objective user measures, and mathematical metrics. We examine three of the most popular saliency map approaches (viz., LIME, Grad-CAM, and Guided Backpropagation) in a between subject study (N=166) across these families of evaluation methods. We test 1) for subjective measures, if the maps differ with respect to user trust and satisfaction; 2) for objective measures, if the maps increase users' abilities and thus understanding of a model; 3) for mathematical metrics, which map achieves the best ratings across metrics; and 4) whether the mathematical metrics can be associated with objective user measures. To our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI)
MethodsLocal Interpretable Model-Agnostic Explanations
