Sanity Checks for Saliency Metrics
Richard Tomsett, Dan Harborne, Supriyo Chakraborty, Prudhvi Gurram,, Alun Preece

TL;DR
This paper critically examines the reliability and consistency of metrics used to evaluate saliency maps in image classification, revealing significant inconsistencies and unreliability that question their trustworthiness.
Contribution
It highlights the lack of standardization in saliency metric calculations and applies psychometric reliability measures to assess their consistency.
Findings
Saliency metrics are often inconsistent across studies.
Saliency metrics can be statistically unreliable.
Comparative evaluations of saliency methods may be untrustworthy.
Abstract
Saliency maps are a popular approach to creating post-hoc explanations of image classifier outputs. These methods produce estimates of the relevance of each pixel to the classification output score, which can be displayed as a saliency map that highlights important pixels. Despite a proliferation of such methods, little effort has been made to quantify how good these saliency maps are at capturing the true relevance of the pixels to the classifier output (i.e. their "fidelity"). We therefore investigate existing metrics for evaluating the fidelity of saliency methods (i.e. saliency metrics). We find that there is little consistency in the literature in how such metrics are calculated, and show that such inconsistencies can have a significant effect on the measured fidelity. Further, we apply measures of reliability developed in the psychometric testing literature to assess the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
