Evaluating the overall sensitivity of saliency-based explanation methods

Harshinee Sriram; Cristina Conati

arXiv:2306.13682·cs.LG·June 27, 2023

Evaluating the overall sensitivity of saliency-based explanation methods

Harshinee Sriram, Cristina Conati

PDF

Open Access

TL;DR

This paper proposes a formal, model-agnostic methodology to evaluate the sensitivity of saliency-based explanation methods for deep learning models, aiming to improve faithfulness assessments across domains.

Contribution

It extends an existing sensitivity test with formal thresholds and criteria, enabling comprehensive comparison of explanation methods for CNNs and beyond.

Findings

01

Extended the sensitivity test with formal thresholds

02

Compared multiple explanation methods for CNNs

03

Discussed the relationship between sensitivity and faithfulness

Abstract

We address the need to generate faithful explanations of "black box" Deep Learning models. Several tests have been proposed to determine aspects of faithfulness of explanation methods, but they lack cross-domain applicability and a rigorous methodology. Hence, we select an existing test that is model agnostic and is well-suited for comparing one aspect of faithfulness (i.e., sensitivity) of multiple explanation methods, and extend it by specifying formal thresh-olds and building criteria to determine the over-all sensitivity of the explanation method. We present examples of how multiple explanation methods for Convolutional Neural Networks can be compared using this extended methodology. Finally, we discuss the relationship between sensitivity and faithfulness and consider how the test can be adapted to assess different explanation methods in other domains.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning in Materials Science