How can we trust opaque systems? Criteria for robust explanations in XAI
Florian J. Boge, Annika Schuster

TL;DR
This paper emphasizes the importance of robustness in explanations for trustworthy AI, proposing criteria for explanation and method robustness to ensure reliable interpretability of deep learning models.
Contribution
It introduces formal criteria for explanation and method robustness in XAI, providing a framework to improve trustworthiness of explanations in deep learning.
Findings
Explanatory robustness (ER) ensures consistent explanations across methods.
Method robustness (EMR) is a prerequisite for trustworthy explanations.
Robustness of individual methods alone is insufficient for trust.
Abstract
Deep learning (DL) algorithms are becoming ubiquitous in everyday life and in scientific research. However, the price we pay for their impressively accurate predictions is significant: their inner workings are notoriously opaque - it is unknown to laypeople and researchers alike what features of the data a DL system focuses on and how it ultimately succeeds in predicting correct outputs. A necessary criterion for trustworthy explanations is that they should reflect the relevant processes the algorithms' predictions are based on. The field of eXplainable Artificial Intelligence (XAI) presents promising methods to create such explanations. But recent reviews about their performance offer reasons for skepticism. As we will argue, a good criterion for trustworthiness is explanatory robustness: different XAI methods produce the same explanations in comparable contexts. However, in some…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management
