Uncertainty Quantification as a Principled Foundation for Explainable Artificial Intelligence: A Case Study of Counterfactual Explanations
Kacper Sokol, Santo M.A.R. Thies, Eyke H\"ullermeier

TL;DR
This paper proposes a novel uncertainty-based framework for counterfactual explanations in AI, demonstrating its effectiveness and advocating for foundational AI concepts in transparency research.
Contribution
It introduces a principled uncertainty quantification approach to counterfactual explainability, unifying core properties and showing competitive performance with simple design.
Findings
Uncertainty-based explainer achieves state-of-the-art performance.
Framework is simple yet highly effective.
Integrating AI fundamentals enhances transparency and robustness.
Abstract
In this paper we argue that, to its detriment, transparency research overlooks many foundational concepts of artificial intelligence. As an illustrating example we focus on uncertainty quantification in the context of counterfactual explainability, demonstrating that its broader adoption could address key challenges in the field. To this end, we show how uncertainty can provide a principled unifying framework for counterfactual explainability by expressing the core counterfactual properties in terms of uncertainty, allowing us to build two variants of an explainer upon them -- one based solely on uncertainty estimates and another pairing them with distance measured in the feature space. Our comprehensive experiments illustrate highly competitive performance of our framework when compared to many state-of-the-art methods despite its radically simple design. More broadly, the paper…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning
MethodsCounterfactuals Explanations · Focus
