Attribution of Predictive Uncertainties in Classification Models
Iker Perez, Piotr Skalski, Alec Barns-Graham, Jason Wong, David Sutton

TL;DR
This paper introduces a new framework for attributing predictive uncertainties in classification models, combining path integrals, counterfactual explanations, and generative models to improve interpretability and reduce artefacts.
Contribution
It proposes a novel attribution method that outperforms existing approaches by integrating path integrals, counterfactual explanations, and generative models for better uncertainty interpretation.
Findings
Outperforms existing attribution methods in quantitative benchmarks
Produces attributions with fewer artefacts and noise
Effective across various datasets and complexity levels
Abstract
Predictive uncertainties in classification tasks are often a consequence of model inadequacy or insufficient training data. In popular applications, such as image processing, we are often required to scrutinise these uncertainties by meaningfully attributing them to input features. This helps to improve interpretability assessments. However, there exist few effective frameworks for this purpose. Vanilla forms of popular methods for the provision of saliency masks, such as SHAP or integrated gradients, adapt poorly to target measures of uncertainty. Thus, state-of-the-art tools instead proceed by creating counterfactual or adversarial feature vectors, and assign attributions by direct comparison to original images. In this paper, we present a novel framework that combines path integrals, counterfactual explanations and generative models, in order to procure attributions that contain few…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Cell Image Analysis Techniques
MethodsShapley Additive Explanations
