Attribution of Predictive Uncertainties in Classification Models

Iker Perez; Piotr Skalski; Alec Barns-Graham; Jason Wong; David Sutton

arXiv:2107.08756·cs.LG·November 10, 2022

Attribution of Predictive Uncertainties in Classification Models

Iker Perez, Piotr Skalski, Alec Barns-Graham, Jason Wong, David Sutton

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new framework for attributing predictive uncertainties in classification models, combining path integrals, counterfactual explanations, and generative models to improve interpretability and reduce artefacts.

Contribution

It proposes a novel attribution method that outperforms existing approaches by integrating path integrals, counterfactual explanations, and generative models for better uncertainty interpretation.

Findings

01

Outperforms existing attribution methods in quantitative benchmarks

02

Produces attributions with fewer artefacts and noise

03

Effective across various datasets and complexity levels

Abstract

Predictive uncertainties in classification tasks are often a consequence of model inadequacy or insufficient training data. In popular applications, such as image processing, we are often required to scrutinise these uncertainties by meaningfully attributing them to input features. This helps to improve interpretability assessments. However, there exist few effective frameworks for this purpose. Vanilla forms of popular methods for the provision of saliency masks, such as SHAP or integrated gradients, adapt poorly to target measures of uncertainty. Thus, state-of-the-art tools instead proceed by creating counterfactual or adversarial feature vectors, and assign attributions by direct comparison to original images. In this paper, we present a novel framework that combines path integrals, counterfactual explanations and generative models, in order to procure attributions that contain few…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Featurespace/uncertainty-attribution
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Cell Image Analysis Techniques

MethodsShapley Additive Explanations