VOICE: Variance of Induced Contrastive Explanations to quantify Uncertainty in Neural Network Interpretability
Mohit Prabhushankar, Ghassan AlRegib

TL;DR
This paper introduces VOICE, a method to visualize and quantify the predictive uncertainty of gradient-based explanations for neural networks, revealing insights into explanation trustworthiness and uncertainty behavior.
Contribution
It provides a theoretical analysis of existing evaluation strategies and proposes a plug-in approach to measure residual uncertainty in explanations.
Findings
Uncertainty varies across images, networks, and explanations.
Incorrect predictions often have high explanation uncertainty.
Uncertainty metrics align with epistemic uncertainty.
Abstract
In this paper, we visualize and quantify the predictive uncertainty of gradient-based post hoc visual explanations for neural networks. Predictive uncertainty refers to the variability in the network predictions under perturbations to the input. Visual post hoc explainability techniques highlight features within an image to justify a network's prediction. We theoretically show that existing evaluation strategies of visual explanatory techniques partially reduce the predictive uncertainty of neural networks. This analysis allows us to construct a plug in approach to visualize and quantify the remaining predictive uncertainty of any gradient-based explanatory technique. We show that every image, network, prediction, and explanatory technique has a unique uncertainty. The proposed uncertainty visualization and quantification yields two key observations. Firstly, oftentimes under incorrect…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Adversarial Robustness in Machine Learning
MethodsHigh-Order Consensuses
