Not Just How Much, But Where: Decomposing Epistemic Uncertainty into Per-Class Contributions
Mame Diarra Toure, David A. Stephens

TL;DR
This paper introduces a method to decompose Bayesian epistemic uncertainty into per-class contributions, enabling more nuanced understanding of model ignorance, especially in safety-critical applications.
Contribution
It proposes a novel per-class decomposition of mutual information, improving interpretability and detection of class-specific uncertainties in Bayesian deep learning.
Findings
Per-class contributions improve selective prediction accuracy.
Decomposition enhances out-of-distribution detection performance.
Less sensitivity to label noise compared to traditional mutual information.
Abstract
In safety-critical classification, the cost of failure is often asymmetric, yet Bayesian deep learning summarises epistemic uncertainty with a single scalar, mutual information (MI), that cannot distinguish whether a model's ignorance involves a benign or safety-critical class. We decompose MI into a per-class vector , with and across posterior samples. The decomposition follows from a second-order Taylor expansion of the entropy; the weighting corrects boundary suppression and makes comparable across rare and common classes. By construction , and a companion skewness diagnostic flags inputs where the approximation degrades. After characterising the axiomatic properties of , we validate it on three tasks: (i) selective prediction for diabetic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Domain Adaptation and Few-Shot Learning
