Knowledge from Uncertainty in Evidential Deep Learning
Cai Davies, Marc Roig Vilamala, Alun D. Preece, Federico Cerutti,, Lance M. Kaplan, Supriyo Chakraborty

TL;DR
This paper investigates the evidential signal in Evidential Deep Learning (EDL), revealing its connection to misclassification bias and how it differs from other Dirichlet-based methods, especially in large language models.
Contribution
It provides empirical and theoretical analysis of EDL's evidential signal, highlighting its origins and differences from related approaches, and discusses the impact of training with out-of-distribution samples.
Findings
EDL's evidential signal correlates with misclassification bias.
Differences in loss functions affect uncertainty coupling.
Training with out-of-distribution samples influences EDL's uncertainty estimates.
Abstract
This work reveals an evidential signal that emerges from the uncertainty value in Evidential Deep Learning (EDL). EDL is one example of a class of uncertainty-aware deep learning approaches designed to provide confidence (or epistemic uncertainty) about the current test sample. In particular for computer vision and bidirectional encoder large language models, the `evidential signal' arising from the Dirichlet strength in EDL can, in some cases, discriminate between classes, which is particularly strong when using large language models. We hypothesise that the KL regularisation term causes EDL to couple aleatoric and epistemic uncertainty. In this paper, we empirically investigate the correlations between misclassification and evaluated uncertainty, and show that EDL's `evidential signal' is due to misclassification bias. We critically evaluate EDL with other Dirichlet-based approaches,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
