Are Uncertainty Quantification Capabilities of Evidential Deep Learning a Mirage?
Maohao Shen, J. Jon Ryu, Soumya Ghosh, Yuheng Bu, Prasanna Sattigeri,, Subhro Das, Gregory W. Wornell

TL;DR
This paper critically examines evidential deep learning (EDL), revealing that despite empirical success, EDL's uncertainty estimates are unreliable and better interpreted as out-of-distribution detectors, questioning its core effectiveness.
Contribution
The paper provides a unified analysis of EDL's asymptotic behavior, interprets EDL as an out-of-distribution detection method, and conducts extensive empirical evaluations.
Findings
EDL's epistemic uncertainties do not vanish with more data.
EDL functions can be viewed as energy-based out-of-distribution detectors.
Empirical effectiveness of EDL occurs despite poor uncertainty quantification.
Abstract
This paper questions the effectiveness of a modern predictive uncertainty quantification approach, called \emph{evidential deep learning} (EDL), in which a single neural network model is trained to learn a meta distribution over the predictive distribution by minimizing a specific objective function. Despite their perceived strong empirical performance on downstream tasks, a line of recent studies by Bengs et al. identify limitations of the existing methods to conclude their learned epistemic uncertainties are unreliable, e.g., in that they are non-vanishing even with infinite data. Building on and sharpening such analysis, we 1) provide a sharper understanding of the asymptotic behavior of a wide class of EDL methods by unifying various objective functions; 2) reveal that the EDL methods can be better interpreted as an out-of-distribution detection algorithm based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsText and Document Classification Technologies · Face and Expression Recognition · Generative Adversarial Networks and Image Synthesis
