Do Bayesian Variational Autoencoders Know What They Don't Know?
Misha Glazunov, Apostolis Zarras

TL;DR
This paper explores Bayesian inference methods applied to Variational Autoencoders to improve Out-of-Distribution detection, addressing over-confidence issues in deep generative models through empirical evaluation of several Bayesian approaches.
Contribution
It compares three Bayesian inference techniques for VAEs and introduces two simple scores that achieve state-of-the-art OoD detection performance.
Findings
Bayesian approaches reduce over-confidence in OoD detection.
Stochastic Weight Averaging-Gaussian performs best among tested methods.
Proposed scores outperform existing benchmarks in OoD detection.
Abstract
The problem of detecting the Out-of-Distribution (OoD) inputs is of paramount importance for Deep Neural Networks. It has been previously shown that even Deep Generative Models that allow estimating the density of the inputs may not be reliable and often tend to make over-confident predictions for OoDs, assigning to them a higher density than to the in-distribution data. This over-confidence in a single model can be potentially mitigated with Bayesian inference over the model parameters that take into account epistemic uncertainty. This paper investigates three approaches to Bayesian inference: stochastic gradient Markov chain Monte Carlo, Bayes by Backpropagation, and Stochastic Weight Averaging-Gaussian. The inference is implemented over the weights of the deep neural networks that parameterize the likelihood of the Variational Autoencoder. We empirically evaluate the approaches…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Gaussian Processes and Bayesian Inference
