Regularized KL-Divergence for Well-Defined Function-Space Variational Inference in Bayesian neural networks
Tristan Cinquin, Robert Bamler

TL;DR
This paper introduces a novel regularized KL divergence approach for function-space variational inference in Bayesian neural networks, ensuring well-defined objectives and improved uncertainty estimation.
Contribution
It proposes the first well-defined variational objective for function-space inference in BNNs with GP priors, addressing fundamental issues in prior work.
Findings
Method effectively incorporates GP prior properties
Provides competitive uncertainty estimates in various tasks
Ensures a well-defined variational objective for BNNs
Abstract
Bayesian neural networks (BNN) promise to combine the predictive performance of neural networks with principled uncertainty modeling important for safety-critical systems and decision making. However, posterior uncertainty estimates depend on the choice of prior, and finding informative priors in weight-space has proven difficult. This has motivated variational inference (VI) methods that pose priors directly on the function generated by the BNN rather than on weights. In this paper, we address a fundamental issue with such function-space VI approaches pointed out by Burt et al. (2020), who showed that the objective function (ELBO) is negative infinite for most priors of interest. Our solution builds on generalized VI (Knoblauch et al., 2019) with the regularized KL divergence (Quang, 2019) and is, to the best of our knowledge, the first well-defined variational objective for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Fault Detection and Control Systems
MethodsGaussian Process · Variational Inference
