A Rigorous Link between Deep Ensembles and (Variational) Bayesian Methods
Veit David Wild, Sahra Ghalebikesabi, Dino Sejdinovic, Jeremias, Knoblauch

TL;DR
This paper establishes a rigorous mathematical connection between deep ensembles, Bayesian, and variational Bayesian methods, providing a unified framework for uncertainty quantification in deep learning.
Contribution
It reformulates non-convex deep learning optimization as convex in probability measure space and introduces a unified theory using Wasserstein gradient flows.
Findings
Unified theory linking ensemble and Bayesian methods
Proposed new ensembling schemes with convergence guarantees
Proved convergence of interacting deep ensembles to global minimiser
Abstract
We establish the first mathematically rigorous link between Bayesian, variational Bayesian, and ensemble methods. A key step towards this it to reformulate the non-convex optimisation problem typically encountered in deep learning as a convex optimisation in the space of probability measures. On a technical level, our contribution amounts to studying generalised variational inference through the lense of Wasserstein gradient flows. The result is a unified theory of various seemingly disconnected approaches that are commonly used for uncertainty quantification in deep learning -- including deep ensembles and (variational) Bayesian methods. This offers a fresh perspective on the reasons behind the success of deep ensembles over procedures based on parameterised variational inference, and allows the derivation of new ensembling schemes with convergence guarantees. We showcase this by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsProbabilistic and Robust Engineering Design · Gaussian Processes and Bayesian Inference · Model Reduction and Neural Networks
MethodsVariational Inference · Deep Ensembles
