Greedy Bayesian Posterior Approximation with Deep Ensembles
Aleksei Tiulpin, Matthew B. Blaschko

TL;DR
This paper introduces a new principled method for constructing neural network ensembles that better approximate the true posterior distribution, improving uncertainty estimation in deep learning, especially for out-of-distribution detection.
Contribution
It proposes a novel greedy ensemble construction method based on minimizing an $f$-divergence, with a new diversity term derived from a combinatorial analysis, enhancing posterior approximation.
Findings
Improved out-of-distribution detection performance.
The method is effective across various architectures and datasets.
Source code is publicly available for reproducibility.
Abstract
Ensembles of independently trained neural networks are a state-of-the-art approach to estimate predictive uncertainty in Deep Learning, and can be interpreted as an approximation of the posterior distribution via a mixture of delta functions. The training of ensembles relies on non-convexity of the loss landscape and random initialization of their individual members, making the resulting posterior approximation uncontrolled. This paper proposes a novel and principled method to tackle this limitation, minimizing an -divergence between the true posterior and a kernel density estimator (KDE) in a function space. We analyze this objective from a combinatorial point of view, and show that it is submodular with respect to mixture components for any . Subsequently, we consider the problem of greedy ensemble construction. From the marginal gain on the negative -divergence, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
MethodsDeep Ensembles
