Deep Ensembles Secretly Perform Empirical Bayes
Gabriel Loaiza-Ganem, Valentin Villecroze, Yixin Wang

TL;DR
This paper reveals that deep ensembles perform exact Bayesian averaging with a data-dependent prior, providing a theoretical justification for their empirical success and bridging the gap between ensembles and Bayesian neural networks.
Contribution
It demonstrates that deep ensembles are equivalent to empirical Bayes procedures with a learned data-dependent prior, offering a new theoretical understanding.
Findings
Deep ensembles perform exact Bayesian averaging.
The learned prior is a mixture of point masses.
This perspective explains the strong empirical performance of ensembles.
Abstract
Quantifying uncertainty in neural networks is a highly relevant problem which is essential to many applications. The two predominant paradigms to tackle this task are Bayesian neural networks (BNNs) and deep ensembles. Despite some similarities between these two approaches, they are typically surmised to lack a formal connection and are thus understood as fundamentally different. BNNs are often touted as more principled due to their reliance on the Bayesian paradigm, whereas ensembles are perceived as more ad-hoc; yet, deep ensembles tend to empirically outperform BNNs, with no satisfying explanation as to why this is the case. In this work we bridge this gap by showing that deep ensembles perform exact Bayesian averaging with a posterior obtained with an implicitly learned data-dependent prior. In other words deep ensembles are Bayesian, or more specifically, they implement an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Generative Adversarial Networks and Image Synthesis · Anomaly Detection Techniques and Applications
MethodsDeep Ensembles
