Towards E-Value Based Stopping Rules for Bayesian Deep Ensembles
Emanuel Sommer, Rickmer Schulte, Sarah Deubner, Julius Kobialka, David R\"ugamer

TL;DR
This paper introduces an E-value based stopping rule for Bayesian Deep Ensembles, enabling efficient early termination of sampling by statistically assessing improvements over the baseline.
Contribution
It formulates a sequential hypothesis test using E-values to determine when to stop sampling in Bayesian Deep Ensembles, reducing computational costs.
Findings
The proposed stopping rule effectively identifies when additional sampling yields negligible improvements.
Empirical results show that only a fraction of the full sampling budget is often needed.
The method provides a principled, statistically sound approach to early stopping in Bayesian neural network sampling.
Abstract
Bayesian Deep Ensembles (BDEs) represent a powerful approach for uncertainty quantification in deep learning, combining the robustness of Deep Ensembles (DEs) with flexible multi-chain MCMC. While DEs are affordable in most deep learning settings, (long) sampling of Bayesian neural networks can be prohibitively costly. Yet, adding sampling after optimizing the DEs has been shown to yield significant improvements. This leaves a critical practical question: How long should the sequential sampling process continue to yield significant improvements over the initial optimized DE baseline? To tackle this question, we propose a stopping rule based on E-values. We formulate the ensemble construction as a sequential anytime-valid hypothesis test, providing a principled way to decide whether or not to reject the null hypothesis that MCMC offers no improvement over a strong baseline, to early stop…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
