Bayesian Lottery Ticket Hypothesis
Nicholas Kuhn, Arvid Weyrauch, Lars Heyen, Achim Streit, Markus G\"otz, Charlotte Debus

TL;DR
This paper investigates the existence and properties of sparse subnetworks, called Bayesian lottery tickets, within Bayesian neural networks, showing they can match or surpass dense network performance with implications for efficient training.
Contribution
It extends the Lottery Ticket Hypothesis to Bayesian neural networks, analyzing characteristics of sparse subnetworks and proposing a transplantation method linking BNNs with deterministic lottery tickets.
Findings
LTH holds in BNNs with sparse subnetworks.
Winning tickets are consistent across model sizes.
Pruning should prioritize magnitude and standard deviation.
Abstract
Bayesian neural networks (BNNs) are a useful tool for uncertainty quantification, but require substantially more computational resources than conventional neural networks. For non-Bayesian networks, the Lottery Ticket Hypothesis (LTH) posits the existence of sparse subnetworks that can train to the same or even surpassing accuracy as the original dense network. Such sparse networks can lower the demand for computational resources at inference, and during training. The existence of the LTH and corresponding sparse subnetworks in BNNs could motivate the development of sparse training algorithms and provide valuable insights into the underlying training process. Towards this end, we translate the LTH experiments to a Bayesian setting using common computer vision models. We investigate the defining characteristics of Bayesian lottery tickets, and extend our study towards a transplantation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning
