Function-Space MCMC for Bayesian Wide Neural Networks
Lucia Pezzetti, Stefano Favaro, Stefano Peluchetti

TL;DR
This paper explores scalable Bayesian neural network sampling methods, demonstrating that the preconditioned Crank-Nicolson algorithm becomes more effective as network width increases, with higher acceptance rates and better mixing properties.
Contribution
It introduces the use of the preconditioned Crank-Nicolson algorithm for wide neural networks and proves its acceptance probability approaches 1 as width grows, showing improved efficiency.
Findings
Acceptance probabilities approach 1 with increasing width
Preconditioned Crank-Nicolson outperforms other samplers in efficiency
Higher effective sample size in wide network configurations
Abstract
Bayesian Neural Networks represent a fascinating confluence of deep learning and probabilistic reasoning, offering a compelling framework for understanding uncertainty in complex predictive models. In this paper, we investigate the use of the preconditioned Crank-Nicolson algorithm and its Langevin version to sample from a reparametrised posterior distribution of the neural network's weights, as the widths grow larger. In addition to being robust in the infinite-dimensional setting, we prove that the acceptance probabilities of the proposed algorithms approach 1 as the width of the network increases, independently of any stepsize tuning. Moreover, we examine and compare how the mixing speeds of the underdamped Langevin Monte Carlo, the preconditioned Crank-Nicolson and the preconditioned Crank-Nicolson Langevin samplers are influenced by changes in the network width in some real-world…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNuclear Physics and Applications · Model Reduction and Neural Networks · Neural Networks and Applications
