Functional Stochastic Gradient MCMC for Bayesian Neural Networks
Mengjing Wu, Junyu Xuan, Jie Lu

TL;DR
This paper introduces novel functional stochastic gradient MCMC methods for Bayesian neural networks, effectively addressing prior issues and improving inference quality by operating directly in function space.
Contribution
It proposes new functional MCMC schemes with diffusion dynamics that incorporate functional priors, extending stochastic gradient MCMC to function space.
Findings
Improved predictive accuracy over parameter-space MCMC.
Enhanced uncertainty quantification in Bayesian neural networks.
Theoretical proof of the stationary measure being the true posterior over functions.
Abstract
Classical parameter-space Bayesian inference for Bayesian neural networks (BNNs) suffers from several unresolved prior issues, such as knowledge encoding intractability and pathological behaviours in deep networks, which can lead to improper posterior inference. To address these issues, functional Bayesian inference has recently been proposed leveraging functional priors, such as the emerging functional variational inference. In addition to variational methods, stochastic gradient Markov Chain Monte Carlo (MCMC) is another scalable and effective inference method for BNNs to asymptotically generate samples from the true posterior by simulating continuous dynamics. However, existing MCMC methods perform solely in parameter space and inherit the unresolved prior issues, while extending these dynamics to function space is a non-trivial undertaking. In this paper, we introduce novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM · Neural Networks and Applications · Nuclear Physics and Applications
MethodsDiffusion · Variational Inference
