Functional Stochastic Gradient MCMC for Bayesian Neural Networks

Mengjing Wu; Junyu Xuan; Jie Lu

arXiv:2409.16632·cs.LG·October 11, 2024

Functional Stochastic Gradient MCMC for Bayesian Neural Networks

Mengjing Wu, Junyu Xuan, Jie Lu

PDF

Open Access

TL;DR

This paper introduces novel functional stochastic gradient MCMC methods for Bayesian neural networks, effectively addressing prior issues and improving inference quality by operating directly in function space.

Contribution

It proposes new functional MCMC schemes with diffusion dynamics that incorporate functional priors, extending stochastic gradient MCMC to function space.

Findings

01

Improved predictive accuracy over parameter-space MCMC.

02

Enhanced uncertainty quantification in Bayesian neural networks.

03

Theoretical proof of the stationary measure being the true posterior over functions.

Abstract

Classical parameter-space Bayesian inference for Bayesian neural networks (BNNs) suffers from several unresolved prior issues, such as knowledge encoding intractability and pathological behaviours in deep networks, which can lead to improper posterior inference. To address these issues, functional Bayesian inference has recently been proposed leveraging functional priors, such as the emerging functional variational inference. In addition to variational methods, stochastic gradient Markov Chain Monte Carlo (MCMC) is another scalable and effective inference method for BNNs to asymptotically generate samples from the true posterior by simulating continuous dynamics. However, existing MCMC methods perform solely in parameter space and inherit the unresolved prior issues, while extending these dynamics to function space is a non-trivial undertaking. In this paper, we introduce novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and ELM · Neural Networks and Applications · Nuclear Physics and Applications

MethodsDiffusion · Variational Inference