Utilising Gradient-Based Proposals Within Sequential Monte Carlo Samplers for Training of Partial Bayesian Neural Networks

Andrew Millard; Joshua Murphy; Simon Maskell; Zheng Zhao

arXiv:2505.03797·cs.LG·January 30, 2026

Utilising Gradient-Based Proposals Within Sequential Monte Carlo Samplers for Training of Partial Bayesian Neural Networks

Andrew Millard, Joshua Murphy, Simon Maskell, Zheng Zhao

PDF

Open Access

TL;DR

This paper introduces a new gradient-based SMC training method for partial Bayesian neural networks, improving scalability, predictive accuracy, and training efficiency over existing approaches.

Contribution

A novel SMC-based training approach for pBNNs that uses guided proposals and gradient-based Markov kernels, enhancing scalability and performance.

Findings

01

Outperforms state-of-the-art in predictive accuracy and loss

02

Scales well with larger batch sizes, reducing training time

03

Achieves better performance with high-dimensional problems

Abstract

Partial Bayesian neural networks (pBNNs) have been shown to perform competitively with fully Bayesian neural networks while only having a subset of the parameters be stochastic. Using sequential Monte Carlo (SMC) samplers as the inference method for pBNNs gives a non-parametric probabilistic estimation of the stochastic parameters, and has shown improved performance over parametric methods. In this paper we introduce a new SMC-based training method for pBNNs by utilising a guided proposal and incorporating gradient-based Markov kernels, which gives us better scalability on high dimensional problems. We show that our new method outperforms the state-of-the-art in terms of predictive performance and optimal loss. We also show that pBNNs scale well with larger batch sizes, resulting in significantly reduced training times and often better performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Gaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis