Martingale Posterior Neural Networks for Fast Sequential Decision Making

Gerardo Duran-Martin; Leandro S\'anchez-Betancourt; \'Alvaro Cartea; Kevin Murphy

arXiv:2506.11898·cs.LG·October 10, 2025

Martingale Posterior Neural Networks for Fast Sequential Decision Making

Gerardo Duran-Martin, Leandro S\'anchez-Betancourt, \'Alvaro Cartea, Kevin Murphy

PDF

Open Access

TL;DR

This paper presents scalable, online algorithms for Bayesian decision making using martingale posteriors, enabling fast, uncertainty-aware neural network updates suitable for non-stationary environments.

Contribution

It introduces a predictive-first approach with neural network parameterization and Kalman-filter-like updates, decoupling decision-making from parameter inference.

Findings

01

Achieves 10-100x faster inference than classical methods.

02

Maintains competitive decision performance in bandits and Bayesian optimization.

03

Operates fully online without replay, providing efficient uncertainty quantification.

Abstract

We introduce scalable algorithms for online learning of neural network parameters and Bayesian sequential decision making. Unlike classical Bayesian neural networks, which induce predictive uncertainty through a posterior over model parameters, our methods adopt a predictive-first perspective based on martingale posteriors. In particular, we work directly with the one-step-ahead posterior predictive, which we parameterize with a neural network and update sequentially with incoming observations. This decouples Bayesian decision-making from parameter-space inference: we sample from the posterior predictive for decision making, and update the parameters of the posterior predictive via fast, frequentist Kalman-filter-like recursions. Our algorithms operate in a fully online, replay-free setting, providing principled uncertainty quantification without costly posterior sampling. Empirically,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Adversarial Robustness in Machine Learning