Rao-Blackwellised Reparameterisation Gradients

Kevin H. Lam; Thang D. Bui; George Deligiannidis; Yee Whye Teh

arXiv:2506.07687·stat.ML·October 21, 2025

Rao-Blackwellised Reparameterisation Gradients

Kevin H. Lam, Thang D. Bui, George Deligiannidis, Yee Whye Teh

PDF

Open Access 1 Video

TL;DR

This paper introduces the R2-G2 estimator, a Rao-Blackwellised reparameterisation gradient method, which improves gradient estimation for models with latent Gaussian variables, leading to better training performance.

Contribution

The paper proposes the R2-G2 estimator as a Rao-Blackwellised version of the reparameterisation gradient, extending its benefits to various probabilistic models.

Findings

01

R2-G2 outperforms standard estimators in training models with reparameterisation.

02

The local reparameterisation gradient for Bayesian MLPs is an instance of R2-G2.

03

Initial training with R2-G2 yields better model performance.

Abstract

Latent Gaussian variables have been popularised in probabilistic machine learning. In turn, gradient estimators are the machinery that facilitates gradient-based optimisation for models with latent Gaussian variables. The reparameterisation trick is often used as the default estimator as it is simple to implement and yields low-variance gradients for variational inference. In this work, we propose the R2-G2 estimator as the Rao-Blackwellisation of the reparameterisation gradient estimator. Interestingly, we show that the local reparameterisation gradient estimator for Bayesian MLPs is an instance of the R2-G2 estimator and Rao-Blackwellisation. This lets us extend benefits of Rao-Blackwellised gradients to a suite of probabilistic models. We show that initial training with R2-G2 consistently yields better performance in models with multiple applications of the reparameterisation trick.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Rao-Blackwellised Reparameterisation Gradients· slideslive

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning