Gradient Estimation via Differentiable Metropolis-Hastings

Gaurav Arya; Moritz Schauer; Ruben Seyer

arXiv:2406.14451·math.ST·June 21, 2024

Gradient Estimation via Differentiable Metropolis-Hastings

Gaurav Arya, Moritz Schauer, Ruben Seyer

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel method to differentiate through Metropolis-Hastings algorithms, enabling gradient estimation for intractable expectations in Bayesian inference, with proven consistency and practical demonstrations.

Contribution

It provides the first consistent gradient estimator for Metropolis-Hastings, using chain recoupling and latent variable augmentation, with theoretical guarantees.

Findings

01

Estimator is strongly consistent.

02

Method applies to Bayesian sensitivity analysis.

03

Effective in optimizing Metropolis proposals.

Abstract

Metropolis-Hastings estimates intractable expectations - can differentiating the algorithm estimate their gradients? The challenge is that Metropolis-Hastings trajectories are not conventionally differentiable due to the discrete accept/reject steps. Using a technique based on recoupling chains, our method differentiates through the Metropolis-Hastings sampler itself, allowing us to estimate gradients with respect to a parameter of otherwise intractable expectations. Our main contribution is a proof of strong consistency and a central limit theorem for our estimator under assumptions that hold in common Bayesian inference problems. The proofs augment the sampler chain with latent information, and formulate the estimator as a stopping tail functional of this augmented chain. We demonstrate our method on examples of Bayesian sensitivity analysis and optimizing a random walk Metropolis…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gaurav-arya/differentiable_mh
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Medical Imaging Techniques and Applications