Gradient Estimation via Differentiable Metropolis-Hastings
Gaurav Arya, Moritz Schauer, Ruben Seyer

TL;DR
This paper introduces a novel method to differentiate through Metropolis-Hastings algorithms, enabling gradient estimation for intractable expectations in Bayesian inference, with proven consistency and practical demonstrations.
Contribution
It provides the first consistent gradient estimator for Metropolis-Hastings, using chain recoupling and latent variable augmentation, with theoretical guarantees.
Findings
Estimator is strongly consistent.
Method applies to Bayesian sensitivity analysis.
Effective in optimizing Metropolis proposals.
Abstract
Metropolis-Hastings estimates intractable expectations - can differentiating the algorithm estimate their gradients? The challenge is that Metropolis-Hastings trajectories are not conventionally differentiable due to the discrete accept/reject steps. Using a technique based on recoupling chains, our method differentiates through the Metropolis-Hastings sampler itself, allowing us to estimate gradients with respect to a parameter of otherwise intractable expectations. Our main contribution is a proof of strong consistency and a central limit theorem for our estimator under assumptions that hold in common Bayesian inference problems. The proofs augment the sampler chain with latent information, and formulate the estimator as a stopping tail functional of this augmented chain. We demonstrate our method on examples of Bayesian sensitivity analysis and optimizing a random walk Metropolis…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Medical Imaging Techniques and Applications
