Differentiating Metropolis-Hastings to Optimize Intractable Densities

Gaurav Arya; Ruben Seyer; Frank Sch\"afer; Kartik Chandra; Alexander; K. Lew; Mathieu Huot; Vikash K. Mansinghka; Jonathan Ragan-Kelley,; Christopher Rackauckas; Moritz Schauer

arXiv:2306.07961·stat.ML·July 4, 2023·1 cites

Differentiating Metropolis-Hastings to Optimize Intractable Densities

Gaurav Arya, Ruben Seyer, Frank Sch\"afer, Kartik Chandra, Alexander, K. Lew, Mathieu Huot, Vikash K. Mansinghka, Jonathan Ragan-Kelley,, Christopher Rackauckas, Moritz Schauer

PDF

Open Access 1 Repo

TL;DR

This paper introduces an automatic differentiation method for Metropolis-Hastings algorithms, enabling gradient-based optimization of models with intractable densities, including those with discrete components.

Contribution

It combines stochastic automatic differentiation with Markov chain coupling to produce unbiased, low-variance gradients for intractable probabilistic models.

Findings

01

Successfully identified ambiguous observations in Gaussian mixture models

02

Maximized specific heat in Ising models using gradient-based methods

03

Demonstrated applicability to models with discrete components

Abstract

We develop an algorithm for automatic differentiation of Metropolis-Hastings samplers, allowing us to differentiate through probabilistic inference, even if the model has discrete components within it. Our approach fuses recent advances in stochastic automatic differentiation with traditional Markov chain coupling schemes, providing an unbiased and low-variance gradient estimator. This allows us to apply gradient-based optimization to objectives expressed as expectations over intractable target densities. We demonstrate our approach by finding an ambiguous observation in a Gaussian mixture model and by maximizing the specific heat in an Ising model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gaurav-arya/differentiable_mh
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Bayesian Methods and Mixture Models · Gaussian Processes and Bayesian Inference