Differentiating Metropolis-Hastings to Optimize Intractable Densities
Gaurav Arya, Ruben Seyer, Frank Sch\"afer, Kartik Chandra, Alexander, K. Lew, Mathieu Huot, Vikash K. Mansinghka, Jonathan Ragan-Kelley,, Christopher Rackauckas, Moritz Schauer

TL;DR
This paper introduces an automatic differentiation method for Metropolis-Hastings algorithms, enabling gradient-based optimization of models with intractable densities, including those with discrete components.
Contribution
It combines stochastic automatic differentiation with Markov chain coupling to produce unbiased, low-variance gradients for intractable probabilistic models.
Findings
Successfully identified ambiguous observations in Gaussian mixture models
Maximized specific heat in Ising models using gradient-based methods
Demonstrated applicability to models with discrete components
Abstract
We develop an algorithm for automatic differentiation of Metropolis-Hastings samplers, allowing us to differentiate through probabilistic inference, even if the model has discrete components within it. Our approach fuses recent advances in stochastic automatic differentiation with traditional Markov chain coupling schemes, providing an unbiased and low-variance gradient estimator. This allows us to apply gradient-based optimization to objectives expressed as expectations over intractable target densities. We demonstrate our approach by finding an ambiguous observation in a Gaussian mixture model and by maximizing the specific heat in an Ising model.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Bayesian Methods and Mixture Models · Gaussian Processes and Bayesian Inference
