Policy Gradients for Optimal Parallel Tempering MCMC

Daniel Zhao; Natesh S. Pillai

arXiv:2409.01574·stat.CO·December 30, 2024

Policy Gradients for Optimal Parallel Tempering MCMC

Daniel Zhao, Natesh S. Pillai

PDF

Open Access

TL;DR

This paper introduces an adaptive temperature selection algorithm for parallel tempering MCMC using policy gradients, improving sampling efficiency in complex distributions.

Contribution

It presents a novel policy gradient-based method for dynamically adjusting temperatures in parallel tempering, enhancing mixing and reducing autocorrelation.

Findings

01

Lower integrated autocorrelation times compared to traditional methods

02

Effective in sampling from multi-modal distributions

03

Outperforms geometric and uniform temperature schemes

Abstract

Parallel tempering is a meta-algorithm for Markov Chain Monte Carlo that uses multiple chains to sample from tempered versions of the target distribution, enhancing mixing in multi-modal distributions that are challenging for traditional methods. The effectiveness of parallel tempering is heavily influenced by the selection of chain temperatures. Here, we present an adaptive temperature selection algorithm that dynamically adjusts temperatures during sampling using a policy gradient approach. Experiments demonstrate that our method can achieve lower integrated autocorrelation times compared to traditional geometrically spaced temperatures and uniform acceptance rate schemes on benchmark distributions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Advanced Data Storage Technologies