Parallel Tempering With a Variational Reference
Nikola Surjanovic, Saifuddin Syed, Alexandre Bouchard-C\^ot\'e, Trevor, Campbell

TL;DR
This paper introduces an adaptive parallel tempering method that uses a variational reference distribution to improve sampling efficiency in complex Bayesian models, especially with large data.
Contribution
It proposes a novel adaptive annealing path connecting the posterior to a variational reference, enhancing traditional PT methods for difficult distributions.
Findings
Improves sampling efficiency in complex Bayesian models.
Outperforms traditional PT in large-data limits.
Demonstrates significant empirical gains across various scenarios.
Abstract
Sampling from complex target distributions is a challenging task fundamental to Bayesian inference. Parallel tempering (PT) addresses this problem by constructing a Markov chain on the expanded state space of a sequence of distributions interpolating between the posterior distribution and a fixed reference distribution, which is typically chosen to be the prior. However, in the typical case where the prior and posterior are nearly mutually singular, PT methods are computationally prohibitive. In this work we address this challenge by constructing a generalized annealing path connecting the posterior to an adaptively tuned variational reference. The reference distribution is tuned to minimize the forward (inclusive) KL divergence to the posterior distribution using a simple, gradient-free moment-matching procedure. We show that our adaptive procedure converges to the forward KL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Bayesian Methods and Mixture Models · Markov Chains and Monte Carlo Methods
