Tail-Sensitive KL and R\'enyi Convergence of Unadjusted Hamiltonian Monte Carlo via One-Shot Couplings
Nawaf Bou-Rabee, Siddharth Mitra, Andre Wibisono

TL;DR
This paper develops a framework to analyze the convergence of unadjusted Hamiltonian Monte Carlo in tail-sensitive divergences like KL and Rényi, extending Wasserstein convergence results using one-shot couplings.
Contribution
It introduces a novel approach to upgrade Wasserstein convergence guarantees of uHMC to KL and Rényi divergences through one-shot couplings and regularization properties.
Findings
Provides quantitative bounds on density mismatch in high dimensions.
Clarifies the impact of discretization bias on divergence convergence.
Offers guarantees applicable to both unadjusted and Metropolis-adjusted HMC.
Abstract
Hamiltonian Monte Carlo (HMC) algorithms are among the most widely used sampling methods in high dimensional settings, yet their convergence properties are poorly understood in divergences that quantify relative density mismatch, such as Kullback-Leibler (KL) and R\'enyi divergences. These divergences naturally govern acceptance probabilities and warm-start requirements for Metropolis-adjusted Markov chains. In this work, we develop a framework for upgrading Wasserstein convergence guarantees for unadjusted Hamiltonian Monte Carlo (uHMC) to guarantees in tail-sensitive KL and R\'enyi divergences. Our approach is based on one-shot couplings, which we use to establish a regularization property of the uHMC transition kernel. This regularization allows Wasserstein-2 mixing-time and asymptotic bias bounds to be lifted to KL divergence, and analogous Orlicz-Wasserstein bounds to be lifted to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Generative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques
