Accelerated Markov Chain Monte Carlo Algorithms on Discrete States
Bohan Zhou, Shu Liu, Xinzhe Zuo, Wuchen Li

TL;DR
This paper introduces an accelerated Markov Chain Monte Carlo algorithm for discrete states, leveraging Nesterov's method and Wasserstein geometry to improve sampling efficiency and estimate score functions without normalization.
Contribution
It extends classical MCMC with a momentum-based acceleration framework on discrete probability simplices, incorporating Wasserstein geometry and Hamiltonian flows.
Findings
Demonstrates improved sampling efficiency on lattice and hypercube distributions.
Shows effectiveness in estimating discrete score functions without normalization.
Validates the approach with numerical experiments on Gaussian mixtures.
Abstract
We propose a class of discrete state sampling algorithms based on Nesterov's accelerated gradient method, which extends the classical Metropolis-Hastings (MH) algorithm. The evolution of the discrete states probability distribution governed by MH can be interpreted as a gradient descent direction of the Kullback--Leibler (KL) divergence, via a mobility function and a score function. Specifically, this gradient is defined on a probability simplex equipped with a discrete Wasserstein-2 metric with a mobility function. This motivates us to study a momentum-based acceleration framework using damped Hamiltonian flows on the simplex set, whose stationary distribution matches the discrete target distribution. Furthermore, we design an interacting particle system to approximate the proposed accelerated sampling dynamics. The extension of the algorithm with a general choice of potentials and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Statistical Mechanics and Entropy · Target Tracking and Data Fusion in Sensor Networks
