Geometry and factorization of multivariate Markov chains with applications to MCMC acceleration and approximate inference
Michael C.H. Choi, Youjia Wang, Geoffrey Wolfer

TL;DR
This paper explores the geometry and factorization of multivariate Markov chains, introducing projection-based algorithms that improve mixing times in MCMC and scalable approximate filtering for high-dimensional problems.
Contribution
It presents a novel geometric perspective on Markov chain factorizability, leading to new inequalities and practical algorithms for faster mixing and scalable inference.
Findings
Projection samplers accelerate mixing times significantly.
Swapping algorithm with projection resampling improves convergence.
Factored filtering scales linearly with dimension, maintaining accuracy.
Abstract
This paper analyzes the factorizability and geometry of transition matrices of multivariate Markov chains. Specifically, we demonstrate that the induced chains on factors of a product space can be regarded as information projections with respect to the Kullback-Leibler divergence. This perspective yields Han-Shearer type inequalities and submodularity of the entropy rate of Markov chains, as well as applications in the context of large deviations and mixing time comparison. As concrete algorithmic applications in Markov chain Monte Carlo (MCMC) and approximate inference, we provide three illustrations based on lifted MCMC, swapping algorithm and factored filtering to demonstrate projection samplers improve mixing over the original samplers. The projection sampler based on the swapping algorithm resamples the highest-temperature coordinate at stationarity at each step, and we prove that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods
