Perturbation theory for Markov chains via Wasserstein distance

Daniel Rudolf; Nikolaus Schweizer

arXiv:1503.04123·stat.CO·February 27, 2017

Perturbation theory for Markov chains via Wasserstein distance

Daniel Rudolf, Nikolaus Schweizer

PDF

TL;DR

This paper develops bounds on how small changes in Markov chain transitions affect their distributions, especially for Wasserstein ergodic chains, with applications to approximate MCMC methods in big data analysis.

Contribution

It introduces new bounds for Markov chain perturbations using Wasserstein distance, applicable to ergodic chains and approximate MCMC algorithms, under weak conditions.

Findings

01

Bounds are tight for autoregressive models.

02

Quantitative estimates are provided for approximate Metropolis-Hastings.

03

The approach applies Lyapunov functions to weakly ergodic chains.

Abstract

Perturbation theory for Markov chains addresses the question how small differences in the transitions of Markov chains are reflected in differences between their distributions. We prove powerful and flexible bounds on the distance of the $n$ th step distributions of two Markov chains when one of them satisfies a Wasserstein ergodicity condition. Our work is motivated by the recent interest in approximate Markov chain Monte Carlo (MCMC) methods in the analysis of big data sets. By using an approach based on Lyapunov functions, we provide estimates for geometrically ergodic Markov chains under weak assumptions. In an autoregressive model, our bounds cannot be improved in general. We illustrate our theory by showing quantitative estimates for approximate versions of two prominent MCMC algorithms, the Metropolis-Hastings and stochastic Langevin algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.