Entropy Contractions in Markov Chains: Half-Step, Full-Step and   Continuous-Time

Pietro Caputo; Zongchen Chen; Yuzhou Gu; Yury Polyanskiy

arXiv:2409.07689·math.PR·September 13, 2024

Entropy Contractions in Markov Chains: Half-Step, Full-Step and Continuous-Time

Pietro Caputo, Zongchen Chen, Yuzhou Gu, Yury Polyanskiy

PDF

Open Access

TL;DR

This paper investigates the relationships between different entropy contraction measures in Markov chains, providing counterexamples that challenge previous conjectures and offering tools for analyzing these contractions.

Contribution

It disproves the conjecture that various entropy contraction coefficients are within a constant factor of each other, and introduces new counterexamples and analysis methods.

Findings

01

Continuous-time processes can contract faster than discrete-time counterparts.

02

Higher powers of a kernel can contract better than lower powers.

03

Standard inequalities comparing entropy and variance contraction are generally not improvable.

Abstract

This paper considers the speed of convergence (mixing) of a finite Markov kernel $P$ with respect to the Kullback-Leibler divergence (entropy). Given a Markov kernel one defines either a discrete-time Markov chain (with the $n$ -step transition kernel given by the matrix power $P^{n}$ ) or a continuous-time Markov process (with the time- $t$ transition kernel given by $e^{t (P - Id)}$ ). The contraction of entropy for $n = 1$ or $t = 0 +$ are characterized by the famous functional inequalities, the strong data processing inequality (SDPI) and the modified log-Sobolev inequality (MLSI), respectively. When $P = K K^{*}$ is written as the product of a kernel and its adjoint, one could also consider the ``half-step'' contraction, which is the SDPI for $K$ , while the ``full-step'' contraction refers to the SDPI for $P$ . The work [DMLM03] claimed that these contraction coefficients (half-step,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods