Minimax redundancy for Markov chains with large state space

Kedar Shriram Tatwawadi; Jiantao Jiao; Tsachy Weissman

arXiv:1805.01355·cs.IT·May 8, 2018

Minimax redundancy for Markov chains with large state space

Kedar Shriram Tatwawadi, Jiantao Jiao, Tsachy Weissman

PDF

TL;DR

This paper analyzes the rate at which universal coding approaches the Shannon limit for large-state Markov sources, revealing a phase transition in sample complexity related to the alphabet size and mixing time.

Contribution

It establishes a precise phase transition point for the sample size needed to achieve vanishing redundancy in Markov sources with large state spaces.

Findings

01

Redundancy vanishes at a rate depending on alphabet size and mixing time.

02

Identifies a phase transition at sample size proportional to the square of the state space.

03

Provides bounds on the sample complexity for near-optimal compression.

Abstract

For any Markov source, there exist universal codes whose normalized codelength approaches the Shannon limit asymptotically as the number of samples goes to infinity. This paper investigates how fast the gap between the normalized codelength of the "best" universal compressor and the Shannon limit (i.e. the compression redundancy) vanishes non-asymptotically in terms of the alphabet size and mixing time of the Markov source. We show that, for Markov sources whose relaxation time is at least $1 + \frac{( 2 + c )}{k}$ , where $k$ is the state space size (and $c > 0$ is a constant), the phase transition for the number of samples required to achieve vanishing compression redundancy is precisely $Θ (k^{2})$ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.