Minimax redundancy for Markov chains with large state space
Kedar Shriram Tatwawadi, Jiantao Jiao, Tsachy Weissman

TL;DR
This paper analyzes the rate at which universal coding approaches the Shannon limit for large-state Markov sources, revealing a phase transition in sample complexity related to the alphabet size and mixing time.
Contribution
It establishes a precise phase transition point for the sample size needed to achieve vanishing redundancy in Markov sources with large state spaces.
Findings
Redundancy vanishes at a rate depending on alphabet size and mixing time.
Identifies a phase transition at sample size proportional to the square of the state space.
Provides bounds on the sample complexity for near-optimal compression.
Abstract
For any Markov source, there exist universal codes whose normalized codelength approaches the Shannon limit asymptotically as the number of samples goes to infinity. This paper investigates how fast the gap between the normalized codelength of the "best" universal compressor and the Shannon limit (i.e. the compression redundancy) vanishes non-asymptotically in terms of the alphabet size and mixing time of the Markov source. We show that, for Markov sources whose relaxation time is at least , where is the state space size (and is a constant), the phase transition for the number of samples required to achieve vanishing compression redundancy is precisely .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
