Redundancy of unbounded memory Markov classes with continuity conditions
Changlong Wu, Maryam Hosseini, Narayana Santhanam

TL;DR
This paper investigates the limits of universal data compression for binary Markov sources with unbounded memory, focusing on sources that satisfy a continuity condition linking finite-memory estimates to the true infinite-memory distribution.
Contribution
It establishes asymptotically matching bounds on redundancy for Markov sources with continuity conditions, revealing which sources dominate redundancy considerations.
Findings
Derived tight upper and lower bounds on redundancy.
Identified key sources influencing redundancy.
Connected compression performance to source continuity properties.
Abstract
We study the redundancy of universally compressing strings generated by a binary Markov source without any bound on the memory. To better understand the connection between compression and estimation in the Markov regime, we consider a class of Markov sources restricted by a continuity condition. In the absence of an upper bound on memory, the continuity condition implies that gets closer to the true probability as increases, rather than vary around arbitrarily. For such sources, we prove asymptotically matching upper and lower bounds on the redundancy. In the process, we identify what sources in the class matter the most from a redundancy perspective.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · DNA and Biological Computing · Cellular Automata and Applications
