Pragmatic lossless compression: Fundamental limits and universality
Andreas Theocharous, Lampros Gavalakis, Ioannis Kontoyiannis

TL;DR
This paper establishes fundamental limits and universal bounds for lossless data compression, providing precise nonasymptotic characterizations and highlighting the practical importance of pragmatic rates over entropy in short-blocklength scenarios.
Contribution
It derives sharp bounds and explicit nonasymptotic expansions for the optimal compression rate, including universal results with a quantifiable penalty for unknown sources.
Findings
Bounds are more accurate than normal approximation or error exponents in small excess-rate regimes.
Universal achievability results include explicit 'price for universality' terms.
Pragmatic rates are more relevant than entropy for short blocklengths with strict guarantees.
Abstract
The problem of variable-rate lossless data compression is considered, for codes with and without prefix constraints. Sharp bounds are derived for the best achievable compression rate of memoryless sources, when the excess-rate probability is required to be exponentially small in the blocklength. Accurate nonasymptotic expansions with explicit constants are obtained for the optimal rate, using tools from large deviations and Gaussian approximation. When the source distribution is unknown, a universal achievability result is obtained with an explicit ''price for universality'' term. This is based on a fine combinatorial estimate on the number of sequences with small empirical entropy, which might be of independent interest. Examples are shown indicating that, in the small excess-rate-probability regime, the approximation to the fundamental limit of the compression rate suggested by these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques
