A Constrained-Dictionary version of LZ78 asymptotically achieves the Finite-State Compressibility for any Individual Sequence with a Distortion measure
Jacob Ziv

TL;DR
This paper proves that a constrained-dictionary version of LZ78, using a least recently utilized approach, asymptotically achieves finite-state compressibility for individual sequences, extending the theoretical understanding of universal data compression.
Contribution
It demonstrates that LZ78(LRU) with increasing dictionary size asymptotically attains the finite-state compressibility, providing a theoretical optimality result for this popular compression method.
Findings
LZ78 asymptotically achieves FS compressibility.
LZ78(LRU) with large dictionary size also achieves FS compressibility.
A new approach using vector quantization is asymptotically optimal.
Abstract
The unrestricted LZ78 universal data-compression algorithm (as well as the LZ77 and LZW versions) achieves asymptotically, as the block-length tends to infinity, the FS compressibility, namely the best compression-ratio that may be achieved by any Information-lossless(IL) block-to-variable finite-state(FS) algorithm, for any infinitely-long individual sequence. The encoder parses the sequence into distinct phrases where each newly generated phrase is a past phrase which is already stored in a dictionary, extended by one letter. The newly generated phrase is then added to the updated, ever-growing dictionary. One heuristic approach is the "Least Recently Utilized" (LRU) deletion approach, where only the most recent D entries are kept in the dictionary, thus yielding a constrained-dictionary version of LZ78 denoted by LZ78(LRU). In this note, for the sake of completeness, it is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Advanced Wireless Communication Techniques · Error Correcting Code Techniques
