Optimal Lempel-Ziv based lossy compression for memoryless data: how to make the right mistakes
Narayana Santhanam, Dharmendra Modha

TL;DR
This paper proves the optimality of a Lempel-Ziv inspired algorithm, Codelet Parsing, for lossy compression of memoryless binary sequences, extending lossless methods to achieve near-minimal representation length under a distortion constraint.
Contribution
It introduces and proves the optimality of Codelet Parsing, a quasi-linear time algorithm that extends Lempel-Ziv to lossy compression for iid binary data with Hamming distortion.
Findings
Codelet Parsing achieves the rate distortion limit for iid binary sequences.
The algorithm constructs codebooks logarithmic in sequence length.
Codelet lengths are approximately (log n)/R_d, optimizing compression under distortion.
Abstract
Compression refers to encoding data using bits, so that the representation uses as few bits as possible. Compression could be lossless: i.e. encoded data can be recovered exactly from its representation) or lossy where the data is compressed more than the lossless case, but can still be recovered to within prespecified distortion metric. In this paper, we prove the optimality of Codelet Parsing, a quasi-linear time algorithm for lossy compression of sequences of bits that are independently and identically distributed (\iid) and Hamming distortion. Codelet Parsing extends the lossless Lempel Ziv algorithm to the lossy case---a task that has been a focus of the source coding literature for better part of two decades now. Given \iid sequences , the expected length of the shortest lossy representation such that can be reconstructed to within distortion is given by the rate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Advanced Bandit Algorithms Research · Error Correcting Code Techniques
