Source Coding, Large Deviations, and Approximate Pattern Matching
A. Dembo, I. Kontoyiannis

TL;DR
This paper develops a lossy version of the Asymptotic Equipartition Property (AEP) to advance rate-distortion theory and analyze pattern matching in lossy data compression, paralleling lossless methods and providing new theoretical insights.
Contribution
It introduces a generalized lossy AEP, proves strengthened coding theorems, and analyzes pattern matching and waiting times for lossy compression, extending classical information theory results.
Findings
Generalized lossy AEP established for lossy data compression.
Second order coding theorems proved for lossy sources.
Characterization of pattern matching performance and waiting times.
Abstract
We present a development of parts of rate-distortion theory and pattern- matching algorithms for lossy data compression, centered around a lossy version of the Asymptotic Equipartition Property (AEP). This treatment closely parallels the corresponding development in lossless compression, a point of view that was advanced in an important paper of Wyner and Ziv in 1989. In the lossless case we review how the AEP underlies the analysis of the Lempel-Ziv algorithm by viewing it as a random code and reducing it to the idealized Shannon code. This also provides information about the redundancy of the Lempel-Ziv algorithm and about the asymptotic behavior of several relevant quantities. In the lossy case we give various versions of the statement of the generalized AEP and we outline the general methodology of its proof via large deviations. Its relationship with Barron's generalized AEP is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Cellular Automata and Applications · DNA and Biological Computing
