Approximating LZ77 in Small Space
Travis Gagie

TL;DR
This paper presents a method to approximate the LZ77 compression of a string efficiently in small space, balancing time, I/O, and space complexity with a tunable parameter.
Contribution
It introduces an algorithm that constructs an LZ77-like parse with a controlled number of phrases using sub-quadratic space and near-linear time.
Findings
Achieves an LZ77-like parse with O(z/ε) phrases
Uses O(n^{1+ε}) time and I/Os for approximation
Operates within O(z/ε) space
Abstract
Given a positive \(\epsilon \leq 1\) and read-only access to a string \(S [1..n]\) whose LZ77 parse consists of phrases, with high probability we can build an LZ77-like parse of that consists of phrases using time, I/Os (where is the size of a disk block) and space.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · semigroups and automata theory · Natural Language Processing Techniques
