Lempel-Ziv Computation In Compressed Space (LZ-CICS)
Dominik K\"oppl, Kunihiko Sadakane

TL;DR
This paper presents efficient algorithms for computing Lempel-Ziv factorizations of texts in compressed space, achieving near-linear time with minimal working space, and leveraging compressed suffix tree representations for further optimization.
Contribution
It introduces algorithms for Lempel-Ziv 77- and 78-factorizations that operate in compressed space and near-linear time, improving efficiency over previous methods.
Findings
Lempel-Ziv factorizations can be computed in $O(n \, \lg \lg \sigma)$ time.
Using compressed suffix trees, factorizations can be computed in $O(n)$ time.
Algorithms use $O(n \lg \sigma)$ bits of space, optimizing memory usage.
Abstract
We show that both the Lempel Ziv 77- and the 78-factorization of a text of length on an integer alphabet of size can be computed in time (linear time if we allow randomization) using bits of working space. Given that a compressed representation of the suffix tree is loaded into RAM, we can compute both factorizations in time using bits of space, where is the number of factors.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
