Range Predecessor and Lempel-Ziv Parsing
Djamal Belazzougui, Simon J. Puglisi

TL;DR
This paper presents faster algorithms for computing Lempel-Ziv parsing and its rightmost variant, achieving near-linear time with space proportional to the input size, improving over previous methods.
Contribution
The authors develop new algorithms for Lempel-Ziv parsing that are faster and use less space, including a solution for the rightmost variant and an improved 2D range reporting method.
Findings
Lempel-Ziv parsing can be computed in $O(n\log\log \sigma)$ time with $O(n\log \sigma)$ bits.
Rightmost parsing can be achieved in $O(n(1 + (\log \sigma/\sqrt{\log n})))$ time.
Provided a faster construction method for 2D orthogonal range reporting.
Abstract
The Lempel-Ziv parsing of a string (LZ77 for short) is one of the most important and widely-used algorithmic tools in data compression and string processing. We show that the Lempel-Ziv parsing of a string of length on an alphabet of size can be computed in time ( time if we allow randomization) using bits of working space; that is, using space proportional to that of the input string in bits. The previous fastest algorithm using space takes time. We also consider the important rightmost variant of the problem, where the goal is to associate with each phrase of the parsing its most recent occurrence in the input string. We solve this problem in time, using the same working space as above. The previous best solution for rightmost parsing uses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Natural Language Processing Techniques · Speech Recognition and Synthesis
