Decompressing Lempel-Ziv Compressed Text

Philip Bille; Mikko Berggren Ettienne; Travis Gagie; Inge Li G{\o}rtz,; Nicola Prezza

arXiv:1802.10347·cs.DS·November 5, 2019

Decompressing Lempel-Ziv Compressed Text

Philip Bille, Mikko Berggren Ettienne, Travis Gagie, Inge Li G{\o}rtz,, Nicola Prezza

PDF

Open Access

TL;DR

This paper presents new algorithms for decompressing Lempel-Ziv 77 compressed text efficiently, achieving near-optimal space and time complexities, and improves pattern matching on compressed data.

Contribution

It introduces algorithms that decompress LZ77 compressed text in linear time with minimal working space, surpassing previous folklore solutions, especially for general alphabets.

Findings

01

Achieves $O(n)$ time and $O(z)$ space for constant alphabets.

02

Provides a trade-off between time and space for large alphabets.

03

Improves pattern matching efficiency on LZ77-compressed text.

Abstract

We consider the problem of decompressing the Lempel--Ziv 77 representation of a string $S$ of length $n$ using a working space as close as possible to the size $z$ of the input. The folklore solution for the problem runs in $O (n)$ time but requires random access to the whole decompressed text. Another folklore solution is to convert LZ77 into a grammar of size $O (z lo g (n / z))$ and then stream $S$ in linear time. In this paper, we show that $O (n)$ time and $O (z)$ working space can be achieved for constant-size alphabets. On general alphabets of size $σ$ , we describe (i) a trade-off achieving $O (n lo g^{δ} σ)$ time and $O (z lo g^{1 - δ} σ)$ space for any $0 \leq δ \leq 1$ , and (ii) a solution achieving $O (n)$ time and $O (z lo g lo g (n / z))$ space. The latter solution, in particular, dominates both folklore algorithms for the problem. Our solutions can, more generally,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression · DNA and Biological Computing · Network Packet Processing and Optimization