Space-Efficient Construction of Compressed Indexes in Deterministic Linear Time
J. Ian Munro, Gonzalo Navarro, Yakov Nekrich

TL;DR
This paper presents a method to construct compressed suffix arrays and trees in linear time using space proportional to the input size, improving efficiency over previous algorithms that were slower or used more space.
Contribution
It introduces a deterministic linear-time algorithm for building compressed suffix structures with optimal space complexity, advancing the state of the art.
Findings
Constructed compressed suffix array and tree in O(n) time
Achieved linear-time LZ77 and LZ78 parsing algorithms
Reduced space usage to O(n log σ) bits
Abstract
We show that the compressed suffix array and the compressed suffix tree of a string can be built in deterministic time using bits of space, where is the string length and is the alphabet size. Previously described deterministic algorithms either run in time that depends on the alphabet size or need bits of working space. Our result has immediate applications to other problems, such as yielding the first linear-time LZ77 and LZ78 parsing algorithms that use bits.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Natural Language Processing Techniques · Network Packet Processing and Optimization
