Low Space External Memory Construction of the Succinct Permuted Longest Common Prefix Array
German Tischler

TL;DR
This paper introduces a space-efficient external memory algorithm for constructing succinct permuted LCP arrays, significantly reducing space requirements in practical scenarios with small alphabet sizes.
Contribution
It presents the first external memory algorithm for succinct permuted LCP arrays that uses significantly less space than previous methods, especially for small alphabets.
Findings
Uses $O(n ext{ log } \sigma)$ bits of external space
Applicable to circular strings
Efficient for small alphabet sizes
Abstract
The longest common prefix (LCP) array is a versatile auxiliary data structure in indexed string matching. It can be used to speed up searching using the suffix array (SA) and provides an implicit representation of the topology of an underlying suffix tree. The LCP array of a string of length can be represented as an array of length words, or, in the presence of the SA, as a bit vector of bits plus asymptotically negligible support data structures. External memory construction algorithms for the LCP array have been proposed, but those proposed so far have a space requirement of words (i.e. bits) in external memory. This space requirement is in some practical cases prohibitively expensive. We present an external memory algorithm for constructing the bit version of the LCP array which uses bits of additional space in external memory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Network Packet Processing and Optimization · DNA and Biological Computing
