Explaining the Inherent Tradeoffs for Suffix Array Functionality: Equivalences between String Problems and Prefix Range Queries
Dominik Kempa, Tomasz Kociumaka

TL;DR
This paper establishes a fundamental equivalence between suffix array queries and prefix-select queries, unifying and extending prior approaches to analyze and improve string data structures efficiently.
Contribution
It introduces the first bidirectional reduction showing suffix array queries are essentially equivalent to prefix-select queries, unifying various string query problems.
Findings
Suffix array queries are equivalent to prefix-select queries up to an additive logarithmic term.
The framework unifies prior approaches and problem pairs in string processing.
Identifies six core problem pairs connecting string and prefix query models.
Abstract
We study the fundamental question of how efficiently suffix array entries can be accessed when the array cannot be stored explicitly. The suffix array of a text of length encodes the lexicographic order of its suffixes and underlies numerous applications in pattern matching, data compression, and bioinformatics. Previous work established one-way reductions showing how suffix array queries can be answered using, for example, rank queries on the Burrows-Wheeler Transform. More recently, a new class of prefix queries was introduced, together with reductions that, among others, transform a simple tradeoff for prefix-select queries into a suffix array tradeoff matching state-of-the-art space and query-time bounds, while achieving sublinear construction time. For binary texts, the resulting data structure achieves space bits, preprocessing time $O(n / \sqrt{\log…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Machine Learning and Algorithms · Network Packet Processing and Optimization
