Optimal Random Access and Conditional Lower Bounds for 2D Compressed Strings
Rajat De, Dominik Kempa

TL;DR
This paper develops the first optimal-time random access data structure for 2D grammar-compressed strings, establishes conditional lower bounds for pattern matching, and links 2D indexing complexity to open problems in 1D compression.
Contribution
It introduces a novel 2D compressed indexing scheme with optimal query time, proves lower bounds under the Orthogonal Vectors Conjecture, and connects 2D indexing challenges to fundamental 1D compression problems.
Findings
Supports optimal random access in 2D compressed strings
Establishes conditional lower bounds for 2D pattern matching
Shows complexity connections between 2D and 1D compressed indexing
Abstract
Compressed indexing is a powerful technique that enables efficient querying over data stored in compressed form, significantly reducing memory usage and often accelerating computation. While extensive progress has been made for one-dimensional strings, many real-world datasets (such as images, maps, and adjacency matrices) are inherently two-dimensional and highly compressible. Unfortunately, naively applying 1D techniques to 2D data leads to suboptimal results, as fundamental structural repetition is lost during linearization. This motivates the development of native 2D compressed indexing schemes that preserve both compression and query efficiency. We present three main contributions that advance the theory of compressed indexing for 2D strings: (1) We design the first data structure that supports optimal-time random access to a 2D string compressed by a 2D grammar. Specifically,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Graph Theory and Algorithms · DNA and Biological Computing
