Grammar-Based Construction of Indexes for Binary Jumbled Pattern Matching
Travis Gagie

TL;DR
This paper introduces a grammar-based indexing method for binary strings that enables efficient pattern matching queries related to substring composition, significantly improving query times with a linear-space index.
Contribution
It presents a novel construction of indexes from straight-line programs for binary strings, achieving fast query times for substring composition problems.
Findings
Index construction in $O(g^{2/3} n^{4/3})$ time
Constant-time queries for substring presence with specific composition
Linear-space index allows listing substrings efficiently
Abstract
We show how, given a straight-line program with rules for a binary string of length , in time we can build a linear-space index such that, given and , in O(1) time we can determine whether there is a substring of with length containing exactly copies of 1. If we use space for the index, then we can list all such substrings using time per substring.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Natural Language Processing Techniques · semigroups and automata theory
