Breaking a Barrier in Constructing Compact Indexes for Parameterized Pattern Matching
Kento Iseri, Tomohiro I, Diptarama Hendrian, Dominik K\"oppl, Ryo, Yoshinaka, Ayumi Shinohara

TL;DR
This paper presents an improved method for constructing a compact index for parameterized pattern matching, enabling efficient online construction and backward search support, addressing a longstanding open problem in the field.
Contribution
It significantly reduces the construction time complexity of the pBWT index to remove the dependence on the size of p-symbol alphabet, and supports online, space-efficient index construction with backward search.
Findings
Achieved $O(n rac{ ext{lg} | ext{Sigma}_p| ext{lg} n}{ ext{lg} ext{lg} n})$ construction time.
First compact index for p-matching supporting online construction and backward search.
Addresses a decade-old open problem in compact index construction for parameterized pattern matching.
Abstract
A parameterized string (p-string) is a string over an alphabet , where and are disjoint alphabets for static symbols (s-symbols) and for parameter symbols (p-symbols), respectively. Two p-strings and are said to parameterized match (p-match) if and only if can be transformed into by applying a bijection on to every occurrence of p-symbols in . The indexing problem for p-matching is to preprocess a p-string of length so that we can efficiently find the occurrences of substrings of that p-match with a given pattern. Extending the Burrows-Wheeler Transform (BWT) based index for exact string pattern matching, Ganguly et al. [SODA 2017] proposed the first compact index (named pBWT) for p-matching, and posed an open problem on how to construct it in compact space, i.e., in $O(n \lg |\Sigma_{s}…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
