Dynamic Grammar-Compressed Self-Index in $\delta$-Optimal Space
Takaaki Nishimoto, Yasuo Tabei

TL;DR
This paper introduces the first dynamic self-index that is both space-efficient for highly repetitive data and supports fast updates and queries, advancing the state of the art in self-indexing.
Contribution
It presents the dynamic RR-index, achieving delta-optimal space and efficient query and update times, overcoming limitations of previous dynamic self-indexes.
Findings
Occupies expected delta-optimal space in practice.
Answers locate queries efficiently with no dependence on LCP.
Supports insertions and deletions with expected amortized time.
Abstract
A compressed self-index stores a string in compressed form while supporting locate queries without decompression. For highly repetitive strings (arising in web crawls, versioned documents, and genomic collections), static self-indexes can match the -optimal lower bound of bits up to constant factors, where is the string length, is the alphabet size, and is the substring complexity. Their dynamic counterparts, however, remain scarce: every existing dynamic self-index either fails to attain -optimal space, pays at least time per reported occurrence during locate, or exposes the longest common prefix (LCP) of the text inside its update time. We present the dynamic RR-index, a dynamic grammar-compressed self-index built on the restricted recompression run-length straight-line…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
