Compressed Data Structures for Dynamic Sequences
J. Ian Munro, Yakov Nekrich

TL;DR
This paper introduces a fully-dynamic compressed data structure for large alphabets that supports efficient updates and queries, achieving optimal time complexities and minimal space usage based on the string's entropy.
Contribution
It presents the first fully-dynamic structure with optimal query times and worst-case update guarantees for large alphabets, using space close to the entropy bound.
Findings
Supports insertions and deletions with worst-case guarantees
Achieves optimal query times for access, rank, and select
Uses space close to the entropy of the string
Abstract
We consider the problem of storing a dynamic string over an alphabet in compressed form. Our representation supports insertions and deletions of symbols and answers three fundamental queries: returns the -th symbol in , counts how many times a symbol occurs among the first positions in , and finds the position where a symbol occurs for the -th time. We present the first fully-dynamic data structure for arbitrarily large alphabets that achieves optimal query times for all three operations and supports updates with worst-case time guarantees. Ours is also the first fully-dynamic data structure that needs only bits, where is the -th order entropy and is the string length. Moreover our representation supports extraction of a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · DNA and Biological Computing · semigroups and automata theory
