Dynamic Relative Compression, Dynamic Partial Sums, and Substring Concatenation
Philip Bille, Patrick Hagge Cording, Inge Li G{\o}rtz and, Frederik Rye Skjoldjensen, Hjalte Wedel Vildh{\o}j, S{\o}ren Vind

TL;DR
This paper introduces dynamic data structures for relative compression that efficiently handle edits and support fast random access, advancing the compression of highly-repetitive data like genomes and web data.
Contribution
It develops optimal or near-optimal dynamic data structures for relative compression, dynamic partial sums, and substring concatenation, improving update and query times.
Findings
Achieves optimal update and query times for dynamic relative compression.
Provides new bounds for dynamic partial sums and substring concatenation problems.
Enhances string indexing and pattern matching capabilities with wildcards.
Abstract
Given a static reference string and a source string , a relative compression of with respect to is an encoding of as a sequence of references to substrings of . Relative compression schemes are a classic model of compression and have recently proved very successful for compressing highly-repetitive massive data sets such as genomes and web-data. We initiate the study of relative compression in a dynamic setting where the compressed source string is subject to edit operations. The goal is to maintain the compressed representation compactly, while supporting edits and allowing efficient random access to the (uncompressed) source string. We present new data structures that achieve optimal time for updates and queries while using space linear in the size of the optimal relative compression, for nearly all combinations of parameters. We also present solutions for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
