Balancing Run-Length Straight-Line Programs*
Gonzalo Navarro (1, 2), Francisco Olivares (1, 2), Cristian, Urbina (1, 2) ((1) University of Chile, (2) CeBiB)

TL;DR
This paper proves that balanced run-length straight-line programs (RLSLPs) can be constructed efficiently, enabling faster substring extraction and query operations like RMQ and fingerprinting with space proportional to the smallest RLSLP size.
Contribution
It extends the linear-time balancing result from SLPs to RLSLPs, simplifying algorithms and reducing space for substring queries and operations.
Findings
Balanced RLSLPs can be constructed in linear time.
Substring extraction from RLSLPs is simplified.
RMQ and fingerprinting can be performed in logarithmic time using minimal RLSLPs.
Abstract
It was recently proved that any SLP generating a given string can be transformed in linear time into an equivalent balanced SLP of the same asymptotic size. We show that this result also holds for RLSLPs, which are SLPs extended with run-length rules of the form for , deriving . An immediate consequence is the simplification of the algorithm for extracting substrings of an RLSLP-compressed string. We also show that several problems like answering RMQs and computing Karp-Rabin fingerprints on substrings can be solved in space and time, being the size of the smallest RLSLP generating the string, of length . We extend the result to solving more general operations on string ranges, in space and applications of the operation. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Natural Language Processing Techniques
