
TL;DR
This paper introduces a new space-efficient compressed data structure for representing sets with gap encoding, supporting fast rank and select queries, and adapts to highly compressible data based on entropy measures.
Contribution
It proposes a novel zero-order compressed representation of sets that is space-efficient and supports efficient rank/select operations, improving over previous methods.
Findings
Achieves space close to the entropy of the gap stream
Supports rank and select in logarithmic time
Adapts well to highly compressible data
Abstract
In this paper, we consider the problem of efficiently representing a set of items out of a universe while supporting a number of operations on it. Let be the gap stream associated with , its bit-size when encoded with \emph{gap-encoding}, and its empirical zero-order entropy. We prove that (1) if is highly compressible, and (2) . Let be the number of \emph{distinct} gap lengths between elements in . We firstly propose a new space-efficient zero-order compressed representation of taking bits of space. Then, we describe a fully-indexable dictionary that supports \emph{rank} and \emph{select} queries in time while requiring asymptotically the same space as the proposed compressed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Advanced Data Storage Technologies · Cellular Automata and Applications
