TL;DR
SPIDER is a new succinct data structure that reduces space overhead to 3.82% while significantly improving the speed of rank and select queries on large bit vectors, bridging the performance gap with less space-efficient methods.
Contribution
The paper introduces SPIDER, a novel succinct rank/select data structure that achieves state-of-the-art query times with minimal space overhead, using innovative cache-efficient and prediction-based techniques.
Findings
SPIDER uses 3.82% extra space, the lowest among similar structures.
It outperforms existing structures in rank query speed for large datasets.
It significantly improves select query performance, narrowing the gap with less space-efficient methods.
Abstract
Rank and select data structures seek to preprocess a bit vector to quickly answer two kinds of queries: rank(i) gives the number of 1 bits in slots 0 through i, and select(j) gives the first slot s with rank(s) = j. A succinct data structure can answer these queries while using space much smaller than the size of the original bit vector. State of the art succinct rank and select data structures use as little as 4% extra space while answering rank and select queries quickly. Rank queries can be answered using only a handful of array accesses. Select queries can be answered by starting with similar array accesses, followed by a linear scan. Despite these strong results, a tradeoff remains: data structures that use under 4% space are significantly slower at answering rank and select queries than less-space-efficient data structures (using, say, > 20% extra space). In this paper we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
