SPIDER: Improved Succinct Rank and Select Performance

Matthew D. Laws; Jocelyn Bliven; Kit Conklin; Elyes Laalai; Samuel; McCauley; Zach S. Sturdevant

arXiv:2405.05214·cs.DS·May 9, 2024

SPIDER: Improved Succinct Rank and Select Performance

Matthew D. Laws, Jocelyn Bliven, Kit Conklin, Elyes Laalai, Samuel, McCauley, Zach S. Sturdevant

PDF

1 Repo

TL;DR

SPIDER is a new succinct data structure that reduces space overhead to 3.82% while significantly improving the speed of rank and select queries on large bit vectors, bridging the performance gap with less space-efficient methods.

Contribution

The paper introduces SPIDER, a novel succinct rank/select data structure that achieves state-of-the-art query times with minimal space overhead, using innovative cache-efficient and prediction-based techniques.

Findings

01

SPIDER uses 3.82% extra space, the lowest among similar structures.

02

It outperforms existing structures in rank query speed for large datasets.

03

It significantly improves select query performance, narrowing the gap with less space-efficient methods.

Abstract

Rank and select data structures seek to preprocess a bit vector to quickly answer two kinds of queries: rank(i) gives the number of 1 bits in slots 0 through i, and select(j) gives the first slot s with rank(s) = j. A succinct data structure can answer these queries while using space much smaller than the size of the original bit vector. State of the art succinct rank and select data structures use as little as 4% extra space while answering rank and select queries quickly. Rank queries can be answered using only a handful of array accesses. Select queries can be answered by starting with similar array accesses, followed by a linear scan. Despite these strong results, a tradeoff remains: data structures that use under 4% space are significantly slower at answering rank and select queries than less-space-efficient data structures (using, say, > 20% extra space). In this paper we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

williams-cs/spider
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.