TL;DR
This paper introduces BuRR, a practical succinct retrieval data structure that achieves near-optimal space efficiency and high speed, outperforming traditional Bloom filters and previous retrieval methods in both space and time.
Contribution
The paper presents the first practical succinct retrieval data structure, BuRR, with extremely low space overhead and superior speed, improving upon prior theoretical and practical solutions.
Findings
BuRR achieves space overheads below 1% in experiments.
BuRR outperforms traditional Bloom filters in speed and space efficiency.
Homogeneous ribbon filter AMQs are simpler and faster with slightly larger space overhead.
Abstract
A retrieval data structure for a static function supports queries that return for any . Retrieval data structures can be used to implement a static approximate membership query data structure (AMQ), i.e., a Bloom filter alternative, with false positive rate . The information-theoretic lower bound for both tasks is bits. While succinct theoretical constructions using bits were known, these could not achieve very small overheads in practice because they have an unfavorable space--time tradeoff hidden in the asymptotic costs or because small overheads would only be reached for physically impossible input sizes. With bumped ribbon retrieval (BuRR), we present the first practical succinct retrieval data structure. In an extensive experimental evaluation BuRR achieves space overheads well below 1\,\% while being faster…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
