Memento Filter: A Fast, Dynamic, and Robust Range Filter

Navid Eslami; Niv Dayan

arXiv:2408.05625·cs.DS·October 29, 2024

Memento Filter: A Fast, Dynamic, and Robust Range Filter

Navid Eslami, Niv Dayan

PDF

1 Repo

TL;DR

The paper introduces Memento filter, a novel range filter that supports dynamic datasets, fast operations, and low false positive rates, making it suitable for real-time applications like B-Trees.

Contribution

Memento filter is the first range filter to combine dynamicity, efficiency, and robust false positive guarantees for any workload.

Findings

01

Achieves competitive false positive rates and performance.

02

Supports inserts, deletes, and dataset expansion.

03

Doubles range query throughput in B-Tree-based key-value store.

Abstract

Range filters are probabilistic data structures that answer approximate range emptiness queries. They aid in avoiding processing empty range queries and have use cases in many application domains such as key-value stores and social web analytics. However, current range filter designs do not support dynamically changing and growing datasets. Moreover, several of these designs also exhibit impractically high false positive rates under correlated workloads, which are common in practice. These impediments restrict the applicability of range filters across a wide range of use cases. We introduce Memento filter, the first range filter to offer dynamicity, fast operations, and a robust false positive rate guarantee for any workload. Memento filter partitions the key universe and clusters its keys according to this partitioning. For each cluster, it stores a fingerprint and a list of key…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

n3slami/Memento_Filter
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.