Bloom Filters, Adaptivity, and the Dictionary Problem

Michael A. Bender; Martin Farach-Colton; Mayank Goswami; Rob Johnson,; Samuel McCauley; and Shikha Singh

arXiv:1711.01616·cs.DS·August 28, 2018

Bloom Filters, Adaptivity, and the Dictionary Problem

Michael A. Bender, Martin Farach-Colton, Mayank Goswami, Rob Johnson,, Samuel McCauley, and Shikha Singh

PDF

1 Repo

TL;DR

This paper investigates the limitations and possibilities of adaptive approximate membership query data structures, demonstrating the impossibility of small adaptive AMQs and proposing an optimal, adaptive solution with efficient operations.

Contribution

It proves the impossibility of small adaptive AMQs and introduces a new adaptive AMQ design that is space-efficient and supports constant-time queries and updates.

Findings

01

Impossibility of small adaptive AMQs proved

02

Proposed adaptive AMQ with optimal space usage

03

Supports worst-case constant-time queries and updates

Abstract

The Bloom filter---or, more generally, an approximate membership query data structure (AMQ)---maintains a compact, probabilistic representation of a set S of keys from a universe U. An AMQ supports lookups, inserts, and (for some AMQs) deletes. A query for an x in S is guaranteed to return "present." A query for x not in S returns "absent" with probability at least 1-epsilon, where epsilon is a tunable false positive probability. If a query returns "present," but x is not in S, then x is a false positive of the AMQ. Because AMQs have a nonzero probability of false-positives, they require far less space than explicit set representations. AMQs are widely used to speed up dictionaries that are stored remotely (e.g., on disk/across a network). Most AMQs offer weak guarantees on the number of false positives they will return on a sequence of queries. The false-positive probability of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

djslzx/telescoping-filter
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.