Fingerprint Filters Are Optimal
William Kuszmaul, Jingxun Liang, Renfei Zhou

TL;DR
This paper proves a tight lower bound on the space complexity of dynamic filters with false-positive rate , showing that fingerprinting-based filters are optimal in space usage regardless of their operation time.
Contribution
It establishes a sharp lower bound matching the space used by fingerprint filters, resolving a decades-old open question about their optimality.
Findings
Proves a tight lower bound of n log ^{-1} + n log e - o(n) bits for dynamic filters.
Shows fingerprint filters are space-optimal among all dynamic filters.
Addresses a central open problem in the design of approximate membership data structures.
Abstract
Dynamic filters are data structures supporting approximate membership queries to a dynamic set of keys, allowing a small false-positive error rate , under insertions and deletions to the set . Essentially all known constructions for dynamic filters use a technique known as fingerprinting. This technique, which was first introduced by Carter et al. in 1978, inherently requires bits of space when . Whether or not this bound is optimal for all dynamic filters (rather than just for fingerprint filters) has remained for decades as one of the central open questions in the area. We resolve this question by proving a sharp lower bound of bits for , regardless of operation time.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplexity and Algorithms in Graphs · Machine Learning and Algorithms · Advanced Steganography and Watermarking Techniques
