Extending the Applicability of Bloom Filters by Relaxing their Parameter Constraints
Paul Walther, Wejdene Mansour, Johann Maximilian Zollner, Martin Werner

TL;DR
This paper introduces Rational Bloom filters that relax traditional parameter constraints, enabling more flexible and optimized set membership queries in key-value stores.
Contribution
It proposes Rational Bloom filters allowing non-integer hash functions and Variably-Sized Block Bloom filters for flexible, efficient large-scale data storage.
Findings
Rational Bloom filters optimize false positive rates with non-integer hash functions.
Variably-Sized Block Bloom filters enable flexible filter sizes for large datasets.
The methods improve efficiency in key-value store applications.
Abstract
These days, Key-Value Stores are widely used for scalable data storage. In this environment, Bloom filter (BF) serves as an efficient probabilistic data structure for representing sets of keys. They allow for set membership queries with no false negatives and with the right choice of the main parameters - length of the BF, number of hash functions used to map an element to the array's indices, and the number of elements inserted - the false positive rate is optimized. However, the number of hash functions is constrained to integer values, and the length of a BF is usually chosen to be a power of two to allow for efficient modulo operations using binary arithmetic. In this paper, we relax these constraints by proposing the Rational Bloom filter, which allows for non-integer numbers of hash functions. This results in optimized fraction-of-zero values for a known number of elements to be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · Energy Efficient Wireless Sensor Networks · Machine Learning and ELM
