Rateless Bloom Filters: Set Reconciliation for Divergent Replicas with Variable-Sized Elements
Pedro Silva Gomes, Carlos Baquero

TL;DR
This paper introduces Rateless Bloom Filters, a dynamic set reconciliation method that efficiently handles variable-sized elements and large differences without prior parameter knowledge, reducing communication costs.
Contribution
The paper presents the Rateless Bloom Filter, a novel adaptive filter that optimally adjusts to set differences without prior knowledge, improving efficiency in set reconciliation.
Findings
Reduces communication cost by over 20% for Jaccard indices below 85%.
Effectively handles variable-sized elements and large set differences.
Outperforms state-of-the-art protocols in diverse scenarios.
Abstract
Set reconciliation protocols typically make two critical assumptions: they are designed for fixed-sized elements and they are optimized for when the difference cardinality, d, is very small. When adapting to variable-sized elements, the current practice is to synchronize fixed-size element digests. However, when the number of differences is considerable, such as after a network partition, this approach can be inefficient. Our solution is a two-stage hybrid protocol that introduces a preliminary Bloom filter step, specifically designed for this regime. The novelty of this approach, however, is in solving a core technical challenge: determining the optimal Bloom filter size without knowing d. Our solution is the Rateless Bloom Filter (RBF), a dynamic filter that naturally adapts to arbitrary symmetric differences, closely matching the communication complexity of an optimally configured…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Caching and Content Delivery · Software System Performance and Reliability
