Balanced Allocations and Double Hashing
Michael Mitzenmacher

TL;DR
This paper investigates the effectiveness of double hashing in balanced allocation schemes, showing empirically that it performs nearly as well as fully random hashing and providing theoretical explanations for this behavior.
Contribution
It offers the first empirical and theoretical analysis demonstrating that double hashing is nearly as effective as fully random hashing in balanced allocation problems.
Findings
Double hashing performs similarly to fully random hashing in balanced allocation.
Empirical results show negligible performance difference between the two methods.
Theoretical analysis explains why double hashing is effective in this context.
Abstract
Double hashing has recently found more common usage in schemes that use multiple hash functions. In double hashing, for an item , one generates two hash values and , and then uses combinations for to generate multiple hash values from the initial two. We first perform an empirical study showing that, surprisingly, the performance difference between double hashing and fully random hashing appears negligible in the standard balanced allocation paradigm, where each item is placed in the least loaded of choices, as well as several related variants. We then provide theoretical results that explain the behavior of double hashing in this context.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Caching and Content Delivery · Advanced Image and Video Retrieval Techniques
