The Power of Two Choices with Simple Tabulation
S{\o}ren Dahlgaard, Mathias B{\ae}k Tejs Knudsen, Eva Rotenberg, and, Mikkel Thorup

TL;DR
This paper proves that simple tabulation hashing, which is efficient and constant-time, achieves high-probability load balancing guarantees similar to fully random hash functions in the two choices paradigm, with some limitations.
Contribution
It establishes that simple tabulation hashing provides high-probability bounds on maximum load in the two choices paradigm, matching fully random hashing in expectation.
Findings
Maximum load is O(log log n) with high probability using simple tabulation.
There exists a case where simple tabulation does not bound maximum load by log log n.
Expected maximum load remains lg log n + O(1) with simple tabulation.
Abstract
The power of two choices is a classic paradigm for load balancing when assigning balls to bins. When placing a ball, we pick two bins according to two hash functions and , and place the ball in the least loaded bin. Assuming fully random hash functions, when , Azar et al.~[STOC'94] proved that the maximum load is with high probability. In this paper, we investigate the power of two choices when the hash functions and are implemented with simple tabulation, which is a very efficient hash function evaluated in constant time. Following their analysis of Cuckoo hashing [J.ACM'12], P\v{a}tra\c{s}cu and Thorup claimed that the expected maximum load with simple tabulation is . This did not include any high probability guarantee, so the load balancing was not yet to be trusted. Here, we show that with simple tabulation,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · graph theory and CDMA systems · Coding theory and cryptography
