Power of $d$ Choices with Simple Tabulation
Anders Aamand, Mathias B{\ae}k Tejs Knudsen, Mikkel Thorup

TL;DR
This paper demonstrates that simple tabulation hashing maintains the optimal maximum load bounds in the $d$-choice paradigm, matching fully random hash functions, and introduces a simpler proof technique for related hashing analyses.
Contribution
It extends the analysis of simple tabulation hashing to the $d$-choice setting and provides a shorter, more accessible proof that generalizes previous results.
Findings
Maximum load is $O( ext{lg lg } n)$ whp with simple tabulation.
Expected maximum load matches fully random bounds, at $rac{ ext{lg lg } n}{ ext{lg } d}+O(1)$.
Tie-breaking reduces expected maximum load further to $rac{ ext{lg lg } n}{d ext{ lg } _d}+O(1)$.
Abstract
Suppose that we are to place balls into bins sequentially using the -choice paradigm: For each ball we are given a choice of bins, according to hash functions and we place the ball in the least loaded of these bins breaking ties arbitrarily. Our interest is in the number of balls in the fullest bin after all balls have been placed. Azar et al. [STOC'94] proved that when and when the hash functions are fully random the maximum load is at most whp (i.e. with probability for any choice of ). In this paper we suppose that the are simple tabulation hash functions. Generalising a result by Dahlgaard et al [SODA'16] we show that for an arbitrary constant the maximum load is whp, and that expected maximum load is at most $\frac{\lg \lg n}{\lg…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
