Power of $d$ Choices with Simple Tabulation

Anders Aamand; Mathias B{\ae}k Tejs Knudsen; Mikkel Thorup

arXiv:1804.09684·cs.DS·April 26, 2018

Power of $d$ Choices with Simple Tabulation

Anders Aamand, Mathias B{\ae}k Tejs Knudsen, Mikkel Thorup

PDF

TL;DR

This paper demonstrates that simple tabulation hashing maintains the optimal maximum load bounds in the $d$-choice paradigm, matching fully random hash functions, and introduces a simpler proof technique for related hashing analyses.

Contribution

It extends the analysis of simple tabulation hashing to the $d$-choice setting and provides a shorter, more accessible proof that generalizes previous results.

Findings

01

Maximum load is $O( ext{lg lg } n)$ whp with simple tabulation.

02

Expected maximum load matches fully random bounds, at $rac{ ext{lg lg } n}{ ext{lg } d}+O(1)$.

03

Tie-breaking reduces expected maximum load further to $rac{ ext{lg lg } n}{d ext{ lg } _d}+O(1)$.

Abstract

Suppose that we are to place $m$ balls into $n$ bins sequentially using the $d$ -choice paradigm: For each ball we are given a choice of $d$ bins, according to $d$ hash functions $h_{1}, \dots, h_{d}$ and we place the ball in the least loaded of these bins breaking ties arbitrarily. Our interest is in the number of balls in the fullest bin after all $m$ balls have been placed. Azar et al. [STOC'94] proved that when $m = O (n)$ and when the hash functions are fully random the maximum load is at most $\frac{l g l g n}{l g d} + O (1)$ whp (i.e. with probability $1 - O (n^{- γ})$ for any choice of $γ$ ). In this paper we suppose that the $h_{1}, \dots, h_{d}$ are simple tabulation hash functions. Generalising a result by Dahlgaard et al [SODA'16] we show that for an arbitrary constant $d \geq 2$ the maximum load is $O (l g l g n)$ whp, and that expected maximum load is at most $\frac{\lg \lg n}{\lg…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.