Robin Hood Hashing really has constant average search cost and variance in full tables
Patricio V. Poblete, Alfredo Viola

TL;DR
This paper proves that Robin Hood hashing maintains a bounded, small variance in search cost even at high load factors, ensuring expected constant-time searches in full tables through a novel differential equation approach.
Contribution
Introduces a differential equation method to analyze Robin Hood hashing, proving bounded variance and expected constant search time near full table capacity.
Findings
Variance of search cost is bounded by a small constant near full load.
Expected search time remains constant even with deletions and high load factors.
The approach applies to various collision resolution strategies, including FCFS and LCFS.
Abstract
Thirty years ago, the Robin Hood collision resolution strategy was introduced for open addressing hash tables, and a recurrence equation was found for the distribution of its search cost. Although this recurrence could not be solved analytically, it allowed for numerical computations that, remarkably, suggested that the variance of the search cost approached a value of when the table was full. Furthermore, by using a non-standard mean-centered search algorithm, this would imply that searches could be performed in expected constant time even in a full table. In spite of the time elapsed since these observations were made, no progress has been made in proving them. In this paper we introduce a technique to work around the intractability of the recurrence equation by solving instead an associated differential equation. While this does not provide an exact solution, it is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Caching and Content Delivery · Optimization and Search Problems
