Modern Minimal Perfect Hashing: A Survey
Hans-Peter Lehmann, Thomas Mueller, Rasmus Pagh, Giulio Ermanno Pibiri, Peter Sanders, Sebastiano Vigna, Stefan Walzer

TL;DR
This survey reviews recent advances in minimal perfect hashing, highlighting improvements in space efficiency, construction and query speed, and practical applications across various fields.
Contribution
It provides a comprehensive overview of modern minimal perfect hash functions, including recent developments, experimental evaluations, and practical guidance for application selection.
Findings
Modern perfect hash functions are extremely fast and space-efficient.
Recent methods achieve near-optimal space usage within 0.1% of the lower bound.
Experimental results guide practical selection of hash functions.
Abstract
Given a set of keys, a perfect hash function for maps the keys in to the first integers without collisions. It may return an arbitrary result for any key not in and is called minimal if . The most important parameters are its space consumption, construction time, and query time. Years of research now enable modern perfect hash functions to be extremely fast to query, very space-efficient, and scale to billions of keys. Different approaches give different trade-offs between these aspects. For example, the smallest constructions get within 0.1% of the space lower bound of bits per key. Others are particularly fast to query, requiring only one memory access. Perfect hashing has many applications, for example to avoid collision resolution in static hash tables, and is used in databases, bioinformatics, and stringology. Since the last…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Cryptographic Implementations and Security · Advanced Image and Video Retrieval Techniques
