Quicksort, Largest Bucket, and Min-Wise Hashing with Limited Independence
Mathias B{\ae}k Tejs Knudsen, Morten St\"ockel

TL;DR
This paper analyzes the performance of classic algorithms like quicksort and hashing under limited independence, providing new bounds and tighter analyses for their expected behavior with fewer randomness assumptions.
Contribution
It improves bounds for randomized quicksort, establishes tight bounds for min-wise hashing, and generalizes largest bucket size analysis under limited independence.
Findings
Quicksort with 4-independence achieves O(n log n) expected time.
Tight bounds for min-wise hashing with 3- and 4-independence.
Largest bucket size is Ω(n^{1/k}) for k-independent hash functions.
Abstract
Randomized algorithms and data structures are often analyzed under the assumption of access to a perfect source of randomness. The most fundamental metric used to measure how "random" a hash function or a random number generator is, is its independence: a sequence of random variables is said to be -independent if every variable is uniform and every size subset is independent. In this paper we consider three classic algorithms under limited independence. We provide new bounds for randomized quicksort, min-wise hashing and largest bucket size under limited independence. Our results can be summarized as follows. -Randomized quicksort. When pivot elements are computed using a -independent hash function, Karloff and Raghavan, J.ACM'93 showed expected worst-case running time for a special version of quicksort. We improve upon this, showing that the same running…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Optimization and Search Problems · Advanced Image and Video Retrieval Techniques
