On the Problem of $p_1^{-1}$ in Locality-Sensitive Hashing
Thomas Dybdahl Ahle

TL;DR
This paper improves the efficiency of locality-sensitive hashing algorithms by reducing query time and space complexity, ensuring sublinear performance even when collision probabilities are very low, which is common in high-dimensional data.
Contribution
The authors propose a modification to the Indyk-Motwani LSH algorithm that guarantees sublinear query time for any collision probability above 1/n, significantly enhancing performance in high-dimensional nearest neighbor searches.
Findings
Achieves sublinear query time for collision probabilities > 1/n
Reduces query time and space complexity by up to a factor of n
Easily implementable modification to existing LSH algorithms
Abstract
A Locality-Sensitive Hash (LSH) function is called -sensitive, if two data-points with a distance less than collide with probability at least while data points with a distance greater than collide with probability at most . These functions form the basis of the successful Indyk-Motwani algorithm (STOC 1998) for nearest neighbour problems. In particular one may build a -approximate nearest neighbour data structure with query time where . That is, sub-linear time, as long as is not too small. This is significant since most high dimensional nearest neighbour problems suffer from the curse of dimensionality, and can't be solved exact, faster than a brute force linear-time scan of the database. Unfortunately, the best LSH functions tend to have very low collision probabilities,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
