DB-LSH: Locality-Sensitive Hashing with Query-based Dynamic Bucketing
Yao Tian, Xi Zhao, Xiaofang Zhou

TL;DR
DB-LSH introduces a dynamic, query-based bucketing approach in locality-sensitive hashing, significantly reducing space and improving efficiency for high-dimensional approximate nearest neighbor search.
Contribution
It proposes a novel LSH scheme using multi-dimensional indexes and query-based hypercubic buckets, reducing space costs and achieving smaller query complexity.
Findings
Outperforms state-of-the-art methods in efficiency and accuracy
Achieves smaller query cost with theoretical guarantees
Effectively handles large high-dimensional datasets
Abstract
Among many solutions to the high-dimensional approximate nearest neighbor (ANN) search problem, locality sensitive hashing (LSH) is known for its sub-linear query time and robust theoretical guarantee on query accuracy. Traditional LSH methods can generate a small number of candidates quickly from hash tables but suffer from large index sizes and hash boundary problems. Recent studies to address these issues often incur extra overhead to identify eligible candidates or remove false positives, making query time no longer sub-linear. To address this dilemma, in this paper we propose a novel LSH scheme called DB-LSH which supports efficient ANN search for large high-dimensional datasets. It organizes the projected spaces with multi-dimensional indexes rather than using fixed-width hash buckets. Our approach can significantly reduce the space cost as by avoiding the need to maintain many…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods · Advanced Neural Network Applications
