Effective and General Distance Computation for Approximate Nearest Neighbor Search
Mingyu Yang, Wentao Li, Jiabao Jin, Xiaoyao Zhong, Xiangyu Wang,, Zhitao Shen, Wei Jia, Wei Wang

TL;DR
This paper introduces a novel data-driven distance computation method for approximate nearest neighbor search that outperforms existing approaches in speed and accuracy by leveraging data distribution and orthogonal projection.
Contribution
The paper proposes a new distance computation technique that improves effectiveness and generality over ADSampling by using data distribution and decoupled correction.
Findings
Achieves 1.6 to 2.1 times speedup over ADSampling
Provides higher accuracy in real-world datasets
Demonstrates superior performance through extensive experiments
Abstract
Approximate K Nearest Neighbor (AKNN) search in high-dimensional spaces is a critical yet challenging problem. In AKNN search, distance computation is the core task that dominates the runtime. Existing approaches typically use approximate distances to improve computational efficiency, often at the cost of reduced search accuracy. To address this issue, the state-of-the-art method, ADSampling, employs random projections to estimate approximate distances and introduces an additional distance correction process to mitigate accuracy loss. However, ADSampling has limitations in both effectiveness and generality, primarily due to its reliance on random projections for distance approximation and correction. To address the effectiveness limitations of ADSampling, we leverage data distribution to improve distance computation via orthogonal projection. Furthermore, to overcome the generality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Face and Expression Recognition · Advanced Image and Video Retrieval Techniques
