Dynamically Detect and Fix Hardness for Efficient Approximate Nearest Neighbor Search
Zhiyuan Hua, Qiji Mo, Zebin Yao, Lixiao Cui, Xiaoguang Liu, Gang Wang, Zijing Wei, Xinyu Liu, Tianxiao Tang, Shaozhi Liu, Lin Qu

TL;DR
This paper introduces a dynamic method to detect and fix graph-based structures in Approximate Nearest Neighbor Search, significantly improving accuracy and speed, especially for out-of-distribution queries.
Contribution
It proposes Escape Hardness metric and a two-stage graph fixing approach to enhance ANNS performance and adaptability over existing methods.
Findings
Achieves up to 2.25x faster search for OOD queries at 99% recall.
Speeds up index construction by 2.35-9.02x compared to RoarGraph.
Outperforms state-of-the-art methods on real-world datasets.
Abstract
Approximate Nearest Neighbor Search (ANNS) has become a fundamental component in many real-world applications. Among various ANNS algorithms, graph-based methods are state-of-the-art. However, ANNS often suffers from a significant drop in accuracy for certain queries, especially in Out-of-Distribution (OOD) scenarios. To address this issue, a recent approach named RoarGraph constructs a bipartite graph between the base data and historical queries to bridge the gap between two different distributions. However, it suffers from some limitations: (1) Building a bipartite graph between two distributions lacks theoretical support, resulting in the query distribution not being effectively utilized by the graph index. (2) Requires a sufficient number of historical queries before graph construction and suffers from high construction times. (3) When the query workload changes, it requires…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
