Multiple Index Merge for Approximate Nearest Neighbor Search
Liuchang Jing, Mingyu Yang, Lei Li, Jianbin Qin, Wei Wang

TL;DR
This paper introduces an efficient method for merging multiple approximate nearest neighbor indexes, significantly speeding up search and merging processes while maintaining high accuracy in large-scale high-dimensional data.
Contribution
We propose a novel reverse neighbor sliding merge (RNSM) technique and merge order selection (MOS) to improve multi-index AKNN search efficiency and scalability.
Findings
Up to 5.48× speedup over existing merge methods
Achieves 9.92× faster index reconstruction
Scales efficiently to 100 million vectors
Abstract
Approximate nearest neighbor (AKNN) search in high-dimensional space is a foundational problem in vector databases with widespread applications. Among the numerous AKNN indexes, Proximity Graph-based indexes achieve state-of-the-art search efficiency across various benchmarks. However, their extensive distance computations of high-dimensional vectors lead to slow construction and substantial memory overhead. The limited memory capacity often prevents building the entire index at once when handling large-scale datasets. A common practice is to build multiple sub-indexes separately. However, directly searching on these separated indexes severely compromises search efficiency, as queries cannot leverage cross-graph connections. Therefore, efficient graph index merging is crucial for multi-index searching. In this paper, we focus on efficient two-index merging and the merge order of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Advanced Image and Video Retrieval Techniques · Graph Theory and Algorithms
