FGIM: a Fast Graph-based Indexes Merging Framework for Approximate Nearest Neighbor Search
Zekai Wu, Jiabao Jin, Peng Cheng, Xiaoyao Zhong, Lei Chen, Yongxin Tong, Zhitao Shen, Jingkuan Song, Heng Tao Shen, Xuemin Lin

TL;DR
This paper introduces FGIM, a framework for efficiently merging multiple graph-based indexes in high-dimensional approximate nearest neighbor search, significantly speeding up index construction while maintaining search accuracy.
Contribution
The paper presents a novel FGIM framework that effectively merges existing graph-based indexes, enhancing speed and scalability in high-dimensional ANNS applications.
Findings
Achieves up to 3.5× speedup over HNSW's incremental construction.
Attains an average of 7.9× speedup for non-incremental methods.
Maintains comparable or better search performance.
Abstract
As the state-of-the-art methods for high-dimensional data retrieval, Approximate Nearest Neighbor Search (ANNS) approaches with graph-based indexes have attracted increasing attention and play a crucial role in many real-world applications, e.g., retrieval-augmented generation (RAG) and recommendation systems. Unlike the extensive works focused on designing efficient graph-based ANNS methods, this paper delves into merging multiple existing graph-based indexes into a single one, which is also crucial in many real-world scenarios (e.g., cluster consolidation in distributed systems and read-write contention in real-time vector databases). We propose a Fast Graph-based Indexes Merging (FGIM) framework with three core techniques: (1) Proximity Graphs (PGs) to Nearest Neighbor Graph (-NNG) transformation used to extract potential candidate neighbors from input graph-based indexes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Advanced Image and Video Retrieval Techniques · Data Management and Algorithms
