Unleashing Graph Partitioning for Large-Scale Nearest Neighbor Search
Lars Gottesb\"uren, Laxman Dhulipala, Rajesh Jayaram, Jakub Lacki

TL;DR
This paper introduces efficient, modular routing algorithms for large-scale approximate nearest neighbor search that leverage balanced graph partitioning, significantly improving query throughput and recall on billion-scale datasets.
Contribution
It presents the first routing methods compatible with balanced graph partitioning, enabling high-quality, fast, and scalable distributed ANNS systems.
Findings
Achieves up to 2.14x higher QPS at 90% recall@10
Supports any partitioning method due to modular routing algorithms
Outperforms existing methods on billion-scale datasets
Abstract
We consider the fundamental problem of decomposing a large-scale approximate nearest neighbor search (ANNS) problem into smaller sub-problems. The goal is to partition the input points into neighborhood-preserving shards, so that the nearest neighbors of any point are contained in only a few shards. When a query arrives, a routing algorithm is used to identify the shards which should be searched for its nearest neighbors. This approach forms the backbone of distributed ANNS, where the dataset is so large that it must be split across multiple machines. In this paper, we design simple and highly efficient routing methods, and prove strong theoretical guarantees on their performance. A crucial characteristic of our routing algorithms is that they are inherently modular, and can be used with any partitioning method. This addresses a key drawback of prior approaches, where the routing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Graph Theory and Algorithms · Optimization and Search Problems
