Efficiently Constructing Sparse Navigable Graphs
Alex Conway, Laxman Dhulipala, Martin Farach-Colton, Rob Johnson, Ben Landrum, Christopher Musco, Yarin Shechter, Torsten Suel, Richard Wen

TL;DR
This paper introduces a fast, provably efficient algorithm for constructing sparse navigable graphs used in nearest neighbor search, improving over heuristics with theoretical guarantees and near-optimal runtime.
Contribution
It presents the first provably efficient algorithms for constructing sparse navigable graphs with near-linear time complexity and approximation guarantees, leveraging set cover techniques and problem-specific optimizations.
Findings
Achieves $ ilde{O}(n^2)$ time for near-optimal sparse graph construction.
Proves that better than $O( ext{log } n)$ approximation is NP-hard.
Extends approach to construct $ au$-monotonic and $ au$-shortcut graphs efficiently.
Abstract
Graph-based nearest neighbor search methods have seen a surge of popularity in recent years, offering state-of-the-art performance across a wide variety of applications. Central to these methods is the task of constructing a sparse navigable search graph for a given dataset endowed with a distance function. Unfortunately, doing so is computationally expensive, so heuristics are universally used in practice. In this work, we initiate the study of fast algorithms with provable guarantees for search graph construction. For a dataset with data points, the problem of constructing an optimally sparse navigable graph can be framed as separate but highly correlated minimum set cover instances. This yields a naive time greedy algorithm that returns a navigable graph whose sparsity is at most higher than optimal. We improve significantly on this baseline, taking…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Geographic Information Systems Studies · Constraint Satisfaction and Optimization
