Sparse Navigable Graphs for Nearest Neighbor Search: Algorithms and Hardness
Sanjeev Khanna, Ashwin Padaki, Erik Waingarten

TL;DR
This paper studies the construction of sparse navigable graphs for efficient nearest neighbor search, revealing computational hardness, establishing approximation algorithms, and providing lower bounds on query complexity.
Contribution
It introduces approximation algorithms for sparse navigable graphs, proves their near-optimality, and establishes hardness results linking the problem to Set Cover.
Findings
DiskANN can produce solutions much less sparse than optimal.
An $O(n^3)$-time $( ext{ln} n + 1)$-approximation algorithm is developed.
Any $o(n)$-approximation requires examining $ ext{Omega}(n^2)$ distances.
Abstract
We initiate the study of approximation algorithms and computational barriers for constructing sparse -navigable graphs [IX23, DGM+24], a core primitive underlying recent advances in graph-based nearest neighbor search. Given an -point dataset with an associated metric and a parameter , the goal is to efficiently build the sparsest graph that is -navigable: for every distinct , there exists an edge with . We consider two natural sparsity objectives: minimizing the maximum out-degree and minimizing the total size. We first show a strong negative result: the slow-preprocessing version of DiskANN (analyzed in [IX23] for low-doubling metrics) can yield solutions whose sparsity is times larger than optimal, even on Euclidean instances.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Complexity and Algorithms in Graphs · Data Management and Algorithms
