Graph-based time-space trade-offs for approximate near neighbors

Thijs Laarhoven

arXiv:1712.03158·cs.DS·October 4, 2019

Graph-based time-space trade-offs for approximate near neighbors

Thijs Laarhoven

PDF

TL;DR

This paper provides a rigorous asymptotic analysis of graph-based approximate nearest neighbor search, revealing conditions under which it matches hash-based methods in efficiency and exploring its scalability for large datasets.

Contribution

It introduces a formal complexity analysis of greedy graph-based near neighbor search, establishing conditions for optimal trade-offs and comparing with hash-based approaches.

Findings

01

Graph-based search matches hash-based trade-offs for small approximation factors.

02

Complexity bounds depend on dataset size and approximation factor.

03

Scalability analyzed for datasets of size exponential in dimension.

Abstract

We take a first step towards a rigorous asymptotic analysis of graph-based approaches for finding (approximate) nearest neighbors in high-dimensional spaces, by analyzing the complexity of (randomized) greedy walks on the approximate near neighbor graph. For random data sets of size $n = 2^{o (d)}$ on the $d$ -dimensional Euclidean unit sphere, using near neighbor graphs we can provably solve the approximate nearest neighbor problem with approximation factor $c > 1$ in query time $n^{ρ_{q} + o (1)}$ and space $n^{1 + ρ_{s} + o (1)}$ , for arbitrary $ρ_{q}, ρ_{s} \geq 0$ satisfying \begin{align} (2c^2 - 1) \rho_q + 2 c^2 (c^2 - 1) \sqrt{\rho_s (1 - \rho_s)} \geq c^4. \end{align} Graph-based near neighbor searching is especially competitive with hash-based methods for small $c$ and near-linear memory, and in this regime the asymptotic scaling of a greedy graph-based search matches the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.