Approximate Nearest Neighbors in Limited Space

Piotr Indyk; Tal Wagner

arXiv:1807.00112·cs.DS·July 3, 2018·1 cites

Approximate Nearest Neighbors in Limited Space

Piotr Indyk, Tal Wagner

PDF

Open Access

TL;DR

This paper introduces a space-efficient data structure for approximate nearest neighbor search in high-dimensional integer spaces, significantly reducing memory usage while maintaining accuracy.

Contribution

It presents a novel data structure that uses substantially less space than previous methods for approximate nearest neighbor search in bounded integer spaces.

Findings

01

Achieves $O( ext{epsilon}^{-2} n ext{log}(n) ext{log}(1/ ext{epsilon}))$ bits of space

02

Improves upon the previous space bound of $O( ext{epsilon}^{-2} n ext{log}(n)^2)$

03

Provides bounds for the problem of estimating all distances from query points to data points

Abstract

We consider the $(1 + ϵ)$ -approximate nearest neighbor search problem: given a set $X$ of $n$ points in a $d$ -dimensional space, build a data structure that, given any query point $y$ , finds a point $x \in X$ whose distance to $y$ is at most $(1 + ϵ) min_{x \in X} ∥ x - y ∥$ for an accuracy parameter $ϵ \in (0, 1)$ . Our main result is a data structure that occupies only $O (ϵ^{- 2} n lo g (n) lo g (1/ ϵ))$ bits of space, assuming all point coordinates are integers in the range ${- n^{O (1)} \dots n^{O (1)}}$ , i.e., the coordinates have $O (lo g n)$ bits of precision. This improves over the best previously known space bound of $O (ϵ^{- 2} n lo g (n)^{2})$ , obtained via the randomized dimensionality reduction method of Johnson and Lindenstrauss (1984). We also consider the more general problem of estimating all distances from a collection of query points to all…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Optimization and Search Problems · Complexity and Algorithms in Graphs