Approximate Nearest Neighbors in Limited Space
Piotr Indyk, Tal Wagner

TL;DR
This paper introduces a space-efficient data structure for approximate nearest neighbor search in high-dimensional integer spaces, significantly reducing memory usage while maintaining accuracy.
Contribution
It presents a novel data structure that uses substantially less space than previous methods for approximate nearest neighbor search in bounded integer spaces.
Findings
Achieves $O( ext{epsilon}^{-2} n ext{log}(n) ext{log}(1/ ext{epsilon}))$ bits of space
Improves upon the previous space bound of $O( ext{epsilon}^{-2} n ext{log}(n)^2)$
Provides bounds for the problem of estimating all distances from query points to data points
Abstract
We consider the -approximate nearest neighbor search problem: given a set of points in a -dimensional space, build a data structure that, given any query point , finds a point whose distance to is at most for an accuracy parameter . Our main result is a data structure that occupies only bits of space, assuming all point coordinates are integers in the range , i.e., the coordinates have bits of precision. This improves over the best previously known space bound of , obtained via the randomized dimensionality reduction method of Johnson and Lindenstrauss (1984). We also consider the more general problem of estimating all distances from a collection of query points to all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Optimization and Search Problems · Complexity and Algorithms in Graphs
