TL;DR
This paper introduces a bandit-based Monte Carlo optimization algorithm for high-dimensional k-nearest neighbors, achieving logarithmic complexity in data dimension and outperforming existing methods in experiments.
Contribution
The paper develops a novel adaptive sampling algorithm for exact nearest neighbors with proven high-probability guarantees and improved complexity bounds.
Findings
Algorithm identifies exact neighbors with high probability.
Complexity scales logarithmically with data dimension.
Outperforms existing algorithms like kGraph, NGT, and LSH in experiments.
Abstract
The celebrated Monte Carlo method estimates an expensive-to-compute quantity by random sampling. Bandit-based Monte Carlo optimization is a general technique for computing the minimum of many such expensive-to-compute quantities by adaptive random sampling. The technique converts an optimization problem into a statistical estimation problem which is then solved via multi-armed bandits. We apply this technique to solve the problem of high-dimensional -nearest neighbors, developing an algorithm which we prove is able to identify exact nearest neighbors with high probability. We show that under regularity assumptions on a dataset of points in -dimensional space, the complexity of our algorithm scales logarithmically with the dimension of the data as for error probability , rather than linearly as in exact…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
