From Average Embeddings To Nearest Neighbor Search
Alexandr Andoni, David Cheikhi

TL;DR
This paper demonstrates how average embeddings can be used to develop efficient approximate nearest neighbor search algorithms, extending classic embedding methods with a data-dependent hashing approach.
Contribution
It introduces a novel approach leveraging average embeddings to improve approximate nearest neighbor search, strengthening traditional bi-Lipschitz embedding techniques.
Findings
Embedding metric spaces into on average enables efficient approximate nearest neighbor search.
Existence of efficient average embeddings implies a polynomial approximation algorithm.
The approach enhances data-dependent hashing methods for similarity search.
Abstract
In this note, we show that one can use average embeddings, introduced recently in [Naor'20, arXiv:1905.01280], to obtain efficient algorithms for approximate nearest neighbor search. In particular, a metric embeds into on average, with distortion , if, for any distribution on , the embedding is Lipschitz and the (square of) distance does not decrease on average (wrt ). In particular existence of such an embedding (assuming it is efficient) implies a approximate nearest neighbor search under . This can be seen as a strengthening of the classic (bi-Lipschitz) embedding approach to nearest neighbor search, and is another application of data-dependent hashing paradigm.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Machine Learning and Algorithms · Optimization and Search Problems
