On Adaptive Distance Estimation
Yeshwanth Cherapanamjeri, Jelani Nelson

TL;DR
This paper introduces a new randomized data structure for distance estimation in high-dimensional spaces that guarantees accuracy even under adaptively chosen queries, enabling faster approximate nearest neighbor searches.
Contribution
It presents the first data structure supporting adaptive queries for distance estimation with high probability guarantees, low memory, and fast query times.
Findings
Supports $(1+psilon)$-approximate distance queries with high probability
Memory usage is near-linear in data size and dimension
Query time is significantly faster than naive linear scan
Abstract
We provide a static data structure for distance estimation which supports {\it adaptive} queries. Concretely, given a dataset of points in and , we construct a randomized data structure with low memory consumption and query time which, when later given any query point , outputs a -approximation of with high probability for all . The main novelty is our data structure's correctness guarantee holds even when the sequence of queries can be chosen adaptively: an adversary is allowed to choose the th query point in a way that depends on the answers reported by the data structure for . Previous randomized Monte Carlo methods do not provide error guarantees in the setting of adaptively chosen queries. Our memory consumption is $\tilde…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Algorithms · Optimization and Search Problems · Advanced Image and Video Retrieval Techniques
