On Adaptive Distance Estimation

Yeshwanth Cherapanamjeri; Jelani Nelson

arXiv:2010.11252·cs.DS·December 17, 2020·1 cites

On Adaptive Distance Estimation

Yeshwanth Cherapanamjeri, Jelani Nelson

PDF

Open Access 1 Video

TL;DR

This paper introduces a new randomized data structure for distance estimation in high-dimensional spaces that guarantees accuracy even under adaptively chosen queries, enabling faster approximate nearest neighbor searches.

Contribution

It presents the first data structure supporting adaptive queries for distance estimation with high probability guarantees, low memory, and fast query times.

Findings

01

Supports $(1+psilon)$-approximate distance queries with high probability

02

Memory usage is near-linear in data size and dimension

03

Query time is significantly faster than naive linear scan

Abstract

We provide a static data structure for distance estimation which supports {\it adaptive} queries. Concretely, given a dataset $X = {x_{i}}_{i = 1}^{n}$ of $n$ points in $R^{d}$ and $0 < p \leq 2$ , we construct a randomized data structure with low memory consumption and query time which, when later given any query point $q \in R^{d}$ , outputs a $(1 + ϵ)$ -approximation of $∥ q - x_{i} ∥_{p}$ with high probability for all $i \in [n]$ . The main novelty is our data structure's correctness guarantee holds even when the sequence of queries can be chosen adaptively: an adversary is allowed to choose the $j$ th query point $q_{j}$ in a way that depends on the answers reported by the data structure for $q_{1}, \dots, q_{j - 1}$ . Previous randomized Monte Carlo methods do not provide error guarantees in the setting of adaptively chosen queries. Our memory consumption is $\tilde…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

On Adaptive Distance Estimation· slideslive

Taxonomy

TopicsMachine Learning and Algorithms · Optimization and Search Problems · Advanced Image and Video Retrieval Techniques