Fast Distance Oracles for Any Symmetric Norm
Yichuan Deng, Zhao Song, Omri Weinstein, Ruizhe Zhang

TL;DR
This paper introduces a fast $(1+\varepsilon)$-approximate distance oracle for any symmetric norm, significantly improving efficiency for similarity search in high-dimensional spaces across various norms.
Contribution
It presents a novel data structure that provides efficient approximate distance queries for all symmetric norms, extending beyond $\ell_p$ norms to many practical norms.
Findings
Preprocessing time and space are $\tilde{O}(n(d + \mathrm{mmc}(l)^2))$.
Query time is $\tilde{O}(d + |S| \cdot \mathrm{mmc}(l)^2)$.
Matches state-of-the-art for $\ell_p$ norms when $l=\ell_p$.
Abstract
In the Distance Oracle problem, the goal is to preprocess vectors in a -dimensional metric space into a cheap data structure, so that given a query vector and a subset of the input data points, all distances for can be quickly approximated (faster than the trivial query time). This primitive is a basic subroutine in machine learning, data mining and similarity search applications. In the case of norms, the problem is well understood, and optimal data structures are known for most values of . Our main contribution is a fast distance oracle for any symmetric norm . This class includes norms and Orlicz norms as special cases, as well as other norms used in practice, e.g. top- norms, max-mixture…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsData Management and Algorithms · Complexity and Algorithms in Graphs · Automated Road and Building Extraction
