Approximate Euclidean lengths and distances beyond Johnson-Lindenstrauss
Aleksandros Sobczyk, Mathieu Luisier

TL;DR
This paper introduces a new algorithm that improves the estimation of Euclidean lengths and distances in high-dimensional data, reducing the dependency on the error parameter from quadratic to linear, especially for matrices with decaying spectra.
Contribution
It presents an algorithm inspired by Hutch++, that achieves better accuracy with fewer queries for length and distance estimation, extending to leverage scores, surpassing standard Johnson-Lindenstrauss bounds.
Findings
Achieves $oldsymbol{ ext{O}(rac{1}{ ext{epsilon}})}$ query complexity for length estimation.
Provides element-wise probabilistic bounds comparable or better than JL in worst-case.
Extends results to distance and leverage score estimation with similar improvements.
Abstract
A classical result of Johnson and Lindenstrauss states that a set of high dimensional data points can be projected down to dimensions such that the square of their pairwise distances is preserved up to a small distortion . It has been proved that the JL lemma is optimal for the general case, therefore, improvements can only be explored for special cases. This work aims to improve the dependency based on techniques inspired by the Hutch++ Algorithm, which reduces to for the related problem of implicit matrix trace estimation. We first present an algorithm to estimate the Euclidean lengths of the rows of a matrix. We prove for it element-wise probabilistic bounds that are at least as good as standard JL approximations in the worst-case, but are asymptotically better for matrices with decaying…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Distributed Sensor Networks and Detection Algorithms · Statistical Methods and Inference
