Near-Optimal (Euclidean) Metric Compression
Piotr Indyk, Tal Wagner

TL;DR
This paper improves the bounds on the size of sketches needed for near-optimal metric compression in Euclidean and general metrics, significantly reducing the amount of data required to approximate distances.
Contribution
The authors present tighter bounds for metric sketching, improving upon Johnson-Lindenstrauss bounds for Euclidean metrics and establishing optimal bounds for general metrics.
Findings
Euclidean metric sketch size reduced to $O(rac{1}{ ext{epsilon}^2} ext{log}(1/ ext{epsilon}) ext{log} n + ext{loglog} ext{Phi})$ bits per point.
Established that the new bounds are tight up to a $ ext{log}(1/ ext{epsilon})$ factor.
Provided an optimal sketch size of $O(n ext{log}(1/ ext{epsilon}) + ext{loglog} ext{Phi})$ bits per point for general metrics.
Abstract
The metric sketching problem is defined as follows. Given a metric on points, and , we wish to produce a small size data structure (sketch) that, given any pair of point indices, recovers the distance between the points up to a distortion. In this paper we consider metrics induced by and norms whose spread (the ratio of the diameter to the closest pair distance) is bounded by . A well-known dimensionality reduction theorem due to Johnson and Lindenstrauss yields a sketch of size , i.e., bits per point. We show that this bound is not optimal, and can be substantially improved to bits per point. Furthermore, we show that our bound is tight up to a factor of . We also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Geometry and Mesh Generation · Digital Image Processing Techniques · Sparse and Compressive Sensing Techniques
