On constant factor approximation for earth mover distance over doubling metrics
Shi Li

TL;DR
This paper presents efficient algorithms and sketching schemes for approximating earth mover distance in metric spaces with bounded doubling dimension, achieving near-linear query times and controlled approximation factors.
Contribution
It introduces a data structure and encoding scheme for fast, approximate earth mover distance computations over doubling metrics, with provable efficiency and approximation guarantees.
Findings
Preprocessing time is near-quadratic in the size of the metric space.
Query time for EMD approximation is nearly linear in the size of the space.
The methods achieve an approximation factor proportional to the doubling dimension.
Abstract
Given a metric space , the earth mover distance between two distributions over is defined as the minimum cost of a bipartite matching between the two distributions. The doubling dimension of a metric is the smallest value such that every ball in can be covered by ball of half the radius. We study efficient algorithms for approximating earth mover distance over metrics with bounded doubling dimension. Given a metric , with , we can use preprocessing time to create a data structure of size , such that subsequently queried EMDs can be -approximated in time. We also show a weaker form of sketching scheme, which we call "encoding scheme". Given , by using preprocessing time, every subsequent distribution over can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Computational Geometry and Mesh Generation · Data Management and Algorithms
