New Streaming Algorithms for High Dimensional EMD and MST
Xi Chen, Rajesh Jayaram, Amit Levi, Erik Waingarten

TL;DR
This paper introduces efficient one- and two-pass streaming algorithms with polylogarithmic space for approximating the MST cost and Earth Mover Distance in high-dimensional data, improving previous approximation bounds.
Contribution
It presents new streaming algorithms with near-optimal approximation factors for high-dimensional MST and EMD, utilizing an improved Quadtree analysis.
Findings
Achieves $ ilde{O}( ext{log } n)$ approximation with polylogarithmic space.
Single-pass EMD algorithm with small additive error.
Proves lower bounds matching the approximation guarantees.
Abstract
We study streaming algorithms for two fundamental geometric problems: computing the cost of a Minimum Spanning Tree (MST) of an -point set , and computing the Earth Mover Distance (EMD) between two multi-sets of size . We consider the turnstile model, where points can be added and removed. We give a one-pass streaming algorithm for MST and a two-pass streaming algorithm for EMD, both achieving an approximation factor of and using polylog-space only. Furthermore, our algorithm for EMD can be compressed to a single pass with a small additive error. Previously, the best known sublinear-space streaming algorithms for either problem achieved an approximation of [Andoni-Indyk-Krauthgamer '08, Backurs-Dong-Indyk-Razenshteyn-Wagner '20].…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Geometry and Mesh Generation · Advanced Graph Theory Research · Data Management and Algorithms
