Round-efficient Fully-scalable MPC algorithms for k-Means
Shaofeng H.-C. Jiang, Yaonan Jin, Jianing Lou, Weicheng Wang

TL;DR
This paper presents a new fully-scalable MPC algorithm for Euclidean k-Means that achieves a near-optimal approximation in constant rounds, improving efficiency and breaking previous barriers.
Contribution
Introduces a novel variant of the MP algorithm with LMP guarantees under arbitrary distance distortions, enabling efficient MPC algorithms for clustering.
Findings
Achieves an O((log n / log log n)^2)-approximation in O(1) rounds.
Provides an O(log n / log log n)-approximation for k-Median, improving previous results.
Develops a new MP algorithm variant with LMP property under distance distortions.
Abstract
We study Euclidean -Means under the Massively Parallel Computation (MPC) model, focusing on the \emph{fully-scalable} setting. Our main result is a fully-scalable -approximation in rounds. Previously, fully-scalable algorithms for -Means either run in super-constant rounds, albeit with a better -approximation [Cohen-Addad et al., SODA'26], or suffer from bicriteria guarantees [Bhaskara and Wijewardena, ICML'18; Czumaj et al., ICALP'24]. Our algorithm also gives an -approximation for -Median, which improves a recent -approximation [Goranci et al., SODA'26], and this ratio breaks the fundamental barrier of tree embedding methods used therein. Our main technical contribution is a new variant of the MP algorithm [Mettu and Plaxton, SICOMP'03] that works for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
