An Efficient Massively Parallel Constant-Factor Approximation Algorithm for the $k$-Means Problem
Vincent Cohen-Addad, Fabian Kuhn, Zahra Parsaeian

TL;DR
This paper introduces a massively parallel algorithm that efficiently approximates the $k$-means problem within a constant factor in fewer than logarithmic rounds, using nearly linear global memory, advancing parallel clustering methods.
Contribution
It presents the first constant-factor approximation algorithm for the general $k$-means problem in o(log n) MPC rounds with nearly linear memory, building on facility location approximations with LMP properties.
Findings
Achieves $O(rac{ ext{loglog n} imes ext{logloglog n}})$ round complexity.
Uses $O(n^{ ext{small } ext{constant}})$ bits per machine, with nearly linear total memory.
Provides a constant-factor approximation with LMP property for facility location.
Abstract
In this paper, we present an efficient massively parallel approximation algorithm for the -means problem. Specifically, we provide an MPC algorithm that computes a constant-factor approximation to an arbitrary -means instance in rounds. The algorithm uses bits of memory per machine, where is a constant that can be made arbitrarily small. The global memory usage is bits for an arbitrarily small constant , and is thus only slightly superlinear. Recently, Czumaj, Gao, Jiang, Krauthgamer, and Vesel\'{y} showed that a constant-factor bicriteria approximation can be computed in rounds in the MPC model. However, our algorithm is the first constant-factor approximation for the general -means problem that runs in rounds in the MPC model. Our approach builds upon…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFacility Location and Emergency Management · Complexity and Algorithms in Graphs · Computational Geometry and Mesh Generation
