Near-Optimal Clustering in the $k$-machine model
Sayan Bandyapadhyay, Tanmay Inamdar, Shreyas Pai, Sriram V., Pemmaraju

TL;DR
This paper introduces near-optimal algorithms for three clustering problems in the $k$-machine model, achieving constant-factor approximations with optimal round complexity, and establishes matching lower bounds, marking the first such results in this model.
Contribution
It presents the first constant-factor approximation algorithms for clustering in the $k$-machine model with optimal round complexity, and proves matching lower bounds, advancing large-scale distributed clustering theory.
Findings
Algorithms achieve $O(1)$-factor approximation in $ ilde{O}(n/k)$ rounds.
Lower bounds show $ ilde{ ext{Omega}}(n/k)$ rounds are necessary for polynomial-factor approximations.
Main technical contribution: learning a small portion of the input metric suffices for approximation.
Abstract
The clustering problem, in its many variants, has numerous applications in operations research and computer science (e.g., in applications in bioinformatics, image processing, social network analysis, etc.). As sizes of data sets have grown rapidly, researchers have focused on designing algorithms for clustering problems in models of computation suited for large-scale computation such as MapReduce, Pregel, and streaming models. The -machine model (Klauck et al., SODA 2015) is a simple, message-passing model for large-scale distributed graph processing. This paper considers three of the most prominent examples of clustering problems: the uncapacitated facility location problem, the -median problem, and the -center problem and presents -factor approximation algorithms for these problems running in rounds in the -machine model. These algorithms are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
