Near-Optimal Bounds for Parameterized Euclidean k-means
Vincent Cohen-Addad, Karthik C. S., David Saulpic, and Chris Schwiegelshohn

TL;DR
This paper establishes tight computational bounds for approximating Euclidean k-means clustering, assuming a new hypothesis, and shows the optimality of existing algorithms under this assumption.
Contribution
It introduces the Exponential Time for Expanders Hypothesis (XXH) and proves tight lower bounds for approximation algorithms for Euclidean k-means, matching known upper bounds.
Findings
No faster approximation algorithms under XXH hypothesis.
The known algorithm by Feldman et al. is optimal assuming XXH.
The exact algorithm by Inaba et al. is optimal for small k under XXH.
Abstract
The -means problem is a classic objective for modeling clustering in a metric space. Given a set of points in a metric space, the goal is to find representative points so as to minimize the sum of the squared distances from each point to its closest representative. In this work, we study the approximability of -means in Euclidean spaces parameterized by the number of clusters, . In seminal works, de la Vega, Karpinski, Kenyon, and Rabani [STOC'03] and Kumar, Sabharwal, and Sen [JACM'10] showed how to obtain a -approximation for high-dimensional Euclidean -means in time . In this work, we introduce a new fine-grained hypothesis called Exponential Time for Expanders Hypothesis (XXH) which roughly asserts that there are no non-trivial exponential time approximation algorithms for the vertex cover problem on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
