The Hardness of Approximation of Euclidean k-means
Pranjal Awasthi, Moses Charikar, Ravishankar Krishnaswamy, Ali Kemal, Sinop

TL;DR
This paper establishes the first NP-hardness of approximation for the Euclidean k-means problem, showing it cannot be approximated within a certain factor unless P=NP, via a reduction from vertex cover on triangle-free graphs.
Contribution
It introduces the first hardness of approximation results for Euclidean k-means, connecting it to vertex cover hardness through novel graph product spectral analysis.
Findings
NP-hard to approximate k-means within (1+ε) factor
Reduction from vertex cover on triangle-free graphs
Spectral analysis of graph products preserves independence number
Abstract
The Euclidean -means problem is a classical problem that has been extensively studied in the theoretical computer science, machine learning and the computational geometry communities. In this problem, we are given a set of points in Euclidean space , and the goal is to choose centers in so that the sum of squared distances of each point to its nearest center is minimized. The best approximation algorithms for this problem include a polynomial time constant factor approximation for general and a -approximation which runs in time . At the other extreme, the only known computational complexity result for this problem is NP-hardness [ADHP'09]. The main difficulty in obtaining hardness results stems from the Euclidean nature of the problem, and the fact that any point in can be a potential center. This gap in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
