The Hardness of Approximation of Euclidean k-means

Pranjal Awasthi; Moses Charikar; Ravishankar Krishnaswamy; Ali Kemal; Sinop

arXiv:1502.03316·cs.CC·February 12, 2015

The Hardness of Approximation of Euclidean k-means

Pranjal Awasthi, Moses Charikar, Ravishankar Krishnaswamy, Ali Kemal, Sinop

PDF

TL;DR

This paper establishes the first NP-hardness of approximation for the Euclidean k-means problem, showing it cannot be approximated within a certain factor unless P=NP, via a reduction from vertex cover on triangle-free graphs.

Contribution

It introduces the first hardness of approximation results for Euclidean k-means, connecting it to vertex cover hardness through novel graph product spectral analysis.

Findings

01

NP-hard to approximate k-means within (1+ε) factor

02

Reduction from vertex cover on triangle-free graphs

03

Spectral analysis of graph products preserves independence number

Abstract

The Euclidean $k$ -means problem is a classical problem that has been extensively studied in the theoretical computer science, machine learning and the computational geometry communities. In this problem, we are given a set of $n$ points in Euclidean space $R^{d}$ , and the goal is to choose $k$ centers in $R^{d}$ so that the sum of squared distances of each point to its nearest center is minimized. The best approximation algorithms for this problem include a polynomial time constant factor approximation for general $k$ and a $(1 + ϵ)$ -approximation which runs in time $p o l y (n) 2^{O (k / ϵ)}$ . At the other extreme, the only known computational complexity result for this problem is NP-hardness [ADHP'09]. The main difficulty in obtaining hardness results stems from the Euclidean nature of the problem, and the fact that any point in $R^{d}$ can be a potential center. This gap in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.