Local Search Yields a PTAS for k-Means in Doubling Metrics
Zachary Friggstad, Mohsen Rezapour, and Mohammad R. Salavatipour

TL;DR
This paper proves that a simple local search algorithm provides a Polynomial Time Approximation Scheme (PTAS) for the $k$-means problem in fixed-dimensional Euclidean spaces, resolving a long-standing open problem.
Contribution
It introduces a local search algorithm that achieves a PTAS for $k$-means in fixed-dimensional Euclidean spaces, and extends this result to uncapacitated facility location and $k$-median in doubling metrics.
Findings
Local search yields a PTAS for $k$-means in fixed-dimensional Euclidean space.
The algorithm considers swaps of up to $ ho=d^{O(d)} imes ext{poly}(rac{1}{ extepsilon})$ centers.
First demonstration that local search provides a PTAS for facility location and $k$-median in doubling metrics.
Abstract
The most well known and ubiquitous clustering problem encountered in nearly every branch of science is undoubtedly -means: given a set of data points and a parameter , select centres and partition the data points into clusters around these centres so that the sum of squares of distances of the points to their cluster centre is minimized. Typically these data points lie for some . -means and the first algorithms for it were introduced in the 1950's. Since then, hundreds of papers have studied this problem and many algorithms have been proposed for it. The most commonly used algorithm is known as Lloyd-Forgy, which is also referred to as "the" -means algorithm, and various extensions of it often work very well in practice. However, they may produce solutions whose cost is arbitrarily large compared to the optimum solution. Kanungo et al.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
