A Fast Approximation Scheme for Low-Dimensional $k$-Means
Vincent Cohen-Addad

TL;DR
This paper presents a faster approximation algorithm for low-dimensional $k$-means, achieving near state-of-the-art practical running times while maintaining theoretical guarantees, by innovatively combining local search with randomized dissections.
Contribution
It introduces a $(1+ ext{epsilon})$-approximation algorithm for low-dimensional $k$-means with improved running time matching practical heuristics, using randomized dissections to speed up local search.
Findings
Achieves $(1+\epsilon)$-approximation in near-linear time for low-dimensional $k$-means.
Uses randomized dissections to efficiently identify optimal local moves.
Matches the practical running time of $k$-means++ and $D^2$-sampling heuristics.
Abstract
We consider the popular -means problem in -dimensional Euclidean space. Recently Friggstad, Rezapour, Salavatipour [FOCS'16] and Cohen-Addad, Klein, Mathieu [FOCS'16] showed that the standard local search algorithm yields a -approximation in time , giving the first polynomial-time approximation scheme for the problem in low-dimensional Euclidean space. While local search achieves optimal approximation guarantees, it is not competitive with the state-of-the-art heuristics such as the famous -means++ and -sampling algorithms. In this paper, we aim at bridging the gap between theory and practice by giving a -approximation algorithm for low-dimensional -means running in time , and so matching the running time of the -means++ and -sampling heuristics up…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Geometry and Mesh Generation · Complexity and Algorithms in Graphs · Algorithms and Data Compression
