Almost-Optimal Upper and Lower Bounds for Clustering in Low Dimensional Euclidean Spaces
Vincent Cohen-Addad, Karthik C. S., David Saulpic, and Chris Schwiegelshohn

TL;DR
This paper improves the computational efficiency of near-optimal clustering algorithms in low-dimensional Euclidean spaces and establishes tight lower bounds, advancing theoretical understanding of clustering complexity.
Contribution
It presents a faster algorithm for $(1+ ext{epsilon})$-approximate $k$-median and $k$-means clustering in low dimensions and proves a matching lower bound under the Gap Exponential Time Hypothesis.
Findings
Improved running time to $2^{ ilde{O}(1/ ext{epsilon})^{d-1}} imes n imes ext{polylog}(n)$.
Established a near-matching lower bound assuming Gap ETH.
Enhanced understanding of the computational complexity of clustering in low-dimensional spaces.
Abstract
The -median and -means clustering objectives are classic objectives for modeling clustering in a metric space. Given a set of points in a metric space, the goal of the -median (resp. -means) problem is to find representative points so as to minimize the sum of the distances (resp. sum of squared distances) from each point to its closest representative. Cohen-Addad, Feldmann, and Saulpic [JACM'21] showed how to obtain a -factor approximation in low-dimensional Euclidean metric for both the -median and -means problems in near-linear time (where is the dimension and is the number of input points). We improve this running time to , and show an almost matching lower bound: under the Gap Exponential Time Hypothesis for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFacility Location and Emergency Management · Computational Geometry and Mesh Generation · Advanced Clustering Algorithms Research
