Clustering under Perturbation Stability in Near-Linear Time
Pankaj K. Agarwal, Hsien-Chih Chang, Kamesh Munagala, Erin, Taylor, Emo Welzl

TL;DR
This paper introduces near-linear time algorithms for clustering in low-dimensional Euclidean spaces under perturbation stability, providing efficient solutions for k-center, k-means, and k-median problems when the stability parameter exceeds certain thresholds.
Contribution
It presents the first efficient exact algorithms for perturbation-stable clustering instances with running times nearly linear in data size for specific stability parameters.
Findings
Algorithms run in near-linear time for stable instances with α ≥ 2 + √3.
Polynomial dependence on the number of clusters k for certain stability levels.
Simple algorithms using local search or dynamic programming achieve these results.
Abstract
We consider the problem of center-based clustering in low-dimensional Euclidean spaces under the perturbation stability assumption. An instance is -stable if the underlying optimal clustering continues to remain optimal even when all pairwise distances are arbitrarily perturbed by a factor of at most . Our main contribution is in presenting efficient exact algorithms for -stable clustering instances whose running times depend near-linearly on the size of the data set when . For -center and -means problems, our algorithms also achieve polynomial dependence on the number of clusters, , when for any constant in any fixed dimension. For -median, our algorithms have polynomial dependence on for in any fixed dimension; and for in two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
