Fully Dynamic k-Means Coreset in Near-Optimal Update Time
Max Dupr\'e la Tour, Monika Henzinger, David Saulpic

TL;DR
This paper introduces a near-optimal dynamic algorithm for maintaining coresets in k-means clustering, enabling efficient updates and queries in high-dimensional and general metric spaces.
Contribution
It presents a novel fully dynamic coreset construction with near-optimal update time, improving efficiency for k-means and k-median clustering in various metric spaces.
Findings
Near-optimal update time of O(k) in general metrics
Efficient algorithms for Euclidean space with O(d) update time
Reduced query time for approximate solutions
Abstract
We study in this paper the problem of maintaining a solution to -median and -means clustering in a fully dynamic setting. To do so, we present an algorithm to efficiently maintain a coreset, a compressed version of the dataset, that allows easy computation of a clustering solution at query time. Our coreset algorithm has near-optimal update time of in general metric spaces, which reduces to in the Euclidean space . The query time is in general metrics, and in . To maintain a constant-factor approximation for -median and -means clustering in Euclidean space, this directly leads to an algorithm update time , and query time . To maintain a -approximation, the query time is reduced to .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
