Consistent k-Clustering for General Metrics
Hendrik Fichtenberger, Silvio Lattanzi, Ashkan Norouzi-Fard, Ola, Svensson

TL;DR
This paper presents an algorithm for streaming k-median clustering in metric spaces that minimizes center swaps, matching offline performance with near-optimal swaps, advancing the efficiency of dynamic clustering.
Contribution
The authors develop a streaming algorithm for k-median clustering that achieves near-optimal center swaps, closing the gap between online and offline clustering guarantees.
Findings
Matches offline guarantees with ew swaps
Uses new structural properties of k-median clustering
Achieves actor approximation with ew swaps
Abstract
Given a stream of points in a metric space, is it possible to maintain a constant approximate clustering by changing the cluster centers only a small number of times during the entire execution of the algorithm? This question received attention in recent years in the machine learning literature and, before our work, the best known algorithm performs center swaps (the notation hides polylogarithmic factors in the number of points and the aspect ratio of the input instance). This is a quadratic increase compared to the offline case -- the whole stream is known in advance and one is interested in keeping a constant approximation at any point in time -- for which swaps are known to be sufficient and simple examples show that swaps are necessary. We close this gap by developing an algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutomated Road and Building Extraction · Facility Location and Emergency Management · Data Management and Algorithms
