Fast Exact k-Means, k-Medians and Bregman Divergence Clustering in 1D
Allan Gr{\o}nlund, Kasper Green Larsen, Alexander Mathiasen and, Jesper Sindahl Nielsen, Stefan Schneider, Mingzhou Song

TL;DR
This paper reviews and improves exact polynomial-time algorithms for 1D k-Means and related clustering problems, extending them to Bregman divergences and optimizing space and reporting efficiency.
Contribution
It uncovers overlooked historical algorithms, compares their theoretical properties, and introduces space-efficient data structures for fast exact clustering in 1D.
Findings
Existing algorithms can be optimized for space and speed.
Algorithms generalize to Bregman divergences and absolute distances.
Practical experiments demonstrate performance improvements.
Abstract
The -Means clustering problem on points is NP-Hard for any dimension , however, for the 1D case there exists exact polynomial time algorithms. Previous literature reported an time dynamic programming algorithm that uses space. It turns out that the problem has been considered under a different name more than twenty years ago. We present all the existing work that had been overlooked and compare the various solutions theoretically. Moreover, we show how to reduce the space usage for some of them, as well as generalize them to data structures that can quickly report an optimal -Means clustering for any . Finally we also generalize all the algorithms to work for the absolute distance and to work for any Bregman Divergence. We complement our theoretical contributions by experiments that compare the practical performance of the various algorithms.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutomated Road and Building Extraction · Data Management and Algorithms · Video Surveillance and Tracking Methods
