A balanced k-means algorithm for weighted point sets
Steffen Borgwardt, Andreas Brieden, Peter Gritzmann

TL;DR
This paper introduces a weight-balanced k-means algorithm capable of handling weighted points and size constraints, providing polynomial bounds on iterations and extending classical k-means to more complex, real-world scenarios.
Contribution
It presents a novel generalization of k-means that manages weighted points with prescribed cluster size bounds, including theoretical analysis of its convergence properties.
Findings
Bounded the number of iterations by n^{O(dk)}
Algorithm handles weighted points with size constraints
Polynomial iteration bounds for fixed k and d
Abstract
The classical -means algorithm for partitioning points in into clusters is one of the most popular and widely spread clustering methods. The need to respect prescribed lower bounds on the cluster sizes has been observed in many scientific and business applications. In this paper, we present and analyze a generalization of -means that is capable of handling weighted point sets and prescribed lower and upper bounds on the cluster sizes. We call it weight-balanced -means. The key difference to existing models lies in the ability to handle the combination of weighted point sets with prescribed bounds on the cluster sizes. This imposes the need to perform partial membership clustering, and leads to significant differences. For example, while finite termination of all -means variants for unweighted point sets is a simple consequence of the existence of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Computational Geometry and Mesh Generation · Face and Expression Recognition
