Multi-Prototypes Convex Merging Based K-Means Clustering Algorithm
Dong Li, Shuisheng Zhou, Tieyong Zeng, and Raymond H. Chan

TL;DR
This paper introduces MCKM, a novel clustering algorithm that combines multi-prototypes sampling and convex merging to automatically determine the number of clusters and avoid local minima in K-Means clustering.
Contribution
The paper proposes a new method integrating multi-prototypes sampling and convex merging to improve K-Means, automatically estimate cluster number, and escape local minima.
Findings
Effective in synthetic data clustering
Verified on real-world datasets
Outperforms traditional K-Means
Abstract
K-Means algorithm is a popular clustering method. However, it has two limitations: 1) it gets stuck easily in spurious local minima, and 2) the number of clusters k has to be given a priori. To solve these two issues, a multi-prototypes convex merging based K-Means clustering algorithm (MCKM) is presented. First, based on the structure of the spurious local minima of the K-Means problem, a multi-prototypes sampling (MPS) is designed to select the appropriate number of multi-prototypes for data with arbitrary shapes. A theoretical proof is given to guarantee that the multi-prototypes selected by MPS can achieve a constant factor approximation to the optimal cost of the K-Means problem. Then, a merging technique, called convex merging (CM), merges the multi-prototypes to get a better local minima without k being given a priori. Specifically, CM can obtain the optimal merging and estimate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Computing and Algorithms · Advanced Clustering Algorithms Research · Face and Expression Recognition
Methodsk-Means Clustering
