Multi-Prototypes Convex Merging Based K-Means Clustering Algorithm

Dong Li; Shuisheng Zhou; Tieyong Zeng; and Raymond H. Chan

arXiv:2302.07045·cs.LG·February 15, 2023

Multi-Prototypes Convex Merging Based K-Means Clustering Algorithm

Dong Li, Shuisheng Zhou, Tieyong Zeng, and Raymond H. Chan

PDF

Open Access

TL;DR

This paper introduces MCKM, a novel clustering algorithm that combines multi-prototypes sampling and convex merging to automatically determine the number of clusters and avoid local minima in K-Means clustering.

Contribution

The paper proposes a new method integrating multi-prototypes sampling and convex merging to improve K-Means, automatically estimate cluster number, and escape local minima.

Findings

01

Effective in synthetic data clustering

02

Verified on real-world datasets

03

Outperforms traditional K-Means

Abstract

K-Means algorithm is a popular clustering method. However, it has two limitations: 1) it gets stuck easily in spurious local minima, and 2) the number of clusters k has to be given a priori. To solve these two issues, a multi-prototypes convex merging based K-Means clustering algorithm (MCKM) is presented. First, based on the structure of the spurious local minima of the K-Means problem, a multi-prototypes sampling (MPS) is designed to select the appropriate number of multi-prototypes for data with arbitrary shapes. A theoretical proof is given to guarantee that the multi-prototypes selected by MPS can achieve a constant factor approximation to the optimal cost of the K-Means problem. Then, a merging technique, called convex merging (CM), merges the multi-prototypes to get a better local minima without k being given a priori. Specifically, CM can obtain the optimal merging and estimate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Computing and Algorithms · Advanced Clustering Algorithms Research · Face and Expression Recognition

Methodsk-Means Clustering