Careful Seeding for k-Medois Clustering with Incremental k-Means++ Initialization
Difei Cheng, Yunfeng Zhang, Ruinan Jin

TL;DR
This paper introduces a novel incremental k-medoids clustering algorithm, INCKPP, which improves initialization, handles imbalanced datasets better, and is further optimized for efficiency with INCKPPsample, outperforming existing methods.
Contribution
The paper proposes the INCKPP algorithm with incremental k-means++ initialization, overcoming hyperparameter issues and improving clustering on imbalanced datasets, along with an efficient INCKPPsample variant.
Findings
INCKPP outperforms existing algorithms on synthetic and real datasets.
The incremental initialization improves clustering accuracy and robustness.
INCKPPsample enhances efficiency without sacrificing performance.
Abstract
K-medoids clustering is a popular variant of k-means clustering and widely used in pattern recognition and machine learning. A main drawback of k-medoids clustering is that an improper initialization can cause it to get trapped in local optima. An improved k-medoids clustering algorithm, called INCKM algorithm, which is the first to apply incremental initialization to k-medoids clustering, was recently proposed to overcome this drawback. The INCKM algorithm requires the construction of a subset of candidate medoids determined by one hyperparameter for initialization, and meanwhile, it always fails when dealing with imbalanced datasets with an incorrect hyperparameter selection. In this paper, we propose a novel k-medoids clustering algorithm, called incremental k-means++ (INCKPP) algorithm, which initializes with a novel incremental manner, attempting to optimally add one new cluster…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Artificial Intelligence in Healthcare · Data Mining Algorithms and Applications
