Global $k$-means$++$: an effective relaxation of the global $k$-means clustering algorithm
Georgios Vardakas, Aristidis Likas

TL;DR
This paper introduces global $k$-means$++$, a new clustering algorithm that achieves high-quality results similar to global $k$-means but with significantly reduced computational cost by leveraging $k$-means++ style center selection.
Contribution
The paper proposes the global $k$-means$++$ algorithm, combining the deterministic nature of global $k$-means with the probabilistic center selection of $k$-means++, reducing computational load while maintaining clustering quality.
Findings
Achieves comparable clustering quality to global $k$-means.
Reduces computational time significantly.
Performs well on various benchmark datasets.
Abstract
The -means algorithm is a prevalent clustering method due to its simplicity, effectiveness, and speed. However, its main disadvantage is its high sensitivity to the initial positions of the cluster centers. The global -means is a deterministic algorithm proposed to tackle the random initialization problem of k-means but its well-known that requires high computational cost. It partitions the data to clusters by solving all -means sub-problems incrementally for all . For each cluster problem, the method executes the -means algorithm times, where is the number of datapoints. In this paper, we propose the \emph{global -means\texttt{++}} clustering algorithm, which is an effective way of acquiring quality clustering solutions akin to those of global -means with a reduced computational load. This is achieved by exploiting the center selection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Data Management and Algorithms · Data Mining Algorithms and Applications
