An enhanced method of initial cluster center selection for K-means algorithm
Zillur Rahman, Md. Sabir Hossain, Mohammad Hasan, Ahmed Imteaj

TL;DR
This paper introduces a new initialization method for K-means clustering that improves cluster separation and reduces errors, outperforming traditional random initialization especially with more than two clusters.
Contribution
A novel initialization approach combining convex hull and nearest neighbor techniques to enhance K-means clustering performance.
Findings
Achieved lower clustering errors on Iris, Letter, and Ruspini datasets.
Outperformed conventional K-means in computation speed for multiple clusters.
Demonstrated robustness across various real-world datasets.
Abstract
Clustering is one of the widely used techniques to find out patterns from a dataset that can be applied in different applications or analyses. K-means, the most popular and simple clustering algorithm, might get trapped into local minima if not properly initialized and the initialization of this algorithm is done randomly. In this paper, we propose a novel approach to improve initial cluster selection for K-means algorithm. This algorithm is based on the fact that the initial centroids must be well separated from each other since the final clusters are separated groups in feature space. The Convex Hull algorithm facilitates the computing of the first two centroids and the remaining ones are selected according to the distance from previously selected centers. To ensure the selection of one center per cluster, we use the nearest neighbor technique. To check the robustness of our proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
