Hybridization of K-means with improved firefly algorithm for automatic clustering in high dimension
Afroj Alam

TL;DR
This paper proposes a hybrid clustering approach combining K-means, improved firefly algorithm, and PCA to automatically determine optimal clusters in high-dimensional data, addressing convergence and local minima issues.
Contribution
It introduces a novel hybrid method integrating an enhanced firefly algorithm with K-means and PCA for improved automatic clustering in high-dimensional datasets.
Findings
Effective determination of optimal cluster number using Silhouette and Elbow methods.
Enhanced firefly algorithm improves convergence speed and avoids local minima.
Hybrid approach outperforms traditional methods in high-dimensional clustering tasks.
Abstract
K-means Clustering is the most well-known partitioning algorithm among all clustering, by which we can partition the data objects very easily in to more than one clusters. However, for K-means to choose an appropriate number of clusters without any prior domain knowledge about the dataset is challenging, especially in high-dimensional data objects. Hence, we have implemented the Silhouette and Elbow methods with PCA to find an optimal number of clusters. Also, previously, so many meta-heuristic swarm intelligence algorithms inspired by nature have been employed to handle the automatic data clustering problem. Firefly is efficient and robust for automatic clustering. However, in the Firefly algorithm, the entire population is automatically subdivided into sub-populations that decrease the convergence rate speed and trapping to local minima in high-dimensional optimization problems. Thus,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Data Mining and Machine Learning Applications · Customer churn and segmentation
