Farthest-Point Heuristic based Initialization Methods for K-Modes Clustering
Zengyou He

TL;DR
This paper investigates a farthest-point heuristic for initializing k-modes clustering, demonstrating that it improves clustering accuracy over random initialization through experimental validation.
Contribution
It introduces a novel farthest-point heuristic initialization method for k-modes clustering and empirically shows its effectiveness.
Findings
Farthest-point heuristic improves clustering accuracy.
The method outperforms random initialization.
Experimental results validate the approach.
Abstract
The k-modes algorithm has become a popular technique in solving categorical data clustering problems in different application domains. However, the algorithm requires random selection of initial points for the clusters. Different initial points often lead to considerable distinct clustering results. In this paper we present an experimental study on applying a farthest-point heuristic based initialization method to k-modes clustering to improve its performance. Experiments show that new initialization method leads to better clustering accuracy than random selection initialization method for k-modes clustering.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Data Management and Algorithms · Face and Expression Recognition
