An Analytical Study on Behavior of Clusters Using K Means, EM and K*   Means Algorithm

G. Nathiya; S. C. Punitha; M. Punithavalli

arXiv:1004.1743·cs.LG·April 13, 2010·19 cites

An Analytical Study on Behavior of Clusters Using K Means, EM and K* Means Algorithm

G. Nathiya, S. C. Punitha, M. Punithavalli

PDF

Open Access

TL;DR

This paper compares the performance of K-means, EM, and K* Means clustering algorithms on a heart dataset, demonstrating that EM produces higher quality clusters based on various metrics.

Contribution

It provides an experimental analysis of three clustering algorithms on a specific dataset, highlighting EM's superior clustering quality over K-means and K* Means.

Findings

01

EM outperforms K-means and K* Means in cluster quality

02

EM achieves better purity and entropy scores

03

EM has lower CPU time for the dataset

Abstract

Clustering is an unsupervised learning method that constitutes a cornerstone of an intelligent data analysis process. It is used for the exploration of inter-relationships among a collection of patterns, by organizing them into homogeneous clusters. Clustering has been dynamically applied to a variety of tasks in the field of Information Retrieval (IR). Clustering has become one of the most active area of research and the development. Clustering attempts to discover the set of consequential groups where those within each group are more closely related to one another than the others assigned to different groups. The resultant clusters can provide a structure for organizing large bodies of text for efficient browsing and searching. There exists a wide variety of clustering algorithms that has been intensively studied in the clustering problem. Among the algorithms that remain the most…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Clustering Algorithms Research · Text and Document Classification Technologies · Face and Expression Recognition