An agglomerative hierarchical clustering method by optimizing the average silhouette width
Fatima Batool

TL;DR
This paper introduces HOSil, an agglomerative hierarchical clustering method that optimizes the average silhouette width, demonstrating superior clustering quality and accurate cluster number estimation across diverse data structures.
Contribution
The paper presents a novel linkage metric for AHC, called HOSil, which improves clustering quality and cluster number estimation compared to existing methods.
Findings
HOSil outperforms other clustering methods in quality across various data types.
HOSil accurately identifies the number of clusters in diverse data structures.
HOSil can detect clusters of various shapes and distributions.
Abstract
An agglomerative hierarchical clustering (AHC) framework and algorithm named HOSil based on a new linkage metric optimized by the average silhouette width (ASW) index is proposed. A conscientious investigation of various clustering methods and estimation indices is conducted across a diverse verities of data structures for three aims: a) clustering quality, b) clustering recovery, and c) estimation of number of clusters. HOSil has shown better clustering quality for a range of artificial and real world data structures as compared to k-means, PAM, single, complete, average, Ward, McQuitty, spectral, model-based, and several estimation methods. It can identify clusters of various shapes including spherical, elongated, relatively small sized clusters, clusters coming from different distributions including uniform, t, gamma and others. HOSil has shown good recovery for correct determination…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Face and Expression Recognition · Data Management and Algorithms
