kFuse: A novel density based agglomerative clustering
Huan Yan, Junjie Hu

TL;DR
kFuse is a new density-based agglomerative clustering method that reduces parameter dependence and improves stability and accuracy by considering density, boundary connectivity, and natural neighbors, validated on synthetic and real datasets.
Contribution
This paper introduces kFuse, a novel agglomerative clustering algorithm that simplifies parameter tuning and enhances clustering stability and accuracy through a density-based merging strategy.
Findings
kFuse outperforms existing methods in accuracy on synthetic datasets.
kFuse demonstrates robustness across various real-world datasets.
The method requires only the final number of clusters as input.
Abstract
Agglomerative clustering has emerged as a vital tool in data analysis due to its intuitive and flexible characteristics. However, existing agglomerative clustering methods often involve additional parameters for sub-cluster partitioning and inter-cluster similarity assessment. This necessitates different parameter settings across various datasets, which is undoubtedly challenging in the absence of prior knowledge. Moreover, existing agglomerative clustering techniques are constrained by the calculation method of connection distance, leading to unstable clustering results. To address these issues, this paper introduces a novel density-based agglomerative clustering method, termed kFuse. kFuse comprises four key components: (1) sub-cluster partitioning based on natural neighbors; (2) determination of boundary connectivity between sub-clusters through the computation of adjacent samples…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Bayesian Methods and Mixture Models · Complex Network Analysis Techniques
