An Information-theoretic Perspective of Hierarchical Clustering
Yicheng Pan, Feng Zheng, Bingchen Fan

TL;DR
This paper introduces an information-theoretic approach to hierarchical clustering, proposing a new objective function and algorithm that automatically determines the number of hierarchy levels, demonstrating superior performance on synthetic and real datasets.
Contribution
It formulates a novel information-theoretic objective for hierarchical clustering and develops HCSE, an algorithm that automatically determines hierarchy levels without hyper-parameters.
Findings
HCSE effectively finds intrinsic hierarchy levels in synthetic data.
HCSE achieves competitive costs compared to LOUVAIN and HLP on real datasets.
The proposed approach offers an interpretable and parameter-free clustering method.
Abstract
A combinatorial cost function for hierarchical clustering was introduced by Dasgupta \cite{dasgupta2016cost}. It has been generalized by Cohen-Addad et al. \cite{cohen2019hierarchical} to a general form named admissible function. In this paper, we investigate hierarchical clustering from the \emph{information-theoretic} perspective and formulate a new objective function. We also establish the relationship between these two perspectives. In algorithmic aspect, we get rid of the traditional top-down and bottom-up frameworks, and propose a new one to stratify the \emph{sparsest} level of a cluster tree recursively in guide with our objective function. For practical use, our resulting cluster tree is not binary. Our algorithm called HCSE outputs a -level cluster tree by a novel and interpretable mechanism to choose automatically without any hyper-parameter. Our experimental results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Complex Network Analysis Techniques · Bayesian Methods and Mixture Models
