Nearly-Optimal Hierarchical Clustering for Well-Clustered Graphs
Steinar Laenen, Bogdan-Adrian Manghiuc, He Sun

TL;DR
This paper introduces two nearly-linear time hierarchical clustering algorithms for well-structured graphs that achieve constant-factor approximation of Dasgupta's cost, outperforming previous methods on synthetic and real datasets.
Contribution
The paper proposes efficient HC algorithms with theoretical guarantees and demonstrates superior empirical performance on diverse datasets.
Findings
Algorithms run in nearly-linear time.
Achieve O(1) approximation of Dasgupta's cost.
Produce comparable or better HC trees faster than prior methods.
Abstract
This paper presents two efficient hierarchical clustering (HC) algorithms with respect to Dasgupta's cost function. For any input graph with a clear cluster-structure, our designed algorithms run in nearly-linear time in the input size of , and return an -approximate HC tree with respect to Dasgupta's cost function. We compare the performance of our algorithm against the previous state-of-the-art on synthetic and real-world datasets and show that our designed algorithm produces comparable or better HC trees with much lower running time.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Complex Network Analysis Techniques · Face and Expression Recognition
