Hierarchical Clusterings of Unweighted Graphs
Svein H{\o}gemo, Christophe Paul, Jan Arne Telle

TL;DR
This paper investigates the complexity of hierarchical clustering in unweighted graphs, proving NP-completeness generally, but identifying specific graph classes like co-bipartite and cycle on 6 vertices as min-well-behaved, enabling efficient clustering.
Contribution
Introduces the normalization procedure for improving clusterings and characterizes min-well-behaved graphs, including co-bipartite and certain cycles, for hierarchical clustering.
Findings
Hierarchical clustering problem is NP-complete in general.
Normalization procedure can iteratively improve clusterings.
Co-bipartite graphs and cycle on 6 vertices are min-well-behaved.
Abstract
We study the complexity of finding an optimal hierarchical clustering of an unweighted similarity graph under the recently introduced Dasgupta objective function. We introduce a proof technique, called the normalization procedure, that takes any such clustering of a graph and iteratively improves it until a desired target clustering of G is reached. We use this technique to show both a negative and a positive complexity result. Firstly, we show that in general the problem is NP-complete. Secondly, we consider min-well-behaved graphs, which are graphs having the property that for any the graph being the join of copies of has an optimal hierarchical clustering that splits each copy of in the same optimal way. To optimally cluster such a graph we thus only need to optimally cluster the smaller graph . Co-bipartite graphs are min-well-behaved, but…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
