Hierarchical Clustering via Local Search
Hossein Jowhari

TL;DR
This paper presents a local search algorithm for hierarchical clustering using tree re-arrangements called interchanges, providing theoretical guarantees on revenue and demonstrating practical effectiveness through empirical results.
Contribution
It introduces a local search method with provable revenue bounds for hierarchical clustering and connects it to the average link algorithm, offering new insights and practical implementation.
Findings
Any locally optimal tree guarantees a revenue of at least (n-2)/3 times the total similarity.
Average link trees are shown to be locally optimal under the interchange operation.
Empirical results show the method effectively improves hierarchical clustering quality.
Abstract
In this paper, we introduce a local search algorithm for hierarchical clustering. For the local step, we consider a tree re-arrangement operation, known as the {\em interchange}, which involves swapping two closely positioned sub-trees within a tree hierarchy. The interchange operation has been previously used in the context of phylogenetic trees. As the objective function for evaluating the resulting hierarchies, we utilize the revenue function proposed by Moseley and Wang (NIPS 2017.) In our main result, we show that any locally optimal tree guarantees a revenue of at least where is the number of objects and is the associated similarity function. This finding echoes the previously established bound for the average link algorithm as analyzed by Moseley and Wang. We demonstrate that this alignment is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Data Management and Algorithms · Text and Document Classification Technologies
