Hierarchical Clustering: New Bounds and Objective
Mirmahdi Rahgoshay, Mohammad R. Salavatipour

TL;DR
This paper advances hierarchical clustering by introducing new bounds and objectives, including improved approximation algorithms for dissimilarity-based clustering and a novel cost function, enhancing the theoretical understanding and practical algorithms.
Contribution
It provides a more refined approximation algorithm for the Rev(T) objective and introduces a new dissimilarity-based clustering objective with a 1.3977-approximation.
Findings
Achieved a 0.71604 approximation for Rev(T)
Developed a 1.3977-approximation for the new cost function
Enhanced theoretical bounds for hierarchical clustering algorithms
Abstract
Hierarchical Clustering has been studied and used extensively as a method for analysis of data. More recently, Dasgupta [2016] defined a precise objective function. Given a set of data points with a weight function for each two items and denoting their similarity/dis-similarity, the goal is to build a recursive (tree like) partitioning of the data points (items) into successively smaller clusters. He defined a cost function for a tree to be where is the subtree rooted at the least common ancestor of and and presented the first approximation algorithm for such clustering. Then Moseley and Wang [2017] considered the dual of Dasgupta's objective function for similarity-based weights and showed that both random partitioning and average linkage have approximation ratio which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Face and Expression Recognition · Text and Document Classification Technologies
