A Simple and Efficient Method to Compute a Single Linkage Dendrogram
Huanbiao Zhu, Werner Stuetzle

TL;DR
This paper presents a simple, efficient recursive method to compute a single linkage dendrogram using Prim's algorithm for MST, avoiding the need to store the full distance matrix.
Contribution
It introduces a recursive partitioning algorithm that efficiently computes the dendrogram directly from Prim's MST without extra computational cost.
Findings
Uses Prim's algorithm to compute MST efficiently
Avoids storing the full distance matrix
Provides a straightforward recursive splitting method
Abstract
We address the problem of computing a single linkage dendrogram. A possible approach is to: (i) Form an edge weighted graph over the data, with edge weights reflecting dissimilarities. (ii) Calculate the MST of . (iii) Break the longest edge of thereby splitting it into subtrees , . (iv) Apply the splitting process recursively to the subtrees. This approach has the attractive feature that Prim's algorithm for MST construction calculates distances as needed, and hence there is no need to ever store the inter-point distance matrix. The recursive partitioning algorithm requires us to determine the vertices (and edges) of and . We show how this can be done easily and efficiently using information generated by Prim's algorithm without any additional computational cost.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Graph Theory and Algorithms · Advanced Clustering Algorithms Research
