Fully-Dynamic Parallel Algorithms for Single-Linkage Clustering
Quinten De Man, Laxman Dhulipala, Kishen N Gowda

TL;DR
This paper introduces fully-dynamic parallel algorithms for maintaining the single-linkage dendrogram efficiently under edge insertions and deletions, outperforming static algorithms especially when the dendrogram height is low.
Contribution
It presents novel algorithms for updating the single-linkage dendrogram dynamically with improved asymptotic and parallel efficiency, a first in the field.
Findings
Insertion algorithm runs in O(h) time
Deletion algorithm runs in O(h log(1+n/h)) time
Parallel algorithms are work-efficient with polylogarithmic depth
Abstract
Single-linkage clustering is a popular form of hierarchical agglomerative clustering (HAC) where the distance between two clusters is defined as the minimum distance between any pair of points across the two clusters. In single-linkage HAC, the output is typically the single-linkage dendrogram (SLD), which is the binary tree representing the hierarchy of clusters formed by iteratively contracting the two closest clusters. In the dynamic setting, prior work has only studied maintaining a minimum spanning forest over the data since single-linkage HAC reduces to computing the SLD on the minimum spanning forest of the data. In this paper, we study the problem of maintaining the SLD in the fully-dynamic setting. We assume the input is a dynamic forest (representing the minimum spanning forest of the data) which receives a sequence of edge insertions and edge deletions. To our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
