cuSLINK: Single-linkage Agglomerative Clustering on the GPU
Corey J. Nolet, Divye Gala, Alex Fender, Mahesh Doijade, Joe Eaton,, Edward Raff, John Zedlewski, Brad Rees, Tim Oates

TL;DR
cuSLINK introduces a GPU-optimized single-linkage clustering algorithm that significantly reduces space complexity and accelerates clustering tasks across various data mining applications.
Contribution
The paper presents cuSLINK, a novel GPU reformulation of SLINK with reduced space requirements and reusable computational primitives for efficient clustering.
Findings
Achieves state-of-the-art GPU clustering performance
Reduces space complexity to O(Nk)
Enables scalable clustering for large datasets
Abstract
In this paper, we propose cuSLINK, a novel and state-of-the-art reformulation of the SLINK algorithm on the GPU which requires only space and uses a parameter to trade off space and time. We also propose a set of novel and reusable building blocks that compose cuSLINK. These building blocks include highly optimized computational patterns for -NN graph construction, spanning trees, and dendrogram cluster extraction. We show how we used our primitives to implement cuSLINK end-to-end on the GPU, further enabling a wide range of real-world data mining and machine learning applications that were once intractable. In addition to being a primary computational bottleneck in the popular HDBSCAN algorithm, the impact of our end-to-end cuSLINK algorithm spans a large range of important applications, including cluster analysis in social and computer networks, natural language…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Complex Network Analysis Techniques · Data Management and Algorithms
