SHADE: Deep Density-based Clustering
Anna Beer, Pascal Weber, Lukas Miklautz, Collin Leiber, Walid Durani,, Christian B\"ohm, Claudia Plant

TL;DR
SHADE is a deep clustering algorithm that effectively detects arbitrarily shaped, density-connected clusters in high-dimensional noisy data, automatically identifying noise and providing interpretable visualizations.
Contribution
It introduces a novel loss function that incorporates density-connectivity into deep clustering, improving detection of complex cluster shapes without user input.
Findings
Outperforms existing methods in clustering quality on complex data
Automatically detects noise points without user intervention
Preserves cluster shapes for visualization and interpretation
Abstract
Detecting arbitrarily shaped clusters in high-dimensional noisy data is challenging for current clustering methods. We introduce SHADE (Structure-preserving High-dimensional Analysis with Density-based Exploration), the first deep clustering algorithm that incorporates density-connectivity into its loss function. Similar to existing deep clustering algorithms, SHADE supports high-dimensional and large data sets with the expressive power of a deep autoencoder. In contrast to most existing deep clustering methods that rely on a centroid-based clustering objective, SHADE incorporates a novel loss function that captures density-connectivity. SHADE thereby learns a representation that enhances the separation of density-connected clusters. SHADE detects a stable clustering and noise points fully automatically without any user input. It outperforms existing methods in clustering quality,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research
