SHADE: Deep Density-based Clustering

Anna Beer; Pascal Weber; Lukas Miklautz; Collin Leiber; Walid Durani,; Christian B\"ohm; Claudia Plant

arXiv:2410.06265·cs.LG·October 10, 2024

SHADE: Deep Density-based Clustering

Anna Beer, Pascal Weber, Lukas Miklautz, Collin Leiber, Walid Durani,, Christian B\"ohm, Claudia Plant

PDF

Open Access

TL;DR

SHADE is a deep clustering algorithm that effectively detects arbitrarily shaped, density-connected clusters in high-dimensional noisy data, automatically identifying noise and providing interpretable visualizations.

Contribution

It introduces a novel loss function that incorporates density-connectivity into deep clustering, improving detection of complex cluster shapes without user input.

Findings

01

Outperforms existing methods in clustering quality on complex data

02

Automatically detects noise points without user intervention

03

Preserves cluster shapes for visualization and interpretation

Abstract

Detecting arbitrarily shaped clusters in high-dimensional noisy data is challenging for current clustering methods. We introduce SHADE (Structure-preserving High-dimensional Analysis with Density-based Exploration), the first deep clustering algorithm that incorporates density-connectivity into its loss function. Similar to existing deep clustering algorithms, SHADE supports high-dimensional and large data sets with the expressive power of a deep autoencoder. In contrast to most existing deep clustering methods that rely on a centroid-based clustering objective, SHADE incorporates a novel loss function that captures density-connectivity. SHADE thereby learns a representation that enhances the separation of density-connected clusters. SHADE detects a stable clustering and noise points fully automatically without any user input. It outperforms existing methods in clustering quality,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Clustering Algorithms Research