TL;DR
This paper introduces a domain-adaptive density clustering algorithm that effectively handles data with varying density distributions, addressing issues like sparse cluster loss and cluster fragmentation, and demonstrating superior results on complex datasets.
Contribution
The paper proposes a novel domain-adaptive density measurement and cluster self-ensemble method for improved clustering of data with diverse density features.
Findings
Outperforms existing algorithms on VDD, ED, and MDDM datasets
Effectively detects sparse clusters with adaptive density measurement
Reduces cluster fragmentation through self-ensemble approach
Abstract
As one type of efficient unsupervised learning methods, clustering algorithms have been widely used in data mining and knowledge discovery with noticeable advantages. However, clustering algorithms based on density peak have limited clustering effect on data with varying density distribution (VDD), equilibrium distribution (ED), and multiple domain-density maximums (MDDM), leading to the problems of sparse cluster loss and cluster fragmentation. To address these problems, we propose a Domain-Adaptive Density Clustering (DADC) algorithm, which consists of three steps: domain-adaptive density measurement, cluster center self-identification, and cluster self-ensemble. For data with VDD features, clusters in sparse regions are often neglected by using uniform density peak thresholds, which results in the loss of sparse clusters. We define a domain-adaptive density measurement method based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
