A Domain Adaptive Density Clustering Algorithm for Data with Varying   Density Distribution

Jianguo Chen; Philip S. Yu

arXiv:1911.10293·cs.LG·November 26, 2019

A Domain Adaptive Density Clustering Algorithm for Data with Varying Density Distribution

Jianguo Chen, Philip S. Yu

PDF

1 Repo

TL;DR

This paper introduces a domain-adaptive density clustering algorithm that effectively handles data with varying density distributions, addressing issues like sparse cluster loss and cluster fragmentation, and demonstrating superior results on complex datasets.

Contribution

The paper proposes a novel domain-adaptive density measurement and cluster self-ensemble method for improved clustering of data with diverse density features.

Findings

01

Outperforms existing algorithms on VDD, ED, and MDDM datasets

02

Effectively detects sparse clusters with adaptive density measurement

03

Reduces cluster fragmentation through self-ensemble approach

Abstract

As one type of efficient unsupervised learning methods, clustering algorithms have been widely used in data mining and knowledge discovery with noticeable advantages. However, clustering algorithms based on density peak have limited clustering effect on data with varying density distribution (VDD), equilibrium distribution (ED), and multiple domain-density maximums (MDDM), leading to the problems of sparse cluster loss and cluster fragmentation. To address these problems, we propose a Domain-Adaptive Density Clustering (DADC) algorithm, which consists of three steps: domain-adaptive density measurement, cluster center self-identification, and cluster self-ensemble. For data with VDD features, clusters in sparse regions are often neglected by using uniform density peak thresholds, which results in the loss of sparse clusters. We define a domain-adaptive density measurement method based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

JianguoChen2015/DADC
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.