Faithful Density-Peaks Clustering via Matrix Computations on MPI   Parallelization System

Ji Xu; Tianlong Xiao; Jinye Yang; Panpan Zhu

arXiv:2406.12297·cs.LG·June 19, 2024

Faithful Density-Peaks Clustering via Matrix Computations on MPI Parallelization System

Ji Xu, Tianlong Xiao, Jinye Yang, Panpan Zhu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a parallel density peaks clustering method that effectively handles non-Euclidean data and large datasets by leveraging matrix computations on MPI, improving accuracy and scalability over existing methods.

Contribution

It proposes a novel faithful and parallel DP approach using vector-like distance matrices and inverse leading-node-finding, addressing limitations of previous methods.

Findings

01

Capable of clustering non-Euclidean data such as community detection

02

Outperforms state-of-the-art methods in accuracy on large Euclidean datasets

03

Demonstrates scalability and effectiveness through extensive experiments

Abstract

Density peaks clustering (DP) has the ability of detecting clusters of arbitrary shape and clustering non-Euclidean space data, but its quadratic complexity in both computing and storage makes it difficult to scale for big data. Various approaches have been proposed in this regard, including MapReduce based distribution computing, multi-core parallelism, presentation transformation (e.g., kd-tree, Z-value), granular computing, and so forth. However, most of these existing methods face two limitations. One is their target datasets are mostly constrained to be in Euclidian space, the other is they emphasize only on local neighbors while ignoring global data distribution due to restriction to cut-off kernel when computing density. To address the two issues, we present a faithful and parallel DP method that makes use of two types of vector-like distance matrices and an inverse…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alanxuji/faithpdp
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications