A density peaks clustering algorithm with sparse search and K-d tree

Yunxiao Shan; Shu Li; Fuxiang Li; Yuxin Cui; Shuai Li; Ming Zhou,; Xiang Li

arXiv:2203.00973·stat.ML·July 21, 2022

A density peaks clustering algorithm with sparse search and K-d tree

Yunxiao Shan, Shu Li, Fuxiang Li, Yuxin Cui, Shuai Li, Ming Zhou,, Xiang Li

PDF

TL;DR

This paper introduces an improved density peaks clustering algorithm that uses sparse search and K-d trees to significantly reduce computational complexity and enhance clustering efficiency and accuracy, especially on large datasets.

Contribution

The paper proposes a novel density peaks clustering method utilizing sparse search and K-d trees to accelerate computation and improve clustering performance.

Findings

01

Reduces computational complexity from O(n^2K) to O(n(n^{1-1/K}+k))

02

Enhances clustering efficiency on large datasets

03

Improves clustering accuracy compared to existing algorithms

Abstract

Density peaks clustering has become a nova of clustering algorithm because of its simplicity and practicality. However, there is one main drawback: it is time-consuming due to its high computational complexity. Herein, a density peaks clustering algorithm with sparse search and K-d tree is developed to solve this problem. Firstly, a sparse distance matrix is calculated by using K-d tree to replace the original full rank distance matrix, so as to accelerate the calculation of local density. Secondly, a sparse search strategy is proposed to accelerate the computation of relative-separation with the intersection between the set of $k$ nearest neighbors and the set consisting of the data points with larger local density for any data point. Furthermore, a second-order difference method for decision values is adopted to determine the cluster centers adaptively. Finally, experiments are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.