A density peaks clustering algorithm with sparse search and K-d tree
Yunxiao Shan, Shu Li, Fuxiang Li, Yuxin Cui, Shuai Li, Ming Zhou,, Xiang Li

TL;DR
This paper introduces an improved density peaks clustering algorithm that uses sparse search and K-d trees to significantly reduce computational complexity and enhance clustering efficiency and accuracy, especially on large datasets.
Contribution
The paper proposes a novel density peaks clustering method utilizing sparse search and K-d trees to accelerate computation and improve clustering performance.
Findings
Reduces computational complexity from O(n^2K) to O(n(n^{1-1/K}+k))
Enhances clustering efficiency on large datasets
Improves clustering accuracy compared to existing algorithms
Abstract
Density peaks clustering has become a nova of clustering algorithm because of its simplicity and practicality. However, there is one main drawback: it is time-consuming due to its high computational complexity. Herein, a density peaks clustering algorithm with sparse search and K-d tree is developed to solve this problem. Firstly, a sparse distance matrix is calculated by using K-d tree to replace the original full rank distance matrix, so as to accelerate the calculation of local density. Secondly, a sparse search strategy is proposed to accelerate the computation of relative-separation with the intersection between the set of nearest neighbors and the set consisting of the data points with larger local density for any data point. Furthermore, a second-order difference method for decision values is adopted to determine the cluster centers adaptively. Finally, experiments are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
