Clustering by Mining Density Distributions and Splitting Manifold Structure
Zhichang Xu, Zhiguo Long, Hua Meng

TL;DR
This paper introduces a new spectral clustering method that starts from local structures to form micro-clusters, using a density and manifold-aware splitting rule, resulting in improved adaptability to complex data structures.
Contribution
It proposes a local-structure-based micro-clustering approach with a novel similarity measure, enhancing spectral clustering's effectiveness on complex, unevenly distributed data.
Findings
Outperforms granular-ball methods on synthetic datasets
Better captures complex data structures
Shows improved clustering accuracy on real-world data
Abstract
Spectral clustering requires the time-consuming decomposition of the Laplacian matrix of the similarity graph, thus limiting its applicability to large datasets. To improve the efficiency of spectral clustering, a top-down approach was recently proposed, which first divides the data into several micro-clusters (granular-balls), then splits these micro-clusters when they are not ``compact'', and finally uses these micro-clusters as nodes to construct a similarity graph for more efficient spectral clustering. However, this top-down approach is challenging to adapt to unevenly distributed or structurally complex data. This is because constructing micro-clusters as a rough ball struggles to capture the shape and structure of data in a local range, and the simplistic splitting rule that solely targets ``compactness'' is susceptible to noise and variations in data density and leads to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Data Mining Algorithms and Applications · Data Management and Algorithms
