TL;DR
This paper introduces a refined k-nearest neighbor graph for spectral clustering that reduces computational complexity while maintaining performance, by selectively keeping edges based on local statistics and an optional cluster number selection.
Contribution
A novel graph refinement method that reduces edges in k-NN graphs using local statistics, improving efficiency and consistency in spectral clustering.
Findings
Maintains clustering performance with fewer edges.
Outperforms approximate spectral clustering methods in consistency.
Effective on both synthetic and real datasets.
Abstract
Spectral clustering became a popular choice for data clustering for its ability of uncovering clusters of different shapes. However, it is not always preferable over other clustering methods due to its computational demands. One of the effective ways to bypass these computational demands is to perform spectral clustering on a subset of points (data representatives) then generalize the clustering outcome, this is known as approximate spectral clustering (ASC). ASC uses sampling or quantization to select data representatives. This makes it vulnerable to 1) performance inconsistency (since these methods have a random step either in initialization or training), 2) local statistics loss (because the pairwise similarities are extracted from data representatives instead of data points). We proposed a refined version of -nearest neighbor graph, in which we keep data points and aggressively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodsk-Means Clustering · k-Nearest Neighbors · Spectral Clustering
