Towards Scalable Spectral Clustering via Spectrum-Preserving Sparsification
Yongyu Wang, Zhuo Feng

TL;DR
This paper presents a scalable graph sparsification method that preserves spectral properties, enabling efficient spectral clustering on large datasets without losing solution quality.
Contribution
Introduces a spectrum-preserving sparsification algorithm that constructs ultra-sparse graphs with guaranteed spectral preservation, improving scalability of spectral clustering.
Findings
Significant reduction in NN graph complexity.
Speedups in spectral clustering tasks.
Effective preservation of key eigenvectors.
Abstract
The eigendeomposition of nearest-neighbor (NN) graph Laplacian matrices is the main computational bottleneck in spectral clustering. In this work, we introduce a highly-scalable, spectrum-preserving graph sparsification algorithm that enables to build ultra-sparse NN (u-NN) graphs with guaranteed preservation of the original graph spectrums, such as the first few eigenvectors of the original graph Laplacian. Our approach can immediately lead to scalable spectral clustering of large data networks without sacrificing solution quality. The proposed method starts from constructing low-stretch spanning trees (LSSTs) from the original graphs, which is followed by iteratively recovering small portions of "spectrally critical" off-tree edges to the LSSTs by leveraging a spectral off-tree embedding scheme. To determine the suitable amount of off-tree edges to be recovered to the LSSTs, an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlind Source Separation Techniques · Remote-Sensing Image Classification · Face and Expression Recognition
MethodsSpectral Clustering
