Scalable Spectral Clustering Using Random Binning Features
Lingfei Wu, Pin-Yu Chen, Ian En-Hsu Yen, Fangli Xu, Yinglong Xia and, Charu Aggarwal

TL;DR
This paper introduces a scalable spectral clustering method using Random Binning features that significantly reduces computational complexity while maintaining accuracy, enabling efficient clustering on large datasets.
Contribution
The paper proposes a novel spectral clustering approach that uses Random Binning features to accelerate similarity graph construction and eigendecomposition, achieving linear scalability.
Findings
Reduces spectral clustering computational cost from quadratic to linear.
Achieves similar accuracy to traditional methods on large datasets.
Outperforms or matches state-of-the-art methods in accuracy and runtime.
Abstract
Spectral clustering is one of the most effective clustering approaches that capture hidden cluster structures in the data. However, it does not scale well to large-scale problems due to its quadratic complexity in constructing similarity graphs and computing subsequent eigendecomposition. Although a number of methods have been proposed to accelerate spectral clustering, most of them compromise considerable information loss in the original data for reducing computational bottlenecks. In this paper, we present a novel scalable spectral clustering method using Random Binning features (RB) to simultaneously accelerate both similarity graph construction and the eigendecomposition. Specifically, we implicitly approximate the graph similarity (kernel) matrix by the inner product of a large sparse feature matrix generated by RB. Then we introduce a state-of-the-art SVD solver to effectively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Machine Learning and ELM · Remote-Sensing Image Classification
MethodsSpectral Clustering
