Impact of regularization on Spectral Clustering
Antony Joseph, Bin Yu

TL;DR
This paper provides a theoretical analysis of how regularization improves spectral clustering, especially under the stochastic block model, by relaxing degree assumptions and introducing a data-driven method for selecting the regularization parameter.
Contribution
It quantifies the benefits of regularization in spectral clustering, removes minimum degree constraints, and proposes a practical data-driven parameter selection method.
Findings
Regularization enhances spectral clustering performance.
Minimum degree assumptions can be relaxed with large regularization.
The proposed DKest method effectively chooses the regularization parameter.
Abstract
The performance of spectral clustering can be considerably improved via regularization, as demonstrated empirically in Amini et. al (2012). Here, we provide an attempt at quantifying this improvement through theoretical analysis. Under the stochastic block model (SBM), and its extensions, previous results on spectral clustering relied on the minimum degree of the graph being sufficiently large for its good performance. By examining the scenario where the regularization parameter is large we show that the minimum degree assumption can potentially be removed. As a special case, for an SBM with two blocks, the results require the maximum degree to be large (grow faster than ) as opposed to the minimum degree. More importantly, we show the usefulness of regularization in situations where not all nodes belong to well-defined clusters. Our results rely on a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Sparse and Compressive Sensing Techniques · Advanced Clustering Algorithms Research
MethodsSpectral Clustering
