Semi-Supervised Clustering of Sparse Graphs: Crossing the Information-Theoretic Threshold
Junda Sheng, Thomas Strohmer

TL;DR
This paper demonstrates that semi-supervised learning enables effective community detection in sparse graphs beyond the classical information-theoretic limits, introducing new algorithms and insights into network models.
Contribution
It shows that partial label information overcomes fundamental detection limits in sparse graphs and introduces two efficient algorithms for semi-supervised clustering.
Findings
Detection is feasible with any fraction of labels beyond the phase transition.
Two algorithms effectively integrate label info with graph structure.
The work extends stochastic block model analysis to semi-supervised settings.
Abstract
The stochastic block model is a canonical random graph model for clustering and community detection on network-structured data. Decades of extensive study on the problem have established many profound results, among which the phase transition at the Kesten-Stigum threshold is particularly interesting both from a mathematical and an applied standpoint. It states that no estimator based on the network topology can perform substantially better than chance on sparse graphs if the model parameter is below a certain threshold. Nevertheless, if we slightly extend the horizon to the ubiquitous semi-supervised setting, such a fundamental limitation will disappear completely. We prove that with an arbitrary fraction of the labels revealed, the detection problem is feasible throughout the parameter domain. Moreover, we introduce two efficient algorithms, one combinatorial and one based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Clustering Algorithms Research · Advanced Graph Neural Networks
