Phase transitions in semisupervised clustering of sparse networks
Pan Zhang, Cristopher Moore, and Lenka Zdeborov\'a

TL;DR
This paper investigates how partial label information influences the phase transitions in semisupervised community detection within sparse networks, revealing critical points where accuracy improves abruptly or continuously.
Contribution
It characterizes the phase diagram of semisupervised clustering in stochastic block models, identifying discontinuous and continuous transitions in detection accuracy as a function of known labels.
Findings
Detectability transition disappears for two groups with any label knowledge.
For larger groups, a line of easy/hard transition points emerges, with accuracy jumping at a critical label fraction.
Similar phase transition behaviors are observed in real-world network data.
Abstract
Predicting labels of nodes in a network, such as community memberships or demographic variables, is an important problem with applications in social and biological networks. A recently-discovered phase transition puts fundamental limits on the accuracy of these predictions if we have access only to the network topology. However, if we know the correct labels of some fraction of the nodes, we can do better. We study the phase diagram of this "semisupervised" learning problem for networks generated by the stochastic block model. We use the cavity method and the associated belief propagation algorithm to study what accuracy can be achieved as a function of . For groups, we find that the detectability transition disappears for any , in agreement with previous work. For larger where a hard but detectable regime exists, we find that the easy/hard…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
