Phase transitions in semisupervised clustering of sparse networks

Pan Zhang; Cristopher Moore; and Lenka Zdeborov\'a

arXiv:1404.7789·cs.SI·November 20, 2014

Phase transitions in semisupervised clustering of sparse networks

Pan Zhang, Cristopher Moore, and Lenka Zdeborov\'a

PDF

TL;DR

This paper investigates how partial label information influences the phase transitions in semisupervised community detection within sparse networks, revealing critical points where accuracy improves abruptly or continuously.

Contribution

It characterizes the phase diagram of semisupervised clustering in stochastic block models, identifying discontinuous and continuous transitions in detection accuracy as a function of known labels.

Findings

01

Detectability transition disappears for two groups with any label knowledge.

02

For larger groups, a line of easy/hard transition points emerges, with accuracy jumping at a critical label fraction.

03

Similar phase transition behaviors are observed in real-world network data.

Abstract

Predicting labels of nodes in a network, such as community memberships or demographic variables, is an important problem with applications in social and biological networks. A recently-discovered phase transition puts fundamental limits on the accuracy of these predictions if we have access only to the network topology. However, if we know the correct labels of some fraction $α$ of the nodes, we can do better. We study the phase diagram of this "semisupervised" learning problem for networks generated by the stochastic block model. We use the cavity method and the associated belief propagation algorithm to study what accuracy can be achieved as a function of $α$ . For $k = 2$ groups, we find that the detectability transition disappears for any $α > 0$ , in agreement with previous work. For larger $k$ where a hard but detectable regime exists, we find that the easy/hard…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.