Stochastic Block Models and Reconstruction
Elchanan Mossel, Joe Neeman, Allan Sly

TL;DR
This paper rigorously analyzes the stochastic block model in the sparse regime, proving part of a conjecture about the threshold for successful clustering and parameter estimation, and establishing connections to spin-glass models and reconstruction problems.
Contribution
It proves the impossibility of clustering below the conjectured threshold and provides an efficient method for parameter estimation above it, confirming half of the non-rigorous predictions.
Findings
Clustering is impossible if (a - b)^2 < 2(a + b).
Parameter estimation is feasible when (a - b)^2 > 2(a + b).
Established a connection between clustering, spin-glass models, and reconstruction problems.
Abstract
The planted partition model (also known as the stochastic blockmodel) is a classical cluster-exhibiting random graph model that has been extensively studied in statistics, physics, and computer science. In its simplest form, the planted partition model is a model for random graphs on nodes with two equal-sized clusters, with an between-class edge probability of and a within-class edge probability of . Although most of the literature on this model has focused on the case of increasing degrees (ie.\ as ), the sparse case is interesting both from a mathematical and an applied point of view. A striking conjecture of Decelle, Krzkala, Moore and Zdeborov\'a based on deep, non-rigorous ideas from statistical physics gave a precise prediction for the algorithmic threshold of clustering in the sparse planted partition model. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Markov Chains and Monte Carlo Methods · Random Matrices and Applications
