Information-theoretic thresholds for community detection in sparse networks
Jess Banks, Cristopher Moore

TL;DR
This paper establishes precise bounds on the information-theoretic threshold for community detection in sparse stochastic block models, identifying when detection is statistically possible or impossible based on network parameters.
Contribution
It provides new upper and lower bounds on the detection threshold, including tight results for large numbers of communities and different regimes of community strength.
Findings
Detection is possible above the threshold with an exponential-time algorithm.
Below the threshold, no algorithm can outperform chance in labeling nodes.
The threshold scales as ( rac{\, ext{log} k}{k \, ext{lambda}^2}) for large k.
Abstract
We give upper and lower bounds on the information-theoretic threshold for community detection in the stochastic block model. Specifically, let be the number of groups, be the average degree, the probability of edges between vertices within and between groups be and respectively, and let . We show that, when is large, and , the critical value of at which community detection becomes possible -- in physical terms, the condensation threshold -- is \[ d_c = \Theta\!\left( \frac{\log k}{k \lambda^2} \right) \, , \] with tighter results in certain regimes. Above this threshold, we show that the only partitions of the nodes into groups are correlated with the ground truth, giving an exponential-time algorithm that performs better than chance -- in particular, detection is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Opinion Dynamics and Social Influence · Random Matrices and Applications
