Information-theoretic thresholds for community detection in sparse   networks

Jess Banks; Cristopher Moore

arXiv:1601.02658·math.PR·April 25, 2016·39 cites

Information-theoretic thresholds for community detection in sparse networks

Jess Banks, Cristopher Moore

PDF

Open Access

TL;DR

This paper establishes precise bounds on the information-theoretic threshold for community detection in sparse stochastic block models, identifying when detection is statistically possible or impossible based on network parameters.

Contribution

It provides new upper and lower bounds on the detection threshold, including tight results for large numbers of communities and different regimes of community strength.

Findings

01

Detection is possible above the threshold with an exponential-time algorithm.

02

Below the threshold, no algorithm can outperform chance in labeling nodes.

03

The threshold scales as ( rac{\, ext{log} k}{k \, ext{lambda}^2}) for large k.

Abstract

We give upper and lower bounds on the information-theoretic threshold for community detection in the stochastic block model. Specifically, let $k$ be the number of groups, $d$ be the average degree, the probability of edges between vertices within and between groups be $c_{in} / n$ and $c_{out} / n$ respectively, and let $λ = (c_{in} - c_{out}) / (k d)$ . We show that, when $k$ is large, and $λ = O (1/ k)$ , the critical value of $d$ at which community detection becomes possible -- in physical terms, the condensation threshold -- is \[ d_c = \Theta\!\left( \frac{\log k}{k \lambda^2} \right) \, , \] with tighter results in certain regimes. Above this threshold, we show that the only partitions of the nodes into $k$ groups are correlated with the ground truth, giving an exponential-time algorithm that performs better than chance -- in particular, detection is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Network Analysis Techniques · Opinion Dynamics and Social Influence · Random Matrices and Applications