Classification and estimation in the Stochastic Block Model based on the   empirical degrees

Antoine Channarond; Jean-Jacques Daudin; St\'ephane Robin

arXiv:1110.6517·math.ST·November 1, 2011

Classification and estimation in the Stochastic Block Model based on the empirical degrees

Antoine Channarond, Jean-Jacques Daudin, St\'ephane Robin

PDF

TL;DR

This paper demonstrates that in the Stochastic Block Model, classification and estimation can be effectively performed using only empirical degree data, even for large networks and multiple classes.

Contribution

It introduces a new algorithm and theoretical results showing that degree-based methods suffice for classification, estimation, and model selection in the SBM.

Findings

01

Consistent estimators based on empirical degrees.

02

Algorithm effective for very large networks.

03

Bound on misclassification probability even with many classes.

Abstract

The Stochastic Block Model (Holland et al., 1983) is a mixture model for heterogeneous network data. Unlike the usual statistical framework, new nodes give additional information about the previous ones in this model. Thereby the distribution of the degrees concentrates in points conditionally on the node class. We show under a mild assumption that classification, estimation and model selection can actually be achieved with no more than the empirical degree data. We provide an algorithm able to process very large networks and consistent estimators based on it. In particular, we prove a bound of the probability of misclassification of at least one node, including when the number of classes grows.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.