Consistent model selection in a collection of stochastic block models

Lucie Arts (LPSM (UMR\_8001))

arXiv:2502.03848·math.ST·February 10, 2026

Consistent model selection in a collection of stochastic block models

Lucie Arts (LPSM (UMR\_8001))

PDF

Open Access 1 Repo

TL;DR

This paper presents a consistent penalized KT estimator for accurately determining the number of clusters in multi-layer and dynamic stochastic block models, applicable to both dense and sparse networks.

Contribution

It introduces a new penalized KT estimator that is proven to be consistent without requiring an upper bound on the number of clusters.

Findings

01

Estimator converges to the true number of clusters as network size increases

02

Works in both dense and sparse network regimes

03

Demonstrated effectiveness on synthetic datasets

Abstract

We introduce the penalized Krichevsky-Trofimov (KT) estimator as a convergent method for estimating the number of nodes clusters when observing multiple networks within both multi-layer and dynamic Stochastic Block Models. We establish the consistency of the KT estimator, showing that it converges to the correct number of clusters in both types of models when the number of nodes in the networks increases. Our estimator does not require a known upper bound on this number to be consistent. Furthermore, we show that these consistency results hold in both dense and sparse regimes, making the penalized KT estimator robust across various network configurations. We illustrate its performance on synthetic datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

arts-lucie/consistent-KT-estimator-in-the-MLSBM
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference