Optimal Estimation of the Number of Communities
Jiashun Jin, Zheng Tracy Ke, Shengming Luo, Minzhe Wang

TL;DR
This paper introduces StGoF, a stepwise method for estimating the number of communities in networks, which is consistent and achieves optimal phase transition under certain signal-to-noise ratio conditions.
Contribution
The paper proposes a novel stepwise goodness-of-fit approach, StGoF, that effectively estimates the number of communities even with degree heterogeneity and varying sparsity levels, overcoming analytical challenges.
Findings
StGoF provides a consistent estimate of K.
StGoF is uniformly consistent when SNR tends to infinity.
The method achieves the optimal phase transition for community detection.
Abstract
In network analysis, how to estimate the number of communities is a fundamental problem. We consider a broad setting where we allow severe degree heterogeneity and a wide range of sparsity levels, and propose Stepwise Goodness-of-Fit (StGoF) as a new approach. This is a stepwise algorithm, where for , we alternately use a community detection step and a goodness-of-fit (GoF) step. We adapt SCORE \cite{SCORE} for community detection, and propose a new GoF metric. We show that at step , the GoF metric diverges to in probability for all and converges to if . This gives rise to a consistent estimate for . Also, we discover the right way to define the signal-to-noise ratio (SNR) for our problem and show that consistent estimates for do not exist if , and StGoF is uniformly consistent for if…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Opinion Dynamics and Social Influence · Advanced Graph Neural Networks
