Fitting networks with a cancellation trick
Jiashun Jin, Jingming Wang

TL;DR
This paper introduces the logit-DCBM model combining ideas from DCBM, LSM, and $eta$-model, and proposes R-SCORE, a recursive community detection algorithm that improves parameter fitting and community detection accuracy.
Contribution
The paper presents the logit-DCBM model and the R-SCORE algorithm, addressing the challenge of fitting nonlinear network models and enhancing community detection performance.
Findings
R-SCORE outperforms existing spectral methods in numerical experiments.
Theoretically, R-SCORE has a faster Hamming error rate than SCORE in certain sparse regimes.
R-SCORE effectively removes nonlinear factors, approximating a low-rank model for better community detection.
Abstract
The degree-corrected block model (DCBM), latent space model (LSM), and -model are all popular network models. We combine their modeling ideas and propose the logit-DCBM as a new model. Similar as the -model and LSM, the logit-DCBM contains nonlinear factors, where fitting the parameters is a challenging open problem. We resolve this problem by introducing a cancellation trick. We also propose R-SCORE as a recursive community detection algorithm, where in each iteration, we first use the idea above to update our parameter estimation, and then use the results to remove the nonlinear factors in the logit-DCBM so the renormalized model approximately satisfies a low-rank model, just like the DCBM. Our numerical study suggests that R-SCORE significantly improves over existing spectral approaches in many cases. Also, theoretically, we show that the Hamming error rate of R-SCORE…
Peer Reviews
Decision·ICLR 2025 Poster
See summary.
Overall, I am not too sure why the logit-DCBM model is worth studying. The authors mention some motivation in lines 58-60 and lines 69-71, but I did not find it compelling enough. I found the motivation argued hand-heavily without any concrete and interesting reasons. Line 58, ",to many statisticians, (5) is preferred.": Which statisticians? References? Line 59-60: I don't think the logistic regression is the only "recommended" model by text bool for binary data. Also, what does "recommending"
1. Well-written 2. Appears rigorous and professional 3. A generalization of the DCBM model to the logit case is an interesting direction
Accessibility to readers less familiar with the literature on community detection No table of notations
The logit-DCMB model provides a nice combination of the DCMB, LSM, and $\beta$ models. The "cancellation trick" allows disregarding of nonlinear terms by means of a ratio of two large sums used as estimators. This is an interesting idea that can be applied to various other settings.
The scope of the paper is [quote] "to propose a nonlinear version of DCMB so that it will hopefully be more acceptable" and extend SCORE to it. I find this goal somewhat incremental and only marginally related to the ICLR community. The paper is hard to follow, with many acronyms that are not spelt out and sentences that are ambiguous. An example: line 085, "The logit-DCBM (and all other models mentioned above) are so-called latent variable models, where Π is the matrix of latent variables. Fo
Videos
Taxonomy
TopicsAdvanced Optical Network Technologies
