Selecting a significance level in sequential testing procedures for   community detection

Riddhi Pratim Ghosh; Ian Barnett

arXiv:2209.07648·stat.ME·September 19, 2022·Appl. Netw. Sci.

Selecting a significance level in sequential testing procedures for community detection

Riddhi Pratim Ghosh, Ian Barnett

PDF

Open Access

TL;DR

This paper introduces a principled method for selecting the significance level in sequential community detection algorithms, improving the reliability of estimating the number of communities in networks.

Contribution

It proposes a new algorithm to choose the significance level based on a user-defined tolerance ratio, enhancing existing sequential community detection methods.

Findings

01

Effective control of the tolerance ratio demonstrated in simulations

02

Improved accuracy in community number estimation in real data

03

Versatile application across different network types

Abstract

While there have been numerous sequential algorithms developed to estimate community structure in networks, there is little available guidance and study of what significance level or stopping parameter to use in these sequential testing procedures. Most algorithms rely on prespecifiying the number of communities or use an arbitrary stopping rule. We provide a principled approach to selecting a nominal significance level for sequential community detection procedures by controlling the tolerance ratio, defined as the ratio of underfitting and overfitting probability of estimating the number of clusters in fitting a network. We introduce an algorithm for specifying this significance level from a user-specified tolerance ratio, and demonstrate its utility with a sequential modularity maximization approach in a stochastic block model framework. We evaluate the performance of the proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSingle-cell and spatial transcriptomics · Gene Regulatory Network Analysis · Bioinformatics and Genomic Networks