Graph inference with clustering and false discovery rate control
Tabea Rebafka, Etienne Roquain, Fanny Villers

TL;DR
This paper introduces a method for graph inference in a noisy stochastic block model, combining parameter estimation, clustering, and FDR control, with theoretical guarantees and superior performance over classical methods.
Contribution
It develops a novel approach that integrates FDR control with SBM-based clustering, providing theoretical guarantees and robustness beyond the model assumptions.
Findings
The method controls FDR while achieving high TDR.
It outperforms classical FDR methods ignoring SBM topology.
The approach is robust to model mis-specification.
Abstract
In this paper, a noisy version of the stochastic block model (NSBM) is introduced and we investigate the three following statistical inferences in this model: estimation of the model parameters, clustering of the nodes and identification of the underlying graph. While the two first inferences are done by using a variational expectation-maximization (VEM) algorithm, the graph inference is done by controlling the false discovery rate (FDR), that is, the average proportion of errors among the edges declared significant, and by maximizing the true discovery rate (TDR), that is, the average proportion of edges declared significant among the true edges. Provided that the VEM algorithm provides reliable parameter estimates and clustering, we theoretically show that our procedure does control the FDR while satisfying an optimal TDR property, up to remainder terms that become small when the size…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Statistical Methods and Inference · Bayesian Methods and Mixture Models
