Community Detection with Known, Unknown, or Partially Known Auxiliary Latent Variables
Mohammad Esmaeili, Aria Nosratinia

TL;DR
This paper investigates community detection in graphs influenced by auxiliary latent variables, analyzing conditions for exact recovery with known, unknown, or partially known auxiliary information, and proposes a semidefinite programming solution.
Contribution
It introduces a framework for community detection considering auxiliary latent variables and provides recovery guarantees with a new semidefinite programming algorithm.
Findings
Exact recovery conditions are characterized for different auxiliary variable knowledge scenarios.
Semidefinite programming achieves exact recovery down to the maximum likelihood threshold.
The approach handles unknown, partially known, and fully known auxiliary latent variables.
Abstract
Empirical observations suggest that in practice, community membership does not completely explain the dependency between the edges of an observation graph. The residual dependence of the graph edges are modeled in this paper, to first order, by auxiliary node latent variables that affect the statistics of the graph edges but carry no information about the communities of interest. We then study community detection in graphs obeying the stochastic block model and censored block model with auxiliary latent variables. We analyze the conditions for exact recovery when these auxiliary latent variables are unknown, representing unknown nuisance parameters or model mismatch. We also analyze exact recovery when these secondary latent variables have been either fully or partially revealed. Finally, we propose a semidefinite programming algorithm for recovering the desired labels when the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Causal Inference Techniques · Data-Driven Disease Surveillance
