Exact Recovery and Bregman Hard Clustering of Node-Attributed Stochastic Block Model
Maximilien Dreveton, Felipe S. Fernandes, Daniel R. Figueiredo

TL;DR
This paper establishes theoretical limits and proposes a novel clustering algorithm for node-attributed stochastic block models, effectively integrating network and attribute data for exact community recovery.
Contribution
It introduces an information-theoretic criterion for exact recovery and an iterative likelihood-maximizing algorithm based on exponential family models.
Findings
The phase transition for exact recovery is characterized by Chernoff-Hellinger divergence.
The proposed algorithm outperforms existing methods in synthetic experiments.
Network and attribute information can be exchanged to improve clustering accuracy.
Abstract
Network clustering tackles the problem of identifying sets of nodes (communities) that have similar connection patterns. However, in many scenarios, nodes also have attributes that are correlated with the clustering structure. Thus, network information (edges) and node information (attributes) can be jointly leveraged to design high-performance clustering algorithms. Under a general model for the network and node attributes, this work establishes an information-theoretic criterion for the exact recovery of community labels and characterizes a phase transition determined by the Chernoff-Hellinger divergence of the model. The criterion shows how network and attribute information can be exchanged in order to have exact recovery (e.g., more reliable network information requires less reliable attribute information). This work also presents an iterative clustering algorithm that maximizes the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsComplex Network Analysis Techniques · Mental Health Research Topics · Opinion Dynamics and Social Influence
