Structure and inference in annotated networks
M. E. J. Newman, Aaron Clauset

TL;DR
This paper presents a principled method for integrating node metadata with network structure to improve community detection, automatically learning when metadata are informative and enabling predictions about unobserved node memberships.
Contribution
The authors develop a novel approach that combines network data and metadata for community detection, which adaptively learns the relevance of metadata without assuming correlation.
Findings
Method improves community detection accuracy over using network or metadata alone.
The approach correctly identifies when metadata are informative or not.
Successfully applied to synthetic and real-world networks across domains.
Abstract
For many networks of scientific interest we know both the connections of the network and information about the network nodes, such as the age or gender of individuals in a social network, geographic location of nodes in the Internet, or cellular function of nodes in a gene regulatory network. Here we demonstrate how this "metadata" can be used to improve our analysis and understanding of network structure. We focus in particular on the problem of community detection in networks and develop a mathematically principled approach that combines a network and its metadata to detect communities more accurately than can be done with either alone. Crucially, the method does not assume that the metadata are correlated with the communities we are trying to find. Instead the method learns whether a correlation exists and correctly uses or ignores the metadata depending on whether they contain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
