TL;DR
This paper introduces a fast, statistically principled method for detecting overlapping and non-overlapping communities in large networks, demonstrating competitive accuracy and efficiency on real and synthetic data.
Contribution
It presents a novel generative model-based approach with a closed-form EM algorithm capable of analyzing large-scale networks efficiently.
Findings
Method is scalable to networks with millions of nodes.
Achieves accuracy comparable to existing community detection algorithms.
Effective for both overlapping and non-overlapping community detection.
Abstract
A fundamental problem in the analysis of network data is the detection of network communities, groups of densely interconnected nodes, which may be overlapping or disjoint. Here we describe a method for finding overlapping communities based on a principled statistical approach using generative network models. We show how the method can be implemented using a fast, closed-form expectation-maximization algorithm that allows us to analyze networks of millions of nodes in reasonable running times. We test the method both on real-world networks and on synthetic benchmarks and find that it gives results competitive with previous methods. We also show that the same approach can be used to extract nonoverlapping community divisions via a relaxation method, and demonstrate that the algorithm is competitively fast and accurate for the nonoverlapping problem.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
