Structure and Overlaps of Communities in Networks
Jaewon Yang, Jure Leskovec

TL;DR
This paper investigates the structure and overlaps of communities in real-world networks, revealing dense overlaps and hub nodes, and introduces a new model, AGM, to better capture these features.
Contribution
It provides empirical analysis of ground-truth communities in large networks and proposes the AGM model to accurately represent overlapping community structures.
Findings
Community overlaps are more densely connected than non-overlapping parts.
High-degree hub nodes reside in community overlaps.
Existing models fail to capture dense overlaps, but AGM succeeds.
Abstract
One of the main organizing principles in real-world social, information and technological networks is that of network communities, where sets of nodes organize into densely linked clusters. Even though detection of such communities is of great interest, understanding the structure communities in large networks remains relatively limited. Due to unavailability of labeled ground-truth data it is practically impossible to evaluate and compare different models and notions of communities on a large scale. In this paper we identify 6 large social, collaboration, and information networks where nodes explicitly state their community memberships. We define ground-truth communities by using these explicit memberships. We then empirically study how such ground-truth communities emerge in networks and how they overlap. We observe some surprising phenomena. First, ground-truth communities contain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Opinion Dynamics and Social Influence · Peer-to-Peer Network Technologies
