Community Detection Algorithm Evaluation using Size and Hashtags
Paul Wagenseller III, Feng Wang

TL;DR
This paper evaluates community detection algorithms in social media by considering community size, coverage, modularity, and user interest, proposing a clique-based baseline and analyzing the effectiveness of existing algorithms.
Contribution
It introduces a new community detection algorithm based on cliques and provides a comprehensive comparison with existing algorithms across multiple metrics.
Findings
Both the proposed clique-based algorithm and InfoMap perform well across metrics.
Many popular algorithms produce communities that are either too small or too large.
Community size should be limited to enhance stability and practical value.
Abstract
Understanding community structure in social media is critical due to its broad applications such as friend recommendations, link predictions and collaborative filtering. However, there is no widely accepted definition of community in literature. Existing work use structure related metrics such as modularity and function related metrics such as ground truth to measure the performance of community detection algorithms, while ignoring an important metric, size of the community. [1] suggests that the size of community with strong ties in social media should be limited to 150. As we discovered in this paper, the majority of the communities obtained by many popular community detection algorithms are either very small or very large. Too small communities don't have practical value and too large communities contain weak connections therefore not stable. In this paper, we compare various…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Opinion Dynamics and Social Influence · Advanced Clustering Algorithms Research
