Finding Community Structure in Mega-scale Social Networks
Ken Wakita, Toshiyuki Tsurumi

TL;DR
This paper improves the scalability of the CNM community detection algorithm for large social networks by introducing metrics to balance community merging, enabling analysis of networks with millions of nodes efficiently.
Contribution
The paper proposes three new variants of the CNM algorithm with metrics to improve scalability and efficiency in large-scale social network community detection.
Findings
Significant speedup over original CNM algorithm.
Ability to process networks with millions of nodes within minutes.
Enhanced modularity in detected community structures.
Abstract
Community analysis algorithm proposed by Clauset, Newman, and Moore (CNM algorithm) finds community structure in social networks. Unfortunately, CNM algorithm does not scale well and its use is practically limited to networks whose sizes are up to 500,000 nodes. The paper identifies that this inefficiency is caused from merging communities in unbalanced manner. The paper introduces three kinds of metrics (consolidation ratio) to control the process of community analysis trying to balance the sizes of the communities being merged. Three flavors of CNM algorithms are built incorporating those metrics. The proposed techniques are tested using data sets obtained from existing social networking service that hosts 5.5 million users. All the methods exhibit dramatic improvement of execution efficiency in comparison with the original CNM algorithm and shows high scalability. The fastest method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Peer-to-Peer Network Technologies · Opinion Dynamics and Social Influence
