Community Formation and Detection on GitHub Collaboration Networks
Behnaz Moradi-Jamei, Brandon L. Kramer, J. Bayoan Santiago Calderon,, Gizem Korkmaz

TL;DR
This paper introduces a novel community detection method for GitHub collaboration networks that accounts for cyclic structures and contribution strengths, revealing insights into team formation influenced by location and programming language.
Contribution
It combines Renewal-Nonbacktracking Random Walks with the Louvain method to improve detection of small-scale OSS communities on a large-scale dataset.
Findings
Enhanced detection of small OSS teams.
Users tend to join established communities.
Location and programming language influence community formation.
Abstract
This paper studies community formation in OSS collaboration networks. While most current work examines the emergence of small-scale OSS projects, our approach draws on a large-scale historical dataset of 1.8 million GitHub users and their repository contributions. OSS collaborations are characterized by small groups of users that work closely together, leading to the presence of communities defined by short cycles in the underlying network structure. To understand the impact of this phenomenon, we apply a pre-processing step that accounts for the cyclic network structure by using Renewal-Nonbacktracking Random Walks (RNBRW) and the strength of pairwise collaborations before implementing the Louvain method to identify communities within the network. Equipping Louvain with RNBRW and the contribution strength provides a more assertive approach for detecting small-scale teams and reveals…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOpen Source Software Innovations · Software Engineering Research · Peer-to-Peer Network Technologies
