How Early Participation Determines Long-Term Sustained Activity in GitHub Projects?
Wenxin Xiao, Hao He, Weiwei Xu, Yuxia Zhang, and Minghui Zhou

TL;DR
This study uses machine learning to analyze how early participation influences long-term sustainability of GitHub projects, highlighting the importance of experienced contributors, focused effort, and detailed documentation.
Contribution
It introduces a novel predictive approach combining the Blumberg model and machine learning to identify early factors that determine open source project sustainability.
Findings
Early experienced contributors positively impact long-term activity.
Concentrated focus and steady commitment promote sustainability.
Community composition with core and peripheral developers is crucial for organizational projects.
Abstract
Although the open source model bears many advantages in software development, open source projects are always hard to sustain. Previous research on open source sustainability mainly focuses on projects that have already reached a certain level of maturity (e.g., with communities, releases, and downstream projects). However, limited attention is paid to the development of (sustainable) open source projects in their infancy, and we believe an understanding of early sustainability determinants is crucial for project initiators, incubators, newcomers, and users. In this paper, we aim to explore the relationship between early participation factors and long-term project sustainability. We leverage a novel methodology combining the Blumberg model of performance and machine learning to predict the sustainability of 290,255 GitHub projects. Specificially, we train an XGBoost model based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOpen Source Software Innovations · Software Engineering Research · Knowledge Management and Sharing
