Supporting the Task-driven Skill Identification in Open Source Project Issue Tracking Systems
Fabio Santos

TL;DR
This paper explores automatic API-domain labeling of open source issues to help newcomers identify suitable tasks, using social network analysis and machine learning to improve onboarding and skill matching.
Contribution
It introduces an approach combining API-domain labeling and social network metrics to enhance task recommendation for OSS newcomers, with comparable prediction accuracy to state-of-the-art methods.
Findings
API-domain labels are relevant for experienced practitioners
Predictions achieve an average precision of 75.5%
Organizing issues with labels aids diverse OSS roles
Abstract
Selecting an appropriate task is challenging for contributors to Open Source Software (OSS), mainly for those who are contributing for the first time. Therefore, researchers and OSS projects have proposed various strategies to aid newcomers, including labeling tasks. We investigate the automatic labeling of open issues strategy to help the contributors to pick a task to contribute. We label the issues with API-domains--categories of APIs parsed from the source code used to solve the issues. We plan to add social network analysis metrics from the issues conversations as new predictors. By identifying the skills, we claim the contributor candidates should pick a task more suitable. We analyzed interview transcripts and the survey's open-ended questions to comprehend the strategies used to assist in onboarding contributors and used to pick up an issue. We applied quantitative studies to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Open Source Software Innovations · Wikis in Education and Collaboration
MethodsOntology
