Identifying Experts in Software Libraries and Frameworks among GitHub Users
Joao Eduardo Montandon, Luciana Lourdes Silva, Marco Tulio Valente

TL;DR
This paper evaluates machine learning techniques to identify software developers' expertise in popular JavaScript libraries using GitHub activity data, highlighting challenges and proposing a clustering-based method validated with LinkedIn data.
Contribution
It introduces a clustering-based approach for expert identification and provides a publicly available dataset linking GitHub activity with self-reported expertise.
Findings
Machine learning classifiers face challenges in predicting library expertise.
Clustering combined with LinkedIn data can effectively recommend experts.
A public dataset of 575 developers' expertise is provided.
Abstract
Software development increasingly depends on libraries and frameworks to increase productivity and reduce time-to-market. Despite this fact, we still lack techniques to assess developers expertise in widely popular libraries and frameworks. In this paper, we evaluate the performance of unsupervised (based on clustering) and supervised machine learning classifiers (Random Forest and SVM) to identify experts in three popular JavaScript libraries: facebook/react, mongodb/node-mongodb, and socketio/socket.io. First, we collect 13 features about developers activity on GitHub projects, including commits on source code files that depend on these libraries. We also build a ground truth including the expertise of 575 developers on the studied libraries, as self-reported by them in a survey. Based on our findings, we document the challenges of using machine learning classifiers to predict…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Open Source Software Innovations · Wikis in Education and Collaboration
