Identifying Source Code File Experts
Ot\'avio Cury, Guilherme Avelino, Pedro Santos Neto, Ricardo Britto,, Marco T\'ulio Valente

TL;DR
This study investigates methods to improve the identification of source code file experts by analyzing development history and applying machine learning, finding comparable performance between linear techniques and classifiers.
Contribution
It explores new variables and machine learning approaches to enhance expert identification accuracy in software projects.
Findings
First authorship correlates positively with source code knowledge.
Recency of modification correlates negatively with source code knowledge.
Machine learning classifiers outperform linear techniques in public datasets.
Abstract
In software development, the identification of source code file experts is an important task. Identifying these experts helps to improve software maintenance and evolution activities, such as developing new features, code reviews, and bug fixes. Although some studies have proposed repository mining techniques to automatically identify source code experts, there are still gaps in this area that can be explored. For example, investigating new variables related to source code knowledge and applying machine learning aiming to improve the performance of techniques to identify source code experts. The goal of this study is to investigate opportunities to improve the performance of existing techniques to recommend source code files experts. We built an oracle by collecting data from the development history and surveying developers of 113 software projects. Then, we use this oracle to: (i)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Open Source Software Innovations · Software Engineering Techniques and Practices
