Graph-Based Machine Learning Improves Just-in-Time Defect Prediction

Jonathan Bryan; Pablo Moriano

arXiv:2110.05371·cs.SE·April 17, 2023

Graph-Based Machine Learning Improves Just-in-Time Defect Prediction

Jonathan Bryan, Pablo Moriano

PDF

Open Access

TL;DR

This paper introduces a graph-based machine learning approach using contribution graphs of developers and source files to significantly improve the accuracy of just-in-time defect prediction in software engineering.

Contribution

It presents a novel graph-based ML method that leverages contribution graphs to outperform traditional intrinsic feature-based models in JIT defect prediction.

Findings

01

Achieved up to 77.55% F1 score in defect prediction

02

Improved MCC by 3% over state-of-the-art methods

03

Validated on 14 open-source projects

Abstract

The increasing complexity of today's software requires the contribution of thousands of developers. This complex collaboration structure makes developers more likely to introduce defect-prone changes that lead to software faults. Determining when these defect-prone changes are introduced has proven challenging, and using traditional machine learning (ML) methods to make these determinations seems to have reached a plateau. In this work, we build contribution graphs consisting of developers and source files to capture the nuanced complexity of changes required to build software. By leveraging these contribution graphs, our research shows the potential of using graph-based ML to improve Just-In-Time (JIT) defect prediction. We hypothesize that features extracted from the contribution graphs may be better predictors of defect-prone changes than intrinsic features derived from software…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection · Manufacturing Process and Optimization · Integrated Circuits and Semiconductor Failure Analysis

MethodsTest