Revisiting the Impact of Dependency Network Metrics on Software Defect Prediction
Lina Gong, Gopi Krishnan Rajbahadur, Ahmed E. Hassan, Shujuan Jiang

TL;DR
This study evaluates the effectiveness of Social Network Analysis (SNA) metrics compared to traditional code metrics in software defect prediction across various scenarios and contexts, revealing mixed but sometimes significant improvements.
Contribution
It provides a comprehensive case study comparing SNA and code metrics in multiple defect prediction scenarios and contexts, highlighting when SNA metrics are most beneficial.
Findings
SNA metrics improve SDP performance in 5 out of 9 scenarios
Improvements vary from marginal to large depending on the case
Future models should incorporate both SNA and code metrics separately
Abstract
Software dependency network metrics extracted from the dependency graph of the software modules by the application of Social Network Analysis (SNA metrics) have been shown to improve the performance of the Software Defect prediction (SDP) models. However, the relative effectiveness of these SNA metrics over code metrics in improving the performance of the SDP models has been widely debated with no clear consensus. Furthermore, some of the common SDP scenarios like predicting the number of defects in a module (Defect-count) in Cross-version and Cross-project SDP contexts remain unexplored. Such lack of clear directive on the effectiveness of SNA metrics when compared to the widely used code metrics prevents us from potentially building better performing SDP models. Therefore, through a case study of 9 open source software projects across 30 versions, we study the relative effectiveness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
