DeMuVGN: Effective Software Defect Prediction Model by Learning Multi-view Software Dependency via Graph Neural Networks
Yu Qiao, Lina Gong, Yu Zhao, Yongwei Wang, Mingqiang Wei

TL;DR
DeMuVGN is a novel graph neural network model that learns multi-view software dependencies, including code, call, and developer relations, to improve defect prediction accuracy across various software projects.
Contribution
It introduces a multi-view dependency graph and leverages GNNs combined with SMOTE to enhance defect prediction, addressing limitations of previous code-only dependency methods.
Findings
Multi-view graphs improve F1 scores by 11.1% to 12.1%.
DeMuVGN boosts F1 scores by 17.4% to 45.8% within projects.
Model performs well in both mature and new software versions.
Abstract
Software defect prediction (SDP) aims to identify high-risk defect modules in software development, optimizing resource allocation. While previous studies show that dependency network metrics improve defect prediction, most methods focus on code-based dependency graphs, overlooking developer factors. Current metrics, based on handcrafted features like ego and global network metrics, fail to fully capture defect-related information. To address this, we propose DeMuVGN, a defect prediction model that learns multi-view software dependency via graph neural networks. We introduce a Multi-view Software Dependency Graph (MSDG) that integrates data, call, and developer dependencies. DeMuVGN also leverages the Synthetic Minority Oversampling Technique (SMOTE) to address class imbalance and enhance defect module identification. In a case study of eight open-source projects across 20 versions,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Reliability and Analysis Research · Software System Performance and Reliability
MethodsFocus
