Generative Adversarial Network-based Cross-Project Fault Prediction
Sourabh Pal

TL;DR
This paper introduces a GAN-based approach to reduce data divergence in cross-project software defect prediction, improving fault prediction accuracy across different projects despite class imbalance issues.
Contribution
It proposes a novel GAN-based data transformation method to enhance cross-project defect prediction by mitigating domain divergence.
Findings
GAN-based method improves defect prediction accuracy
Model performs well with JDT dataset as target
Class imbalance affects model performance
Abstract
Background: The early stage of defect prediction in the software development life cycle can reduce testing effort and ensure the quality of software. Due to the lack of historical data within the same project, Cross-Project Defect Prediction (CPDP) has become a popular research topic among researchers. CPDP trained classifiers based on labeled data sets of one project to predict fault in another project. Goals: Software Defect Prediction (SDP) data sets consist of manually designed static features, which are software metrics. In CPDP, source and target project data divergence is the major challenge in achieving high performance. In this paper, we propose a Generative Adversarial Network (GAN)-based data transformation to reduce data divergence between source and target projects. Method: We apply the Generative Adversarial Method where label data sets are choosing as real data, while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Reliability and Analysis Research · Software Testing and Debugging Techniques
