Defect Category Prediction Based on Multi-Source Domain Adaptation

Ying Xing; Mengci Zhao; Bin Yang; Yuwei Zhang; Wenjin Li; and Jiawei Gu; Jun Yuan

arXiv:2405.10511·cs.SE·May 20, 2024

Defect Category Prediction Based on Multi-Source Domain Adaptation

Ying Xing, Mengci Zhao, Bin Yang, Yuwei Zhang, Wenjin Li, and Jiawei Gu, Jun Yuan

PDF

TL;DR

This paper introduces a multi-source domain adaptation framework with adversarial training and attention mechanisms to improve defect category prediction across different software projects, addressing data scarcity and generalization issues.

Contribution

It reformulates defect prediction as a multi-label classification problem and proposes a novel domain adaptation approach combining adversarial training and attention mechanisms.

Findings

01

Significant performance improvements over baselines on 8 real-world projects.

02

Effective mitigation of domain discrepancies in defect prediction.

03

Enhanced generalization to new software projects.

Abstract

In recent years, defect prediction techniques based on deep learning have become a prominent research topic in the field of software engineering. These techniques can identify potential defects without executing the code. However, existing approaches mostly concentrate on determining the presence of defects at the method-level code, lacking the ability to precisely classify specific defect categories. Consequently, this undermines the efficiency of developers in locating and rectifying defects. Furthermore, in practical software development, new projects often lack sufficient defect data to train high-accuracy deep learning models. Models trained on historical data from existing projects frequently struggle to achieve satisfactory generalization performance on new projects. Hence, this paper initially reformulates the traditional binary defect prediction task into a multi-label…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.