Moving from Cross-Project Defect Prediction to Heterogeneous Defect Prediction: A Partial Replication Study
Hadi Jahanshahi, Mucahit Cevik, Ay\c{s}e Ba\c{s}ar

TL;DR
This study systematically evaluates the feasibility of heterogeneous defect prediction (HDP) models that transfer knowledge across projects with different metrics, comparing their performance to traditional cross-project defect prediction and providing practical guidelines.
Contribution
It extends prior research by replicating and validating HDP results, proposing a new ensemble voting approach, and analyzing the conditions affecting HDP's feasibility.
Findings
HDP performance is comparable to traditional methods in some cases.
HDP sensitivity to parameter selection limits its applicability.
Guidelines are provided for effective transfer learning in defect prediction.
Abstract
Software defect prediction heavily relies on the metrics collected from software projects. Earlier studies often used machine learning techniques to build, validate, and improve bug prediction models using either a set of metrics collected within a project or across different projects. However, techniques applied and conclusions derived by those models are restricted by how identical those metrics are. Knowledge coming from those models will not be extensible to a target project if no sufficient overlapping metrics have been collected in the source projects. To explore the feasibility of transferring knowledge across projects without common labeled metrics, we systematically integrated Heterogeneous Defect Prediction (HDP) by replicating and validating the obtained results. Our main goal is to extend prior research and explore the feasibility of HDP and finally to compare its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Reliability and Analysis Research · Software Engineering Techniques and Practices
