A Systematic Study of Cross-Project Defect Prediction With Meta-Learning
Faimison Porto, Leandro Minku, Emilia Mendes, Adenilso Simao

TL;DR
This paper conducts an extensive comparison of 31 cross-project defect prediction methods across multiple datasets, and proposes a meta-learning approach to automatically select the most suitable method for a given project.
Contribution
It provides a comprehensive evaluation of state-of-the-art CPDP methods and introduces a meta-learning framework for dynamic method selection, highlighting its potential and limitations.
Findings
Four methods consistently performed well across datasets.
Meta-learning can recommend suitable CPDP methods based on project features.
The meta-learning approach showed only minor performance improvements over base methods.
Abstract
The prediction of defects in a target project based on data from external projects is called Cross-Project Defect Prediction (CPDP). Several methods have been proposed to improve the predictive performance of CPDP models. However, there is a lack of comparison among state-of-the-art methods. Moreover, previous work has shown that the most suitable method for a project can vary according to the project being predicted. This makes the choice of which method to use difficult. We provide an extensive experimental comparison of 31 CPDP methods derived from state-of-the-art approaches, applied to 47 versions of 15 open source software projects. Four methods stood out as presenting the best performances across datasets. However, the most suitable among these methods still varies according to the project being predicted. Therefore, we propose and evaluate a meta-learning solution designed to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Reliability and Analysis Research · Software Engineering Techniques and Practices
