Feature-Oriented Defect Prediction: Scenarios, Metrics, and Classifiers
Mukelabai Mukelabai, Stefan Str\"uder, Daniel Str\"uber, Thorsten, Berger

TL;DR
This paper explores feature-oriented defect prediction at the granularity of software features, demonstrating its feasibility and effectiveness across multiple scenarios and datasets, with high predictive performance especially using Random Forest classifiers.
Contribution
It introduces a comprehensive study on feature-oriented defect prediction, analyzing various classifiers, metrics, and scenarios, and shows promising results for cross-project and just-in-time predictions.
Findings
Random Forest with process and structure metrics performs best.
High AUC-ROC values (>95%) for single-project JIT predictions.
Cross-project models achieve median AUC-ROC of 82% without retraining.
Abstract
Several software defect prediction techniques have been developed over the past decades. These techniques predict defects at the granularity of typical software assets, such as components and files. In this paper, we investigate feature-oriented defect prediction: predicting defects at the granularity of features -- domain-entities that represent software functionality and often cross-cut software assets. Feature-oriented defect prediction can be beneficial since: (i) some features might be more error-prone than others, (ii) characteristics of defective features might be useful to predict other error-prone features, and (iii) feature-specific code might be prone to faults arising from feature interactions. We explore the feasibility and solution space for feature-oriented defect prediction. Our study relies on 12 software projects from which we analyzed 13,685 bug-introducing and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Reliability and Analysis Research · Software System Performance and Reliability
