On the costs and profit of software defect prediction
Steffen Herbold

TL;DR
This paper develops a cost model for software defect prediction to evaluate its actual cost-saving potential, revealing that common assumptions may lead to inaccurate cost estimations and that success thresholds need reevaluation.
Contribution
It introduces a mathematically grounded cost model for defect prediction, including boundary conditions for profitability and critiques standard assumptions in the field.
Findings
Unrealistic assumptions about defect impact lead to inaccurate cost estimates.
Standard machine learning thresholds are not suitable as success criteria.
Cost model trends vary significantly with different project assumptions.
Abstract
Defect prediction can be a powerful tool to guide the use of quality assurance resources. However, while lots of research covered methods for defect prediction as well as methodological aspects of defect prediction research, the actual cost saving potential of defect prediction is still unclear. Within this article, we close this research gap and formulate a cost model for software defect prediction. We derive mathematically provable boundary conditions that must be fulfilled by defect prediction models such that there is a positive profit when the defect prediction model is used. Our cost model includes aspects like the costs for quality assurance, the costs of post-release defects, the possibility that quality assurance fails to reveal predicted defects, and the relationship between software artifacts and defects. We initialize the cost model using different assumptions, perform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
