Using Source Code Metrics and Ensemble Methods for Fault Proneness Prediction
Lov Kumar, Santanu Rath, Ashish Sureka

TL;DR
This study develops and validates fault proneness prediction models using ensemble learning and optimized source code metrics, demonstrating improved accuracy and cost-effectiveness on open source datasets.
Contribution
It introduces a framework for selecting effective source code metrics and applies ensemble methods, especially Majority Voting, to enhance fault prediction performance.
Findings
Majority Voting Ensemble outperforms other ensemble methods
Selected source code metrics improve prediction accuracy
Fault prediction is more effective in projects with lower fault percentages
Abstract
Software fault prediction model are employed to optimize testing resource allocation by identifying fault-prone classes before testing phases. Several researchers' have validated the use of different classification techniques to develop predictive models for fault prediction. The performance of the statistical models are proven to be influenced by the training and testing dataset. Ensemble method learning algorithms have been widely used because it combines the capabilities of its constituent models towards a dataset to come up with a potentially higher performance as compared to individual models (improves generalizability). In the study presented in this paper, three different ensemble methods have been applied to develop a model for predicting fault proneness. The efficacy and usefulness of a fault prediction model also depends on the source code metrics which are considered as the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Reliability and Analysis Research · Software Testing and Debugging Techniques
