Employing Partial Least Squares Regression with Discriminant Analysis for Bug Prediction
Rudolf Ferenc (1), Istv\'an Siket (1), P\'eter Heged\H{u}s (1, 2),, R\'obert Rajk\'o (3) ((1) Department of Software Engineering, University of, Szeged, Szeged, Hungary, (2) MTA-SZTE Research Group on Artificial, Intelligence, University of Szeged, Szeged, Hungary

TL;DR
This study introduces a novel application of Partial Least Squares Discriminant Analysis (PLS-DA) for predicting bug-prone Java classes using static code metrics, demonstrating superior performance and faster training compared to existing methods.
Contribution
The paper pioneers the use of PLS-DA in software defect prediction, showing its effectiveness and efficiency over traditional approaches with rigorous statistical validation.
Findings
PLS-DA outperforms state-of-the-art models in bug prediction accuracy.
The model achieves a F-measure of 0.44-0.47 at 90% confidence.
Training with PLS-DA is significantly faster than other algorithms.
Abstract
Forecasting defect proneness of source code has long been a major research concern. Having an estimation of those parts of a software system that most likely contain bugs may help focus testing efforts, reduce costs, and improve product quality. Many prediction models and approaches have been introduced during the past decades that try to forecast bugged code elements based on static source code metrics, change and history metrics, or both. However, there is still no universal best solution to this problem, as most suitable features and models vary from dataset to dataset and depend on the context in which we use them. Therefore, novel approaches and further studies on this topic are highly necessary. In this paper, we employ a chemometric approach - Partial Least Squares with Discriminant Analysis (PLS-DA) - for predicting bug prone Classes in Java programs using static source code…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Reliability and Analysis Research · Software Testing and Debugging Techniques
