A Meta-analytical Comparison of Naive Bayes and Random Forest for   Software Defect Prediction

Ch Muhammad Awais; Wei Gu; Gcinizwe Dlamini; Zamira Kholmatova,; Giancarlo Succi

arXiv:2306.15369·cs.SE·February 6, 2025

A Meta-analytical Comparison of Naive Bayes and Random Forest for Software Defect Prediction

Ch Muhammad Awais, Wei Gu, Gcinizwe Dlamini, Zamira Kholmatova,, Giancarlo Succi

PDF

Open Access 1 Repo

TL;DR

This study systematically compares Naive Bayes and Random Forest models for software defect prediction, finding no significant performance difference in key metrics across analyzed studies.

Contribution

It provides a meta-analytical comparison of the two models, clarifying their relative effectiveness in defect prediction tasks.

Findings

01

No significant difference in recall, f-measure, and precision between models.

02

Meta-analysis based on five studies.

03

Systematic literature review methodology used.

Abstract

Is there a statistical difference between Naive Bayes and Random Forest in terms of recall, f-measure, and precision for predicting software defects? By utilizing systematic literature review and meta-analysis, we are answering this question. We conducted a systematic literature review by establishing criteria to search and choose papers, resulting in five studies. After that, using the meta-data and forest-plots of five chosen papers, we conducted a meta-analysis to compare the two models. The results have shown that there is no significant statistical evidence that Naive Bayes perform differently from Random Forest in terms of recall, f-measure, and precision.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cm-awais/sdp_nb_rf_meta_analysis
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Software System Performance and Reliability · Software Reliability and Analysis Research