A Feature-Driven Framework for Software Fault Prediction

Ahmad Nauman Ghazi; Nagajyothi Devarapalli; Ashir Javeed; Sadi Alawadi; Fahed Alkhabbas; Khalid AlKharabsheh

arXiv:2605.17611·cs.SE·May 19, 2026

A Feature-Driven Framework for Software Fault Prediction

Ahmad Nauman Ghazi, Nagajyothi Devarapalli, Ashir Javeed, Sadi Alawadi, Fahed Alkhabbas, Khalid AlKharabsheh

PDF

TL;DR

This paper presents a feature-driven framework combining feature selection and parameter tuning to enhance machine learning-based software fault prediction, achieving high accuracy and robustness.

Contribution

It introduces a combined approach of feature selection methods and hyperparameter tuning techniques to improve fault prediction performance.

Findings

01

CFS and GA combination achieved 88.40% accuracy with RF.

02

Feature selection reduced dimensionality and identified key attributes.

03

Proposed methods showed robustness with minimal variability and improved efficiency.

Abstract

Software fault prediction (SFP) is a critical task in software engineering, enabling early identification of faults in modules to improve software quality and reduce maintenance costs. This research investigates the combined effects of feature selection and parameter tuning on the performance of machine learning (ML) models for SFP. This study evaluates the interaction between feature selection methods, including correlation-based feature selection (CFS), recursive feature elimination (RFE), mutual information (MI), and L1 regularization, where hyperparameter tuning techniques such as grid search, randomized search, and genetic algorithm (GA) are used for optimization of ML algorithms, including random forest (RF), logistic regression (LR), and support vector machines (SVM) for optimized fault prediction performance. The combined application of CFS and GA yielded the highest accuracy,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.