The Integrity of Machine Learning Algorithms against Software Defect Prediction
Param Khakhar and, Rahul Kumar Dubey

TL;DR
This paper evaluates the effectiveness of the Online Sequential Extreme Learning Machine (OS-ELM) in predicting software defects, demonstrating its superior performance over traditional classifiers on imbalanced datasets.
Contribution
It introduces a comparison of OS-ELM with other classifiers using oversampled data, highlighting its faster training and better accuracy in defect prediction.
Findings
OS-ELM outperforms traditional classifiers in recall and balanced accuracy.
Oversampling with Cluster-based Over-Sampling with Noise Filtering improves results.
OS-ELM trains faster and converges to a global optimum.
Abstract
The increased computerization in recent years has resulted in the production of a variety of different software, however measures need to be taken to ensure that the produced software isn't defective. Many researchers have worked in this area and have developed different Machine Learning-based approaches that predict whether the software is defective or not. This issue can't be resolved simply by using different conventional classifiers because the dataset is highly imbalanced i.e the number of defective samples detected is extremely less as compared to the number of non-defective samples. Therefore, to address this issue, certain sophisticated methods are required. The different methods developed by the researchers can be broadly classified into Resampling based methods, Cost-sensitive learning-based methods, and Ensemble Learning. Among these methods. This report analyses the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM · Fault Detection and Control Systems · Spectroscopy Techniques in Biomedical and Chemical Research
MethodsLogistic Regression
