A review of ensemble learning and data augmentation models for class imbalanced problems: combination, implementation and evaluation
Azal Ahmad Khan, Omkar Chaudhari, Rohitash Chandra

TL;DR
This paper evaluates various data augmentation and ensemble learning methods for class imbalanced problems, highlighting effective combinations like SMOTE and ROS that improve classification performance efficiently.
Contribution
It provides a comprehensive framework to compare 9 data augmentation and 9 ensemble methods, identifying optimal combinations for imbalanced datasets.
Findings
Traditional methods like SMOTE and ROS outperform GAN-based augmentation in effectiveness and computational cost.
Combining data augmentation with ensemble learning significantly enhances classification accuracy on imbalanced datasets.
The study offers guidance for selecting effective methods in practical applications of imbalanced classification.
Abstract
Class imbalance (CI) in classification problems arises when the number of observations belonging to one class is lower than the other. Ensemble learning combines multiple models to obtain a robust model and has been prominently used with data augmentation methods to address class imbalance problems. In the last decade, a number of strategies have been added to enhance ensemble learning and data augmentation methods, along with new methods such as generative adversarial networks (GANs). A combination of these has been applied in many studies, and the evaluation of different combinations would enable a better understanding and guidance for different application domains. In this paper, we present a computational study to evaluate data augmentation and ensemble learning methods used to address prominent benchmark CI problems. We present a general framework that evaluates 9 data augmentation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Electricity Theft Detection Techniques · Financial Distress and Bankruptcy Prediction
