Imbalanced Data Stream Classification using Dynamic Ensemble Selection
Priya.S, Haribharathi Sivakumar, Vijay Arvind.R

TL;DR
This paper introduces a new framework combining data pre-processing and dynamic ensemble selection to improve classification accuracy in nonstationary, imbalanced data streams with concept drift.
Contribution
It proposes a novel integrated framework specifically designed for nonstationary, imbalanced data streams, combining pre-processing and ensemble selection techniques.
Findings
Pre-processing plus dynamic ensemble selection improves accuracy.
Framework performs well across different imbalance ratios.
Effective against multiple types of concept drift.
Abstract
Modern streaming data categorization faces significant challenges from concept drift and class imbalanced data. This negatively impacts the output of the classifier, leading to improper classification. Furthermore, other factors such as the overlapping of multiple classes limit the extent of the correctness of the output. This work proposes a novel framework for integrating data pre-processing and dynamic ensemble selection, by formulating the classification framework for the nonstationary drifting imbalanced data stream, which employs the data pre-processing and dynamic ensemble selection techniques. The proposed framework was evaluated using six artificially generated data streams with differing imbalance ratios in combination with two different types of concept drifts. Each stream is composed of 200 chunks of 500 objects described by eight features and contains five concept drifts.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Imbalanced Data Classification Techniques · Spam and Phishing Detection
