Data augmentation on-the-fly and active learning in data stream classification
Kleanthis Malialis, Dimitris Papatheodoulou, Stylianos Filippou, Christos G. Panayiotou, Marios M. Polycarpou

TL;DR
This paper introduces Augmented Queues, a novel online learning method that combines active learning, data augmentation, and multi-queue memory to improve data stream classification, especially under class imbalance and limited labels.
Contribution
The work presents a new online learning approach that efficiently balances classes and enhances data availability without extra memory, improving learning speed and quality.
Findings
Augmented Queues outperform existing methods in classification accuracy.
The method effectively handles class imbalance and limited labels.
It does not require additional memory for synthetic data generation.
Abstract
There is an emerging need for predictive models to be trained on-the-fly, since in numerous machine learning applications data are arriving in an online fashion. A critical challenge encountered is that of limited availability of ground truth information (e.g., labels in classification tasks) as new data are observed one-by-one online, while another significant challenge is that of class imbalance. This work introduces the novel Augmented Queues method, which addresses the dual-problem by combining in a synergistic manner online active learning, data augmentation, and a multi-queue memory to maintain separate and balanced queues for each class. We perform an extensive experimental study using image and time-series augmentations, in which we examine the roles of the active learning budget, memory size, imbalance level, and neural network type. We demonstrate two major advantages of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Advanced Bandit Algorithms Research · Machine Learning and Algorithms
