Semi-supervised Drifted Stream Learning with Short Lookback
Weijieying Ren, Pengyang Wang, Xiaolin Li, Charles E. Hughes, Yanjie, Fu

TL;DR
This paper introduces a semi-supervised learning framework for real-time data streams with data drift, limited labels, and short lookback windows, addressing challenges in pseudo-labeling and anti-forgetting adaptation.
Contribution
It proposes a generic generation-replay framework with novel pseudo-labeling and minimax game-based replay methods for drifted stream learning with short lookback.
Findings
Effective anti-forgetting in drifted streams demonstrated
Robust pseudo-labeling improves model accuracy
Framework outperforms existing methods in experiments
Abstract
In many scenarios, 1) data streams are generated in real time; 2) labeled data are expensive and only limited labels are available in the beginning; 3) real-world data is not always i.i.d. and data drift over time gradually; 4) the storage of historical streams is limited and model updating can only be achieved based on a very short lookback window. This learning setting limits the applicability and availability of many Machine Learning (ML) algorithms. We generalize the learning task under such setting as a semi-supervised drifted stream learning with short lookback problem (SDSL). SDSL imposes two under-addressed challenges on existing methods in semi-supervised learning, continuous learning, and domain adaptation: 1) robust pseudo-labeling under gradual shifts and 2) anti-forgetting adaptation with short lookback. To tackle these challenges, we propose a principled and generic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Machine Learning and Data Classification · Water Systems and Optimization
