A Survey on Semi-Supervised Learning for Delayed Partially Labelled Data Streams
Heitor Murilo Gomes, Maciej Grzenda, Rodrigo Mello, Jesse Read, Minh, Huong Le Nguyen, Albert Bifet

TL;DR
This survey reviews semi-supervised learning methods for streaming data with delayed labels, highlighting challenges, existing approaches, and proposing a unified framework and improved benchmarking practices.
Contribution
It introduces a unified problem setting for semi-supervised learning with delayed labels and discusses learning guarantees, methods, and benchmarking practices.
Findings
Semi-supervised methods leverage unlabelled data effectively.
Delayed labeling impacts both supervised and semi-supervised learning.
Proposed adaptations improve benchmarking for streaming data scenarios.
Abstract
Unlabelled data appear in many domains and are particularly relevant to streaming applications, where even though data is abundant, labelled data is rare. To address the learning problems associated with such data, one can ignore the unlabelled data and focus only on the labelled data (supervised learning); use the labelled data and attempt to leverage the unlabelled data (semi-supervised learning); or assume some labels will be available on request (active learning). The first approach is the simplest, yet the amount of labelled data available will limit the predictive performance. The second relies on finding and exploiting the underlying characteristics of the data distribution. The third depends on an external agent to provide the required labels in a timely fashion. This survey pays special attention to methods that leverage unlabelled data in a semi-supervised setting. We also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Machine Learning and Algorithms · Mobile Crowdsensing and Crowdsourcing
