Learning from Data Streams: An Overview and Update
Jesse Read, Indr\.e \v{Z}liobait\.e

TL;DR
This paper reviews and updates the fundamental concepts of supervised data-stream learning, emphasizing realistic assumptions, and advocates shifting research focus towards robustness, privacy, and interpretability in real-world applications.
Contribution
It reformulates data-stream learning definitions considering concept drift and temporal dependence, and provides recommendations for future research directions.
Findings
Many existing algorithms rely on unrealistic assumptions.
Learning from data streams is not limited to online or single-pass methods.
Established techniques for concept drift and temporal dependence can be adapted for data streams.
Abstract
The literature on machine learning in the context of data streams is vast and growing. However, many of the defining assumptions regarding data-stream learning tasks are too strong to hold in practice, or are even contradictory such that they cannot be met in the contexts of supervised learning. Algorithms are chosen and designed based on criteria which are often not clearly stated, for problem settings not clearly defined, tested in unrealistic settings, and/or in isolation from related approaches in the wider literature. This puts into question the potential for real-world impact of many approaches conceived in such contexts, and risks propagating a misguided research focus. We propose to tackle these issues by reformulating the fundamental definitions and settings of supervised data-stream learning with regard to contemporary considerations of concept drift and temporal dependence;…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Machine Learning and Data Classification · Advanced Bandit Algorithms Research
