An Algorithm-Centered Approach To Model Streaming Data
Fabian Hinder, Valerie Vaquet, David Komnick, Barbara Hammer

TL;DR
This paper introduces a new algorithm-centered model for streaming data that addresses concept drift, providing a theoretical framework and demonstrating its applicability through numerical evaluation and a critical infrastructure case study.
Contribution
It proposes a novel window-based data model from the algorithm's perspective, bridging the gap between theory and practical stream learning algorithms.
Findings
The new model aligns with existing approaches in many cases.
The framework offers a theoretical basis for streaming data analysis.
Application demonstrated in critical infrastructure domain.
Abstract
Besides the classical offline setup of machine learning, stream learning constitutes a well-established setup where data arrives over time in potentially non-stationary environments. Concept drift, the phenomenon that the underlying distribution changes over time poses a significant challenge. Yet, despite high practical relevance, there is little to no foundational theory for learning in the drifting setup comparable to classical statistical learning theory in the offline setting. This can be attributed to the lack of an underlying object comparable to a probability distribution as in the classical setup. While there exist approaches to transfer ideas to the streaming setup, these start from a data perspective rather than an algorithmic one. In this work, we suggest a new model of data over time that is aimed at the algorithm's perspective. Instead of defining the setup using time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Advanced Database Systems and Queries · Data Management and Algorithms
