On the Change of Decision Boundaries and Loss in Learning with Concept   Drift

Fabian Hinder; Valerie Vaquet; Johannes Brinkrolf; Barbara Hammer

arXiv:2212.01223·cs.LG·December 5, 2022·1 cites

On the Change of Decision Boundaries and Loss in Learning with Concept Drift

Fabian Hinder, Valerie Vaquet, Johannes Brinkrolf, Barbara Hammer

PDF

Open Access

TL;DR

This paper examines the theoretical justification for using interleaved test-train error to detect concept drift, relating it to actual distribution changes and model updates, supported by empirical evidence across various algorithms and datasets.

Contribution

It provides a mathematical analysis linking ITTE changes to true concept drift and model changes, enhancing understanding of drift detection methods.

Findings

01

ITTE change correlates with real distribution drift

02

Theoretical justification for ITTE-based drift detection

03

Empirical validation across multiple algorithms and datasets

Abstract

The notion of concept drift refers to the phenomenon that the distribution generating the observed data changes over time. If drift is present, machine learning models may become inaccurate and need adjustment. Many technologies for learning with drift rely on the interleaved test-train error (ITTE) as a quantity which approximates the model generalization error and triggers drift detection and model updates. In this work, we investigate in how far this procedure is mathematically justified. More precisely, we relate a change of the ITTE to the presence of real drift, i.e., a changed posterior, and to a change of the training result under the assumption of optimality. We support our theoretical findings by empirical evidence for several learning algorithms, models, and datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Stream Mining Techniques · Advanced Bandit Algorithms Research · Air Quality Monitoring and Forecasting