Characterizing Concept Drift
Geoffrey I. Webb, Roy Hyde, Hong Cao, Hai Long Nguyen and, Francois Petitjean

TL;DR
This paper develops a comprehensive framework and formal definitions for characterizing and categorizing different types of concept drift in machine learning, addressing a key gap in understanding non-stationary data.
Contribution
It introduces the first formal, quantitative framework for analyzing concept drift, establishing a taxonomy and clarifying ambiguities in previous qualitative categorizations.
Findings
First formal definitions of concept drift types
A comprehensive taxonomy of concept drift
Clarification of ambiguities in previous definitions
Abstract
Most machine learning models are static, but the world is dynamic, and increasing online deployment of learned models gives increasing urgency to the development of efficient and effective mechanisms to address learning in the context of non-stationary distributions, or as it is commonly called concept drift. However, the key issue of characterizing the different types of drift that can occur has not previously been subjected to rigorous definition and analysis. In particular, while some qualitative drift categorizations have been proposed, few have been formally defined, and the quantitative descriptions required for precise and objective understanding of learner performance have not existed. We present the first comprehensive framework for quantitative analysis of drift. This supports the development of the first comprehensive set of formal definitions of types of concept drift. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
