Early Concept Drift Detection via Prediction Uncertainty
Pengqian Lu, Jie Lu, Anjin Liu, Guangquan Zhang

TL;DR
This paper proposes the Prediction Uncertainty Index (PU-index) as a more sensitive and robust method for early concept drift detection in streaming data, outperforming traditional error rate-based methods.
Contribution
Introduction of the PU-index derived from classifier uncertainty, with theoretical properties proving its effectiveness in early and robust drift detection.
Findings
PU-index detects drift even when error rates are stable.
PU-index responds to any change in error rate.
Empirical results show PUDD's effectiveness on synthetic and real datasets.
Abstract
Concept drift, characterized by unpredictable changes in data distribution over time, poses significant challenges to machine learning models in streaming data scenarios. Although error rate-based concept drift detectors are widely used, they often fail to identify drift in the early stages when the data distribution changes but error rates remain constant. This paper introduces the Prediction Uncertainty Index (PU-index), derived from the prediction uncertainty of the classifier, as a superior alternative to the error rate for drift detection. Our theoretical analysis demonstrates that: (1) The PU-index can detect drift even when error rates remain stable. (2) Any change in the error rate will lead to a corresponding change in the PU-index. These properties make the PU-index a more sensitive and robust indicator for drift detection compared to existing methods. We also propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsData Stream Mining Techniques · Advanced Bandit Algorithms Research · Metaheuristic Optimization Algorithms Research
