A Neighbor-Searching Discrepancy-based Drift Detection Scheme for Learning Evolving Data
Feng Gu, Jie Lu, Zhen Fang, Kun Wang, Guangquan Zhang

TL;DR
This paper introduces a novel drift detection method based on Neighbor-Searching Discrepancy that accurately identifies real concept drift in data streams, distinguishes it from virtual drift, and provides trend information for better model maintenance.
Contribution
The paper proposes a new discrepancy-based statistic for real concept drift detection that outperforms existing methods and offers insights into the direction of drift.
Findings
High accuracy in detecting real concept drift
Effective in ignoring virtual drift
Outperforms state-of-the-art methods in experiments
Abstract
Uncertain changes in data streams present challenges for machine learning models to dynamically adapt and uphold performance in real-time. Particularly, classification boundary change, also known as real concept drift, is the major cause of classification performance deterioration. However, accurately detecting real concept drift remains challenging because the theoretical foundations of existing drift detection methods - two-sample distribution tests and monitoring classification error rate, both suffer from inherent limitations such as the inability to distinguish virtual drift (changes not affecting the classification boundary, will introduce unnecessary model maintenance), limited statistical power, or high computational cost. Furthermore, no existing detection method can provide information on the trend of the drift, which could be invaluable for model maintenance. This work…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Advanced Data Storage Technologies · Machine Learning and Data Classification
MethodsSeventeen Ways to Call Uphold Helpline Full Guide USA 24 Hour Assistance
