Context-Aware Drift Detection
Oliver Cobb, Arnaud Van Looveren

TL;DR
This paper introduces a novel context-aware drift detection framework that leverages causal inference and conditional two-sample tests to identify distributional changes in machine learning systems, even when data dependencies exist.
Contribution
It develops a general framework for drift detection based on conditional distribution tests, extending traditional methods to account for context and dependencies.
Findings
Effective in detecting drift in subpopulations of data.
Applicable to large-scale vision datasets like ImageNet.
Insensitive to subpopulation prevalence variations.
Abstract
When monitoring machine learning systems, two-sample tests of homogeneity form the foundation upon which existing approaches to drift detection build. They are used to test for evidence that the distribution underlying recent deployment data differs from that underlying the historical reference data. Often, however, various factors such as time-induced correlation mean that batches of recent deployment data are not expected to form an i.i.d. sample from the historical data distribution. Instead we may wish to test for differences in the distributions conditional on \textit{context} that is permitted to change. To facilitate this we borrow machinery from the causal inference domain to develop a more general drift detection framework built upon a foundation of two-sample tests for conditional distributional treatment effects. We recommend a particular instantiation of the framework based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Distributed Sensor Networks and Detection Algorithms · Data Stream Mining Techniques
