Context-Aware Drift Detection

Oliver Cobb; Arnaud Van Looveren

arXiv:2203.08644·stat.ML·August 3, 2022·5 cites

Context-Aware Drift Detection

Oliver Cobb, Arnaud Van Looveren

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel context-aware drift detection framework that leverages causal inference and conditional two-sample tests to identify distributional changes in machine learning systems, even when data dependencies exist.

Contribution

It develops a general framework for drift detection based on conditional distribution tests, extending traditional methods to account for context and dependencies.

Findings

01

Effective in detecting drift in subpopulations of data.

02

Applicable to large-scale vision datasets like ImageNet.

03

Insensitive to subpopulation prevalence variations.

Abstract

When monitoring machine learning systems, two-sample tests of homogeneity form the foundation upon which existing approaches to drift detection build. They are used to test for evidence that the distribution underlying recent deployment data differs from that underlying the historical reference data. Often, however, various factors such as time-induced correlation mean that batches of recent deployment data are not expected to form an i.i.d. sample from the historical data distribution. Instead we may wish to test for differences in the distributions conditional on \textit{context} that is permitted to change. To facilitate this we borrow machinery from the causal inference domain to develop a more general drift detection framework built upon a foundation of two-sample tests for conditional distributional treatment effects. We recommend a particular instantiation of the framework based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SeldonIO/alibi-detect
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Distributed Sensor Networks and Detection Algorithms · Data Stream Mining Techniques