Partial Wasserstein and Maximum Mean Discrepancy distances for bridging the gap between outlier detection and drift detection
Thomas Viehmann

TL;DR
This paper introduces a novel approach that bridges outlier detection and drift detection by comparing input samples to an adaptively selected subset of the reference distribution, enhancing monitoring in machine learning applications.
Contribution
It proposes a new method using Partial Wasserstein and Maximum Mean Discrepancy distances to unify outlier and drift detection techniques.
Findings
Effective in detecting distributional shifts and outliers
Improves sensitivity by focusing on relevant reference distribution parts
Provides a unified framework for monitoring ML models
Abstract
With the rise of machine learning and deep learning based applications in practice, monitoring, i.e. verifying that these operate within specification, has become an important practical problem. An important aspect of this monitoring is to check whether the inputs (or intermediates) have strayed from the distribution they were validated for, which can void the performance assurances obtained during testing. There are two common approaches for this. The, perhaps, more classical one is outlier detection or novelty detection, where, for a single input we ask whether it is an outlier, i.e. exceedingly unlikely to have originated from a reference distribution. The second, perhaps more recent approach, is to consider a larger number of inputs and compare its distribution to a reference distribution (e.g. sampled during testing). This is done under the label drift detection. In this work,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Probabilistic and Robust Engineering Design
