Monitoring Machine Learning Models: Online Detection of Relevant Deviations
Florian Heinrichs

TL;DR
This paper introduces a sequential monitoring method for machine learning models that effectively detects significant performance degradations while minimizing false alarms, ensuring model reliability in changing environments.
Contribution
It proposes a novel monitoring scheme that accounts for temporal dependence and thresholds, improving detection of relevant changes over existing benchmark methods.
Findings
Outperforms benchmark methods in detecting relevant model quality changes
Reduces unnecessary alerts and false positives
Validated with simulated and real data
Abstract
Machine learning models are essential tools in various domains, but their performance can degrade over time due to changes in data distribution or other factors. On one hand, detecting and addressing such degradations is crucial for maintaining the models' reliability. On the other hand, given enough data, any arbitrary small change of quality can be detected. As interventions, such as model re-training or replacement, can be expensive, we argue that they should only be carried out when changes exceed a given threshold. We propose a sequential monitoring scheme to detect these relevant changes. The proposed method reduces unnecessary alerts and overcomes the multiple testing problem by accounting for temporal dependence of the measured model quality. Conditions for consistency and specified asymptotic levels are provided. Empirical validation using simulated and real data demonstrates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Data Stream Mining Techniques · Machine Learning and Data Classification
