Modeling and Controlling Deployment Reliability under Temporal Distribution Shift
Naimur Rahman, Naazreen Tabassum

TL;DR
This paper introduces a deployment-centric framework for modeling and controlling the reliability of machine learning models under temporal distribution shift, emphasizing stability and cost trade-offs.
Contribution
It proposes a novel multi-objective control approach to manage reliability dynamics, with empirical analysis on a large-scale credit dataset demonstrating its effectiveness.
Findings
Selective interventions improve reliability stability over continuous retraining.
The approach reduces operational costs significantly.
Reliability can be modeled as a controllable dynamic state.
Abstract
Machine learning models deployed in non-stationary environments are exposed to temporal distribution shift, which can erode predictive reliability over time. While common mitigation strategies such as periodic retraining and recalibration aim to preserve performance, they typically focus on average metrics evaluated at isolated time points and do not explicitly model how reliability evolves during deployment. We propose a deployment-centric framework that treats reliability as a dynamic state composed of discrimination and calibration. The trajectory of this state across sequential evaluation windows induces a measurable notion of volatility, allowing deployment adaptation to be formulated as a multi-objective control problem that balances reliability stability against cumulative intervention cost. Within this framework, we define a family of state-dependent intervention policies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
