TRACE: Theoretical Risk Attribution under Covariate-shift Effects
Hosein Anjidani, S. Yahya S. R. Tehrani, Mohammad Mahdi Mojahedian, Mohammad Hossein Yassaee

TL;DR
TRACE provides a theoretical framework to diagnose and quantify the impact of covariate shift on model performance, enabling safer model replacement decisions through interpretable risk attribution.
Contribution
We introduce TRACE, a novel framework that decomposes risk change under covariate shift into interpretable components and provides practical diagnostics for model deployment.
Findings
TRACE accurately captures risk change in linear regression.
Diagnostics correlate strongly with true performance degradation.
Deployment gate score effectively guides model replacement decisions.
Abstract
When a source-trained model is replaced by a model trained on shifted data, its performance on the source domain can change unpredictably. To address this, we study the two-model risk change, , under covariate shift. We introduce TRACE (Theoretical Risk Attribution under Covariate-shift Effects), a framework that decomposes into an interpretable upper bound. This decomposition disentangles the risk change into four actionable factors: two generalization gaps, a model change penalty, and a covariate shift penalty, transforming the bound into a powerful diagnostic tool for understanding why performance has changed. To make TRACE a fully computable diagnostic, we instantiate each term. The covariate shift penalty is estimated via a model sensitivity factor (from high-quantile input gradients) and a data-shift measure; we use…
Peer Reviews
Decision·Submitted to ICLR 2026
- TRACE provides the practical lens that ones can diagnose which factors the model performance degradations are attributed to. Such attributions can be fully computable for each factor, which leads to diagnosis in domain adaptations such as failures by model instability or harmful parameter updates. - Such diagnosis is theoretically sound and its performances seem effective when tested in real-world scenarios such as adaptations to DomainNet dataset.
- The overall scope of this paper is limited to the covariate shifts, and it cannot be easily extended to other or more complicated distribution shifts. I'm still concerned how practical it is, even considering that the method is theoretically sound.
- The idea of providing practical explanations and bounds for risk change under model replacing is interesting and novel. - The proposed framework is effective and practical, both in theory and in experiments.
- The introduction is not well motivated. The authors should provide more explanation about why it is important to explain and bound risk change under covariate shift. - The setting is limited. The authors only consider covariate shift, which enforces a strong assumption on the two distributions.
- The paper is well written and addresses important question for transfer learning - The paper describes the entire pipeline to compute the estimate
- It is hard to compare this method with previous methods: the related work could be more explicit in this regard. - The discussion over the tightness of the estimated quantities is limited, in more difficult scenarios one could be afraid that bounds will be to loose to be helpful. - The experiment are limited to 2 dimensional data for most of it and is not detailed enough to provide a good evaluation of the method
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis
