Explanation Shift: How Did the Distribution Shift Impact the Model?
Carlos Mougan, Klaus Broelemann, David Masip, Gjergji Kasneci,, Thanassis Thiropanis, Steffen Staab

TL;DR
This paper introduces a novel approach to understanding how explanation characteristics change under distribution shifts, providing a better indicator for out-of-distribution model behavior than existing methods.
Contribution
It proposes modeling explanation shifts as a new way to detect distribution shifts, along with an algorithmic method and open-source tools for analysis.
Findings
Explanation shifts are effective indicators of distribution shifts.
Modeling explanation shifts outperforms state-of-the-art techniques.
The approach is validated on synthetic and real-world datasets.
Abstract
As input data distributions evolve, the predictive performance of machine learning models tends to deteriorate. In practice, new input data tend to come without target labels. Then, state-of-the-art techniques model input data distributions or model prediction distributions and try to understand issues regarding the interactions between learned models and shifting distributions. We suggest a novel approach that models how explanation characteristics shift when affected by distribution shifts. We find that the modeling of explanation shifts can be a better indicator for detecting out-of-distribution model behaviour than state-of-the-art techniques. We analyze different types of distribution shifts using synthetic examples and real-world data sets. We provide an algorithmic method that allows us to inspect the interaction between data set features and learned models and compare them to…
Peer Reviews
Decision·Submitted to ICLR 2024
- Paper is written well with all relevant background explained in main text. - The idea of detecting relevant distribution shifts through their effect on explanations is original and interesting. - Work provides intuitive examples to show which distribution shifts can be detected by explanation shifts. - It tackles a significant problem. Interpretability of distribution shifts, including their effect on model behavior, is an under-studied problem.
- Motivation for considering shifts in explanation is not convincing. This, I believe, is because the goal is not stated concretely to then motivate the solution. The stated goal of investigating interaction between distribution shifts and learned model needs to be further characterized in quantifiable forms. I agree that not every type of distribution shifts are important to detect. The ones that change model behaviour in some meaningful way are important. However, it is unclear to me why expl
1. The overall paper is well written. Understanding how distribution shifts affect ML model performance is important. 2. The hypothesis that distribution shifts in explanations generated for a model could give a hint about overall distribution shifts and their impact on model performance is a good one and should be explored
1. The authors provide some interesting case studies and examples of the utility of Shapley based explanations in identifying distribution shifts. One aspect of this discussion that could be done better is trying to emphasize what additional assumptions could be required so that explanations such as Shapley can indeed be used for detecting, say, concept shift. This is not an impossible task for other methods, see for example: Liu et al [1]. See also experiments in this paper. 2. The example on
- The paper is generally easy to read and well-written. - The paper provides a nice formulation of the problem. - The paper tackles an important real-world problem. - The authors provide theoretical analyses for a few simple synthetic cases.
1. I am not convinced that the results demonstrate the empirical superiority of the proposed method relative to the baselines. The authors only compare to the baselines in Figure 2. Here, there are several other methods that are competitive with explanation shift. In addition, the authors do not show confidence intervals in this figure. I also contest that "good indicators should follow a progressive steady positive slope", as if the goal is distribution shift detection, the only thing that shou
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Time Series Analysis and Forecasting
