On the Relationship Between Explanation and Prediction: A Causal View
Amir-Hossein Karimi, Krikamol Muandet, Simon Kornblith, Bernhard, Sch\"olkopf, Been Kim

TL;DR
This paper uses causal inference to systematically analyze how various upstream factors influence explanations of machine learning models, revealing that explanations often poorly reflect the actual predictions, especially in high-performing models.
Contribution
It introduces a causal framework to quantify the relationship between explanations and predictions, highlighting limitations of current explanation methods.
Findings
Explanations often have a weak relationship with model predictions.
Higher-performing models show larger gaps between explanations and predictions.
The study provides a foundation for developing quantitative metrics for explanations.
Abstract
Being able to provide explanations for a model's decision has become a central requirement for the development, deployment, and adoption of machine learning models. However, we are yet to understand what explanation methods can and cannot do. How do upstream factors such as data, model prediction, hyperparameters, and random initialization influence downstream explanations? While previous work raised concerns that explanations (E) may have little relationship with the prediction (Y), there is a lack of conclusive study to quantify this relationship. Our work borrows tools from causal inference to systematically assay this relationship. More specifically, we study the relationship between E and Y by measuring the treatment effect when intervening on their causal ancestors, i.e., on hyperparameters and inputs used to generate saliency-based Es or Ys. Our results suggest that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Bayesian Modeling and Causal Inference
