Counterfactual Explanations in Sequential Decision Making Under Uncertainty
Stratis Tsirtsis, Abir De, Manuel Gomez-Rodriguez

TL;DR
This paper develops a method to generate counterfactual explanations for sequential decision processes modeled as Markov decision processes, providing insights to improve decision making under uncertainty.
Contribution
It introduces a formal framework and a polynomial-time dynamic programming algorithm for finding optimal counterfactual explanations in sequential decision making.
Findings
Algorithm guarantees optimal counterfactual explanations
Validated on synthetic and real data from therapy sessions
Provides valuable insights for decision making under uncertainty
Abstract
Methods to find counterfactual explanations have predominantly focused on one step decision making processes. In this work, we initiate the development of methods to find counterfactual explanations for decision making processes in which multiple, dependent actions are taken sequentially over time. We start by formally characterizing a sequence of actions and states using finite horizon Markov decision processes and the Gumbel-Max structural causal model. Building upon this characterization, we formally state the problem of finding counterfactual explanations for sequential decision making processes. In our problem formulation, the counterfactual explanation specifies an alternative sequence of actions differing in at most k actions from the observed sequence that could have led the observed process realization to a better outcome. Then, we introduce a polynomial time algorithm based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Advanced Causal Inference Techniques · Decision-Making and Behavioral Economics
