Counterfactual Strategies for Markov Decision Processes

Paul Kobialka; Lina Gerlach; Francesco Leofante; Erika \'Abrah\'am; Silvia Lizeth Tapia Tarifa; Einar Broch Johnsen

arXiv:2505.09412·cs.AI·May 15, 2025

Counterfactual Strategies for Markov Decision Processes

Paul Kobialka, Lina Gerlach, Francesco Leofante, Erika \'Abrah\'am, Silvia Lizeth Tapia Tarifa, Einar Broch Johnsen

PDF

TL;DR

This paper introduces counterfactual strategies for Markov Decision Processes, enabling minimal modifications to decision strategies to reduce undesired outcomes in sequential decision-making tasks.

Contribution

It extends counterfactual reasoning to MDPs by encoding minimal strategy changes as solutions to non-linear optimization problems.

Findings

01

Successfully reduces undesired outcome probabilities in real-world datasets

02

Demonstrates practical viability in complex sequential decision tasks

03

Provides a method for synthesizing diverse counterfactual strategies

Abstract

Counterfactuals are widely used in AI to explain how minimal changes to a model's input can lead to a different output. However, established methods for computing counterfactuals typically focus on one-step decision-making, and are not directly applicable to sequential decision-making tasks. This paper fills this gap by introducing counterfactual strategies for Markov Decision Processes (MDPs). During MDP execution, a strategy decides which of the enabled actions (with known probabilistic effects) to execute next. Given an initial strategy that reaches an undesired outcome with a probability above some limit, we identify minimal changes to the initial strategy to reduce that probability below the limit. We encode such counterfactual strategies as solutions to non-linear optimization problems, and further extend our encoding to synthesize diverse counterfactual strategies. We evaluate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsCounterfactuals Explanations · Focus