Robust Counterfactual Inference in Markov Decision Processes
Jessica Lally, Milad Kazemi, Nicola Paoletti

TL;DR
This paper introduces a scalable, non-parametric method to compute bounds on counterfactual transition probabilities in MDPs, enabling robust policy optimization under causal model uncertainty.
Contribution
It presents a novel approach that provides closed-form bounds on counterfactuals across all compatible causal models, improving robustness and computational efficiency.
Findings
Provides tight bounds on counterfactual transition probabilities.
Enables robust policy optimization under causal uncertainty.
Demonstrates improved robustness in case studies.
Abstract
This paper addresses a key limitation in existing counterfactual inference methods for Markov Decision Processes (MDPs). Current approaches assume a specific causal model to make counterfactuals identifiable. However, there are usually many causal models that align with the observational and interventional distributions of an MDP, each yielding different counterfactual distributions, so fixing a particular causal model limits the validity (and usefulness) of counterfactual inference. We propose a novel non-parametric approach that computes tight bounds on counterfactual transition probabilities across all compatible causal models. Unlike previous methods that require solving prohibitively large optimisation problems (with variables that grow exponentially in the size of the MDP), our approach provides closed-form expressions for these bounds, making computation highly efficient and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference
MethodsCounterfactuals Explanations · ALIGN
