Causal Imitation Learning Under Measurement Error and Distribution Shift
Shi Bo, AmirEmad Ghassami

TL;DR
This paper introduces CausIL, a causal inference-based framework for offline imitation learning that effectively handles measurement errors and distribution shifts, improving policy robustness in noisy, changing environments.
Contribution
It proposes a novel causal modeling approach for IL under measurement error, with identification conditions and estimators for discrete and continuous states, enhancing robustness to distribution shifts.
Findings
CausIL outperforms behavioral cloning in robustness to distribution shift.
The method is applicable to both discrete and continuous state spaces.
Empirical results on medical data show improved policy recovery.
Abstract
We study offline imitation learning (IL) when part of the decision-relevant state is observed only through noisy measurements and the distribution may change between training and deployment. Such settings induce spurious state-action correlations, so standard behavioral cloning (BC) -- whether conditioning on raw measurements or ignoring them -- can converge to systematically biased policies under distribution shift. We propose a general framework for IL under measurement error, inspired by explicitly modeling the causal relationships among the variables, yielding a target that retains a causal interpretation and is robust to distribution shift. Building on ideas from proximal causal inference, we introduce \texttt{CausIL}, which treats noisy state observations as proxy variables, and we provide identification conditions under which the target policy is recoverable from demonstrations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Explainable Artificial Intelligence (XAI) · Domain Adaptation and Few-Shot Learning
