Learning Causally Invariant Reward Functions from Diverse Demonstrations
Ivan Ovinnikov, Eugene Bykovets, Joachim M. Buhmann

TL;DR
This paper introduces a causal invariance regularization technique for inverse reinforcement learning, enhancing reward function generalization and policy transferability across diverse and heterogeneous demonstrations.
Contribution
It proposes a novel regularization approach based on causal invariance principles to improve reward learning robustness and generalization in inverse reinforcement learning.
Findings
Superior policy performance in transfer settings
Enhanced reward function generalization
Effective regularization for diverse demonstrations
Abstract
Inverse reinforcement learning methods aim to retrieve the reward function of a Markov decision process based on a dataset of expert demonstrations. The commonplace scarcity and heterogeneous sources of such demonstrations can lead to the absorption of spurious correlations in the data by the learned reward function. Consequently, this adaptation often exhibits behavioural overfitting to the expert data set when a policy is trained on the obtained reward function under distribution shift of the environment dynamics. In this work, we explore a novel regularization approach for inverse reinforcement learning methods based on the causal invariance principle with the goal of improved reward function generalization. By applying this regularization to both exact and approximate formulations of the learning task, we demonstrate superior policy performance when trained using the recovered…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference
MethodsSparse Evolutionary Training
