Learning Causally Invariant Reward Functions from Diverse Demonstrations

Ivan Ovinnikov; Eugene Bykovets; Joachim M. Buhmann

arXiv:2409.08012·cs.LG·September 13, 2024

Learning Causally Invariant Reward Functions from Diverse Demonstrations

Ivan Ovinnikov, Eugene Bykovets, Joachim M. Buhmann

PDF

Open Access

TL;DR

This paper introduces a causal invariance regularization technique for inverse reinforcement learning, enhancing reward function generalization and policy transferability across diverse and heterogeneous demonstrations.

Contribution

It proposes a novel regularization approach based on causal invariance principles to improve reward learning robustness and generalization in inverse reinforcement learning.

Findings

01

Superior policy performance in transfer settings

02

Enhanced reward function generalization

03

Effective regularization for diverse demonstrations

Abstract

Inverse reinforcement learning methods aim to retrieve the reward function of a Markov decision process based on a dataset of expert demonstrations. The commonplace scarcity and heterogeneous sources of such demonstrations can lead to the absorption of spurious correlations in the data by the learned reward function. Consequently, this adaptation often exhibits behavioural overfitting to the expert data set when a policy is trained on the obtained reward function under distribution shift of the environment dynamics. In this work, we explore a novel regularization approach for inverse reinforcement learning methods based on the causal invariance principle with the goal of improved reward function generalization. By applying this regularization to both exact and approximate formulations of the learning task, we demonstrate superior policy performance when trained using the recovered…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference

MethodsSparse Evolutionary Training