Identifiability and Generalizability in Constrained Inverse   Reinforcement Learning

Andreas Schlaginhaufen; Maryam Kamgarpour

arXiv:2306.00629·cs.LG·June 2, 2023·2 cites

Identifiability and Generalizability in Constrained Inverse Reinforcement Learning

Andreas Schlaginhaufen, Maryam Kamgarpour

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper develops a theoretical framework for constrained inverse reinforcement learning, analyzing reward identifiability and generalizability, with implications for safety and transferability in RL.

Contribution

It extends reward identifiability and generalizability results to constrained MDPs and regularizations, highlighting conditions for reward uniqueness and transferability.

Findings

01

Identifiability up to potential shaping is due to entropy regularization.

02

Reward must be identified up to a constant for generalization.

03

Finite sample guarantees for reward suboptimality are provided.

Abstract

Two main challenges in Reinforcement Learning (RL) are designing appropriate reward functions and ensuring the safety of the learned policy. To address these challenges, we present a theoretical framework for Inverse Reinforcement Learning (IRL) in constrained Markov decision processes. From a convex-analytic perspective, we extend prior results on reward identifiability and generalizability to both the constrained setting and a more general class of regularizations. In particular, we show that identifiability up to potential shaping (Cao et al., 2021) is a consequence of entropy regularization and may generally no longer hold for other regularizations or in the presence of safety constraints. We also show that to ensure generalizability to new transition laws and constraints, the true reward must be identified up to a constant. Additionally, we derive a finite sample guarantee for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

andrschl/cirl
noneOfficial

Videos

Identifiability and Generalizability in Constrained Inverse Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsEntropy Regularization