Identifiability and generalizability from multiple experts in Inverse   Reinforcement Learning

Paul Rolland; Luca Viano; Norman Schuerhoff; Boris Nikolov; Volkan; Cevher

arXiv:2209.10974·cs.LG·October 14, 2022·1 cites

Identifiability and generalizability from multiple experts in Inverse Reinforcement Learning

Paul Rolland, Luca Viano, Norman Schuerhoff, Boris Nikolov, Volkan, Cevher

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates the conditions under which reward functions in Inverse Reinforcement Learning can be uniquely identified from multiple experts' behaviors and explores how this knowledge can be used to generalize to new environments.

Contribution

It provides a verifiable rank condition for reward identifiability in tabular MDPs and extends the analysis to feature-based rewards and approximate transition models.

Findings

01

Reward functions can be identified up to a constant with multiple experts under certain conditions.

02

The rank condition for identifiability is necessary and sufficient.

03

Data on multiple experts enables policy generalization to new environments.

Abstract

While Reinforcement Learning (RL) aims to train an agent from a reward function in a given environment, Inverse Reinforcement Learning (IRL) seeks to recover the reward function from observing an expert's behavior. It is well known that, in general, various reward functions can lead to the same optimal policy, and hence, IRL is ill-defined. However, (Cao et al., 2021) showed that, if we observe two or more experts with different discount factors or acting in different environments, the reward function can under certain conditions be identified up to a constant. This work starts by showing an equivalent identifiability statement from multiple experts in tabular MDPs based on a rank condition, which is easily verifiable and is shown to be also necessary. We then extend our result to various different scenarios, i.e., we characterize reward identifiability in the case where the reward…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lviano/identifiability_irl
pytorchOfficial

Videos

Identifiability and generalizability from multiple experts in Inverse Reinforcement Learning· slideslive

Taxonomy

TopicsInnovation Diffusion and Forecasting · Game Theory and Applications · Experimental Behavioral Economics Studies