Identifiability and generalizability from multiple experts in Inverse Reinforcement Learning
Paul Rolland, Luca Viano, Norman Schuerhoff, Boris Nikolov, Volkan, Cevher

TL;DR
This paper investigates the conditions under which reward functions in Inverse Reinforcement Learning can be uniquely identified from multiple experts' behaviors and explores how this knowledge can be used to generalize to new environments.
Contribution
It provides a verifiable rank condition for reward identifiability in tabular MDPs and extends the analysis to feature-based rewards and approximate transition models.
Findings
Reward functions can be identified up to a constant with multiple experts under certain conditions.
The rank condition for identifiability is necessary and sufficient.
Data on multiple experts enables policy generalization to new environments.
Abstract
While Reinforcement Learning (RL) aims to train an agent from a reward function in a given environment, Inverse Reinforcement Learning (IRL) seeks to recover the reward function from observing an expert's behavior. It is well known that, in general, various reward functions can lead to the same optimal policy, and hence, IRL is ill-defined. However, (Cao et al., 2021) showed that, if we observe two or more experts with different discount factors or acting in different environments, the reward function can under certain conditions be identified up to a constant. This work starts by showing an equivalent identifiability statement from multiple experts in tabular MDPs based on a rank condition, which is easily verifiable and is shown to be also necessary. We then extend our result to various different scenarios, i.e., we characterize reward identifiability in the case where the reward…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsInnovation Diffusion and Forecasting · Game Theory and Applications · Experimental Behavioral Economics Studies
