Accounting for Human Learning when Inferring Human Preferences

Harry Giles; Lawrence Chan

arXiv:2011.05596·cs.LG·December 2, 2020

Accounting for Human Learning when Inferring Human Preferences

Harry Giles, Lawrence Chan

PDF

Open Access

TL;DR

This paper explores how accounting for human learning in inverse reinforcement learning can improve the inference of human preferences, especially in unfamiliar environments, by modeling humans as learning agents rather than stationary demonstrators.

Contribution

It introduces a model that incorporates human learning into IRL, demonstrating that this approach can outperform traditional methods assuming stationarity in certain scenarios.

Findings

01

Learning-aware IRL can outperform stationary models in some cases.

02

Model misspecification can lead to poor inference results.

03

Accounting for human learning is crucial in unfamiliar environments.

Abstract

Inverse reinforcement learning (IRL) is a common technique for inferring human preferences from data. Standard IRL techniques tend to assume that the human demonstrator is stationary, that is that their policy $π$ doesn't change over time. In practice, humans interacting with a novel environment or performing well on a novel task will change their demonstrations as they learn more about the environment or task. We investigate the consequences of relaxing this assumption of stationarity, in particular by modelling the human as learning. Surprisingly, we find in some small examples that this can lead to better inference than if the human was stationary. That is, by observing a demonstrator who is themselves learning, a machine can infer more than by observing a demonstrator who is noisily rational. In addition, we find evidence that misspecification can lead to poor inference,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Computability, Logic, AI Algorithms · Advanced Bandit Algorithms Research