Repeated Inverse Reinforcement Learning

Kareem Amin; Nan Jiang; Satinder Singh

arXiv:1705.05427·cs.AI·November 7, 2017·30 cites

Repeated Inverse Reinforcement Learning

Kareem Amin, Nan Jiang, Satinder Singh

PDF

Open Access

TL;DR

This paper introduces a new repeated inverse reinforcement learning problem where an agent learns to act on behalf of a human across multiple tasks, aiming to minimize surprising the human and efficiently updating its behavior with demonstrations.

Contribution

It formalizes the repeated IRL problem, explores different task sequence models, and provides foundational theoretical results for this novel setting.

Findings

01

Formalization of the repeated IRL problem

02

Analysis of task sequence selection methods

03

Foundational theoretical results for the problem

Abstract

We introduce a novel repeated Inverse Reinforcement Learning problem: the agent has to act on behalf of a human in a sequence of tasks and wishes to minimize the number of tasks that it surprises the human by acting suboptimally with respect to how the human would have acted. Each time the human is surprised, the agent is provided a demonstration of the desired behavior by the human. We formalize this problem, including how the sequence of tasks is chosen, in a few different ways and provide some foundational results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Data Stream Mining Techniques