Repeated Inverse Reinforcement Learning
Kareem Amin, Nan Jiang, Satinder Singh

TL;DR
This paper introduces a new repeated inverse reinforcement learning problem where an agent learns to act on behalf of a human across multiple tasks, aiming to minimize surprising the human and efficiently updating its behavior with demonstrations.
Contribution
It formalizes the repeated IRL problem, explores different task sequence models, and provides foundational theoretical results for this novel setting.
Findings
Formalization of the repeated IRL problem
Analysis of task sequence selection methods
Foundational theoretical results for the problem
Abstract
We introduce a novel repeated Inverse Reinforcement Learning problem: the agent has to act on behalf of a human in a sequence of tasks and wishes to minimize the number of tasks that it surprises the human by acting suboptimally with respect to how the human would have acted. Each time the human is surprised, the agent is provided a demonstration of the desired behavior by the human. We formalize this problem, including how the sequence of tasks is chosen, in a few different ways and provide some foundational results.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Data Stream Mining Techniques
