Interactively Teaching an Inverse Reinforcement Learner with Limited   Feedback

Rustam Zayanov; Francisco S. Melo; Manuel Lopes

arXiv:2309.09095·cs.LG·September 19, 2023

Interactively Teaching an Inverse Reinforcement Learner with Limited Feedback

Rustam Zayanov, Francisco S. Melo, Manuel Lopes

PDF

Open Access 1 Repo

TL;DR

This paper addresses the challenge of teaching inverse reinforcement learners with limited feedback by formalizing the problem and proposing an algorithm that combines active state selection and policy inference, validated in a synthetic driving environment.

Contribution

It introduces a formal framework for teaching with limited feedback and develops an algorithm integrating active learning and inverse reinforcement learning techniques.

Findings

01

The proposed algorithm effectively teaches learners with limited feedback.

02

It outperforms baseline methods in a synthetic driving environment.

03

The method successfully infers policies from minimal trajectory data.

Abstract

We study the problem of teaching via demonstrations in sequential decision-making tasks. In particular, we focus on the situation when the teacher has no access to the learner's model and policy, and the feedback from the learner is limited to trajectories that start from states selected by the teacher. The necessity to select the starting states and infer the learner's policy creates an opportunity for using the methods of inverse reinforcement learning and active learning by the teacher. In this work, we formalize the teaching process with limited feedback and propose an algorithm that solves this teaching problem. The algorithm uses a modified version of the active value-at-risk method to select the starting states, a modified maximum causal entropy algorithm to infer the policy, and the difficulty score ratio method to choose the teaching demonstrations. We test the algorithm in a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rzayanov/irl-teaching-limited-feedback
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Supply Chain and Inventory Management · Energy Efficiency and Management

MethodsFocus