Interactively Teaching an Inverse Reinforcement Learner with Limited Feedback
Rustam Zayanov, Francisco S. Melo, Manuel Lopes

TL;DR
This paper addresses the challenge of teaching inverse reinforcement learners with limited feedback by formalizing the problem and proposing an algorithm that combines active state selection and policy inference, validated in a synthetic driving environment.
Contribution
It introduces a formal framework for teaching with limited feedback and develops an algorithm integrating active learning and inverse reinforcement learning techniques.
Findings
The proposed algorithm effectively teaches learners with limited feedback.
It outperforms baseline methods in a synthetic driving environment.
The method successfully infers policies from minimal trajectory data.
Abstract
We study the problem of teaching via demonstrations in sequential decision-making tasks. In particular, we focus on the situation when the teacher has no access to the learner's model and policy, and the feedback from the learner is limited to trajectories that start from states selected by the teacher. The necessity to select the starting states and infer the learner's policy creates an opportunity for using the methods of inverse reinforcement learning and active learning by the teacher. In this work, we formalize the teaching process with limited feedback and propose an algorithm that solves this teaching problem. The algorithm uses a modified version of the active value-at-risk method to select the starting states, a modified maximum causal entropy algorithm to infer the policy, and the difficulty score ratio method to choose the teaching demonstrations. We test the algorithm in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Supply Chain and Inventory Management · Energy Efficiency and Management
MethodsFocus
