Asking Easy Questions: A User-Friendly Approach to Active Reward   Learning

Erdem B{\i}y{\i}k; Malayandi Palan; Nicholas C. Landolfi; Dylan P.; Losey; Dorsa Sadigh

arXiv:1910.04365·cs.RO·October 11, 2019·54 cites

Asking Easy Questions: A User-Friendly Approach to Active Reward Learning

Erdem B{\i}y{\i}k, Malayandi Palan, Nicholas C. Landolfi, Dylan P., Losey, Dorsa Sadigh

PDF

Open Access 2 Repos

TL;DR

This paper introduces a method for active reward learning in robots that prioritizes asking questions easy for humans to answer, improving learning efficiency by balancing robot and human uncertainties.

Contribution

It proposes an information gain approach that considers human answerability, enhancing question selection for faster reward learning in human-robot interaction.

Findings

01

Questions are easier for humans to answer.

02

Faster reward learning achieved in simulations.

03

User study confirms improved question quality.

Abstract

Robots can learn the right reward function by querying a human expert. Existing approaches attempt to choose questions where the robot is most uncertain about the human's response; however, they do not consider how easy it will be for the human to answer! In this paper we explore an information gain formulation for optimally selecting questions that naturally account for the human's ability to answer. Our approach identifies questions that optimize the trade-off between robot and human uncertainty, and determines when these questions become redundant or costly. Simulations and a user study show our method not only produces easy questions, but also ultimately results in faster reward learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Advanced Bandit Algorithms Research