One-shot Policy Elicitation via Semantic Reward Manipulation

Aaquib Tabrez; Ryan Leonard; Bradley Hayes

arXiv:2101.01860·cs.RO·January 7, 2021·1 cites

One-shot Policy Elicitation via Semantic Reward Manipulation

Aaquib Tabrez, Ryan Leonard, Bradley Hayes

PDF

Open Access

TL;DR

This paper introduces SPEAR, a novel algorithm that uses semantic explanations to modify agents' reward functions, enabling effective one-shot policy manipulation and improved collaboration in complex domains.

Contribution

SPEAR is a new sequential optimization method that leverages semantic explanations to efficiently manipulate policies, outperforming existing approaches in runtime and problem complexity.

Findings

01

SPEAR effectively manipulates policies in complex domains.

02

It significantly reduces runtime compared to state-of-the-art methods.

03

The method enhances agent collaboration through semantic reward augmentation.

Abstract

Synchronizing expectations and knowledge about the state of the world is an essential capability for effective collaboration. For robots to effectively collaborate with humans and other autonomous agents, it is critical that they be able to generate intelligible explanations to reconcile differences between their understanding of the world and that of their collaborators. In this work we present Single-shot Policy Explanation for Augmenting Rewards (SPEAR), a novel sequential optimization algorithm that uses semantic explanations derived from combinations of planning predicates to augment agents' reward functions, driving their policies to exhibit more optimal behavior. We provide an experimental validation of our algorithm's policy manipulation capabilities in two practically grounded applications and conclude with a performance analysis of SPEAR on domains of increasingly complex…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics · Topic Modeling