Advice Conformance Verification by Reinforcement Learning agents for   Human-in-the-Loop

Mudit Verma; Ayush Kharkwal; Subbarao Kambhampati

arXiv:2210.03455·cs.AI·October 10, 2022

Advice Conformance Verification by Reinforcement Learning agents for Human-in-the-Loop

Mudit Verma, Ayush Kharkwal, Subbarao Kambhampati

PDF

Open Access

TL;DR

This paper introduces a method for reinforcement learning agents to verify and communicate how well they conform to human advice, enhancing transparency and trust in human-in-the-loop systems.

Contribution

It proposes a novel Advice-Conformance Verification framework using a Preference Tree to interpret and communicate advice adherence in RL agents.

Findings

01

The method effectively indicates advice conformity in MuJoCo's Humanoid environment.

02

It provides interpretable explanations of advice usage.

03

A human study validates the approach's usefulness.

Abstract

Human-in-the-loop (HiL) reinforcement learning is gaining traction in domains with large action and state spaces, and sparse rewards by allowing the agent to take advice from HiL. Beyond advice accommodation, a sequential decision-making agent must be able to express the extent to which it was able to utilize the human advice. Subsequently, the agent should provide a means for the HiL to inspect parts of advice that it had to reject in favor of the overall environment objective. We introduce the problem of Advice-Conformance Verification which requires reinforcement learning (RL) agents to provide assurances to the human in the loop regarding how much of their advice is being conformed to. We then propose a Tree-based lingua-franca to support this communication, called a Preference Tree. We study two cases of good and bad advice scenarios in MuJoCo's Humanoid environment. Through our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Embodied and Extended Cognition · Reinforcement Learning in Robotics