Reward Shaping for Building Trustworthy Robots in Sequential Human-Robot   Interaction

Yaohui Guo; X. Jessie Yang; Cong Shi

arXiv:2308.00945·cs.RO·August 3, 2023·1 cites

Reward Shaping for Building Trustworthy Robots in Sequential Human-Robot Interaction

Yaohui Guo, X. Jessie Yang, Cong Shi

PDF

Open Access

TL;DR

This paper introduces a reward-shaping framework for human-robot interaction that balances increasing human trust with maintaining task performance, using a Markov game approach and potential-based rewards.

Contribution

It formulates trust-aware HRI as a Markov game and applies potential-based reward shaping to enhance trust without significant performance loss.

Findings

01

Successfully increased human trust in simulation

02

Bounded performance loss with potential-based rewards

03

Efficient linear program for real-world deployment

Abstract

Trust-aware human-robot interaction (HRI) has received increasing research attention, as trust has been shown to be a crucial factor for effective HRI. Research in trust-aware HRI discovered a dilemma -- maximizing task rewards often leads to decreased human trust, while maximizing human trust would compromise task performance. In this work, we address this dilemma by formulating the HRI process as a two-player Markov game and utilizing the reward-shaping technique to improve human trust while limiting performance loss. Specifically, we show that when the shaping reward is potential-based, the performance loss can be bounded by the potential functions evaluated at the final states of the Markov game. We apply the proposed framework to the experience-based trust model, resulting in a linear program that can be efficiently solved and deployed in real-world applications. We evaluate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman-Automation Interaction and Safety · Cognitive Functions and Memory · Healthcare Technology and Patient Monitoring