Reward Shaping for Building Trustworthy Robots in Sequential Human-Robot Interaction
Yaohui Guo, X. Jessie Yang, Cong Shi

TL;DR
This paper introduces a reward-shaping framework for human-robot interaction that balances increasing human trust with maintaining task performance, using a Markov game approach and potential-based rewards.
Contribution
It formulates trust-aware HRI as a Markov game and applies potential-based reward shaping to enhance trust without significant performance loss.
Findings
Successfully increased human trust in simulation
Bounded performance loss with potential-based rewards
Efficient linear program for real-world deployment
Abstract
Trust-aware human-robot interaction (HRI) has received increasing research attention, as trust has been shown to be a crucial factor for effective HRI. Research in trust-aware HRI discovered a dilemma -- maximizing task rewards often leads to decreased human trust, while maximizing human trust would compromise task performance. In this work, we address this dilemma by formulating the HRI process as a two-player Markov game and utilizing the reward-shaping technique to improve human trust while limiting performance loss. Specifically, we show that when the shaping reward is potential-based, the performance loss can be bounded by the potential functions evaluated at the final states of the Markov game. We apply the proposed framework to the experience-based trust model, resulting in a linear program that can be efficiently solved and deployed in real-world applications. We evaluate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman-Automation Interaction and Safety · Cognitive Functions and Memory · Healthcare Technology and Patient Monitoring
