Robots that Take Advantage of Human Trust

Dylan P. Losey; Dorsa Sadigh

arXiv:1909.05777·cs.RO·September 13, 2019

Robots that Take Advantage of Human Trust

Dylan P. Losey, Dorsa Sadigh

PDF

TL;DR

This paper explores how robots can exploit human trust by acting in ways that influence human perceptions and decisions, thereby improving efficiency in human-robot interactions.

Contribution

It introduces a formal game-theoretic model of human-robot interaction where robots leverage human trust, and demonstrates this through theoretical analysis and empirical studies.

Findings

01

Trusting human models lead to more communicative robot behavior.

02

Exploiting human trust can increase user involvement.

03

Different human models affect robot strategies and outcomes.

Abstract

Humans often assume that robots are rational. We believe robots take optimal actions given their objective; hence, when we are uncertain about what the robot's objective is, we interpret the robot's actions as optimal with respect to our estimate of its objective. This approach makes sense when robots straightforwardly optimize their objective, and enables humans to learn what the robot is trying to achieve. However, our insight is that---when robots are aware that humans learn by trusting that the robot actions are rational---intelligent robots do not act as the human expects; instead, they take advantage of the human's trust, and exploit this trust to more efficiently optimize their own objective. In this paper, we formally model instances of human-robot interaction (HRI) where the human does not know the robot's objective using a two-player game. We formulate different ways in which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.