Robots that Take Advantage of Human Trust
Dylan P. Losey, Dorsa Sadigh

TL;DR
This paper explores how robots can exploit human trust by acting in ways that influence human perceptions and decisions, thereby improving efficiency in human-robot interactions.
Contribution
It introduces a formal game-theoretic model of human-robot interaction where robots leverage human trust, and demonstrates this through theoretical analysis and empirical studies.
Findings
Trusting human models lead to more communicative robot behavior.
Exploiting human trust can increase user involvement.
Different human models affect robot strategies and outcomes.
Abstract
Humans often assume that robots are rational. We believe robots take optimal actions given their objective; hence, when we are uncertain about what the robot's objective is, we interpret the robot's actions as optimal with respect to our estimate of its objective. This approach makes sense when robots straightforwardly optimize their objective, and enables humans to learn what the robot is trying to achieve. However, our insight is that---when robots are aware that humans learn by trusting that the robot actions are rational---intelligent robots do not act as the human expects; instead, they take advantage of the human's trust, and exploit this trust to more efficiently optimize their own objective. In this paper, we formally model instances of human-robot interaction (HRI) where the human does not know the robot's objective using a two-player game. We formulate different ways in which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
