Reducing Human-Robot Goal State Divergence with Environment Design

Kelsey Sikes; Sarah Keren; Sarath Sreedharan

arXiv:2404.15184·cs.AI·April 24, 2024

Reducing Human-Robot Goal State Divergence with Environment Design

Kelsey Sikes, Sarah Keren, Sarath Sreedharan

PDF

Open Access

TL;DR

This paper introduces a new metric called Goal State Divergence (GSD) to quantify and reduce the mismatch between human and robot goal states through environment design modifications, improving human-robot collaboration.

Contribution

The paper proposes GSD as a novel metric for goal alignment and a method to identify minimal environment changes to prevent goal state mismatches.

Findings

01

GSD effectively measures goal state divergence in human-robot interactions.

02

Environment modifications based on GSD reduce goal mismatches in benchmarks.

03

The approach improves safety and alignment in human-robot collaboration.

Abstract

One of the most difficult challenges in creating successful human-AI collaborations is aligning a robot's behavior with a human user's expectations. When this fails to occur, a robot may misinterpret their specified goals, prompting it to perform actions with unanticipated, potentially dangerous side effects. To avoid this, we propose a new metric we call Goal State Divergence $(G S D)$ , which represents the difference between a robot's final goal state and the one a human user expected. In cases where $G S D$ cannot be directly calculated, we show how it can be approximated using maximal and minimal bounds. We then input the $G S D$ value into our novel human-robot goal alignment (HRGA) design problem, which identifies a minimal set of environment modifications that can prevent mismatches like this. To show the effectiveness of $G S D$ for reducing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman-Automation Interaction and Safety