REI-Bench: Can Embodied Agents Understand Vague Human Instructions in Task Planning?

Chenxi Jiang; Chuhao Zhou; Jianfei Yang

arXiv:2505.10872·cs.RO·May 12, 2026

REI-Bench: Can Embodied Agents Understand Vague Human Instructions in Task Planning?

Chenxi Jiang, Chuhao Zhou, Jianfei Yang

PDF

1 Video

TL;DR

This paper introduces REI-Bench, a benchmark for robot task planning with vague human instructions, revealing significant performance drops due to referential vagueness and proposing context cognition to improve understanding.

Contribution

It is the first benchmark modeling vague referring expressions grounded in pragmatic theory and proposes a context cognition method to enhance robot task planning accuracy.

Findings

01

Vagueness in referring expressions can reduce planning success by up to 36.9%.

02

Most planning failures are due to missing objects in the robot's understanding.

03

Context cognition significantly improves robot understanding of vague instructions.

Abstract

Robot task planning decomposes human instructions into executable action sequences that enable robots to complete a series of complex tasks. Although recent large language model (LLM)-based task planners achieve amazing performance, they assume that human instructions are clear and straightforward. However, real-world users are not experts, and their instructions to robots often contain significant vagueness. Linguists suggest that such vagueness frequently arises from referring expressions (REs), whose meanings depend heavily on dialogue context and environment. This vagueness is even more prevalent among the elderly and children, who are the groups that robots should serve more. This paper studies how such vagueness in REs within human instructions affects LLM-based robot task planning and how to overcome this issue. To this end, we propose the first robot task planning benchmark that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

REI-Bench: Can Embodied Agents Understand Vague Human Instructions in Task Planning?· slideslive