Embodied Multi-Agent Coordination by Aligning World Models Through Dialogue
Vardhan Dongre, Dilek Hakkani-T\"ur

TL;DR
This paper investigates whether large language model-based embodied agents can effectively communicate to align their world models during collaborative tasks, revealing that dialogue improves coordination but may hinder task success.
Contribution
It introduces a framework for measuring world-model alignment in embodied agents and evaluates how dialogue influences coordination and understanding.
Findings
Dialogue reduces action conflicts by 40 to 83 percentage points.
Dialogue degrades task success compared to silent coordination.
Current models often achieve superficial coordination without genuine world-model alignment.
Abstract
Effective collaboration between embodied agents requires more than acting in a shared environment; it demands communication grounded in each agent's evolving understanding of the world. When agents can only partially observe their surroundings, coordination without communication is provably hard, but communication can, in principle, bridge this gap by allowing agents to share observations and align their world models. In this work, we examine whether LLM-based embodied agents actually realize the ability to communicate. We extend PARTNR, a benchmark for collaborative household robotics, with a natural-language dialogue channel that enables two agents with partial observability to communicate during task execution. To evaluate whether dialogue leads to genuine world-model alignment rather than superficial coordination, we propose a framework for measuring world-model alignment defined…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
