GOMA: Proactive Embodied Cooperative Communication via Goal-Oriented Mental Alignment
Lance Ying, Kunal Jha, Shivam Aarya, Joshua B. Tenenbaum, Antonio, Torralba, Tianmin Shu

TL;DR
GOMA introduces a goal-oriented mental alignment framework enabling embodied assistants to proactively communicate via natural language, improving cooperation in complex environments by reducing mental state misalignment.
Contribution
This paper presents a novel planning-based approach for proactive verbal communication that aligns agents' mental states to enhance cooperation.
Findings
GOMA outperforms strong baselines in cooperative tasks.
Large language models struggle with contextually meaningful communication.
Proactive communication improves human perception and task performance.
Abstract
Verbal communication plays a crucial role in human cooperation, particularly when the partners only have incomplete information about the task, environment, and each other's mental state. In this paper, we propose a novel cooperative communication framework, Goal-Oriented Mental Alignment (GOMA). GOMA formulates verbal communication as a planning problem that minimizes the misalignment between the parts of agents' mental states that are relevant to the goals. This approach enables an embodied assistant to reason about when and how to proactively initialize communication with humans verbally using natural language to help achieve better cooperation. We evaluate our approach against strong baselines in two challenging environments, Overcooked (a multiplayer game) and VirtualHome (a household simulator). Our experimental results demonstrate that large language models struggle with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmbodied and Extended Cognition · Cognitive Science and Mapping · Design Education and Practice
