Physical Reasoning and Object Planning for Household Embodied Agents
Ayush Agrawal, Raghav Prabhakar, Anirudh Goyal, Dianbo Liu

TL;DR
This paper introduces the COAT framework and datasets to evaluate how language models reason about substitute objects in household tasks, emphasizing physical state, utility, and context for improved embodied agent decision-making.
Contribution
It presents the novel COAT datasets and human preference mappings, advancing understanding of physical commonsense reasoning in language models for household agents.
Findings
Language models struggle with aligning object utility to tasks.
Contextual dependencies significantly affect object substitution decisions.
Physical state variables influence reasoning about object usability.
Abstract
In this study, we explore the sophisticated domain of task planning for robust household embodied agents, with a particular emphasis on the intricate task of selecting substitute objects. We introduce the CommonSense Object Affordance Task (COAT), a novel framework designed to analyze reasoning capabilities in commonsense scenarios. This approach is centered on understanding how these agents can effectively identify and utilize alternative objects when executing household tasks, thereby offering insights into the complexities of practical decision-making in real-world environments. Drawing inspiration from factors affecting human decision-making, we explore how large language models tackle this challenge through four meticulously crafted commonsense question-and-answer datasets featuring refined rules and human annotations. Our evaluation of state-of-the-art language models on these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Multi-Agent Systems and Negotiation
