From Language to Action: Can LLM-Based Agents Be Used for Embodied Robot Cognition?

Shinas Shaji; Fabian Huppertz; Alex Mitrevski; Sebastian Houben

arXiv:2603.03148·cs.RO·March 4, 2026

From Language to Action: Can LLM-Based Agents Be Used for Embodied Robot Cognition?

Shinas Shaji, Fabian Huppertz, Alex Mitrevski, Sebastian Houben

PDF

Open Access

TL;DR

This paper explores using large language models as core components for planning and reasoning in embodied robotic agents, demonstrating potential for task execution but also revealing significant limitations in reliability and instruction following.

Contribution

It proposes a novel cognitive architecture integrating LLMs with memory modules for robot control and evaluates its performance in household tasks within a simulated environment.

Findings

01

LLM-driven agents can complete structured household tasks.

02

Emergent adaptation and memory-guided planning are observed.

03

Limitations include hallucinations and poor instruction following.

Abstract

In order to flexibly act in an everyday environment, a robotic agent needs a variety of cognitive capabilities that enable it to reason about plans and perform execution recovery. Large language models (LLMs) have been shown to demonstrate emergent cognitive aspects, such as reasoning and language understanding; however, the ability to control embodied robotic agents requires reliably bridging high-level language to low-level functionalities for perception and control. In this paper, we investigate the extent to which an LLM can serve as a core component for planning and execution reasoning in a cognitive robot architecture. For this purpose, we propose a cognitive architecture in which an agentic LLM serves as the core component for planning and reasoning, while components for working and episodic memories support learning from experience and adaptation. An instance of the architecture…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Social Robot Interaction and HRI · AI-based Problem Solving and Planning