ALRM: Agentic LLM for Robotic Manipulation
Vitor Gaboardi dos Santos, Ibrahim Khadraoui, Ibrahim Farhat, Hamza Yous, Samy Teffahi, Hakim Hacid

TL;DR
This paper introduces ALRM, a modular agentic framework that enhances robotic manipulation by integrating large language models with reasoning loops, supporting diverse instructions and systematic evaluation through a new benchmark.
Contribution
ALRM is the first framework combining agentic reasoning with robotic control, enabling multistep planning, reflection, and revision using LLMs in a modular, interpretable manner.
Findings
Claude-4.1-Opus performs best among closed-source models.
Falcon-H1-7B excels among open-source models.
ALRM achieves reliable robotic execution across diverse tasks.
Abstract
Large Language Models (LLMs) have recently empowered agentic frameworks to exhibit advanced reasoning and planning capabilities. However, their integration in robotic control pipelines remains limited in two aspects: (1) prior \ac{llm}-based approaches often lack modular, agentic execution mechanisms, limiting their ability to plan, reflect on outcomes, and revise actions in a closed-loop manner; and (2) existing benchmarks for manipulation tasks focus on low-level control and do not systematically evaluate multistep reasoning and linguistic variation. In this paper, we propose Agentic LLM for Robot Manipulation (ALRM), an LLM-driven agentic framework for robotic manipulation. ALRM integrates policy generation with agentic execution through a ReAct-style reasoning loop, supporting two complementary modes: Code-asPolicy (CaP) for direct executable control code generation, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · Robot Manipulation and Learning
