RobotxR1: Enabling Embodied Robotic Intelligence on Large Language Models through Closed-Loop Reinforcement Learning
Liam Boyle, Nicolas Baumann, Paviththiren Sivasothilingam, Michele Magno, Luca Benini

TL;DR
This paper introduces a closed-loop reinforcement learning framework that enables small language models to develop embodied robotic intelligence, achieving performance comparable or superior to larger models through environmental interaction.
Contribution
It extends the R1-zero approach to robotics by integrating closed-loop RL, allowing small LLMs to learn effective reasoning and control capabilities without relying solely on supervised fine-tuning.
Findings
Small LLMs achieve effective reasoning via environment interaction.
20.2%-point performance gain over SFT baseline in autonomous driving.
Qwen2.5-3B surpasses GPT-4o in control adaptability.
Abstract
Future robotic systems operating in real-world environments will require on-board embodied intelligence without continuous cloud connection, balancing capabilities with constraints on computational power and memory. This work presents an extension of the R1-zero approach, which enables the usage of low parameter-count Large Language Models (LLMs) in the robotic domain. The R1-Zero approach was originally developed to enable mathematical reasoning in LLMs using static datasets. We extend it to the robotics domain through integration in a closed-loop Reinforcement Learning (RL) framework. This extension enhances reasoning in Embodied Artificial Intelligence (Embodied AI) settings without relying solely on distillation of large models through Supervised Fine-Tuning (SFT). We show that small-scale LLMs can achieve effective reasoning performance by learning through closed-loop interaction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Automated Systems · Reinforcement Learning in Robotics · Topic Modeling
