RobotxR1: Enabling Embodied Robotic Intelligence on Large Language Models through Closed-Loop Reinforcement Learning

Liam Boyle; Nicolas Baumann; Paviththiren Sivasothilingam; Michele Magno; Luca Benini

arXiv:2505.03238·cs.RO·September 3, 2025

RobotxR1: Enabling Embodied Robotic Intelligence on Large Language Models through Closed-Loop Reinforcement Learning

Liam Boyle, Nicolas Baumann, Paviththiren Sivasothilingam, Michele Magno, Luca Benini

PDF

Open Access

TL;DR

This paper introduces a closed-loop reinforcement learning framework that enables small language models to develop embodied robotic intelligence, achieving performance comparable or superior to larger models through environmental interaction.

Contribution

It extends the R1-zero approach to robotics by integrating closed-loop RL, allowing small LLMs to learn effective reasoning and control capabilities without relying solely on supervised fine-tuning.

Findings

01

Small LLMs achieve effective reasoning via environment interaction.

02

20.2%-point performance gain over SFT baseline in autonomous driving.

03

Qwen2.5-3B surpasses GPT-4o in control adaptability.

Abstract

Future robotic systems operating in real-world environments will require on-board embodied intelligence without continuous cloud connection, balancing capabilities with constraints on computational power and memory. This work presents an extension of the R1-zero approach, which enables the usage of low parameter-count Large Language Models (LLMs) in the robotic domain. The R1-Zero approach was originally developed to enable mathematical reasoning in LLMs using static datasets. We extend it to the robotics domain through integration in a closed-loop Reinforcement Learning (RL) framework. This extension enhances reasoning in Embodied Artificial Intelligence (Embodied AI) settings without relying solely on distillation of large models through Supervised Fine-Tuning (SFT). We show that small-scale LLMs can achieve effective reasoning performance by learning through closed-loop interaction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Automated Systems · Reinforcement Learning in Robotics · Topic Modeling