Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

Emre Can Acikgoz; Cheng Qian; Jonas H\"ubotter; Heng Ji; Dilek Hakkani-T\"ur; Gokhan Tur

arXiv:2602.21320·cs.LG·February 26, 2026

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

Emre Can Acikgoz, Cheng Qian, Jonas H\"ubotter, Heng Ji, Dilek Hakkani-T\"ur, Gokhan Tur

PDF

Open Access

TL;DR

Tool-R0 introduces a self-evolving framework for training general-purpose tool-calling LLM agents from scratch using self-play reinforcement learning, eliminating the need for pre-existing datasets and enabling autonomous evolution.

Contribution

It presents the first zero-data, self-evolving RL approach for training tool-use LLM agents through co-evolution of generator and solver components.

Findings

01

Achieves 92.5% relative improvement over base models.

02

Outperforms supervised baselines in tool-calling tasks.

03

Provides empirical insights into self-play dynamics and scaling behaviors.

Abstract

Large language models (LLMs) are becoming the foundation for autonomous agents that can use tools to solve complex tasks. Reinforcement learning (RL) has emerged as a common approach for injecting such agentic capabilities, but typically under tightly controlled training setups. It often depends on carefully constructed task-solution pairs and substantial human supervision, which creates a fundamental obstacle to open-ended self-evolution toward superintelligent systems. In this paper, we propose Tool-R0 framework for training general purpose tool-calling agents from scratch with self-play RL, under a zero-data assumption. Initialized from the same base LLM, Tool-R0 co-evolves a Generator and a Solver with complementary rewards: one proposes targeted challenging tasks at the other's competence frontier and the other learns to solve them with real-world tool calls. This creates a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Artificial Intelligence in Healthcare and Education