An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents

Bowen Jin; Jinsung Yoon; Priyanka Kargupta; Sercan O. Arik; Jiawei Han

arXiv:2505.15117·cs.CL·May 22, 2025

An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents

Bowen Jin, Jinsung Yoon, Priyanka Kargupta, Sercan O. Arik, Jiawei Han

PDF

Open Access 1 Repo

TL;DR

This paper empirically investigates how reward design, LLM characteristics, and search engine choices affect the training and performance of RL-based reasoning-search agents, providing practical guidelines for real-world deployment.

Contribution

It systematically analyzes key factors influencing RL training of LLM search agents, offering new insights into reward formulation, LLM scale, and search engine impact.

Findings

01

Format rewards improve final performance.

02

Intermediate retrieval rewards have limited impact.

03

Search engine choice affects training dynamics and robustness.

Abstract

Reinforcement learning (RL) has demonstrated strong potential in training large language models (LLMs) capable of complex reasoning for real-world problem solving. More recently, RL has been leveraged to create sophisticated LLM-based search agents that adeptly combine reasoning with search engine use. While the use of RL for training search agents is promising, the optimal design of such agents remains not fully understood. In particular, key factors -- such as (1) reward formulation, (2) the choice and characteristics of the underlying LLM, and (3) the role of the search engine in the RL process -- require further investigation. In this work, we conduct comprehensive empirical studies to systematically investigate these and offer actionable insights. We highlight several key findings: format rewards are effective in improving final performance, whereas intermediate retrieval rewards…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

petergriffinjin/search-r1
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation