Reinforcement Fine-Tuning for Reasoning towards Multi-Step Multi-Source Search in Large Language Models

Wentao Shi; Yiqing Shen

arXiv:2506.08352·cs.IR·June 11, 2025

Reinforcement Fine-Tuning for Reasoning towards Multi-Step Multi-Source Search in Large Language Models

Wentao Shi, Yiqing Shen

PDF

Open Access 1 Repo

TL;DR

This paper introduces R-Search, a unified single-LLM framework for multi-step, multi-source search and reasoning, utilizing reinforcement fine-tuning to improve factual accuracy and efficiency in time-sensitive queries.

Contribution

It proposes a novel structured output format and a reinforcement fine-tuning method, ReFT, enabling an LLM to perform integrated multi-source search and reasoning within one inference process.

Findings

01

Outperforms state-of-the-art methods on multiple benchmarks.

02

Achieves 70% reduction in context token usage.

03

Reduces execution latency by approximately 50%.

Abstract

Large language models (LLMs) can face factual limitations when responding to time-sensitive queries about recent events that arise after their knowledge thresholds in the training corpus. Existing search-augmented approaches fall into two categories, each with distinct limitations: multi-agent search frameworks incur substantial computational overhead by separating search planning and response synthesis across multiple LLMs, while single-LLM tool-calling methods restrict themselves to sequential planned, single-query searches from sole search sources. We present Reasoning-Search (R-Search), a single-LLM search framework that unifies multi-step planning, multi-source search execution, and answer synthesis within one coherent inference process. Innovatively, it structure the output into four explicitly defined components, including reasoning steps that guide the search process (<think>),…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wentao0429/reasoning-search
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Information Retrieval and Search Behavior · Computational and Text Analysis Methods