Reinforcement Fine-Tuning for Reasoning towards Multi-Step Multi-Source Search in Large Language Models
Wentao Shi, Yiqing Shen

TL;DR
This paper introduces R-Search, a unified single-LLM framework for multi-step, multi-source search and reasoning, utilizing reinforcement fine-tuning to improve factual accuracy and efficiency in time-sensitive queries.
Contribution
It proposes a novel structured output format and a reinforcement fine-tuning method, ReFT, enabling an LLM to perform integrated multi-source search and reasoning within one inference process.
Findings
Outperforms state-of-the-art methods on multiple benchmarks.
Achieves 70% reduction in context token usage.
Reduces execution latency by approximately 50%.
Abstract
Large language models (LLMs) can face factual limitations when responding to time-sensitive queries about recent events that arise after their knowledge thresholds in the training corpus. Existing search-augmented approaches fall into two categories, each with distinct limitations: multi-agent search frameworks incur substantial computational overhead by separating search planning and response synthesis across multiple LLMs, while single-LLM tool-calling methods restrict themselves to sequential planned, single-query searches from sole search sources. We present Reasoning-Search (R-Search), a single-LLM search framework that unifies multi-step planning, multi-source search execution, and answer synthesis within one coherent inference process. Innovatively, it structure the output into four explicitly defined components, including reasoning steps that guide the search process (<think>),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Information Retrieval and Search Behavior · Computational and Text Analysis Methods
