Search-R2: Enhancing Search-Integrated Reasoning via Actor-Refiner Collaboration

Bowei He; Minda Hu; Zenan Xu; Hongru Wang; Licheng Zong; Yankai Chen; Chen Ma; Xue Liu; Pluto Zhou; Irwin King

arXiv:2602.03647·cs.AI·February 4, 2026

Search-R2: Enhancing Search-Integrated Reasoning via Actor-Refiner Collaboration

Bowei He, Minda Hu, Zenan Xu, Hongru Wang, Licheng Zong, Yankai Chen, Chen Ma, Xue Liu, Pluto Zhou, Irwin King

PDF

Open Access

TL;DR

Search-R2 introduces an Actor-Refiner framework for search-integrated reasoning, improving training efficiency and reasoning accuracy by targeted intervention and hybrid rewards, outperforming existing baselines.

Contribution

It presents a novel Actor-Refiner collaboration method with a dense reward mechanism, formal analysis, and extensive empirical validation for enhanced reasoning in language agents.

Findings

01

Outperforms strong RAG and RL baselines across datasets

02

Achieves higher reasoning accuracy with minimal overhead

03

Demonstrates theoretical performance gains through formal analysis

Abstract

Search-integrated reasoning enables language agents to transcend static parametric knowledge by actively querying external sources. However, training these agents via reinforcement learning is hindered by the multi-scale credit assignment problem: existing methods typically rely on sparse, trajectory-level rewards that fail to distinguish between high-quality reasoning and fortuitous guesses, leading to redundant or misleading search behaviors. To address this, we propose Search-R2, a novel Actor-Refiner collaboration framework that enhances reasoning through targeted intervention, with both components jointly optimized during training. Our approach decomposes the generation process into an Actor, which produces initial reasoning trajectories, and a Meta-Refiner, which selectively diagnoses and repairs flawed steps via a 'cut-and-regenerate' mechanism. To provide fine-grained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Explainable Artificial Intelligence (XAI)