Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning
Yaorui Shi, Sihang Li, Chang Wu, Zhiyuan Liu, Junfeng Fang, Hengxing Cai, An Zhang, Xiang Wang

TL;DR
AutoRefine is a reinforcement learning framework that improves retrieval-augmented reasoning by enabling iterative knowledge refinement during the reasoning process, leading to better performance on complex question-answering tasks.
Contribution
It introduces a novel search-and-refine-during-think paradigm with explicit knowledge filtering steps and tailored rewards, advancing retrieval-augmented reasoning methods.
Findings
Outperforms existing methods on QA benchmarks
Enhances reasoning accuracy in multi-hop scenarios
Issues higher-quality, frequent searches
Abstract
Large language models have demonstrated impressive reasoning capabilities but are inherently limited by their knowledge reservoir. Retrieval-augmented reasoning mitigates this limitation by allowing LLMs to query external resources, but existing methods often retrieve irrelevant or noisy information, hindering accurate reasoning. In this paper, we propose AutoRefine, a reinforcement learning post-training framework that adopts a new "search-and-refine-during-think" paradigm. AutoRefine introduces explicit knowledge refinement steps between successive search calls, enabling the model to iteratively filter, distill, and organize evidence before generating an answer. Furthermore, we incorporate tailored retrieval-specific rewards alongside answer correctness rewards using group relative policy optimization. Experiments on single-hop and multi-hop QA benchmarks demonstrate that AutoRefine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Intelligent Tutoring Systems and Adaptive Learning · Semantic Web and Ontologies
