AutoSearch: Adaptive Search Depth for Efficient Agentic RAG via Reinforcement Learning

Jingbo Sun; Wenyue Chong; Songjun Tu; Qichao Zhang; Yaocheng Zhang; Jiajun Chai; Xiaohan Wang; Wei Lin; Guojun Yin; Dongbin Zhao

arXiv:2604.17337·cs.AI·April 21, 2026

AutoSearch: Adaptive Search Depth for Efficient Agentic RAG via Reinforcement Learning

Jingbo Sun, Wenyue Chong, Songjun Tu, Qichao Zhang, Yaocheng Zhang, Jiajun Chai, Xiaohan Wang, Wei Lin, Guojun Yin, Dongbin Zhao

PDF

TL;DR

AutoSearch uses reinforcement learning to adaptively determine the optimal search depth in agentic RAG systems, balancing accuracy and efficiency for complex question answering.

Contribution

It introduces AutoSearch, a RL framework that dynamically evaluates search steps to minimize unnecessary search while maintaining high answer quality.

Findings

01

AutoSearch achieves better accuracy-efficiency trade-offs than fixed-depth methods.

02

AutoSearch reduces over-searching and computational costs in complex question answering.

03

Experiments show AutoSearch improves answer quality on multiple benchmarks.

Abstract

Agentic retrieval-augmented generation (RAG) systems enable large language models (LLMs) to solve complex tasks through multi-step interaction with external retrieval tools. However, such multi-step interaction often involves redundant search steps, incurring substantial computational cost and latency. Prior work limits search depth (i.e., the number of search steps) to reduce cost, but this often leads to underexploration of complex questions. To address this, we first investigate how search depth affects accuracy and find a minimal sufficient search depth that defines an accuracy-efficiency trade-off, jointly determined by question complexity and the agent's capability. Furthermore, we propose AutoSearch, a reinforcement learning (RL) framework that evaluates each search step via self-generated intermediate answers. By a self-answering mechanism, AutoSearch identifies the minimal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.