Test-Time Strategies for More Efficient and Accurate Agentic RAG

Brian Zhang; Deepti Guntur; Zhiyang Zuo; Abhinav Sharma; Shreyas Chaudhari; Wenlong Zhao; Franck Dernoncourt; Puneet Mathur; Ryan Rossi; Nedim Lipka

arXiv:2603.12396·cs.IR·March 16, 2026

Test-Time Strategies for More Efficient and Accurate Agentic RAG

Brian Zhang, Deepti Guntur, Zhiyang Zuo, Abhinav Sharma, Shreyas Chaudhari, Wenlong Zhao, Franck Dernoncourt, Puneet Mathur, Ryan Rossi, Nedim Lipka

PDF

Open Access

TL;DR

This paper proposes test-time modifications to Search-R1 in Retrieval-Augmented Generation systems, introducing contextualization and de-duplication modules to enhance accuracy and efficiency in complex question answering tasks.

Contribution

It introduces novel test-time strategies, including a contextualization module and a de-duplication approach, to improve reasoning and reduce retrieval redundancy in agentic RAG systems.

Findings

01

Achieved a 5.6% increase in EM score on HotpotQA and Natural Questions datasets.

02

Reduced the number of retrieval turns by 10.5%, improving efficiency.

03

Enhanced answer correctness through the integration of GPT-4.1-mini for contextualization.

Abstract

Retrieval-Augmented Generation (RAG) systems face challenges with complex, multihop questions, and agentic frameworks such as Search-R1 (Jin et al., 2025), which operates iteratively, have been proposed to address these complexities. However, such approaches can introduce inefficiencies, including repetitive retrieval of previously processed information and challenges in contextualizing retrieved results effectively within the current generation prompt. Such issues can lead to unnecessary retrieval turns, suboptimal reasoning, inaccurate answers, and increased token consumption. In this paper, we investigate test-time modifications to the Search-R1 pipeline to mitigate these identified shortcomings. Specifically, we explore the integration of two components and their combination: a contextualization module to better integrate relevant information from retrieved documents into…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Expert finding and Q&A systems · Information Retrieval and Search Behavior