Rethinking On-policy Optimization for Query Augmentation

Zhichao Xu; Shengyao Zhuang; Xueguang Ma; Bingsen Chen; Yijun Tian; Fengran Mo; Jie Cao; Vivek Srikumar

arXiv:2510.17139·cs.CL·March 3, 2026

Rethinking On-policy Optimization for Query Augmentation

Zhichao Xu, Shengyao Zhuang, Xueguang Ma, Bingsen Chen, Yijun Tian, Fengran Mo, Jie Cao, Vivek Srikumar

PDF

Open Access

TL;DR

This paper systematically compares prompting-based and RL-based query augmentation methods for IR, revealing that simple prompting often matches or exceeds RL performance, and introduces a hybrid approach, OPQE, that outperforms both.

Contribution

It provides the first consistent comparison of query augmentation techniques and proposes a novel hybrid method, OPQE, combining prompting and RL for improved retrieval performance.

Findings

01

Prompting-based augmentation often matches or surpasses RL-based methods.

02

OPQE outperforms standalone prompting and RL approaches.

03

Hybrid approach effectively merges flexibility of prompting with targeted optimization.

Abstract

Recent advances in large language models (LLMs) have led to a surge of interest in query augmentation for information retrieval (IR). Two main approaches have emerged. The first prompts LLMs to generate answers or pseudo-documents that serve as new queries, relying purely on the model's parametric knowledge or contextual information. The second applies reinforcement learning (RL) to fine-tune LLMs for query rewriting, directly optimizing retrieval metrics. While having respective advantages and limitations, the two approaches have not been compared under consistent experimental conditions. In this work, we present the first systematic comparison of prompting-based and RL-based query augmentation across diverse benchmarks, including evidence-seeking, ad hoc, and tool retrieval. Our key finding is that simple, training-free query augmentation often performs on par with, or even surpasses,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInformation Retrieval and Search Behavior · Topic Modeling · Advanced Graph Neural Networks