Can QPP Choose the Right Query Variant? Evaluating Query Variant Selection for RAG Pipelines

Negar Arabzadeh; Andrew Drozdov; Michael Bendersky; Matei Zaharia

arXiv:2604.22661·cs.IR·April 27, 2026

Can QPP Choose the Right Query Variant? Evaluating Query Variant Selection for RAG Pipelines

Negar Arabzadeh, Andrew Drozdov, Michael Bendersky, Matei Zaharia

PDF

TL;DR

This paper explores using Query Performance Prediction (QPP) to select optimal query variants in RAG pipelines, balancing retrieval relevance and generation quality efficiently.

Contribution

It introduces intra-topic discrimination for QPP in RAG, evaluating predictors for variant selection to improve end-to-end performance.

Findings

01

QPP can reliably identify better query variants for RAG.

02

Lightweight pre-retrieval predictors often match or outperform post-retrieval methods.

03

Variants optimizing ranking metrics may not yield the best generated answers.

Abstract

Large Language Models (LLMs) have made query reformulation ubiquitous in modern retrieval and Retrieval-Augmented Generation (RAG) pipelines, enabling the generation of multiple semantically equivalent query variants. However, executing the full pipeline for every reformulation is computationally expensive, motivating selective execution: can we identify the best query variant before incurring downstream retrieval and generation costs? We investigate Query Performance Prediction (QPP) as a mechanism for variant selection across ad-hoc retrieval and end-to-end RAG. Unlike traditional QPP, which estimates query difficulty across topics, we study intra-topic discrimination - selecting the optimal reformulation among competing variants of the same information need. Through large-scale experiments on TREC-RAG using both sparse and dense retrievers, we evaluate pre- and post-retrieval…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.