Query Decomposition for RAG: Balancing Exploration-Exploitation
Roxana Petcu, Kenton Murray, Daniel Khashabi, Evangelos Kanoulas, Maarten de Rijke, Dawn Lawrie, Kevin Duh

TL;DR
This paper introduces a method for query decomposition in retrieval-augmented generation that balances exploration and exploitation, improving document relevance estimation and long-form generation quality.
Contribution
It formulates query decomposition as a bandit problem, applying bandit learning methods to optimize sub-query selection in RAG systems.
Findings
35% gain in document-level precision
15% increase in α-nDCG
Improved long-form generation performance
Abstract
Retrieval-augmented generation (RAG) systems address complex user requests by decomposing them into subqueries, retrieving potentially relevant documents for each, and then aggregating them to generate an answer. Efficiently selecting informative documents requires balancing a key trade-off: (i) retrieving broadly enough to capture all the relevant material, and (ii) limiting retrieval to avoid excessive noise and computational cost. We formulate query decomposition and document retrieval in an exploitation-exploration setting, where retrieving one document at a time builds a belief about the utility of a given sub-query and informs the decision to continue exploiting or exploring an alternative. We experiment with a variety of bandit learning methods and demonstrate their effectiveness in dynamically selecting the most informative sub-queries. Our main finding is that estimating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Topic Modeling · Information Retrieval and Search Behavior
