Question Decomposition for Retrieval-Augmented Generation

Paul J. L. Ammann; Jonas Golde; Alan Akbik

arXiv:2507.00355·cs.CL·July 2, 2025

Question Decomposition for Retrieval-Augmented Generation

Paul J. L. Ammann, Jonas Golde, Alan Akbik

PDF

Open Access

TL;DR

This paper introduces a question decomposition and reranking approach for retrieval-augmented generation to improve multi-hop question answering by assembling and selecting the most relevant evidence from multiple documents.

Contribution

It proposes a practical RAG pipeline that combines LLM-based question decomposition with reranking, enhancing multi-hop retrieval without additional training or specialized indexing.

Findings

01

Significant improvements in retrieval accuracy (MRR@10 +36.7%)

02

Enhanced answer accuracy (F1 +11.6%)

03

Effective assembly of complementary documents for multi-hop questions

Abstract

Grounding large language models (LLMs) in verifiable external sources is a well-established strategy for generating reliable answers. Retrieval-augmented generation (RAG) is one such approach, particularly effective for tasks like question answering: it retrieves passages that are semantically related to the question and then conditions the model on this evidence. However, multi-hop questions, such as "Which company among NVIDIA, Apple, and Google made the biggest profit in 2023?," challenge RAG because relevant facts are often distributed across multiple documents rather than co-occurring in one source, making it difficult for standard RAG to retrieve sufficient information. To address this, we propose a RAG pipeline that incorporates question decomposition: (i) an LLM decomposes the original query into sub-questions, (ii) passages are retrieved for each sub-question, and (iii) the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Expert finding and Q&A systems