QPaug: Question and Passage Augmentation for Open-Domain Question Answering of LLMs
Minsang Kim, Cheoneum Park, Seungjun Baek

TL;DR
QPaug enhances open-domain QA by decomposing questions into sub-questions and augmenting retrieved passages with self-generated content, leading to improved retrieval and answer accuracy in LLM-based systems.
Contribution
Introduces QPaug, a novel method that decomposes questions and augments passages using LLMs, significantly improving retrieval and answer quality in open-domain QA.
Findings
QPaug outperforms previous state-of-the-art methods.
Significant performance gains over existing RAG approaches.
Effective handling of ambiguous or complex questions.
Abstract
Retrieval-augmented generation (RAG) has received much attention for Open-domain question-answering (ODQA) tasks as a means to compensate for the parametric knowledge of large language models (LLMs). While previous approaches focused on processing retrieved passages to remove irrelevant context, they still rely heavily on the quality of retrieved passages which can degrade if the question is ambiguous or complex. In this paper, we propose a simple yet efficient method called question and passage augmentation (QPaug) via LLMs for open-domain QA. QPaug first decomposes the original questions into multiple-step sub-questions. By augmenting the original question with detailed sub-questions and planning, we are able to make the query more specific on what needs to be retrieved, improving the retrieval performance. In addition, to compensate for the case where the retrieved passages contain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · WordPiece · Residual Connection · Weight Decay · Softmax · Layer Normalization · Byte Pair Encoding · Attention Dropout · Linear Warmup With Linear Decay
