QPaug: Question and Passage Augmentation for Open-Domain Question   Answering of LLMs

Minsang Kim; Cheoneum Park; Seungjun Baek

arXiv:2406.14277·cs.CL·September 30, 2024·1 cites

QPaug: Question and Passage Augmentation for Open-Domain Question Answering of LLMs

Minsang Kim, Cheoneum Park, Seungjun Baek

PDF

Open Access 1 Repo

TL;DR

QPaug enhances open-domain QA by decomposing questions into sub-questions and augmenting retrieved passages with self-generated content, leading to improved retrieval and answer accuracy in LLM-based systems.

Contribution

Introduces QPaug, a novel method that decomposes questions and augments passages using LLMs, significantly improving retrieval and answer quality in open-domain QA.

Findings

01

QPaug outperforms previous state-of-the-art methods.

02

Significant performance gains over existing RAG approaches.

03

Effective handling of ambiguous or complex questions.

Abstract

Retrieval-augmented generation (RAG) has received much attention for Open-domain question-answering (ODQA) tasks as a means to compensate for the parametric knowledge of large language models (LLMs). While previous approaches focused on processing retrieved passages to remove irrelevant context, they still rely heavily on the quality of retrieved passages which can degrade if the question is ambiguous or complex. In this paper, we propose a simple yet efficient method called question and passage augmentation (QPaug) via LLMs for open-domain QA. QPaug first decomposes the original questions into multiple-step sub-questions. By augmenting the original question with detailed sub-questions and planning, we are able to make the query more specific on what needs to be retrieved, improving the retrieval performance. In addition, to compensate for the case where the retrieved passages contain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kmswin1/qpaug
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · WordPiece · Residual Connection · Weight Decay · Softmax · Layer Normalization · Byte Pair Encoding · Attention Dropout · Linear Warmup With Linear Decay