QuIM-RAG: Advancing Retrieval-Augmented Generation with Inverted   Question Matching for Enhanced QA Performance

Binita Saha; Utsha Saha; Muhammad Zubair Malik

arXiv:2501.02702·cs.CL·January 9, 2025

QuIM-RAG: Advancing Retrieval-Augmented Generation with Inverted Question Matching for Enhanced QA Performance

Binita Saha, Utsha Saha, Muhammad Zubair Malik

PDF

TL;DR

This paper introduces QuIM-RAG, a novel retrieval mechanism for RAG systems that improves QA performance by matching questions to document chunks through inverted question matching, demonstrated on a large, domain-specific corpus.

Contribution

The paper proposes QuIM-RAG, a new retrieval strategy for RAG systems that enhances question relevance and answer accuracy in domain-specific QA tasks.

Findings

01

Outperforms traditional RAG models on BERT-Score and RAGAS metrics.

02

Effective retrieval from large domain-specific corpora.

03

Improved accuracy in complex question answering.

Abstract

This work presents a novel architecture for building Retrieval-Augmented Generation (RAG) systems to improve Question Answering (QA) tasks from a target corpus. Large Language Models (LLMs) have revolutionized the analyzing and generation of human-like text. These models rely on pre-trained data and lack real-time updates unless integrated with live data tools. RAG enhances LLMs by integrating online resources and databases to generate contextually appropriate responses. However, traditional RAG still encounters challenges like information dilution and hallucinations when handling vast amounts of data. Our approach addresses these challenges by converting corpora into a domain-specific dataset and RAG architecture is constructed to generate responses from the target document. We introduce QuIM-RAG (Question-to-question Inverted Index Matching), a novel approach for the retrieval…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Layer Normalization · Dense Connections · Attention Dropout · Softmax · Byte Pair Encoding · Linear Warmup With Linear Decay · WordPiece · Linear Layer