R^2AG: Incorporating Retrieval Information into Retrieval Augmented   Generation

Fuda Ye; Shuangyin Li; Yongqi Zhang; Lei Chen

arXiv:2406.13249·cs.CL·October 31, 2024·2 cites

R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation

Fuda Ye, Shuangyin Li, Yongqi Zhang, Lei Chen

PDF

Open Access 1 Repo

TL;DR

R^2AG enhances retrieval augmented generation by integrating retrieval information into LLMs, bridging the semantic gap and improving performance especially in low-source scenarios, validated through extensive experiments.

Contribution

The paper introduces R^2AG, a novel framework that incorporates retrieval features into LLMs using a specialized transformer and prompting strategy, addressing semantic misalignment in RAG.

Findings

01

Improves generation quality across five datasets

02

Enhances robustness and efficiency of RAG systems

03

Fills semantic gap between retrievers and LLMs

Abstract

Retrieval augmented generation (RAG) has been applied in many scenarios to augment large language models (LLMs) with external documents provided by retrievers. However, a semantic gap exists between LLMs and retrievers due to differences in their training objectives and architectures. This misalignment forces LLMs to passively accept the documents provided by the retrievers, leading to incomprehension in the generation process, where the LLMs are burdened with the task of distinguishing these documents using their inherent knowledge. This paper proposes R $^{2}$ AG, a novel enhanced RAG framework to fill this gap by incorporating Retrieval information into Retrieval Augmented Generation. Specifically, R $^{2}$ AG utilizes the nuanced features from the retrievers and employs a R $^{2}$ -Former to capture retrieval information. Then, a retrieval-aware prompting strategy is designed to integrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yefd/RRAG
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · WordPiece · Residual Connection · Weight Decay · Softmax · Layer Normalization · Byte Pair Encoding · Attention Dropout · Linear Warmup With Linear Decay