R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation
Fuda Ye, Shuangyin Li, Yongqi Zhang, Lei Chen

TL;DR
R^2AG enhances retrieval augmented generation by integrating retrieval information into LLMs, bridging the semantic gap and improving performance especially in low-source scenarios, validated through extensive experiments.
Contribution
The paper introduces R^2AG, a novel framework that incorporates retrieval features into LLMs using a specialized transformer and prompting strategy, addressing semantic misalignment in RAG.
Findings
Improves generation quality across five datasets
Enhances robustness and efficiency of RAG systems
Fills semantic gap between retrievers and LLMs
Abstract
Retrieval augmented generation (RAG) has been applied in many scenarios to augment large language models (LLMs) with external documents provided by retrievers. However, a semantic gap exists between LLMs and retrievers due to differences in their training objectives and architectures. This misalignment forces LLMs to passively accept the documents provided by the retrievers, leading to incomprehension in the generation process, where the LLMs are burdened with the task of distinguishing these documents using their inherent knowledge. This paper proposes RAG, a novel enhanced RAG framework to fill this gap by incorporating Retrieval information into Retrieval Augmented Generation. Specifically, RAG utilizes the nuanced features from the retrievers and employs a R-Former to capture retrieval information. Then, a retrieval-aware prompting strategy is designed to integrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · WordPiece · Residual Connection · Weight Decay · Softmax · Layer Normalization · Byte Pair Encoding · Attention Dropout · Linear Warmup With Linear Decay
