R4: Reinforced Retriever-Reorder-Responder for Retrieval-Augmented Large   Language Models

Taolin Zhang; Dongyang Li; Qizhou Chen; Chengyu Wang; Longtao Huang,; Hui Xue; Xiaofeng He; Jun Huang

arXiv:2405.02659·cs.CL·June 26, 2024

R4: Reinforced Retriever-Reorder-Responder for Retrieval-Augmented Large Language Models

Taolin Zhang, Dongyang Li, Qizhou Chen, Chengyu Wang, Longtao Huang,, Hui Xue, Xiaofeng He, Jun Huang

PDF

Open Access

TL;DR

This paper introduces R$^4$, a pipeline that optimizes the order and representation of retrieved documents to improve retrieval-augmented LLM responses without fine-tuning the models.

Contribution

It proposes a novel document reordering and enhancement method that significantly boosts factual QA performance in retrieval-augmented LLMs.

Findings

01

Improved factual question-answering accuracy on knowledge-intensive tasks.

02

Effective document reordering using graph attention learning.

03

Enhanced document representations via gradient adversarial learning.

Abstract

Retrieval-augmented large language models (LLMs) leverage relevant content retrieved by information retrieval systems to generate correct responses, aiming to alleviate the hallucination problem. However, existing retriever-responder methods typically append relevant documents to the prompt of LLMs to perform text generation tasks without considering the interaction of fine-grained structural semantics between the retrieved documents and the LLMs. This issue is particularly important for accurate response generation as LLMs tend to "lose in the middle" when dealing with input prompts augmented with lengthy documents. In this work, we propose a new pipeline named "Reinforced Retriever-Reorder-Responder" (R $^{4}$ ) to learn document orderings for retrieval-augmented LLMs, thereby further enhancing their generation abilities while the large numbers of parameters of LLMs remain frozen. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques