R4: Reinforced Retriever-Reorder-Responder for Retrieval-Augmented Large Language Models
Taolin Zhang, Dongyang Li, Qizhou Chen, Chengyu Wang, Longtao Huang,, Hui Xue, Xiaofeng He, Jun Huang

TL;DR
This paper introduces R$^4$, a pipeline that optimizes the order and representation of retrieved documents to improve retrieval-augmented LLM responses without fine-tuning the models.
Contribution
It proposes a novel document reordering and enhancement method that significantly boosts factual QA performance in retrieval-augmented LLMs.
Findings
Improved factual question-answering accuracy on knowledge-intensive tasks.
Effective document reordering using graph attention learning.
Enhanced document representations via gradient adversarial learning.
Abstract
Retrieval-augmented large language models (LLMs) leverage relevant content retrieved by information retrieval systems to generate correct responses, aiming to alleviate the hallucination problem. However, existing retriever-responder methods typically append relevant documents to the prompt of LLMs to perform text generation tasks without considering the interaction of fine-grained structural semantics between the retrieved documents and the LLMs. This issue is particularly important for accurate response generation as LLMs tend to "lose in the middle" when dealing with input prompts augmented with lengthy documents. In this work, we propose a new pipeline named "Reinforced Retriever-Reorder-Responder" (R) to learn document orderings for retrieval-augmented LLMs, thereby further enhancing their generation abilities while the large numbers of parameters of LLMs remain frozen. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
