ERAGent: Enhancing Retrieval-Augmented Language Models with Improved Accuracy, Efficiency, and Personalization
Yunxiao Shi, Xing Zi, Zijing Shi, Haimin Zhang, Qiang Wu, Min Xu

TL;DR
ERAGent advances retrieval-augmented language models by improving retrieval quality, efficiency, and personalization through novel modules and learning mechanisms, demonstrating superior performance across multiple datasets and tasks.
Contribution
Introduction of ERAGent, a framework with modules for enhanced retrieval, efficiency, and personalization, addressing current limitations in retrieval-augmented language models.
Findings
ERAGent achieves higher accuracy on six datasets.
It improves retrieval efficiency and reduces unnecessary knowledge searches.
ERAGent effectively personalizes responses using learned user profiles.
Abstract
Retrieval-augmented generation (RAG) for language models significantly improves language understanding systems. The basic retrieval-then-read pipeline of response generation has evolved into a more extended process due to the integration of various components, sometimes even forming loop structures. Despite its advancements in improving response accuracy, challenges like poor retrieval quality for complex questions that require the search of multifaceted semantic information, inefficiencies in knowledge re-retrieval during long-term serving, and lack of personalized responses persist. Motivated by transcending these limitations, we introduce ERAGent, a cutting-edge framework that embodies an advancement in the RAG area. Our contribution is the introduction of the synergistically operated module: Enhanced Question Rewriter and Knowledge Filter, for better retrieval quality. Retrieval…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Multi-Head Attention · Dense Connections · Attention Dropout · Weight Decay · Dropout · Residual Connection · Byte Pair Encoding · Adam
