Towards Trustworthy Retrieval Augmented Generation for Large Language   Models: A Survey

Bo Ni; Zheyuan Liu; Leyao Wang; Yongjia Lei; Yuying Zhao; Xueqi Cheng,; Qingkai Zeng; Luna Dong; Yinglong Xia; Krishnaram Kenthapadi; Ryan Rossi,; Franck Dernoncourt; Md Mehrab Tanjim; Nesreen Ahmed; Xiaorui Liu; Wenqi Fan,; Erik Blasch; Yu Wang; Meng Jiang; Tyler Derr

arXiv:2502.06872·cs.CL·February 12, 2025·3 cites

Towards Trustworthy Retrieval Augmented Generation for Large Language Models: A Survey

Bo Ni, Zheyuan Liu, Leyao Wang, Yongjia Lei, Yuying Zhao, Xueqi Cheng,, Qingkai Zeng, Luna Dong, Yinglong Xia, Krishnaram Kenthapadi, Ryan Rossi,, Franck Dernoncourt, Md Mehrab Tanjim, Nesreen Ahmed, Xiaorui Liu, Wenqi Fan,, Erik Blasch, Yu Wang, Meng Jiang, Tyler Derr

PDF

Open Access 1 Repo

TL;DR

This survey reviews the challenges and solutions for making Retrieval-Augmented Generation systems trustworthy, focusing on reliability, privacy, safety, fairness, explainability, and accountability, and provides a comprehensive framework for future research.

Contribution

It offers a unified framework and taxonomy for developing trustworthy RAG systems, addressing key challenges and guiding future research directions.

Findings

01

Identifies critical trustworthiness challenges in RAG systems

02

Proposes a structured framework and taxonomy for trustworthiness

03

Highlights future research directions and applications

Abstract

Retrieval-Augmented Generation (RAG) is an advanced technique designed to address the challenges of Artificial Intelligence-Generated Content (AIGC). By integrating context retrieval into content generation, RAG provides reliable and up-to-date external knowledge, reduces hallucinations, and ensures relevant context across a wide range of tasks. However, despite RAG's success and potential, recent studies have shown that the RAG paradigm also introduces new risks, including robustness issues, privacy concerns, adversarial attacks, and accountability issues. Addressing these risks is critical for future applications of RAG systems, as they directly impact their trustworthiness. Although various methods have been developed to improve the trustworthiness of RAG methods, there is a lack of a unified perspective and framework for research in this topic. Thus, in this paper, we aim to address…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Arstanley/Awesome-Trustworthy-Retrieval-Augmented-Generation
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Warmup With Linear Decay · Weight Decay · WordPiece · Attention Dropout · Byte Pair Encoding · Layer Normalization · Residual Connection · Dense Connections