LRP4RAG: Detecting Hallucinations in Retrieval-Augmented Generation via Layer-wise Relevance Propagation
Haichuan Hu, Congqing He, Xiaochen Xie, Quanjun Zhang

TL;DR
LRP4RAG introduces a novel Layer-wise Relevance Propagation-based approach to detect hallucinations in Retrieval-Augmented Generation, significantly improving accuracy over existing methods.
Contribution
This paper pioneers the use of LRP for hallucination detection in RAG, combining relevance analysis with classifiers for better identification.
Findings
LRP4RAG outperforms existing baselines in hallucination detection
Relevance-based features improve detection accuracy
First application of LRP in RAG hallucination detection
Abstract
Retrieval-Augmented Generation (RAG) has become a primary technique for mitigating hallucinations in large language models (LLMs). However, incomplete knowledge extraction and insufficient understanding can still mislead LLMs to produce irrelevant or even contradictory responses, which means hallucinations persist in RAG. In this paper, we propose LRP4RAG, a method based on the Layer-wise Relevance Propagation (LRP) algorithm for detecting hallucinations in RAG. Specifically, we first utilize LRP to compute the relevance between the input and output of the RAG generator. We then apply further extraction and resampling to the relevance matrix. The processed relevance data are input into multiple classifiers to determine whether the output contains hallucinations. To the best of our knowledge, this is the first time that LRP has been used for detecting RAG hallucinations, and extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Functional Brain Connectivity Studies · Topological and Geometric Data Analysis
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Adam · Layer Normalization · Weight Decay · Attention Is All You Need · Dense Connections · WordPiece · Attention Dropout · Linear Warmup With Linear Decay · Byte Pair Encoding
