Detecting Hallucinations in Retrieval-Augmented Generation via Semantic-level Internal Reasoning Graph

Jianpeng Hu; Yanzeng Li; Jialun Zhong; Wenfa Qi; Lei Zou

arXiv:2601.03052·cs.CL·January 7, 2026

Detecting Hallucinations in Retrieval-Augmented Generation via Semantic-level Internal Reasoning Graph

Jianpeng Hu, Yanzeng Li, Jialun Zhong, Wenfa Qi, Lei Zou

PDF

Open Access

TL;DR

This paper introduces a semantic-level internal reasoning graph approach to detect faithfulness hallucinations in retrieval-augmented generation systems, improving detection accuracy by capturing the model's internal reasoning processes.

Contribution

It extends relevance propagation to the semantic level and develops a framework leveraging LLM dependencies for more effective hallucination detection.

Findings

01

Outperforms state-of-the-art baselines on RAGTruth and Dolly-15k datasets.

02

Provides a more faithful semantic dependency representation.

03

Demonstrates improved detection of faithfulness hallucinations.

Abstract

The Retrieval-augmented generation (RAG) system based on Large language model (LLM) has made significant progress. It can effectively reduce factuality hallucinations, but faithfulness hallucinations still exist. Previous methods for detecting faithfulness hallucinations either neglect to capture the models' internal reasoning processes or handle those features coarsely, making it difficult for discriminators to learn. This paper proposes a semantic-level internal reasoning graph-based method for detecting faithfulness hallucination. Specifically, we first extend the layer-wise relevance propagation algorithm from the token level to the semantic level, constructing an internal reasoning graph based on attribution vectors. This provides a more faithful semantic-level representation of dependency. Furthermore, we design a general framework based on a small pre-trained language model to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning