Detecting Hallucinations in Graph Retrieval-Augmented Generation via Attention Patterns and Semantic Alignment

Shanghao Li; Jinda Han; Yibo Wang; Yuanjie Zhu; Zihe Song; Langzhou He; Kenan Kamel A Alghythee; Philip S. Yu

arXiv:2512.09148·cs.CL·December 11, 2025

Detecting Hallucinations in Graph Retrieval-Augmented Generation via Attention Patterns and Semantic Alignment

Shanghao Li, Jinda Han, Yibo Wang, Yuanjie Zhu, Zihe Song, Langzhou He, Kenan Kamel A Alghythee, Philip S. Yu

PDF

Open Access

TL;DR

This paper introduces interpretability metrics and a hallucination detection method for GraphRAG systems, revealing how structural reliance and semantic misalignment in LLMs lead to hallucinations, and improving detection accuracy.

Contribution

It proposes two new interpretability metrics, PRD and SAS, and a post-hoc hallucination detector GGA, advancing understanding and mitigation of hallucinations in GraphRAG.

Findings

01

High PRD correlates with over-reliance on salient paths.

02

Low SAS indicates weak semantic grounding.

03

GGA outperforms baseline detectors in AUC and F1.

Abstract

Graph-based Retrieval-Augmented Generation (GraphRAG) enhances Large Language Models (LLMs) by incorporating external knowledge from linearized subgraphs retrieved from knowledge graphs. However, LLMs struggle to interpret the relational and topological information in these inputs, resulting in hallucinations that are inconsistent with the retrieved knowledge. To analyze how LLMs attend to and retain structured knowledge during generation, we propose two lightweight interpretability metrics: Path Reliance Degree (PRD), which measures over-reliance on shortest-path triples, and Semantic Alignment Score (SAS), which assesses how well the model's internal representations align with the retrieved knowledge. Through empirical analysis on a knowledge-based QA task, we identify failure patterns associated with over-reliance on salient paths and weak semantic grounding, as indicated by high PRD…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling · Multimodal Machine Learning Applications