Hallucination Detection in LLMs Using Spectral Features of Attention Maps

Jakub Binkowski; Denis Janiak; Albert Sawczyn; Bogdan Gabrys; Tomasz Kajdanowicz

arXiv:2502.17598·cs.LG·October 21, 2025

Hallucination Detection in LLMs Using Spectral Features of Attention Maps

Jakub Binkowski, Denis Janiak, Albert Sawczyn, Bogdan Gabrys, Tomasz Kajdanowicz

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a spectral analysis method using Laplacian eigenvalues of attention maps to improve hallucination detection in Large Language Models, achieving state-of-the-art results.

Contribution

The paper proposes the $ ext{LapEigvals}$ method that leverages spectral features of attention maps for more effective hallucination detection in LLMs.

Findings

01

Achieves state-of-the-art performance among attention-based methods.

02

Demonstrates robustness and generalisation across different models.

03

Provides extensive ablation studies validating the approach.

Abstract

Large Language Models (LLMs) have demonstrated remarkable performance across various tasks but remain prone to hallucinations. Detecting hallucinations is essential for safety-critical applications, and recent methods leverage attention map properties to this end, though their effectiveness remains limited. In this work, we investigate the spectral features of attention maps by interpreting them as adjacency matrices of graph structures. We propose the $LapEigvals$ method, which utilises the top- $k$ eigenvalues of the Laplacian matrix derived from the attention maps as an input to hallucination detection probes. Empirical evaluations demonstrate that our approach achieves state-of-the-art hallucination detection performance among attention-based methods. Extensive ablation studies further highlight the robustness and generalisation of $LapEigvals$ , paving the way for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

graphml-lab-pwr/lapeig
pytorchOfficial

Videos

Hallucination Detection in LLMs Using Spectral Features of Attention Maps· underline

Taxonomy

TopicsBrain Tumor Detection and Classification · Anomaly Detection Techniques and Applications

MethodsSoftmax · Attention Is All You Need