Grounding the Ungrounded: A Spectral-Graph Framework for Quantifying Hallucinations in Multimodal LLMs

Supratik Sarkar; Swagatam Das

arXiv:2508.19366·cs.LG·December 11, 2025

Grounding the Ungrounded: A Spectral-Graph Framework for Quantifying Hallucinations in Multimodal LLMs

Supratik Sarkar, Swagatam Das

PDF

3 Reviews

TL;DR

This paper introduces a spectral-graph framework grounded in diffusion dynamics to quantify and analyze hallucinations in multimodal large language models, enabling principled evaluation and mitigation.

Contribution

It develops a novel information-geometric approach using spectral decompositions of graph Laplacians to measure semantic distortions in multimodal LLM outputs.

Findings

01

Derived Courant-Fischer bounds on hallucination profiles

02

Introduced modality-aware, interpretable measures of hallucination

03

Tracked hallucination evolution over prompts and time

Abstract

Hallucinations in LLMs--especially in multimodal settings--undermine reliability. We present a rigorous information-geometric framework, grounded in diffusion dynamics, to quantify hallucinations in MLLMs where model outputs are embedded via spectral decompositions of multimodal graph Laplacians, and their gaps to a truth manifold define a semantic distortion metric. We derive Courant-Fischer bounds on a temperature-dependent hallucination profile and use RKHS eigenmodes to obtain modality-aware, interpretable measures that track evolution over prompts and time. This reframes hallucination as quantifiable and bounded, providing a principled basis for evaluation and mitigation.

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 2Confidence 2

Strengths

• Clear positioning: gap = no theory-backed, modality-aware framework that quantifies hallucinations. • Proposes a KL-smoothed semantic distortion $d^{(\varepsilon,h)}_{\text{sem}}$ that is $0$ on an admissible set $K$ and $>0$ off $K$, via a $K$-restricted vs. unconditional smoother (Eq. 6). • Hypergraph construction is naturally multimodal, yields a hallucination energy, and admits CF bounds. • Empirically validates theory (three datasets × three stacks) and outperforms baselines (AUROC/AUP

Weaknesses

1. **Primary weakness:** Dense formalism buries the practical message—what problems does this framework actually solve, and under what conditions should practitioners prefer it over standard uncertainty baselines? Also, please explain how the mathematical assumptions for the theorems (beyond “g-free” which is adequately explained) are translated to real world scenarios. 2. “Reference-free” vs. operational $K$: Reconcile the “independent-of-$g$” claim with the use of a finite admissible set $K$ a

Reviewer 02Rating 2Confidence 3

Strengths

1. The paper aims to quantify hallucination via information metrics, which is a novel perspective. 2. The paper aims to define measure from mathematical formulations, which paves the way for rigorous evaluations and analysis.

Weaknesses

1. There is a general lack of motivation for the development of formulations in the paper. While $K_g$ appears in early set-ups, all subsequent developments are built on $K$. The paper neither discuss how $K$ and $K_g$ are different in practical LLM usages, nor provide clues on how $K$ is obtained in experiments (is it dependent on the specific LLM, training data or evaluation data?), making the definition of hallucination idealized and not connected to practice. Furthermore, the paper does not

Reviewer 03Rating 4Confidence 3

Strengths

1. The study of hallucination in LLMs and MLLMs is an important and timely research topic. 2. This work provides a theoretical grounding for hallucination quantification, offering valuable insights and potential guidance for future research in this area. 3. The open-sourced codebase enhances reproducibility and supports further validation by the community.

Weaknesses

As I am not familiar with the theoretical aspects, my assessment focuses mainly on the experimental design and empirical evaluation: 1. The experimental setup does not fully align with the paper’s claim. Although the authors aim to address hallucination in LLMs/MLLMs, the experiments only involve traditional multimodal models such as BLIP, CLIP, Whisper, and T5, rather than modern autoregressive LLMs like Qwen-Audio/VL/Omni or the GPT-4/5 series. This limitation significantly constrains the pap

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.