Semantic Uncertainty Quantification of Hallucinations in LLMs: A Quantum Tensor Network Based Method
Pragatheeswaran Vipulanandan, Kamal Premaratne, Dilip Sarkar

TL;DR
This paper introduces a quantum tensor network based framework for quantifying semantic uncertainty in LLM hallucinations, enabling more reliable detection and prioritization of trustworthy outputs.
Contribution
It presents a novel quantum physics inspired method for uncertainty quantification in LLMs, improving hallucination detection and output reliability assessment.
Findings
Consistent AUROC and AURAC improvements over baselines
Robustness across different generation lengths and quantization levels
Effective in resource-constrained deployment scenarios
Abstract
Large language models (LLMs) exhibit strong generative capabilities but remain vulnerable to confabulations, fluent yet unreliable outputs that vary arbitrarily even under identical prompts. Leveraging a quantum tensor network based pipeline, we propose a quantum physics inspired uncertainty quantification framework that accounts for aleatoric uncertainty in token sequence probability for semantic equivalence based clustering of LLM generations. This offers a principled and interpretable scheme for hallucination detection. We further introduce an entropy maximization strategy that prioritizes high certainty, semantically coherent outputs and highlights entropy regions where LLM decisions are likely to be unreliable, offering practical guidelines for when human oversight is warranted. We evaluate the robustness of our scheme under different generation lengths and quantization levels,…
Peer Reviews
Decision·ICLR 2026 Poster
- The paper is generally well written although a bit heavy on theoretical notations. It could be simplified a bit more though. - It brings quantum physics-inspired UQ to the LLM hallucination detection problem. - The paper tackles a very significant issue. Hallucination risk in LLMs is a key challenge for safe AI.
- Heavy reliance on advanced mathematical machinery. Makes it a bit harder to adopt. - The apporach does not come significant performance boost.
1. Clear motivation: addresses the instability of entropy-based uncertainty when probability mass collapses into a few clusters. 2. Conceptual novelty: combining semantic clustering with quantum-mechanical perturbation analysis is original and theoretically interesting. 3. Methodological soundness: the derivations are mathematically consistent, and the optimization formulation is well justified. 4. Robustness: results are shown across several models, datasets, and quantization levels, indicating
1. Complexity and interpretability: the proposed QTN embedding and perturbation analysis are computationally heavier than simpler uncertainty measures; inference-time cost is not reported. 2. Baselines could be broader: comparisons omit recent energy- or logits-based approaches (e.g., Semantic Energy, LogTokU). 3. Sensitivity analysis: the method introduces several hyperparameters (kernel bandwidth, λ, Rényi α), but their tuning process and sensitivity are not discussed in depth. 4. Clustering d
* Interesting and novel way of thinking about uncertainty in LLMs * Small, but persistent, improvement over baselines (in a domain where the baselines are already pretty strong)
* Not enough background/intuition on QTNs and the physics-inspired approach used * Lack of technical detail: I am not sure how to construct all the estimators and would certainly not be able to implement the method myself. * Experiments do not fully tease apart the benefit of the proposed method I elaborate on all these points in my questions below.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum many-body systems · Topic Modeling · Advanced Graph Neural Networks
