# An Information–Theoretic Model of Abduction for Detecting Hallucinations in Explanations

**Authors:** Boris Galitsky

PMC · DOI: 10.3390/e28020173 · 2026-02-02

## TL;DR

This paper introduces a new model that detects hallucinations in AI-generated explanations by combining information theory and abductive reasoning.

## Contribution

The novel contribution is a neuro-symbolic framework using entropy-based inference and abductive reasoning for hallucination detection.

## Key findings

- The model outperforms existing neural and symbolic methods in hallucination detection accuracy and interpretability.
- It successfully identifies hallucinations in GPT-5.1 outputs through abductive reasoning and information divergence.
- Discourse structure integration improves differentiation between valid and unsupported claims.

## Abstract

We present an Information–Theoretic Model of Abduction for Detecting Hallucinations in Generative Models, a neuro-symbolic framework that combines entropy-based inference with abductive reasoning to identify unsupported or contradictory content in large language model outputs. Our approach treats hallucination detection as a dual optimization problem: minimizing the information gain between source-conditioned and response-conditioned belief distributions, while simultaneously selecting the minimal abductive hypothesis capable of explaining discourse-salient claims. By incorporating discourse structure through RST-derived EDU weighting, the model distinguishes legitimate abductive elaborations from claims that cannot be justified under any computationally plausible hypothesis. Experimental evaluation across medical, factual QA, and multi-hop reasoning datasets demonstrates that the proposed method outperforms state-of-the-art neural and symbolic baselines in both accuracy and interpretability. Qualitative analysis further shows that the framework successfully exposes plausible-sounding but abductively unsupported model errors, including real hallucinations generated by GPT-5.1. Together, these results indicate that integrating Information–Theoretic divergence and abductive explanation provides a principled and effective foundation for robust hallucination detection in generative systems.

## Full-text entities

- **Genes:** ATHS (atherosclerosis susceptibility (lipoprotein associated)) [NCBI Gene 470] {aka ALP}, MAP3K8 (mitogen-activated protein kinase kinase kinase 8) [NCBI Gene 1326] {aka AURA2, COT, EST, ESTF, MEKK8, TPL2}, GPT (glutamic--pyruvic transaminase) [NCBI Gene 2875] {aka AAT1, ALT, ALT1, GPT1, SGPT}, CHR [NCBI Gene 1125]
- **Diseases:** pain (MESH:D010146), flu (MESH:D007251), Gout (MESH:D006073), headache (MESH:D006261), inflammation (MESH:D007249), injury to (MESH:D014947), swelling (MESH:D004487), LLMs (MESH:D007806), IG (MESH:D015430), rash (MESH:D005076), arrhythmia (MESH:D001145), lupus (MESH:D008180), XAI (MESH:C538243), Fever (MESH:D005334), CoT. (MESH:D007161), hallucinatory (MESH:C000726587), arthritis (MESH:D001168), Hallucinations (MESH:D006212), rheumatoid arthritis (MESH:D001172), swollen joints (MESH:D007592), rubella (MESH:D012409), joint pain (MESH:D018771), scarlet fever (MESH:D012541), allergic reaction (MESH:D004342), NEUTRAL (MESH:C536560), measles (MESH:D008457), explosion (MESH:D007174), hyperuricemia (MESH:D033461), viral arthritis (MESH:D001170)
- **Chemicals:** oxygen (MESH:D010100), Hc (MESH:D006854), urate (MESH:D014527), L (MESH:D007930), colchicine (MESH:D003078), Aleph (-), D (MESH:D003903), purine (MESH:C030985), Ar (MESH:D001128), H. (MESH:D006859), As (MESH:D001151)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12939550/full.md

---
Source: https://tomesphere.com/paper/PMC12939550