Hallucination Basins: A Dynamic Framework for Understanding and Controlling LLM Hallucinations

Kalyan Cherukuri; Lav R. Varshney

arXiv:2604.04743·cs.CL·April 7, 2026

Hallucination Basins: A Dynamic Framework for Understanding and Controlling LLM Hallucinations

Kalyan Cherukuri, Lav R. Varshney

PDF

TL;DR

This paper introduces a geometric dynamical systems framework to understand LLM hallucinations, revealing task-dependent basin structures in latent space and proposing geometry-aware steering to reduce hallucinations.

Contribution

It formalizes the behavior of hallucinations with task-complexity and multi-basin theorems, and demonstrates geometry-aware steering as a method to mitigate hallucinations without retraining.

Findings

01

Factoid tasks show clearer basin separation.

02

Summarization tasks have overlapping basin structures.

03

Geometry-aware steering reduces hallucination probability.

Abstract

Large language models (LLMs) hallucinate: they produce fluent outputs that are factually incorrect. We present a geometric dynamical systems framework in which hallucinations arise from task-dependent basin structure in latent space. Using autoregressive hidden-state trajectories across multiple open-source models and benchmarks, we find that separability is strongly task-dependent rather than universal: factoid settings can show clearer basin separation, whereas summarization and misconception-heavy settings are typically less stable and often overlap. We formalize this behavior with task-complexity and multi-basin theorems, characterize basin emergence in L-layer transformers, and show that geometry-aware steering can reduce hallucination probability without retraining.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.