Whitening Reveals Cluster Commitment as the Geometric Separator of Hallucination Types
Matic Korun

TL;DR
This paper introduces a geometric taxonomy of hallucination types in language models and demonstrates that PCA-whitening reveals cluster commitment as a key separator, highlighting capacity limits and prompt-set sensitivity.
Contribution
It presents a novel whitening method to distinguish hallucination types in embedding space, clarifies the role of capacity limits, and uncovers prompt-set fragility in micro-signal detection.
Findings
Whitening separates Type 2 from Type 3 hallucinations significantly.
Type 1/2 separation is limited by model capacity, not measurement artifacts.
Prompt diversification affects micro-signal detection, revealing prompt-set sensitivity.
Abstract
A geometric hallucination taxonomy distinguishes three failure types -- center-drift (Type~1), wrong-well convergence (Type~2), and coverage gaps (Type~3) -- by their signatures in embedding cluster space. Prior work found Types~1 and~2 indistinguishable in full-dimensional contextual measurement. We address this through PCA-whitening and eigenspectrum decomposition on GPT-2-small, using multi-run stability analysis (20 seeds) with prompt-level aggregation. Whitening transforms the micro-signal regime into a space where peak cluster alignment (max\_sim) separates Type~2 from Type~3 at Holm-corrected significance, with condition means following the taxonomy's predicted ordering: Type~2 (highest commitment) Type~1 (intermediate) Type~3 (lowest). A first directionally stable but underpowered hint of Type~1/2 separation emerges via the same metric, generating a capacity prediction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPsychedelics and Drug Studies · Hallucinations in medical conditions · Functional Brain Connectivity Studies
