Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers

Yixiao Huang; Hanlin Zhu; Tianyu Guo; Jiantao Jiao; Somayeh Sojoudi; Michael I. Jordan; Stuart Russell; Song Mei

arXiv:2506.10887·cs.CL·February 3, 2026

Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers

Yixiao Huang, Hanlin Zhu, Tianyu Guo, Jiantao Jiao, Somayeh Sojoudi, Michael I. Jordan, Stuart Russell, Song Mei

PDF

Open Access

TL;DR

This paper investigates out-of-context reasoning in large language models, revealing it as a key factor behind both their ability to generalize from new facts and their tendency to hallucinate, supported by theoretical and empirical analysis.

Contribution

It introduces a formal framework for out-of-context reasoning, demonstrating how model architecture and implicit bias influence this phenomenon.

Findings

01

OCR drives both generalization and hallucination depending on causal relations

02

A simple attention-only transformer can learn OCR in a synthetic task

03

Gradient descent bias explains the model's ability to associate facts efficiently

Abstract

Large language models (LLMs) can acquire new knowledge through fine-tuning, but this process exhibits a puzzling duality: models can generalize remarkably from new facts, yet are also prone to hallucinating incorrect information. However, the reasons for this phenomenon remain poorly understood. In this work, we argue that both behaviors stem from a single mechanism known as out-of-context reasoning (OCR): the ability to deduce implications by associating concepts, even those without a causal link. Our experiments across five prominent LLMs confirm that OCR indeed drives both generalization and hallucination, depending on whether the associated concepts are causally related. To build a rigorous theoretical understanding of this phenomenon, we then formalize OCR as a synthetic factual recall task. We empirically show that a one-layer single-head attention-only transformer with factorized…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications