On the Notion that Language Models Reason

Bertram H{\o}jer

arXiv:2511.11810·cs.CL·November 18, 2025

On the Notion that Language Models Reason

Bertram H{\o}jer

PDF

Open Access

TL;DR

This paper critically examines the concept of reasoning in language models, arguing that their outputs are better understood as statistical regularities rather than genuine logical reasoning, which impacts how we evaluate their epistemic uncertainty.

Contribution

The paper clarifies the mismatch between reasoning definitions and how language models operate, proposing a view of LMs as implicit finite-order Markov kernels rather than true reasoners.

Findings

01

LM outputs reflect statistical regularities, not logical reasoning

02

Reasoning-like behavior arises from learned statistical invariances

03

Understanding LMs as pattern matchers affects epistemic uncertainty evaluation

Abstract

Language models (LMs) are said to be exhibiting reasoning, but what does this entail? We assess definitions of reasoning and how key papers in the field of natural language processing (NLP) use the notion and argue that the definitions provided are not consistent with how LMs are trained, process information, and generate new tokens. To illustrate this incommensurability we assume the view that transformer-based LMs implement an \textit{implicit} finite-order Markov kernel mapping contexts to conditional token distributions. In this view, reasoning-like outputs correspond to statistical regularities and approximate statistical invariances in the learned kernel rather than the implementation of explicit logical mechanisms. This view is illustrative of the claim that LMs are "statistical pattern matchers"" and not genuine reasoners and provides a perspective that clarifies why…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications