Are language models rational? The case of coherence norms and belief revision

Thomas Hofweber; Peter Hase; Elias Stengel-Eskin; Mohit Bansal

arXiv:2406.03442·cs.CL·November 17, 2025

Are language models rational? The case of coherence norms and belief revision

Thomas Hofweber, Peter Hase, Elias Stengel-Eskin, Mohit Bansal

PDF

Open Access

TL;DR

This paper examines whether language models adhere to rational coherence norms, introducing a new credence measure based on internal probabilities, and finds that some models do align with these norms while others do not.

Contribution

It introduces the Minimal Assent Connection (MAC) and a novel credence account, providing a framework to evaluate rational coherence in language models.

Findings

01

Some language models satisfy coherence norms

02

The new credence measure correlates with model behavior

03

Implications for AI safety and alignment

Abstract

Do norms of rationality apply to machine learning models, in particular language models? In this paper we investigate this question by focusing on a special subset of rational norms: coherence norms. We consider both logical coherence norms as well as coherence norms tied to the strength of belief. To make sense of the latter, we introduce the Minimal Assent Connection (MAC) and propose a new account of credence, which captures the strength of belief in language models. This proposal uniformly assigns strength of belief simply on the basis of model internal next token probabilities. We argue that rational norms tied to coherence do apply to some language models, but not to others. This issue is significant since rationality is closely tied to predicting and explaining behavior, and thus it is connected to considerations about AI safety and alignment, as well as understanding model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multi-Agent Systems and Negotiation