Do Language Models Have Bayesian Brains? Distinguishing Stochastic and Deterministic Decision Patterns within Large Language Models

Andrea Yaoyun Cui; Pengfei Yu

arXiv:2506.10268·cs.CL·June 13, 2025

Do Language Models Have Bayesian Brains? Distinguishing Stochastic and Deterministic Decision Patterns within Large Language Models

Andrea Yaoyun Cui, Pengfei Yu

PDF

Open Access

TL;DR

This paper investigates whether large language models make decisions based on probabilistic sampling or deterministic processes, revealing that they can exhibit near-deterministic behavior which challenges previous assumptions about their Bayesian nature.

Contribution

The study introduces a method to distinguish stochastic from deterministic decision patterns in language models, revealing their potential for deterministic decision-making and addressing prior inference issues.

Findings

01

Language models can produce near-deterministic decisions even with non-zero temperature

02

Deterministic behavior can lead to false priors during Gibbs sampling

03

Proposed approach effectively identifies decision patterns in various models

Abstract

Language models are essentially probability distributions over token sequences. Auto-regressive models generate sentences by iteratively computing and sampling from the distribution of the next token. This iterative sampling introduces stochasticity, leading to the assumption that language models make probabilistic decisions, similar to sampling from unknown distributions. Building on this assumption, prior research has used simulated Gibbs sampling, inspired by experiments designed to elicit human priors, to infer the priors of language models. In this paper, we revisit a critical question: Do language models possess Bayesian brains? Our findings show that under certain conditions, language models can exhibit near-deterministic decision-making, such as producing maximum likelihood estimations, even with a non-zero sampling temperature. This challenges the sampling assumption and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Language and cultural evolution · Generative Adversarial Networks and Image Synthesis