On the Failure of Latent State Persistence in Large Language Models

Jen-tse Huang; Kaiser Sun; Wenxuan Wang; Mark Dredze

arXiv:2505.10571·cs.CL·January 27, 2026

On the Failure of Latent State Persistence in Large Language Models

Jen-tse Huang, Kaiser Sun, Wenxuan Wang, Mark Dredze

PDF

TL;DR

This paper investigates the inability of large language models to maintain persistent internal states, revealing fundamental limitations in their reasoning capabilities and internal representation stability.

Contribution

The paper formalizes the Latent State Persistence gap in LLMs and introduces three novel experiments to quantify and analyze this failure.

Findings

01

LLMs fail to allocate probability mass to a single hidden choice in a guessing game.

02

Lack of LSP causes concept drift and self-contradictions in LLMs during multi-question tasks.

03

Models struggle with tracking transformations on hidden variables, indicating poor internal state management.

Abstract

While Large Language Models (LLMs) excel in reasoning, whether they can sustain persistent latent states remains under-explored. The capacity to maintain and manipulate unexpressed, internal representations-analogous to human working memory-is a cornerstone of complex reasoning. In this paper, we formalize and quantify the "Latent State Persistence" (LSP) gap through three novel experiments. First, we utilize a Number Guessing Game, demonstrating that across independent queries, LLMs fail to allocate probability mass to a singular hidden choice, violating a fundamental probabilistic principle. Second, we employ a Yes-No Game to show that as the number of questions increases, LLMs suffer from "concept drift," leading to inevitable self-contradictions due to the lack of LSP. Finally, inspired by Mathematical Mentalism, we task models with tracking transformations on hidden variables,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.