On the Failure of Latent State Persistence in Large Language Models
Jen-tse Huang, Kaiser Sun, Wenxuan Wang, Mark Dredze

TL;DR
This paper investigates the inability of large language models to maintain persistent internal states, revealing fundamental limitations in their reasoning capabilities and internal representation stability.
Contribution
The paper formalizes the Latent State Persistence gap in LLMs and introduces three novel experiments to quantify and analyze this failure.
Findings
LLMs fail to allocate probability mass to a single hidden choice in a guessing game.
Lack of LSP causes concept drift and self-contradictions in LLMs during multi-question tasks.
Models struggle with tracking transformations on hidden variables, indicating poor internal state management.
Abstract
While Large Language Models (LLMs) excel in reasoning, whether they can sustain persistent latent states remains under-explored. The capacity to maintain and manipulate unexpressed, internal representations-analogous to human working memory-is a cornerstone of complex reasoning. In this paper, we formalize and quantify the "Latent State Persistence" (LSP) gap through three novel experiments. First, we utilize a Number Guessing Game, demonstrating that across independent queries, LLMs fail to allocate probability mass to a singular hidden choice, violating a fundamental probabilistic principle. Second, we employ a Yes-No Game to show that as the number of questions increases, LLMs suffer from "concept drift," leading to inevitable self-contradictions due to the lack of LSP. Finally, inspired by Mathematical Mentalism, we task models with tracking transformations on hidden variables,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
