Stochastic Thermodynamics for Autoregressive Generative Models: A Non-Markovian Perspective
Takahiro Sagawa

TL;DR
This paper introduces a stochastic thermodynamics framework for autoregressive generative models, enabling entropy production estimation in non-Markovian processes like language models, with applications to GPT-2 and Kalman filters.
Contribution
It develops a general theoretical approach to quantify irreversibility in non-Markovian generative models, including entropy production estimation and decomposition into meaningful components.
Findings
Entropy production can be estimated efficiently from sampled trajectories.
Token-level entropy production is dominated by syntactic artifacts.
Sentence-level entropy production can distinguish causal from non-causal text.
Abstract
Autoregressive generative models -- including Transformers, recurrent neural networks, classical Kalman filters, state space models, and Mamba -- all generate sequences by sampling each output from a deterministic summary of the past, producing genuinely non-Markovian observed processes. We develop a general theoretical framework based on stochastic thermodynamics for this class of architectures and introduce the entropy production, which can be efficiently estimated from sampled trajectories without exponential sampling cost, despite the non-Markovian nature of the observed dynamics. As a proof-of-concept experiment with a large language model (LLM), we evaluate the entropy production for a pre-trained Transformer-based model, GPT-2. We find that the token-level entropy production is dominated by a syntactic artifact, while the sentence-level entropy production may yield a more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
