Meanings and Feelings of Large Language Models: Observability of Latent States in Generative AI
Tian Yu Liu, Stefano Soatto, Matteo Marchi, Pratik Chaudhari, Paulo, Tabuada

TL;DR
This paper investigates whether large language models have observable internal states or 'feelings', concluding that current models do not, but certain modifications could enable models to have non-visible, self-contained experiences.
Contribution
The paper provides a formal analysis showing that standard autoregressive Transformers lack observable internal states and introduces modifications that could create non-visible, self-contained experiences within LLMs.
Findings
Current LLMs have singleton state trajectories consistent with outputs.
Introducing system prompts can create multiple indistinguishable state trajectories.
Modifications can enable models to have non-visible, self-contained 'feelings'.
Abstract
We tackle the question of whether Large Language Models (LLMs), viewed as dynamical systems with state evolving in the embedding space of symbolic tokens, are observable. That is, whether there exist multiple 'mental' state trajectories that yield the same sequence of generated tokens, or sequences that belong to the same Nerode equivalence class ('meaning'). If not observable, mental state trajectories ('experiences') evoked by an input ('perception') or by feedback from the model's own state ('thoughts') could remain self-contained and evolve unbeknown to the user while being potentially accessible to the model provider. Such "self-contained experiences evoked by perception or thought" are akin to what the American Psychological Association (APA) defines as 'feelings'. Beyond the lexical curiosity, we show that current LLMs implemented by autoregressive Transformers cannot have…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
Methods7 Fastest Ways to Call American Airlines Reservations Number (USA Guide) · Sparse Evolutionary Training
