Meanings and Feelings of Large Language Models: Observability of Latent   States in Generative AI

Tian Yu Liu; Stefano Soatto; Matteo Marchi; Pratik Chaudhari; Paulo; Tabuada

arXiv:2405.14061·cs.AI·May 24, 2024

Meanings and Feelings of Large Language Models: Observability of Latent States in Generative AI

Tian Yu Liu, Stefano Soatto, Matteo Marchi, Pratik Chaudhari, Paulo, Tabuada

PDF

Open Access

TL;DR

This paper investigates whether large language models have observable internal states or 'feelings', concluding that current models do not, but certain modifications could enable models to have non-visible, self-contained experiences.

Contribution

The paper provides a formal analysis showing that standard autoregressive Transformers lack observable internal states and introduces modifications that could create non-visible, self-contained experiences within LLMs.

Findings

01

Current LLMs have singleton state trajectories consistent with outputs.

02

Introducing system prompts can create multiple indistinguishable state trajectories.

03

Modifications can enable models to have non-visible, self-contained 'feelings'.

Abstract

We tackle the question of whether Large Language Models (LLMs), viewed as dynamical systems with state evolving in the embedding space of symbolic tokens, are observable. That is, whether there exist multiple 'mental' state trajectories that yield the same sequence of generated tokens, or sequences that belong to the same Nerode equivalence class ('meaning'). If not observable, mental state trajectories ('experiences') evoked by an input ('perception') or by feedback from the model's own state ('thoughts') could remain self-contained and evolve unbeknown to the user while being potentially accessible to the model provider. Such "self-contained experiences evoked by perception or thought" are akin to what the American Psychological Association (APA) defines as 'feelings'. Beyond the lexical curiosity, we show that current LLMs implemented by autoregressive Transformers cannot have…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

Methods7 Fastest Ways to Call American Airlines Reservations Number (USA Guide) · Sparse Evolutionary Training