Predictive Minds: LLMs As Atypical Active Inference Agents

Jan Kulveit; Clem von Stengel; Roman Leventov

arXiv:2311.10215·cs.CL·November 20, 2023·1 cites

Predictive Minds: LLMs As Atypical Active Inference Agents

Jan Kulveit, Clem von Stengel, Roman Leventov

PDF

Open Access

TL;DR

This paper reinterprets large language models through the lens of active inference, highlighting their current limitations and potential for future self-aware, adaptive behavior driven by feedback loops.

Contribution

It introduces a novel perspective by framing LLMs as atypical active inference agents, contrasting them with traditional systems and discussing future enhancements.

Findings

01

LLMs currently lack a tight feedback loop between action and perception.

02

LLMs fit within the active inference paradigm despite current limitations.

03

Closing the feedback loop could lead to self-aware, adaptive models.

Abstract

Large language models (LLMs) like GPT are often conceptualized as passive predictors, simulators, or even stochastic parrots. We instead conceptualize LLMs by drawing on the theory of active inference originating in cognitive science and neuroscience. We examine similarities and differences between traditional active inference systems and LLMs, leading to the conclusion that, currently, LLMs lack a tight feedback loop between acting in the world and perceiving the impacts of their actions, but otherwise fit in the active inference paradigm. We list reasons why this loop may soon be closed, and possible consequences of this including enhanced model self-awareness and the drive to minimize prediction error by changing the world.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLanguage and cultural evolution · Computability, Logic, AI Algorithms · Machine Learning and Algorithms

MethodsMulti-Head Attention · Attention Is All You Need · Residual Connection · Byte Pair Encoding · Refunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · Adam · Softmax · Dense Connections · Dropout