Predictive Minds: LLMs As Atypical Active Inference Agents
Jan Kulveit, Clem von Stengel, Roman Leventov

TL;DR
This paper reinterprets large language models through the lens of active inference, highlighting their current limitations and potential for future self-aware, adaptive behavior driven by feedback loops.
Contribution
It introduces a novel perspective by framing LLMs as atypical active inference agents, contrasting them with traditional systems and discussing future enhancements.
Findings
LLMs currently lack a tight feedback loop between action and perception.
LLMs fit within the active inference paradigm despite current limitations.
Closing the feedback loop could lead to self-aware, adaptive models.
Abstract
Large language models (LLMs) like GPT are often conceptualized as passive predictors, simulators, or even stochastic parrots. We instead conceptualize LLMs by drawing on the theory of active inference originating in cognitive science and neuroscience. We examine similarities and differences between traditional active inference systems and LLMs, leading to the conclusion that, currently, LLMs lack a tight feedback loop between acting in the world and perceiving the impacts of their actions, but otherwise fit in the active inference paradigm. We list reasons why this loop may soon be closed, and possible consequences of this including enhanced model self-awareness and the drive to minimize prediction error by changing the world.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage and cultural evolution · Computability, Logic, AI Algorithms · Machine Learning and Algorithms
MethodsMulti-Head Attention · Attention Is All You Need · Residual Connection · Byte Pair Encoding · Refunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · Adam · Softmax · Dense Connections · Dropout
