Embedded Universal Predictive Intelligence: a coherent framework for multi-agent learning

Alexander Meulemans; Rajai Nasser; Maciej Wo{\l}czyk; Marissa A. Weis; Seijin Kobayashi; Blake Richards; Guillaume Lajoie; Angelika Steger; Marcus Hutter; James Manyika; Rif A. Saurous; Jo\~ao Sacramento; Blaise Ag\"uera y Arcas

arXiv:2511.22226·cs.AI·December 1, 2025

Embedded Universal Predictive Intelligence: a coherent framework for multi-agent learning

Alexander Meulemans, Rajai Nasser, Maciej Wo{\l}czyk, Marissa A. Weis, Seijin Kobayashi, Blake Richards, Guillaume Lajoie, Angelika Steger, Marcus Hutter, James Manyika, Rif A. Saurous, Jo\~ao Sacramento, Blaise Ag\"uera y Arcas

PDF

Open Access

TL;DR

This paper introduces a unified framework for multi-agent learning based on self-prediction and Bayesian reinforcement learning, enabling agents to model each other and achieve advanced cooperation and theory of mind.

Contribution

It extends universal artificial intelligence to embedded multi-agent settings, proposing a mathematical framework for prospective self-prediction and mutual modeling.

Findings

01

Agents can reason about others with similar algorithms.

02

Self-prediction leads to new cooperation strategies.

03

Universal agents can develop infinite-order theory of mind.

Abstract

The standard theory of model-free reinforcement learning assumes that the environment dynamics are stationary and that agents are decoupled from their environment, such that policies are treated as being separate from the world they inhabit. This leads to theoretical challenges in the multi-agent setting where the non-stationarity induced by the learning of other agents demands prospective learning based on prediction models. To accurately model other agents, an agent must account for the fact that those other agents are, in turn, forming beliefs about it to predict its future behavior, motivating agents to model themselves as part of the environment. Here, building upon foundational work on universal artificial intelligence (AIXI), we introduce a mathematical framework for prospective learning and embedded agency centered on self-prediction, where Bayesian RL agents predict both future…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmbodied and Extended Cognition · Child and Animal Learning Development · Domain Adaptation and Few-Shot Learning