Multi-agent cooperation through in-context co-player inference

Marissa A. Weis; Maciej Wo{\l}czyk; Rajai Nasser; Rif A. Saurous; Blaise Ag\"uera y Arcas; Jo\~ao Sacramento; Alexander Meulemans

arXiv:2602.16301·cs.AI·February 19, 2026

Multi-agent cooperation through in-context co-player inference

Marissa A. Weis, Maciej Wo{\l}czyk, Rajai Nasser, Rif A. Saurous, Blaise Ag\"uera y Arcas, Jo\~ao Sacramento, Alexander Meulemans

PDF

Open Access

TL;DR

This paper demonstrates that sequence models can naturally learn to cooperate in multi-agent settings by in-context adaptation to diverse co-players, without hardcoded assumptions or explicit timescale separation.

Contribution

It shows that in-context learning in sequence models enables natural co-player adaptation and cooperation without relying on predefined learning rules or timescale distinctions.

Findings

01

In-context learning induces best-response strategies in multi-agent interactions.

02

Vulnerability to extortion naturally leads to mutual shaping and cooperation.

03

Diverse co-player training promotes scalable cooperative behavior.

Abstract

Achieving cooperation among self-interested agents remains a fundamental challenge in multi-agent reinforcement learning. Recent work showed that mutual cooperation can be induced between "learning-aware" agents that account for and shape the learning dynamics of their co-players. However, existing approaches typically rely on hardcoded, often inconsistent, assumptions about co-player learning rules or enforce a strict separation between "naive learners" updating on fast timescales and "meta-learners" observing these updates. Here, we demonstrate that the in-context learning capabilities of sequence models allow for co-player learning awareness without requiring hardcoded assumptions or explicit timescale separation. We show that training sequence model agents against a diverse distribution of co-players naturally induces in-context best-response strategies, effectively functioning as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Game Theory and Cooperation · Adversarial Robustness in Machine Learning