Multi-agent cooperation through in-context co-player inference
Marissa A. Weis, Maciej Wo{\l}czyk, Rajai Nasser, Rif A. Saurous, Blaise Ag\"uera y Arcas, Jo\~ao Sacramento, Alexander Meulemans

TL;DR
This paper demonstrates that sequence models can naturally learn to cooperate in multi-agent settings by in-context adaptation to diverse co-players, without hardcoded assumptions or explicit timescale separation.
Contribution
It shows that in-context learning in sequence models enables natural co-player adaptation and cooperation without relying on predefined learning rules or timescale distinctions.
Findings
In-context learning induces best-response strategies in multi-agent interactions.
Vulnerability to extortion naturally leads to mutual shaping and cooperation.
Diverse co-player training promotes scalable cooperative behavior.
Abstract
Achieving cooperation among self-interested agents remains a fundamental challenge in multi-agent reinforcement learning. Recent work showed that mutual cooperation can be induced between "learning-aware" agents that account for and shape the learning dynamics of their co-players. However, existing approaches typically rely on hardcoded, often inconsistent, assumptions about co-player learning rules or enforce a strict separation between "naive learners" updating on fast timescales and "meta-learners" observing these updates. Here, we demonstrate that the in-context learning capabilities of sequence models allow for co-player learning awareness without requiring hardcoded assumptions or explicit timescale separation. We show that training sequence model agents against a diverse distribution of co-players naturally induces in-context best-response strategies, effectively functioning as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Game Theory and Cooperation · Adversarial Robustness in Machine Learning
