Autotelic Reinforcement Learning in Multi-Agent Environments
Eleni Nisioti, El\'ias Masquil, Gautier Hamon, and Cl\'ement, Moulin-Frier

TL;DR
This paper introduces a decentralized approach for multi-agent reinforcement learning where agents autonomously generate and align goals, enabling cooperative skills acquisition without external supervision, demonstrated through navigation tasks.
Contribution
It proposes the Dec-IMSAP framework and a goal-coordination game that facilitate emergent goal alignment and cooperation in multi-agent intrinsically motivated learning.
Findings
Goal alignment improves goal diversity and cooperation.
Decentralized goal coordination matches centralized training performance.
Emergent communication enables autonomous cooperation.
Abstract
In the intrinsically motivated skills acquisition problem, the agent is set in an environment without any pre-defined goals and needs to acquire an open-ended repertoire of skills. To do so the agent needs to be autotelic (deriving from the Greek auto (self) and telos (end goal)): it needs to generate goals and learn to achieve them following its own intrinsic motivation rather than external supervision. Autotelic agents have so far been considered in isolation. But many applications of open-ended learning entail groups of agents. Multi-agent environments pose an additional challenge for autotelic agents: to discover and master goals that require cooperation agents must pursue them simultaneously, but they have low chances of doing so if they sample them independently. In this work, we propose a new learning paradigm for modeling such settings, the Decentralized Intrinsically Motivated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Reinforcement Learning in Robotics · Auction Theory and Applications
