Multi-agent Deep Covering Skill Discovery
Jiayu Chen, Marina Haliem, Tian Lan, Vaneet Aggarwal

TL;DR
This paper introduces a hierarchical multi-agent option discovery framework that minimizes joint state space cover time, enabling more efficient exploration and collaboration in multi-agent reinforcement learning tasks.
Contribution
It proposes a novel multi-agent option discovery method using cover time minimization and hierarchical learning with attention mechanisms for scalable multi-agent coordination.
Findings
Outperforms prior methods in exploration speed and task rewards.
Effectively captures agent interactions with attention mechanisms.
Successfully identifies collaborative options for sub-tasks.
Abstract
The use of skills (a.k.a., options) can greatly accelerate exploration in reinforcement learning, especially when only sparse reward signals are available. While option discovery methods have been proposed for individual agents, in multi-agent reinforcement learning settings, discovering collaborative options that can coordinate the behavior of multiple agents and encourage them to visit the under-explored regions of their joint state space has not been considered. In this case, we propose Multi-agent Deep Covering Option Discovery, which constructs the multi-agent options through minimizing the expected cover time of the multiple agents' joint state space. Also, we propose a novel framework to adopt the multi-agent options in the MARL process. In practice, a multi-agent task can usually be divided into some sub-tasks, each of which can be completed by a sub-group of the agents.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Machine Learning and Data Classification
