Improving Global Parameter-sharing in Physically Heterogeneous Multi-agent Reinforcement Learning with Unified Action Space
Xiaoyang Yu, Youfang Lin, Shuo Wang, Kai Lv, Sheng Han

TL;DR
This paper introduces the Unified Action Space (UAS) to improve parameter-sharing among heterogeneous agents in multi-agent reinforcement learning, enhancing cooperation without excessive computational costs.
Contribution
The paper proposes UAS and a Cross-Group Inverse loss to better handle action semantics, enabling effective parameter-sharing in heterogeneous multi-agent systems.
Findings
UAS improves cooperation in heterogeneous MAS
U-QMIX and U-MAPPO outperform state-of-the-art methods
Effective in SMAC environment
Abstract
In a multi-agent system (MAS), action semantics indicates the different influences of agents' actions toward other entities, and can be used to divide agents into groups in a physically heterogeneous MAS. Previous multi-agent reinforcement learning (MARL) algorithms apply global parameter-sharing across different types of heterogeneous agents without careful discrimination of different action semantics. This common implementation decreases the cooperation and coordination between agents in complex situations. However, fully independent agent parameters dramatically increase the computational cost and training difficulty. In order to benefit from the usage of different action semantics while also maintaining a proper parameter-sharing structure, we introduce the Unified Action Space (UAS) to fulfill the requirement. The UAS is the union set of all agent actions with different semantics.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
MethodsSparse Evolutionary Training · Mixing Adam and SGD
