Improving Global Parameter-sharing in Physically Heterogeneous   Multi-agent Reinforcement Learning with Unified Action Space

Xiaoyang Yu; Youfang Lin; Shuo Wang; Kai Lv; Sheng Han

arXiv:2408.07395·cs.MA·August 15, 2024

Improving Global Parameter-sharing in Physically Heterogeneous Multi-agent Reinforcement Learning with Unified Action Space

Xiaoyang Yu, Youfang Lin, Shuo Wang, Kai Lv, Sheng Han

PDF

Open Access

TL;DR

This paper introduces the Unified Action Space (UAS) to improve parameter-sharing among heterogeneous agents in multi-agent reinforcement learning, enhancing cooperation without excessive computational costs.

Contribution

The paper proposes UAS and a Cross-Group Inverse loss to better handle action semantics, enabling effective parameter-sharing in heterogeneous multi-agent systems.

Findings

01

UAS improves cooperation in heterogeneous MAS

02

U-QMIX and U-MAPPO outperform state-of-the-art methods

03

Effective in SMAC environment

Abstract

In a multi-agent system (MAS), action semantics indicates the different influences of agents' actions toward other entities, and can be used to divide agents into groups in a physically heterogeneous MAS. Previous multi-agent reinforcement learning (MARL) algorithms apply global parameter-sharing across different types of heterogeneous agents without careful discrimination of different action semantics. This common implementation decreases the cooperation and coordination between agents in complex situations. However, fully independent agent parameters dramatically increase the computational cost and training difficulty. In order to benefit from the usage of different action semantics while also maintaining a proper parameter-sharing structure, we introduce the Unified Action Space (UAS) to fulfill the requirement. The UAS is the union set of all agent actions with different semantics.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsSparse Evolutionary Training · Mixing Adam and SGD