Deep Multi-Agent Reinforcement Learning with Hybrid Action Spaces based   on Maximum Entropy

Hongzhi Hua; Kaigui Wu; Guixuan Wen

arXiv:2206.05108·cs.LG·June 13, 2022

Deep Multi-Agent Reinforcement Learning with Hybrid Action Spaces based on Maximum Entropy

Hongzhi Hua, Kaigui Wu, Guixuan Wen

PDF

Open Access

TL;DR

This paper introduces MAHSAC, a novel deep multi-agent reinforcement learning algorithm designed for environments with hybrid action spaces, demonstrating superior performance and stability in complex multi-agent tasks.

Contribution

The paper presents MAHSAC, extending Soft Actor-Critic to handle hybrid action spaces in multi-agent settings under the CTDE paradigm, filling a significant research gap.

Findings

01

MAHSAC shows fast training and high stability.

02

It outperforms existing methods in cooperative and competitive scenarios.

03

Demonstrates effectiveness in environments with hybrid action spaces.

Abstract

Multi-agent deep reinforcement learning has been applied to address a variety of complex problems with either discrete or continuous action spaces and achieved great success. However, most real-world environments cannot be described by only discrete action spaces or only continuous action spaces. And there are few works having ever utilized deep reinforcement learning (drl) to multi-agent problems with hybrid action spaces. Therefore, we propose a novel algorithm: Deep Multi-Agent Hybrid Soft Actor-Critic (MAHSAC) to fill this gap. This algorithm follows the centralized training but decentralized execution (CTDE) paradigm, and extend the Soft Actor-Critic algorithm (SAC) to handle hybrid action space problems in Multi-Agent environments based on maximum entropy. Our experiences are running on an easy multi-agent particle world with a continuous observation and discrete action space,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics