Deep Multi-Agent Reinforcement Learning with Discrete-Continuous Hybrid   Action Spaces

Haotian Fu; Hongyao Tang; Jianye Hao; Zihan Lei; Yingfeng Chen,; Changjie Fan

arXiv:1903.04959·cs.LG·June 4, 2019·19 cites

Deep Multi-Agent Reinforcement Learning with Discrete-Continuous Hybrid Action Spaces

Haotian Fu, Hongyao Tang, Jianye Hao, Zihan Lei, Yingfeng Chen,, Changjie Fan

PDF

Open Access

TL;DR

This paper introduces two novel deep multi-agent reinforcement learning algorithms designed for hybrid action spaces, demonstrating superior performance on complex simulated tasks compared to existing methods.

Contribution

The paper proposes Deep MAPQN and Deep MAHHQN algorithms to address multi-agent DRL with hybrid action spaces, a previously unexplored problem.

Findings

01

Both algorithms outperform existing independent deep parameterized Q-learning.

02

Effective in complex tasks like RoboCup Soccer and Ghost Story.

03

Show significant improvements in training efficiency and policy effectiveness.

Abstract

Deep Reinforcement Learning (DRL) has been applied to address a variety of cooperative multi-agent problems with either discrete action spaces or continuous action spaces. However, to the best of our knowledge, no previous work has ever succeeded in applying DRL to multi-agent problems with discrete-continuous hybrid (or parameterized) action spaces which is very common in practice. Our work fills this gap by proposing two novel algorithms: Deep Multi-Agent Parameterized Q-Networks (Deep MAPQN) and Deep Multi-Agent Hierarchical Hybrid Q-Networks (Deep MAHHQN). We follow the centralized training but decentralized execution paradigm: different levels of communication between different agents are used to facilitate the training process, while each agent executes its policy independently based on local observations during execution. Our empirical results on several challenging tasks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Sports Analytics and Performance · Autonomous Vehicle Technology and Safety

MethodsQ-Learning