Hierarchical Reinforcement Learning for Optimal Agent Grouping in   Cooperative Systems

Liyuan Hu

arXiv:2501.06554·cs.LG·January 14, 2025

Hierarchical Reinforcement Learning for Optimal Agent Grouping in Cooperative Systems

Liyuan Hu

PDF

TL;DR

This paper introduces a hierarchical reinforcement learning framework for optimal agent grouping in cooperative multi-agent systems, combining CTDE, permutation-invariant networks, and option-critic algorithms for scalable and effective coordination.

Contribution

It proposes a novel hierarchical RL approach that jointly learns agent grouping and policies, enhancing coordination in cooperative multi-agent systems.

Findings

01

Effective agent grouping achieved in cooperative tasks

02

Scalable learning with CTDE paradigm

03

Improved coordination through permutation-invariant networks

Abstract

This paper presents a hierarchical reinforcement learning (RL) approach to address the agent grouping or pairing problem in cooperative multi-agent systems. The goal is to simultaneously learn the optimal grouping and agent policy. By employing a hierarchical RL framework, we distinguish between high-level decisions of grouping and low-level agents' actions. Our approach utilizes the CTDE (Centralized Training with Decentralized Execution) paradigm, ensuring efficient learning and scalable execution. We incorporate permutation-invariant neural networks to handle the homogeneity and cooperation among agents, enabling effective coordination. The option-critic algorithm is adapted to manage the hierarchical decision-making process, allowing for dynamic and optimal policy adjustments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.