HARP: Human-Assisted Regrouping with Permutation Invariant Critic for   Multi-Agent Reinforcement Learning

Huawen Hu; Enze Shi; Chenxi Yue; Shuocun Yang; Zihao Wu; Yiwei Li,; Tianyang Zhong; Tuo Zhang; Tianming Liu; Shu Zhang

arXiv:2409.11741·cs.LG·September 19, 2024

HARP: Human-Assisted Regrouping with Permutation Invariant Critic for Multi-Agent Reinforcement Learning

Huawen Hu, Enze Shi, Chenxi Yue, Shuocun Yang, Zihao Wu, Yiwei Li,, Tianyang Zhong, Tuo Zhang, Tianming Liu, Shu Zhang

PDF

Open Access

TL;DR

HARP is a multi-agent reinforcement learning framework that combines automatic regrouping with minimal human assistance, enabling non-experts to effectively guide multi-agent collaboration and improve task performance.

Contribution

HARP introduces a novel permutation invariant critic and dynamic regrouping mechanism for multi-agent RL, reducing human workload and enhancing scalability.

Findings

01

Effective in multiple collaboration scenarios

02

Leverages limited human guidance to improve performance

03

Allows non-experts to contribute valuable suggestions

Abstract

Human-in-the-loop reinforcement learning integrates human expertise to accelerate agent learning and provide critical guidance and feedback in complex fields. However, many existing approaches focus on single-agent tasks and require continuous human involvement during the training process, significantly increasing the human workload and limiting scalability. In this paper, we propose HARP (Human-Assisted Regrouping with Permutation Invariant Critic), a multi-agent reinforcement learning framework designed for group-oriented tasks. HARP integrates automatic agent regrouping with strategic human assistance during deployment, enabling and allowing non-experts to offer effective guidance with minimal intervention. During training, agents dynamically adjust their groupings to optimize collaborative task completion. When deployed, they actively seek human assistance and utilize the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsFocus