Novelty-Guided Data Reuse for Efficient and Diversified Multi-Agent   Reinforcement Learning

Yangkun Chen; Kai Yang; Jian Tao; Jiafei Lyu

arXiv:2412.15517·cs.LG·December 23, 2024

Novelty-Guided Data Reuse for Efficient and Diversified Multi-Agent Reinforcement Learning

Yangkun Chen, Kai Yang, Jian Tao, Jiafei Lyu

PDF

Open Access 2 Repos

TL;DR

This paper introduces MANGER, a novel method that uses observation novelty to dynamically reuse data in multi-agent reinforcement learning, significantly improving efficiency and behavioral diversity in complex cooperative tasks.

Contribution

The paper presents MANGER, a new approach employing Random Network Distillation to enhance sample reuse and diversity in multi-agent reinforcement learning.

Findings

01

Improved sample efficiency in MARL tasks.

02

Enhanced exploration and behavioral diversity.

03

Superior performance in Google Research Football and StarCraft II tasks.

Abstract

Recently, deep Multi-Agent Reinforcement Learning (MARL) has demonstrated its potential to tackle complex cooperative tasks, pushing the boundaries of AI in collaborative environments. However, the efficiency of these systems is often compromised by inadequate sample utilization and a lack of diversity in learning strategies. To enhance MARL performance, we introduce a novel sample reuse approach that dynamically adjusts policy updates based on observation novelty. Specifically, we employ a Random Network Distillation (RND) network to gauge the novelty of each agent's current state, assigning additional sample update opportunities based on the uniqueness of the data. We name our method Multi-Agent Novelty-GuidEd sample Reuse (MANGER). This method increases sample efficiency and promotes exploration and diverse agent behaviors. Our evaluations confirm substantial improvements in MARL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics