Efficient Episodic Memory Utilization of Cooperative Multi-Agent   Reinforcement Learning

Hyungho Na; Yunkyeong Seo; Il-chul Moon

arXiv:2403.01112·cs.LG·March 8, 2024·3 cites

Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning

Hyungho Na, Yunkyeong Seo, Il-chul Moon

PDF

Open Access 1 Repo

TL;DR

This paper introduces EMU, a novel method for cooperative multi-agent reinforcement learning that leverages semantic episodic memory and a new reward structure to accelerate learning and avoid local optima, demonstrated in complex environments.

Contribution

The paper proposes EMU, combining semantic episodic memory with a new reward incentive, to improve learning efficiency and policy quality in multi-agent reinforcement learning.

Findings

01

EMU accelerates learning in complex tasks.

02

EMU outperforms conventional methods in StarCraft II.

03

The episodic incentive improves policy convergence.

Abstract

In cooperative multi-agent reinforcement learning (MARL), agents aim to achieve a common goal, such as defeating enemies or scoring a goal. Existing MARL algorithms are effective but still require significant learning time and often get trapped in local optima by complex tasks, subsequently failing to discover a goal-reaching policy. To address this, we introduce Efficient episodic Memory Utilization (EMU) for MARL, with two primary objectives: (a) accelerating reinforcement learning by leveraging semantically coherent memory from an episodic buffer and (b) selectively promoting desirable transitions to prevent local convergence. To achieve (a), EMU incorporates a trainable encoder/decoder structure alongside MARL, creating coherent memory embeddings that facilitate exploratory memory recall. To achieve (b), EMU introduces a novel reward structure called episodic incentive based on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hyunghona/emu
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsQ-Learning