Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning

Filippos Christianos; Lukas Sch\"afer; Stefano V. Albrecht

arXiv:2006.07169·cs.MA·May 20, 2021·22 cites

Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning

Filippos Christianos, Lukas Sch\"afer, Stefano V. Albrecht

PDF

Open Access 3 Repos 1 Video

TL;DR

This paper introduces SEAC, a shared experience actor-critic method that enhances exploration in multi-agent reinforcement learning, especially in sparse reward settings, leading to faster learning and higher returns.

Contribution

The paper presents a novel experience sharing approach within an actor-critic framework for multi-agent RL, improving exploration efficiency in sparse reward environments.

Findings

01

SEAC outperforms baselines and state-of-the-art algorithms in multiple environments.

02

Experience sharing accelerates learning and achieves higher returns.

03

In difficult tasks, sharing experience can be the difference between success and failure.

Abstract

Exploration in multi-agent reinforcement learning is a challenging problem, especially in environments with sparse rewards. We propose a general method for efficient exploration by sharing experience amongst agents. Our proposed algorithm, called Shared Experience Actor-Critic (SEAC), applies experience sharing in an actor-critic framework. We evaluate SEAC in a collection of sparse-reward multi-agent environments and find that it consistently outperforms two baselines and two state-of-the-art algorithms by learning in fewer steps and converging to higher returns. In some harder environments, experience sharing makes the difference between learning to solve the task and not learning at all.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Artificial Intelligence in Games