Memory Augmented Self-Play

Shagun Sodhani; Vardaan Pahuja

arXiv:1805.11016·cs.LG·June 4, 2018

Memory Augmented Self-Play

Shagun Sodhani, Vardaan Pahuja

PDF

Open Access 1 Repo

TL;DR

This paper introduces a memory-augmented self-play framework that enhances exploration and performance of reinforcement learning agents by leveraging external memory to store past experiences, leading to more diverse tasks and faster learning.

Contribution

It proposes a novel memory-augmented self-play method that improves exploration efficiency and performance in reinforcement learning by integrating external memory into the self-play process.

Findings

01

Memory augmentation leads to more diverse self-play tasks.

02

Agents with memory outperform those without in exploration speed.

03

Pretrained agents in the memory-augmented setting perform better.

Abstract

Self-play is an unsupervised training procedure which enables the reinforcement learning agents to explore the environment without requiring any external rewards. We augment the self-play setting by providing an external memory where the agent can store experience from the previous tasks. This enables the agent to come up with more diverse self-play tasks resulting in faster exploration of the environment. The agent pretrained in the memory augmented self-play setting easily outperforms the agent pretrained in no-memory self-play setting.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shagunsodhani/memory-augmented-self-play
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Mind wandering and attention · Ferroelectric and Negative Capacitance Devices