Alchemy: A benchmark and analysis toolkit for meta-reinforcement   learning agents

Jane X. Wang; Michael King; Nicolas Porcel; Zeb Kurth-Nelson; Tina; Zhu; Charlie Deck; Peter Choy; Mary Cassin; Malcolm Reynolds; Francis Song,; Gavin Buttimore; David P. Reichert; Neil Rabinowitz; Loic Matthey; Demis; Hassabis; Alexander Lerchner; Matthew Botvinick

arXiv:2102.02926·cs.LG·October 22, 2021·6 cites

Alchemy: A benchmark and analysis toolkit for meta-reinforcement learning agents

Jane X. Wang, Michael King, Nicolas Porcel, Zeb Kurth-Nelson, Tina, Zhu, Charlie Deck, Peter Choy, Mary Cassin, Malcolm Reynolds, Francis Song,, Gavin Buttimore, David P. Reichert, Neil Rabinowitz, Loic Matthey, Demis, Hassabis, Alexander Lerchner, Matthew Botvinick

PDF

Open Access 1 Repo

TL;DR

Alchemy introduces a complex, procedurally generated 3D benchmark for meta-reinforcement learning, enabling detailed analysis of agent capabilities and limitations in structured, dynamic environments.

Contribution

The paper presents Alchemy, a novel, richly structured 3D benchmark for meta-RL, along with analysis tools and initial agent evaluations.

Findings

01

Meta-learning agents exhibit specific failures on Alchemy

02

Alchemy provides a challenging environment for meta-RL research

03

Tools and sample trajectories are released publicly

Abstract

There has been rapidly growing interest in meta-learning as a method for increasing the flexibility and sample efficiency of reinforcement learning. One problem in this area of research, however, has been a scarcity of adequate benchmark tasks. In general, the structure underlying past benchmarks has either been too simple to be inherently interesting, or too ill-defined to support principled analysis. In the present work, we introduce a new benchmark for meta-RL research, emphasizing transparency and potential for in-depth analysis as well as structural richness. Alchemy is a 3D video game, implemented in Unity, which involves a latent causal structure that is resampled procedurally from episode to episode, affording structure learning, online inference, hypothesis testing and action sequencing based on abstract domain knowledge. We evaluate a pair of powerful RL agents on Alchemy and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

deepmind/dm_alchemy
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Viral Infectious Diseases and Gene Expression in Insects · Machine Learning and Data Classification