Alchemy: A benchmark and analysis toolkit for meta-reinforcement learning agents
Jane X. Wang, Michael King, Nicolas Porcel, Zeb Kurth-Nelson, Tina, Zhu, Charlie Deck, Peter Choy, Mary Cassin, Malcolm Reynolds, Francis Song,, Gavin Buttimore, David P. Reichert, Neil Rabinowitz, Loic Matthey, Demis, Hassabis, Alexander Lerchner, Matthew Botvinick

TL;DR
Alchemy introduces a complex, procedurally generated 3D benchmark for meta-reinforcement learning, enabling detailed analysis of agent capabilities and limitations in structured, dynamic environments.
Contribution
The paper presents Alchemy, a novel, richly structured 3D benchmark for meta-RL, along with analysis tools and initial agent evaluations.
Findings
Meta-learning agents exhibit specific failures on Alchemy
Alchemy provides a challenging environment for meta-RL research
Tools and sample trajectories are released publicly
Abstract
There has been rapidly growing interest in meta-learning as a method for increasing the flexibility and sample efficiency of reinforcement learning. One problem in this area of research, however, has been a scarcity of adequate benchmark tasks. In general, the structure underlying past benchmarks has either been too simple to be inherently interesting, or too ill-defined to support principled analysis. In the present work, we introduce a new benchmark for meta-RL research, emphasizing transparency and potential for in-depth analysis as well as structural richness. Alchemy is a 3D video game, implemented in Unity, which involves a latent causal structure that is resampled procedurally from episode to episode, affording structure learning, online inference, hypothesis testing and action sequencing based on abstract domain knowledge. We evaluate a pair of powerful RL agents on Alchemy and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Viral Infectious Diseases and Gene Expression in Insects · Machine Learning and Data Classification
