XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX

Alexander Nikulin; Vladislav Kurenkov; Ilya Zisman; Artem Agarkov,; Viacheslav Sinii; Sergey Kolesnikov

arXiv:2312.12044·cs.LG·November 20, 2024·2 cites

XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX

Alexander Nikulin, Vladislav Kurenkov, Ilya Zisman, Artem Agarkov,, Viacheslav Sinii, Sergey Kolesnikov

PDF

Open Access 2 Repos 1 Video

TL;DR

XLand-MiniGrid introduces a scalable, JAX-based suite of grid-world environments for meta-reinforcement learning, enabling large-scale experimentation with diverse tasks and efficient training on accelerators.

Contribution

It provides a new, scalable, and easy-to-use platform with extensive benchmarks for meta-RL research, combining the diversity of XLand and simplicity of MiniGrid.

Findings

01

Baselines reach millions of steps per second.

02

Benchmarks are challenging for current algorithms.

03

The platform facilitates rapid experimentation.

Abstract

Inspired by the diversity and depth of XLand and the simplicity and minimalism of MiniGrid, we present XLand-MiniGrid, a suite of tools and grid-world environments for meta-reinforcement learning research. Written in JAX, XLand-MiniGrid is designed to be highly scalable and can potentially run on GPU or TPU accelerators, democratizing large-scale experimentation with limited resources. Along with the environments, XLand-MiniGrid provides pre-sampled benchmarks with millions of unique tasks of varying difficulty and easy-to-use baselines that allow users to quickly start training adaptive agents. In addition, we have conducted a preliminary analysis of scaling and generalization, showing that our baselines are capable of reaching millions of steps per second during training and validating that the proposed benchmarks are challenging. XLand-MiniGrid is open-source and available at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX· slideslive

Taxonomy

TopicsAdvanced Control Systems Optimization · Distributed and Parallel Computing Systems · Reinforcement Learning in Robotics