PufferLib: Making Reinforcement Learning Libraries and Environments Play   Nice

Joseph Suarez

arXiv:2406.12905·cs.LG·June 21, 2024

PufferLib: Making Reinforcement Learning Libraries and Environments Play Nice

Joseph Suarez

PDF

Open Access 1 Repo

TL;DR

PufferLib simplifies integrating reinforcement learning environments and libraries by providing compatibility wrappers and fast vectorization, enabling scalable training across diverse benchmarks and complex simulators.

Contribution

It introduces a library that ensures compatibility and accelerates training in reinforcement learning setups, supporting a wide range of environments and libraries.

Findings

01

Enables seamless use of popular RL libraries with various environments.

02

Provides fast vectorization for accelerated training.

03

Supports numerous benchmarks and complex simulators.

Abstract

You have an environment, a model, and a reinforcement learning library that are designed to work together but don't. PufferLib makes them play nice. The library provides one-line environment wrappers that eliminate common compatibility problems and fast vectorization to accelerate training. With PufferLib, you can use familiar libraries like CleanRL and SB3 to scale from classic benchmarks like Atari and Procgen to complex simulators like NetHack and Neural MMO. We release pip packages and prebuilt images with dependencies for dozens of environments. All of our code is free and open-source software under the MIT license, complete with baselines, documentation, and support at pufferai.github.io.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

emerge-lab/gpudrive
jax

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsE-Learning and Knowledge Management · ICT in Developing Communities

MethodsLib