Brax -- A Differentiable Physics Engine for Large Scale Rigid Body   Simulation

C. Daniel Freeman; Erik Frey; Anton Raichuk; Sertan Girgin; Igor; Mordatch; Olivier Bachem

arXiv:2106.13281·cs.RO·June 28, 2021·21 cites

Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation

C. Daniel Freeman, Erik Frey, Anton Raichuk, Sertan Girgin, Igor, Mordatch, Olivier Bachem

PDF

Open Access 1 Repo 1 Video

TL;DR

Brax is an open-source, high-performance, differentiable physics engine built in JAX that enables scalable reinforcement learning by integrating environment simulation and learning algorithms on accelerators.

Contribution

Brax introduces a novel, efficient physics simulation library in JAX with integrated reinforcement learning algorithms, facilitating large-scale, accelerated policy training.

Findings

01

Achieves high performance and parallelism on accelerators.

02

Supports seamless integration of environment simulation and learning algorithms.

03

Enables training of policies on MuJoCo-like tasks in minutes.

Abstract

We present Brax, an open source library for rigid body simulation with a focus on performance and parallelism on accelerators, written in JAX. We present results on a suite of tasks inspired by the existing reinforcement learning literature, but remade in our engine. Additionally, we provide reimplementations of PPO, SAC, ES, and direct policy optimization in JAX that compile alongside our environments, allowing the learning algorithm and the environment processing to occur on the same device, and to scale seamlessly on accelerators. Finally, we include notebooks that facilitate training of performant policies on common OpenAI Gym MuJoCo-like tasks in minutes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google/brax
jaxOfficial

Videos

[ML News] GitHub Copilot - Copyright, GPL, Patents & more | Brickit LEGO app | Distill goes on break· youtube

Taxonomy

TopicsReinforcement Learning in Robotics · Robotic Locomotion and Control · Modeling and Simulation Systems

MethodsConvolution · Average Pooling · Global Average Pooling · Dilated Convolution · Entropy Regularization · 1x1 Convolution · Proximal Policy Optimization · Switchable Atrous Convolution