Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow

John McLeod; Hrvoje Stojic; Vincent Adam; Dongho Kim; Jordi Grau-Moya,; Peter Vrancx; Felix Leibfried

arXiv:2103.14407·cs.LG·April 14, 2021·1 cites

Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow

John McLeod, Hrvoje Stojic, Vincent Adam, Dongho Kim, Jordi Grau-Moya,, Peter Vrancx, Felix Leibfried

PDF

Open Access 2 Repos

TL;DR

Bellman introduces a comprehensive, modular toolbox for model-based reinforcement learning in TensorFlow, facilitating research and application development in sample-efficient, real-world scenarios.

Contribution

It is the first thoroughly designed and tested model-based RL toolbox with modular components and evaluation tools, filling a significant gap in RL software resources.

Findings

01

Provides a flexible environment for combining models and agents.

02

Enables systematic comparison of model-based and model-free methods.

03

Supports research into uncertainty-aware environment models.

Abstract

In the past decade, model-free reinforcement learning (RL) has provided solutions to challenging domains such as robotics. Model-based RL shows the prospect of being more sample-efficient than model-free methods in terms of agent-environment interactions, because the model enables to extrapolate to unseen situations. In the more recent past, model-based methods have shown superior results compared to model-free methods in some challenging domains with non-linear state transitions. At the same time, it has become apparent that RL is not market-ready yet and that many real-world applications are going to require model-based approaches, because model-free methods are too sample-inefficient and show poor performance in early stages of training. The latter is particularly important in industry, e.g. in production systems that directly impact a company's revenue. This demonstrates the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Evolutionary Algorithms and Applications