Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
John McLeod, Hrvoje Stojic, Vincent Adam, Dongho Kim, Jordi Grau-Moya,, Peter Vrancx, Felix Leibfried

TL;DR
Bellman introduces a comprehensive, modular toolbox for model-based reinforcement learning in TensorFlow, facilitating research and application development in sample-efficient, real-world scenarios.
Contribution
It is the first thoroughly designed and tested model-based RL toolbox with modular components and evaluation tools, filling a significant gap in RL software resources.
Findings
Provides a flexible environment for combining models and agents.
Enables systematic comparison of model-based and model-free methods.
Supports research into uncertainty-aware environment models.
Abstract
In the past decade, model-free reinforcement learning (RL) has provided solutions to challenging domains such as robotics. Model-based RL shows the prospect of being more sample-efficient than model-free methods in terms of agent-environment interactions, because the model enables to extrapolate to unseen situations. In the more recent past, model-based methods have shown superior results compared to model-free methods in some challenging domains with non-linear state transitions. At the same time, it has become apparent that RL is not market-ready yet and that many real-world applications are going to require model-based approaches, because model-free methods are too sample-inefficient and show poor performance in early stages of training. The latter is particularly important in industry, e.g. in production systems that directly impact a company's revenue. This demonstrates the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Evolutionary Algorithms and Applications
