Learning Accurate Long-term Dynamics for Model-based Reinforcement   Learning

Nathan O. Lambert; Albert Wilcox; Howard Zhang; Kristofer S. J.; Pister; Roberto Calandra

arXiv:2012.09156·cs.LG·September 2, 2021

Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning

Nathan O. Lambert, Albert Wilcox, Howard Zhang, Kristofer S. J., Pister, Roberto Calandra

PDF

1 Repo

TL;DR

This paper introduces a trajectory-based model for long-term dynamics prediction in robotic systems, which improves accuracy and sample efficiency over traditional methods by directly predicting future states at specified time indices.

Contribution

The paper proposes a novel trajectory-based modeling approach that enhances long-term prediction accuracy and stability in model-based reinforcement learning.

Findings

01

Trajectory-based models outperform traditional models in long-term prediction accuracy.

02

The approach improves sample efficiency in robotic tasks.

03

It enables direct prediction of task rewards from the model.

Abstract

Accurately predicting the dynamics of robotic systems is crucial for model-based control and reinforcement learning. The most common way to estimate dynamics is by fitting a one-step ahead prediction model and using it to recursively propagate the predicted state distribution over long horizons. Unfortunately, this approach is known to compound even small prediction errors, making long-term predictions inaccurate. In this paper, we propose a new parametrization to supervised learning on state-action data to stably predict at longer horizons -- that we call a trajectory-based model. This trajectory-based model takes an initial state, a future time index, and control parameters as inputs, and directly predicts the state at the future time index. Experimental results in simulated and real-world robotic tasks show that trajectory-based models yield significantly more accurate long term…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

facebookresearch/mbrl-lib
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.