Prediction and Control with Temporal Segment Models

Nikhil Mishra; Pieter Abbeel; Igor Mordatch

arXiv:1703.04070·cs.LG·July 14, 2017·22 cites

Prediction and Control with Temporal Segment Models

Nikhil Mishra, Pieter Abbeel, Igor Mordatch

PDF

Open Access

TL;DR

This paper presents a deep generative modeling approach for learning complex nonlinear system dynamics over temporal segments, enabling stable long-horizon predictions and effective trajectory optimization.

Contribution

It introduces a novel method combining convolutional autoregressive models and variational autoencoders to model trajectory distributions conditioned on past and future actions.

Findings

01

Stable long-horizon predictions for stochastic systems

02

Effective modeling of uncertainty and noise effects

03

Improved sample efficiency in trajectory optimization

Abstract

We introduce a method for learning the dynamics of complex nonlinear systems based on deep generative models over temporal segments of states and actions. Unlike dynamics models that operate over individual discrete timesteps, we learn the distribution over future state trajectories conditioned on past state, past action, and planned future action trajectories, as well as a latent prior over action trajectories. Our approach is based on convolutional autoregressive models and variational autoencoders. It makes stable and accurate predictions over long horizons for complex, stochastic systems, effectively expressing uncertainty and modeling the effects of collisions, sensory noise, and action delays. The learned dynamics model and action prior can be used for end-to-end, fully differentiable trajectory optimization and model-based policy optimization, which we use to evaluate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Reinforcement Learning in Robotics · Human Pose and Action Recognition