Data-Efficient Reinforcement Learning with Probabilistic Model   Predictive Control

Sanket Kamthe; Marc Peter Deisenroth

arXiv:1706.06491·cs.SY·February 23, 2018·110 cites

Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control

Sanket Kamthe, Marc Peter Deisenroth

PDF

Open Access 1 Repo

TL;DR

This paper introduces a data-efficient reinforcement learning framework using probabilistic model predictive control with Gaussian Processes, effectively handling constraints and reducing environment interactions.

Contribution

It presents a novel model-based RL approach that incorporates probabilistic models and theoretical guarantees for constrained environments, improving data efficiency.

Findings

01

Achieves state-of-the-art data efficiency in RL tasks.

02

Effectively handles state and control constraints.

03

Provides theoretical guarantees for optimality with Gaussian Process models.

Abstract

Trial-and-error based reinforcement learning (RL) has seen rapid advancements in recent times, especially with the advent of deep neural networks. However, the majority of autonomous RL algorithms require a large number of interactions with the environment. A large number of interactions may be impractical in many real-world applications, such as robotics, and many practical systems have to obey limitations in the form of state space or control constraints. To reduce the number of system interactions while simultaneously handling constraints, we propose a model-based RL framework based on probabilistic Model Predictive Control (MPC). In particular, we propose to learn a probabilistic transition model using Gaussian Processes (GPs) to incorporate model uncertainty into long-term predictions, thereby, reducing the impact of model errors. We then use MPC to find a control sequence that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SimonRennotte/Data-Efficient-Reinforcement-Learning-with-Probabilistic-Model-Predictive-Control
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Fault Detection and Control Systems · Control Systems and Identification

MethodsGaussian Process