Effects of sampling and horizon in predictive reinforcement learning

Pavel Osinenko; Dmitrii Dobriborsci

arXiv:2108.04802·math.DS·August 25, 2021

Effects of sampling and horizon in predictive reinforcement learning

Pavel Osinenko, Dmitrii Dobriborsci

PDF

Open Access 2 Repos

TL;DR

This paper investigates how sampling time and prediction horizon affect the performance of hybrid reinforcement learning and model-predictive control agents in a mobile robot parking task, revealing optimal hyper-parameter settings.

Contribution

It provides an empirical analysis of the impact of sampling and horizon hyper-parameters on hybrid RL-MPC agents in a real-world control scenario.

Findings

01

Sampling exhibits a 'sweet spot' behavior for optimal performance.

02

RL agents perform better at shorter prediction horizons.

03

Benchmarking against a simple MPC variant highlights the advantages of hybrid approaches.

Abstract

Plain reinforcement learning (RL) may be prone to loss of convergence, constraint violation, unexpected performance, etc. Commonly, RL agents undergo extensive learning stages to achieve acceptable functionality. This is in contrast to classical control algorithms which are typically model-based. An direction of research is the fusion of RL with such algorithms, especially model-predictive control (MPC). This, however, introduces new hyper-parameters related to the prediction horizon. Furthermore, RL is usually concerned with Markov decision processes. But the most of the real environments are not time-discrete. The factual physical setting of RL consists of a digital agent and a time-continuous dynamical system. There is thus, in fact, yet another hyper-parameter -- the agent sampling time. In this paper, we investigate the effects of prediction horizon and sampling of two hybrid…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Reinforcement Learning in Robotics · Control Systems and Identification