Deep Reinforcement Learning with Embedded LQR Controllers

Wouter Caarls

arXiv:2101.07175·cs.RO·June 23, 2021

Deep Reinforcement Learning with Embedded LQR Controllers

Wouter Caarls

PDF

TL;DR

This paper explores integrating Linear Quadratic Regulator (LQR) control with reinforcement learning to improve control performance in reaching tasks, addressing issues like chattering and enhancing generalization.

Contribution

It introduces methods that embed LQR control into reinforcement learning algorithms, enabling better performance and robustness in control tasks.

Findings

01

Adding LQR control improves RL performance

02

Embedding LQR into discrete actions is particularly effective

03

LQR integration helps mitigate chattering in reaching tasks

Abstract

Reinforcement learning is a model-free optimal control method that optimizes a control policy through direct interaction with the environment. For reaching tasks that end in regulation, popular discrete-action methods are not well suited due to chattering in the goal state. We compare three different ways to solve this problem through combining reinforcement learning with classical LQR control. In particular, we introduce a method that integrates LQR control into the action set, allowing generalization and avoiding fixing the computed control in the replay memory if it is based on learned dynamics. We also embed LQR control into a continuous-action method. In all cases, we show that adding LQR control can improve performance, although the effect is more profound if it can be used to augment a discrete action set.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.