Structured Online Learning-based Control of Continuous-time Nonlinear   Systems

Milad Farsi; Jun Liu

arXiv:2010.10577·math.OC·October 22, 2020

Structured Online Learning-based Control of Continuous-time Nonlinear Systems

Milad Farsi, Jun Liu

PDF

Open Access

TL;DR

This paper introduces a structured, online, model-based reinforcement learning method for continuous-time nonlinear systems that efficiently computes optimal control policies using a quadratic value function and matrix differential equations.

Contribution

It presents a novel online learning algorithm leveraging system structure and quadratic value functions to reduce computational complexity in continuous-time nonlinear control.

Findings

01

Successfully applied to four benchmark nonlinear systems.

02

Achieved regulation with bounded model prediction error.

03

Provided a computationally efficient update rule for control parameters.

Abstract

Model-based reinforcement learning techniques accelerate the learning task by employing a transition model to make predictions. In this paper, a model-based learning approach is presented that iteratively computes the optimal value function based on the most recent update of the model. Assuming a structured continuous-time model of the system in terms of a set of bases, we formulate an infinite horizon optimal control problem addressing a given control objective. The structure of the system along with a value function parameterized in the quadratic form provides a flexibility in analytically calculating an update rule for the parameters. Hence, a matrix differential equation of the parameters is obtained, where the solution is used to characterize the optimal feedback control in terms of the bases, at any time step. Moreover, the quadratic form of the value function suggests a compact…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Model Reduction and Neural Networks · Mechanical Circulatory Support Devices