Structured Online Learning-based Control of Continuous-time Nonlinear Systems
Milad Farsi, Jun Liu

TL;DR
This paper introduces a structured, online, model-based reinforcement learning method for continuous-time nonlinear systems that efficiently computes optimal control policies using a quadratic value function and matrix differential equations.
Contribution
It presents a novel online learning algorithm leveraging system structure and quadratic value functions to reduce computational complexity in continuous-time nonlinear control.
Findings
Successfully applied to four benchmark nonlinear systems.
Achieved regulation with bounded model prediction error.
Provided a computationally efficient update rule for control parameters.
Abstract
Model-based reinforcement learning techniques accelerate the learning task by employing a transition model to make predictions. In this paper, a model-based learning approach is presented that iteratively computes the optimal value function based on the most recent update of the model. Assuming a structured continuous-time model of the system in terms of a set of bases, we formulate an infinite horizon optimal control problem addressing a given control objective. The structure of the system along with a value function parameterized in the quadratic form provides a flexibility in analytically calculating an update rule for the parameters. Hence, a matrix differential equation of the parameters is obtained, where the solution is used to characterize the optimal feedback control in terms of the bases, at any time step. Moreover, the quadratic form of the value function suggests a compact…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Model Reduction and Neural Networks · Mechanical Circulatory Support Devices
