Accelerated Continuous-Time Approximate Dynamic Programming via Data-Assisted Hybrid Control
Daniel E. Ochoa, Jorge I. Poveda

TL;DR
This paper presents a novel continuous-time control algorithm that accelerates convergence using dynamic momentum and data-assisted techniques, eliminating the need for persistent excitation and ensuring stability.
Contribution
It introduces the first momentum-based actor-critic algorithm for continuous-time systems that leverages past data and hybrid resets to improve convergence and stability.
Findings
Accelerates convergence of continuous-time control systems.
Eliminates the need for persistent excitation conditions.
Ensures asymptotic stability using hybrid dynamical systems theory.
Abstract
We introduce a new closed-loop architecture for the online solution of approximate optimal control problems in the context of continuous-time systems. Specifically, we introduce the first algorithm that incorporates dynamic momentum in actor-critic structures to control continuous-time dynamic plants with an affine structure in the input. By incorporating dynamic momentum in our algorithm, we are able to accelerate the convergence properties of the closed-loop system, achieving superior transient performance compared to traditional gradient-descent based techniques. In addition, by leveraging the existence of past recorded data with sufficiently rich information properties, we dispense with the persistence of excitation condition traditionally imposed on the regressors of the critic and the actor. Given that our continuous-time momentum-based dynamics also incorporate periodic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Advanced Control Systems Optimization · Reinforcement Learning in Robotics
