Learning-Based Stable Optimal Control for Infinite-Time Nonlinear Regulation Problems
Han Wang, Di Wu, Lin Cheng, Shengping Gong, Xu Huang

TL;DR
This paper introduces a learning-based framework that guarantees stability in infinite-time nonlinear regulation control by combining optimal control theory, data generation, and Lyapunov stability, demonstrated through simulations.
Contribution
It develops a novel method that ensures stability in learning-based optimal control for nonlinear systems by integrating Lyapunov conditions with data-driven approaches.
Findings
Achieves near-optimal regulation control in simulations
Ensures stability of learned policies through Lyapunov conditions
Provides an efficient data generation method for state space coverage
Abstract
Infinite-time nonlinear optimal regulation control is widely utilized in aerospace engineering as a systematic method for synthesizing stable controllers. However, conventional methods often rely on linearization hypothesis, while recent learning-based approaches rarely consider stability guarantees. This paper proposes a learning-based framework to learn a stable optimal controller for nonlinear optimal regulation problems. First, leveraging the equivalence between Pontryagin Maximum Principle (PMP) and Hamilton-Jacobi-Bellman (HJB) equation, we improve the backward generation of optimal examples (BGOE) method for infinite-time optimal regulation problems. A state-transition-matrix-guided data generation method is then proposed to efficiently generate a complete dataset that covers the desired state space. Finally, we incorporate the Lyapunov stability condition into the learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Model Reduction and Neural Networks · Control and Stability of Dynamical Systems
