Stabilizing Optimal Control for Nonlinear Stochastic Systems: A Parametric Gradient-Based Approach
Yuji Ito, Kenji Fujimoto

TL;DR
This paper introduces a parametric gradient-based method with a penalty function to design stabilizing suboptimal controllers for nonlinear stochastic systems with uncertainties, ensuring robust stability and improved performance.
Contribution
It presents a novel gradient-based approach with explicit cost and gradient formulations, addressing the challenges of stochastic parameters and stability in nonlinear control.
Findings
The method guarantees robust stability of the controlled system.
Numerical simulations show improved performance over baseline methods.
The approach effectively handles stochastic uncertainties in control design.
Abstract
This study proposes a method for designing stabilizing suboptimal controllers for nonlinear stochastic systems. These systems include time-invariant stochastic parameters that represent uncertainty of dynamics, posing two key difficulties in optimal control. Firstly, the time-invariant stochastic nature violates the principle of optimality and Hamilton-Jacobi equations, which are fundamental tools for solving optimal control problems. Secondly, nonlinear systems must be robustly stabilized against these stochastic parameters. To overcome these difficulties simultaneously, this study presents a parametric-gradient-based method with a penalty function. A controller and cost function are parameterized using basis functions, and a gradient method is employed to optimize the controller by minimizing the parameterized cost function. Crucial challenges in this approach are parameterizing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Variational Analysis · Advanced Optimization Algorithms Research · Advanced Control Systems Optimization
