Ensemble based Closed-Loop Optimal Control using Physics-Informed Neural Networks
Jostein Barry-Straume, Adwait D. Verulkar, Arash Sarshar, Andrey A. Popov, and Adrian Sandu

TL;DR
This paper introduces an ensemble physics-informed neural network framework for solving the Hamilton-Jacobi-Bellman equation to design optimal control systems, enabling effective closed-loop control of nonlinear systems without stabilizer terms.
Contribution
It proposes a multistage ensemble PINN approach for learning optimal control policies directly from the HJB equation, avoiding stabilizer terms used in prior methods.
Findings
Successfully controls a nonlinear system with noisy states.
Demonstrates ensemble and singular control effectiveness.
Handles varying initial conditions and perturbations.
Abstract
The objective of designing a control system is to steer a dynamical system with a control signal, guiding it to exhibit the desired behavior. The Hamilton-Jacobi-Bellman (HJB) partial differential equation offers a framework for optimal control system design. However, numerical solutions to this equation are computationally intensive, and analytical solutions are frequently unavailable. Knowledge-guided machine learning methodologies, such as physics-informed neural networks (PINNs), offer new alternative approaches that can alleviate the difficulties of solving the HJB equation numerically. This work presents a multistage ensemble framework to learn the optimal cost-to-go, and subsequently the corresponding optimal control signal, through the HJB equation. Prior PINN-based approaches rely on a stabilizing the HJB enforcement during training. Our framework does not use stabilizer terms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Adaptive Dynamic Programming Control · Neural Networks and Reservoir Computing
