Neural Co-state Projection Regulator: A Model-free Paradigm for Real-time Optimal Control with Input Constraints
Lihan Lian, Uduak Inyang-Udoh

TL;DR
The paper introduces a model-free neural control framework based on Pontryagin's Minimum Principle that efficiently computes real-time optimal controls with input constraints, demonstrating superior generalization and sample efficiency over RL.
Contribution
It proposes the neural co-state projection regulator (NCPR), a novel model-free approach that integrates PMP with neural networks for real-time constrained optimal control.
Findings
Outperforms RL in generalization to unseen states and constraints
Achieves better sampling efficiency compared to RL
Demonstrates effectiveness on unicycle and pendulum tasks
Abstract
Learning-based approaches, notably Reinforcement Learning (RL), have shown promise for solving optimal control tasks without explicit system models. However, these approaches are often sample-inefficient, sensitive to reward design and hyperparameters, and prone to poor generalization, especially under input constraints. To address these challenges, we introduce the neural co-state projection regulator (NCPR), a model-free learning-based optimal control framework that is grounded in Pontryagin's Minimum Principle (PMP) and capable of solving quadratic regulator problems in nonlinear control-affine systems with input constraints. In this framework, a neural network (NN) is trained in a self-supervised setting to take the current state of the system as input and predict a finite-horizon trajectory of projected co-states (i.e., the co-state weighted by the system's input gain).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning
