Neural Co-state Projection Regulator: A Model-free Paradigm for Real-time Optimal Control with Input Constraints

Lihan Lian; Uduak Inyang-Udoh

arXiv:2508.00283·eess.SY·August 4, 2025

Neural Co-state Projection Regulator: A Model-free Paradigm for Real-time Optimal Control with Input Constraints

Lihan Lian, Uduak Inyang-Udoh

PDF

Open Access

TL;DR

The paper introduces a model-free neural control framework based on Pontryagin's Minimum Principle that efficiently computes real-time optimal controls with input constraints, demonstrating superior generalization and sample efficiency over RL.

Contribution

It proposes the neural co-state projection regulator (NCPR), a novel model-free approach that integrates PMP with neural networks for real-time constrained optimal control.

Findings

01

Outperforms RL in generalization to unseen states and constraints

02

Achieves better sampling efficiency compared to RL

03

Demonstrates effectiveness on unicycle and pendulum tasks

Abstract

Learning-based approaches, notably Reinforcement Learning (RL), have shown promise for solving optimal control tasks without explicit system models. However, these approaches are often sample-inefficient, sensitive to reward design and hyperparameters, and prone to poor generalization, especially under input constraints. To address these challenges, we introduce the neural co-state projection regulator (NCPR), a model-free learning-based optimal control framework that is grounded in Pontryagin's Minimum Principle (PMP) and capable of solving quadratic regulator problems in nonlinear control-affine systems with input constraints. In this framework, a neural network (NN) is trained in a self-supervised setting to take the current state of the system as input and predict a finite-horizon trajectory of projected co-states (i.e., the co-state weighted by the system's input gain).…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning