Undiscounted Control Policy Generation for Continuous-Valued Optimal Control by Approximate Dynamic Programming
Jonathan Lock, Tomas McKelvey

TL;DR
This paper introduces a numerical method based on approximate dynamic programming for generating control policies in undiscounted, infinite-horizon nonlinear optimal control problems with continuous states, allowing for convergence of both policy and system state.
Contribution
It extends existing methods by enabling convergence criteria that include both control policy and system state, applicable to undiscounted, bounded, continuous cost functions.
Findings
Method successfully generates control policies for complex nonlinear systems.
Convergence of policy and state is achieved without discounting.
MATLAB implementation and examples demonstrate practical applicability.
Abstract
We present a numerical method for generating the state-feedback control policy associated with general undiscounted, constant-setpoint, infinite-horizon, nonlinear optimal control problems with continuous state variables. The method is based on approximate dynamic programming, and is closely related to approximate policy iteration. Existing methods typically terminate based on the convergence of the control policy and either require a discounted problem formulation or demand the cost function to lie in a specific subclass of functions. The presented method extends on existing termination criteria by requiring both the control policy and the resulting system state to converge, allowing for use with undiscounted cost functions that are bounded and continuous. This paper defines the numerical method, derives the relevant underlying mathematical properties, and validates the numerical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnergy, Environment, and Transportation Policies · Adaptive Dynamic Programming Control · Advanced Control Systems Optimization
