Dynamic Programming: From Local Optimality to Global Optimality

John Stachurski; Jingni Yang; and Ziyue Yang

arXiv:2411.11062·math.OC·May 13, 2025

Dynamic Programming: From Local Optimality to Global Optimality

John Stachurski, Jingni Yang, and Ziyue Yang

PDF

Open Access

TL;DR

This paper explores conditions under which local optimality in dynamic programming guarantees global optimality, with implications for large-scale policy algorithms and neural network applications.

Contribution

It provides sufficient conditions linking local and global optimality in dynamic programming, extending understanding to neural network-based policy methods.

Findings

01

Established conditions for local to global optimality transition.

02

Applied results to neural network-based policy optimization.

03

Demonstrated implications for large-scale dynamic programming algorithms.

Abstract

In the theory of dynamic programming, an optimal policy is a policy whose lifetime value dominates that of all other policies from every possible initial condition in the state space. This raises a natural question: when does optimality from a single state imply optimality from every state? Working in a general setting, we provide sufficient conditions for this property that relate to reachability and irreducibility. Our results have significant implications for modern policy-based algorithms used to solve large-scale dynamic programs. We illustrate our findings by applying them to an optimal savings problem via an algorithm that implements gradient ascent in a policy space constructed from neural networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEconomic theories and models