Near Optimality of Lipschitz and Smooth Policies in Controlled Diffusions
Somnath Pradhan, Serdar Yuksel

TL;DR
This paper proves that Lipschitz and smooth policies can approximate optimal control policies for controlled diffusions across various cost criteria, bridging theoretical optimality and practical policy design.
Contribution
It establishes the density of Lipschitz and smooth policies in the space of optimal policies under the Borkar topology for controlled diffusions.
Findings
Lipschitz and smooth policies are dense in the space of optimal policies.
Optimal costs are continuous with respect to policies under the Borkar topology.
Smooth/Lipschitz policies can approximate optimal policies arbitrarily closely.
Abstract
For optimal control of diffusions under several criteria, due to computational or analytical reasons, many studies have a apriori assumed control policies to be Lipschitz or smooth, often with no rigorous analysis on whether this restriction entails loss. While optimality of Markov/stationary Markov policies for expected finite horizon/infinite horizon (discounted/ergodic) cost and cost-up-to-exit time optimal control problems can be established under certain technical conditions, an optimal solution is typically only measurable in the state (and time, if the horizon is finite) with no apriori additional structural properties. In this paper, building on our recent work [S. Pradhan and S. Y\"uksel, Continuity of cost in Borkar control topology and implications on discrete space and time approximations for controlled diffusions under several criteria, Electronic Journal of Probability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Mathematical Modeling in Engineering
