A Mixed Value and Policy Iteration Method for Stochastic Control with   Universally Measurable Policies

Huizhen Yu; Dimitri P. Bertsekas

arXiv:1308.3814·math.OC·February 26, 2016·Math. Oper. Res.·2 cites

A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies

Huizhen Yu, Dimitri P. Bertsekas

PDF

Open Access

TL;DR

This paper introduces a mixed value and policy iteration method for stochastic control models with complex measurability issues, ensuring convergence in various cost scenarios and addressing limitations of standard policy iteration.

Contribution

The paper proposes a novel mixed iteration method that overcomes measurability challenges in stochastic control, with proven convergence for infinite horizon problems and new insights into value iteration.

Findings

01

Convergence of the proposed method in discounted and undiscounted cases.

02

A new convergence theorem for value iteration with bounded initial functions.

03

The method effectively handles models with Borel spaces and measurable policies.

Abstract

We consider stochastic control models with Borel spaces and universally measurable policies. For such models the standard policy iteration is known to have difficult measurability issues and cannot be carried out in general. We present a mixed value and policy iteration method that circumvents this difficulty. The method allows the use of stationary policies in computing the optimal cost function, in a manner that resembles policy iteration. It can also be used to address similar difficulties of policy iteration in the context of upper and lower semicontinuous models. We analyze the convergence of the method in infinite horizon total cost problems, for the discounted case where the one-stage costs are bounded, and for the undiscounted case where the one-stage costs are nonpositive or nonnegative. For undiscounted total cost problems with nonnegative one-stage costs, we also give a new…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization · Economic theories and models · Stochastic processes and financial applications