Policy Iteration for Stationary Discounted Hamilton--Jacobi--Bellman Equations: A Viscosity Approach
Namkyeong Cho, Yeoneung Kim

TL;DR
This paper develops a viscosity-based policy iteration method for stationary Hamilton--Jacobi--Bellman equations, introducing a semi-discrete scheme with artificial viscosity to ensure convergence and stability.
Contribution
It proposes a novel regularization and discretization approach for policy iteration in stationary HJB equations, enabling convergence analysis and error estimates.
Findings
Semi-discrete PI converges monotonically and geometrically for fixed mesh size.
The method achieves a sharp $ orm{V^h - V}_ ext{L^ ext{infty}} \
Numerical experiments confirm theoretical convergence and error decay behaviors.
Abstract
We study policy iteration (PI) for deterministic infinite-horizon discounted optimal control problems, whose value function is characterized by a stationary Hamilton--Jacobi--Bellman (HJB) equation. At the PDE level, PI is fundamentally ill-posed: the improvement step requires pointwise evaluation of , which is not well defined for viscosity solutions, and thus the associated nonlinear operator cannot be interpreted in a stable functional sense. We develop a monotone semi-discrete formulation for the stationary discounted setting by introducing a space-discrete scheme with artificial viscosity of order . This regularization restores comparison, ensures monotonicity of the discrete operator, and yields a well-defined pointwise policy improvement via discrete gradients. Our analysis reveals a convergence mechanism fundamentally different from the finite-horizon case. For…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
