Convergence of Proximal Policy Gradient Method for Problems with Control Dependent Diffusion Coefficients

Ashley Davey; Harry Zheng

arXiv:2505.18379·math.OC·May 27, 2025

Convergence of Proximal Policy Gradient Method for Problems with Control Dependent Diffusion Coefficients

Ashley Davey, Harry Zheng

PDF

Open Access 1 Repo

TL;DR

This paper proves the convergence of a proximal policy gradient method for complex stochastic control problems with control-dependent diffusion, introducing scalable algorithms and validating their effectiveness through numerical examples.

Contribution

It establishes convergence conditions for the proximal policy gradient method in control problems with control-dependent diffusion and develops scalable deep learning algorithms.

Findings

01

Proximal policy gradient converges linearly under certain conditions.

02

Algorithms achieve high accuracy in high-dimensional stochastic control.

03

Numerical results validate theoretical convergence guarantees.

Abstract

We prove convergence of the proximal policy gradient method for a class of constrained stochastic control problems with control in both the drift and diffusion of the state process. The problem requires either the running or terminal cost to be strongly convex, but other terms may be non-convex. The inclusion of control-dependent diffusion introduces additional complexity in regularity analysis of the associated backward stochastic differential equation. We provide sufficient conditions under which the control iterates converge linearly to the optimal control, by deriving representations and estimates of solutions to the adjoint backward stochastic differential equations. We introduce numerical algorithms that implement this method using deep learning and ordinary differential equation based techniques. These approaches enable high accuracy and scalability for stochastic control…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ashley-davey/ppgm
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRadiative Heat Transfer Studies · Optimization and Variational Analysis · Differential Equations and Numerical Methods