Convergence of Proximal Policy Gradient Method for Problems with Control Dependent Diffusion Coefficients
Ashley Davey, Harry Zheng

TL;DR
This paper proves the convergence of a proximal policy gradient method for complex stochastic control problems with control-dependent diffusion, introducing scalable algorithms and validating their effectiveness through numerical examples.
Contribution
It establishes convergence conditions for the proximal policy gradient method in control problems with control-dependent diffusion and develops scalable deep learning algorithms.
Findings
Proximal policy gradient converges linearly under certain conditions.
Algorithms achieve high accuracy in high-dimensional stochastic control.
Numerical results validate theoretical convergence guarantees.
Abstract
We prove convergence of the proximal policy gradient method for a class of constrained stochastic control problems with control in both the drift and diffusion of the state process. The problem requires either the running or terminal cost to be strongly convex, but other terms may be non-convex. The inclusion of control-dependent diffusion introduces additional complexity in regularity analysis of the associated backward stochastic differential equation. We provide sufficient conditions under which the control iterates converge linearly to the optimal control, by deriving representations and estimates of solutions to the adjoint backward stochastic differential equations. We introduce numerical algorithms that implement this method using deep learning and ordinary differential equation based techniques. These approaches enable high accuracy and scalability for stochastic control…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiative Heat Transfer Studies · Optimization and Variational Analysis · Differential Equations and Numerical Methods
