A Convergence-Guaranteed Algorithm for Stochastic Optimal Control Problems
Mohsen Amidzadeh

TL;DR
This paper introduces a new algorithm for stochastic optimal control that avoids complex backward equations by using Malliavin calculus, reducing dimensionality issues and maintaining competitive performance.
Contribution
It presents a novel convergence-guaranteed iterative algorithm based on Malliavin calculus, eliminating the need for adjoint processes in stochastic control.
Findings
Reduces complexity compared to traditional methods
Alleviates the curse of dimensionality
Maintains competitive performance in empirical tests
Abstract
Stochastic Optimal Control Problems (SOCPs) plays a major role in the sequential decision-making challenges. There exist various iterative algorithms, under framework of stochastic maximum principle, that sequentially find the optimal control decision. However, they are based on the adjoint sensitivity analysis that necessitates simulation of an adjoint process, typically a backward stochastic differential equation (SDE) that must simultaneously be adapted to a forward filtration and satisfy a terminal condition, which substantially increases complexity and exacerbates the curse of dimensionality. We instead develop a stochastic maximum principle based on the Malliavin calculus, which enables us to devise an iterative algorithm without need of an adjoint process. Our algorithm however needs the Malliavin derivative that can be efficiently computed based on a forward simulator. Empirical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Portfolio Optimization · Reinforcement Learning in Robotics · Stochastic processes and financial applications
