Randomized dynamic programming principle and Feynman-Kac representation for optimal control of McKean-Vlasov dynamics
Erhan Bayraktar, Andrea Cosso, Huy\^en Pham (LPMA)

TL;DR
This paper develops a probabilistic framework using a Feynman-Kac representation and randomized dynamic programming for optimal control of McKean-Vlasov dynamics, allowing for open-loop controls and degenerate diffusions.
Contribution
It introduces a novel randomized dynamic programming principle for McKean-Vlasov control problems with open-loop controls, extending previous methods.
Findings
Proves the value function admits a nonlinear Feynman-Kac representation.
Establishes the dynamic programming principle for the control problem.
Shows the randomized control problem has the same value as the original.
Abstract
We analyze a stochastic optimal control problem, where the state process follows a McKean-Vlasov dynamics and the diffusion coefficient can be degenerate. We prove that its value function V admits a nonlinear Feynman-Kac representation in terms of a class of forward-backward stochastic differential equations, with an autonomous forward process. We exploit this probabilistic representation to rigorously prove the dynamic programming principle (DPP) for V. The Feynman-Kac representation we obtain has an important role beyond its intermediary role in obtaining our main result: in fact it would be useful in developing probabilistic numerical schemes for V. The DPP is important in obtaining a characterization of the value function as a solution of a non-linear partial differential equation (the so-called Hamilton-Jacobi-Belman equation), in this case on the Wasserstein space of measures. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Gas Dynamics and Kinetic Theory · Optimization and Variational Analysis
