A fast iterative PDE-based algorithm for feedback controls of nonsmooth mean-field control problems
Christoph Reisinger, Wolfgang Stockinger, Yufei Zhang

TL;DR
This paper introduces a fast PDE-based iterative algorithm for computing optimal feedback controls in complex mean-field control problems involving nonsmooth costs, leveraging forward-backward splitting and neural network approximations.
Contribution
It develops a novel accelerated gradient method combining PDE solvers and neural networks for high-dimensional mean-field control problems with nonsmooth costs.
Findings
Algorithm effectively captures control structures.
Achieves robustness against parameter perturbations.
Performs well in high-dimensional scenarios.
Abstract
We propose a PDE-based accelerated gradient algorithm for optimal feedback controls of McKean-Vlasov dynamics that involve mean-field interactions both in the state and action. The method exploits a forward-backward splitting approach and iteratively refines the approximate controls based on the gradients of smooth costs, the proximal maps of nonsmooth costs, and dynamically updated momentum parameters. At each step, the state dynamics is approximated via a particle system, and the required gradient is evaluated through a coupled system of nonlocal linear PDEs. The latter is solved by finite difference approximation or neural network-based residual approximation, depending on the state dimension. We present exhaustive numerical experiments for low and high-dimensional mean-field control problems, including sparse stabilization of stochastic Cucker-Smale models, which reveal that our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Gas Dynamics and Kinetic Theory · Markov Chains and Monte Carlo Methods
