A fast iterative PDE-based algorithm for feedback controls of nonsmooth   mean-field control problems

Christoph Reisinger; Wolfgang Stockinger; Yufei Zhang

arXiv:2108.06740·math.OC·May 3, 2024·SIAM J. Sci. Comput.·5 cites

A fast iterative PDE-based algorithm for feedback controls of nonsmooth mean-field control problems

Christoph Reisinger, Wolfgang Stockinger, Yufei Zhang

PDF

Open Access

TL;DR

This paper introduces a fast PDE-based iterative algorithm for computing optimal feedback controls in complex mean-field control problems involving nonsmooth costs, leveraging forward-backward splitting and neural network approximations.

Contribution

It develops a novel accelerated gradient method combining PDE solvers and neural networks for high-dimensional mean-field control problems with nonsmooth costs.

Findings

01

Algorithm effectively captures control structures.

02

Achieves robustness against parameter perturbations.

03

Performs well in high-dimensional scenarios.

Abstract

We propose a PDE-based accelerated gradient algorithm for optimal feedback controls of McKean-Vlasov dynamics that involve mean-field interactions both in the state and action. The method exploits a forward-backward splitting approach and iteratively refines the approximate controls based on the gradients of smooth costs, the proximal maps of nonsmooth costs, and dynamically updated momentum parameters. At each step, the state dynamics is approximated via a particle system, and the required gradient is evaluated through a coupled system of nonlocal linear PDEs. The latter is solved by finite difference approximation or neural network-based residual approximation, depending on the state dimension. We present exhaustive numerical experiments for low and high-dimensional mean-field control problems, including sparse stabilization of stochastic Cucker-Smale models, which reveal that our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Gas Dynamics and Kinetic Theory · Markov Chains and Monte Carlo Methods