Gaussian process policy iteration with additive Schwarz acceleration for forward and inverse HJB and mean field game problems
Xianjin Yang, Jingguo Zhang

TL;DR
This paper introduces a Gaussian Process-based policy iteration method for solving forward and inverse HJB and MFG problems, enhanced with additive Schwarz acceleration to boost computational efficiency.
Contribution
The paper presents a novel GP-based policy iteration framework with explicit solutions and Schwarz acceleration, improving efficiency in solving complex HJB and MFG problems.
Findings
Schwarz acceleration significantly improves convergence speed.
Explicit GP solutions eliminate the need for numerical optimization.
Method effectively handles both forward and inverse problems.
Abstract
We propose a Gaussian Process (GP)-based policy iteration framework for addressing both forward and inverse problems in Hamilton--Jacobi--Bellman (HJB) equations and mean field games (MFGs). Policy iteration is formulated as an alternating procedure between solving the value function under a fixed control policy and updating the policy based on the resulting value function. By exploiting the linear structure of GPs for function approximation, each policy evaluation step admits an explicit closed-form solution, eliminating the need for numerical optimization. To improve convergence, we incorporate the additive Schwarz acceleration as a preconditioning step following each policy update. Numerical experiments demonstrate the effectiveness of Schwarz acceleration in improving computational efficiency.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Advanced Bandit Algorithms Research · Stochastic processes and financial applications
MethodsGreedy Policy Search · Gaussian Process
