DeepPAAC: A New Deep Galerkin Method for Principal-Agent Problems

Michael Ludkovski; Changgen Xie; Zimu Zhu

arXiv:2511.04309·math.NA·December 9, 2025

DeepPAAC: A New Deep Galerkin Method for Principal-Agent Problems

Michael Ludkovski, Changgen Xie, Zimu Zhu

PDF

Open Access

TL;DR

DeepPAAC introduces a novel deep learning algorithm to numerically solve complex principal-agent problems modeled by Hamilton-Jacobi-Bellman equations, accommodating multi-dimensional strategies and constraints.

Contribution

The paper develops DeepPAAC, a new deep learning-based Actor-Critic method specifically designed for solving high-dimensional principal-agent problems in continuous time.

Findings

01

DeepPAAC effectively handles multi-dimensional states and controls.

02

The method demonstrates convergence across five case studies.

03

Neural network architecture impacts solver performance.

Abstract

We consider numerical resolution of principal-agent (PA) problems in continuous time. We formulate a generic PA model with continuous and lump payments and a multi-dimensional strategy of the agent. To tackle the resulting Hamilton-Jacobi-Bellman equation with an implicit Hamiltonian we develop a novel deep learning method: the Deep Principal-Agent Actor Critic (DeepPAAC) Actor-Critic algorithm. DeepPAAC is able to handle multi-dimensional states and controls, as well as constraints. We investigate the role of the neural network architecture, training designs, loss functions, etc. on the convergence of the solver, presenting five different case studies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Reinforcement Learning in Robotics · Generative Adversarial Networks and Image Synthesis