Global Convergence of Adjoint-Optimized Neural PDEs

Konstantin Riedl; Justin Sirignano; Konstantinos Spiliopoulos

arXiv:2506.13633·cs.LG·March 30, 2026

Global Convergence of Adjoint-Optimized Neural PDEs

Konstantin Riedl, Justin Sirignano, Konstantinos Spiliopoulos

PDF

TL;DR

This paper proves the global convergence of adjoint gradient descent in training neural PDE models with infinite neurons and training time, addressing complex mathematical challenges.

Contribution

It provides the first convergence proof for neural PDE training involving infinite-width neural networks and non-convex optimization.

Findings

01

Neural PDE solutions converge to target data in the infinite limit.

02

The proof handles non-local kernel operators without spectral gaps.

03

Numerical studies validate the theoretical convergence results.

Abstract

Many engineering and scientific fields have recently become interested in modeling terms in partial differential equations (PDEs) with neural networks, which requires solving the inverse problem of learning neural network terms from observed data in order to approximate missing or unresolved physics in the PDE model. The resulting neural-network PDE model, being a function of the neural network parameters, can be calibrated to the available ground truth data by optimizing over the PDE using gradient descent, where the gradient is evaluated in a computationally efficient manner by solving an adjoint PDE. These neural PDE models have emerged as an important research area in scientific machine learning. In this paper, we study the convergence of the adjoint gradient descent optimization method for training neural PDE models in the limit where both the number of hidden units and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.