Dual Natural Gradient Descent for Scalable Training of Physics-Informed Neural Networks

Anas Jnini; Flavio Vella

arXiv:2505.21404·cs.LG·October 9, 2025

Dual Natural Gradient Descent for Scalable Training of Physics-Informed Neural Networks

Anas Jnini, Flavio Vella

PDF

Open Access

TL;DR

This paper introduces Dual Natural Gradient Descent (D-NGD), a scalable second-order optimization method for Physics-Informed Neural Networks that significantly improves training efficiency and accuracy at large scales.

Contribution

The paper proposes D-NGD, a novel method that computes natural-gradient steps in a residual space, enabling scalable, efficient training of large PINNs with improved accuracy.

Findings

01

Scales to networks with 12.8 million parameters

02

Achieves 10-1000x lower final error than first-order methods

03

Enables training of large PINNs on a single GPU

Abstract

Natural-gradient methods markedly accelerate the training of Physics-Informed Neural Networks (PINNs), yet their Gauss--Newton update must be solved in the parameter space, incurring a prohibitive $O (n^{3})$ time complexity, where $n$ is the number of network trainable weights. We show that exactly the same step can instead be formulated in a generally smaller residual space of size $m = \sum_{γ} N_{γ} d_{γ}$ , where each residual class $γ$ (e.g. PDE interior, boundary, initial data) contributes $N_{γ}$ collocation points of output dimension $d_{γ}$ . Building on this insight, we introduce \textit{Dual Natural Gradient Descent} (D-NGD). D-NGD computes the Gauss--Newton step in residual space, augments it with a geodesic-acceleration correction at negligible extra cost, and provides both a dense direct solver for modest $m$ and a Nystrom-preconditioned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications