Dual Natural Gradient Descent for Scalable Training of Physics-Informed Neural Networks
Anas Jnini, Flavio Vella

TL;DR
This paper introduces Dual Natural Gradient Descent (D-NGD), a scalable second-order optimization method for Physics-Informed Neural Networks that significantly improves training efficiency and accuracy at large scales.
Contribution
The paper proposes D-NGD, a novel method that computes natural-gradient steps in a residual space, enabling scalable, efficient training of large PINNs with improved accuracy.
Findings
Scales to networks with 12.8 million parameters
Achieves 10-1000x lower final error than first-order methods
Enables training of large PINNs on a single GPU
Abstract
Natural-gradient methods markedly accelerate the training of Physics-Informed Neural Networks (PINNs), yet their Gauss--Newton update must be solved in the parameter space, incurring a prohibitive time complexity, where is the number of network trainable weights. We show that exactly the same step can instead be formulated in a generally smaller residual space of size , where each residual class (e.g. PDE interior, boundary, initial data) contributes collocation points of output dimension . Building on this insight, we introduce \textit{Dual Natural Gradient Descent} (D-NGD). D-NGD computes the Gauss--Newton step in residual space, augments it with a geodesic-acceleration correction at negligible extra cost, and provides both a dense direct solver for modest and a Nystrom-preconditioned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
