Kronecker-Factored Approximate Curvature for Physics-Informed Neural Networks
Felix Dangel, Johannes M\"uller, Marius Zeinhofer

TL;DR
This paper introduces a Kronecker-factored approximation method for second-order optimization in physics-informed neural networks, significantly improving scalability and performance over existing methods.
Contribution
It extends KFAC to PDE-based loss functions using Taylor-mode automatic differentiation, enabling efficient training of larger PINNs.
Findings
KFAC-based optimizers are competitive with second-order methods on small problems.
The approach scales better to larger networks and PDEs.
KFAC outperforms first-order methods and LBFGS in experiments.
Abstract
Physics-informed neural networks (PINNs) are infamous for being hard to train. Recently, second-order methods based on natural gradient and Gauss-Newton methods have shown promising performance, improving the accuracy achieved by first-order methods by several orders of magnitude. While promising, the proposed methods only scale to networks with a few thousand parameters due to the high computational cost to evaluate, store, and invert the curvature matrix. We propose Kronecker-factored approximate curvature (KFAC) for PINN losses that greatly reduces the computational cost and allows scaling to much larger networks. Our approach goes beyond the established KFAC for traditional deep learning problems as it captures contributions from a PDE's differential operator that are crucial for optimization. To establish KFAC for such losses, we use Taylor-mode automatic differentiation to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Numerical Analysis Techniques · Model Reduction and Neural Networks · Neural Networks and Applications
