Optimizing the Optimizer for Physics-Informed Neural Networks and Kolmogorov-Arnold Networks
Elham Kiyani, Khemraj Shukla, Jorge F. Urb\'an, J\'er\^ome Darbon, George Em Karniadakis

TL;DR
This paper explores advanced quasi-Newton optimization methods, such as SSBFGS and SSBroyden, to improve the training efficiency and accuracy of Physics-Informed Neural Networks and Kolmogorov-Arnold Networks for solving complex PDEs.
Contribution
It introduces and systematically evaluates self-scaled quasi-Newton optimizers, demonstrating significant accuracy improvements over traditional methods in SciML applications.
Findings
Enhanced training efficiency with self-scaled quasi-Newton methods.
Achieved state-of-the-art accuracy on multiple PDE benchmarks.
Demonstrated effectiveness for Deep Operator Network architectures.
Abstract
Physics-Informed Neural Networks (PINNs) have revolutionized the computation of PDE solutions by integrating partial differential equations (PDEs) into the neural network's training process as soft constraints, becoming an important component of the scientific machine learning (SciML) ecosystem. More recently, physics-informed Kolmogorv-Arnold networks (PIKANs) have also shown to be effective and comparable in accuracy with PINNs. In their current implementation, both PINNs and PIKANs are mainly optimized using first-order methods like Adam, as well as quasi-Newton methods such as BFGS and its low-memory variant, L-BFGS. However, these optimizers often struggle with highly non-linear and non-convex loss landscapes, leading to challenges such as slow convergence, local minima entrapment, and (non)degenerate saddle points. In this study, we investigate the performance of Self-Scaled BFGS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Computational Physics and Python Applications
MethodsAdam
