When and why PINNs fail to train: A neural tangent kernel perspective
Sifan Wang, Xinling Yu, Paris Perdikaris

TL;DR
This paper investigates why physics-informed neural networks (PINNs) sometimes fail to train effectively by analyzing their behavior through the Neural Tangent Kernel (NTK) framework, leading to new insights and an adaptive training algorithm.
Contribution
It derives the NTK for PINNs, proves its convergence properties, and introduces an adaptive gradient descent method based on NTK eigenvalues to improve training.
Findings
NTK of PINNs converges to a deterministic kernel in the infinite-width limit.
Discrepancy in convergence rates of loss components affects training success.
The proposed adaptive algorithm improves training stability and convergence.
Abstract
Physics-informed neural networks (PINNs) have lately received great attention thanks to their flexibility in tackling a wide range of forward and inverse problems involving partial differential equations. However, despite their noticeable empirical success, little is known about how such constrained neural networks behave during their training via gradient descent. More importantly, even less is known about why such models sometimes fail to train at all. In this work, we aim to investigate these questions through the lens of the Neural Tangent Kernel (NTK); a kernel that captures the behavior of fully-connected neural networks in the infinite width limit during training via gradient descent. Specifically, we derive the NTK of PINNs and prove that, under appropriate conditions, it converges to a deterministic kernel that stays constant during training in the infinite-width limit. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Neural Networks and Applications · Nuclear reactor physics and engineering
MethodsNeural Tangent Kernel
