Understanding and mitigating gradient pathologies in physics-informed neural networks
Sifan Wang, Yujun Teng, Paris Perdikaris

TL;DR
This paper investigates the training challenges of physics-informed neural networks, identifies issues caused by numerical stiffness, and proposes new training algorithms and architectures that significantly enhance predictive accuracy in physical systems.
Contribution
It introduces a gradient-based learning rate annealing algorithm and a resilient neural network architecture to address gradient pathologies in physics-informed neural networks.
Findings
Improved predictive accuracy by 50-100x across various physics problems.
Identified numerical stiffness as a key failure mode in PINNs.
Proposed methods effectively mitigate gradient pathologies.
Abstract
The widespread use of neural networks across different scientific domains often involves constraining them to satisfy certain symmetries, conservation laws, or other domain knowledge. Such constraints are often imposed as soft penalties during model training and effectively act as domain-specific regularizers of the empirical risk loss. Physics-informed neural networks is an example of this philosophy in which the outputs of deep neural networks are constrained to approximately satisfy a given set of partial differential equations. In this work we review recent advances in scientific machine learning with a specific focus on the effectiveness of physics-informed neural networks in predicting outcomes of physical systems and discovering hidden physics from noisy data. We will also identify and analyze a fundamental mode of failure of such approaches that is related to numerical stiffness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Nuclear Engineering Thermal-Hydraulics · Nuclear reactor physics and engineering
