Self-Adaptive Physics-Informed Neural Networks using a Soft Attention Mechanism
Levi McClenny, Ulisses Braga-Neto

TL;DR
This paper introduces Self-Adaptive PINNs with trainable weights that focus learning on difficult regions of PDE solutions, improving accuracy and efficiency through a soft attention mechanism and Gaussian Process mapping.
Contribution
It proposes a novel adaptive training method for PINNs with trainable weights, a soft attention mechanism, and a Gaussian Process-based weight mapping, enhancing solution accuracy for stiff PDEs.
Findings
SA-PINNs outperform state-of-the-art PINNs in L2 error
SA-PINNs require fewer training epochs
Self-adaptive weights improve focus on difficult solution regions
Abstract
Physics-Informed Neural Networks (PINNs) have emerged recently as a promising application of deep neural networks to the numerical solution of nonlinear partial differential equations (PDEs). However, it has been recognized that adaptive procedures are needed to force the neural network to fit accurately the stubborn spots in the solution of "stiff" PDEs. In this paper, we propose a fundamentally new way to train PINNs adaptively, where the adaptation weights are fully trainable and applied to each training point individually, so the neural network learns autonomously which regions of the solution are difficult and is forced to focus on them. The self-adaptation weights specify a soft multiplicative soft attention mask, which is reminiscent of similar mechanisms used in computer vision. The basic idea behind these SA-PINNs is to make the weights increase as the corresponding losses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Nanofluid Flow and Heat Transfer · Fluid Dynamics and Turbulent Flows
MethodsNeural Tangent Kernel
