Dual Cone Gradient Descent for Training Physics-Informed Neural Networks

Youngsik Hwang; Dong-Young Lim

arXiv:2409.18426·cs.LG·January 16, 2025·2 cites

Dual Cone Gradient Descent for Training Physics-Informed Neural Networks

Youngsik Hwang, Dong-Young Lim

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Dual Cone Gradient Descent (DCGD), a novel optimization method for training physics-informed neural networks (PINNs), improving stability and accuracy by addressing gradient imbalance issues.

Contribution

The paper proposes DCGD, a new gradient adjustment framework ensuring stable training of PINNs by maintaining gradient directions within a dual cone region, with theoretical convergence analysis.

Findings

01

DCGD outperforms existing optimization algorithms on benchmark PDEs.

02

DCGD improves predictive accuracy and training stability for complex PDEs.

03

Combining DCGD with learning rate annealing and NTK enhances PINN performance.

Abstract

Physics-informed neural networks (PINNs) have emerged as a prominent approach for solving partial differential equations (PDEs) by minimizing a combined loss function that incorporates both boundary loss and PDE residual loss. Despite their remarkable empirical performance in various scientific computing tasks, PINNs often fail to generate reasonable solutions, and such pathological behaviors remain difficult to explain and resolve. In this paper, we identify that PINNs can be adversely trained when gradients of each loss function exhibit a significant imbalance in their magnitudes and present a negative inner product value. To address these issues, we propose a novel optimization framework, Dual Cone Gradient Descent (DCGD), which adjusts the direction of the updated gradient to ensure it falls within a dual cone region. This region is defined as a set of vectors where the inner…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

youngsikhwang/Dual-Cone-Gradient-Descent
jaxOfficial

Videos

Dual Cone Gradient Descent for Training Physics-Informed Neural Networks· slideslive

Taxonomy

TopicsNeural Networks and Applications

MethodsSparse Evolutionary Training