Dual Cone Gradient Descent for Training Physics-Informed Neural Networks
Youngsik Hwang, Dong-Young Lim

TL;DR
This paper introduces Dual Cone Gradient Descent (DCGD), a novel optimization method for training physics-informed neural networks (PINNs), improving stability and accuracy by addressing gradient imbalance issues.
Contribution
The paper proposes DCGD, a new gradient adjustment framework ensuring stable training of PINNs by maintaining gradient directions within a dual cone region, with theoretical convergence analysis.
Findings
DCGD outperforms existing optimization algorithms on benchmark PDEs.
DCGD improves predictive accuracy and training stability for complex PDEs.
Combining DCGD with learning rate annealing and NTK enhances PINN performance.
Abstract
Physics-informed neural networks (PINNs) have emerged as a prominent approach for solving partial differential equations (PDEs) by minimizing a combined loss function that incorporates both boundary loss and PDE residual loss. Despite their remarkable empirical performance in various scientific computing tasks, PINNs often fail to generate reasonable solutions, and such pathological behaviors remain difficult to explain and resolve. In this paper, we identify that PINNs can be adversely trained when gradients of each loss function exhibit a significant imbalance in their magnitudes and present a negative inner product value. To address these issues, we propose a novel optimization framework, Dual Cone Gradient Descent (DCGD), which adjusts the direction of the updated gradient to ensure it falls within a dual cone region. This region is defined as a set of vectors where the inner…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNeural Networks and Applications
MethodsSparse Evolutionary Training
