Overcoming the Loss Conditioning Bottleneck in Optimization-Based PDE Solvers: A Novel Well-Conditioned Loss Function
Wenbo Cao, Weiwei Zhang

TL;DR
This paper introduces a new loss function called Stabilized Gradient Residual (SGR) that improves the convergence speed and stability of optimization-based PDE solvers by better conditioning the loss landscape, outperforming traditional MSE loss.
Contribution
The paper proposes the SGR loss, which modulates the condition number of the system, leading to significantly faster convergence in optimization-based PDE solvers compared to MSE loss.
Findings
SGR loss achieves orders-of-magnitude faster convergence than MSE loss in ODIL.
SGR outperforms MSE loss in PINNs despite neural network nonlinearity.
Theoretical analysis links loss conditioning to solver efficiency.
Abstract
Optimization-based PDE solvers that minimize scalar loss functions have gained increasing attention in recent years. These methods either define the loss directly over discrete variables, as in Optimizing a Discrete Loss (ODIL), or indirectly through a neural network surrogate, as in Physics-Informed Neural Networks (PINNs). However, despite their promise, such methods often converge much more slowly than classical iterative solvers and are commonly regarded as inefficient. This work provides a theoretical insight, attributing the inefficiency to the use of the mean squared error (MSE) loss, which implicitly forms the normal equations, squares the condition number, and severely impairs optimization. To address this, we propose a novel Stabilized Gradient Residual (SGR) loss. By tuning a weight parameter, it flexibly modulates the condition number between the original system and its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
