Overcoming the Loss Conditioning Bottleneck in Optimization-Based PDE Solvers: A Novel Well-Conditioned Loss Function

Wenbo Cao; Weiwei Zhang

arXiv:2508.02692·cs.CE·August 6, 2025

Overcoming the Loss Conditioning Bottleneck in Optimization-Based PDE Solvers: A Novel Well-Conditioned Loss Function

Wenbo Cao, Weiwei Zhang

PDF

TL;DR

This paper introduces a new loss function called Stabilized Gradient Residual (SGR) that improves the convergence speed and stability of optimization-based PDE solvers by better conditioning the loss landscape, outperforming traditional MSE loss.

Contribution

The paper proposes the SGR loss, which modulates the condition number of the system, leading to significantly faster convergence in optimization-based PDE solvers compared to MSE loss.

Findings

01

SGR loss achieves orders-of-magnitude faster convergence than MSE loss in ODIL.

02

SGR outperforms MSE loss in PINNs despite neural network nonlinearity.

03

Theoretical analysis links loss conditioning to solver efficiency.

Abstract

Optimization-based PDE solvers that minimize scalar loss functions have gained increasing attention in recent years. These methods either define the loss directly over discrete variables, as in Optimizing a Discrete Loss (ODIL), or indirectly through a neural network surrogate, as in Physics-Informed Neural Networks (PINNs). However, despite their promise, such methods often converge much more slowly than classical iterative solvers and are commonly regarded as inefficient. This work provides a theoretical insight, attributing the inefficiency to the use of the mean squared error (MSE) loss, which implicitly forms the normal equations, squares the condition number, and severely impairs optimization. To address this, we propose a novel Stabilized Gradient Residual (SGR) loss. By tuning a weight parameter, it flexibly modulates the condition number between the original system and its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.