Are Deep Learning Based Hybrid PDE Solvers Reliable? Why Training Paradigms and Update Strategies Matter

Yuhan Wu; Jan Willem van Beek; Victorita Dolean; Alexander Heinlein

arXiv:2602.06842·math.NA·February 9, 2026

Are Deep Learning Based Hybrid PDE Solvers Reliable? Why Training Paradigms and Update Strategies Matter

Yuhan Wu, Jan Willem van Beek, Victorita Dolean, Alexander Heinlein

PDF

Open Access

TL;DR

This paper investigates the reliability of deep learning hybrid PDE solvers, emphasizing that training paradigms and update strategies critically influence convergence and accuracy, with proposed physics-aware acceleration improving performance.

Contribution

It demonstrates that training objectives and update strategies significantly affect solver reliability and introduces physics-aware Anderson acceleration to enhance convergence.

Findings

01

Physics-aware Anderson acceleration improves convergence

02

Training objectives aligned with physics reduce residuals

03

Classical Anderson acceleration is unsuitable for neural operators

Abstract

Deep learning-based hybrid iterative methods (DL-HIMs) integrate classical numerical solvers with neural operators, utilizing their complementary spectral biases to accelerate convergence. Despite this promise, many DL-HIMs stagnate at false fixed points where neural updates vanish while the physical residual remains large, raising questions about reliability in scientific computing. In this paper, we provide evidence that performance is highly sensitive to training paradigms and update strategies, even when the neural architecture is fixed. Through a detailed study of a DeepONet-based hybrid iterative numerical transferable solver (HINTS) and an FFT-based Fourier neural solver (FNS), we show that significant physical residuals can persist when training objectives are not aligned with solver dynamics and problem physics. We further examine Anderson acceleration (AA) and demonstrate that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Numerical methods for differential equations · Stochastic Gradient Optimization Techniques