Are Deep Learning Based Hybrid PDE Solvers Reliable? Why Training Paradigms and Update Strategies Matter
Yuhan Wu, Jan Willem van Beek, Victorita Dolean, Alexander Heinlein

TL;DR
This paper investigates the reliability of deep learning hybrid PDE solvers, emphasizing that training paradigms and update strategies critically influence convergence and accuracy, with proposed physics-aware acceleration improving performance.
Contribution
It demonstrates that training objectives and update strategies significantly affect solver reliability and introduces physics-aware Anderson acceleration to enhance convergence.
Findings
Physics-aware Anderson acceleration improves convergence
Training objectives aligned with physics reduce residuals
Classical Anderson acceleration is unsuitable for neural operators
Abstract
Deep learning-based hybrid iterative methods (DL-HIMs) integrate classical numerical solvers with neural operators, utilizing their complementary spectral biases to accelerate convergence. Despite this promise, many DL-HIMs stagnate at false fixed points where neural updates vanish while the physical residual remains large, raising questions about reliability in scientific computing. In this paper, we provide evidence that performance is highly sensitive to training paradigms and update strategies, even when the neural architecture is fixed. Through a detailed study of a DeepONet-based hybrid iterative numerical transferable solver (HINTS) and an FFT-based Fourier neural solver (FNS), we show that significant physical residuals can persist when training objectives are not aligned with solver dynamics and problem physics. We further examine Anderson acceleration (AA) and demonstrate that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Numerical methods for differential equations · Stochastic Gradient Optimization Techniques
