Preconditioning and Numerical Stability in Neural Network Training for Parametric PDEs

Markus Bachmayr; Wolfgang Dahmen; Chenguang Duan; and Mathias Oster

arXiv:2601.23185·math.NA·February 2, 2026

Preconditioning and Numerical Stability in Neural Network Training for Parametric PDEs

Markus Bachmayr, Wolfgang Dahmen, Chenguang Duan, and Mathias Oster

PDF

Open Access

TL;DR

This paper explores how preconditioning with well-conditioned frames improves neural network training for parametric PDEs, ensuring numerical stability and enabling efficient low-precision computations.

Contribution

It introduces a novel stable representation of preconditioned operators that maintains precision in low-precision floating point formats.

Findings

01

Preconditioning significantly enhances training performance.

02

Stable operator representations enable low-precision computations without loss of accuracy.

03

Standard representations are insufficient for numerical stability.

Abstract

In the context of training neural network-based approximations of solutions of parameter-dependent PDEs, we investigate the effect of preconditioning via well-conditioned frame representations of operators and demonstrate a significant improvement on the performance of standard training methods. We also observe that standard representations of preconditioned matrices are insufficient for obtaining numerical stability and propose a generally applicable form of stable representations that enables computations with single- and half-precision floating point numbers without loss of precision.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Numerical Methods and Algorithms · Neural Networks and Applications