Complexity bounds on neural networks for the solution of structured linear systems of equations

Benjamin D\"orich; Roland Maier; Lukas Ullmer

arXiv:2603.19043·math.NA·March 20, 2026

Complexity bounds on neural networks for the solution of structured linear systems of equations

Benjamin D\"orich, Roland Maier, Lukas Ullmer

PDF

Open Access

TL;DR

This paper establishes explicit complexity bounds for ReLU neural networks approximating solutions to structured linear systems, especially symmetric positive definite and sparse matrices, extending classical iterative methods.

Contribution

It provides new upper bounds on neural network size for solving structured linear systems, incorporating matrix properties like condition number and sparsity.

Findings

01

Bounds depend explicitly on matrix size and condition number

02

Extends classical iterative methods to neural network approximation

03

Applicable to finite difference and finite element matrices

Abstract

We derive upper bounds on the complexity of ReLU neural networks approximating the solution of a linear system given the matrix and the right-hand side. We focus on matrices which are symmetric positive definite and sparse, as they appear in the context of finite difference and finite element methods. For such matrices, we extend available results for the matrix inversion to the task of solving a linear system, where we leverage favorable properties of classical methods such as the modified Richardson and the conjugate gradient method. Our bounds on the number of layers and neurons are not only explicit with respect to the size of the matrices, but also with respect to their condition numbers.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Stochastic Gradient Optimization Techniques · Neural Networks and Applications