Optimization and generalization analysis for two-layer physics-informed neural networks without over-parametrization

Zhihan Zeng; Yiqi Gu

arXiv:2507.16380·cs.LG·July 23, 2025

Optimization and generalization analysis for two-layer physics-informed neural networks without over-parametrization

Zhihan Zeng, Yiqi Gu

PDF

Open Access

TL;DR

This paper analyzes the optimization and generalization of two-layer physics-informed neural networks trained with SGD, demonstrating that under certain conditions, the loss can be reduced below a specified threshold without over-parameterization.

Contribution

It provides a new analysis of SGD training for two-layer PINNs under non-over-parameterized regimes, avoiding the computational costs of over-parameterization.

Findings

01

Training loss decreases below O(ε) with sufficient network width.

02

Analysis avoids reliance on over-parameterization assumptions.

03

Results applicable to practical PINN training scenarios.

Abstract

This work focuses on the behavior of stochastic gradient descent (SGD) in solving least-squares regression with physics-informed neural networks (PINNs). Past work on this topic has been based on the over-parameterization regime, whose convergence may require the network width to increase vastly with the number of training samples. So, the theory derived from over-parameterization may incur prohibitive computational costs and is far from practical experiments. We perform new optimization and generalization analysis for SGD in training two-layer PINNs, making certain assumptions about the target function to avoid over-parameterization. Given $ϵ > 0$ , we show that if the network width exceeds a threshold that depends only on $ϵ$ and the problem, then the training loss and expected loss will decrease below $O (ϵ)$ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Neural Networks and Reservoir Computing