Error Analysis of Three-Layer Neural Network Trained with PGD for Deep   Ritz Method

Yuling Jiao; Yanming Lai; and Yang Wang

arXiv:2405.11451·math.NA·May 21, 2024

Error Analysis of Three-Layer Neural Network Trained with PGD for Deep Ritz Method

Yuling Jiao, Yanming Lai, and Yang Wang

PDF

Open Access

TL;DR

This paper provides a comprehensive error analysis for a three-layer neural network trained with projected gradient descent within the deep Ritz method framework, offering insights into approximation, generalization, and optimization errors for PDE solutions.

Contribution

It is the first to analyze overparameterized networks for PDEs, including error estimates and training guidance without restrictive assumptions.

Findings

01

Established global convergence of the training algorithm.

02

Derived error bounds based on sample size and network parameters.

03

Provided practical guidance on network design and training settings.

Abstract

Machine learning is a rapidly advancing field with diverse applications across various domains. One prominent area of research is the utilization of deep learning techniques for solving partial differential equations(PDEs). In this work, we specifically focus on employing a three-layer tanh neural network within the framework of the deep Ritz method(DRM) to solve second-order elliptic equations with three different types of boundary conditions. We perform projected gradient descent(PDG) to train the three-layer network and we establish its global convergence. To the best of our knowledge, we are the first to provide a comprehensive error analysis of using overparameterized networks to solve PDE problems, as our analysis simultaneously includes estimates for approximation error, generalization error, and optimization error. We present error bound in terms of the sample size $n$ and our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsSparse Evolutionary Training · Focus