Convergence Rate Analysis for Deep Ritz Method
Chenguang Duan, Yuling Jiao, Yanming Lai, Xiliang Lu, Zhijian Yang

TL;DR
This paper provides the first rigorous convergence rate analysis for the deep Ritz method applied to second order elliptic PDEs, explaining how network architecture affects accuracy and sample complexity.
Contribution
It establishes a nonasymptotic convergence rate in $H^1$ norm for DRM with ReLU^2 networks and analyzes hyper-parameter settings for optimal convergence.
Findings
First nonasymptotic convergence rate in $H^1$ norm for DRM.
Bounds on approximation error and Rademacher complexity for deep ReLU^2 networks.
Guidelines for setting network depth and width for desired accuracy.
Abstract
Using deep neural networks to solve PDEs has attracted a lot of attentions recently. However, why the deep learning method works is falling far behind its empirical success. In this paper, we provide a rigorous numerical analysis on deep Ritz method (DRM) \cite{wan11} for second order elliptic equations with Neumann boundary conditions. We establish the first nonasymptotic convergence rate in norm for DRM using deep networks with activation functions. In addition to providing a theoretical justification of DRM, our study also shed light on how to set the hyper-parameter of depth and width to achieve the desired convergence rate in terms of number of training samples. Technically, we derive bounds on the approximation error of deep network in norm and on the Rademacher complexity of the non-Lipschitz composition of gradient norm and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
