ReLU Deep Neural Networks and Linear Finite Elements

Juncai He; Lin Li; Jinchao Xu; Chunyue Zheng

arXiv:1807.03973·math.NA·June 2, 2020

ReLU Deep Neural Networks and Linear Finite Elements

Juncai He, Lin Li, Jinchao Xu, Chunyue Zheng

PDF

TL;DR

This paper explores the theoretical relationship between ReLU deep neural networks and linear finite element methods, establishing minimal layer requirements for representing finite element functions and demonstrating potential applications in solving PDEs.

Contribution

It provides a theoretical framework linking ReLU DNNs with finite element functions, including minimal layer requirements and neuron estimates for representing CPWL functions.

Findings

01

ReLU DNNs require at least 2 hidden layers to represent finite element functions in dimensions 2 and 3.

02

A general CPWL function in can be represented with +1 layers and an estimated number of neurons.

03

Numerical experiments show ReLU DNNs can be used to solve boundary value problems, indicating potential for PDE applications.

Abstract

In this paper, we investigate the relationship between deep neural networks (DNN) with rectified linear unit (ReLU) function as the activation function and continuous piecewise linear (CPWL) functions, especially CPWL functions from the simplicial linear finite element method (FEM). We first consider the special case of FEM. By exploring the DNN representation of its nodal basis functions, we present a ReLU DNN representation of CPWL in FEM. We theoretically establish that at least $2$ hidden layers are needed in a ReLU DNN to represent any linear finite element functions in $Ω \subseteq R^{d}$ when $d \geq 2$ . Consequently, for $d = 2, 3$ which are often encountered in scientific and engineering computing, the minimal number of two hidden layers are necessary and sufficient for any CPWL function to be represented by a ReLU DNN. Then we include a detailed account on how a general…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.