TL;DR
PIELM is a fast, physics-informed machine learning method that efficiently solves linear partial differential equations, matching or surpassing PINNs in accuracy and offering a scalable distributed version for large domains.
Contribution
This paper introduces PIELM, a rapid physics-informed neural network alternative for PDEs, and proposes DPIELM, a distributed extension for large-scale problems.
Findings
PIELM achieves comparable or better accuracy than PINNs.
DPIELM provides effective solutions for large domain PDEs.
Neural network approaches can be competitive with traditional methods.
Abstract
There has been rapid progress recently on the application of deep networks to the solution of partial differential equations, collectively labelled as Physics Informed Neural Networks (PINNs). In this paper, we develop Physics Informed Extreme Learning Machine (PIELM), a rapid version of PINNs which can be applied to stationary and time dependent linear partial differential equations. We demonstrate that PIELM matches or exceeds the accuracy of PINNs on a range of problems. We also discuss the limitations of neural network based approaches, including our PIELM, in the solution of PDEs on large domains and suggest an extension, a distributed version of our algorithm -{}- DPIELM. We show that DPIELM produces excellent results comparable to conventional numerical techniques in the solution of time-dependent problems. Collectively, this work contributes towards making the use of neural…
| Steady/ Unsteady | 1D/2D | Test case ID | Description |
| Steady | 1D | TC-1 | Linear advection |
| TC-2 | Linear diffusion | ||
| TC-3 | Linear advection-diffusion | ||
| 2D | TC-4 | Linear advection in a star shaped computational domain | |
| TC-5 | Linear diffusion in a star shaped computational domain | ||
| TC-6 | Linear diffusion in a complex computational domain | ||
| Unsteady | 1D | TC-7 | Linear advection |
| TC-8 | Quasi-linear advection | ||
| TC-9 | Linear advection-diffusion | ||
| 2D | TC-10 | Linear advection-diffusion |
| TC | ||
| TC-1 | ||
| TC-2 | ||
| TC-3 |
| TC | ||
| TC-4 | ||
| TC-5 | ||
| TC-6 |
| Test case ID | Description | |
| Representation of functions with PIELM | TC-11 | Representation of sharp and discontinuous functions in 1D |
| TC-12 | Representation of a sharp peaked 2D Gaussian | |
| Solution of PDEs with PIELM | TC-13 | TC-7 with |
| TC-14 | TC-3 with | |
| Solution of PDEs with PINN | TC-15 | TC-7 with |
| Test Case ID | Description | Architecture |
| TC-9 | 1D unsteady linear advection-diffusion | |
| TC-10 | 2D unsteady linear advection-diffusion equation | |
| TC-11 | Representation of 1D sharp and discontinuous functions | |
| TC-12 | Representation of a sharp peaked 2D Gaussian with DPIELM | |
| TC-13 | Advection of a sharp peaked Gaussian | |
| TC-14 | 1D steady advection-diffusion | |
| TC-15 | Advection of a high frequency wave packet |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Physics Informed Extreme Learning Machine (PIELM)– A rapid method
for the numerical solution of partial differential equations
Vikas Dwivedi
Department of Mechanical Engineering
Indian Institute of Technology, Madras
Chennai-600036, India
&Balaji Srinivasan
Department of Mechanical Engineering
Indian Institute of Technology, Madras
Chennai-600036, India
[email protected] Use footnote for providing further information about author (webpage, alternative address)—not for acknowledging funding agencies.
Abstract
There has been rapid progress recently on the application of deep networks to solution of partial differential equations, collectively labelled as Physics Informed Neural Networks (PINNs). In this paper, we develop Physics Informed Extreme Learning Machine (PIELM), a rapid version of PINNs which can be applied to stationary and time dependent linear partial differential equations. We demonstrate that PIELM matches or exceeds the accuracy of PINNs on a range of problems. We also discuss the limitations of neural network based approaches, including our PIELM, in the solution of PDEs on large domains and suggest an extension, a distributed version of our algorithm – DPIELM. We show that DPIELM produces excellent results comparable to conventional numerical techniques in the solution of time-dependent problems. Collectively, this work contributes towards making the use of neural networks in the solution of partial differential equations in complex domains as a competitive alternative to conventional discretization techniques.
K****eywords Partial differential equations Physics informed neural networks Extreme learning machine Advection-diffusion equation
1 Introduction
Partial differential equations (PDEs) are extensively used in the mathematical modelling of various problems in physics, engineering and finance. In practical situations, these equations typically lack analytical solutions and are solved numerically. In current practice, most numerical approaches to solve PDEs like finite element method (FEM), finite difference method (FDM) and finite volume method (FVM) are mesh based. A typical implementation of a mesh based approach involves three steps: (1) Grid generation, (2) Discretization of governing equation and (3) Solution of the discretized equations with some iterative method.
However, there are limitations to these approaches. Some of the limitations of these methods are as follows:
They cannot be used to solve PDEs in complex computational domains because grid generation (step 1) itself becomes infeasible. 2. 2.
The process of discretization (step 2) creates a discrepancy between the mathematical nature of actual PDE and its approximate difference equation [1]. Sometimes this can lead to quite serious problems [2].
One of the options to fix these issues is to use neural networks. In this approach, the data set consists of some randomly selected points in the domain and on the boundary. The governing equations and the boundary conditions are fitted using neural network. There are two main motivations for this approach. First, being universal approximators, neural networks can potentially represent any PDE. So this avoids the discretization step and thus discretization based physics errors too. Second, it is meshfree and therefore complex geometries can be easily handled [3]. Initial work in this direction can be credited to Lagaris et al. [4, 5]. Firstly, they solved the initial boundary value problem using neural networks and later they extended their work to handle irregular boundaries. Since then, a lot of work has been done in this field [6, 7, 8, 9, 10, 11, 12, 13]. In particular, we refer to the physics-informed neural networks (PINN) approach by Raissi and Karniadakis [11] and Raissi et. al [12, 13]. This approach has produced promising results for a series of benchmark nonlinear problems.
Recently, Berg et al. [3] have developed a PINN based method to solve PDEs on complex domains and produced several results. However, in spite of various advantages of using deep networks for solving PDEs, PINNs have several problems [13]. Firstly, there is no theoretical basis to know the size of neural network architecture and the amount of data needed. Then, there is no guarantee that the algorithm will not hit upon a local minima. Finally, their learning time is slower than the traditional numerical methods making them very expensive for practical problems.
We show that some of the problems mentioned above can be easily handled by using an alternative network called the extreme learning machine (ELM). The basic ELM was proposed by Huang et al. [14] for a single hidden layer feed forward networks (SLFNs) and later it was extended to generalized SLFNs. The essence of ELM is that the weights of the hidden layer of SLFNs need not to be learnt. It is much faster than the traditional gradient based optimization methods alleviating the learning time problem. Previously, ELMs have been used in approximating functions [15] and solving ordinary differential equations (ODEs) and stationary PDEs [16, 17] using Legendre and Bernstein polynomial basis functions respectively.
In this paper, we propose a new machine learning algorithm to solve stationary and time dependent PDEs in complex geometries. We have named it physics informed extreme learning machine (PIELM) because it is a combination of two algorithms namely ELM and PINN. Theoretically, there is no question over the employment of ELM as a PDE solver because it is a universal approximator [18] and therefore it can approximate any PDE. We have made our ELM “physics informed” by incorporating the information about the physics of PDE as the cost function. In addition to this, we have also proposed an extension to original PIELM called distributed PIELM that enhances the representation power of PIELM without adding any extra hidden layers. We demonstrate that both PIELM and DPIELM exibit superior performance on a range of stationary and time-dependent problems in comparison to existing methods.
This paper proceeds as follows. We give a brief review of PINN and ELM in Section 2. The proposed PIELM is described in Section 3.In Section 4, we evaluate the performance of PIELM in solving various stationary and time-dependent PDEs. To our knowledge, this is the first application of an ELM based algorithm to solve a 2D unsteady PDE. In Section 5, we discuss the limitation of PIELM to represent discontinuous functions and the functions with sharp gradient. We also illustrate a test case where even a deep PINN fails to represent a complicated function. In Section 6, DPIELM, the distributed version of PIELM for enhanced representation is described. In Section 7, the results of implementation of DPIELM algorithm in test cases involving representation of functions with sharp gradients have been presented. We have also shown that DPIELM outperforms the deep PINN in representing the complicated function described earlier. Finally, conclusion and future work are given in Section 8.
2 Brief review of ELM and PINN
The PIELM is combination of two learning algorithms: ELM and PINN. In this section, we review these two algorithms in brief.
2.1 Extreme learning machine
Traditional gradient-based learning algorithms [19] have prohibitively slow learning speed and they suffer from various problems like improper learning rate, local minima etc. Huang et al. [14] originally proposed a novel learning algorithm called ELM to fix these issues. ELM is extremely fast and mostly it shows better generalization performance than gradient-based learning approaches like back-propagation. A typical implementation of the algorithm involves the following steps:
Select a shallow neural network. 2. 2.
Fix the hidden layer weights and biases randomly. These parameters will not be learned and therefore no iterative optimization is required for them. 3. 3.
Apply a nonlinear transformation to the input data set. This gives the input to the final layer. 4. 4.
Take the linear combination of all the inputs of the final layer. This is the output of ELM. 5. 5.
Learn the output layer weights using the least squares method.
Mathematical formulation
Consider the basic ELM shown in Fig (1). It is a single layer feed forward neural network with neurons in the hidden layer. Input is a vector of size and output is the component of the output vector of size . We denote the non-linear activation by and the weights and biases of node of hidden layer by and respectively. The output of the hidden node is . For a given data set with distinct samples, the ELM output is given by
[TABLE]
where,
[TABLE]
[TABLE]
and is vector of output layer weights.
On writing the Eq. (1) for all the components, the resulting ELM is given by
[TABLE]
where is the training error vector. The ELM tends to reach the smallest training error together with the smallest norm of the output weights. Mathematically saying, the loss function to be minimized for the ELM is given by
[TABLE]
where is the regularization parameter. The correct weights that minimize can be calculated by solving the normal equations as given below.
[TABLE]
2.2 Physics informed neural network
Raissi et al. [13] proposed a data efficient PINN for approximating solutions to general non-linear PDEs and validated it with a series of benchmark test cases. The main feature of the PINN is the inclusion of the prior knowledge of physics in the learning algorithm as cost function. As a result, the algorithm imposes penalty for any non-physical solution and quickly directs it towards the correct solution. This physics informed approach enhances the information content of the data. As a result, the algorithm has good generalization property even in the small data set regime.
Mathematical formulation
Consider a PDE of the following form
[TABLE]
[TABLE]
[TABLE]
where may be a linear or nonlinear differential operator and is the boundary of computational domain . We approximate with the output of PINN. The network architecture may be shallow or deep depending upon the non-linearity . The essence of PINN lies in the definition of its loss function. In order to make the neural network “physics informed”, the loss function is defined such that a penalty is imposed whenever the network output doesn’t respect the physics of the problem. If we denote the training errors in approximating the PDE, BCs and IC by , and respectively. Then, the expressions for these errors are as follows:
[TABLE]
[TABLE]
[TABLE]
For shallow networks, and can be determined using hand calculations. However, for deep networks, we have to use automatic differentiation [20]. The loss function to be minimized for a PINN is given by
[TABLE]
where , and refer to number of collocation points, boundary condition points and initial condition points respectively. Finally, any gradient based optimization routine may be used to minimize .
This completes the mathematical formulation of PINN. The key steps in its implementation are as follows:
Identify the PDE to be solved along with the initial and boundary conditions. 2. 2.
Decide the architecture of PINN. 3. 3.
Approximate the correct solution with PINN. 4. 4.
Find expressions for the PDE, BCs and IC in terms of PINN and its derivatives. 5. 5.
Define a loss function which penalizes for error in PDE, BCs and IC. 6. 6.
Minimize the loss with gradient based algorithms.
3 Proposed PIELM
Consider the following unsteady linear PDE
[TABLE]
[TABLE]
[TABLE]
where is a linear differential operator and is the boundary of computational domain . We approximate with the output of PIELM. For simplicity, we consider the 1D unsteady version of Eqns (12, 13, 14). The extension to higher dimensional problems is straightforward. The PIELM for 1D unsteady problem is schematically shown in Fig (2). The number of neurons in the hidden layers is . If we define , , , and then, the output of the hidden neuron is
[TABLE]
where and is the nonlinear activation function. The PIELM output is given by
[TABLE]
Similarly, the formulae for and are given by
[TABLE]
[TABLE]
We denote the training errors in approximating the PDE, BCs and IC by , and respectively. The expressions for these errors are as follows:
[TABLE]
[TABLE]
[TABLE]
Next, we put a hard constraint on to solve the PDE exactly with zero error by setting
[TABLE]
[TABLE]
[TABLE]
Eqns (21 to 23) lead to the a system of linear equations which can be represented as
[TABLE]
The form of and depends on the , and i.e. on the type of PDE, boundary condition and initial condition. In order to find , Moore–Penrose generalized inverse [21] (also called pseudo-inverse) should be used as it works well for singular and non square too. An additional advantage with this formulation is that we have a basis to guess the scale of the PIELM architecture. When we are solving Eqns (21 to 23) simultaneously, we know that a unique solution will exist when number of unknowns are equal to number of equations which means that . This gives us an idea of the size of the hidden layer. However, in practice we get the correct solution even with lesser number of neurons. For example, if we supply a large number of points to approximate a linear function, the PIELM would not require the same number of neurons for learning.
This completes the mathematical formulation of PIELM. The key steps in its implementation are as follows:
Assign the input layer weights randomly. 2. 2.
Depending on the PDE and the initial and boundary conditions, find the expressions for , and . 3. 3.
Assemble the three sets of equations in the form of 4. 4.
Output layer weight vector is given by where refers to pseudo-inverse.
It is to be noted that unlike conventional ELMs , we are not solving an optimization problem. The loss function to be minimized for a conventional physics informed ELM would be given by
[TABLE]
where is a regularization parameter and , and refer to number of collocation points, boundary condition points and initial condition points respectively. The correct ELM weights that minimize can be calculated by solving the normal equations which is given below.
[TABLE]
Although a PIELM can be made with this minimization approach, we have opted for the direct approach due to the following reasons:
The direct approach is straightforward to formulate and code. It saves the effort of calculating loss function and setting the derivatives equal to zero. 2. 2.
The learning of the minimization approach is comparatively less “physics informed” because physics is not being imposed in an exact sense.
4 Performance evaluation of PIELM
To evaluate the performance of PIELM, we rigorously test it on various linear and quasi-linear PDEs described in Table(1). TC-1, TC-2, TC-4, TC-5, TC-6 are taken from Berg et al. [3]. TC-8 is taken from Kopriva et al. [22]. TC-9 and TC-10 are taken from Borker et al. [23]. All the experiments are conducted in Matlab 2017b environment running in an Intel Core i5 2.20GHz CPU and 8GB RAM Dell laptop. The error is defined as the difference between the PIELM prediction and the exact solution.
4.1 1D steady cases [ TC-1, TC-2, TC-3 ]
The 1D stationary advection, diffusion and advection-diffusion equations are given by
[TABLE]
[TABLE]
[TABLE]
respectively. The expressions for and the Dirichlet boundary conditions for these cases are calculated by assuming the following exact solutions
[TABLE]
[TABLE]
[TABLE]
respectively. In order to solve these equations in PIELM framework, we have to solve Eqn(21 to 23). The expression for depends on the linear differential operator . The definitions of in these cases are , and respectively. When acts on , the corresponding expressions for may be written as follows:
[TABLE]
[TABLE]
[TABLE]
where is collocation points vector, is bias vector, and ’’ refers to Hadamardt product. Referring to Fig (3a), . Similarly, expression for is given by
[TABLE]
where is boundary points vector, and is the boundary condition.
The results for these test cases are given in Fig(4), Fig(5) and Fig (6) respectively and the summary of the experiments is given in Tab(2).
Remark
It should be noted that the unified deep ANN algorithm [3] took 100 points to achieve an order of accuracy of and in TC-1 and TC-2 respectively. In comparison, PIELM took less than half of the points and still achieved an order of accuracy of in both the cases.
4.2 2D steady cases [ TC-4, TC-5, TC-6 ]
The stationary 2D advection and diffusion equations for the three cases are given by
[TABLE]
[TABLE]
[TABLE]
where and are advection coefficients. The computational domains and are shown in respectively. The expressions for and the for these cases are constructed by choosing the following exact solutions
[TABLE]
[TABLE]
[TABLE]
respectively. PIELM equations for these problems are as follows:
=
- •
Case 1: Advection
[TABLE]
- •
Case 2: Diffusion
[TABLE]
where and (refer Fig (2)). 2. 2.
[TABLE]
where .
The results for the advection and diffusion cases on are shown in Fig (8) and Fig (9) respectively. The result for diffusion case on is given in Fig (10). Summary of the experiments is given in
Remarks
The unified deep ANN algorithm [3] took points to achieve an order of accuracy of in TC-4 and TC-5. In comparison, PIELM solved these cases with merely points and still achieved an order of accuracy of and respectively. 2. 2.
False diffusion is an error which gives diffusion like appearance to solution of pure advection equation when it is solved using upwind discretization. In Fig (8) these errors can be seen flowing along streamlines. However, PIELM reduces the order of these errors to an insignificant level. 3. 3.
Computational domain is the map of state of Illinois in USA. This is an example of a complicated polygon which has very short line segments and fine grained details in various regions. Conventional mesh based methods are not feasible for these kind of geometries. The latitudes and longitudes of the boundary are available in MATLAB’s in-built function “usamap”. We have re-scaled the data in the range . PIELM solved this case with just points to an accuracy level of in just seconds.
4.3 1D unsteady advection cases [ TC-7, TC-8 ]
The unsteady 1D advection equation is given by
[TABLE]
[TABLE]
where is the advection coefficient. In this problem, we consider two cases: (1) constant coefficient case with and (2) variable coefficient case [22] with . The two cases have periodic and inflow boundary conditions respectively. The value of is . The exact solutions to the 1D unsteady advection problems with constant and variable coefficient are respectively given by
[TABLE]
[TABLE]
PIELM equations
=
- •
TC-7: Linear case
[TABLE]
- •
TC-8: Quasi-linear case
[TABLE]
where are collocation point vectors, , and \boldsymbol{A_{f}}=[\begin{array}[]{cccc}a\left(\overrightarrow{x_{f}},\overrightarrow{t_{f}}\right),&...&,&a\left(\overrightarrow{x_{f}},\overrightarrow{t_{f}}\right)\end{array}]_{N\text{x}N^{*}}. 2. 2.
[TABLE]
where , are left boundary points vectors, , are right boundary points vectors, and . 3. 3.
[TABLE]
where are initial condition vectors, and is initial condition.
The results are shown in Fig (11). PIELM predicts the exact solution correctly for both linear and quasi-linear advection. In this case, we took , , and . The total learning time is within 2-3 seconds.
Remark
For advection problems, the time step of the traditional numerical schemes like upwinding can not exceed the mesh size due to stability issues. However, PIELM doesn’t impose any such restriction and we can take larger time steps.
4.3.1 1D unsteady advection-diffusion [ TC-9 ]
The 1D equivalent of the unsteady 2D advection-diffusion equation solved by Borker et al. [23] is given by
[TABLE]
where is the advection coefficient. The expressions for the initial condition and the boundary condition are constructed on the basis of the following exact solution
[TABLE]
The PIELM equations are as follows:
=
[TABLE]
where are collocation point vectors, and . 2. 2.
[TABLE]
where , are boundary points vectors, and is the boundary condition. 3. 3.
[TABLE]
where are initial condition vectors, and is initial condition.
The results are shown in Fig (12). In this case, PIELM prediction clearly goes wrong in the following grounds:
Initial and boundary conditions aren’t captured correctly. 2. 2.
Solution has unphysical oscillations throughout the domain. 3. 3.
The hump of the Gaussian decays a lot slower than the correct rate.
We increased the size of hidden layer but didn’t see any improvement in results. For example, we took points in the computational domain and put as many as neurons in the hidden layer. Still the PIELM predictions were poor.
4.3.2 2D unsteady advection-diffusion [ TC-10 ]
We solve the following unsteady 2D advection-diffusion equation[23]
[TABLE]
[TABLE]
[TABLE]
where . and are advection coefficients in and directions and is the diffusion coefficient. The expressions for the initial and the boundary conditions are constructed by choosing the following exact solution:
[TABLE]
PIELM equations to be solved are as follows:
=
[TABLE]
where are collocation point vectors, . Referring to Fig (3b), . 2. 2.
[TABLE]
where , , are boundary points vectors and . 3. 3.
[TABLE]
where are initial condition vectors and .
The results are shown in Fig (13-14). They are 2D equivalents of 1D results. The solution is diffusive and contains unphysical oscillations throughout the domain. In spite of all these issues, it should be noted that:
The advection component of the equation is captured correctly. The two humps are moving at the same speed. 2. 2.
The non-physical oscillations don’t grow with time.
For this case, we have taken a total of 125000 data points and 1000 neurons in the hidden layer. The results show signs of improvement if we keep increasing the number of points. However, that takes a lot of time which makes the option impractical. The limitations of the PIELM will be further investigated in the next section. We close this section by summarizing the advantages of the PIELM which are as follows:
It is extremely fast as well as data efficient.(TC-1 to TC-6) 2. 2.
It can be seamlessly extended to higher dimensions. (TC-10) 3. 3.
It reduces numerical artefacts like false diffusion.(TC-4) 4. 4.
It is meshfree method and can handle the complex geometries.(TC-6) 5. 5.
It reduces the arbitrariness of the number of neurons in the hidden layer.
5 Limitations of PIELM
A potential reason for the failure of PIELM in solving advection-diffusion equation [23] could be the limited representation capacity of PIELM to represent a complex function. A PDE consists of function and its derivatives. If a neural network cannot represent the function itself, then calculation of derivatives only adds to the error.
Representation of functions with sharp gradients [ TC-11, TC-12
]
To put the representation power of PIELM to test we choose the following two functions: (1) A 1D composite function that contains both discontinuous functions and functions with sharp gradients and (2) A 2D Gaussian function with a sharp peak. The expressions for 2D Gaussian function and 1D composite function are and
[TABLE]
respectively. The results are shown in figure (15)and Fig (16) respectively. These cases clearly expose the limitation of PIELM in representing profiles with sharp gradients and corners.
PDEs with sharp solutions [ TC-13, TC-14 ]
Due to this limitation, PIELM fails to solve any PDE which admits functions with sharp gradients. For example, we have already seen the failure of our algorithm in solving 1D and 2D advection-diffusion equation. We further illustrate the impact of this limitation on two simpler equations. Firstly, we consider pure advection of a sharp peaked Gaussian. This equation has been solved in TC-7 for a smooth sine function. Next , we take steady advection-diffusion equation (TC-3) with a low value of diffusion coefficient.
The exact and PIELM solutions for these problems are shown in Fig (17) and Fig (18) respectively.
Representation of a high frequency wavelet with PINN [TC-15]
An obvious idea to improve the representation capacity of a neural network is to make it deep. Therefore, we test the performance of PINN in TC-7 with i.e. pure advection of a high frequency wavelet. The PINN code is freely available at https://github.com/maziarraissi/PINNs. We modified the original code by replacing the Burgers equation with pure linear advection equation. Fig(19) shows the architecture of PINN. It consists of 9 hidden layers with 20 neurons each. Each layer is activated by functions.
The exact and the PINN solutions are shown in Fig (20). We can see that even a deep network is unable to capture the sharp gradients of this wavelet.
List of the experiments conducted in this section are given in Tab 4. We summarize this section as follows:
The main limitation of a PIELM is its inability to represent complex functions. 2. 2.
This limitation restricts the PIELM to solve the PDEs with sharp exact solutions. 3. 3.
Adding extra layers is not the practical solution to this problem. (TC-15)
6 Distributed PIELM
In this section, we propose a distributed version of PIELM called DPIELM. This algorithm takes motivation from finite volume methods in which the whole computational domain is partitioned into multiple cells and governing equations are solved at each cell. The solutions of these individual cells are stitched together with additional convective and diffusive fluxes conditions at the cell interfaces. We adopt a similar strategy in DPIELM. As representation of a complex function is very hard for a single PIELM or PINN in the whole domain, we divide the domain into multiple cells and install a PIELM in each cell. Therefore, each PIELM uses different representations in different portions of domain while satisfying some additional constraints of continuity and differentiability.
6.1 Mathematical formulation
Consider the following 1D unsteady problem
[TABLE]
[TABLE]
[TABLE]
where is a linear differential operator and is the boundary of computational domain . In this problem, the rectangular domain is given by . On uniformly dividing into non-overlapping rectangular cells, may be written as
[TABLE]
The boundary of the cell is denoted by . For rectangular cells,
[TABLE]
where represents the interface of .
Fig (21a) shows the distribution of PIELMs in a rectangular computational domain with cells (i.e. ). We denote the PIELM on the cell by . Fig (21b) shows a PIELM with collocation points at the interior and the boundary points at the four interfaces. The weights and output corresponding to a given are denoted by \text{[\overrightarrow{m}}^{(i)},\text{\overrightarrow{n}}^{(i)},\text{\overrightarrow{b}}^{(i)},\text{\overrightarrow{c}}^{(i)}] and respectively.
At each , we enforce additional constraints of continuity (or smoothness) of solution at the cell interfaces depending on the differential operator . For example, continuity of solution is sufficient for advection problems. For diffusion problem, the solution should be continuously differentiable. For the computational domain shown in the Fig (21), the system of equations to be solved in the DPIELM framework are given below.
6.1.1 Regular PIELM equations
=
[TABLE] 2. 2.
[TABLE]
[TABLE] 3. 3.
[TABLE]
6.1.2 Additional interface equations
Constraints for solutions i.e. .
- •
Continuity along direction
[TABLE]
where .
- •
Continuity along direction
[TABLE]
where . 2. 2.
Constraints for solutions i.e. .
- •
Smooth solutions along direction
[TABLE]
where .
Assembly of Eqns (72 to 78) leads to a system of linear equations which can be represented as
[TABLE]
where The form of and depends on the , and . Finally can be found using pseudo inverse. It is to be noted that although we have shown the formulation for 1D unsteady problems, no special adjustment is needed to extend the formulation to higher dimensional problems.
This completes the mathematical formulation of DPIELM. The main steps in its implementation are as follows:
Divide the computational domain into uniformly distributed non overlapping cells and install a PIELM in each cell. 2. 2.
Depending on the cell location, PDE and the initial and boundary conditions, find the expressions for , and at each cell. 3. 3.
Depending on the PDE, find the expressions for and at each cell interface. 4. 4.
Assemble these equations in the form of where 5. 5.
Find the value of using pseudo inverse.
7 Performance evaluation of DPIELM
In this section, we evaluate the performance of DPIELMs by testing it on all the cases in which regular PIELM and PINN failed to perform. The details of the architecture is given in Tab(5).
Representation of functions with sharp gradient [ TC-11, TC-12]
The mathematical formulation of DPIELM equations for 1D case are as follows:
=
[TABLE] 2. 2.
[TABLE]
where refer to left and right cell interfaces respectively. 3. 3.
[TABLE]
The equations for the 3D case can be written in a similar fashion. The exact and DPIELM solutions for these cases are shown in Figs (22) and (23) respectively.
1D steady advection-diffusion equation with low value of diffusion
constant [TC-14]
The DPIELM equations are as follows:
=
[TABLE] 2. 2.
[TABLE] 3. 3.
[TABLE] 4. 4.
[TABLE]
The results of the exact and DPIELM solution is given in Fig(24).
7.1 1D unsteady advection of a sharp peaked Gaussian and a high frequency
wavelet and [ TC-13, TC-15 ]
The DPIELM equation to be solved are as follows:
=
[TABLE] 2. 2.
[TABLE]
[TABLE] 3. 3.
[TABLE] 4. 4.
- •
Continuity along direction
[TABLE]
where , and .
- •
Continuity along direction
[TABLE]
where , and .
The results for the TC-13 and TC-15 are given in Fig (25) and Fig (26) respectively.
7.2 1D and 2D unsteady advection-diffusion
equations [ TC-9, TC-10]
In this section, we present the DPIELM equation for the 1D case. The equations for the 2D case can be formulated in a similar fashion. The equations to be solved for 1D unsteady advection-diffusion equation are as follows:
=
[TABLE] 2. 2.
[TABLE]
where , and .
Rest of the equations i.e. , and are same as that used in 1D unsteady linear advection. The results for the 1D and 2D cases are given in Fig (27) and Fig (28 to 30) respectively.
This brings us to the end of our numerical experiments. We close this section by highlighting the key points which are as follows:
The process of partitioning of the whole computational domain into multiple cells simplifies the representation of the complicated function ( and thus PDE ) in the individual cells. As a result, local PIELMs are able to capture not just the functions with sharp gradients, but also discontinuous functions ( TC-11 ). 2. 2.
To our knowledge, this is the first demonstration of capability of ELM based algorithms to solve 2D unsteady PDEs and produce results comparable to sophisticated numerical methods. (TC-10) 3. 3.
The distributed version of PIELM exhibits better representation ability than a deep PINN (TC-15).
8 Conclusion and future work
We have presented in this paper PIELM – an efficient method to utilize physics informed neural networks to solve stationary and time dependent linear PDEs. As PIELM inherits the unique qualities of its parent algorithms (PINN and ELM), it works very well on complex geometries, respects the inherent physics of the PDEs and is extremely fast as well. This leads to several advantages over existing conventional numerical methods; PIELM reduces numerical artefacts such as false diffusion as well can handle complex geometries in a meshfree approach. PIELM also reduces, to a certain extent, the arbitrariness of the number of neurons in typical deep PINNs. Our numerical tests also confirm that, for a fixed problem, our minimal PIELM is more accurate than prior deep NN results [3] while being faster.
We have also presented in this paper the limitations in representing complex functions using a single PIELM or PINN for the whole domain. For practical problems using PINNs can lead to very deep networks with the concomitant training problems and efficiency issues. Our proposed solution is to use a distributed PIELM (DPIELM) which uses different representations in different portions of the domain while imposing some continuity and differentiability constraints. The resultant DPIELM easily captures profiles that PINNs have difficulty with. Further, on time-dependent problems, DPIELM gives results that are comparable to sophisticated conventional numerical techniques as seen in Section 7.2.
We believe that the method, as formulated, is already very powerful for linear PDEs with constant or space-varying coefficients. Two primary areas of development remain, in our opinion. Our preliminary tests show that, for linear PDEs, unlike deep PINNs [13], our method may actually be competitive with conventional techniques in terms of speed and accuracy. However, a firm conclusion on this cannot be reached until a more thorough and fair study is done. Theoretical and numerical evidence for this efficacy is the first area which deserves attention, as the number of practical applications (such as heat conduction, etc) where numerical methods for linear equations are a staple is large. Having an efficient neural network framework for such problems can be tremendously beneficial to practitioners.
Even more importantly, PIELM’s efficacy is thanks to the linear nature of the final problem. We have, therefore, limited our present study to linear problems. Extending this method to nonlinear equations is the obvious next frontier. This may be approached in one of two ways. The first approach would be to use the PIELM structure as is and then solve the resultant, non-convex optimization problem. Another approach would be to linearize the equation around the current time step to predict a future time step. This would be the equivalent of a linearized, explicit time-stepping method in conventional time marching techniques. We are currently investigating these approaches and will report progress in future publications.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Versteeg, Henk Kaarle, and Weeratunge Malalasekera. An introduction to computational fluid dynamics: the finite volume method. Pearson education, 2007.
- 2[2] Quirk, James J. " A contribution to the great Riemann solver debate." Upwind and High-Resolution Schemes. Springer, Berlin, Heidelberg, 1997. 550-569.
- 3[3] Berg, Jens, and Kaj Nyström. A unified deep artificial neural network approach to partial differential equations in complex geometries. Neurocomputing 317 (2018): 28-41.
- 4[4] Lagaris, Isaac E., Aristidis Likas, and Dimitrios I. Fotiadis. " Artificial neural networks for solving ordinary and partial differential equations." IEEE transactions on neural networks 9.5 (1998): 987-1000.
- 5[5] Lagaris, Isaac E., Aristidis C. Likas, and Dimitris G. Papageorgiou. " Neural-network methods for boundary value problems with irregular boundaries." IEEE Transactions on Neural Networks 11.5 (2000): 1041-1049.
- 6[6] van Milligen, B. Ph, V. Tribaldos, and J. A. Jiménez. " Neural network differential equation and plasma equilibrium solver." Physical review letters 75.20 (1995): 3594.
- 7[7] Mc Fall, K. S., & Mahan, J. R. (2009). Artificial neural network method for solution of boundary value problems with exact satisfaction of arbitrary boundary conditions. IEEE Transactions on Neural Networks, 20(8), 1221-1233.
- 8[8] Kumar, Manoj, and Neha Yadav. "Multilayer perceptrons and radial basis function neural network methods for the solution of differential equations: a survey." Computers & Mathematics with Applications 62.10 (2011): 3796-3811.
