Error Estimation of the Besse Relaxation Scheme for a Semilinear Heat Equation
Georgios E. Zouraris

TL;DR
This paper provides the first error estimate for a fully discrete Besse relaxation scheme applied to a semilinear heat equation, demonstrating optimal second-order accuracy in a combined temporal and spatial norm.
Contribution
It introduces a new stability argument and establishes the first error bounds for the fully discrete Besse relaxation scheme in this context.
Findings
Achieved optimal second-order error estimate in discrete $L_t^{ar{ ext{infinity}}}(H_x^1)$-norm.
Developed a novel composite stability argument.
First literature report of error estimates for fully discrete Besse relaxation scheme.
Abstract
The solution to the initial and Dirichlet boundary value problem for a semilinear, one dimensional heat equation is approximated by a numerical method that combines the Besse relaxation scheme in time (C. R. Acad. Sci. Paris S{\'e}r. I, vol. 326 (1998)) with a central finite difference method in space. A new, composite stability argument is developed, leading to an optimal, second-order error estimate in the discrete norm. It is the first time in the literature where an error estimate for fully discrete approximations based on the Besse relaxation scheme is provided.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Error Estimation of the Besse Relaxation Scheme
for a Semilinear Heat Equation
Georgios E. Zouraris*‡*
Abstract.
The solution to the initial and Dirichlet boundary value problem for a semilinear, one dimensional heat equation is approximated by a numerical method that combines the Besse relaxation scheme in time (C. R. Acad. Sci. Paris Sér. I, vol. 326 (1998)) with a central finite difference method in space. A new, composite stability argument is developed, leading to an optimal, second-order error estimate in the discrete norm. It is the first time in the literature where an error estimate for fully discrete approximations based on the Besse relaxation scheme is provided.
Key words and phrases:
Besse relaxation method, semilinear heat equation, finite differences, Dirichlet boundary conditions, optimal order error estimates
1991 Mathematics Subject Classification:
65M12, 65M60
‡ Department of Mathematics and Applied Mathematics, University of Crete, GR-700 13 Panepistimioupolis, Heraklion, Crete, Greece.
1. Introduction
1.1. Formulation of the problem
Let , , with , and be the solution of the following initial and boundary value problem:
[TABLE]
where , and with
[TABLE]
Furthermore, we assume that the data , and are smooth enough and compatible, in order to guarantee the existence and uniqueness of a solution to the problem above that is sufficiently smooth for our purposes.
Two decades ago, for the discretization in time of the nonlinear Schrödinger equation, C. Besse [4] introduced a new linear-implicit time-stepping method (called Relaxation Scheme) as an attempt to avoid the numerical solution of the nonlinear systems of algebraic equations that the application of the implicit Crank-Nicolson method yields. The proposed time discretization technique, combined with a finite element or a finite difference space discretization, is computationally efficient (see, e.g., [3], [8], [6]) and performs as a second order method (see, e.g., [5], [8]). Later, C. Besse [5] analyzing the Relaxation Scheme as a semidiscrete in time method to approximate the solution of the Cauchy problem (i.e. without the presence of boundary conditions) shows, using that it is local well-posedness and convergent without concluding a convergent rate with respect to the time-step. Until today, in spite of the results in [5], there is no scientific work in the literature providing an error estimate for the Relaxation Scheme. Since the Relaxation Scheme can not be classified as a Runge-Kutta or a linear multistep method, a natural question arises: “is the Relaxation Scheme a special method or a representative member of a new family of linear implicit time-discretization methods?” One way moving toward to find an answer is first to understand its convergence and then to construct methods with similar characteristics.
The aim of the work at hands is to contribute to the understanding of the convergence nature of the Besse relaxation scheme, by investigating its use, along with a finite difference space discretization, to obtain approximations of the solution to the parabolic problem (1.1)-(1.4). By building up a proper stability argument and using energy techniques, we are able to prove an optimal, second order error estimate in a discrete norm. The result is new and opens the discussion on the applicability and the extension of the Relaxation Scheme to other non-linear evolution equations.
1.2. Formulation of the numerical method
1.2.1. Notation
Let be the set of all positive integers and . For given , we define a uniform partition of the time interval with time-step , nodes for , and intermediate nodes for . Also, for given , we consider a uniform partition of with mesh-width and nodes for . Then, we introduce the discrete spaces
[TABLE]
a discrete product operator by
[TABLE]
and a discrete Laplacian operator by
[TABLE]
In addition, we introduce operators and , which, for given , are defined by for and and for . Finally, for and for any function and any , we define by for .
1.2.2. The Besse Relaxation Finite Difference method
The Besse Relaxation Finite Difference (BRFD) method combines a standard finite difference discetization in space with the Besse relaxation scheme in time (cf. [4]). Its algorithm consists of the following steps:
Step I: Define by
[TABLE]
and then find such that
[TABLE]
Step II: Define by
[TABLE]
and then find such that
[TABLE]
Step III: For , first define by
[TABLE]
and then find such that
[TABLE]
Obviously, the numerical method above requires, at each time step, the solution of a tridiagonal linear system of algebraic equations.
1.3. An overview of the paper
In the error analysis of the (BRFD) method, we face the locally Lipschitz nonlinearity of the problem by introducing the (MBRFD) scheme (see Section 4.2), which follows from the (BRFD) method after molifying properly the terms with nonlinear structure (cf. [1], [9], [7]). The (MBRFD) approximations depend on a parameter and have the following key property: when their discrete -norm is bounded by , then they are also (BRFD) approximations, because, in that case, the molifier (see (4.1)) acts as an indentity. Assuming that is large enough and is sufficiently small, for the non computable (ΜBRFD) approximations, first we show that are well-defined (see Proposition 4.1), and then we establish an optimal, second order error estimate in the discrete -norm (see Theorem 4.2). Letting and be sufficiently small (see (4.58)) and applying a discrete Sobolev inequality (see (2.1)), the latter convergence result implies that the discrete norm of the (MBRFD) approximations are lower than and thus they, also, are (BRFD) approximations. Finally, we are show that the (BRFD) approximations are unique and hence inherit the convergence properties of the (MBRFD) scheme (see Theorem 4.3), i.e. that there exist constants and , independent of and , such that
[TABLE]
and
[TABLE]
where is a discrete norm which is stronger than the discrete norm.
At every time-step, the (BRFD) method computes first an approximation of at the midpoint of the current time interval (see (1.7) and (1.9)) and then an approximation of at the next time node (see (1.8) and (1.10)). However, the computation of the approximations of at the midpoints is a simple postprocessing procedure and has no obvious discrete dynamic structure. The stability argument we employ is based first on taking a discrete derivative of the error equation that corresponds to (1.9) (see (4.27)) and then on including the discrete and discrete norm of the time increment of the error in the stability norm (see (4.32) and (4.52)).
We close this section by giving a brief overview of the paper. In Section 2, we introduce additional notation and provide a series of auxiliary results. Section 3 is dedicated to the estimation of several type of consistency errors and of the approximation error of a discrete elliptic projection. In Section 4, we define a modified version of the (BRFD) method, and then analyze its convergence properties and arrive at a set of conditions that ensure the well-posedness and convergence of the (BRFD) method.
2. Preliminaries
Let us introduce another discrete space by and the discrete space derivative operator by
[TABLE]
We define on an inner product by for , and we will denote by the corresponding norm, i.e. for . Also, we define a discrete maximum norm on by for .
We provide with the discrete inner product given by for , and we shall denote by its induced norm, i.e. for . Also, we equip with a discrete -norm defined by for , and with a discrete -seminorm given by for . It is easily seen that becomes a norm when it is restricted on and satisfies the following useful inequalities:
[TABLE]
for . In the sequel, we present a series of auxiliary results that they will be in often use in the rest of the work.
Lemma 2.1**.**
For all it holds that
[TABLE]
Proof.
Let . First, we establish (2.3) proceeding as follows:
[TABLE]
Then, we set in (2.3) to get (2.4). ∎
Lemma 2.2**.**
Let . Then, for , it holds that
[TABLE]
*where and . *
Proof.
Let . First, we define , by and for and . Then, we use the mean value theorem, to conclude that
[TABLE]
where given by and for . Observing that
[TABLE]
and
[TABLE]
we, easily, arrive at
[TABLE]
Thus, (2.5) follows as a simple consequence of (2.6), (2.7) and (2.8). ∎
Lemma 2.3**.**
Let . Then, for , it holds that
[TABLE]
and
[TABLE]
where , ,
[TABLE]
and .
Proof.
Let . We simplify the notation, first, by defining , by and for , and then, by introducing by and by . Also, we set and .
[math] First, we use the definition of and the mean value theorem, to get
[TABLE]
and
[TABLE]
which, obviously, yields
[TABLE]
Next, we use the definition of and the mean value theorem, to obtain
[TABLE]
which, leads to
[TABLE]
Finally, for , we apply (2.5) and (2.2), to arrive at
[TABLE]
Observing that and using (2.14) we have
[TABLE]
[math] Using the mean value theorem, we obtain
[TABLE]
where , are defined by and . Thus, using (2.11) and (2.13), we have
[TABLE]
The desired inequality (2.9) follows, easily, as a simple outcome of (2.16) and (2.17).
[math] For the discrete derivative of and , we, easily, obtain the following formulas:
[TABLE]
for , which yield
[TABLE]
Using (2.18), (2.1), (2.11) and (2.12), we have
[TABLE]
Combining (2.18), (2.13), (2.15) and (2.1), we arrive at
[TABLE]
Finally, (2.10) follows, easily, in view of (2.16), (2.19) and (2.20). ∎
3. Consistency Errors
To simplify the notation, we set , , for , and for . In view of the Dirichlet boundary conditions (1.2) and the compatibility conditions (1.4), it holds that , for and for .
3.1. Time consistency error at the nodes
Let be defined by
[TABLE]
and let be specified by
[TABLE]
for . Assuming that the solution is smooth enough on , and using (1.4) and the Dirichlet boundary conditions (1.2), we conclude that for and . Thus, we have and for .
Substracting (1.1) with from (3.1), and (1.1) with from (3.2), we get
[TABLE]
where and be defined by
[TABLE]
and
[TABLE]
Applying the Taylor formula we obtain
[TABLE]
for and , and
[TABLE]
for . Then, from (3.3), (3.4) and (3.5), we arrive at
[TABLE]
and
[TABLE]
3.2. Space consistency error
Also, let be defined by
[TABLE]
and, for , let be given by
[TABLE]
Subtracting (3.10) from (3.1) and (3.11) from (3.2), we obtain
[TABLE]
The use of the Taylor formula yields
[TABLE]
for and , which along with (3.12) yields
[TABLE]
3.3. Time consistency error at the intermediate nodes
For , let be determined by
[TABLE]
Setting and using, again, the Taylor formula we have
[TABLE]
for and , which, easily, yields
[TABLE]
3.4. A Discrete Ellliptic Projection
Let . Then, we define (cf. [2]) by requiring
[TABLE]
Using the Taylor formula, it follows that
[TABLE]
where is defined by
[TABLE]
First, subtract (3.18) from (3.19) to get
[TABLE]
Then, take the inner product of both sides of (3.21) with and use (2.4), the Cauchy-Schwarz inequality and (2.2) to obtain
[TABLE]
Finally, we use (3.22) to have
[TABLE]
4. Convergence Analysis
4.1. A mollifier
For , let (cf. [7], [9]) be an odd fuction defined by
[TABLE]
where is the unique polynomial of that satisfies the following conditions:
[TABLE]
4.2. The (MBRFD) scheme
The modified version of the (BRFD) method (cf. [1], [7], [9]) is a recursive procedure that, for given , derives approximations of the solution performing the steps below.
Step 1: Let be defined by
[TABLE]
and be specified by
[TABLE]
Step 2: Define by
[TABLE]
and find such that
[TABLE]
Step 3: For , first define by
[TABLE]
and, then, find such that
[TABLE]
4.3. Existence and uniqueness of the (MBRFD) approximations
Proposition 4.1**.**
Let , and . When , then the modified (BRFD) approximations are well-defined.
Proof.
Let , and be a linear operator given by
[TABLE]
Since , the definition of yields that . Thus, from (4.3), (4.5) and (4.7) it is easily seen that the well-posedness of and follows easily by securing the invertibility of . Moving towards to this target, first we use (2.4) to obtain
[TABLE]
Let us assume that . When , then , which, along with (4.8), yields , or, equivalently, . The latter argument shows that and, thus, is invertible, since has finite dimension. ∎
Remark 4.1**.**
Let us assume that and . Since and is well-defined, in view of (4.3) and (1.6), we conclude that is, also, well-defined and .
4.4. Convergence of the (MBRFD) scheme
In the theorem below, we investigate the convergence properties of the modified (BRFD) approximations.
Theorem 4.2**.**
Let , , and , where is the constant specified in Proposition 4.1. Then, there exist constants , , and , independent of and , such that: if , then
[TABLE]
[TABLE]
and
[TABLE]
Proof.
To simplify the notation, we set , , for , and for . In the sequel, we will use the symbol to denote a generic constant that is independent of , and , and may changes value from one line to the other. Also, we will use the symbol to denote a generic constant that depends on but is independent of , , and may changes value from one line to the other.
Since , after subtracting (4.3) from (3.10) we obtain
[TABLE]
Next, take the inner product of (4.12) with , and then use (2.4), the Cauchy-Schwarz inequality, (3.3), (3.6), (3.7), (3.13) and the arithmetic mean inequality to get
[TABLE]
Let and . Then, the inequality above yields that
[TABLE]
Taking the inner product of (4.12) with , and then using (2.4), we obtain
[TABLE]
where
[TABLE]
Now, we use the Cauchy-Schwarz inequality, the arithmetic mean inequality and (4.13), to have
[TABLE]
Also, (3.3), the Cauchy-Schwarz inequality, (2.3), (3.6), (3.9), (3.13) and the arithmetic mean inequality, yield
[TABLE]
In view of (4.14), (4.15) and (4.16), we arrive at
[TABLE]
which, obviously, yields (4.9).
Since , using (4.1), (4.4) and (4.13), we have
[TABLE]
Also, using Lemma 2.2, (2.2) and (4.17), we get
[TABLE]
We subtract (4.5) and (4.7) from (3.11), to obtain the following error equations:
[TABLE]
where
[TABLE]
We take the inner product of (4.20) with , and then, use (2.3), to have
[TABLE]
where
[TABLE]
Let . Using the Cauchy-Schwarz inequality, the arithmetic mean inequality, (3.6) and (3.13), we have
[TABLE]
Next, we use the Cauchy-Schwarz inequality, (2.2), (4.1) and the arithmetic mean inequality, to get
[TABLE]
Finally, taking into account that , we apply the Cauchy-Schwarz inequality, (4.1) and the arithmetic mean inequality to obtain
[TABLE]
From (4.21), (4.22), (4.23) and (4.24), we conclude that there exists a constant , such that
[TABLE]
Let us find an error equation governing the midpoint error . Subtracting (4.6) from (3.14) and using (4.1) and the assumption , we obtain
[TABLE]
which, easily, yields that
[TABLE]
where is defined by
[TABLE]
Then, we use (2.9), (4.1) and the mean value theorem, to get
[TABLE]
Taking the inner product of both sides of (4.27) with \tau\big{(}\boldsymbol{\theta}^{n}+\boldsymbol{\theta}^{n-2}\big{)}, and then using the Cauchy-Schwarz inequality, (4.29), (3.17) and (2.2), it follows that
[TABLE]
which, along with the application of the arithmetic mean inequality, yields
[TABLE]
Thus, from (4.25) and (4.30), we conclude that there exists a constant such that:
[TABLE]
where
[TABLE]
Assuming that with , a standard discrete Gronwall argument based on (4.31) yields
[TABLE]
Since , after setting in (4.25) and then using (4.18), we obtain
[TABLE]
Also, setting in (4.26) and then using (4.2), we get
[TABLE]
which, along with (4.18) and (3.16), yields
[TABLE]
Also, setting in (4.25), and then using (4.34) and (4.36), we have
[TABLE]
Thus, (4.33), (4.37), (4.36) and (4.18) yield
[TABLE]
Since , (4.10) follows, easily, from (4.32), (4.38) and (4.34).
Let us define and for . Then, using (4.5), (4.7), (3.2) and (3.18) we get
[TABLE]
where
[TABLE]
Take the inner product of (4.39) with , and then, use (2.4) and (2.3), to have
[TABLE]
where
[TABLE]
Let . Using the Cauchy-Schwarz inequality, the arithmetic mean inequality, (3.8) and (3.23), we have
[TABLE]
and
[TABLE]
Using, again, the Cauchy-Schwarz inequality and the arithmetic mean inequality, we get
[TABLE]
where
[TABLE]
Then, we use (4.1), (2.1), (4.10), (2.2), (2.5) and the assumption to get
[TABLE]
and
[TABLE]
Thus, (4.43), (4.44) and (4.45) yield
[TABLE]
From (4.40), (4.41), (4.42) and (4.46), we conclude that there exists a constant , such that
[TABLE]
Taking the inner product of both sides of (4.27) by , and using (2.3), the Cauchy-Schwarz inequality and (3.17), we have
[TABLE]
Using (4.28), (2.10), (2.2), (4.10) and (3.23), we get
[TABLE]
Then, (4.48), (4.49) and the arithmetic mean inequality, yield
[TABLE]
Combining (4.47) and (4.50), we conclude that there exists a positive constant such that:
[TABLE]
where
[TABLE]
Assuming that , where , and using a standard discrete Gronwall argument based on (4.51), we obtain
[TABLE]
After setting in (4.47) and then using (4.19) and (3.21), we obtain
[TABLE]
Using (4.35), (4.19) and (3.16), we have
[TABLE]
Set in (4.47) to conclude that
[TABLE]
which, along with, (4.54) and (4.55), yields
[TABLE]
Thus, from (4.53), (4.56), (4.55) and (4.19), we obtain
[TABLE]
Finally, (4.11) follows, easily, from (4.52) and (4.57). ∎
4.5. Convergence of the (BRFD) method
Theorem 4.3**.**
Let , , , be the constant determined in Proposition 4.1, , , and be the constants specified in Theorem 4.2, where . If
[TABLE]
then, the method (BRFD) is well-defined and the following error estimates hold
[TABLE]
and
[TABLE]
Proof.
Since , the convergence estimates (4.9) and (4.11), the discrete Sobolev inequality (2.1) and the mesh size conditions (4.58) imply that the (MBRFD) are well-defined and
[TABLE]
and
[TABLE]
which, along with (4.1), yield
[TABLE]
Thus, the (MBRFD) approximations are (BRFD) approximations when , i.e. (1.5)-(1.10) hold after replacing by , by for , and by for .
Let , and be approximations derived by the (BRFD) method. Then, we introduce the errors , for , and for . Since and , Remark 4.1 and (1.5) yield , and . Now, we assume that for a given it holds that and . Subracting (1.10) from (4.7) (or (1.8) from (4.5) when ), and then using (4.61), we obtain
[TABLE]
Next, taking the inner product with and then using (2.4), the Cauchy-Schwarz inequality, (4.1) and the definion of , we get
[TABLE]
which, obviously, yields that . When , observing that
[TABLE]
we arrive at . The induction argument above, shows that, under our assumptions the (BRFD) approximations are those derived from of the (MBRFD) scheme when , and thus the error estimates (4.59) and (4.60) follow as a natural outcome of (4.9), (4.10) and (4.11). ∎
Remark 4.2**.**
Let us make the choice (see [4], [5]) instead of (1.7). Then, we obtain , and . Thus, from (4.33) we arrive at a suboptimal error estimate of the form . Here, we skip the problem by introducing (1.6) (cf. [9]) that derives a higher order approximation of .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] G. D. Akrivis, Finite difference discretization of the cubic Schrödinger equation , IMA J. Numer. Anal. 13 (1993), 115-124.
- 2[2] G. D. Akrivis and V. A. Dougalis, Finite difference discretizations of some initial and boundary value problems with interface , Math. Comp. 56 (1991), 505-522.
- 3[3] X. Antoine, W. Bao and C. Besse, Computational methods for the dynamics of the nonlinear Schrödinger/Gross-Pitaevskii equations , Comput. Phys. Commun. 184 (2013), 2621-2633.
- 4[4] C. Besse, Schéma de relaxation pour l’ équation de Schrödinger non linéaire et les systèmes de Davey et Stewartson , C. R. Acad. Sci. Paris Sér. I 326 (1998), 1427-1432.
- 5[5] C. Besse, A relaxation scheme for the nonlinear Schrödinger equation , SIAM J. Numer. Anal. 42 (2004), 934-952.
- 6[6] P. Henning and J. Wärnegård, Numerical comparison of mass-conservative schemes for the Gross-Pitaevskii equation , ar Xiv:1804.10547 (2018).
- 7[7] O. Karakashian and Ch. Makridakis, A space-time finite element method for the nonlinear Schrödinger equation: The discontinuous Galerkin method , Math. Comp. 67 (1998), 479-499.
- 8[8] T. Katsaounis and D. Mitsotakis, On the reflection of solitons of the cubic nonlinear Schrödinger equation , Mathematical Methods in the Applied Sciences 41 (2018), 1013-1018.
