Arbitrary high order A-stable and B-convergent numerical methods for ODEs via deferred correction
Saint-Cyr E.R. Koyaguerebo-Ime, Yves Bourgault

TL;DR
This paper develops a sequence of high-order, A-stable, B-convergent numerical methods for solving first-order ODEs using recursive deferred correction schemes based on the implicit midpoint method.
Contribution
It introduces a new family of high-order deferred correction schemes that are A-stable and B-convergent, with proven order enhancement through correction steps.
Findings
Numerical experiments confirm high accuracy of schemes DC2 to DC10.
Theoretical order of accuracy is achieved in practice.
Schemes demonstrate satisfactory stability on stiff and non-stiff ODEs.
Abstract
This paper presents a sequence of deferred correction (DC) schemes built recursively from the implicit midpoint scheme for the numerical solution of general first order ordinary differential equations (ODEs). It is proven that each scheme is A-stable, satisfies a B-convergence property, and that the correction on a scheme DC(2j) of order 2j of accuracy leads to a scheme DC2j+2 of order 2j+2. The order of accuracy is guaranteed by a deferred correction condition. Numerical experiments with standard stiff and non-stiff ODEs are performed with the DC2, ..., DC10 schemes. The results show a high accuracy of the method. The theoretical orders of accuracy are achieved together with a satisfactory stability.
| 1 | ||||||||
| 2 | ||||||||
| 3 | ||||||||
| 4 |
| DC2 | DC4 | DC6 | DC8 | DC10 | |
| 1 | 0.18 | 1.7e-2 | 1.8e-4 | 2.3e-4 | 1.3e-4 |
| 2.03e-3 | 3.71e-2 (0.26) | 6.16e-4 (0.53) | 7.14e-5 (0.14) | 1.47e-6 (0.81) | 9.42e-7 (0.79) |
| 1.00e-4 | 1.92e-3 (0.98) | 2.93e-5 (1.01) | 4.31e-6 (0.93) | 3.72e-7 (0.45) | 5.78e-8 (0.94) |
| 1.00e-5 | 2.22e-5 (1.94) | 1.30e-7 (2.35) | 3.92e-9 (3.04) | 1.9e-10 (3.27) | 1.1e-11 (3.73) |
| 5.00e-6 | 5.55e-6 (2.0) | 1.04e-8 (3.70) | 1.4e-10 (3.70) | 4.4e-12 (5.50) | 4.4e-13 (4.64) |
| 3.33e-6 | 2.46e-6 (1.99) | 2.59e-9 (3.33) | 1.6e-11 (5.31) | 4.5e-13 (5.63) | 2.0e-13 (2.02) |
| 2.25e-6 | 1.39e-6 (1.99) | 8.7e-10 (3.79) | 3.3e-12 (5.54) | 4.2e-13 (0.16) | 4.2e-13 (-2.66) |
| BDF2 | BDF4 | BDF6 | RK4 | ||
| 1 | 0.14 | 0.83 | 6.1e-2 | – | |
| 2.03e-3 | 4.3e-2 (0.19) | 2.5e-2 (0.19) | 1.9e-3 (0.19) | – | |
| 1.00e-4 | 6.61e-3 (0.62) | 2.98e-3 (0.71) | 1.79e-3 (0.79) | 1.27e-3 | |
| 1.00e-5 | 2.59e-4 (1.41) | 1.92e-5 (2.19) | 3.15e-6 (2.76) | 4.91e-8 (4.41) | |
| 5.00e-6 | 7.29e-5 (1.91) | 1.92e-6 (3.58) | 1.35e-7 (5.11) | 2.53e-9 (4.28) |
| DC2 | DC4 | DC6 | DC8 | DC10 | |
| 5.00e-2 | 3418 | 456.26 | 42.665 | 3.2350 | 0.2132 |
| 2.50e-2 | 790.2 (2.1) | 25.351 (4.2) | 0.5959 (6.2) | 1.17e-2 (8.1) | 1.9e-4 (10.1) |
| 1.25e-2 | 193.8 (2.0) | 1.5493 (4.0) | 9.17e-3 (6.0) | 5.28e-5 (7.8) | 2.79e-6 (6.1) |
| 6.25e-3 | 48.23 (2.0) | 9.67e-2 (4.0) | 1.4e-4 (5.99) | 2.78e-6 (0.0) | 2.78e-6 (0.0) |
| 1.56e-3 | 3.010 (2.0) | 3.8e-4 (3.99) | 4.72e-6 (2.5) | 4.67e-6 (-0.3) | 4.7e-6 (-0.3) |
| BDF2 | BDF4 | BDF6 | rkf | stiff | |
| 1.56e-3 | 22026.46 | 14836.76 | 5578.40 | 22026.46 | 2636.00 |
| DC2 | DC4 | DC6 | DC8 | DC10 | |
| 2.000e-5 | 0.2152 | 6.51e-2 | 2.22e-2 | 8.00e-3 | 2.98e-3 |
| 5.000e-6 | 1.35e-2 (2) | 2.59e-4 (4) | 5.59e-6 (6) | 1.27e-7 (8) | 2.97e-9 (10) |
| 2.500e-6 | 3.38e-3 (2) | 1.62e-5 (4) | 8.74e-8 (6) | 4.9e-10 (8) | 2.9e-12 (10) |
| 1.250e-6 | 8.47e-4 (2) | 1.01e-6 (4) | 1.36e-9 (6) | 1.9e-12 (8) | 7.4e-14 (5.3) |
| 3.125e-7 | 5.29e-5 (2) | 4.00e-9 (4) | 3.6e-13 (6) | 7e-14 (2.4) | 6.3e-14 |
| 6.250e-8 | 2.11e-6 (2) | 6.3e-12 (4) | 6.02e-13 | 2.33e-13 | 1.19e-13 |
| BDF2 | BDF4 | BDF6 | rkf | stiff | |
| 1.25e-6 | 3.38e-3 | 7.94e-8 | 2.3e-12 | 2.36e-6 | 6.6e-10 |
| DC2 | DC4 | DC6 | DC8 | DC10 | |
| 100 | 2.79e-07 | 5.34e-08 | 8.31e-09 | 4.26e-09 | 1.04e-09 |
| 8.30e-12 | 9.68e-13 | 6.86e-14 | 6.14e-14 | 1.66e-14 | |
| 4.47e-13 | 5.31e-14 | 3.28e-15 | 3.40e-15 | 8.42e-16 | |
| 7.85e-12 | 9.14e-13 | 6.54e-14 | 5.81e-14 | 1.57e-14 | |
| 50 | 7.52e-08(1.89) | 1.02e-08(2.38) | 1.56e-09(2.41) | 8.53e-11(5.64) | 4.92e-11(4.41) |
| 1.96e-12(2.08) | 6.46e-14(3.90) | 3.16e-14(1.12) | 2.94e-15(4.38) | 5.07e-16(5.03) | |
| 1.07e-13(2.06) | 3.73e-15(3.83) | 1.61e-15(1.02) | 2.21e-16(3.94) | 9.78e-17(3.11) | |
| 1.86e-12(2.08) | 6.14e-14(3.89) | 3.00e-14(1.12) | 2.85D-15(4.35) | 4.09D-16(5.26) | |
| 10 | 3.16e-09(1.99) | 2.37e-11(4.03) | 5.26e-13(5.23) | 1.28e-14(6.72) | 4.51e-16(8.89) |
| 7.77e-14(1.99) | 2.79e-16(3.68) | 3.02e-18(5.74) | 1.15e-19(7.94) | 7.28e-21(8.09) | |
| 4.31e-15(1.97) | 7.08e-17(1.79) | 5.91e-17(0.24) | 6.27e-17(0.12) | 6.84e-17(0.09) | |
| 7.34e-14(1.99) | 3.20e-16(3.37) | 6.18e-17(1.79) | 6.28e-17(0.57) | 6.84e-17(0.11) | |
| BDF2 | BDF4 | BDF6 | RK4 | stiff | |
| 10 | 5.7e-8 | 6.6e-10 | 3.5e-11 | 2.03e-16 | 1.29e-16 |
| DC2 | DC4 | DC6 | DC8 | DC10 | |
|---|---|---|---|---|---|
| 0.5 | 3.63e-5 | 4.46e-6 | 2.08e-6 | 2.91e-6 | 3.09e-6 |
| 3.63e-5 | 4.46e-6 | 2.08e-6 | 2.91e-6 | 3.09e-6 | |
| 7.12e-5 | 4.37e-7 | 1.02e-7 | 4.12e-7 | 4.26e-7 | |
| 1/300 | 4.7e-9 (1.8) | 1.09e-9 (1.7) | 4.0e-10 (1.7) | 3.0e-10 (1.9) | 2.0e-10 (1.9) |
| 7.4e-9 (1.7) | 2.23e-8 (1.1) | 4.16e-8 (0.8) | 2.9e-8 (0.9) | 2.5e-8 (0.9) | |
| 4.7e-9 (1.9) | 2.12e-8 (0.6) | 4.12e-8 (0.6) | 2.8e-8 (0.5) | 2.5e-8 (0.6) | |
| 1/600 | 1.0e-9 (2.2) | 1.5e-10 (2.8) | 1.0e-12 (8.6) | 9.9e-13 (8.) | 7.5e-13 (8.2) |
| 5e-13 (14.) | 3.0e-14 (19.6) | 2.0e-16 (27.7) | 2.0e-16 (27.1) | 3.0e-16 (26.1) | |
| 1.0e-9 (2.2) | 1.5e-10 (7.1) | 1.0e-12 (15.3) | 9.9e-13 (15) | 4.0e-13 (15.8) | |
| 1/6000 | 9.24e-12 | 7.31e-14 | 1.48e-14 | 4.57e-14 | – |
| 5.38e-15 | 0. | 0. | 0. | – | |
| 9.25e-12 | 2.07e-13 | 1.36e-13 | 8.27e-14 | – | |
| BDF2 | BDF4 | BDF6 | RK4 | stiff | |
| 0.5 | 5.3e-4 | 3.6e-5 | 4.1e-6 | – | 7.76e-13 |
| 1/600 | 2.8e-6 | 1.2e-6 | 6.9e-7 | – | 7.28e-13 |
| DC2 | DC4 | DC6 | DC8 | DC10 | |
|---|---|---|---|---|---|
| 3.75e-5 | 3.0089 | 2.9999 | 2.9440 | 0.1838 | 3.12e-3 |
| 1322.9 | 1327.5 | 1320.6 | 197.79 | 3.26792 | |
| 1.50e-5 | 2.9769 (0) | 2.9999 (0) | 0.1080 (3.6) | 1.90e-4 (7.5) | 5.1e-5 (4.5) |
| 1333.3 (0) | 1330.3 (0) | 113.69 (2.7) | 0.18281 (7.6) | 5.1e-2 (4.5) | |
| 7.50e-6 | 2.8706 (0) | 2.6947 (0) | 1.60e-3 (6.0) | 1.74e-6 (6.7) | 1.27e-5 (1.9) |
| 1327.4 (0) | 1286.5 (0) | 1.6349 (6.1) | 1.80e-3 (6.7) | 1.29e-2 (1.9) | |
| 1.875e-6 | 0.74(0.9) | 0.339 (1.5) | 2.50e-7 (6.3) | – | 2.88e-7 (2.7) |
| 659. (0.5) | 373.2 (0.9) | 2.91e-4 (6.2) | – | 2.92e-4 (2.7) | |
| – | stiff | rkf | |||
| 2.16e-6 | 3.54e-2 | ||||
| 3.48e-3 | 64.76 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
∎
11institutetext: Saint-Cyr E.R. Koyaguerebo-Imé 22institutetext: Yves Bourgault††thanks: Department of Mathematics and Statistics, University of Ottawa, STEM Complex, 150 Louis-Pasteur Pvt, Ottawa, ON, Canada, K1N 6N5, Tel.: +613-562-5800x2013 (22email: [email protected], 22email: [email protected]
Arbitrary high order A-stable and B-convergent numerical methods for ODEs via deferred correction††thanks: The authors would like to acknowledge the financial support of the Discovery Grant Program of the Natural Sciences and Engineering Research Council of Canada (NSERC) and a scholarship to the first author from the NSERC CREATE program “Génie par la Simulation”.
Saint-Cyr E.R. Koyaguerebo-Imé
Yves Bourgault
(Received: date / Accepted: date)
Abstract
This paper presents a sequence of deferred correction (DC) schemes built recursively from the implicit midpoint scheme for the numerical solution of general first order ordinary differential equations (ODEs). It is proven that each scheme is A-stable, satisfies a B-convergence property, and that the correction on a scheme DC2j of order 2j of accuracy leads to a scheme DC2j+2 of order 2j+2. The order of accuracy is guaranteed by a deferred correction condition. Numerical experiments with standard stiff and non-stiff ODEs are performed with the DC2, …, DC10 schemes. The results show a high accuracy of the method. The theoretical orders of accuracy are achieved together with a satisfactory stability.
Keywords:
Ordinary differential equations high order time-stepping methodsdeferred correctionA-stability
MSC:
MSC 65B05 65L04 65L05 65L12 65L20
††journal: BIT
1 Introduction
In MR2058857 ; kress2002deferred , Gustafsson and Kress introduced a new version of deferred correction (DC) strategy for the numerical solution of linear systems of ordinary differential equations (ODE) MR2058857 and initial boundary value problems kress2002deferred , under a monotonicity condition. Numerical experiments with one-dimensional linear parabolic and hyperbolic equations were performed and showed that the method is effective (orders 2, 4 and 6 of accuracy are achieved). We propose to extend the method from MR2058857 ; kress2002deferred to the time-discretization of more general time-evolution partial differential equations (PDEs). In this paper, we restrict to the case of the initial value problem (IVP)
[TABLE]
where the unknown is from into a Banach space , is a given data and is a sufficiently differentiable function such that exists and is sufficiently differentiable. The main objective is to show the properties of the numerical method (consistency, stability, convergence and order of accuracy). A complete analysis of the DC method applied to reaction-diffusion equations leads to an arbitrary high order and unconditionally stable method (see koyaguerebo2020unconditionally ).
The DC method is used to improve the order of accuracy of numerical methods of lower order. This method is explored by many authors, e.g. schild1990gaussian ; auzinger2016encyclopedia ; MR2058857 ; kushnir2012highly ; hansen2011order ; dutt2000spectral ; daniel1967interated ; IntegralDC2010 . The method in daniel1967interated is an application of iterative deferred correction (IDC). The authors proved that an asymptotic improvement of order can be accomplished, from a scheme of order , at each step of the IDC procedure, provided suitable finite difference operators are employed. Numerical experiments are performed with the IDC applied to the trapezoidal rule, Taylor-2 and Adams-Bashforth of order 2. The results are promising even though they point out some difficulties of the proposed algorithms: inaccuracy for “large” time step and no asymptotic improvement for high levels of correction. The approaches in kushnir2012highly ; hansen2011order ; dutt2000spectral ; auzinger2016encyclopedia ; MR2058857 ; IntegralDC2010 are quite similar and consist in a linear perturbation of a low order scheme. However, solving stiff problems (problems extremely hard to solve by standard explicit methods spijker1996stiffness ) is a challenge unfavorable for these methods. In particular, the method in kushnir2012highly , concerning a highly accurate solver for stiff ODEs, requires sufficiently small time steps for moderately stiff problems while convergence is reduced to order 2 for “very stiff” problems.
Our schemes are based on nonlinear perturbations (corrections) of the implicit midpoint rule and inherit the A-stable property of the trapezoidal rule MR0170477 at any stage of the correction. Starting from an approximation of the exact solution by the implicit midpoint rule on a uniform partition of , at the stage of the correction we obtain an approximation of , expected to be of order of accuracy, on the same partition. Each approximate solution to be corrected is subject to a deferred correction condition (DCC) which guarantees the improvement of the order of accuracy. We prove that if satisfies the DCC and its correction converges to at the discrete points (or is simply bounded, when is finite dimensional) then approximates with order . Moreover, provided the function is Lipschitz with respect to its second variable or satisfies a one-sided Lipschitz condition, each satisfies the DCC and then converges with order of accuracy, for arbitrary positive integer . We also prove that each DC scheme involving is -stable. The theory is illustrated by numerical tests, for the schemes of order 2, 4, …, 10.
The paper is organized as follows: in section 2 we recall some basic results from finite difference approximations and present the DC schemes; section 3 deals with the consistency of the method; the analysis of convergence and order of accuracy together with a B-convergence result are given in section 4; absolute stability is proved is section 5, and section 6 is devoted to numerical experiments.
2 Deferred correction schemes for the implicit midpoint rule
We suppose that , for a positive integer , so that (1) has a unique solution . We simply denote by , the norm in the Banach space . For a time step , we denote and , for each integer . This implies that . We consider the time steps such that is a partition of , for a non-negative integer . The centered, forward and backward difference operators , and , respectively, related to and applied to , are defined as follows:
[TABLE]
[TABLE]
and
[TABLE]
The average operator is denoted by :
[TABLE]
The composition of and is defined recursively. They commute, that is , and satisfy the identities
[TABLE]
and
[TABLE]
for each integer such that . We have the estimate
[TABLE]
provided and (see (isaacson1966analysis, , p.249) or koyaguerebo2020finite ).
If is a sequence of approximation of at the discrete points , the finite difference operators apply to , and we define
[TABLE]
and
[TABLE]
From the centered finite difference approximation (see (koyaguerebo2020finite, , Thm 5) or hildebrand1974introduction ; chung2010computational ; dahlquist2008numerical ) we have
[TABLE]
and
[TABLE]
for each integer . These approximations lead to the schemes
[TABLE]
The schemes (7) are multi-steps and prone to stability restrictions. We resort to DC method to transform them into a sequence of one step schemes as follows: For , we have the implicit midpoint rule
[TABLE]
For ,
[TABLE]
[TABLE]
The scheme (LABEL:a27)-(10) has unknowns , , and is deduced from (7) by substituting the unknown under the summation symbols by . The index indicates that is expected to be an approximation of the exact solution with order of accuracy. We call the schemes (LABEL:a27)-(10) Deferred Correction of order for the implicit midpoint rule, denoted DC2j+2.
Remark 1
The scheme (LABEL:a27)-(10), for , should involve unknowns which represent approximate solutions of (1) at the discrete points , respectively. To avoid those approximations for , we propose the following scheme which is efficient for the computation of , using only points within the solution interval .
[TABLE]
[TABLE]
The finite difference operator in (11) are related to the time step . The approximations and are computed from the same scheme, (8) or (LABEL:a27)-(10), but for the time steps and , respectively. The scheme (11) results from the finite difference approximations
[TABLE]
and
[TABLE]
where , with , for . Table 1 gives the coefficients for .
Remark 2
Each , , is an iterative solution of the system
[TABLE]
where is the unknown, and and are constants depending on and . The total number of vectors (in the solution space ) stored for the computation of is : and the , for , and .
Remark 3
From Remark 2, only the implicit midpoint rule, DC2, is an implicit Runge-Kutta (RK) methods. Starting with DC4, all the DC2j methods of the form (LABEL:a27)-(10) are not RK methods. For instance, depends on and some of the , which evolve independently and are not stages computed from . As we will see in Section 5, the analysis of A-stability, in particular the proof of lemma 3, shows that it is impossible to write a recurrence from (LABEL:a27) when , as one would get by applying any RK method to Dahlquist equation. This is the main ingredient behind the A-stability of our DC2j methods independently of the order of accuracy.
3 Deferred correction condition (DCC)
In this section we give a sufficient condition for the scheme (LABEL:a27)-(10) to achieve order of accuracy. Hereafter, the letter will denote any constant independent from , and that can be calculated explicitly in terms of known quantities. The exact value of may change. We have the following definition:
Definition 1
(Deferred Correction Condition) Let be the exact solution of the Cauchy problem (1). Given a positive integer , a sequence of approximations of , at the discrete points , is said to satisfy the Deferred Correction Condition for the implicit midpoint rule if approximates with order of accuracy, and we have
[TABLE]
for and , where is fixed and is a constant independent from .
Remark 4
Condition (16) is equivalent to
[TABLE]
and
[TABLE]
for . This is due to the transform
[TABLE]
and a similar transform for .
We have the following result:
Theorem 3.1
Let be the exact solution of (1) and , , a sequence of approximations of satisfying DCC for the implicit midpoint rule. Let be the solution of (LABEL:a27)-(10) built from . We suppose that are given and satisfy
[TABLE]
where is a constant independent from . Furthermore, we suppose that one of the following four conditions holds:
* is Lipschitz with respect to the second variable : there exists such that*
[TABLE]
* is finite dimensional, and remains close to in the sense that there exists such that*
[TABLE]
* is infinite dimensional, and converges to the exact solution .*
* is a Hilbert space with inner product , and satisfies the following so-called one-sided Lipschitz condition, with a one-sided Lipschitz constant :*
[TABLE]
Then approximates with order of accuracy, that is
[TABLE]
where is a constant depending only on , , DCC, a Lipschitz constant on and the derivatives of up to order , for time steps sufficiently small.
Proof
First we consider the case where the function is Lipschitz with respect to the second variable . Combining (1) and (LABEL:a27), we obtain the identity
[TABLE]
where and are finite difference operators defined for arbitrary integer by
[TABLE]
and
[TABLE]
provided exists for . We have defined
[TABLE]
and
[TABLE]
From (5) we have
[TABLE]
and, since is differentiable and is sufficiently regular, we deduce from the mean value theorem and the approximation (6) that
[TABLE]
for each , where is a constant depending only on , , a Lipschitz constant from and the derivatives of up to order . The last two inequalities imply that
[TABLE]
Since the sequence satisfies DCC, from Remark 4 we have
[TABLE]
From the Lipschitz condition on we have
[TABLE]
Substituting inequalities (26)-(28) in the identity (24), we deduce that
[TABLE]
and it follows from the triangle inequality that
[TABLE]
for . We then deduce by induction on that
[TABLE]
From hypothesis (19) and the DCC we have
[TABLE]
where is a constant independent from . Moreover, the sequence is bounded above by , for . Whence
[TABLE]
Finally, by the triangle inequality, identity (25) and DCC, we have
[TABLE]
where is a constant depending only on , , the DCC constant, and the derivatives of up to order .
Suppose that satisfies (21) and is finite dimensional. We can write
[TABLE]
From (21) and the DCC there exists such that implies
[TABLE]
On the other hand, we have
[TABLE]
where
[TABLE]
It follows (28) for
[TABLE]
Since is differentiable and the set is compact in the finite dimensional linear space , the supremum exists and is finite. The theorem is then deduced from the case (i).
If converges to the exact solution , taking the DDC and the finite difference formula (6) into account, we have
[TABLE]
It follows from the continuity of that there exists such that implies
[TABLE]
The theorem, in this case, follows by taking in (i).
Here we consider the case where is a Hilbert space and satisfies the monotonicity condition (22). Then, taking the inner product of the identity (24) with , we deduce the inequality
[TABLE]
since, according to (22), we have
[TABLE]
Inequalities (26)-(27) together with the Cauchy-Schwartz inequality yield
[TABLE]
and
[TABLE]
where is a constant depending only on , , a Lipschitz constant on and the derivatives of up to order . Substituting the last three inequalities into (33), we obtain
[TABLE]
and we deduce from the identity
[TABLE]
and the inequality
[TABLE]
that
[TABLE]
The conclusion follows from the case (i), for .
Remark 5
Theorem 3.1 shows that the correction may be applied for any other scheme satisfying DCC.
4 Convergence and order of accuracy
The aim of this section is to prove the following theorem:
Theorem 4.1
Let be the exact solution of the problem (1). Suppose that one of the four conditions (i)-(iv) of Theorem 3.1 holds, with condition (ii) or (iii) holding for all . Then each sequence , , solution of the scheme (8) or (LABEL:a27)-(10), approximates with order of accuracy. Furthermore, we have the estimate
[TABLE]
for and , where is a constant depending only on , , and the derivatives of and up to order and , respectively.
To prove this theorem we need Theorem 3.1 and the the following lemma:
Lemma 1
Let be the solution of the scheme (8). Suppose that one of the conditions (i), (iii) or (iv) of Theorem 3.1 holds, or is bounded in the sense of the condition (ii) of this theorem. Then approximates with order 2 of accuracy, and we have the inequality
[TABLE]
for and , where is a constant depending only on , , and the derivatives of and up to order and , respectively.
Proof (Proof of Lemma 1)
For the sake of simplification we suppose that . The general case can be handled by transforming (1) to an autonomous system. From the hypotheses of the Lemma, Theorem 3.1 implies that approximates with order two of accuracy:
[TABLE]
where is a constant depending only on , and the derivatives of up to order 3. To establish (35) we proceed by induction on the integer .
Inequality (35) for .
As in Theorem 3.1, we combine (1) and (8) and deduce the identity
[TABLE]
where
[TABLE]
and
[TABLE]
From Taylor’s formula with integral remainder and the estimate (4), there exists a function such that
[TABLE]
with
[TABLE]
for each nonnegative integers and such that , where is a constant depending only on , , and the derivatives of up to order . We can write
[TABLE]
where
[TABLE]
The last identities substituted into (37) yield
[TABLE]
Proceeding as in Theorem 3.1, we deduce from (36) and the regularity of that
[TABLE]
Therefore, taking the norm on both sides of (39), we deduce by the triangle inequality and the inequalities (36) and (38), for , that
[TABLE]
where is a constant depending only on and the derivatives of and up to order 3 and 1, respectively. The last inequality combined with (36) implies that (35) holds for .
Here we are going to prove that inequality (35) remains true for , assuming that it holds for an arbitrary integer such that .
We apply to (39) and obtain
[TABLE]
where we set
[TABLE]
The main difficulty is to bound . We have
[TABLE]
[TABLE]
where , and
[TABLE]
It follows the general formula
[TABLE]
where , and is a linear combination, with properly chosen coefficients, of the quantities
[TABLE]
where with , for . From (42) and (36) we have
[TABLE]
and we deduce that there exists such that implies
[TABLE]
where is a constant depending only on , , and the derivatives of and up to order and , respectively. From the inductions hypothesis (35) and inequality (4) we have
[TABLE]
and
[TABLE]
where is a constant depending only on , , and the derivatives of and up to order and , respectively. Each being multilinear continuous, we deduce from (44)-(46) and the relation , for , that
[TABLE]
It follows by the triangle inequality that (43) for yields
[TABLE]
for , where is a constant depending only on , , and the derivatives of and up to order and , respectively . Passing to the norm in identity (41), we deduce from (38) and the last inequality that
[TABLE]
Otherwise, applying to (41), inequalities (44)-(46) and (47) yield
[TABLE]
for , where is a constant depending only on , , and the derivatives of and up to order and , respectively. Therefore, passing to the norm in the identity obtained by applying to (41), we deduce from (41) and the last inequality that
[TABLE]
for , with the constant depending only on , , and the derivatives of and up to order and , respectively. Inequalities (47) and (48) imply that the induction hypothesis is also true for , and we deduce that (35) is true for each integer .
Proof (Proof of Theorem 4.1)
We proceed by induction on . The case is immediate from Lemma 1. Suppose that approximates with order of accuracy and satisfies (34), for an arbitrary such that . We are going to prove that approximates with order of accuracy and (34) holds substituting by .
From the induction hypothesis, satisfies DCC. Because and are computed from the same scheme DC2j, but for different time steps, also satisfies DCC. Therefore, as in 29, Theorem 3.1 applied to the approximation , built from , yields
[TABLE]
where
[TABLE]
According to the DCC and the condition , we have
[TABLE]
By the triangle inequality and the DCC, the last two inequalities yield
[TABLE]
From the DCC on and the inequality (49), Theorem 3.1 again implies that approximates the exact solution with order of accuracy. Therefore, it is enough to establish (34) for , . To this end we rewrite identity (24) as follows
[TABLE]
with
[TABLE]
where and are as in Theorem 3.1. Proceeding as in Lemma 1 and taking the finite difference formulae (5) and (6) into account, we can write
[TABLE]
where
[TABLE]
is a constant depending only on , , and the derivatives of and . According to the inequality (34) from the induction hypothesis, we may write
[TABLE]
where
[TABLE]
Therefore, writing (50) as follows
[TABLE]
with
[TABLE]
the induction hypothesis and the reasoning from Lemma 1, substituting the functions and , respectively, by and , by , and by , yields
[TABLE]
for and , where is a constant depending only on , , and the derivatives of and up to order and , respectively. Inequality (34) holds for by the triangle inequality from the last inequality.
We end this section by the following corollary that gives an important convergence property of the DC method. This property is useful for a time-stepping method to solve stiff and large dimensional differential equations arising from the space discretization of time-dependent PDEs.
Corollary 1
Suppose that the function is from , for a positive integer , and satisfies the one-sided Lipschitz condition (22). Then, each approximate solution from satisfies the inequality
[TABLE]
where is a constant independent from any global Lipschitz constant on , and either for or for .
Proof
From the regularity assumption on and and the one sided-Lipschitz condition, we deduce from Theorem 4.1 that each , , satisfies DCC. Therefore, inequality (51) is immediate from the part (4) of Theorem 3.1. The constant depends only on the derivatives of up to order and, according to (31)-(32) and the mean value theorem, on the bound of the Jacobian on the compact set .
Remark 6
The convergence property satisfied by the schemes in Corollary 1 is in fact -convergence (see, e.g., frank1981concept ; kraaijevanger1985b ) since the constant of the global error in (51) is independent from any global Lipschitz constant of the function . Nevertheless, since in the definition of -convergence the constant depends on high order derivatives of the exact solution , the identity
[TABLE]
can make any requirement on the independence of the constant with respect to somewhat artificial. The numerical test on Bernoulli ODE in Section 6 gives an application of Corollary 1.
Remark 7
In practice, from part 4 of the proof of Theorem 3.1, the global error for an approximate solution of the IVP (1) under the one-sided Lipschitz condition (22) by a DC2j+2 method, , takes the form
[TABLE]
for . The constant depends on the derivative of the exact solution of order and can be very large in magnitude. However, if and is not too small, the factor is sufficiently small such that , leading to very accurate approximate solutions for large time steps . Nevertheless, independently of the sign of , when is sufficiently small in the asymptotic region , where is the global Lipschitz constant of , becomes closed to 1, for example when , so that only must dominate the constant . Consequently, a non B-convergent method can be competitive with respect to a B-convergent one for sufficiently small time steps. This situation will be illustrated by the Bernoulli ODE in Section 6.
5 Absolute stability
In this section we prove the absolute stability of the DC schemes. The notion of absolute stability is introduced by Dahlquist MR0170477 to characterize methods able to solve stiff ODEs. Considering the following IVP,
[TABLE]
where is a complex number, we have the following definition (see quarteroni2010 ; MR0170477 ):
Definition 2
A numerical method is said to be absolutely stable if the corresponding solution for the problem (53) for fixed and some is such that
[TABLE]
The region of absolute stability of a numerical method is defined as the subset of the complex plane
[TABLE]
If , , the numerical method is said to be A-stable.
Before establishing absolute stability results for the deferred correction schemes (8) and (LABEL:a27)-(10), we recall the following result.
Lemma 2 ( see (tuenter2006frobenius, , formula (6)) )
Let be a polynomial of degree in one variable. Then the sum is a polynomial of degree in the variable .
Lemma 3
Suppose that and in the initial value problem (1), where is a complex number with negative real part (). Then the corresponding approximate solutions from the schemes (8) and (LABEL:a27)-(10) can be written as follows
[TABLE]
where is a polynomial of degree in the variable .
Proof
We suppose that , otherwise we trivially have , for . Since , we can rewrite (LABEL:a27) as follows
[TABLE]
where, according to formulae (2) and (3), we have
[TABLE]
and
[TABLE]
Combining the last three identities, we deduce that
[TABLE]
where is affine in . Under the hypothesis of the lemma, (8) matches the trapezoidal rule, and we have
[TABLE]
that is (56) is true for . Suppose that (56) holds for an arbitrary integer . From (57) we have
[TABLE]
with , and, substituting each by the formula given by the induction hypothesis (56), we deduce that
[TABLE]
where
[TABLE]
It follows that
[TABLE]
It is clear that is a polynomial of degree in the variable as . Therefore, according to the Lemma 2, is a polynomial of degree in the variable . Whence,
[TABLE]
where
[TABLE]
is a polynomial of degree in the variable . We then deduce by induction that the lemma is true for arbitrary non-negative integer .
Theorem 5.1
Each of the deferred correction schemes (8) and (LABEL:a27)-(10) is A-stable.
Proof
From Lemma 3 we have, for ,
[TABLE]
since, under the condition , we have .
6 Numerical experiments
In this section we evaluate the accuracy and order of convergence of the schemes , implemented using the Scilab programming language. The starting values are computed using the scheme (11)-(12).
We choose six standard problems for the evaluation. The first problem concerns -convergence by considering a Bernoulli equation. The second problem is about long term integration with an oscillatory solution of large amplitude. The four other problems are about stiffness. The third and fourth problems (B5 modified and E5, respectively) both involve complex eigenvalues of negative real parts, where the imaginary parts of the eigenvalues for the third problem have larger magnitudes while those from the fourth problem have smaller magnitudes. The fifth problem (Robertson) is nonlinear and stiff with real negative eigenvalues, and it also addresses B-convergence. The sixth problem is the van der Pol oscillator, which is stiff with arbitrary complex eigenvalues.
The first three problems have analytic solutions. For problems (61), (62) and (63) that do not have an analytic solution, we consider a small time step such that the approximate solutions with are almost identical (to machine precision for problem (62)), and we choose one of the approximate solutions as reference solution.
For solutions , , the absolute error on the approximate solutions , , is computed with the norm
[TABLE]
For very large we extract solutions at or discrete times evenly spread over the interval .
For a comparison of accuracy, we implement in Scilab the backward differentiation formulae (BDF) of order 2, 4 and 6, and the explicit Runge-Kutta (RK) of order 4. The implemented BDF are run with exact starting values for the first three problems that have analytic solutions, while for problems four and five the starting values are provided by the function stiff (implementing BDF with adaptive steps) of the solver ode from Scilab. For the van der Pol oscillator, the comparison of our DC methods is done only with the solutions from stiff and rkf from the solver ode. For each of the problems, except the first one, we give a table of absolute errors and orders of convergence for pairs of two consecutive time steps, for the approximate solutions with the DC methods. We denote by the maximal time step allowed to compute an approximate solution with the solver stiff or rkf (see enright1975comparing for a discussion on maximal time steps).
6.1 Bernoulli differential equation
[TABLE]
Table 2 gives the absolute error and the order of convergence for each pair of consecutive time steps, in the case of DC, BDF and RK4 methods. The dash for RK4 indicates that the method is unstable for the corresponding time steps.
This problem addresses -convergence since the function is one-sided Lipschitz with , when positive solutions are considered. Moreover, the problem is strongly nonlinear with exponentially increasing magnitude of derivatives of the right side function . Such derivatives of large magnitude generally limit the accuracy of high order methods that are not B-convergent. The one-sided Lipschitz constant being negative, in accordance with Corollary 1, DC methods provide very accurate approximate solutions for large time steps, and their accuracy increases with the order of the method. However, the convergence of the DC methods is suboptimal, due to the effect of the strong nonlinearity of the ODE. While and almost achieve their proper order for , the order of convergence of and are not observed since these methods quickly achieve machine accuracy. In fact, DC10 achieves order 8.05 of convergence for to . BDF methods are stable for large time steps, but they are less accurate than their corresponding DC methods. RK4 is completely unstable for . For sufficiently small time steps in the asymptotic region, RK4 is more accurate than DC4 and any of the BDF methods, as stated in Remark 7, while DC6-10 achieve better accuracy.
6.2 Oscillatory problem hull1972comparing
[TABLE]
The exact solution is . The original problem is set with in hull1972comparing . The author in karouma2015class solved this problem with Runge-Kutta methods of orders 4 and 8, for and , to “illustrate the need of higher order methods when a long-term integration problem is considered”. Table 3 gives the absolute error and the order of convergence for each pair of consecutive time steps. The BDF methods are run only for the smallest time step. The solvers rkf and stiff use adaptive time stepping with a maximal time step and tolerances .
The magnitude of the exact solution of the modified oscillatory problem is large, resulting in a relatively large absolute error obtained by the DC schemes (absolute errors of about is possible for a good choice of stepsize). Moreover, the long term integration influences the accuracy of these schemes since they achieve absolute errors of about when the solution interval is reduced to . Nevertheless, each DC scheme converges with its proper order. The DC methods are considerably more accurate than standard methods (both with fixed and variable stepsizes) which are inaccurate for this problem. For instance, for BDF2 and rkf, the solutions remain bounded with bounds close to the maximal amplitude of the exact solution but the phase of the oscillation is completely wrong.
6.3 Problem B5 modified enright1975comparing , stiff with complex eigenvalues of negative real parts and larger (in magnitude) imaginary parts
[TABLE]
This problem, originally set with , is an illustration of ODEs resulting from a semi-discretization by finite element methods of parabolic PDEs stewart1990avoiding . We choose to make the problem a little more difficult. Table 4 gives the absolute errors for the first component of the approximate solutions which is similar for the second component. The absolute errors for the others components quickly achieve machine precision. The solvers stiff and rkf are run with and .
The imaginary parts of the Jacobian eigenvalues of the modified B5 problem are large. Even though the real parts of the eigenvalues are negative, we observe that smaller time steps are required by DC schemes to obtain accurate approximations. DC schemes achieve their proper order of convergence, but BDF methods perform better for this problem than DC schemes.
6.4 Problem E5enright1975comparing , stiff with complex eigenvalues of negative real parts and smaller (in magnitude) imaginary parts
[TABLE]
A reference solution is computed with for . The solution of this problem has small magnitude in and the eigenvalues of the Jacobian matrix along the solution curve belong to the region of the complex plane. Table 5 gives the absolute errors and order of convergence for the four components of the approximate solutions. For BDF, RK4 and stiff, the absolute errors are provided only for the first component. The absolute error on the other components is smaller by 2 (RK4) to 5 (stiff) orders of magnitude, as we should expect from the magnitude of the solution components. The implemented BDF methods are run with starting values deduced from the solver stiff. The implemented RK4 is unstable for time steps , and the absolute error is reported for in table 5. The solver stiff is run with and .
Imaginary parts of eigenvalues for the problem E5 are smaller, and larger time steps allow DC schemes to produce very accurate approximations, compared to the modified B5 problem. DC schemes perform better for this problem than BDF methods. They achieve their proper order of convergence but on a relatively small range of time steps, for higher order DC methods, since the solution is already very accurate for large time steps.
6.5 Robertson (1966) wanner1991solving , stiff with real negative eigenvalues
[TABLE]
This is one of the three problems considered as stiffest in wanner1991solving . We compute a reference solution with DC10 for the time step . The solution belongs to the region and the eigenvalues of the Jacobian dF(y) along the solution curve belong to . Table 6 gives absolute errors and orders of convergence of DC methods for each component of the solution. For other methods, we give only the maximal errors on the three components of the approximate solutions. The solver stiff is run with and . The solver rkf fails in solving this problem for various tolerances and , and Scilab reported: “it is likely that rkf45 is inefficient for solving this problem”. The implemented BDF methods are run with starting values deduced from the solver stiff using the preceding tolerances.
The Robertson problem is stiff and addresses B-convergence since its Jacobian matrix has real negative eigenvalues with some having large magnitude. For this problem, DC schemes produce accurate approximate solutions even for large time steps, and high order DC methods can be avoided (DC6 is enough). The convergence is slow for , but faster convergence happens for in the asymptotic region (). The DC schemes perform better than BDF methods at equal order and time step. A comparison of the errors for suggests that the error constants might be 3 to 5 orders of magnitude smaller for DC than BDF methods.
6.6 van der Pol oscillator enright1975comparing ; shampine1981evaluation , stiff with arbitrary complex eigenvalues
[TABLE]
This problem was initially proposed for and in enright1975comparing . The actual version results from a suggestion by Shampine shampine1981evaluation . We compute a reference solution with DC8 for . The solution belong to the region of the real plane and the eigenvalues along the solution curve belong to the region of the complex plane. Table 7 gives the absolute errors and orders of convergence. For rkf and stiff, we use and , .
The van der Pol oscillator is stiff and the solution has a large magnitude. DC6 and DC8 reached their order of convergence. This shows that the DC strategy works well in spite of the fact that DC2 and DC4 would require much smaller time steps to produce reasonably accurate solutions. The order of convergence for DC10 is not observed, though the solutions obtained are accurate.
6.7 Discussion of the numerical results
In general, a careful assessment of the proof of Theorem 3.1 points out to the fact that, for a system with complex eigenvalues , we only need a time step such that for a good accuracy (faster convergence happens when ). These situations are well illustrated by the test cases of Sections 6.3 and 6.4, where the required time step for accuracy is much smaller for modified B5 than E5. However, time steps such that , , is necessary for an asymptotic convergence with proper order. For example, in the case of the Bernoulli equation we have and . Large time steps provide accurate approximations (as expected from B-convergent methods), but asymptotic convergences are observed only for .
For the computational effort of the DC methods, we recall that to compute an approximate solution on discrete points , solves nonlinear systems while , , solves systems. In the case of the Bernoulli equation, for example, achieves the maximal error of about by solving approximately nonlinear systems while the maximal absolute error for is about for . We did not report any CPU time since our code is written in Scilab, an interpreted language. All methods that we implemented are consequently interpreted, while rkf and stiff provided with Scilab are compiled. Nevertheless, the main burden in implicit time-stepping solvers is the resolution of nonlinear systems, and we have shown that higher order DC methods give the most accurate approximations by solving fewer systems of equations. This gives a clue on the CPU time required and the efficiency of these methods. High order DC methods should be competitive in situations where using fully implicit methods is unavoidable.
7 Conclusions
We have presented a new approach of deferred correction methods for the numerical solution of general first order ordinary differential equations. Proofs for consistency, order of convergence and stability of the method are given, which rely on a recursive argument using a new deferred correction condition. The numerical experiments comply with the theory and show a high accuracy of the method and its satisfactory A-stable property and B-convergence. Globally, each DC scheme reaches its proper order of convergence and applies to any category of problem, providing accurate approximations for time steps not necessarily small. The accuracy of the DC schemes increases with the level of correction.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) Auzinger, W.: Encyclopedia of Applied and Computational Mathematics, chap. Defect Correction Methods, pp. 323–332. Springer, Berlin, Heidelberg (2015)
- 2(2) Christlieb, A., Ong, B., Qiu, J.M.: Integral deferred correction methods constructed with high order Runge-Kutta integrators. Math. Comp. 79 , 761–783 (2010)
- 3(3) Chung, T.: Computational Fluid Dynamics, 2nd edn. Cambridge university press (2010)
- 4(4) Dahlquist, G., Björck, A.k.: Numerical methods in scientific computing. Vol. I. SIAM, Philadelphia, PA (2008)
- 5(5) Dahlquist, G.G.: A special stability problem for linear multistep methods. Nordisk Tidskr. Informationsbehandling (BIT) 3 , 27–43 (1963)
- 6(6) Daniel, J.W., Pereyra, V., Schumaker, L.L.: Iterated deferred corrections for initial value problems. Acta Cient. Venezolana 19 , 128–135 (1968)
- 7(7) Dutt, A., Greengard, L., Rokhlin, V.: Spectral deferred correction methods for ordinary differential equations. BIT 40 , 241–266 (2000)
- 8(8) Enright, W.H., Hull, T., Lindberg, B.: Comparing numerical methods for stiff systems of ODE:s. BIT 15 , 1–48 (1975)
