Error analysis of randomized Runge-Kutta methods for differential equations with time-irregular coefficients
Raphael Kruse, Yue Wu

TL;DR
This paper provides an error analysis and convergence rates for two randomized Runge-Kutta methods applied to ODEs with irregular time-dependent coefficients, including those with singularities or weak regularity.
Contribution
It introduces precise error bounds and convergence rates for randomized Runge-Kutta schemes handling time-irregular coefficients in ODEs, extending applicability to Carathéodory and singular cases.
Findings
Derived $L^p$-norm error bounds for the methods.
Established almost sure convergence rates.
Validated results through numerical experiments.
Abstract
This paper contains an error analysis of two randomized explicit Runge-Kutta schemes for ordinary differential equations (ODEs) with time-irregular coefficient functions. In particular, the methods are applicable to ODEs of Carath\'eodory type, whose coefficient functions are only integrable with respect to the time variable but are not assumed to be continuous. A further field of application are ODEs with coefficient functions that contain weak singularities with respect to the time variable. The main result consists of precise bounds for the discretization error with respect to the -norm. In addition, convergence rates are also derived in the almost sure sense. An important ingredient in the analysis are corresponding error bounds for the randomized Riemann sum quadrature rule. The theoretical results are illustrated through a few numerical experiments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Error analysis of randomized Runge-Kutta methods
for differential equations with time-irregular coefficients
Raphael Kruse
Raphael Kruse
Technische Universität Berlin
Institut für Mathematik, Secr. MA 5-3
Straße des 17. Juni 136
DE-10623 Berlin
Germany
and
Yue Wu
Yue Wu
Technische Universität Berlin
Institut für Mathematik, Secr. MA 5-3
Straße des 17. Juni 136
DE-10623 Berlin
Germany
Abstract.
This paper contains an error analysis of two randomized explicit Runge-Kutta schemes for ordinary differential equations (ODEs) with time-irregular coefficient functions. In particular, the methods are applicable to ODEs of Carathéodory type, whose coefficient functions are only integrable with respect to the time variable but are not assumed to be continuous. A further field of application are ODEs with coefficient functions that contain weak singularities with respect to the time variable.
The main result consists of precise bounds for the discretization error with respect to the -norm. In addition, convergence rates are also derived in the almost sure sense. An important ingredient in the analysis are corresponding error bounds for the randomized Riemann sum quadrature rule. The theoretical results are illustrated through a few numerical experiments.
Key words and phrases:
randomized Riemann sum quadrature rule, randomized Euler method, randomized Runge-Kutta method, ordinary differential equations with time-irregular coefficients, Carathéodory differential equations, almost sure convergence, -convergence
2010 Mathematics Subject Classification:
65C05, 65L05, 65L06, 65L20
1. Introduction
In this paper we investigate the numerical solution of ordinary differential equations by randomized one-step methods. More precisely, let and let , , denote the exact solution to the initial value problem
[TABLE]
where is the initial condition. Let us recall that the initial value problem (1) is said to be of Carathéodory type if the measurable coefficient function is (locally) integrable with respect to the temporal variable and continuous with respect to the state variable. If is additionally assumed to fulfill a (local) Lipschitz condition with respect to the state variable, then it is well-known that the initial value problem (1) admits a unique (local) solution . Recall that a measurable mapping is called a (global) solution to (1) if is absolutely continuous and satisfies
[TABLE]
for any . In particular, Equation (2) comes implicitly with the assumption that the mapping is integrable. For instance, we refer to [17, Chap. I, Thm 5.3].
Our motivation for studying Carathéodory type initial value problems stems from the fact that certain stochastic differential equations [26, 28] or rough differential equations [11] that are driven by an additive noise can be transformed into a problem of the form (1). For instance, let be Lipschitz continuous and let be the solution to a rough differential equation of the form
[TABLE]
where is a non-smooth but integrable perturbation. Then, the mapping given by , , solves an ODE of the form (1) with for each . Depending on the smoothness of the perturbation , the resulting mapping is then often only integrable or Hölder continuous with exponent with respect to the temporal variable .
Due to the low regularity of the coefficient function the numerical approximation of the solution to Carathéodory type differential equations is a challenging task. Indeed, it can be shown that any deterministic numerical one-step method is in general divergent if it only uses finitely many point evaluations of . This is easily seen from a simple adaptation of arguments presented in [27, Kap. 2.3]. We discuss this aspect in more depth in Section 1.1 below.
If the coefficient function enjoys slightly more regularity with respect to the temporal variable, say Hölder continuous with some exponent (compare with Assumption 5.1 further below), then classical deterministic numerical algorithms such as Runge-Kutta methods or linear multi-step methods become applicable and will converge to the exact solution. However, since is not assumed to be differentiable we cannot expect high order of convergence from these schemes. In fact, in [21] it is shown that if then the minimum error of any deterministic method depending only on point evaluations of the coefficient function is of order . Similarly, for arbitrary values of the minimum error among all deterministic algorithms that use at most point evaluations of decays only with order , see [19]. Therefore, especially in the case of small values for , deterministic methods may still be considered as impracticable.
For these reasons it is necessary to extend the class of considered numerical algorithms. For instance, the method could additionally make use of linear functionals of the coefficient function , say integrals, instead of mere point evaluations. However, it is often not clear how to implement these methods if is not (piecewise) continuous. Here, we therefore follow a different path that considers randomized numerical methods. The prototype of this class of numerical algorithms is the classical Monte Carlo method, which converges already under an integrability condition.
In the literature, several randomized numerical methods have been developed for the specific initial value problem (1) under a variety of mild regularity assumptions. For instance, we mention the results in [7, 19, 20, 22, 30, 31] and the references therein. These randomized methods are usually found to be superior over corresponding deterministic methods in the sense that the resulting discretization error decays already with order under the same smoothness assumptions as sketched above. Let us also mention that a further application of randomized methods to initial value problems in Banach spaces is found in [8, 18], while the approximation of stochastic ODEs by a randomized Euler-Maruyama method is considered in [29]. In [6] a related family of quasi-randomized methods is studied.
In this paper we present a precise error analysis for two randomized Runge-Kutta methods that are applicable to the numerical solution of ODEs with time-irregular coefficient functions. The purpose is to prove convergence of the two methods with an order of at least with respect to the -norm under very mild conditions on the coefficient function . Hereby we relax several conditions on often found in the literature. In particular, we do not assume that the coefficient function is (locally) bounded which allows to treat functions with a weak singularity of the form , , , . In addition, we also estimate the order of convergence in the almost sure sense. The precise conditions on the coefficient function are stated in Assumption 4.1 and Assumption 5.1.
We now introduce the two randomized Runge-Kutta methods in more detail. If the reader is not familiar with standard notations and concepts in probability we suggest to first consult Section 2.
Let be a sequence of independent and -distributed random variables on a probability space . Then, for any step size we define to be the integer determined by . Set for every . The first numerical approximation of considered in this paper is determined by setting and by the recursion
[TABLE]
for all . This method is usually termed randomized Euler method and it is a particular case of a Runge-Kutta Monte Carlo method studied in [20, 30, 31]. Let us emphasize that, as it is customary for Monte Carlo methods, the result of the numerical scheme is a discrete time stochastic process defined on the same probability space as the random input .
The second randomized Runge-Kutta method is determined by setting and by the recursion
[TABLE]
for all . This scheme is a member of a family of methods that has been introduced in [7, 19].
The two methods (3) and (4) can indeed be interpreted as randomized Runge-Kutta methods. In fact, in the -th step of (3) and (4) we randomly choose one particular Runge-Kutta method from the families with Butcher tableaux
[TABLE]
respectively, where the value of the parameter is determined by the outcome of the random variable . For more details on Runge-Kutta methods and their Butcher-tableaux [3] we refer to standard references, for example [4, 12, 16].
The remainder of this paper is organized as follows. In Section 2 we introduce our notation and recall some prerequisites from probability that are needed later. In Section 3 we state and prove precise error estimates for the randomized Riemann sum quadrature rule, which are an important ingredient in our error analysis for the randomized Runge-Kutta methods (3) and (4). Randomized quadrature rules are well-known to the literature, see [14, 15]. However, this is apparently the first time they are applied in the error analysis of randomized Runge-Kutta methods.
Section 4 contains the first main result of this paper. Here we prove that the randomized Euler method (3) converges to the exact solution of a Carathéodory type ODE (1) with order with respect to the norm in . See Assumption 4.1 for a precise statement of the conditions on the coefficient function . In addition we also derive the order of convergence in the almost sure sense, hereby generalizing results from [20] to unbounded coefficient functions. Note that the computationally more expensive method (4) does not offer any additional advantages in terms of convergence speed in case of possibly discontinuous coefficient functions. We therefore omit an error analysis in this situation.
In Section 5 we then consider the classical ODE setting with a Hölder continuous coefficient function . We determine the order of convergence of the two numerical methods in dependence of the Hölder exponent and with respect to the -norm. We see that the randomized Runge-Kutta method (4) is superior to the randomized Euler method (3) if . These results generalize the error analysis from [7, 19] to the case . Since they are based on the -convergence result, we believe that our almost sure convergence rates are new to the literature as well. Lastly, we present several numerical experiments in the final section.
1.1. Divergence of deterministic algorithms
As announced in the introduction let us briefly follow a line of arguments from [27, Kap. 2.3]. Our aim is to give a sketch of proof that all deterministic algorithms that only use point evaluations of will in general diverge if applied to Carathéodory type ODEs.
To this end, let , , and and consider the problem (1) with the coefficient function for all and . Clearly, in this case the exact solution satisfies . If we apply an arbitrary but fixed deterministic algorithm for the approximation of with evaluations of it will return an approximation . Let us assume that for the computation of the deterministic algorithm evaluated the coefficient function at the points , , in the extended phase space. For each number define now the set as well as .
Then, we consider a further initial value problem with the same initial condition but with the coefficient function for all . Obviously, the mapping is also measurable and bounded. Since does not depend on the state variable, it also fits into the framework of Carathéodory type ODEs. In fact, the mapping fulfills all conditions of Assumption 4.1 further below. Because the set has Lebesgue measure zero the exact solution satisfies in this case. However, if we now apply the same deterministic algorithm as above, it cannot distinguish between and and it will return the same numerical approximation . Since this is true for any and since the deterministic algorithm will not converge to the exact solution of at least one of the problems.
2. Preliminaries
In this section we collect a few important results and inequalities in particular from probability, which are needed later. But first we fix some notation and terminology that is frequently used throughout this paper.
As usual we denote by the set of all positive integers and , while denotes the set of all real numbers. By we denote the standard norm on the Euclidean space , . Further, for every we denote by the set of all -Hölder continuous mappings . Note that the space becomes a Banach space if endowed with the Hölder norm
[TABLE]
In particular, it then holds true that
[TABLE]
The next inequality is a useful tool to bound the error of a numerical approximation. For a proof and more general variants see for instance Proposition 4.1 in [9].
Lemma 2.1** (Discrete Gronwall’s inequality).**
Consider two nonnegative sequences and which for some given satisfy
[TABLE]
then for all it also holds true that
[TABLE]
For the introduction and the error analysis of Monte Carlo methods, we also require some fundamental concepts from probability and stochastic analysis. For a general introduction readers are referred to standard monographs on this topic, for instance [23, 24, 28]. For the measure theoretical background see [1, 5].
First let us recall that a probability space consists of a measurable space endowed with a finite measure satisfying . The value is interpreted as the probability of the event . A mapping is called a random variable if is -measurable, where denotes the Borel--algebra generated by the set of all open subsets of . More precisely, it holds true that
[TABLE]
for all . Every random variable induces a probability measure on its image space. In fact, the measure given by for all is a probability measure on the measurable space . Usually, is called the distribution of .
In this paper, we frequently encounter a family of -distributed random variables . This means that for each the real-valued mapping is a random variable which is uniformly distributed on the interval with , . In particular, the distribution of is given by \mu_{\tau_{j}}(A)=\frac{1}{(b-a)}\lambda\big{(}A\cap(a,b)\big{)}, where denotes the Lebesgue measure on the real line.
Next, let us recall that a random variable is called integrable if . Then, the expectation of is defined as
[TABLE]
Moreover, we write with if . In addition, the set becomes a Banach space if we identify all random variables which only differ on a set of measure zero (i.e. probability zero) and if we endow with the norm
[TABLE]
This definition coincides with the definition of the standard spaces of -fold Lebesgue-integrable measurable functions. If for some the Chebyshev inequality yields for all
[TABLE]
Further, we say that a family of -valued random variables is independent if for any finite subset and for arbitrary events we have the multiplication rule
[TABLE]
On the level of the distributions of this basically means that the joint distribution of each finite subfamily is equal to the product measure of the single distributions. From this we directly get the multiplication rule for the expectation
[TABLE]
provided is integrable for each .
If we interpret the index as a time parameter, we say that a family of -valued random variables is a discrete time stochastic process. A very important class of stochastic processes are martingales. Without stating a precise definition of martingales it suffices for the understanding of this paper to be aware of the fact that if is an independent family of integrable random variables satisfying for each , then the stochastic process defined by the partial sums
[TABLE]
is a discrete time martingale. This enables us to apply powerful inequalities for martingales, such as the following discrete time version of the Burkholder–Davis–Gundy inequality, see [2].
Theorem 2.2** (Burkholder–Davis–Gundy inequality).**
For each there exist positive constants and such that for every discrete time martingale and for every we have
[TABLE]
where denotes the quadratic variation of up to .
Another well-known lemma considers the limiting behaviour of sequences of sets under probability measure (see Theorem 2.7 in [24]).
Lemma 2.3** (Borel-Cantelli Lemma).**
If and , then , where
[TABLE]
3. Error estimates for randomized Riemann sums
In this section we give precise error estimates for a randomized Riemann sum quadrature rule for integrals whose integrands have various degrees of smoothness. Randomized quadrature rules have been first introduced in [14, 15]. Usually, they consist of a randomized version of classical deterministic quadrature rules and are known to offer advantages if the integrand is not smooth. However, in contrast to most Monte Carlo methods, randomized quadrature rules still suffer from the curse of dimensionality in the same way as their deterministic counter-parts. The main field of application therefore is the numerical approximation of integrals with a non-smooth integrand over a low-dimensional domain. See also [10, Sec. 6.4.5] or [27, Sec. 5.5] for further details.
As in the introduction, for any step size we define to be the integer determined by . Let us recall that for every measurable function with for some the randomized Riemann sum approximation of with step size is given by
[TABLE]
where and is an independent family of -distributed random variables on a probability space .
The first theorem contains an error estimate with respect to the -norm. Further below, we also study the almost sure convergence of .
Theorem 3.1** (-error estimate).**
Let be a measurable mapping satisfying for some . Then, for every and the randomized Riemann sum is an unbiased estimator for the integral , i.e., . Further, for all we have
[TABLE]
In addition, if the mapping is -Hölder continuous for some , then for all we have
[TABLE]
Proof.
First, due to and it follows
[TABLE]
Hence, . After taking the expected value of we get
[TABLE]
Thus, the randomized Riemann sum is an unbiased estimator for . Further, by linearity of the integral we obtain for the error
[TABLE]
Now, as above it follows that each summand is a centered random variable, that is
[TABLE]
for every . Moreover, the summands are mutually independent due to the independence of . In addition, we also obtain from (16) that
[TABLE]
Therefore, is a discrete time -martingale. Thus, we can apply the Burkholder-Davis-Gundy inequality from Theorem 2.2 and obtain
[TABLE]
After inserting the quadratic variation we arrive at
[TABLE]
Now, by an application of the triangle inequality we get
[TABLE]
The first term is then bounded by
[TABLE]
since by Hölder’s inequality.
If we directly obtain the same bound for the second term by making use of (16). If we first apply Hölder’s inequality with exponents and . This yields
[TABLE]
due to (16). Altogether, since and after noting that we derive from (17), (18) and (19) that
[TABLE]
This completes the proof of (14).
Next, if in addition , then we can improve the estimate in (17) by
[TABLE]
Thus, inserting this into (17) gives
[TABLE]
This completes the proof of (15). ∎
Error estimates with respect to the -norm are sometimes unsatisfactory, since they allow for the possibility that single realizations of the randomized Riemann sum may differ significantly from its expected value. But in practice often just one realization of the estimator is computed. To some extent this is justified by the next theorem. This indicates that already on the level of single “typical” realizations of we observe convergence provided the step size is sufficiently small. However, depending on the value of the order of convergence may be significantly reduced.
Theorem 3.2** (Almost sure convergence).**
Let be a given measurable mapping with for some . Let be an arbitrary sequence of step sizes with . Then, there exist a nonnegative random variable and a measurable set with such that for all and we have
[TABLE]
Moreover, if in addition for some , then for every there exist a nonnegative random variable and a measurable set with such that for all and we have
[TABLE]
For the proof we need the following result, which is a simple consequence of the Borel-Cantelli lemma. It is a version of [25, Lemma 2.1], which in turn is based on a technique developed in [13].
Lemma 3.3**.**
Let and be given. Consider an arbitrary sequence of step sizes with . Then, for every sequence satisfying
[TABLE]
there exist a nonnegative random variable and a measurable set with such that for every and it holds true that
[TABLE]
Proof.
For each consider the event
[TABLE]
Then, by the Chebyshev inequality (11) it holds true that
[TABLE]
for all . Consequently,
[TABLE]
due to our assumptions on and . Thus, the Borel-Cantelli lemma (see Lemma 2.3) yields . Since
[TABLE]
the assertion follows with being the complement of . Finally, the random variable is defined by
[TABLE]
and for all . ∎
Remark 3.4**.**
The result of Lemma 3.3 can equivalently be reformulated as follows: Under the same assumptions there exist a measurable set with and a nonnegative random variable such that for all and we have
[TABLE]
Hence, it is possible to relax the -dependent step size restriction in Lemma 3.3 in form of the random variable on the cost of introducing an -dependent error constant . For further details we refer to the proof of [25, Lemma 2.1].
The proof of Theorem 3.2 is now a simple consequence of Theorem 3.1 and Lemma 3.3:
Proof of Theorem 3.2.
First, we assume that for some . Let be an arbitrary sequence of step sizes with . Then define
[TABLE]
Clearly, for each . In particular, from (14) it follows that
[TABLE]
Thus, since the conditions of Lemma 3.3 are fulfilled with and assertion (20) follows directly.
Next, if we additionally assume that for some then we immediately have for every . Let be arbitrary. Choose a value for such that . Then, if we define as above we obtain from (15) that
[TABLE]
for every . Thus, a further application of Lemma 3.3 with yields
[TABLE]
for all with probability one. ∎
4. Numerical approximation of Carathéodory ODEs
In this section we investigate the numerical approximation of the exact solution to the Carathéodory type ordinary differential equation (1). In particular, we derive the order of convergence of the randomized Euler method (3) with respect to the norm in . We also state the order of convergence in the almost sure sense. Throughout this section, we shall allow the following assumptions on the coefficient function .
Assumption 4.1**.**
The coefficient function is assumed to be measurable. Further, there exist and a measurable mapping with such that
[TABLE]
for almost all and . In addition, there is a measurable mapping with such that
[TABLE]
for almost all .
Let us stress, that the mapping is not necessarily continuous with respect to the temporal variable . In addition, the mappings and are not assumed to be bounded, in contrast to other results found in the literature [6, 20, 30, 31]. Moreover, from (22) and (23) we directly deduce the linear growth condition
[TABLE]
for almost all and . Here, is the -integrable mapping determined by , . Assumption 4.1 is more than sufficient to ensure the existence of a unique solution to the initial value problem (1), see [17, Chap. I, Thm 5.3].
In the following proposition we collect a few properties of the solution to (1).
Proposition 4.2**.**
Let Assumption 4.1 be fulfilled with . Then, the solution to the initial value problem (1) satisfies
[TABLE]
Moreover, if then for any it holds true that
[TABLE]
In particular, is Hölder continuous with exponent .
Proof.
Let be the solution to (1). Then, from (2) and (24) we get that
[TABLE]
Then, an application of Gronwall’s inequality (see e.g. [17, Chap. I, Cor. 6.6]) yields the assertion (25).
Next, assume that and let be arbitrary. Then, from (2) and (24) we further deduce that
[TABLE]
Since is bounded and since the mapping is -fold integrable we obtain from the Hölder inequality with exponents and that
[TABLE]
Due to this proves the asserted Hölder continuity of if . The case is treated similarly. ∎
Now we are well prepared to state the main result of this section. The following theorem provides an error estimate of the randomized Euler method (3) under Assumption 4.1 with respect to the norm in . We give an explicit expression for the error constant further below.
Theorem 4.3** (-error estimate).**
Let Assumption 4.1 be fulfilled with . Let denote the exact solution to (1). For given let denote the numerical approximation determined by (3) with initial condition . Then, there exists , independent of , such that
[TABLE]
Proof.
Let and be arbitrary. Since and by using a telescopic sum argument as well as (2) and (3) and we get
[TABLE]
In order to simplify the notation we write . Note that the family of random variables is independent and is uniformly distributed on the interval for each . Then, after adding and subtracting several terms we have to estimate the following three sums
[TABLE]
First, we give an estimate of the term . To this end we apply (22) and arrive at
[TABLE]
Observe that this inequality is only valid in the almost sure sense, since (22) holds for almost all . However, this is sufficient, since the expected value will eventually be applied. Therefore, after taking the Euclidean norm and the maximum over the time levels in (27) we obtain
[TABLE]
almost surely for every . Next, we apply the -th power of the -norm to both sides of the inequality. From the fact that for all we then get
[TABLE]
The last term is further estimated by Hölder’s inequality as follows
[TABLE]
For the next step, first take note of
[TABLE]
since . Moreover, we observe that , and therefore also , is independent of the errors at earlier time levels. Thus, from (12) we obtain
[TABLE]
Inserting this into (28) yields
[TABLE]
Therefore, an application of Lemma 2.1 results in
[TABLE]
It remains to give estimates for the terms and with respect to the -norm. For this we observe that the sum is the error of a randomized Riemann sum approximation. Since by (24)
[TABLE]
Theorem 3.1 is applicable and we deduce from (14) that
[TABLE]
Regarding the estimate of we make use of (22) and the -Hölder continuity of from (26). Then we obtain
[TABLE]
where, as already noted above, this inequality is only valid in the almost sure sense. Next, by an application of Hölder’s inequality it holds true that
[TABLE]
Together with (29) we conclude from (30) that
[TABLE]
This completes the proof. ∎
Remark 4.4**.**
Let us mention, that the proof of Theorem 4.3 admits an explicit expression of the error constant , namely
[TABLE]
One could further estimate the supremum of by (25).
We observe that the error constant grows at least exponentially with and . This indicates that the numerical method requires very small values for the step size if applied to initial value problems on large time intervals or with huge Lipschitz bounds .
In the same way as in Theorem 3.2 we also have a result on the almost sure convergence of the randomized Euler method (3). Compare also with [20, Theorem 2], if the coefficient function is additionally assumed to be locally bounded.
Theorem 4.5** (Almost sure convergence).**
Let Assumption 4.1 be fulfilled with and let denote the exact solution to (1). For a given sequence of step sizes with let denote the numerical approximation determined by (3) with initial condition and step size , . Then, there exist a random variable and a measurable set with such that for every and it holds true that
[TABLE]
Since the proof of Theorem 4.5 follows from the same steps as the proof of the first part of Theorem 3.2, it is omitted.
5. Randomized Runge-Kutta methods for ODEs
In this section, we consider initial value problems (1) whose coefficient function enjoys slightly more regularity with respect to the temporal variable than those considered in Section 4. However, we still do not assume any differentiability of .
Assumption 5.1**.**
The coefficient function is assumed to be continuous. Further, there exists such that
[TABLE]
for all and . In addition, there exist and with
[TABLE]
for all and .
As a direct consequence of Assumption 5.1 we take note of the linear growth bound
[TABLE]
with . Clearly, under Assumption 5.1 the initial value problem (1) is a classical ordinary differential equation. Therefore, there exists a (global) unique solution . In particular, the solution is continuously differentiable with
[TABLE]
for all , and
[TABLE]
for all .
The following theorem contains the error estimates for the randomized Euler method (3) and the randomized Runge-Kutta method (4) under Assumption 5.1. We provide explicit expressions for the error constants further below.
Theorem 5.2** (-error estimate).**
Let Assumption 5.1 be fulfilled with . Let be the exact solution to (1). For given step size we denote by and the sequences generated by the numerical methods (3) and (4), respectively. Then, for every there exists a constant , independent of , such that
[TABLE]
Moreover, for every there exists a constant , independent of , such that
[TABLE]
Proof.
Let be an arbitrary step size. As in the proof of Theorem 4.3 we write for every .
We first prove the error estimate (35) for the randomized Euler method (3). For this let be arbitrary. As in the proof of Theorem 4.3 in (27) we split the error into three sums of the form
[TABLE]
Due to (31) we can estimate the term by
[TABLE]
Thus, applying the Euclidean norm and then taking the maximum over all time steps in (37) yields
[TABLE]
In contrast to the situation in Theorem 4.3 the Lipschitz constant is now deterministic. Thus, after applying the -norm to this inequality we obtain
[TABLE]
Then, an application of Gronwall’s lemma (see Lemma 2.1) yields
[TABLE]
and it remains to estimate the norms of the sums and .
Regarding the term it follows from (31) and (32) that
[TABLE]
for all . Hence, due to (34) we see that the mapping is -Hölder continuous. In particular,
[TABLE]
Therefore, we can apply the estimate (15) from Theorem 3.1 to . This gives
[TABLE]
Finally, the estimate of follows the same lines as in (30) but we additionally make use of the Lipschitz continuity (34) of . Then we get
[TABLE]
This completes the proof of (35).
Let us now turn to the proof of the error estimate (36) for the randomized Runge-Kutta method (4). This time we apply a slightly modified version of (37):
[TABLE]
Note that actually for all . Thus we directly obtain
[TABLE]
Moreover, due to (38) the estimate of reads as follows
[TABLE]
For the last step recall the definition of from (4). Thus, by using (31) we get
[TABLE]
Consequently, since we have
[TABLE]
for every . Then, the error estimate (36) follows from a further application of Lemma 2.1 as demonstrated above. ∎
Remark 5.3**.**
As in the previous section, the proof of Theorem 5.2 also admits an explicit expressions of the error constants and , namely
[TABLE]
and
[TABLE]
where the supremum of can be further estimated by (33).
Again, we take note of the fact that the error constants and both grow at least exponentially with the final time and the Lipschitz constant . Both methods are therefore not necessarily well-suited for long-time simulations or if the ODE is stiff.
We close this section with the following result on the almost sure convergence of the randomized Euler method (3) and the randomized Runge-Kutta method (4).
Theorem 5.4** (Almost sure convergence).**
Let Assumption 5.1 be fulfilled for some and let denote the exact solution to (1). For a given sequence of step sizes with let and denote the numerical approximations determined by (3) and (4) with initial condition and step size , , respectively. Then, for every there exist a random variable and a measurable set with such that for every and we have
[TABLE]
In addition, for every there exist a random variable and a measurable set with such that for every and we have
[TABLE]
The proof of Theorem 5.4 is similar to the proof of the second part of Theorem 3.2 and is therefore omitted.
6. Numerical Examples
In this section we illustrate our theoretical results through a few numerical experiments.
6.1. State-independent case with weak singularities
Consider the following ODE with a state-independent coefficient function
[TABLE]
with varying values for the parameter . In dependence of we have different regularity of the coefficient function in terms of the -spaces. It is not hard to see that the exact solution at the final time is given by . In the experiment, we take to be and , and simulate the solutions via scheme (3), which in fact simplifies to the randomized Riemann sum (13). We approximate the error of the quadrature rule with respect to the -norm at terminal time by a Monte Carlo simulation with independent samples. The result is shown in Figure 1.
According to Theorem 3.1, the convergence order depends on the integrability of the function . In Figure 1, the root-mean-squared errors were plotted versus the 2-logarithm of the underlying step size, i.e., the number on the x-axis indicates the step size . When , the observed order of convergence is as expected around , since is only integrable. When increasing the value for from to , the regularity of is raised, which in turn gives an increase in the observed order of convergence from to .
6.2. convergence for an ODE with jumps
Consider the following ODE with a non-continuous coefficient function:
[TABLE]
where and
[TABLE]
Here we have three jump points at , and . It is easy to see that the exact solution at terminal time equals . We perform the numerical experiment with the classical Euler scheme, the randomized Euler scheme (3) and the randomized Runge-Kutta scheme (4), respectively. A comparison of the -errors at the final time is shown in Figure 2, where the errors have been approximated by a Monte Carlo simulation with independent samples for the same step sizes as in Section 6.1.
Note that by our choice of the step sizes the classical Euler scheme always evaluates the mapping at the three jump points. But due to the definition of the sign function, one of the summands in the definition of is always equal to zero at the jump points, causing to be neither left continuous nor right continuous at these points. For instance, we have , while and for all . This causes an additional error of order in each step of the classical Euler scheme, where a jump point of is involved.
On the other hand, this type of error is avoided by both randomized numerical methods, since the random variable will prevent the evaluation of at jump points almost surely. This explains why both randomized methods perform better than the classical method if we compare the -errors for the same step sizes. Further, although the coefficient function is not continuous with respect to the time variable, we observe an experimental convergence of order for the randomized Runge-Kutta method (4). This is well in agreement with the maximum order of convergence that has been proven for that method in Theorem 5.2.
Next, let us briefly compare the computational efficiency of the three methods under consideration. By also taking the necessity of drawing a random number at each step into consideration, the two randomized Runge-Kutta methods (3) and (4) are of course computationally more expensive than the classical Euler method. For this reason we compare in Figure 3 the average CPU times of these three schemes versus their accuracy. From this figure we can see that the classical Euler method is as expected the fastest method and, since it still converges with the same experimental order as the randomized Euler method (3), it is in total more efficient than its randomized counter-part. On the other hand, the computationally even more expensive randomized Runge-Kutta method (4) quickly offsets its higher cost with its higher order of convergence.
Acknowledgement
The authors like to thank Wolf-Jürgen Beyn, Monika Eisenmann, Mihály Kovács, and Stig Larsson for inspiring discussions and helpful comments. This research was carried out in the framework of Matheon supported by Einstein Foundation Berlin. The authors also gratefully acknowledge financial support by the German Research Foundation through the research unit FOR 2402 – Rough paths, stochastic partial differential equations and related topics – at TU Berlin.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] H. Bauer. Measure and integration theory , volume 26 of de Gruyter Studies in Mathematics . Walter de Gruyter & Co., Berlin, 2001. Translated from the German by Robert B. Burckel.
- 2[2] D. L. Burkholder. Martingale transforms. Ann. Math. Statist. , 37:1494–1504, 1966.
- 3[3] J. C. Butcher. On Runge-Kutta processes of high order. J. Austral. Math. Soc. , 4:179–194, 1964.
- 4[4] J. C. Butcher. Numerical Methods for Ordinary Differential Equations . John Wiley & Sons, Ltd., Chichester, second edition, 2008.
- 5[5] D. L. Cohn. Measure theory . Birkhäuser Advanced Texts: Basler Lehrbücher. Birkhäuser/Springer, New York, second edition, 2013.
- 6[6] I. Coulibaly and C. Lécot. A quasi-randomized Runge-Kutta method. Math. Comp. , 68(226):651–659, 1999.
- 7[7] T. Daun. On the randomized solution of initial value problems. J. Complexity , 27(3-4):300–311, 2011.
- 8[8] T. Daun and S. Heinrich. Complexity of parametric initial value problems in Banach spaces. J. Complexity , 30(4):392–429, 2014.
