On the Numerical Approximation of the Karhunen-Lo\`{e}ve Expansion for Lognormal Random Fields
Michael Griebel, Guanglian Li

TL;DR
This paper provides a detailed error analysis for the numerical approximation of the Karhunen-Loève expansion applied to lognormal random fields, crucial for PDEs with random coefficients, balancing truncation, sampling, and discretization errors.
Contribution
It introduces new error estimates that optimize the approximation of the KL expansion for lognormal fields, integrating truncation, sampling, and numerical errors.
Findings
Derived error bounds for KL expansion approximation
Quantified the impact of truncation, sampling, and discretization errors
Validated theoretical results with numerical experiments in multiple stochastic dimensions
Abstract
The Karhunen-Lo\`{e}ve (KL) expansion is a popular method for approximating random fields by transforming an infinite-dimensional stochastic domain into a finite-dimensional parameter space. Its numerical approximation is of central importance to the study of PDEs with random coefficients. In this work, we analyze the approximation error of the Karhunen-Lo\`eve expansion for lognormal random fields. We derive error estimates that allow the optimal balancing of the truncation error of the expansion, the Quasi Monte-Carlo error for sampling in the stochastic domain and the numerical approximation error in the physical domain. The estimate is given in the number of terms maintained in the KL expansion, in the number of sampling points , and in the discretization mesh size in the physical domain employed in the numerical solution of the eigenvalue problems during the expansion.…
| 2 | 4 | 8 | |
|---|---|---|---|
| 1/16 | 4.1217e-2 | 1.1933e-2 | 3.6194e-3 |
| 1/64 | 4.1216e-2 | 1.1931e-2 | 3.6060e-3 |
| 1/128 | 4.1216e-2 | 1.1931e-2 | 3.6059e-3 |
| 1/256 | 4.1216e-2 | 1.1931e-2 | 3.6059e-3 |
| 2 | 4 | 8 | |
|---|---|---|---|
| 1/16 | 6.0723e-3 | 1.8676e-4 | 1.7078e-4 |
| 1/64 | 7.2336e-3 | 1.0267e-4 | 1.0837e-5 |
| 1/128 | 6.2803e-3 | 7.9009e-5 | 2.7146e-6 |
| 1/256 | 6.1652e-3 | 6.6978e-5 | 2.7102e-6 |
| 2 | 4 | 8 | |
|---|---|---|---|
| 1/16 | 2.8446e-3 | 2.8332e-3 | 2.8335e-3 |
| 1/64 | 3.7890e-4 | 2.9770e-4 | 2.9743e-4 |
| 1/128 | 2.8832e-4 | 7.8182e-5 | 7.8268e-5 |
| 1/256 | 2.2743e-4 | 1.9821e-5 | 1.9772e-5 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbabilistic and Robust Engineering Design · Mathematical Approximation and Integration · Numerical Methods and Algorithms
On the Numerical Approximation of the Karhunen-Loève Expansion for Lognormal Random Fields
Michael Griebel and Guanglian Li Institut für Numerische Simulation, Universität Bonn, Endenicher Allee 19B, 53115 Bonn, Germany. E-mail: [email protected] Fraunhofer SCAI, Schloss Birlinghoven, 53754 Sankt Augustin, GermanyBernoulli Institute, University of Groningen, Nijenborgh 9, 9747AG, Groningen, The Netherlands. E-mail: [email protected], [email protected].
Abstract
The Karhunen-Loève (KL) expansion is a popular method for approximating random fields by transforming an infinite-dimensional stochastic domain into a finite-dimensional parameter space. Its numerical approximation is of central importance to the study of PDEs with random coefficients. In this work, we analyze the approximation error of the Karhunen-Loève expansion for lognormal random fields. We derive error estimates that allow the optimal balancing of the truncation error of the expansion, the Quasi Monte-Carlo error for sampling in the stochastic domain and the numerical approximation error in the physical domain. The estimate is given in the number of terms maintained in the KL expansion, in the number of sampling points , and in the discretization mesh size in the physical domain employed in the numerical solution of the eigenvalue problems during the expansion. The result is used to quantify the error in PDEs with random coefficients. We complete the theoretical analysis with numerical experiments in one and multiple stochastic dimensions.
Keywords: Karhunen-Loève expansion, eigenvalue decay, approximation of bivariate functions, error estimates, lognormal random field.
1 Introduction
Partial differential equations (PDEs) with random coefficient have been widely employed to describe applications that are affected by a certain amount of uncertainty arising from imperfect/insufficient information about the problem, e.g., in the input data. The range of applications is broad and diverse and includes, e.g., oil field modelling, quantum mechanics and finance [9, 16, 23]. The dimension of the random coefficient can be huge or even infinite, which poses enormous computational challenge. To reduce its dimensionality, one can parameterize the random coefficient by means of the Karhunen-Loève (KL) expansion or the polynomial chaos (PC) expansion [15, 22], which greatly facilitates the subsequent numerical treatment, e.g., by the stochastic Galerkin method or the stochastic collocation method. Alternatively, one may expand the random field with respect to the hierarchical Faber basis or some wavelet type basis; see [5, 12] for details. In this paper, we will focus on the KL expansion, which is known to be optimal in the sense of the mean square error.
To formulate the problem, let be an open bounded domain with a strong local Lipschitz boundary and let be a complete separable probability space with -field and probability measure . We will denote for notational simplicity. Now, we consider a stochastic field with its logarithm being a centered Gaussian field. The lognormal random field is frequently used in stochastic PDEs as a random diffusion coefficient.
In practical computation, its numerical approximation usually proceeds in three steps. In the first step, the centered random field is approximated by its -term KL expansion for some . The truncation error relies on the regularity of the bivariate function in the physical variable , see [18] for details. In the second step, the covariance function of the centered Gaussian random field is approximated via a sampling method. By its very definition, the covariance function involves an integral over the stochastic domain , which is often of very high dimensional. For its approximation, various quadrature-type sampling methods, e.g., Monte-Carlo methods, (Quasi) Monte-Carlo (QMC) methods and sparse grids [10, 13, 14] can be applied, say with sampling points. These methods essentially require boundedness of the variation, the first or higher mixed derivatives of for fixed and then yield a corresponding order of convergence. In this paper, we focus on the QMC method, which has only a low regularity requirement on , namely that the first mixed derivative of is bounded. The outcome of this second step is a function that approximates the covariance function . The associated self-adjoint operators are denoted as and , respectively. Note that is a finite rank operator with rank not larger than . We shall prove in Proposition 3.1 that only the first terms in the KL expansion of are relevant to approximate the spectrum of . Here, the nonnegative parameter denotes the regularity of the bivariate function in the physical variable . This result implies that the number of KL truncation terms satisfies . The third step is to approximate the eigenvalue problem of the self-adjoint operator by means of a conforming Galerkin finite element method (FEM) over a regular mesh with a mesh size . Now, to estimate the error between and its numerical approximation with being the number of truncation terms, being the number of sampling points and being the mesh size, the eigenvalue approximation error is derived. Moreover, to balance the decay of the eigenvalues of the covariance kernel and the numerical approximation error, we need to take in order to ensure convergence in the first place. Otherwise, no convergence rate is guaranteed when solving the eigenvalue problems numerically.
The main contribution of this work is threefold. First, we present the spectral analysis of the finite rank operator , which allows us to specify the number of truncation terms. Second, we recall the error rate of QMC quadrature and provide an error estimate of the numerical approximation to the eigenvalue problem associated with the operator in terms of mesh size . Third, we derive an error estimate of both, and in various norms. Our final estimates of and with are presented in Theorem 5.6. For example, we obtain the bound
[TABLE]
Moreover, we discuss the example of an elliptic PDE with lognormal random diffusion coefficient. There, using our previous results on the approximation of the lognormal random field, we can deduce bounds of the error between the solution of the PDE and its induced approximation .
The remainder of the paper is organized as follows. We formulate in Section 2 the approximation of by the KL expansion, explain the general sampling method and discuss the Galerkin approximation. In Section 3, we analyze the Quasi Monte-Carlo method to approximate the covariance kernel , and derive a spectral estimate for and by means of the maximin principle and an eigenvalue decay estimate. In Section 4, we discuss the conforming Galerkin approximation of the eigenvalue problems of and derive spectral estimates. The main error estimates between and in the -norm and the -norm with , respectively, are established in Section 5. Furthermore, we present an application of our results for an elliptic operator with lognormal random coefficients in Section 6. Two numerical tests are provided in Section 7 to verify our findings. Finally, we give some concluding remarks in Section 8.
2 Preliminaries
This section collects elementary facts on the KL expansion and its numerical approximation. To this end, the overall numerical approximation error is divided into three parts: the truncation error, the sampling error and the resulting approximation error of the eigenvalue problems.
We start with some notation. Let two Banach spaces and be given. Then, stands for the Banach space composed of all continuous linear operators from to and stands for . The set of nonnegative integers is denoted by . For any index , is the sum of all components. The letters , and are reserved for the truncation number of the KL modes, the number of sampling points and the mesh size. We write if for some absolute constant which is independent of , and , and we likewise write . Moreover, for any , , we follow [1] and define the Sobolev space by
[TABLE]
It is equipped with the norm
[TABLE]
The space is the closure of in . Its dual space is , with . Also we use for . denotes the inner product in .
2.1 Karhunen-Loève expansion: continuous level
In this work, we consider a stochastic field with its logarithm being a centered Gaussian field, i.e.,
[TABLE]
where is the probability measure on introduced in Section 1. We denote the associated integral operator by
[TABLE]
whereas its adjoint operator is defined by
[TABLE]
Let be defined by . Then is a nonnegative self-adjoint Hilbert-Schmidt operator with kernel given by
[TABLE]
This is just the covariance function of the stochastic process . Moreover, for any , we have
[TABLE]
The standard spectral theory for compact operators [25] implies that the operator has at most countably many discrete eigenvalues, with zero being the only accumulation point, and each non-zero eigenvalue has only finite multiplicity. Let be the sequence of eigenvalues (with multiplicity counted) associated to , which are ordered nonincreasingly, and let be the corresponding eigenfunctions that are orthonormal in . Furthermore, for any , define
[TABLE]
One can verify that the sequence is uncorrelated and orthonormal in , and therefore, are i.i.d normal random functions.
Note that the sequence can be characterized by the so-called approximation numbers (cf. [21, Section 2.3.1]). They are defined by
[TABLE]
where denotes the set of the finite rank operators on . This equivalency is frequently employed to estimate eigenvalues by constructing finite rank approximation operators to .
The KL expansion of the bivariate function then refers to the expression
[TABLE]
where the series converges in .
2.2 Karhunen-Loève expansion: -term truncation
Now, we will truncate the KL expansion and discuss the resulting error. The studies on the -term KL approximation to random fields are extensive. In [22], the authors derived the eigenvalue decay rates for random fields with their corresponding covariance kernels possessing certain regularity and considered the generalized fast multipole methods to solve the associated eigenvalue problems. Robust eigenvalue computation for smooth covariance kernels was studied in [24]. A comparison of -term KL truncation and the sparse grids approximation was given in [17].
The result of this section is based on our recent paper [18], which proves a sharp eigenvalue decay rate under a mild assumption on the regularity of the bivariate function in the physical domain. To this end, we make the following assumption.
Assumption 2.1** (Regularity of ).**
There exists some such that .
Under Assumption 2.1, by the definition of the kernel , we have .
The following eigenvalue decay estimate [18, Theorems 3.2, 3.3 and 3.4] will be used repeatedly.
Theorem 2.1**.**
Let Assumption 2.1 hold. Then, for any sufficiently large, there holds
[TABLE]
with the constant . Here, denotes the embedding constant between the Lorentz sequence spaces and is a constant depending only on and .
The next lemma gives the regularity of the eigenfunctions .
Lemma 2.1** (Regularity of the eigenfunctions ).**
Let Assumption 2.1 be valid. Then for all , there holds
[TABLE]
Here, denotes a positive constant depending only on , and .
Proof.
We will only prove the result for . The case for can be obtained by the interpolation method.
Let with . The combination of Assumption 2.1 and decomposition (2.5) and an application of Lebesgue’s dominated convergence theorem lead to the expansion
[TABLE]
After taking the squared -norm on both sides, we arrive at
[TABLE]
Now we sum over all with , and obtain by the definition of the Sobolev space that
[TABLE]
At last, an application of Theorem 2.1 gives
[TABLE]
for any positive parameter . With we obtain from the relation (2.7) that, when is sufficiently large, there holds
[TABLE]
This verifies (2.6) for . By noting that , an application of [2, Theorem 3.3] yields then the desired estimate. ∎
It is worth to emphasize the optimality of the eigenfunctions in the sense that the mean-square error resulting from a finite-rank approximation of is minimized [15]. Thus, the eigenfunctions indeed minimize the truncation error in the -sense, i.e.
[TABLE]
2.3 Sampling estimate of the continuous Karhunen-Loève approximation
Clearly, any numerical computation of the covariance function by a conventional quadrature method quickly becomes expensive and impractical when the dimensionality of the random domain is large. This is due to the curse of dimensionality. To this end, depending on the regularity prerequisites with respect to the stochastic variable , the Monte Carlo method, the Quasi-Monte Carlo (QMC) methods or the sparse grid method may be employed in approximating . In this paper, we will focus on QMC.
Anyway, a numerical quadrature gives , which is defined by
[TABLE]
Here, denotes the number of quadrature points and and are the corresponding quadrature points and weights. Clearly, and .
Analogously, we denote by the nonnegative self-adjoint Hilbert-Schmidt operator with kernel . The operator is of rank no greater than and hence compact. Analogously, we can define in nondecreasing order its eigenvalues and its normalized eigenfunctions in as and , respectively.
Note at this point the following: If we are interested in a specific approximate realization of for some , then we have to consider the function defined by
[TABLE]
To estimate the error between and , we can apply finite elements over as introduced in Subsection 2.4. This error depends on the regularity of for given . On the other hand, if we are only interested in certain statistical quantities of the Gaussian random field , then there is no need to calculate , and we can take directly i.i.d normal random functions, e.g., . This is indeed the situation many articles are concerned with, see e.g., [5, 11, 19].
2.4 Galerkin discretization of the sampled, truncated continuous Karhunen-Loève approximation
Now we describe the conforming Galerkin approximation of the eigenvalue problem on . To this end, let be a regular quasi-uniform triangulation over the physical domain with a maximal mesh size and let . The associated finite element space is defined by
[TABLE]
Let be the dimension of . We then have Q=\mathcal{O}\Big{(}(\frac{h}{k})^{d}\Big{)}. The -projection has the approximation property [8, Theorem 4.4.20]
[TABLE]
Here, the positive constant depends only on the regularity parameter of and is independent of the mesh size .
The conforming Galerkin approximation of the eigenvalue problem of is to find such that
[TABLE]
This is equivalent to the eigenvalue problem of the finite-rank operator on defined by
[TABLE]
Let be the corresponding eigenpairs with eigenvalues in nonincreasing order and eigenvectors orthonormal in . Then the -term truncated KL expansion, denoted by , is defined by
[TABLE]
Note at this point the following: Again, if we are mainly concerned with the approximation to a specific bivariate function via the expression (2.14), then we have to replace with its numerical approximation
[TABLE]
Here, represents the quadrature points on each finite element and denotes the Legendre polynomials of order . Note that is the numerical approximation by interpolation with sampling points to defined as
[TABLE]
In view of the KL expansion (2.5) and the -term truncation estimate (2.14), an application of the triangle inequality yields
[TABLE]
A main goal of this paper is to derive a sharp estimate of in (2.17). To this end, it suffices to analyze the three terms on the right hand side of (2.17). Here, the first term represents the truncation error that can be estimated by Theorem 2.1, the second term is due to sampling of the KL approximation and the third term is induced by the Galerkin approximation error.
3 QMC method approximation error
In this section, we apply the QMC method based on the randomly shifted lattice rule and derive the sampling error corresponding to the second term in (2.17). To this end, we map the quadrature points to by the inverse of the cumulative distribution function of the standard normal distribution. The cumulative distribution function is defined by
[TABLE]
and its inverse is Upon changing variables, we obtain
[TABLE]
Then by taking and for in (2.9), we get an approximation to the covariance function , which is denoted by .
To this end, we introduce the construction of the quadrature points , which is based on the fast CBC construction of randomly shifted lattice rules in the unanchored space [20]. The unanchored space is defined by
[TABLE]
Here, the positive function controls the boundary behavior of the functions in . The collection of parameters for all controls the relative importance of various groups of variables, and
[TABLE]
Note that we will choose the weight function and the weight parameters , such that the bivariate function belongs to for all .
We apply the CBC approach [20, Algorithm 6] to derive the generating vector with the number of sampling points being . To this end, let the shift be an i.i.d uniformly distributed vector. Then we obtain the randomly shifted (rank-1) lattice rule by
[TABLE]
Now, in (2.9) can be approximated by taking for .
The error between and is measured by the shifted-averaged worse-case error defined by
[TABLE]
Thus, using the CBC Algorithm to calculate defined in (2.9) yields a shifted-averaged worse-case error of for any with the construction cost of . Therefore, we start with the following setting.
Assumption 3.1** (Assumption on the sampling error).**
For some , there holds
[TABLE]
To approximate a bivariate function or a specific realization of the random field , we have introduced in the last section the quantities , cf. (2.10), which are not orthonormal in . Nevertheless, they are very close to an orthonormal basis when the approximation error between and is very small.
Lemma 3.1** (Near orthonormality of ).**
Let be defined as in (2.10). There holds
[TABLE]
Proof.
This is a direct consequence of the definition (2.10) and the eigenvalue problem for . ∎
Next, we give some estimates on the finite-rank approximation and its spectrum.
Proposition 3.1** (Spectral estimate for ).**
Let Assumption 2.1 hold, let be sufficiently large and let . Then with
[TABLE]
For , there holds
[TABLE]
Furthermore, let be an eigenvalue of with multiplicity for and for some . Assume that for sufficiently large , there holds
[TABLE]
Then, for , there holds
[TABLE]
In addition,
[TABLE]
Proof.
We can obtain from the definition (2.9) and the triangle inequality
[TABLE]
Then Assumption 2.1 leads to (3.5). The relation (3.6) is derived from the definition. To prove (3.8), fix and let be a -dimension subspace. Since and are nonnegative and self-adjoint, we obtain
[TABLE]
Next we estimate the lower bound of the minimum on the right hand side of (3.10). To this end, note that any admits the expression for some . Let , then . For any , plugging in the expression for and applying (3.4) lead to
[TABLE]
The lower bound of the minimum can now be estimated using Lagrange multipliers. To this end, let and define
[TABLE]
Let be the optimal point to the unconstrained minimization problem associated to . Then have the same sign by the definition of . Let for all . The optimality conditions read for , and . This immediately implies
[TABLE]
The second relation in (3.12) implies for all that there holds
[TABLE]
Summing over yields
[TABLE]
Recall that for all . Together with (3.12), this implies Now combining (3.7) and (3.13) results in and therefore, for some positive constant independent of . This, together with (3.12), (3.11) and (3.10), gives Analogously, by changing the roles of and , we can show
[TABLE]
Consequently, (3.8) follows by Theorem 2.1.
It remains to prove (3.9). We only present the proof for . For , (3.9) can be shown similarly to [4, Theorem 9.1]. Since the whole space is orthogonally decomposed as the direct sum of the range of and its kernel, can be split into
[TABLE]
for some satisfying . Recall that . This leads to
[TABLE]
which, combined with (3.4) and (3.14), gives
[TABLE]
By redefining to be \big{(}\sum\limits_{i=1}^{q_{1}}c_{i}^{2}\big{)}^{-\frac{1}{2}}\sum\limits_{i=1}^{q_{1}}c_{i}\phi_{i}, (3.9) is proved due to the spectral gap assumption (3.7). ∎
Remark 3.1**.**
If the number of sampling points is not sufficiently large then the spectral gap assumption (3.7) is not fulfilled. Then one can show that for , there holds
[TABLE]
Thus, to make an accurate approximation to for , we have to impose a much more stringent restriction on . In this manner, we can show that for .
Finally, we can give an estimate to the second term in (2.17):
Proposition 3.2** (Root mean square error to the second term in (2.17)).**
Let be sufficiently large and . Furthermore, let (3.7) be satisfied. Then there holds
[TABLE]
Proof.
Employing the triangle inequality yields
[TABLE]
Then the desired result follows from (3.14) and (3.9). ∎
Next we give an estimate on .
Lemma 3.2** (Estimate on ).**
Let be sufficiently large and . Furthermore, let (3.7) be satisfied. Then there holds
[TABLE]
Proof.
By (2.3), we obtain
[TABLE]
Therefore, taking the -norm on both sides yields
[TABLE]
where, in the last inequality, we have applied (3.9) and the inequality for all . ∎
In order to numerically approximate each realization of the Gaussian random field for given , we need the following result.
Proposition 3.3**.**
Let be sufficiently large and . Furthermore, let (3.7) be satisfied. Then
[TABLE]
Proof.
The triangle inequality yields
[TABLE]
Then the desired result follows from (3.14), (3.9) and Lemma 3.2. ∎
4 Conforming Galerkin approximation estimate
In this section we derive an estimate for the third term in (2.17) by means of the approximation theory of conforming finite element methods. To this end, let
[TABLE]
Then is a self-adjoint operator on and we have the following error representation.
Lemma 4.1**.**
The error operator has the property
[TABLE]
Proof.
For given and since and , we obtain
[TABLE]
∎
A direct consequence of Lemma 4.1, together with the approximation property (2.12) and Proposition 3.1, is the upper bound estimate for the operator norm of
[TABLE]
Finally, we are ready to present the main result in this section.
Proposition 4.1** (Conforming Galerkin approximation estimate).**
Let Assumption 2.1 hold and let be sufficiently large and . Assume that the spectral gap condition (3.7) is valid. Then there are constants , and such that
[TABLE]
Furthermore, the eigenvectors can be selected such that
[TABLE]
Here, the constants and are independent of and and is sufficiently small.
Proof.
The proof follows from [4, Theorem 9.1], where the following identity plays a crucial role. We have
[TABLE]
for all satisfying . This identity can be derived by definition directly. Together with Proposition 3.1 and the estimate (4.2), this completes the proof. ∎
Recall that if we want to approximate a certain realization of the random field for some , then we have to estimate the error between and . To this end, we make the following assumption.
Assumption 4.1**.**
Let and be as defined in (2.15) and (2.16), respectively. Assume that, for some sufficiently large and a sufficiently small , there holds
[TABLE]
Note that Assumption 4.1 requires a certain regularity of the bivariate function over the stochastic domain since the computation of involves the polynomial interpolation of over .
We then get the following result.
Lemma 4.2**.**
Let Assumption 4.1 be valid. Let and be as defined in (2.10) and (2.15), respectively. Then, for some sufficiently large and a sufficiently small , there holds
[TABLE]
Proof.
An application of the triangle inequality leads to
[TABLE]
The second term can be estimated by Assumption 4.1. The definitions (2.3) and (2.16) imply
[TABLE]
Taking the -norm on both sides leads to
[TABLE]
Then the desired result follows from Lemma 3.1 and Proposition 4.1. ∎
5 Main estimates
In this section, we present the main estimate of the error between the lognormal random field and its -term numerical approximation in the -norm and in the -norm for . The overall procedure is as follows: We first derive in Sections 5.1 and 5.2 an estimate on with respect to the -norm and the -norm, cf. Theorems 5.1 and 5.4. Then we establish the final results on in Theorem 5.6 by employing Fernique’s Theorem.
5.1 error estimate
First, we give an estimate for the third term in (2.17).
Proposition 5.1** (Galerkin approximation estimate in (2.17)).**
Let Assumption 2.1 hold, let be sufficiently large and let and . Assume the spectral gap condition (3.7) to be valid. Then there holds
[TABLE]
Proof.
Due to the orthogonality of the basis functions in , an application of the triangle inequality leads to
[TABLE]
Here, we have used the orthogonality of over in the last step. Then, an application of Proposition 4.1 and Theorem 2.1 gives the desired result. ∎
Now, using Theorem 2.1, Propositions 3.2 and 5.1, we are finally ready to present an estimate for (2.17).
Theorem 5.1** (Root mean square error for -term KL expansion).**
Let Assumption 2.1 hold. Let be large, let and . Assume the spectral gap condition (3.7) to be valid. Let be given as in (2.14). Then there holds
[TABLE]
Remark 5.1** (Complexity).**
According to Theorem 5.1, in order to approximate the Gaussian random field by formula (2.14) with root mean square error of for a certain threshold accuracy , we need to take the number of sampling points , the number of truncation terms and the mesh size .
To numerically approximate the realization of for given by the -term truncation formula (2.14), we can replace the i.i.d normal random functions with defined in (2.15). The error in this process can be estimated as follows.
Theorem 5.2** (Root mean square error for -term KL expansion of a bivariate function).**
Let Assumptions 2.1 and 4.1 hold. Let be large, let and . Assume the spectral gap condition (3.7) to be valid. Let
[TABLE]
Then there holds
[TABLE]
Proof.
An application of the triangle inequality together with the KL expansion (2.5) implies
[TABLE]
Now, the first term and the third term above can be bounded by Proposition 3.3 and Theorem 2.1, respectively. We only need to estimate the second term. The triangle inequality gives
[TABLE]
By Lemma 3.1 and (3.4), for all Then, Proposition 4.1, Lemma 4.2 and Theorem 2.1 together show the desired result. ∎
Remark 5.2** (Complexity).**
According to Theorem 5.2, in order to approximate a specific realization of the Gaussian random field by formula (5.1) with root mean square error for a certain threshold accuracy , we need to choose the number of sampling points , the number of truncation terms and the mesh size .
5.2 Uniform error estimate
In order to derive a uniform error estimate of the Gaussian random field , we require a further regularity assumption on to guarantee that . To this end, we make the following assumption.
Assumption 5.1** (Regularity of ).**
Let Assumption 2.1 hold. Furthermore, assume .
Then, the following estimate is valid.
Theorem 5.3** (Uniform estimate on the eigenfunctions).**
Let Assumption 5.1 be satisfied. Then there holds
[TABLE]
Proof.
Due to Assumption 5.1, an application of (2.6) with together with the Sobolev embedding implies the desired result. ∎
Remark 5.3** (Optimality of the uniform estimate (5.2)).**
In [6, Section 3] the uniform estimate of eigenfunctions was studied under certain assumptions on the stationary covariance kernel. There, the authors derived a similar uniform estimate as (5.2) by utilizing essentially the regularity of the Fourier transform of the covariance kernel. They showed the sharpness of their uniform estimate in the case when and for the stationary covariance kernel with its Fourier transform . Then, the uniform estimate of the n-th eigenfunction is , see [7].
Proposition 5.2** (Uniform truncation estimate).**
Let Assumption 5.1 be satisfied. Then, for any , there holds
[TABLE]
Proof.
By the Sobolev embedding theorem, Assumption 5.1 implies . Then one can obtain
[TABLE]
This and Theorem 2.1 yield the desired estimate. ∎
Proposition 5.3**.**
Let Assumption 5.1 be fulfilled. Then and are bounded from to , i.e., and . In addition, it holds
[TABLE]
Furthermore, the eigenfunctions . Let be sufficiently large and . Assume the spectral gap condition (3.7) to be valid. When for some sufficiently small , there holds
[TABLE]
Proof.
The proof of (5.3) and (5.4) follows directly from basic operator theory. The bound (5.5) is a result of (2.13) and (5.4). By the definitions of and , we obtain
[TABLE]
Together with Proposition 4.1 and (5.5), this yields
[TABLE]
Since , the second term can be bounded from above by the third term, and this completes the proof. ∎
Remark 5.4**.**
In a similar manner as in the proof to (5.6), if is sufficiently large and , one can show that
[TABLE]
Combining (5.7) and Theorem 5.3, analogously to Proposition 3.1, we can derive the following estimate.
Proposition 5.4**.**
Let be sufficiently large and . Furthermore, let (3.7) be satisfied. Then there holds
[TABLE]
Proof.
This result follows from an application of the triangle inequality and (5.7). ∎
Proposition 5.5**.**
Let be sufficiently large and . There holds
[TABLE]
Proof.
An application of the triangle inequality leads to
[TABLE]
Then, an application of the inequalities (5.2) and (5.7), Propositions 4.1 and 5.3 and Theorem 2.1 reveals the desired result. ∎
Finally, the uniform estimate between and can be derived from Propositions 5.4, 5.5 and 5.2. We then obtain the following result.
Theorem 5.4** (Uniform estimate on -term KL truncation of ).**
Let Assumption 5.1 hold and let be large and . Assume the spectral gap condition (3.7) to be valid. Then there exists sufficiently small, such that
[TABLE]
for all .
5.3 Numerical estimate for the error between and
In this section, by utilizing the preceding results on together with the mean value theorem, we will derive an error estimate between and . Note at this point that the results in this part can be only applied to the case when is a normal random field. One crucial tool which we will employ repeatedly below is Fernique’s theorem. For convenience, we recall it in the following.
Theorem 5.5** (Fernique’s theorem).**
Let be a real, separable Banach space and suppose that is an -valued random variable which is a centered and Gaussian in the sense that, for each , is a centered, -valued Gaussian random variable. If R=\inf\Big{\{}r\in[0,\infty):\mathbb{P}(\|X\|_{E}\leq r)\geq\frac{3}{4}\Big{\}}, then
[TABLE]
First, we give a priori bounds on and .
Proposition 5.6**.**
Let Assumption 5.1 hold and let be large and . Assume the spectral gap condition (3.7) to be valid. Then there exists sufficiently small such that, for all , there holds
[TABLE]
Proof.
Note that is a symmetric Gaussian random variable defined on and valued in . By Fernique’s theorem, there exists such that
[TABLE]
Hence, by Young’s inequality, we obtain
[TABLE]
and (5.8) leads to
[TABLE]
This shows the first assertion. The second one can be obtained in a similar manner. ∎
Now we can state the main result of this section.
Theorem 5.6**.**
Let Assumption 5.1 hold. Let be sufficiently large, let and . Assume the spectral gap condition (3.7) to be valid. Then, for all , there holds
[TABLE]
Proof.
The mean value theorem indicates
[TABLE]
This, combined with Hölder’s inequality, leads to
[TABLE]
where . In view of Theorem 5.4 and Proposition 5.6, this proves (5.9). The second assertion (5.10) can be shown similarly using Theorem 5.1. This completes the proof. ∎
6 Application to elliptic PDEs with random diffusion coefficient
In this section, we use the results of Theorem 5.6 to analyze a model order reduction algorithm for a class of elliptic PDEs with lognormal random coefficient in the multi-query context. In the algorithm, we apply the Karhunen-Loève approximation to the stochastic diffusion coefficient to arrive at a truncated model with finite-dimensional noise. We shall provide an error analysis below. Throughout this section, we assume that the conditions of Theorem 5.6 are satisfied.
Let be an open bounded domain in with a strong local Lipchitz boundary and let be a given probability space. Consider the elliptic PDE with random coefficient
[TABLE]
for a.e. , where the elliptic operator is defined by
[TABLE]
and denotes the derivative with respect to the spatial variable . We assume the force term to be in . In the model problem (6.1), the dependence of the diffusion coefficient on a stochastic variable reflects imprecise knowledge or lack of information.
The extra-coordinate poses significant computational challenges. One popular approach is the stochastic Galerkin method [3]. There, one often approximates the stochastic diffusion coefficient by a finite sum of products of deterministic and stochastic orthogonal bases (with respect to a certain probability measure). This gives a computationally more tractable finite-dimensional noise model. There, the choice of the employed orthogonal basis is crucial for the accurate and efficient approximation to . In this work, we consider the KL approximation of the random field in (2.14).
First, we specify the functional analytic setting. Let and let be its dual space. Then, for any given , the weak formulation of problem (6.1) is to find such that
[TABLE]
We first discuss the well-posedness of problem (6.2) for each , which was proven in [11, Theorem 2.2]. By Assumption 5.1, a.e.. Let and for all .
Proposition 6.1** (Ellipticity and boundedness of ).**
The following bounds hold:
[TABLE]
Proposition 6.1, together with the Lax-Milgram theorem, guarantees that the weak formulation (6.2) is well-posed. Furthermore,
[TABLE]
After substituting the numerical KL approximation of the diffusion coefficient into problem (6.1), we arrive at a truncated problem with finite-dimensional noise: For a.e.
[TABLE]
where is the elliptic differential operator with the diffusion coefficient . The corresponding weak formulation is then to find such that
[TABLE]
for any given . Analogous to the continuous case, let and for all . We can then state the well-posedness of problem (6.5).
Proposition 6.2** (Ellipticity and boundedness of ).**
The following bounds hold:
[TABLE]
Proof.
This follows from the proof of [11, Theorem 2.2]. ∎
Due to Proposition 6.2 and the Lax-Milgram theorem, we obtain the well-posedness of problem (6.5). Furthermore, (6.5) and Proposition 6.2, together with Poincarè’s inequality, give the following a priori estimate
[TABLE]
The next result quantifies the effect of the perturbation of the coefficient on the solution .
Theorem 6.1**.**
Let Assumption 5.1 hold. Let be large, let and . Assume the spectral gap condition (3.7) to be valid. Let and be solutions to (6.1) and (6.4), respectively. Then for all , there holds
[TABLE]
Proof.
From the weak formulations for and , cf. (6.2) and (6.5), we obtain for any
[TABLE]
By setting in the weak formulation (6.7) and using Proposition 6.1 and the generalized Hölder inequality, we have
[TABLE]
Consequently, we arrive at
[TABLE]
Finally, taking the -norm on both sides and employing the generalized Hölder’s inequality, combined with Theorem 5.6, Proposition 6.1 and the *a priori *estimate (6.6), shows the desired result. ∎
Remark 6.1**.**
Note that this work is mainly concerned with the numerical approximation of the lognormal random coefficient. Therefore, we refrain from discussing the important issue of the numerical approximation of the associated elliptic problems, i.e., Problems (6.1) and (6.4). We refer to [19] for related results in this direction.
7 Numerical simulation
In this section, we provide numerical tests to verify the theoretical results presented in Section 5. Recall that the three parameters , and denote the number of terms in the KL approximation, the number of sampling points and the mesh size. These parameters determine directly the computational cost involved.
We take in the following simulation. In order to obtain the -term KL truncation estimate to in the form of (2.14), we employ the fast CBC construction of randomly shifted lattice rules in the unanchored space [20] to estimate the kernel defined in (2.9). To this end, we employ the unanchored space as defined by (3), where the weight function and the weight parameters are
[TABLE]
We apply the CBC method [20, Algorithm 6] to derive the generating vector with the number of sampling points being . To this end, let the shift be an i.i.d uniformly distributed vector. Then we obtain the randomly shifted (rank-1) lattice rule by formula (3.3). Now, in (2.9) can be approximated by taking for .
The bivariate functions employed in the following examples belong to for all . Thus, using the CBC Algorithm to calculate as defined in (2.9) yields a shifted-averaged worse-case error of for any with the construction cost of .
7.1 Example 1: and
Let
[TABLE]
see Fig. 1 for an illustration. One can verify that for any with the physical domain and the stochastic domain . Thus, we have in this case. According to the definition of the finite element space , we will use conforming quadratic finite element. Now as in Remark 5.1, let the tolerance be chosen as . Note that we always fix the sampling points . Then we can take the number of truncation terms and the mesh size .
Since the dimension of the stochastic domain equals to one, we can choose the generating vector . We then can derive the sampling points from formula (3.3). The shifted-averaged worse-case error is 8.0171e-6. We present in Table 1 the root mean square error between and for different numbers of truncation terms and different mesh sizes .
Now, let us compare these computed results with the values that were predicted from our theory. To this end, we plug the fixed number of sampling points into Remark 5.1 and derive that we can take the accuracy , the number of truncation terms and the mesh size . Indeed, for , we also obtain the optimal error in Table 1. This shows that our estimates are quite sharp and involve just small constants.
7.2 Example 2: and
Let the bivariate function be given by
[TABLE]
Then one can verify that for any , with the physical domain and the stochastic domain . According to the definition of the finite element space , we will use spectral element up to degree 10. The basis functions are Lagrange interpolation polynomials through the local Gauss-Lobatto integration points defined per element. Due to Theorem 2.1, the eigenvalues decay very fast since in this case.
We present the root mean square errors between and in Tables 2 and 3 for different mesh sizes and different numbers of truncation terms with and , respectively.
Furthermore, for our fixed number of sampling points and for the accuracy , we can compare our computed results with the predicted ones due to Remark 5.1. We see that our estimates are again qualitatively quite sharp and involve just small constants.
8 Concluding remarks
In this work, we have analyzed the numerical approximation error in the Karhunen-Loève expansion to log normal random coefficients. We derived the numerical error in terms of the number of terms in the Karhunen-Loève expansion, the number of QMC sampling points to estimate the covariance function and the mesh size for the conforming Galerkin approximation to the eigenvalue problem. Our results show the basic relation
[TABLE]
among those three parameters, where is the dimension of the physical domain and denotes the regularity of the bivariate function in the physical domain. These results are also useful for the study of stochastic elliptic problems. We presented numerical results for one and multiple stochastic dimensions to support our theory.
The QMC method can be replaced by some properly adapted sparse grid method, if there is higher mixed regularity in present with respect to the stochastic variables. Analogously, if the physical problem possesses higher regularity, then a more suitable FEM of higher order can be utilized. Then, of course, the sampling estimate, the Galerkin estimate and the resulting error estimates have to be modified accordingly. This would lead to a different balancing of the terms that in Remark 5.1.
Acknowledgements
The authors were supported by the Hausdorff Center for Mathematics in Bonn and the Sonderforschungsbereich 1060 The Mathematics of Emergent Effects funded by the Deutsche Forschungsgemeinschaft. Guanglian Li acknowledges the support from the Royal Society via the Newton International fellowship. Part of this work was done during her visit to IPAM in the Long program: Computational Issues in Oil Field Applications. We thank Prof. Dr. Markus Bachmayr for fruitful discussions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. Adams and J. Fournier. Sobolev Spaces . Elsevier/Academic Press, Amsterdam, 2003.
- 2[2] S. Agmon. Lectures on Elliptic Boundary Value Problems . Princeton, N.J.-Toronto-London, 1965.
- 3[3] I. Babuška, R. Tempone, and G. Zouraris. Galerkin finite element approximations of stochastic elliptic partial differential equations. SIAM J. Numer. Anal. , 42(2):800–825, 2004.
- 4[4] I. Babuška and J. Osborn. Eigenvalue problems. Handbook of Numerical Analysis , 2:641–787, 1991.
- 5[5] M. Bachmayr, A. Cohen, D. Dũng, and C. Schwab. Fully discrete approximation of parametric and stochastic elliptic PD Es. SIAM J. Numer. Anal. , 55(5):2151–2186, 2017.
- 6[6] M. Bachmayr, A. Cohen, and G. Migliorati. Representations of Gaussian random fields and approximation of elliptic PD Es with lognormal coefficients. J. Fourier Anal. Appl. , 2017. in press.
- 7[7] A. Bonami and A. Karoui. Uniform approximation and explicit estimates for the prolate spheroidal wave functions. Constructive Approximation , 43(1):15–45, 2016.
- 8[8] S. Brenner and L. Scott. The Mathematical Theory of Finite Element Methods , volume 15 of Texts in Applied Mathematics . Springer, New York, third edition, 2008.
