Identification of the degradation coefficient for an anomalous diffusion process in hydrology
Guang-Hui Zheng, Ming-Hui Ding

TL;DR
This paper develops a hybrid variational regularization and Laplace approximation method to accurately identify the degradation coefficient in anomalous diffusion processes in hydrology, addressing ill-posed inverse problems.
Contribution
It introduces a novel combined deterministic-stochastic approach for inverse degradation coefficient problems, ensuring uniqueness, stability, and quantifying uncertainty.
Findings
Proved unique determination of the degradation coefficient from average flux data.
Established existence, stability, and convergence of the solution.
Numerical examples demonstrate the method's efficiency and robustness.
Abstract
In hydrology, the degradation coefficient is one of the key parameters to describe the water quality change and to determine the water carrying capacity. This paper is devoted to identify the degradation coefficient in an anomalous diffusion process by using the average flux data at the accessible part of boundary. The main challenges in inverse degradation coefficient problems (IDCP) is the average flux measurement data only provide very limited information and cause the severe ill-posedness of IDCP. Firstly, we prove the average flux measurement data can uniquely determine the degradation coefficient. The existence and uniqueness of weak solution for the direct problem are established, and the Lipschitz continuity of the corresponding forward operator is also obtained. Secondly, to overcome the ill-posedness, we combine the variational regularization method with Laplace approximations…
| Type of basis function | |||
| polynomial basis | |||
| Trigonometric basis | |||
| Type of data | ||||||
| Average flux data | ||||||
| Direct flux data | ||||||
| Measurement data | skewness |
| Data on | |
| Data on | |
| Data on | |
| Measurement data | skewness |
| Data on | |
| Data on | |
| Data on | |
| y | |||
| 0.2 | |||
| 0.5 | |||
| 0.8 | |||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNumerical methods in inverse problems · Groundwater flow and contamination studies · Advanced Mathematical Modeling in Engineering
Identification of the degradation coefficient for an anomalous diffusion process in hydrology
Guang-Hui Zheng Corresponding author. College of Mathematics and Econometrics, Hunan University, Changsha 410082, Hunan Province, China. Email: [email protected]
Ming-Hui Ding College of Mathematics and Econometrics, Hunan University, Changsha 410082, Hunan Province, China. Email: [email protected]
ABSTRACT
In hydrology, the degradation coefficient is one of the key parameters to describe the water quality change and to determine the water carrying capacity. This paper is devoted to identify the degradation coefficient in an anomalous diffusion process by using the average flux data at the accessible part of boundary. The main challenges in inverse degradation coefficient problems (IDCP) is the average flux measurement data only provide very limited information and cause the severe ill-posedness of IDCP. Firstly, we prove the average flux measurement data can uniquely determine the degradation coefficient. The existence and uniqueness of weak solution for the direct problem are established, and the Lipschitz continuity of the corresponding forward operator is also obtained. Secondly, to overcome the ill-posedness, we combine the variational regularization method with Laplace approximations (LA) to solve the IDCP. This hybrid method is essentially the combination of deterministic regularization method and stochastic method. Thus, it is able to calculate the minimizer (MAP point) more rapidly and accurately, but also enables captures the statistics information and quantifying the uncertainty of the solution. Furthermore, the existence, stability and convergence of the minimizer of the variational problem are proved. The convergence rate estimate between the LA posterior distribution and the actual posterior distribution in the sense of Hellinger distance is given, and the skewness are introduced for characterizing the symmetry or slope of LA solution, especially the relationship with the symmetry of the measurement data. Finally, the one-dimensional and two-dimensional numerical examples are presented to confirm the efficiency and robustness of the proposed method.
keywords: fractional diffusion equation, average flux data, degradation coefficient, Laplace approximations, Hellinger distance, skewness.
1 Introduction
In recent years, fractional calculus and fractional differential equations have been more and more extensively used in many scientific fields. For example, physical, chemical, biology, engineering, medicine, hydrology, finance and so on, refer to [1, 2, 3, 4, 5, 6, 7, 8, 9, 10].
As is known to all, in hydrology, the normal solute diffusion obeys Darcy s law:
[TABLE]
and mass conservation law:
[TABLE]
where is diffusion flux, is diffusion coefficient, is concentration of solute and denotes some source or sink. By substituting (1.1) into (1.2), the following classical diffusion equation can be derived
[TABLE]
It is well known that the classical diffusion equation can describe the normal diffusion quite well, and in probability theory, it corresponds to Brownian motion. However, there are an increasing number of so-called anomalous diffusion arises in real world, especially the diffusion phenomena occurred in some media with memory and hereditary properties [1, 2, 3, 4, 5]. The anomalous diffusion is not consistent with the classical mass conservation law, but satisfies what is called time fractional mass conservation law [11, 12], i.e.
[TABLE]
where ,and is the Caputo fractional derivative defined by [1, 2, 3, 4]
[TABLE]
Similarly, by combining Darcy s law (1.1), one can obtain the time fractional diffusion equation (TFDE)
[TABLE]
The time fractional diffusion equation, compared with the classical Brownian motion, is closely related to fractional Brownian motion and gradually accepted as an important tool for describing anomalous diffusion. Particularly in hydrology, the TFDE models sticking and trapping between mobile periods for contaminant particles in a porous medium [13] or a river flow [14]. About the direct problems for TFDE, i.e., initial value problem and initial boundary value problem, which have been studied extensively in the past few years [15, 16, 17, 18, 19, 20, 21, 22, 23, 24]. However, in some practical problems, the boundary data on the whole boundary cannot be obtained. We only know the noisy data on a part of the boundary or at some interior points of the concerned domain, which will lead to some inverse problems, i.e., fractional inverse diffusion problems. Recently, there are also rapidly growing publications on the time fractional inverse diffusion problems, such as a tutorial review [25], inverse initial boundary value problems [25, 26, 27, 28, 29, 30], inverse source problems [31, 32, 33, 34, 35], and inverse coefficient problems [36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48].
In hydrology and water resources management, the pollutant degradation coefficient is one of the key parameters to describe the water quality change and to determine the water carrying capacity [49, 50]. So, identification of degradation coefficient is very important for water quality evaluation, monitoring and protection. In anomalous diffusion, identification of degradation coefficient is often correspond to the inverse reaction coefficient problems (IRCP) for TFDE.
As for the inverse reaction (or degradation) coefficient problems for TFDE, there are only limited papers studying this topic. For example, Tuan [41] gave the uniqueness of IRCP by only using finite measurement data on the boundary. Jin and Rundell [39] established the uniqueness in determining the reaction coefficient from the direct flux measurements for one dimensional TFDE, and an algorithm of the quasi-Newton type is proposed for the numerical reconstruction. In [48], Yamamoto and Zhang obtained conditional stability in recovering the reaction coefficient in a one dimensional TFDE with one half order Caputo derivative by a Carleman estimate. By means of integral transform method, Miller and Yamamoto [40] proved the uniqueness for the IRCP for TFDE from the internal measurement data. Li et al. [37] suggested an optimal perturbation algorithm for the simultaneous numerical recovery of the diffusion coefficient and fractional order in a one dimensional TFDE. In [44], Sun and Wei proved the uniqueness in identifying the reaction coefficient for one dimensional TFDE, and used the conjugate gradient method to solve it numerically. The IRCP for TFDE from the final time data measurement was discussed by Jin and Rundell in [25].
In this paper, based on pollutant degradation model in hydrology and compared with the above one dimensional case which was studied more extensively, we consider the inverse degradation coefficient problem for TFDE in higher dimensions. Let be a bounded domain in with smooth boundary . The pollutant degradation model can be described by TFDE as follows
[TABLE]
where the degradation coefficient , , the diffusion coefficient Matrix and satisfy , ,
[TABLE]
The input source formed with separated variables , where is the time-varying strength of source, and denotes the space-position information. For example, the usual modelling of point source has the form , here is Dirac function and is the source location.
Throughout this paper, The solutions to system (1.6) will be denoted by in order to indicate its dependence on the degradation coefficient , and correspond to the input sources , ; . Hereafter, refers to a generic constant which may differ at different occurrences. Then, our aim is to solve the following inverse problem.
Inverse degradation coefficient problem (IDCP) for TFDE: Given the input source , ; , such that the measurement data set
[TABLE]
determine the degradation coefficient (see Figure 1.1 for a schematic illustration). Here is an accessible part of the boundary, is a nonzero nonnegative function, and
[TABLE]
where is the th component of the outward unit normal vector .
The measurement data set (1.7) is a weighted integral data on accessible part of the boundary, the weight function can be interpreted as a characterization of measure instrument. Compared with the direct flux measurement (or Neumann measurement data) on boundary, i.e. \frac{\partial u_{j}^{i}(x,t;q)}{\partial\nu_{A}}\big{|}_{\Lambda\times(0,T)} (see Jin and Rundell [39]), it is a average flux measurement, which is rather easier to measure as a practical matter [51, 52]. However, because of the average effect, the amounts of average flux measurement data is much less than the amounts of direct flux measurement data. The similar situation also arises in the popular topic called inverse scattering problem with phaseless data [53, 54, 55]. For IDCP, owing to the reduced measurement data, the average flux measurement reconstruction is much more severely ill-posed than the direct flux measurement reconstruction. Moreover, the high nonlinearity is also the inherent difficulty for IDCP. In this paper, we will prove that the limited average flux measurement is enough to identify the degradation coefficient uniquely. Then we combine the variational regularization method with Laplace approximations to solve the IDCP. This hybrid method is essentially the organic combination of deterministic method and stochastic method.
Our contributions of this work are fourfold:
- •
We establish the uniqueness of IDCP, i.e., the average flux measurement data (1.7) can uniquely determine the degradation coefficient .
- •
Due to the severe ill-posedness of IDCP, the reconstruction accuracy of degradation coefficient are very sensitive to the data noise, we combine the () variational regularization method and Laplace approximations theory to overcome this difficulty. Especially, it can effectively capture the statistics information and quantifying the uncertainty of the solution.
- •
We analyze the Hellinger distance between the exact posterior measure and its approximation given by Laplace approximations and prove the second order convergence rate at the MAP point.
- •
Since the average flux measurement only provide very limited information, such as the symmetry of degradation coefficient. We first use a term called ”skewness” from statistics to characterize the symmetry of solution, and show the relation between the symmetry of solution and the symmetry of data. Furthermore, we recover numerically the skewness of degradation coefficient by improving the symmetry of measurement data.
The paper is organized as follows. In Section 2, we present the uniqueness for the IDCP. In Section 3, we consider the well-posedness of weak solution of the direct problem, and prove the Lipschitz continuity of the corresponding forward operator. Furthermore, the variational regularization scheme with () penalty term is introduced to overcome the ill-posedness of IDCP. The existence, stability, convergence theorem of the minimizer of the variational problem are established. Moreover, the first-order Fréchet derivative and second-order Fréchet derivative of forward operator are obtained by using adjoint method, and the conjugate gradient method is presented for calculating the minimizer (MAP point). In Section 4, the Bayesian theory and Laplace approximations method are recalled. The convergence rate estimate between the LA posterior distribution and the actual posterior distribution in the sense of Hellinger distance is given, and the confidence region and skewness are introduced for characterizing the accuracy and reliability of LA. Several one-dimensional and two-dimensional numerical examples are shown in Section 5. Finally, the conclusions are given in Section 6.
2 The uniqueness of the IDCP
In this section, the classical solutions of the problem (1.6) which belong to the space are considered [17], here denotes the space of functions such that . Motivated by the ideas in [57], we show that the average flux measurement data can determine the degradation coefficient uniquely.
Theorem 1.1. Let be a complete set in , and be given nonzero nonnegative functions, and satisfies . Assume , , , on . Let , be the classical solutions of problem (1.6) corresponding to the input sources (; ) with the degradation coefficients and respectively. If we choose , such that
[TABLE]
then in .
Proof. Notice that be a given nonzero nonnegative function. Then we set on and on , and introduce the function as the solution of the following adjoint problem
[TABLE]
In fact, by using the transform formula , (2.9) becomes [48]
[TABLE]
Since we choose , from , and the Green’s formula we compute
[TABLE]
[TABLE]
where the second equality uses the following fractional integration by parts formula (see [46] Lemma 2.1):
[TABLE]
here is the space of functions which are absolutely continuous on . Similarly, by setting , it follows that
[TABLE]
For and the corresponding function given by ,we see that
[TABLE]
Then we see from that
[TABLE]
By the completeness of , we obtain
[TABLE]
Multiplying equation (2.9) by , integrating by parts over , we find
[TABLE]
The two expressions of are subtracted from each other, and using (2.11) we have
[TABLE]
we finally combine , to discover
[TABLE]
The maximum principle for time factional diffusion equation (2.9) or (2.10) (see [17], [56]) can be applied to deduce that , then in .
Remark 1.2. Obviously, when the average flux measurement (2.8) is replaced by the direct flux measurement and the other assumptions of Theorem 1.1 are satisfied, the uniqueness result still holds, i.e., if
[TABLE]
then in . In [39], the direct flux measurement was used by Jin and Rundell for coefficient identification in one dimensional TFDE (see Theorem 3.1 (a) in [39]). However, they assume that the reaction coefficient must be known beforehand in the neighborhood of right boundary .
Remark 1.3. Let , satisfying in , in , and the other assumptions of Theorem 1.1 are satisfied. By choosing , , we see the uniqueness also holds. The proof is the same as in Theorem 1.1.
3 Variational regularization method
3.1 Weak solution of TFDE
In order to obtain the continuity of forward map and provide feasibility for numerical computation (such as finite element method), we study the weak solution of problem (1.6) with . The existence, uniqueness and stability of weak solution are given. Let denote the space of infinitely differentiable functions on with compact support in . The Sobolev space is the closure of with respect to the norm , where denotes the norm in the usual fractional Sobolev spaces [58, 72].
We recall and define a Hilbert space
[TABLE]
equipped with the norm
[TABLE]
Definition 3.1.1. (Weak solution) We call that is a weak solution of the initial boundary value problem provide
[TABLE]
where the bilinear form and are defined by
[TABLE]
Theorem 3.1.2. If , , and , , satisfy the same assumption in Theorem 1.1. Then the problem exists a unique solution in and the solution satisfies
[TABLE]
where is a constant independent of .
By using the Lax-Milgram theorem, the proof of Theorem 2.1 is standard (refer to [58, 19, 46]). Hence, we omit it.
3.2 The forward operator
In this section, we solve numerically the reaction coefficient by problem . The inverse coefficient problem is formulated into a variational problem by using the Tikhonov regularization. Then the existence, stability and convergence of minimizer for the variational problem are provided.
Define a forward or solution operator
[TABLE]
where , and
[TABLE]
and the norm of is defined by . To get the continuity of the forward operator, we restrict .
Theorem 3.2.1. If , and the other assumption is the same as Theorem 1.1. Then the nonlinear forward map is the Lipschitz continuous.
Proof. Setting , it follows that
[TABLE]
and
[TABLE]
Let , then satisfies the following equation
[TABLE]
Since and , by using Theorem 3.1.2, we have
[TABLE]
Similarly from
[TABLE]
Furthermore, by trace theorem, it implies
[TABLE]
and then
[TABLE]
Therefore, the Lipschitz continuity of is proved.
3.3 Variational regularization method with () penalty term
In order to overcome the ill-posedness of the problem, we apply the variational regularization method with () penalty term to deal with it. Then the corresponding variational functional is defined as follows
[TABLE]
where denotes the measure data matrix and satisfies , here is the noise level. is the regularization parameter. Next, we will prove the existence, stability and convergence of minimizer of the variational functional (3.23). Although the proof is similar to [44, 59], for the completeness, we give some details.
Theorem 3.3.1. There exists a minimizer for variation functional .
Proof. Because of the nonnegativity of , there is a minimizing sequence in such that
[TABLE]
Since is bounded in , there exists a subsequence, which is again denoted by , such that in . Moreover, by using the reflexivity of and the density of in [60], it follows that in . Owing to the closed convexity of , by using Mazur Theorem, we find , and then . From the weak lower semicontinuity of norm, and the Lipschitz continuity of , we have
[TABLE]
Therefore, is a minimizer of .
Theorem 3.3.2. Assume that is a sequences which satisfy in , and is a minimizer of with replaced by . Then the minimizers of are stable with respect to the measurement data .
Proof. From the definition of , we find
[TABLE]
Since is bounded in , there has a subsequence, still denoted by , such that . Similar to the proof of Theorem 3.3.1, it implies . Based on continuity of and weak lower semicontiunity of norm, by it follows that
[TABLE]
for all . This deduce that is a minimizer of . Furthermore, we set , and find that
[TABLE]
Thus, the stability result holds.
Definition 3.3.3. is called an -minimizing solution if
[TABLE]
Theorem 3.3.4. Suppose that the noise level sequence convergence monotonically to [math], and the corresponding measurement data satisfy . Moreover, assume that the regularization parameter satisfies , and , (as ), and is also monotonically increasing. is a minimizer of with replaced by . Then has a subsequence which convergent to an -minimizing solution, and satisfies that .
Proof. From the definition of , we get
[TABLE]
Then, take , we obtain
[TABLE]
Moreover, notice that
[TABLE]
and , it implies that
[TABLE]
Since is bounded, there has a subsequence, which denoted again by satisfying . Likewise, by Mazur Theorem, we also get . Hence, by (3.27), a similar argument implies that
[TABLE]
for all satisfying . Setting deduce that . That is, is an -minimizing solution, and satisfies .
3.4 The Fréchet derivative of variation functional
For simplicity, we only focus on the case , then the variation functional becomes
[TABLE]
In order to find the minimizer of variation functional (3.29), the efficient evaluation of the Fréchet derivative is critical, here we adopt the adjoint method. First, we claim that the following asymptotic expansion formula of solution holds.
Theorem 3.4.1. The solution is differentiable in the sense that: for any direction , we have that
[TABLE]
where satisfies the sensitive equation:
[TABLE]
Proof. Setting . It is easy to verify that satisfies
[TABLE]
By Theorem 3.1.2, it deduces that . Similarly, by using Theorem 3.1.2 to (3.31), we find , and then the claim now holds.
The following theorem shows that the solution is twice Fréchet differentiable. Since the proof is analogous to Theorem 3.4.1 and is omitted.
Theorem 3.4.2. is differentiable in the sense that: for the direction , we have that
[TABLE]
where satisfies the second-order sensitive equation:
[TABLE]
Theorem 3.4.3. For the direction , the Fréchet derivative of variation functional at is given by
[TABLE]
where satisfies the adjoint equation:
[TABLE]
Proof. In fact, we only need to prove that the Fréchet derivative of is given by
[TABLE]
Notice that can be written as the Lagrangian multiplier formula,
[TABLE]
for any multiplier function . Then, from Theorem 3.4.2 and integration by parts, we have that
[TABLE]
By choosing , and satisfies (3.35), then we obtain (3.36).
3.5 The conjugate gradient method
The conjugate gradient method (CGM) combined with an appropriate stopping rule can serve as a regularization method, and it has been applied to various inverse problems [44, 46, 71]. However, the CGM is a deterministic regularization method which yield only a point estimate of the solution, without quantifying the associated uncertainties of measurement data noise. Here, we use CGM to calculate a maximum a posteriori (MAP) estimator of the solution, and then in conjunction with the Laplace approximations to sample the associated posterior distribution. The CGM we utilized is presented in Algorithm 1.
4 Bayesian theory and Laplace approximation
4.1 Convergence of Laplace approximation
From classical Bayesian theory, and notice that the observation error is an independent and identically distributed (i.i.d) Gauss random vector with mean zero and the covariance matrix , here is the unit matrix, we can write the minimization functional as [61]
[TABLE]
where is a covariance weighted norm on given by , and is a stretched vector version of the one defined in . Furthermore, the minimizer of defines the maximum a posteriori estimator
[TABLE]
The Laplace approximation (LA) in essence is a linearization around the MAP point for sampling the posterior distribution of the solution (refer to [62]). It consists of approximating the posterior measure(or distribution) by , here is the inverse of Hessian of . Next, motivated by [63], we analyze the Hellinger distance between the exact posterior measure and its approximation given by and obtain the convergence as . In [64], Wacher gave a bound of the Hellinger distance between the posterior and its LA, but it seems not to deduce the convergence. Moreover, the estimation of other distances (such as Kullback-Leibler divergence) between the posterior distribution and its approximation were applied in analysing the stochastic surrogate models (e.g., [65, 66]).
Lemma 5.1.1. For every and , there exists such that, the forward map satisfies,
[TABLE]
Proof. From theorem 3.1.2, and trace theorem, it follows that
[TABLE]
Thus
[TABLE]
and then
[TABLE]
here, .
From the Laplace approximation, the Taylor expansion of at MAP point is given by
[TABLE]
In the infinite dimensional version of Beyesian theory. The posterior measure is absolutely continuous with respect to the prior, and the Rodon-Nikodym derivative between them is determined by the data likelihood, i.e.
[TABLE]
Similarly, the Laplace approximation version of Bayesian formula is defined as
[TABLE]
Lemma 5.1.2. The forward map satisfies:
(i) for every and , there exsits such that, for all and ,
[TABLE]
(ii) for every , there exsits a such that for all and with ,
[TABLE]
The Lemma 5.1.2 is the direct results of Lemma 5.1.1, by using Lemma 2.1 in [63], so we omit the proof.
Theorem 5.1.3. Assume that and are defined in Lemma 5.1.2, then the measure and its Laplace approximation measure are close with respect to the Hellinger distance, i.e., there is a constant , such that
[TABLE]
and
[TABLE]
where the Hellinger distance is defined by
[TABLE]
Proof. By using Lemma 5.1.2 (ii), we obtain
[TABLE]
where , , and
[TABLE]
From the second-order necessary condition for minimizer, lemma 5.1.2 (i) and Fernique theorem (see [63]), we see that
[TABLE]
Moreover, notice , it follows that
[TABLE]
From the definition of Hellinger distance, we get
[TABLE]
Now, using and , it deduces
[TABLE]
and then
[TABLE]
Hence, we can find that
[TABLE]
Remark 5.1.4. From the proof of Theorem 5.1.3, we can see that the approximation error is derived from the Taylor expansion (4.40), and second-order convergence rate is given. Moreover, from the convergence estimation (4.45), we also find that when the noise level is decreased or the regularization parameter is increased, the Hellinger distance become small, and it provides some inspiration for selecting the regularization parameter.
4.2 Numerical algorithm of Laplace approximation
For effective numerical simulation, we take a finite dimensional approximation of the previous minimization problem as follows
[TABLE]
where , and is a finite-dimensional approximation of the continuous forward map . The Laplace approximation theory shows the posterior distribution . In the finite dimensional, the covariance matrix (see [67], Section 10.5)
[TABLE]
where is Jacobian matrix of forward operator at point. Notice that the covariance formula (4.53) only uses the first order derivatives of . A standard implementation of Laplace approximation is presented in the following algorithm [67]:
Samples generated by (4.55) are drawn from , and so the ensemble of realizations provides an approximation to and hence the posterior. Finally, we use the mean of the sample as an approximation of , where the convergence of follows from the strong law of large numbers. From the classical Gaussian statistic theory, we find that are consistent and best unbiased estimate of .
4.3 Confidence region and skewness
We can calculate the confidence region for the inferred parameters (such as degradation coefficient in IDCP) by the LA posterior samples. The confidence region is a set of points in an -dimensional space, often represented as an ellipsoid around a point which is an estimated parameter. The confidence region shows that the real value of the identified parameter has a certain probability of falling around the numerical construction result and quantifies the level of confidence that the parameter lies in the region. Therefore, the confidence region gives the reliability of the construction result of the inferred parameters.
Definiton 4.3.1. (Confidence region) [68] The confidence region of is defined as follows
[TABLE]
It is an ellipsoid centered on point, and assume the eigenvalue of are
[TABLE]
We find that the axes of the ellipsoid are . Since , it deduces that , and the length of each axis is . Let be the component of , then it satisfies . Thus, we can see that the confidence region is a multi-dimensional generalization of a confidence interval. It also can be seen that when the number of samples , the regularization parameter and the eigenvalue of increase or the measurement noise level decrease, the size of confidence region will decrease and give higher reliability.
Skewness is a statistic that studies the symmetry of data distribution. By measuring the skewness, we can determine the degree and direction of the asymmetry of the data distribution. The definition of skewness is given below
Definition 4.3.2. (Skewness) [69] Assume the third central moment of the random variable is exists, the following ratios:
[TABLE]
is called the skewness coefficient for X, i.e., skewness. Here is the mean, is the standard deviation, is the expectation operator, is the third central moment.
By normalization we can transform the identified parameter into a well defined probability density function for a hypothetical random variable, and calculate the skewness of random variable to estimate the symmetry of the inferred parameter (see Section 5).
5 Numerical experiments and discussions
In this section, we present some numerical examples to illustrate the feasibility of the Laplace approximation method for IDCP.
The noisy data are generated by adding random perturbations as follows [71]
[TABLE]
here is a Gaussian random variable with zero mean and unit standard deviation. indicates the noise level.
To show the accuracy of numerical solutions, we compute the relative error denoted by
[TABLE]
where is the approximate degradation coefficient reconstructed by the LA algorithm, and is the exact solution.
In an iteration algorithm, aim to make to approximate point, the important work is to find a suitable stopping rule. Using to represent the residuals of iterations of and steps, i.e.,
[TABLE]
when , we stop iterating as a stopping criterion.
5.1 Inversion for one-dimensional degradation coefficient
The domain under consideration is a unite line , and the boundary is indicated as , . The direct problem is discretized by using uniform rectangular finite element. Set , and the grid point on is . The boundary data are obtained by solving the direct problem with , and setting positive function . We solve the direct problem, sensitive problem and adjoint problem by using the finite element method and construction of data using difference method.
The obtained by conjugate gradient method is an approximation to . Samples generated by algorithm 2 are drawn from , and so the ensemble , the length is taken to be , provides an approximation to .
5.1.1 Smooth solution
In , the accessible boundary is taken to be , and the degradation coefficient is given by
[TABLE]
First we investigate the effect of the amounts of basis functions. Trigonometric basis functions are used, i.e., in Example 1. In Table 1, for fractional order , we list the error of LA solution and exact solution under different amounts of basis functions and different noise levels, is the number of basis functions. From the Table 1, we can see that the error becomes smaller at the same noise level as the number of basis function increases. When the number of basis functions is the same, the result become worse with the increase of noise levels. In the following calculations, we chose .
In Theorem 3.3.4, we introduce a slightly crude rule of regularization parameter selection, i.e.,
[TABLE]
The numerical results for Example 1 with various and are shown in the Table 2. We find that at the same noise level, was selected to meet the rule , but the results are unsatisfactory. By choosing which satisfies , the error between the LA solution and the exact solution is small, and the result was desirable. However, the crude rule for regularization parameter selection is not satisfied with , and the error between LA solution and exact solution will gradually increase and result will be shock. Thus, the regularization parameter can be chosen neither too large nor too small, even if it satisfies .
The numerical result for Example 1 for various noise levels , and in the case of are shown in Figure 5.2. The basis function is the trigonometric basis function in . Compare the Figure with Figure , the results in Figure is worse than Figure . In Table 3, we further show the numerical errors of the Example 1 for different and . It can be seen that the numerical results become worse as the noise levels increase. The error become larger as fractional order increases. However, when the noise level is 0.0001, the numerical result is insensitive to fractional order .
Next, we change trigonometric basis function into polynomial basis function in , i.e., . We show the reconstruction results for Example 1 under various error levels and different types of basis function with in the Table 4. The data indicate that the difference in types of basis function will affect the numerical results. The numerical results obtained by trigonometric basis functions are better than those obtained by polynomial basis functions. In the following example, we only consider the triangle basis functions.
In the Figure 5.3 show that the results by using the direct flux data is not sensitive to the fractional order . From Table 5, we find that the reconstruction results by using the direct flux data are better than the ones by using the average flux data. However, we also get satisfactory numerical results by using the average flux data when the noise levels are respectively. The reconstruction error caused by using average flux data increases sharply when the noise level exceeds , but using the direct flux data still give good results when the noise level exceeds and even reaches . This is because, compared to direct flux data, the amount of the average flux data is less and provides limited information. Moreover, the limited measurement data lead to higher sensitivity to noise and severally ill-posedness of IDCP. But, the average flux data is rather easier to measure as a practical matter, and it has been widely used recently. Hence, recovering the degradation coefficient accurately by using limited measurement data can be a real challenge.
For Example 1, we draw the 95% confidence interval in Figure 5.4 for the noise level with . The posterior mean is in excellent agreement with the exact solution, and the confidence interval quantifies its associated uncertainty. The confidence interval shrinks as the noise level decreases. We observe that the confidence interval on one side near the observation data is relatively narrow, and the corresponding confidence interval is also relatively accurate. The confidence interval far from the observation data is wide and the corresponding confidence interval is relatively inaccurate. It’s a surprise that the position of the data affects the accuracy of our reconstruction results. In Example 2, Example 3, we will continue to verify this result.
. In , the accessible boundary is taken to be , and the degradation coefficient is given by
[TABLE]
The numerical results for Example 2 for various levels of noise in the data are shown in Figure 5.5 with . The results in Figure is worse than Figure as the fractional order increase. Moreover, we also find that the position of the measurement data have a great influence on the reconstruction results. In the Figure 5.2, we let , the average flux data are measured on the for Example 1, when the exact solution is biased to the right of the region, we obtain good numerical results. In the Figure 5.5, setting , when the average flux data are measured on the for Example 2, the left biased give desired approximate result (see Example 3 for further discussion). We used LA algorithm to sample, calculated the corresponding posterior mean value , and drew 95% confidence interval in the Figure 5.6, we can get similar results of Example 1. The confidence interval shrinks as the noise level decreases and near observation data is more accurate, far from the observation data is wide and inaccurate, see the Figure 5.6.
. In , the accessible boundary is taken to be , and the degradation coefficient is given by
[TABLE]
The numerical results for Example 3 with various fractional order and noise levels are presented in Figure 5.7, it shows that the fractional order increases but the result is not good. When we measure average flux data from both sides, and use LA method to recover the degradation coefficient, the error is small with noise levels are respectively. Furthermore, the confidence interval has higher precision on both sides and lower precision in the middle of the region. The confidence interval are shown in Figure 5.8. This is consistent with our previous conclusions.
Now, we introduce the concept of skewness to describe the degree of LA solution’s deviation from the mean value, and further demonstrate the relation between the symmetry of solution and the symmetry of measurement data. The degradation coefficient is taken as Example 3, and the boundary average flux data are measured on different positions. Then the corresponding numerical results for Example 3 are displayed in Figure 5.9. As the error level increase to 0.001, when measuring the data on the boundary of , the degradation coefficient recovered by LA method is left skewed, i.e., its skewness is positive. When the average flux data are measured on the boundary of , the degradation coefficient is right skewed, i.e., its skewness is negative. When we measure average flux data on both sides of the boundary (symmetric data), that is to say , the reconstructed degradation coefficient is also symmetric and give a accurate approximation, and its skewness is zero. Thus, we can use the symmetry of measurement data to capture the symmetry feature of the identified object. By using (4.58), the calculation of the corresponding skewness can be referred to Table 6.
5.1.2 Nonsmooth function
In this example, we consider the more challenging case of reconstructing a nonsmooth example with a cusp, and the degradation coefficient is prescribed as follows:
[TABLE]
We also consider an discontinuous example, and the degradation coefficient is:
[TABLE]
The numerical results for Example 4 and Example 5 for various noise levels in the case of are shown in the Figure 5.10 with the regularization parameter . It can be seen that the smaller the noise level, the better the numerical results. The error for nonsmooth Example 4 in the neighborhood of the cusp is large because of the smoothing nature of the prior. And the same result can be obtained from Example 5, the numerical results obtained in the neighborhood of segment point are not very desired. For the nonsmooth case, different regularization methods and different prior information are needed, such as TV prior [70], and we do not discuss the details here.
5.2 Inversion for two-dimensional degradation coefficient
The domain under consideration is a unit square . Set , and the grid point on is . In the equation , we take , the diffusion coefficient matrix is unitary. The forward problem is discretized using uniform rectangular finite element. The number of basis functions is take N=5 and sample size . Set positive function . The , where , , , . Next, we presents the numerical results for 2D cases.
We take the , the degradation coefficient is:
[TABLE]
The LA solution of Example 6 tends to the front right of the region, we can see the Figure 5.11. In this case, we use the asymmetric boundary data to reconstruct the degradation coefficients, here the data is on the , and it verifies the LA solution is close to the data location. The solution dependence on data location is more obvious in two dimensions.
We take the , the degradation coefficient is:
[TABLE]
It is easy to see is central symmetry, and the reconstruction results are displayed in Figure 5.12. Here, Figure is the exact solution and the Figure , , are correspond to average flux data measured on the different part of boundary. The skewness is calculated from the marginal density function. The data measured on boundary , the graph is skewed to the front right of the region, that is, the skewness is negative in the x direction and positive in the y direction. When the average flux data measured on , the graph slant toward the front of the area i.e., the skewness is zero in the x direction and positive in the y direction. We use the average flux data on the whole boundary , the graph is accordance with the exact solution. The skewness correspond to the average flux data collected on different position of boundary can be seen in Table 7. Notice that here the corresponding marginal distributions of 2-D random variable are used to calculate the skewness by applying formula (4.58). In addition, the 95% confidence intervals of LA solutions at different are shown in the Table 8. It can be seen that, owing to the appropriate regularization parameter selection and average flux measurement on whole boundary, the size of confidence region is very small, i.e., the proposed LA algorithm gives a higher reliability.
6 Conclusions
In this paper, we study the identification of degeneracy coefficients in time-fractional diffusion equations (TFDE) by using the average flux data at the accessible part of boundary. We mainly prove that the average flux measurement data can uniquely determine the degradation coefficient. The Lipchitz continuity of the corresponding forward operator is obtained. Due to the average flux measurement data only provide very limited information, and lead to serious ill-posedness of IDCP. This paper combines Tikhonov regularization with Laplace method to overcome the ill-posedness. The existence, stability and convergence of solutions of variational problems are given. The paper introduce the sensitivity problem and the adjoint problem to find the minimizer of the variational problem by using the conjugate gradient method, and derive the mean and variance of the approximate posterior distribution by applying Bayesian theory and Laplace approximation. The Hellinger distance between the exact posterior measure and Laplace approximation is analyzed, and the second order convergence rate at MAP point is proved. The symmetry of the LA solution described by skewness is proposed, and find that the symmetry of solution is closely related to the symmetry of data. Finally, some numerical examples show that the method is not only accurate and flexible, but also can capture statistical information and quantify the uncertainty of the solution.
Acknowledgments
The work described in this paper was supported by the NSF of China (11301168).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. Metzler, J. Klafter , The random walk s guide to anomalous diffusion: a fractional dynamics approach , Phys. Rep. 339 (2000) 1-77.
- 2[2] I. Podlubny , Fractional differential equations. An introduction to fractional derivatives, fractional differential equations, some methods of their solution and some of their applications , Academic Press, 1999.
- 3[3] A. Kilbas, H. Srivastava, J. Trujillo , Theory and Applications of Fractional Differential Equations , Elsevier, Amsterdam, 2006.
- 4[4] S. Samko, A. Kilbas, O. Marichev , Fractional Integrals and Derivatives , Gordon and Breach Science Publishers, Philadelphia, 1993.
- 5[5] V. Uchaikin , Fractional Derivatives for Physicists and Engineers , Springer, 2013.
- 6[6] D. Brockmann, L. Hufnagel, T. Geisel , The scaling laws of human travel , Nature 439 (2006) 462-465 .
- 7[7] E. Scalas, R. Gorenflo, F. Mainardi , Fractional calculus and continuous-time finance , Phys. A. 284 (2000) 376-384.
- 8[8] D. Benson, S. Wheatcraft, M. Meerschaert , Application of a fractional advection-dispersion equation , Water Resour. Res. 36 (6) (2000) 1403-1412.
