Pad\'{e}-type approximations to the resolvent of fractional powers of operators
Lidia Aceto, Paolo Novati

TL;DR
This paper develops a reliable pole selection method for Padé-type rational approximations of the resolvent of fractional powers of operators, providing accurate error estimates and demonstrating effectiveness through numerical examples.
Contribution
It introduces a new pole selection strategy based on hypergeometric functions for better rational approximation of fractional operator resolvents.
Findings
Accurate error estimates for Padé approximation of fractional powers
Numerical validation of theoretical error bounds
Enhanced rational Krylov methods based on the new approximation approach
Abstract
We study a reliable pole selection for the rational approximation of the resolvent of fractional powers of operators in both the finite and infinite dimensional setting. The analysis exploits the representation in terms of hypergeometric functions of the error of the Pad\'{e} approximation of the fractional power. We provide quantitatively accurate error estimates that can be used fruitfully for practical computations. We present some numerical examples to corroborate the theoretical results. The behavior of the rational Krylov methods based on this theory is also presented.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Padé-type approximations to the resolvent of fractional powers of operators
Lidia Aceto
Departments of Mathematics, University of Pisa, Via F. Buonarroti, 1/C, 56127 Pisa, Italy
and
Paolo Novati
Departments of Mathematics and Geosciences, University of Trieste, via Valerio 12/1, 34127 Trieste, Italy
Abstract.
We study a reliable pole selection for the rational approximation of the resolvent of fractional powers of operators in both the finite and infinite dimensional setting. The analysis exploits the representation in terms of hypergeometric functions of the error of the Padé approximation of the fractional power. We provide quantitatively accurate error estimates that can be used fruitfully for practical computations. We present some numerical examples to corroborate the theoretical results. The behavior of the rational Krylov methods based on this theory is also presented.
2010 Mathematics Subject Classification:
Primary 47A58, 65F60, 65D32
The authors are members of the INdAM research group GNCS
This work was supported by GNCS-INdAM and by FRA-University of Trieste
1. Introduction
Let be a self-adjoint positive operator with spectrum , acting on a Hilbert space endowed with norm and operator norm . We assume that possesses a compact inverse so that it can be written in terms of its spectral decomposition and the operational calculus can be defined by working on the eigenvalues as in the finite dimensional setting. Moreover, denoting by the eigenvalues of and assuming that they are numbered in increasing order of magnitude, we have This paper deals with the numerical approximation of the resolvent of fractional powers
[TABLE]
where denotes the identity operator. This kind of resolvent appears for instance when using an implicit multistep or a Runge-Kutta method for solving fractional in space parabolic-type equations in which represents the Laplacian operator with Dirichlet boundary conditions and depends both on the time step and the parameters of the integrator. We quote here [14] and the references therein contained for a comprehensive treatment of the operational calculus involving fractional powers, in the more generic setting of linear operators on Banach spaces.
Clearly the computation of is closely connected to the approximation of the fractional power because
[TABLE]
and hence, any approximant of the function in can be employed to define a method for the resolvent. In this view, recalling the analysis given in [1], the basic aim of this work is to consider Padé-type approximations of the fractional power, centered at points that allows to minimize as much as possible the error for . This idea is justified by the fact that the function behaves like for large values of .
For any given , let be the -Padé approximant of centered at , and consider the approximation
[TABLE]
It is well known that the choice of the parameter is fundamental for the quality of the approximation, see [1, 2]. Since is assumed to be self-adjoint the analysis of this approach can be made by working scalarly in the real interval . In particular, working with unbounded operator, in [1] it has been shown how to suitably define the parameter by looking for an approximation of the optimal value given by the solution of
[TABLE]
As for the resolvent, using in (1.1) and writing
[TABLE]
where denotes the set of polynomials of degree at most , we have
[TABLE]
Obviously inherits the dependence on , and the main contribute of this paper is to define the parameter in order to minimize as much as possible the error
[TABLE]
so that the idea here is to define by looking for the solution of
[TABLE]
We derive an approximate solution of (1.5) that depends on and , and we are able to show that the error decays like that is, sublinearly. We experimentally show that using this new parameter sequence it is possible to improve the approximation attainable by taking as in [1] for and then using (1.1) to compute . The latter approach has recently been used in [4].
In the applications, where one works with a discretization of , if the largest eigenvalue of (or an approximation of it) is known, then the theory developed for the unbounded case can be refined. In particular, here we present a new sequence of parameters that can be use to handle this situation and that allows to compute with a linear decay of the error, that is, of the type , . In both situations, unbounded and bounded, we provide error estimates that are quantitatively quite accurate and therefore useful for an a-priori choice of that, computationally, represents the number of operator/matrix inversions (cf. (1.3)).
The poles of can also be used to define a rational Krylov method for the computation of , , and the error estimates as hints for the a-priori definition of the dimension of the Krylov space. We remark that the poles of completely change with , so that for a Krylov method it is fundamental to decide at the beginning the dimension to reach, that is, the set of poles. The construction of rational Krylov methods based on the theory presented in the paper is considered at the end of the paper.
The paper is organized as follows. In Section 2 the basic features concerning the rational approximation of the fractional power are recalled. Section 3 and Section 4 contain the error analysis for the infinite and finite dimensional case, respectively. Finally, in Section 5 some numerical results are reported, including some experiments with rational Krylov methods.
2. The rational approximation
The -Padé type approximation to recalled in (1.2) can be obtained starting from the integral representation (see [6, Eq. (V.4) p. 116])
[TABLE]
and using the change of variable
[TABLE]
that yields
[TABLE]
Using the -point Gauss-Jacobi rule with respect to the weight function we obtain the rational approximation (see (1.2))
[TABLE]
The coefficients and are given by
[TABLE]
where and are, respectively, the weights and nodes of the Gauss-Jacobi quadrature rule. Denoting by the th zero of the Jacobi polynomial and setting
[TABLE]
from [3, Proposition 2] we can express as the rational function
[TABLE]
where
[TABLE]
We refer here to [7, 13] for other effective rational approaches based on different integral representations and quadrature rules.
As for the resolvent, using the rational form defined in (1.3) we have that
[TABLE]
where are the coefficients of the partial fraction expansion of and are the roots of the polynomial From [4, Proposition 1] we know that all the values are real and simple. To locate them on the real axis, we recall that are the zeros of the Jacobi polynomial . So, using (2.5) it is immediate to verify that the roots of are all real, simple and negative, which implies that its coefficients are strictly positive. The same conclusions apply to Therefore, since all the coefficients of are strictly positive by construction, according to the Descartes’ rule of signs, we are also sure that are negative and therefore that the approximation (2.8) is well-defined.
3. Error analysis
Before starting we want to emphasize that the rational forms and depend on the value of in (2.2). Anyway, in order to keep the notations as simple as possible, in what follows we avoid to write the explicit dependence on this parameter. Moreover, throughout this and the following section we frequently use the symbol to compare sequences, with the underlying meaning that if where as
First of all we recall the following result given in [1, Proposition 2], and based on the representation of the error arising from the Padé approximation of the fractional power
[TABLE]
in terms of hypergeometric functions whose detailed analysis can be found in [10].
Proposition 3.1**.**
For large values of , the following representation of the error holds
[TABLE]
Now, let
[TABLE]
Proposition 3.2**.**
For large values of , the following representation holds
[TABLE]
Proof.
By (3.1) we have
[TABLE]
Therefore by Proposition 3.1 we find the result. ∎
In order to minimize the error defined in (1.4), by Proposition 3.2 the basic aim is now to study the nonnegative function
[TABLE]
and, in particular, to approximate the solution of
[TABLE]
Proposition 3.3**.**
The function given in (3.5) has the following properties:
- (1)
* for * 2. (2)
* for and for * 3. (3)
* has exactly two maximums and such that*
[TABLE]
Proof.
Items (1) and (2) are obvious. As for item (3) the study of , after some algebra, leads to the equation
[TABLE]
The function on the right
[TABLE]
is the ratio of two parabolas in the variable . Moreover for and for , and it is not defined (in ) at
[TABLE]
Moreover for
[TABLE]
Therefore starting from the point with coordinates , is growing and for Moreover, for . From to the function is still growing, and for and for . As consequence the equation (3.7) has exactly two solutions, and . ∎
Proposition 3.4**.**
For the maximum it holds
[TABLE]
where
[TABLE]
and denotes the Lambert-W function.
Proof.
Since , there exists such that
[TABLE]
Therefore
[TABLE]
Using (3.7), for large values of we find
[TABLE]
Moreover, since the function is concave for and as we have that as , so that we can use the approximation
[TABLE]
By (3.7) and (3.10) we then have to solve
[TABLE]
whose solution is given by (3.9). ∎
Remark 3.5*.*
Since for close to [math] (see, e.g., [8, Eq. (3.1)]), for defined in (3.9) we have
[TABLE]
Proposition 3.6**.**
For the function defined in (3.5) it holds
[TABLE]
Proof.
Observe first that for (cf. (3.9) and (3.14)), and hence by (3.8)
[TABLE]
Moreover,
[TABLE]
The result then follows from (3.5). ∎
At this point we need to remember that our aim is to solve (3.6). By Proposition 3.3 we have that
[TABLE]
Moreover, since for , for large enough we have
[TABLE]
Since we need to minimize the above quantity with respect to , let us consider the functions
[TABLE]
and
[TABLE]
It is easy to see that is monotone increasing for , whereas is monotone decreasing. Therefore, for large enough, the exact solution of (3.6) can be approximated by solving .
Proposition 3.7**.**
Let
[TABLE]
For large enough the solution of is approximated by
[TABLE]
Proof.
By (3.18), the equation leads to
[TABLE]
Since the exact solution of (3.20) goes to infinity with , we set
[TABLE]
so that by (3.20) we obtain
[TABLE]
Since using the approximation
[TABLE]
we then want to solve
[TABLE]
whose solution, approximation to the one of (3.22), is given by
[TABLE]
The result then follows immediately from (3.21). ∎
In Figure 1 we consider the graphical interpretation of the analysis that leads us to the definition of in Proposition 3.7. Assuming to work with a spectrum contained in with we define by solving As already pointed out, the leftmost maximum becomes smaller than for large enough.
Finally, we are on the point to give the following result, that provides an error estimate since (see (1.4) and (3.3))
[TABLE]
Theorem 3.8**.**
Let be defined according to (3.19). Then for large enough
[TABLE]
Proof.
By Proposition 3.2 and (3.5) we have that
[TABLE]
Then using Proposition 3.7, that is, taking as in (3.19), we have
[TABLE]
Since for large
[TABLE]
cf. [16], we have that
[TABLE]
and hence
[TABLE]
By inserting this approximation in (3.24) we obtain the result. ∎
4. The case of bounded operators
Let be an arbitrary sharp discretization of with spectrum contained in where denotes the largest eigenvalue of The theory just developed can easily be adapted to the approximation of In this situation, in order to define a nearly optimal value for , similarly to (3.6) we want to solve
[TABLE]
Looking at Proposition 3.4 we have as . As a consequence, for ( small), the solution of (4.1) remains the one approximated by (3.19) and the estimate given in Theorem 3.8 is still valid. On the contrary, for ( large), the estimate can be improved as follows. Remembering the features of the function given in Proposition 3.3, we have that for the solution of (4.1) is obtained by solving
[TABLE]
where is defined in (3.17) and
[TABLE]
It can be easily verified that the equation has in fact two solutions, one in the interval and the other in . Anyway since is monotone decreasing in we have to look for the one in as stated in (4.2).
Proposition 4.1**.**
For large enough, the solution of (4.2) is approximated by
[TABLE]
where
[TABLE]
Proof.
From (4.2) we have
[TABLE]
Setting and by (4.4) we obtain
[TABLE]
Using (3.23) we solve
[TABLE]
Therefore
[TABLE]
which implies
[TABLE]
Substituting by and by after some algebra we obtain
[TABLE]
Then, solving this equation and taking the positive solution, we obtain the expression of ∎
Observe that by (4.3), for we have
[TABLE]
Moreover, using (3.23) and the above expression we obtain
[TABLE]
that proves the following result.
Theorem 4.2**.**
Let be such that for each we have Then for each , taking in (2.2) where is given in (4.3), the following estimate holds
[TABLE]
with denoting the induced Euclidean norm.
In order to compute a fairly accurate estimate of we need to solve the equation where is defined in Proposition 3.4. Neglecting the factor in (3.9) and taking as in (3.19), we obtain the equation
[TABLE]
Since if and only if we have
[TABLE]
from which it easily follows that
[TABLE]
In practice, assuming to have a good estimate of the interval containing the spectrum of one should use as in (3.19) whenever and then switch to as in (4.3) for In other words, for bounded operators we consider the sequence
[TABLE]
5. Numerical experiments
In this section we present the numerical results obtained by considering two simple cases of self-adjoint positive operators. The first one is totally artificial since we just consider a diagonal matrix with a large spectrum. In the second one we consider the standard central difference discretization of the one dimensional Laplace operator with Dirichlet boundary conditions.
We remark that in all the experiments the weights and nodes of the Gauss-Jacobi quadrature rule are computed by using the Matlab function jacpts implemented in Chebfun by Hale and Townsend [12]. In addition, the errors are always plotted with respect to the Euclidean norm.
Example 5.1**.**
We define and so that . Taking and in Figure 2, for we plot the error obtained using taken as in (3.19) and as defined in [1, Eq. (24)], that is,
[TABLE]
In addition, we draw the values of the estimate given in Theorem 3.8.
In Figure 3 we consider the choice of as in (4.5) since we take , that is, an operator with a moderately large spectrum. We compare this choice with the analogous one proposed in [1, Eq. (37)] and given by
[TABLE]
In the pictures we also plot the error estimate (4.2).
Example 5.2**.**
We consider the linear operator with Dirichlet boundary conditions . It is known that has a point spectrum consisting entirely of eigenvalues
[TABLE]
Using the standard central difference scheme on a uniform grid and setting , in this example we work with the operator
[TABLE]
The eigenvalues are
[TABLE]
so that
Taking and , in Figure 4, for we plot the error obtained using taken as in (4.5) and as in (5.2).
Example 5.3**.**
In this final example we want to consider the use of the poles arising from the rational approximation introduced in Section 2 for the construction of rational Krylov methods (RKM), see e.g. [5, 9, 11]. In this view, let
[TABLE]
be the -dimensional rational Krylov subspace in which is the set of abscissas as in (2.8), defined by (5.3), and is a given vector. Denoting by the orthogonal matrix whose columns span we consider the rational Krylov approximation
[TABLE]
in which . We remark that is just the result of one step of length of the implicit Euler method applied to the discrete fractional diffusion problem
[TABLE]
By taking as in (3.19) to define the set , in Figure 5 we consider the error of the approxomation (5.4), for and corresponding to the discretization of the scalar function , for Since the construction of the Krylov subspace of dimension requires the knowledge of the whole set , for we compute the corresponding set and consider the final Krylov approximation. In order to appreciate the quality of the approximation we compare this approach with the analogous one in which the set of shifts arises from as in (5.1), and also with respect to the shift-and-invert Krylov method (SIKM), in which we take , following the analysis given in [15].
We remark that for practical purposes one should be able to a-priori set the dimension of the Krylov subspace and this of course requires an accurate error estimate. In this view, the estimate given in Theorem 3.8 can be used to this purpose, since (cf. [11, Corollary 3.4])
[TABLE]
Anyway we have to point out that using Theorem 3.8 in (5.5) the resulting bound may be much conservative for two main reasons. The first one is that we are in fact considering a approximation. The second one is that Theorem 3.8 provides an estimate for general unbounded operator whereas the Krylov method is tailored on the initial vector and also depends on the eigenvalue distribution. For these reasons a practical hint can be to define at the beginning using Theorem 3.8 and then monitor the quality of the approximation at each Krylov iteration by means of the generalized residual given by
[TABLE]
where , , are the columns of and .
6. Concluding remarks
In this paper we have presented a reliable rational approximation for the function on a positive unbounded interval, that can be fuitfully used to compute the resolvent of the fractional power in both the infinite and finite dimensional setting. Moreover the theory can also be employed for the construction of rational Krylov methods, with very good results. With respect to the simple use of rational approximations to , extended to compute by means of (1.1), in this work we have shown that allowing a dependence on it is possible to improve the quality of the approximation. We have provided sharp error estimates that can be used for the a-priori choice of the number of poles, that is, the number of inversions.
Remaining in the framework of Padé-type approximations, we want to point out that many other strategies are possible. Among the others we present here two of them already tested experimentally.
- (1)
Writing
[TABLE]
we can consider the Padé-type approximation (1.2), with replaced by . In this way we obtain the approximation
[TABLE]
that in fact represents a form. Unfortunately this approach is observed to be in general less effective than the one defined in (1.3)-(2.8). 2. (2)
Let be the -Padé approximant of centered at , whose error representation with respect to the variable has been derived in [10, Section 3]. As in (1.2) we can consider the formula
[TABLE]
that yields the rational approximation
[TABLE]
in which are such that . Using this approach, the relationship with the Gauss-Jacobi rule explained in Section 2 is lost. Nevertheless, the error analysis is identical to the one given in Section 3, since the representation (3.2) is still valid with replaced by . Also experimentally, this approach is almost identical to the one presented in the paper.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] L. Aceto, P. Novati, Rational approximation to fractional powers of self-adjoint positive operators. Numer. Math. (2019), DOI: 10.1007/s 00211-01.
- 2[2] L. Aceto, P. Novati, Rational approximation to the fractional Laplacian operator in reaction-diffusion problems. SIAM J. Sci. Comput. 39 (2017), A 214–A 228.
- 3[3] L. Aceto, P. Novati, Efficient implementation of rational approximations to fractional differential operators. J. Sci. Comput. 76 (2018), 651–671.
- 4[4] L. Aceto, D. Bertaccini, F. Durastante, P. Novati, Rational Krylov methods for functions of matrices with applications to fractional partial differential equations. ar Xiv:1812.01405 (2018), submitted.
- 5[5] B. Beckermann, L. Reichel, Error estimates and evaluation of matrix functions via the Faber transform. SIAM J. Numer. Anal. 47 (2009), 3849–3883.
- 6[6] R. Bhatia, Matrix analysis , vol. 169 of Graduate Texts in Mathematics. Springer-Verlag, New York, 1997.
- 7[7] A. Bonito, J.E. Pasciak, Numerical approximation of fractional powers of elliptic operators. Math. Comp. 84 (2015), 2083–2110.
- 8[8] R.M. Corless, G.H. Gonnet, D.E.G. Hare, D.J. Jeffrey, D.E. Knuth, On the Lambert W function , Adv. Comput. Math. 5 (1996), 329–359.
