Robust formula for $N$-point Pad\'e approximant calculation based on Wynn identity
T. M. Mishonov, A. M. Varonov

TL;DR
This paper introduces a new criterion based on Wynn's identity for selecting the optimal N-point Padé approximant, enhancing the accuracy and reliability of rational approximations in physics and applied mathematics.
Contribution
The work presents a novel formula and criterion for optimal Padé approximation using Wynn's identity, with practical applications demonstrated in physics and differential equations.
Findings
Wynn's identity provides a criterion for minimal |η| as the optimal Padé approximant
The method improves multipoint Padé approximation accuracy
Application to physics problems like solar corona heating series summation
Abstract
The performed numerical analysis reveals that Wynn's identity for the compass (here C stands for center, the other letters correspond to the four directions of the compass) gives the long sought criterion, the minimal , for the choice of the optimal Pad\'e approximant. The work of this method is illustrated by calculation of multipoint Pad\'e approximation by a new formula for calculation of this best rational approximation. The work of the criterion for the calculation of optimal Pad\'e approximant is illustrated by the frequently seen in the theoretical physics problems of calculation of series summation and multipoint Pad\'e approximation used as a predictor for solution of differential equations motivated by the magneto-hydrodynamic problem of heating of solar corona by Alv\'en waves. In such a way, an efficient and valuable control…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Robust formula for -point Padé approximant calculation
based on Wynn identity
Todor M. Mishonov
Institute of Solid State Physics, Bulgarian Academy of Sciences, 72 Tzarigradsko Chaussee Blvd., BG-1784 Sofia, Bulgaria
Faculty of Physics, St. Clement of Ohrid University at Sofia, 5 James Bourchier Blvd., BG-1164 Sofia, Bulgaria
Albert M. Varonov
Institute of Solid State Physics, Bulgarian Academy of Sciences, 72 Tzarigradsko Chaussee Blvd., BG-1784 Sofia, Bulgaria
Faculty of Physics, St. Clement of Ohrid University at Sofia, 5 James Bourchier Blvd., BG-1164 Sofia, Bulgaria
(21 April 2020)
Abstract
The performed numerical analysis reveals that Wynn’s identity for the compass (here C stands for center, the other letters correspond to the four directions of the compass) gives the long sought criterion, the minimal , for the choice of the optimal Padé approximant. The work of this method is illustrated by calculation of multipoint Padé approximation by a new formula for calculation of this best rational approximation. The work of the criterion for the calculation of optimal Padé approximant is illustrated by the frequently seen in the theoretical physics problems of calculation of series summation and multipoint Padé approximation used as a predictor for solution of differential equations motivated by the magneto-hydrodynamic problem of heating of solar corona by Alvén waves. In such a way, an efficient and valuable control mechanism for -point Padé approximant calculation is proposed. We believe that the suggested method and criterion can be useful for many applied problems in numerous areas not only in physics but in any scientific application where differential equations are solved. The solution of the Cauchy-Jacobi problem is illustrated by a Fortran program. The algorithm is generalized for the case of the first derivatives at nodal points.
I Introduction and motivation
As the analytical structures take central place in the theoretical and mathematical physics, the Padé approximants of functions become one of the most important problems of applied mathematics. For a general introduction in the problem of Padé approximants, see the well-known monographs Refs. Pade ; Brezinski:91 ; Brezinski:96 . Since the problem was first systematically studied by Frobenius Brezinski:H ; Frobenius:81 and the Padé table introduced by Padé Brezinski:H ; Pade:92 in the 19th century, the contemporary literature is enormous. We will mention only two popular problems, which can be of broad interest for physicists:
- series summation and 2) extrapolation of functions related to predictor-corrector methods for solution of ordinary differential equations. Searching for numerical recipes in this field, a beginner can read that the downside of the Padé approximation is that it is uncontrolled and the dangers of extrapolation cannot be overemphasized NumRep . And after that our beginner will most probably start searching for a commercial software, where the appropriate analytical formulae are programmed. By the very famous law: whatever can go wrong, will go wrong, in very interesting for the physics problems, the commercial software stops working and the research would continue only in co-authorship with the source code authors.
In 1947 Hannes Alvfén Alfven:47 concluded that the solar corona heating is due to the absorption of magneto-hydrodynamic (now called Alfvén) waves. In order to describe the almost 100-time increase of the plasma temperature in a very narrow region of the solar atmosphere. The programming of the magneto-hydrodynamic equations is relatively easy, but after 30-time temperature increase all available for us commercial computation software stops working. Facing such a problem, we had to develop an own method for a frequently encountered numerical problem (four first order linear differential equations coupled to four first order non-linear differential equations) MHD ; MHD:19 . Our research goal was to develop this method based on calculation of Padé approximants with a mechanism to select the most accurate approximant. This mechanism is precisely the main development of the current research and allows the usage of the described method for solution of differential equations. That is why we suppose that many colleagues are in our situation and we share know-how in calculation of Padé approximants.
Before starting let us recall the definition of multipoint Padé approximationPade . A rational function which fits given function values at various points , , i.e. is called multipoint Padé approximant. The associate problem of interpolation by rational functions is called Cauchy-Jacobi problem. Multipoint Padé approximants are also called: rational interpolants, N-point Padé approximants, Newton-Padé approximants, etc, depending on the context. Addressing to experts in the present paper we present a general robust solution of the Cauchy-Jacobi problem which can attract interest and become a convenient tool in the applied numerical analysis.
II Announcement of the results
For a practical implementation of an analytical result, we have to use a computer with finite accuracy of digital representation of real numbers. The Padé approximants give fast convergence and actually the best rational approximation but they are sensitive with respect to the noise of the discrete representation of real numbers. Roughly speaking every value representing a real number suffers from error of discretization as it is multiplied by a random factor , where is the machine epsilon (the biggest positive value for which ), and “” is a programming operator generating a random number homogeneously distributed between 0 and 1.
As formulas for Padé approximants picturally speaking are very sensitive to the random noise of truncation, it is necessary to study the influence of the machine error on the final result. In such a way beyond the analytical problem we have also a statistical one which already belongs to another branch of mathematics. Probably for some special cases it is possible to calculate the probability distribution functions (PDF) of the errors of the final result of calculation of Padé approximants, but it deserves to start with descriptive statistics of some well-known examples, which in our opinion illustrate how to calculate and choose the optimal Padé approximant in physical applications. Here the determination of the optimal Padé approximant is crucial, as our beginner has already learned from the numerical recipes book. To experts we propose a robust method which at least empirically works well.
Let are different Padé approximants of the complex function
[TABLE]
where is the maximal power of the numerator and is the maximal power in the polynomial in the denominator. For these approximants in 1966 Wynn discovered the remarkable relation (Wynn:66, , Eqs. (15) and (16))
[TABLE]
later on baptized by Gragg (Gragg:72, , Theorem 5.5) as missing identity of Frobenius. The criterion for choosing of optimal Padé approximant is based on this Wynn relation which can be rewritten also as
[TABLE]
In the analysis of algorithms is is convenient to represent different approximants in a Padé table. If is in the Center, the notations of the other elements follow the direction of the compass: North, East, Wests, and South, also found in the original paper of Wynn Wynn:66 (i.e. we use Central NEWS algorithm). Probably this representation was an idea of Gragg, who has also described it Gragg:72 , because of the footnote in Wynn’s paper on page 266 (if that’s the case, Gragg was a referee of Wynn’s paper). For convergent Padé approximants
[TABLE]
in the Padé table , , and
[TABLE]
In numerical implementations this limit leads to minimal- criterion at CNEWS algorithm.
Let us repeat in other words. Often the difference between sequential Padé approximants gives a reasonable evaluation of the error, see Ref. Pade . In case of convergence, we have vanishing differences between the values of different cells of the Padé table , for and . In this case also and the minimal value of gives a reasonable criterion for the minimal error and the choice of the optimal Padé approximant. This theoretical hint we prove on many examples of calculation of Padé approximation and arrive at the conclusion that the long sought criterion for the choice of the optimal Padé approximant is simply a search for the minimal value , i.e.
[TABLE]
and . It is technologically and aesthetically attracting that the Wynn identity gives simultaneously
- explicit method for the calculation of Padé approximants in -table and
- method for the evaluation of the error; is actually a criterion for convergence of Padé approximation if we consider real numbers. Our criterion is an alternative of the singular value decomposition (SVD) method described in great detail in Refs. Gonnet:13 ; Beckermann:13 , see also Refs. Karlsson:76 ; Gonchar:78 ; Gonchar:89 ; Levis:18 ; Yattselev:18 . Our minimal criterion and the tolerance level of SVD are perhaps different implementations of one and the same idea. For both methods the rounding error is something external for the theory of Padé approximation. There are no theoretical justifications which method is better and the numerical experiment and the descriptive statistics are just the first steps in the practical realizations on the robust Padé approximants.
The purpose of the present work is to demonstrate how this criterion works and can be used in applications. In the next section we illustrate in detail this criterion on the problem of of calculation of limes of a numerical series.
III The new algorithm for determination of the optimal Padé approximant
Let us have a numerical sequence and we need to calculate the limit . The well-known method is to initialize , for , and also for . In other words we treat every term of the initial series as polynomial approximation of some function
[TABLE]
The method can work even if this sequence is convergent, and those are the most interesting cases for applications. Then we calculate in the “south direction” the corresponding Padé approximants
[TABLE]
or
[TABLE]
and simultaneously calculate the empirical error
[TABLE]
The minimal absolute value in the -table is our criterion for the determination of the optimal Padé approximant. According to the best we know, this minimal- criterion has never been implemented in the numerical recipes so far (authors will very much appreciate any information about this).
For the programming task, we have to calculate all the values, for which division is possible. In the next section we thoroughly describe 2 technical examples.
IV Two easy pieces
IV.1 Calculation of divergent series
Perhaps the most famous example summation of divergent series by Padé approximants is the calculation of the divergent Taylor series beyond the radius of convergence
[TABLE]
The application of Eq. (10) and Eq. (11) gives the optimal Padé approximant for every positive shown in Fig. 1.
Even for for the calculation of the series a pixel accuracy is present. In many articles results of numerical calculations are illustrated by the figures presented by computer graphics with million pixels. For such a figures pixel accuracy means maximal error in which the approximation is undistinguished from the exact result when graphically presented on screen. For many applied studies this is an acceptable beginning. For larger values of the dots representing the calculation evaporate from the line representing the exact value. The most important detail is how and when the method stops working and when the resources of the numerical accuracy are exhausted. The real and empirical of the calculation are shown in Fig. 2.
The empirical error shows saturation and is a reliable indicator for the calculation accuracy. Statistically analysis of the correlation between both errors from Fig. 2 shows very strong dependence between them in Fig. 3.
Now we can safely tell how accurate our Padé approximation is and even how far out in it can be extended. This dependence reveals that the long sought criterion for empirical evaluation of the accuracy of the Padé approximants calculated by -algorithm has already been found.
IV.2 Extrapolation of a sine arch from a preceding one
The second technical example we present is the problem of extrapolation of function which we illustrate in the case of the function. We take equidistant interpolation points from one arch of the function and extrapolate the next arch and even beyond. We use the well-known Aitken interpolation method Aitken (1932); Bronshtein & Semedyayev (1955); Abramowitz & Stegun (1972); Korn2 to order the interpolation points within the arch and the point to be extrapolated. In more details: we calculate for all points The digital noise is minimized if the points are ordered with the proximity with the point of interpolation, i.e.
[TABLE]
Then using sequentially interpolation points in the Aitken scheme we calculate the corresponding polynomial approximation for Next we apply the CNEWS method and use the minimal- for the best Padé approximant. In this manner we have a numerical sequence that we give to the -algorithm to calculate its limit. The described calculation for the function is shown in Fig. 4.
The preceding arch is in the interval and contains 21 interpolation points. Using these points, we extrapolate one arch with 2000 points in the interval and continue the extrapolation in an attempt to obtain a second arch with the same number of points in the next interval . Repeating, every extrapolated point is calculated independently using the Aitken method for interpolation followed by calculation of Padé approximants by the Wynn identity and the optimal Padé approximant is chosen by the advocated in the present paper criterion. The deviation of the points from the real function represented with the line in Fig. 4 shows the limit of applicability of the Aitken-Wynn extrapolation algorithm. Detailed error estimates of the extrapolation are shown in Fig. 5.
The criterion shown in Fig. 5 gives the reliable order estimation of the error. The similar behaviour of both errors shows that our criterion is a reliable method for error estimation. Furthermore, a statistical analysis of the correlation between the real and the empirical error shows very strong dependence in Fig. 6.
The literature on the problem of extrapolation is enormous, however we have not found a discussion of the problem of the mathematical statistics. If we know the values of the function with some accuracy, how to choose the best Padé approximant among the many, some of which have serious noise of rounding and others are even uncontrolled NumRep . There is a huge literature about the formulae, but a convenient criterion for the choice of optimal Padé approximant is not discussed. We give our illustrative example not as an exercise of programming, but as an illustration that we have a good working criterion for many practical cases. As a rule, articles on Padé approximation are illustrated by examples describing how some method is good working, but the instructive analysis reveals where the method stops working and what is most important whether some criterion warns that calculation resource has been exhausted. From a practical point of view, exhaustion of resources for extrapolation is demonstrated by evaporation of the extrapolated points from the analytical curve, and one of the achievements of the present work is that order of the magnitude of this deviation is reliably indicated by our criterion based on the Wynn identity.
V Discussion and conclusions
In spite that the literature for Padé approximants is enormous we were unable to find alternative formulas, methods, algorithms and programs to compare the work of our method. One can put in the agenda for the development of the numerical analysis a simple test: which method for extrapolation of functions from tabulated values in points gives the best extrapolation of a function far from the interpolation interval. This juxtaposition will give the final verdict which method is appropriate to be implemented in commercial software like Mathematica and Maple.
The new result of the implementation of calculation of Padé approximants and their application is the modulus minimization of Wynn’s as reliable empirical criterion of the error.
The preformed analysis of several simple examples has revealed that for practical implementation of Padé approximants we can use the empirical error extracted from the Wynn’s identity. In the agenda the statistical problem of calculation of PDF of the Padé approximants has already been set. The comparison of descriptive statistics data for the PDF of errors of calculation of Padé approximants by different criteria will give what the answer what general recommendation as a numerical recipe have to be given to users not willing to understand how.
In short, the practical implementation of Padé approximants can reach one order of magnitude more applications in theoretical physics and applied mathematics. More than half a century after its discovery, the -algorithm has not been included for calculation of divergent series with convergent Padé approximants and for extrapolation of functions in commercial software. Now the time for this inclusion has come, the herein implemented control mechanism has rendered this mission possible.
Last but not least, the suggested criterion Eq. (11) is applicable in solution of differential equations, numerical analytical continuation, perturbation, series summation and other analogous problems of theoretical physics.
Let us summarize our results:
- we have suggested a method for empirical choice of optimal Padé approximant which woks for all examples we have met in the literature and seems to have universal applicability,
- we derived a new formula for the -point Padé approximant, for a function tabulated in -points. This formula based on Eitken interpolation formula and Wynn identity is maximally robust with respect of truncation errors in the numerical calculations and can be used even for extrapolation. We have developed this method in order to obtain one predictor method for predictor-corrector methods for solving of one irrelevant for the present paper magnetohydrodynamic problem, the programs where our formulae are implemented are given as appendices of the arXiv version of the present paper arxiv and early and more detailed version has already been published in conference proceedings AIP_Pade . The figures in the paper are calculated by those programs. Reproducing the figures is the criterion that our formula can be used directly in numerical calculations. Up to now a non-specialist knows that extrapolation and summation of divergent series is a forbidden procedure used only by elite mathematicians. We have made a popularization addressed to user of applied mathematics working for the industry.
- The described method can be used as a predictor method for solving ordinary differential equations . The described N-point Padé approximation can be used as a predictor using former calculated points and their first derivatives as interpolation points to calculate the predicted value of the function at the new point of the argument. Then we calculate the derivative at the new point . We consider this possibility as a new method for solution of ordinary differential equations. The method can be additionally improved if in the (-) plane we perform a rotation, and the new abscissa of the interpolation to be taken along the local tangent of the curve. We started with the problem of calculation of the temperature profile of the solar corona and velocity of solar windMHD:19 but this method can be applied for many other problems for which standard software is not (good) working.
- If we know the first derivatives of the function using of Taylor expansion we can calculate
[TABLE]
Then we can use the Taylor approximants as interpolation points in the already described method. At the basic points the interpolated rational function will have exact coincidence not only for the function but also for its first derivatives. The main idea of the algorithm is the same: using a node point we calculate the interpolated value at the argument of interpolation, and then use this calculated value at this point as a new value in the node point . If we need a formal proof of convergence it is necessary in the right side of Eq. (14) to put insert the evaluation of error , where is the maximal distance between interpolating an node argument and is the supremum of the modulus of the derivative. This estimation of the error reveal why the analytical approximation is not applicable to function and the trajectory of a Brownial particle.
Finally we have to remark. There is no general criterion for the convergence of the Padé approximation, and even to choose the optimal approximant when the method is convergent, that is why there are no theorems which could be cited. As all known existing methods are given without any empirical prescription for the choice of the optimal approximant the considered minimal- criterion of the CNEWS method can not be compared with other methods. The area of the applicability of the suggested method is however well defined. For many applied problems it is known that solution exists, while for the analytical problems the solution is analytical. In many problems in the physics, for example, there are alternative method to check whether the searched solution is numerically obtained in spite that there will be no alternative methods. Contra-examples of mathematical series for which the Padé method does not work can be easily constructed, but we are unaware of a real problem for which the solution is experimentally measured but the series can not be summarized by the exciting combinations of Euler-McLaurin summation and Padé approximation. That is why we believe that suggested method, criterion and software can be useful for many applied problems in numerous areas not only in physics but in any scientific application where differential equations are solved.
VI Acknowledgments
The authors are thankful to Evgeni Penev for the collaboration in early stages in the present research, and for writing the -algorithm in Fortran-90, Michail Mishonov gave a version in Java and Nedeltcho Zahariev and Zlatan Dimitrov in C-language. The authors are also grateful to Claude Brezinski for pointing out and recommendation of Refs. Gonnet:13 ; Beckermann:13 , to Ognyan Kounchev for Refs. Karlsson:76 ; Gonchar:78 ; Gonchar:89 ; Levis:18 ; Yattselev:18 , to Emil Horozov, Radostina Kamburova, Dantchi Koulova, Peter Kenderov and Hassan Chamati for considerations, comments and stimulating discussions. This work is partially supported by grant KP-06-N38/6 from 5.12.2019 of the Bulgarian National Science Fund and National program “Young scientists and postdoctoral researchers” approved by DCM 577, 17.08.2018.
Appendix A -algorithm source code in Fortran-90
This section includes the -algorthm written in Fortran-90 subroutine and used for the calculations presented in the figures. This Fortran program is our understaning of Cauhi-Jakobi problem an if the time of floating point operation and also the machine epsilon we obtain the analytical result. The program can be converted in a constructive proof.
Appendix B Aitken algorithm
In order to explain the novelty of the present achievement in the beginning we will recall the well known Aitken algorithm. Later on using this basis we will explain the general polynomial exponential. We follow the well-known reference book by Bronstein and SemendyayevBronshtein & Semedyayev (1955). If we need to calculate value of the function at some fixed argument , one can use following schematics (“crest by crest”) which is especially convenient for computer programming
[TABLE]
Every symbol denotes the value in the point of the interpolating polynomial function, build on nodes , i.e. , , …, Those values calculates column by column, in the following manner. Numbers of the column are obtained by
[TABLE]
Every following column is calculating by the former one at the same schematics, for example
[TABLE]
The rounding error is minimized if we reorder the points as
[TABLE]
In such a manner we obtain the series
[TABLE]
In order to obtain an approximation of limit of this sequence
[TABLE]
we use the Wynn algorithm CNEWS algorithm with minimal- criterion, which works even if the sequence is divergent. It is only necessary the problem which we solve to be analytical one born from a real physical problem.
Let us repeat the Aitken formula as an algorithm. Having fixed node point we can calculate the the linear approximation using a second node point
[TABLE]
where node arguments and are parameters of the linear function. In such a way omittin in the notations the argument we can calculate the sequence
[TABLE]
In order to rewrite this conveniently for programming algorithm, we can use the same notations for the changing new values
[TABLE]
Then for the next column of the Aitken table according Eq. (24) using so re-denoted values we calculate
[TABLE]
The formula for a general column is the same
[TABLE]
i.e. linearly interpolated value of the function at point is ascribed as the new function value at the node point . In other words polynomial interpolation is reduces to sequential linear interpolations This procedure repeats until . In such a way we program the series identically by the so defined polynomial interpolations
[TABLE]
using sequentially
[TABLE]
set of nodes. According to the general theory of Padé approximants the Wynn CNEWS algorithm converts polynomial approximation into Padé ones, and for we obtain -point Padé approximant.
If the first points coincide the corresponding polynomial approximant is just the Taylor series
[TABLE]
where are the first derivatives of the approximated function, and is the parameter of this series expansion. Re-denoting
[TABLE]
we can apply the regular Aitken procedure using the new Taylor expanded functional values. In such a way finally we have solved the generalized Cauchy-Jacobi problem: to obtain a rational approximation of a function passing through the N-points with fixed values of its first derivatives at every at those points We gave only an idea for proof and a Fortran program illustrate our understanding.
We will use this achievement for many applied problems. For example, for solution of ordinary differential equation
[TABLE]
Using former N-points with first derivative Taylor approximation we can calculate the functional value using the explained method of Padé extrapolation and then calculate the first derivative at this point explicitly . Renumbering later , … and we can continue the procedure. The number of former points is determined by the indices of the optimal Padé approximants according to the minimal- criterion. For the step for the next point one can apply many different criteria. To our knowledge such combination of methods for solution of ordinary differential equation without fixed order is a new one. We consider that content of this Appendix is implicitly given in the body of the article we give this clarification only for the beginner users.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) G. A. Baker Jr. and P. Graves-Morris, Padé Approximants (2nd ed., Cambridge Univ. Press, Cambridge 1996), Sec. 7.1 Multipoint Padé approximation.
- 2(2) C. Brezinski, and M. Redivo Zaglia, Extrapolation Methods. Theory and Practice , (North-Holland, 1991).
- 3(3) C. Brezinski, “Extrapolation algorithms and Padé approximations: a historical survey”, Applied Numerical Mathematics 20 , 299–318 (1996).
- 4(4) C. Brezinski, History of Continued Fractions and Padé Approximants. , (Springer-Verlag, Berlin, 1991); Sec. 5.2.5 Padé approximants.
- 5(5) G. Frobenius, “Über Relationen zwischen den Näherungsbrüchen von Potenzreihen”, J. Reine Angew. Mathem. 90 , 1–17 (1881), (in German).
- 6(6) H. Padé, “Sur la représentation approchée d’une fonction par des fractions rationelles”, Ann. Sci. Ec. Norm. Super. 9 , 1–93 (1892), (in French).
- 7(7) W. H. Press, S. A. Teukolsky, W. T. Vetterling and B. P. Flannery, Numerical Recipes in C: The Art of Scientific Computing (2nd ed., Cambridge University Press, Cambridge, 1992).
- 8(8) H. Alfvén, “Granulation, magneto-hydrodynamic waves, and the heating of the solar corona”, MNRAS 107 , 211 (1947).
