Fractional Newton-Raphson Method and Some Variants for the Solution of Non-linear Systems
A. Torres-Hernandez, F. Brambila-Paz

TL;DR
This paper introduces novel fractional derivative-based numerical methods for solving non-linear systems in complex space, improving convergence properties and addressing discontinuities present in traditional fractional Newton-Raphson methods.
Contribution
It proposes new fractional Newton-Raphson variants with variable derivative orders, enhancing convergence and stability for complex non-linear systems.
Findings
Methods achieve at least quadratic convergence.
New approaches avoid discontinuities in fractional derivatives.
Applicable to systems with real initial conditions.
Abstract
The following document presents some novel numerical methods valid for one and several variables, which using the fractional derivative, allow to find solutions for some non-linear systems in the complex space using real initial conditions. The origin of these methods is the fractional Newton-Raphson method but unlike the latter, the orders of fractional derivatives proposed here are functions. In the first method, a function is used to guarantee an order of convergence (at least) quadratic, and in the others, a function is used to avoid the discontinuity that is generated when the fractional derivative of the constants is used, and with this, it is possible that the methods have at most an order of convergence (at least) linear.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Fractional Newton-Raphson Method and Some Variants for the Solution of Non-linear Systems
A. Torres-Hernandez
Department of Physics - UNAM
F. Brambila-Paz
Department of Mathematics - UNAM
1. Introduction
When starting to study the fractional calculus, the first difficulty is that, when wanting to solve a problem related to physical units, such as determining the velocity of a particle, the fractional derivative seems to make no sense, this is due to the presence of physical units such as meter and second raised to non-integer exponents, opposite to what happens with operators of integer order. The second difficulty, which is a recurring topic of debate in the study of fractional calculus, is to know what is the order “optimal” that should be used when the goal is to solve a problem related to fractional operators. To face these difficulties, in the first case, it is common to dimensionless any equation in which non-integer operators are involved, while for the second case different orders are used in fractional operators to solve some problem, and then it is chosen the order that provides the “best solution” based about an established criteria.
Based on the two previous difficulties, arises the idea of looking for applications with dimensionless nature such that the need to use multiple orders can be exploited in some way. The aforementioned led to the study of Newton-Raphson method and a particular problem related to the search for roots in the complex space for polynomials: if it is needed to find a complex root of a polynomial using Newton-Raphson method, it is necessary to provide a complex initial condition and, if the right conditions are selected, this leads to a complex solution, but there is also the possibility that this leads to a real solution. If the root obtained is real, it is necessary to change the initial condition and expect that this leads to a complex solution, otherwise, it is necessary to change the value of the initial condition again.
The process described above, it is very similar to what happens when using different values in fractional operators until we find a solution that meets some desired condition. Seeing Newton-Raphson method from the perspective of fractional calculus, one can consider that an order remains fixed , in this case , and the initial conditions are varied until obtaining one solution that satisfies an established criteria. Then reversing the behavior of and , i.e., leave the initial condition fixed and varying the order , the fractional Newton-Raphson method is obtained, which is nothing other thing than Newton-Raphson method using any definition of fractional derivative that fits the function with which one is working. This change, although in essence simple, allows to find roots in the complex space using real initial conditions, because fractional operators generally do not carry polynomials to polynomials.
1.1. Fixed Point Method
A classic problem of common interest in both Physics and Mathematics is to find the zeros of a function , i.e.,
[TABLE]
this problem often arises as a consequence of wanting to solve other problems, for instance, if we want to determine the eigenvalues of a matrix or want to build a box with a given volume but with minimal surface; in the first example, we need to find the zeros (or roots) of the characteristic polynomial of the matrix, while in the second one we need to find the zeros of the gradient of a function that relates the surface of the box with its volume.
Although finding the zeros of a function may seem like a simple problem, in general, it involves solving nonlinear equations, which in many cases does not have an analytical solution, an example of this is present when we are trying to determine the zeros of the following function
[TABLE]
Because in many cases there is no analytical solution, numerical methods are needed to try to determine the solutions to these problems; it should be noted that when using numerical methods, the word “determine” should be interpreted as approach a solution with a degree of precision desired. The numerical methods mentioned above are usually of the iterative type and work as follows: suppose we have a function and we search a value such that , then we can start by giving an initial value and then calculate a value close to the searched value using an iteration function as follows [3]:
[TABLE]
this generates a sequence , which under certain conditions satisfies that
[TABLE]
To understand the convergence of the iteration function it is necessary to have the following definition [4]:
Definition 1.1**.**
Let be an iteration function. The method given in (1) to determine , it is called (locally) convergent, if exists such that for all initial value
[TABLE]
it holds that
[TABLE]
where denotes any vector norm.
When it is assumed that the iteration function is continuous at and that the sequence converges to under the condition given in (2), it is true that
[TABLE]
the previous result is the reason why the method given in (1) is called Fixed Point Method.
1.1.1 Convergence and Order of Convergence
The (local) convergence of the iteration function established in (2), it is useful for demonstrating certain intrinsic properties of the fixed point method. Before continuing it is necessary to have the following definition [3]:
Definition 1.2**.**
Let . The function is a contraction on a set , if exists a non-negative constant such that
[TABLE]
where is called a contraction constant.
The previous definition guarantee that if the iteration function is a contraction on a set , then it is Lipschitz continuous and has at least one fixed point. The existence of a fixed point is guaranteed by the following theorem [5]:
** Theorem 1.3****.**
Contraction Mapping Theorem: Let . Assuming that is a contraction on a closed set , and that . Then has a unique fixed point and for any initial value , the sequence generated by (1) converges to . Moreover
[TABLE]
where is the contraction constant given in (4).
When the fixed point method given by (1) is used, in addition to convergence, exists a special interest in the order of convergence, which is defined as follows [4]:
Definition 1.4**.**
Let be an iteration function with a fixed point . Then the method (1) is called (locally) convergent of (at least) order (), if exists and exists a non-negative constant (with if ) such that for any initial value it is true that
[TABLE]
where is called convergence factor.
The order of convergence is usually related to the speed at which the sequence generated by (1) converges. For the particular cases or it is said that the method has (at least) linear or quadratic convergence, respectively. The following theorem for the one-dimensional case [4], allows characterizing the order of convergence of an iteration function with its derivatives
** Theorem 1.5****.**
Let be an iteration function with a fixed point . Assuming that is -times differentiable in for some , and furthermore
[TABLE]
then is (locally) convergent of (at least) order .
Proof.
Assuming that is an iteration function -times differentiable with a fixed point , then we can expand in Taylor series the function around and order
[TABLE]
then we obtain
[TABLE]
assuming that the sequence generated by (1) converges to and also that , the previous expression implies that
[TABLE]
as consequence, exists a value such that
[TABLE]
∎
A version of the previous theorem for the case -dimensional may be found in the reference [3].
1.2. Newton-Raphson Method
The previous theorem in its n-dimensional form is usually very useful to generate a fixed point method with an order of convergence desired, an order of convergence that is usually appreciated in iterative methods is the (at least) quadratic order. If we have a function and we search a value such that , we may build an iteration function in general form as [6]:
[TABLE]
with a matrix like
[TABLE]
where (). Notice that the matrix is determined according to the order of convergence desired. Before continuing, it is necessary to mention that the following conditions are needed:
Suppose we can generalize the Theorem 1.5 to the case -dimensional, although for this it is necessary to guarantee that the iteration function given by (10) near the value can be expressed in terms of its Taylor series in several variables. 2. 2.
It is necessary to guarantee that the norm of the equivalent of the first derivative in several variables of the iteration function tends to zero near the value .
Then, we will assume that the first condition is satisfied; for the second condition we have that the equivalent to the first derivative in several variables is the Jacobian matrix of the function , which is defined as follows [5]:
[TABLE]
where
[TABLE]
with , the competent -th of the iteration function . Now considering that
[TABLE]
Assume that we have a function with a zero , such that all of its first partial derivatives are defined in . Then taking the iteration function given by (10), the -th component of the iteration function may be written as
[TABLE]
then
[TABLE]
where is the Kronecker delta, which is defined as
[TABLE]
Assuming that (14) is fulfilled
[TABLE]
the previous expression may be written in matrix form as
[TABLE]
where and are the Jacobian matrix of the function and the identity matrix of , respectively. Denoting by the determinant of the matrix , then any matrix that fulfill the following condition
[TABLE]
guarantees that exists such that the iteration function given by (10), converges (locally) with an order of convergence (at least) quadratic in . The following fixed point method can be obtained from the previous result
[TABLE]
which is known as Newton-Raphson method (-dimensional), also known as Newton’s method [7].
Although the condition given in (16) could seem that Newton-Raphson method always has an order of convergence (at least) quadratic, unfortunately, this is not true; the order of convergence is now conditioned by the way in which the function f is constituted, the mentioned above may be appreciated in the following proposition
Proposition 1.6**.**
Let be a function that is at least twice differentiable in . So if is a zero of with algebraic multiplicity (), i.e.,
[TABLE]
the Newton-Raphson method (one-dimensional) has an order of convergence (at least) linear.
Proof.
Suppose we have a function with a zero of algebraic multiplicity , and that is at least twice differentiable in , then
[TABLE]
as consequence the derivative of the iteration function of Newton-Raphson method may be expressed as
[TABLE]
therefore
[TABLE]
and by the Theorem 1.5, then Newton-Raphson method under the hypothesis of the proposition converges (locally) with an order of convergence (at least) linear.
∎
Notice that the order of convergence (at least) quadratic, does not necessarily imply a higher speed of convergence when using Newton-Raphson method; to verify this, just is necessary to take a polynomial of the form
[TABLE]
now assuming that is a zero of the polynomial and that the sequence generated by (17) has an initial condition away from , then
[TABLE]
so a small change is obtained between the values and ; therefore, the sequence generated by Newton method will have a slow convergence to the value of [3].
2. Basic Definitions of the Fractional Derivative
2.1. Introduction to the Definition of Riemann-Liouville
One of the key pieces in the study of fractional calculus is the iterated integral, which is defined as follows [8]:
Definition 2.1**.**
Let , the space of locally integrable functions in the interval . If is a function such that , then the -th iterated integral of the function is given by [8]:
[TABLE]
where
[TABLE]
Considerate that , a generalization of (20) may be obtained for an arbitrary order
[TABLE]
similarly, if , we may define
[TABLE]
the equations (21) and (22) correspond to the definitions of right and left fractional integral of Riemann-Liouville, respectively.
Below, there are some valid properties for right fractional integrals. However, the same properties for left fractional integrals have some changes. Fractional integrals satisfy the semigroup property, which is given in the following proposition [8]:
Proposition 2.2**.**
Let be a function. If , then the fractional integrals of satisfy that
[TABLE]
From the previous result, it is obtained that in particular
[TABLE]
since the operator is the inverse operator to the left of the operator , any integral -th of a function may be written as
[TABLE]
It should be mentioned that the above results are also valid for in complexes, with . Under appropriate conditions in (24) its follows
[TABLE]
then, from (24) and (25) the following operator is built, which corresponds to the ** (right) fractional derivative of Riemann-Liouville**
[TABLE]
where , with the largest integer less than or equal to . We can unify the definitions of integral and fractional derivative of Riemann-Liouville, given by (21) and (26), as follows
[TABLE]
where . Applying the operator (30) with and to the function , we obtain
[TABLE]
2.2. Introduction to the Definition of Caputo
Michele Caputo (1969) published a book and introduced a new definition of fractional derivative, he created this definition with the objective of modeling anomalous diffusion phenomena. The definition of Caputo had already been discovered independently by Gerasimov (1948). This fractional derivative is of the utmost importance since it allows us to give a physical interpretation of the initial value problems, moreover to being used to model fractional time. In some texts, it is known as the fractional derivative of Gerasimov-Caputo [9].
Let be a function, such that is -times differentiable with , then the (right) fractional derivative of Caputo is defined as:
[TABLE]
where . It should be mentioned that the fractional derivative of Caputo behaves as the inverse operator to the left of fractional integral of Riemann-Liouville , that is,
[TABLE]
On the other hand, the relation between the fractional derivatives of Caputo and Riemann-Liouville is given by the following expression [9]
[TABLE]
then if , we obtain
[TABLE]
considering the previous particular case, it is possible to unify the definitions of fractional integral of Riemann-Liouville and fractional derivative of Caputo as follows
[TABLE]
3. Fractional Newton Method
Let , the space of polynomials of degree less than or equal to with real coefficients. The zeros of a function are usually named as roots. The Newton-Raphson method is useful for finding the roots of a function . However, this method is limited because it cannot find roots , if the sequence generated by (17) has an initial condition . To solve this problem and develop a method that has the ability to find roots, both real and complex, of a polynomial if the initial condition is real, we propose a new method called fractional Newton-Raphson method, which consists of Newton-Raphson method with the implementation of the fractional derivative. Before continuing, it is necessary to define the fractional Jacobian matrix of a function as follows
[TABLE]
where
[TABLE]
with . The operator denotes any fractional derivative, applied only to the variable , that satisfies the following condition of continuity respect to the order of the derivative
[TABLE]
then, the matrix (37) satisfies that
[TABLE]
where denotes the Jacobian matrix of the function .
Taking into account that a polynomial of degree it is composed of monomials of the form , with , we can take the equation (31) with (17), to define the following iteration function that results in the Fractional Newton-Raphson Method [1, 2]
[TABLE]
To try to guarantee that the sequence generated by (40) has a order of convergence (at least) quadratic, the condition (16) is combined with (38) to define the following function
[TABLE]
then, in (40) we make obtaining the following iteration function that results in the Fractional Newton Method
[TABLE]
the difference between the methods (40) and (45), is that the just for the second can exists such that if the sequence generated by (45) converges to a root of , exists such that , the sequence has an order of convergence (at least) quadratic in .
The value of in (43) is assigned with the following reasoning: when we use the definitions of fractional derivatives given by (30) and (36) in a function , it is necessary that the function be -times integrable and -times differentiable, where , therefore and, on the other hand, for using Newton method it is just necessary that the function be one-time differentiable, as a consequence of it is obtained that
[TABLE]
Without loss of generality, to understand why the sequence generated by the fractional Newton method when we use a function , has the ability to enter the complex space starting from an initial condition , it is only necessary to observe the fractional derivative of Riemann-Liouville of order of the monomial
[TABLE]
whose result is a function with a rational exponent, contrary to what would happen when using the conventional derivative. When the iteration function given by (45) is used, we must taken an initial condition , as a consequence of the fact that the fractional derivative of order of a constant is usually proportional to the function .
When we take , the sequence generated by the iteration function (45), presents among its behaviors, the following particular cases depending on the initial condition :
If we take an initial condition , the sequence may be divided into three parts, this occurs because it may exists a value for which with , in consequence . 2. 2.
On the other hand, if we take an initial condition, the sequence may be divided into two parts, and .
Unlike classical Newton-Raphson method; which uses tangent lines to generate a sequence , the fractional Newton method uses lines more similar to secants (see Figure 1). A consequence of the fact that the lines are not tangent when using (45), is that different trajectories can be obtained for the same initial condition just by changing the order of the derivative (see Figure 2).
A typical inconvenience that arises in problems related to fractional calculus, it is the fact that it is not always known what is the appropriate order to solve these problems. As a consequence, different values of are generally tested and we choose the value that allows to find the best solution considering an criterion of precision established. Based on the aforementioned, it is necessary to follow the instructions below when using the fractional Newton method to find the zeros of a function :
Without considering the integers and , a partition of the interval is created as follows
[TABLE]
and using the partition, the following sequence is created. 2. 2.
We choose a non-negative tolerance , a limit of iterations for all , an initial condition and a value . 3. 3.
We choose a value to use given by (43), such that . In addition, it is taken a fractional derivative that satisfies the condition of continuity (38), and it is unified with the fractional integral in the same way as in the equations (30) and (36). 4. 4.
The iteration function (45) is used with all the values of the partition , and for each value is generated a sequence , where
[TABLE]
then is generated a sequence , with , such that
[TABLE] 5. 5.
We choose a value and we take the values and , then is defined . If the following condition is fulfilled
[TABLE]
is taken . On the other hand if
[TABLE]
is defined . Without loss of generality, it may be assumed that the second condition is fulfilled, then is taken and are checked the conditions (49) and (50) with the values and . The process described before is repeated for all values , with , and that generates a sequence , with , such that
[TABLE]
By following the steps described before to implement the fractional Newton method, a subset of the solution set of zeros, both real and complex, may be obtained from the function . We will proceed to give an example where is found the solution set of zeros of a function .
Example 3.1**.**
Let the function:
[TABLE]
then the following values are chosen to use the iteration function given by (45)
[TABLE]
and using the fractional derivative given by (30), we obtain the results of the Table 1
Although the fractional Newton method was originally defined for polynomials, the method can be extended to a broader class of functions, as shown in the following examples:
Example 3.2**.**
Let the function:
[TABLE]
then the following values are chosen to use the iteration function given by (45)
[TABLE]
and using the fractional derivative given by (30), we obtain the results of Table 2
In the previous example, a subset of the solution set of zeros of the function (56) was obtained, because this function has an infinite amount of zeros. Using the fractional Newton method does not guarantee that all zeros of a function can be found, leaving an initial condition fixed and varying the orders of the derivative. As in the classical Newton’s method, finding most of the zeros of the function will depend on giving a proper initial condition . If we want to find a larger subset of zeros of the function (56), there are some strategies that are usually useful, for example:
To change the initial condition . 2. 2.
To use a larger amount of values. 3. 3.
To increase the value of .
In general, the last strategy is usually the most useful, but this causes the method (45) to become more costly, because it requires a longer runtime for all values .
Example 3.3**.**
Let the function:
[TABLE]
then the following values are chosen to use the iteration function given by (45)
[TABLE]
and using the fractional derivative given by (30), we obtain the results of Table 3
Example 3.4**.**
Let the function:
[TABLE]
then the following values are chosen to use the iteration function given by (45)
[TABLE]
and using the fractional derivative given by (30), we obtain the results of Table 4
3.1. Fractional Quasi-Newton Method
Although the fractional Newton method is useful for finding multiple zeros of a function , it has the disadvantage that in many cases calculating the fractional derivative of a function is not a simple task. To try to minimize this problem, we use that commonly for many definitions of the fractional derivative, the arbitrary order derivative of a constant is not always zero, that is,
[TABLE]
Then, we may define the function
[TABLE]
it should be noted that the previous function is almost a linear approximation of the function in the initial condition . Then, for any fractional derivative that satisfies the condition (65), and using (37) the Fractional Quasi-Newton Method may be defined as
[TABLE]
with given by the following matrix
[TABLE]
where is the component -th of the value , and is defined as follows
[TABLE]
Since the iteration function (68) does not satisfy the condition (16), then any sequence generated by this iteration function has at most one order of convergence (at least) linear. As a consequence, the speed of convergence is slower compared to what would be obtained when using (45), and then it is necessary to use a larger value of . It should be mentioned that the value in (72), is not taken to try to guarantee an order of convergence as in (43), but to avoid the discontinuity that is generated when using the fractional derivative of constants in the value . An example is given using the fractional quasi-Newton method, where is found a subset of the solution set of zeros of the function .
Example 3.5**.**
Let the function:
[TABLE]
then the following values are chosen to use the iteration function given by (68)
[TABLE]
and using the fractional derivative given by (30), we obtain the results of Table 5
It is worth mentioning that the function (74) has infinite zeros, so it will be used with the following method.
3.2. Fractional Pseudo-Newton Method
A conventional way in which Newton’s method is usually deduced is based on a linear approximation of a function in a value . To perform the aforementioned, we may consider the Taylor polynomial of the function around a value , and disregarding higher order terms we have that
[TABLE]
then, assuming that is a zero of , from the previous expression we have that
[TABLE]
as consequence, a sequence that approximates the value may be generated using the iteration function
[TABLE]
However, the equation (76) is not the only way to generate a linear approximation to the function in the point , in general it may be taken as
[TABLE]
where is any constant value of a slope, that allows the approximation (77) to the function to be valid. The previous equation allows to obtain the following iteration function
[TABLE]
which originates the Parallel Chord Method [7]. The iteration function (78) can be generalized to larger dimensions as follows
[TABLE]
where , is the identity matrix of . The previous equation implies that it is enough to apply the method (78) component to component for the case in several variables.
Using as a basis the idea of (79), and considering any fractional derivative that satisfies the condition (65), we can define the Fractional Pseudo-Newton Method as follows
[TABLE]
with given by the following matrix evaluated at the value
[TABLE]
where is the component -th of the value , with the Kronecker delta, a positive constant , and is given by (72).
Two examples are given using the fractional pseudo-Newton method, where is found a subset of the solution set of zeros of some functions.
Example 3.6**.**
Let the function:
[TABLE]
then the following values are chosen to use the iteration function given by (81)
[TABLE]
and using the fractional derivative given by (30), we obtain the results of Table 6
Example 3.7**.**
Let the function:
[TABLE]
then the following values are chosen to use the iteration function given by (81)
[TABLE]
and using the fractional derivative given by (30), we obtain the results of Table 7
4. Conclusions
The fractional Newton-Raphson method and its variants are useful for finding multiple solutions of nonlinear systems, in the complex space using real initial conditions. However, it should be clarified that they present some advantages and disadvantages between each of them, for example, although the fractional Newton method generally has an order of convergence (at least) quadratic, this method has the disadvantage that it is not an easy task to find the fractional Jacobian matrix for many functions, and also the need to reverse this matrix must be added for each new iteration. But it has an advantage over the other methods, and this is because it can be used with few iterations, which allows occupying a greater number of values belonging to the partition of the interval .
The quasi-Newton fractional method has the advantage that the fractional Jacobian matrix with which it works, compared to the fractional Newton method, is easy to obtain. But a disadvantage is that the method may have at most an order of convergence (at least) linear, so the speed of convergence is lower and it is necessary to use a greater number of iterations to ensure success in the search for solutions. As a consequence, the method is more costly because it requires a longer runtime to use all values . An additional advantage of the method is that if the initial condition is close enough to a solution, its behavior is very similar to fractional Newton method and may converge with a relatively small number of iterations, but it still has the disadvantage that we need to reverse a matrix in each iteration.
Finally, the fractional pseudo-Newton method seems to solve the problem of the need to invert a matrix in each iteration that are present in the previous methods. However, this method, as the fractional quasi-Newton method, may have at most an order of convergence (at least) linear, and it has a lower speed of convergence compared to the latter. As a consequence, it is necessary to use a larger number of values and a greater number of iterations, then we require a longer runtime to use all values to find solutions, so it can be considered as a slow and costly method. But its advantage over the other methods is that it is not necessary to calculate and invert any fractional Jacobian matrix.
The methods may solve some nonlinear systems and are really efficient to find multiple solutions, both real and complex, using real initial conditions. It should be mentioned that these methods are extremely recommended in systems that have infinite solutions or a large number of them.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] F. Brambila-Paz and A. Torres-Hernandez. Fractional newton-raphson method. ar Xiv preprint ar Xiv:1710.07634 , 2017. https://arxiv.org/pdf/1710.07634.pdf .
- 2[2] F. Brambila-Paz, A. Torres-Hernandez, U. Iturrarán-Viveros, and R. Caballero-Cruz. Fractional newton-raphson method accelerated with aitken’s method. ar Xiv preprint ar Xiv:1804.08445 , 2018. https://arxiv.org/pdf/1804.08445.pdf .
- 3[3] Josef Stoer and Roland Bulirsch. Introduction to numerical analysis , volume 12. Springer Science & Business Media, 2013.
- 4[4] Robert Plato. Concise numerical mathematics . Number 57. American Mathematical Soc., 2003.
- 5[5] James M Ortega. Numerical analysis: a second course . SIAM, 1990.
- 6[6] Richard L Burden and J Douglas Faires. Análisis numérico . Thomson Learning,, 2002.
- 7[7] James M Ortega and Werner C Rheinboldt. Iterative solution of nonlinear equations in several variables , volume 30. Siam, 1970.
- 8[8] Rudolf Hilfer. Applications of fractional calculus in physics . World Scientific, 2000.
