On Generalizations of the Newton-Raphson-Simpson Method
Mario DeFranco

TL;DR
This paper introduces a family of algorithms called NRS(m) that generalize the Newton-Raphson-Simpson method, enabling the evaluation of sums of formal zeros of functions and connecting to hypergeometric series via combinatorial structures.
Contribution
The paper develops a new class of algorithms NRS(m) that extend the Newton-Raphson-Simpson method and relate to hypergeometric series using novel combinatorial objects.
Findings
NRS(1) recovers the classical Newton-Raphson-Simpson iterations.
NRS(m) can evaluate certain hypergeometric series.
The algorithms utilize trees with negative vertex degrees for their construction.
Abstract
We present generalizations of the Newton-Raphson-Simpson method. Specifically, for a positive integer and the sequence of coefficients of a Taylor series of a function , we define an algorithm we denote by NRS() which is a way to evaluate, in our terminology, a sum of formal zeros of . We prove that NRS(1) yields the familiar iterations of the Newton-Raphson-Simpson method. We also prove that NRS() is way to evaluate certain -hypergeometric series defined by Sturmfels. In order to define these algorithms, we make use of combinatorial objects which we call trees with negative vertex degree.
| 0 | 0 | 5.161290323 |
| 1 | 3.099986240, | 8.261276563 |
| 2 | 1.403785124 | 9.665061687 |
| 3 | 3.188553499 | 9.983917037 |
| 4 | 1.604320113 | 9.999960238 |
| 5 | 3.976182710 | 1.000000000 |
| 6 | 2.439269432 | 1.000000000 |
| 7 | 9.180054555 | 1.000000000 |
| 0 | 0 | 0 | 0 | 1.60000000 |
| 1 | -4.659688684 | 1.285700049 | 8.197311805 | 2.419731181 |
| 2 | -2.893844613 | 7.161536865 | 4.267692251 | 2.846500406 |
| 3 | -1.070644295 | 2.455302622 | 1.384658328 | 2.984966238 |
| 4 | -1.243741433 | 2.730562457 | 1.486821024 | 2.999834449 |
| 5 | -1.448081738 | 3.103391679 | 1.655309941 | 2.999999980 |
| 6 | -1.827406122 | 3.861697182 | 2.034291060 | 3.000000000 |
| 7 | -2.798594637 | 5.864244005 | 3.065649367 | 3.000000000 |
| 8 | -6.410472972 | 1.336321214 | 6.952739166 | 3.000000000 |
| 0 | 0 | 0 | 0 | 0 |
|---|---|---|---|---|
| 1 | -1.469709756 | -5.895234873, | 3.873351306 | 1.814118062 |
| 2 | -8.795058030 | -6.561933355 | 2.427500757 | 8.918016189 |
| 3 | -3.065450285 | -2.976185716 | 8.727716252 | 2.686080251 |
| 4 | -3.145514266 | -3.445847512 | 9.116888144 | 2.525526366 |
| 5 | -2.842482660 | -3.311045690 | 8.323671070 | 2.170142720 |
| 6 | -2.143169948 | -2.585019447 | 6.315545741 | 1.587356346 |
| 7 | -1.163725119 | -1.433465253 | 3.443052895 | 8.458625235 |
| 8 | -3.335397374 | -4.162656330 | 9.893837244 | 2.395783540 |
| 0 | 4.000000000 |
|---|---|
| 1 | 5.814118062 |
| 2 | 6.705919681 |
| 3 | 6.974527706 |
| 4 | 6.999782970 |
| 5 | 6.999999984 |
| 6 | 7.000000000 |
| 7 | 7.000000000 |
| 8 | 7.000000000 |
| 0 | 0 | 0 | 0 | 0 |
|---|---|---|---|---|
| 1 | -2.901096310 | -1.381474433, | 4.104059794 | 3.730963449 |
| 2 | -1.242997894 | -1.092366178 | 5.026520639 | 3.078851277 |
| 3 | -2.246822248 | -2.526877753 | -6.146352087 | 7.352225534 |
| 4 | -6.219368047 | -7.840976033 | -4.228779292 | 2.330509308 |
| 5 | -4.203554555 | -5.637329986 | -3.932363674 | 1.700311697 |
| 6 | -1.780482352 | -2.477579062 | -1.965670765 | 7.550765040 |
| 7 | -3.047097262 | -4.339903585 | -3.710732333 | 1.332463930 |
| 0 | 0 | 10.00000000 |
| 1 | 3.552452499 | 13.552452499 |
| 2 | 1.246139269 | 14.798591768 |
| 3 | 1.963890324 | 14.994980800 |
| 4 | 5.015969708 | 14.999996770 |
| 5 | 3.229868758 | 15.000000000 |
| 6 | 1.327032861 | 15.000000000 |
| 7 | 2.226906125 | 15.000000000 |
| 0 | 0 | 14.421 |
| 1 | 3.0425 | 17.463 |
| 2 | 1.37564 | 17.601 |
| 3 | 2.7830 | 17.601 |
| 4 | 1.1384 | 17.601 |
| 5 | 1.9048 | 17.601 |
| 0 | 0 | 0 | 0 | 93.850 |
|---|---|---|---|---|
| 1 | -4.5493 | 28.506 | 23.957 | 117.81 |
| 2 | -4.3017 | 2.6095 | 2.1794 | 119.99 |
| 3 | -3.7235 | 2.2057 | 1.8333 | 120.00 |
| 4 | -2.6905 | 1.5663 | 1.2972 | 120.00 |
| 5 | -1.3678 | 7.8611 | 6.4933 | 120.00 |
| 6 | -3.4665 | 1.9734 | 1.6268 | 120.00 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Combinatorial Mathematics · Polynomial and algebraic computation · Advanced Mathematical Identities
On Generalizations of the Newton-Raphson-Simpson Method
Mario DeFranco
Abstract
We present generalizations of the Newton-Raphson-Simpson method. Specifically, for a positive integer and the sequence of coefficients of a Taylor series of a function , we define an algorithm we denote by NRS() which is a way to evaluate, in our terminology, a sum of formal zeros of . We prove that NRS(1) yields the familiar iterations of the Newton-Raphson-Simpson method. We also prove that NRS() is way to evaluate certain -hypergeometric series defined by Sturmfels [3]. In order to define these algorithms, we make use of combinatorial objects which we call trees with negative vertex degree.
1 Introduction
The main purpose of this paper is to define a sequence of algorithms NRS( for positive integer which generalize the Newton-Raphson-Simpson method.
We review the Newton-Raphson-Simpson method. Let be a a differentiable function and . Recall that the Newton-Raphson-Simpson method constructs a sequence defined by
[TABLE]
Then the limit , if it exists, is a zero of . Depending on and , the limit may or may not exist. See Kollerstrom [1] for information about the Newton-Raphson-Simpson method.
Given an integer , the algorithm NRS() constructs a sequence of rational functions in the variables . We think of the as being the coefficients of a polynomial
[TABLE]
We prove that
[TABLE]
is equal to the -th iteration of the Newton-Raphson-Simpson method applied to with . For larger , we claim that
[TABLE]
if convergent, is equal to a sum of zeros of . In Section 5 we apply the NRS() for certain polynomials and give tables of values for the partial sums of (1). These tables indicate the series for these examples converges to the sum of the zeros of that are closest to 0. In Section 6 we talk more about the claimed sufficient conditions on that yield these results, namely that the zeros of be positive.
We obtain the rational functions by considering certain infinite sums in the ; choosing a certain order of summation for yields the series (1). We define the in Section 3 in terms of combinatorial objects which we call trees with negative vertex degree. Next we just present a high-level description of the and why they are relevant, including their appearance in [3].
The sums (where denotes 0) are examples of what we call formal zeros for . For a power series
[TABLE]
we view the as indeterminates in some suitable and we view as a function . Then a formal zero for is an element such that
[TABLE]
There are some different ways to approach the .
One way, for example, is to view
[TABLE]
as a function of the independent variables for and to view and as constants. Then we set
[TABLE]
where
[TABLE]
is a sequence of non-negative integers , almost all zero; where
[TABLE]
and where are some coefficients. We can solve for the by using the set of equations
[TABLE]
for all . This method yields a sum for that is equal to . More generally, for , we may also view as a function of the independent variables for and view and as constants. Then we set
[TABLE]
and solve for as before.
Another method to obtain the is to consider the limits of functions in a suitable ring. For example, if we let
[TABLE]
then the limit
[TABLE]
is equal to . Again we view and as constants, and we interpret expressions with denominators as geometric series.
In [3], Sturmfels considers differential equations satisfied by the roots of a polynomial and expresses their solutions using certain -hypergeometric series. He gives formulas for the coefficients and denotes some of these solutions by
[TABLE]
In Section 3 we prove that
[TABLE]
We now describe the outline of this paper. In Section 2 we prove that NRS(1) is equivalent to Newton-Raphson-Simpson method. In Section 3 we define NRS() using the trees with negative vertex degree and some functions built from that we call auxiliary functions. In Section 4 we show how to explicitly compute the auxiliary functions. In Section 5 we apply NRS() to actual polynomials and present numerical tables of the associated quantities. In Section 6 we discuss further work.
2 NRS(1) and the type number of a tree
For each plane tree, we define what we call its type number. We then show how the Newton-Raphson-Simpson method is actually summing trees ordered by this type number (Theorem 2). The NRS() will also sum trees by type number, but the trees will have negative vertex degree.
We recall the definition of rooted trees and plane trees. See Chapter 5 of [2]. We will use the convention that each vertex has degree 0 or degree .
Definition 1**.**
A rooted tree is a finite acyclic graph with one vertex distinguished as the root which we denote by . Let be a vertex of . The subtrees of are the components of that do not contain the root of . A plane tree is a rooted tree such that the subtrees of each vertex are linearly ordered. The vertex degree , or just degree, of a vertex in is the number of subtrees of . We also require that for any vertex of .
Remark 1**.**
A plane tree is equivalent to an ordered sequence of other plane trees, for . We call the -th root subtree of . We denote the plane tree that consists of a single vertex by ; this tree corresponds to and the empty sequence. We let be the number of vertices of that have degree . We call the degree sequence of . We let denote the set of all plane trees (after the Łukasiewicz words reviewed in Section 3).
2.1 The ring
We define a ring and a map from the set of plane trees into
For integer , let be the -vector space spanned by monomials of the form
[TABLE]
where are non-negative integers, almost all 0 and with , satisfying
[TABLE]
Thus an element is a finite sum of the monomials of the form (2). For and , clearly
[TABLE]
We let be the ring consisting of all elements of the form
[TABLE]
where , and where addition and multiplication in are the usual operations on graded infinite sums. Note that in the sum (3) for we allow infinitely many of the to be non-zero; in this case we say that is an infinite sum. Otherwise we say that is a finite sum. We let denote a general power series of the form :
[TABLE]
We view the coefficients as indeterminates. We define expressions using the coefficients and a plane tree.
Definition 2**.**
Let be a rooted planar tree. Define to be
[TABLE]
2.2 The element
We wish to find a way to compute the sum
[TABLE]
Note that this sum is a well-defined element of because, for each , there are only finitely many with
[TABLE]
Thus we write
[TABLE]
with . For a given power series whose coefficients are actual complex numbers, we would like to evaluate as a complex number. By direct substitution, the easily yield complex numbers (because the are polynomials in the ). Then an obvious way to evaluate is to take the limit of partial sums
[TABLE]
However, we find that for general this series does not have desirable convergence properties. We thus specify a different way of ordering the sum (4). We prove that this ordering is equivalent to the Newton-Raphson-Simpson method. If the partial sums of this ordering converge, then corresponds to a zero of .
To specify this different ordering, we define what we call the type number of a plane tree.
2.3 The type number of a plane tree
Definition 3**.**
Let be a plane tree. We define a non-negative integer , which we call the type number of , and we say that is of type if . If consists of a single vertex, then define to be 0. Otherwise, define to be if either of the following two conditions holds:
1. Exactly one of ’s root subtrees is of type and the rest are of type at most .
2. Two or more of ’s root subtrees are of type and the rest are of type less than .
If satisfies the second condition, we say that is final.
Definition 4**.**
Let
[TABLE]
Let
[TABLE]
and
[TABLE]
Note that the elements and are well-defined elements of .
The ordering that we specify is
[TABLE]
Now as an element of is itself an infinite sum; instead of trying to find an ordering to evaluate that sum, we establish an equation (Theorem 1) in that is linear in with coefficients in terms of for . Then we solve those equations for . This allows us to express as a ratio of elements in .
We show how to establish the equations for . We use the auxiliary function :
Definition 5**.**
Define the auxiliary function by
[TABLE]
The two necessary properties of are given below in Property 1.
Definition 6**.**
Let be a subset of . Define the set to be the set of trees such that if is a root subtree of with , then .
Let such that . Define the set to be the set of trees such that has exactly one root subtree that is equal to ; and if is a root subtree of with , then .
Property 1**.**
For , let
[TABLE]
Then
[TABLE]
Let and . Then
[TABLE]
The next theorem establishes equations that determine .
Theorem 1**.**
[TABLE]
[TABLE]
Proof.
We first establish the equation for . Let be of type 1. If is final, then all of its root subtrees of type [math]; that is, all the root subtrees are single vertices. Thus
[TABLE]
If is not final, then it has exactly one root subtree of type 1 and the rest are single vertices.
[TABLE]
Therefore
[TABLE]
We now determine . For any positive integer , we have that
[TABLE]
represents trees whose root subtrees are of type at most ; and
[TABLE]
represents trees with exactly one root subtree of type and the rest of type at most . Therefore
[TABLE]
[TABLE]
[TABLE]
∎
Lemma 1**.**
For , we have
[TABLE]
Proof.
We prove this by induction. When , we have by Lemma 1
[TABLE]
Now
[TABLE]
so
[TABLE]
Now assume the statement of the lemma is true for some . By Lemma 1
[TABLE]
Applying the definition of and simplifying, we have
[TABLE]
[TABLE]
and
[TABLE]
The sum of the above three expressions is
[TABLE]
where we use the fact that
[TABLE]
and the induction hypothesis. Furthermore
[TABLE]
Substituting these results into (5) proves the induction step and the lemma. ∎
Theorem 2**.**
Let and be defined by the Newton-Raphson-Simpson method applied to . Let be as defined above. For , then
[TABLE]
Proof.
If , then
[TABLE]
and the statement of the theorem is true. We re-express the statement as
[TABLE]
and assume it is true for some . Now we apply Lemma 1 to obtain
[TABLE]
This proves the theorem. ∎
3 NRS() and trees with negative vertex degree
To define the algorithms, we define generalized Łukasiewicz words (Definition 8) and trees with negative vertex degree (Construction 1). The plane trees discussed above have possible vertex degrees of either 0 or an integer that is at least 2. We call these classical plane trees. The trees with negative vertex degree have the same structure as classical plane trees but their vertices may have degree that is any integer except .
3.1 Generalized Łukasiewicz words and trees with negative vertex degree
Recall that a plane tree is uniquely determined by the preorder (depth-first order) sequence of its vertex degrees. We will call this sequence the prodder sequence. See Chapter 5 of [2] for the definition of preorder. For classical plane trees, this preorder sequence of non-negative integers is called the Łukasiewicz word for the tree. We recall the defining properties of Łukasiewicz words.
Definition 7**.**
A Łukasiewicz word may be defined as a sequence of integers such that
[TABLE]
for each . (Note that according to our convention each as well.)
Definition 8**.**
Define a generalized Łukasiewicz word to be a sequence of integers such that
[TABLE]
for each . Define to be the smallest (most negative) integer that occurs in . For , define to be the set of all generalized Łukasiewicz words such that .
Construction 1**.**
Given a generalized Łukasiewicz word , we construct a tree with negative vertex degree in the following way. We construct a new word from by taking each in with and replacing it with a sequence of 0’s of length . Thus the generalized Łukasiewicz word
[TABLE]
yields
[TABLE]
By construction is a non-generalized Łukasiewicz word and thus is the preorder sequence for some classical plane tree which we call . Now from we construct the tree : we give the same structure as , but for each set of vertices of degree 0 in that came from an in , we say that the rightmost vertex of these vertices in the preorder has degree ; that the vertices of degree 0 immediately to the left of in the preorder are “canceled” by ; and that these canceled vertices do not contribute to the number of vertices of degree 0 in . We say that a canceled vertex does not have any degree but we do consider it a subtree of its parent vertex. We say that the classical plane tree is the* underlying tree of . We say that has the preorder sequence . See Figure 1.*
For , we identify the set of all plane trees whose vertex degrees are at least with .
Definition 9**.**
Let be a tree with negative vertex degree with preorder sequence . We define the type number of to be equal to , where is the underlying tree of . We say that is final if is final. We define to be the number of consecutive 0’s at the right end of .
Remark 2**.**
We can construct any tree with negative vertex degree by specifying a sequence of trees , where each is a tree of negative vertex degree, and then appropriately assigning negative degrees to those trees that consist of a single vertex. That is, suppose is a single vertex and we assign it to have degree . Then there must be a subsequence of the form
[TABLE]
*where consists of a single vertex for , and . This motivates the following definition. *
Definition 10**.**
For integers and with , define a -block to be a sequence
[TABLE]
of trees in where there are trees after , and . Define a -block to be a sequence consisting of a single tree
[TABLE]
where is any tree in . We refer to both -blocks and blocks as blocks.
Remark 3**.**
We identify with the set of sequences
[TABLE]
where and is either a -block or a -block. The tree corresponds to the empty sequence (when ). We compare this identification to that of Remark 1.**
3.2 The number of generalized Łukasiewicz words with a given degree sequence
Let
[TABLE]
be a sequence of non-negative integers such that ; only finitely many of the are non-zero; and
[TABLE]
Then the number of Łukasiewicz words
[TABLE]
such that the integer appears times in is equal to
[TABLE]
Theorem 5.3.10 of [2] proves this statement. We present a corresponding result about generalized Łukasiewicz words. The proof in [2] directly carries over and we present it here in that generality.
Theorem 3**.**
Let
[TABLE]
be a sequence of non-negative integers such that ; only finitely many of the are non-zero; and
[TABLE]
The the number of generalized Łukasiewicz words
[TABLE]
with degree sequence is
[TABLE]
Proof.
Let
[TABLE]
Consider the set of all sequences
[TABLE]
such that of the equal and
[TABLE]
The order of is thus
[TABLE]
Let and let denote the -th conjugate of :
[TABLE]
We claim that these conjugates are distinct. If for , then that means
[TABLE]
whenever . This implies that divides and that each is a multiple of . By assumption
[TABLE]
so divides . But that means , which is impossible since . Therefore the conjugates of are distinct.
We claim that exactly one of these conjugates is a generalized Łukasiewicz word. First we show that at least one conjugate is a generalized Łukasiewicz word. Suppose that the negative integer is an attained lower bound for the partial sums:
[TABLE]
for all and that
[TABLE]
with minimal (we may assume that , or else and we are done). Then we claim that the conjugate
[TABLE]
is a generalized Łukasiewicz word. We have
[TABLE]
for all , or else would not be a lower bound.
Now suppose
[TABLE]
for some . Since
[TABLE]
that implies
[TABLE]
contradicting the minimality of . Therefore is a generalized Łukasiewicz word.
Now suppose
[TABLE]
is a generalized Łukasiewicz word. If some conjugate
[TABLE]
for is also a generalized Łukasiewicz word, then
[TABLE]
and
[TABLE]
Therefore
[TABLE]
But this contradicts the assumption that is a generalized Łukasiewicz word. Therefore the only conjugate of that is a generalized Łukasiewicz word is itself.
Let denote the set of generalized Łukasiewicz words with degree sequence . Now , and we have partitioned into subsets that each have order such that each subset contains exactly one generalized Łukasiewicz word. Thus
[TABLE]
This proves the theorem.
∎
3.3 The ring
We define the ring . For , let be the -vector space spanned by monomials of the form
[TABLE]
where are non-negative integers, almost all zero with , satisfying
[TABLE]
Thus an element is a finite sum of the monomials of the form (7). For and , then
[TABLE]
We let be the ring consisting of all elements of the form
[TABLE]
where ; and where addition and multiplication in are the usual operations on infinite sums. Note that in the sum (8) we allow infinitely many of the to be non-zero.
Definition 11**.**
Let . Define
[TABLE]
We call the -expression of .
3.4 The element
Definition 12**.**
[TABLE]
As for in section 2, the elements are well-defined elements of because for any , there are only finitely many trees with
[TABLE]
We let
[TABLE]
denote the -hypergeometric series of [3].
Theorem 4**.**
The -hypergeometric series may be viewed as an element of . As elements of ,
[TABLE]
Proof.
To agree with the notation of [3], we let . In equation 4.2 of [3], Sturmfels defines to be the infinite sum
[TABLE]
where the sum is over all sequences of non-negative integers such that
[TABLE]
and
[TABLE]
Using equation (10), equation (11) may be rewritten as
[TABLE]
And
[TABLE]
Thus we can interpret each as the number of vertices in a tree with negative vertex degree that have degree ; as the number of vertices that have degree [math]; and as the number of vertices that have vertex degree (that is, are not canceled). By Theorem 3, expression (12) counts the number of all such . The monomial factor in (9) is then . ∎
3.5 Strategy to determine
We perform this sum by ordering the trees according to their type number: letting
[TABLE]
we write
[TABLE]
As in Section 2 for , the quantity is an infinite sum. Instead of specifying an ordering for this sum, we establish equations in that allow us to solve for . Specifically, we introduce the quantities for and establish a system of equations that are linear in the . We define these now.
First, recall the construction of trees in discussed in Remark 2. Given integers and , the number determines whether is a valid choice for the first tree in the -block. Therefore we partition into disjoint subsets based on :
Definition 13**.**
Let denote the tree consisting of a single vertex and let . If , define
[TABLE]
For and , define
[TABLE]
and
[TABLE]
Thus
[TABLE]
Refining this partition by the type number yields the following terms.
Definition 14**.**
For , let
[TABLE]
[TABLE]
Thus for
[TABLE]
We next explain how to establish the system of linear equations to determine the . When , the quantity
[TABLE]
and we used the single auxiliary function to establish an equation for . For general , we will use auxiliary functions in variables . The two properties that these auxiliary have which generalize the those of are listed in Property 2 below.
Definition 15**.**
Let be a subset of . Define the set to be the set of trees such that if is a subtree of the root of with , then .
Let such that . Define the set to be the set of trees such that the root of has exactly one subtree that is equal to ; and if is a subtree of the root of with , then .
Property 2**.**
For , let
[TABLE]
and
[TABLE]
Then
[TABLE]
Let and . Then
[TABLE]
To construct the auxiliary functions that satisfy these properties, we define the map , partial blocks, and partial trees next.
3.6 The forgetful map
Recall Remark 3 in which we identify a tree with negative vertex degree with a sequence of blocks. Let denote the first tree in a block. We define a forgetful map on the set of blocks such that forgets everything about the tree except the integer and whether . Specifically, let be a -block. If , define to be the triple
[TABLE]
and if , define to be the triple
[TABLE]
On the set of -blocks, define
[TABLE]
and, if ,
[TABLE]
We call the images of partial blocks which for concreteness are defined next along with their -expressions.
Definition 16**.**
Let denote the tree consisting of a single vertex. In the following four cases we define a partial block , its length which we denote , and its -expression . The expression will be a monomial in and . We define to be:
1. A triple of integers
[TABLE]
where ; ; and . Define
[TABLE]
and
[TABLE]
2. The triple
[TABLE]
where . Define
[TABLE]
and
[TABLE]
3. For , the -tuple
[TABLE]
Define
[TABLE]
and
[TABLE]
4. The 1-tuple
[TABLE]
Define
[TABLE]
and
[TABLE]
The map extends naturally to : writing as a sequence of blocks
[TABLE]
we set
[TABLE]
We call these images partial trees which we define next. The part about the empty subtrees will be necessary to express a recurrence relation among the in Section 4.
Definition 17**.**
For integer , define a partial tree with empty subtrees to be a sequence:
[TABLE]
where ; each is either a -partial block or a -partial block; and there are ’s representing empty subtrees. Let denote the set of all such partial trees. We say that
[TABLE]
Thus
[TABLE]
For , then by construction for some . Define to be this .
Define
[TABLE]
Define the -expression of the partial tree to be
[TABLE]
For , we view as a function of the variables . Next we show that satisfies properties similar to those listed in Property 2:
Lemma 2**.**
Let and . Suppose and .
[TABLE]
[TABLE]
Proof.
. This follows immediately from the definition of a partial tree. The expression is obtained by substituting each in with
[TABLE]
Expanding out, we obtain a sum of terms; each term is an expression of a tree whose root subtrees are in if they have root degree . Every such corresponds to some term.
. By the product rule, the expression is obtained by choosing each factor of that appears in and replacing it with , and then adding all such expressions. This corresponds to making a root subtree of in the spot where the was removed. Substituting it the as in part now yields the sum of terms that correspond to elements in .
∎
3.7 The system of linear equations for
Now we can define the auxiliary functions . The auxiliary functions will satisfy the properties in Property 2 because the satisfy similar properties. We can then establish the system of equations for in Theorem 5.
Definition 18**.**
[TABLE]
[TABLE]
Corolllary 1**.**
The auxiliary functions satisfy the two properties in Property 2.
Proof.
This is immediate from the definition of , Lemma 2 and the disjoint union
[TABLE]
∎
Recall the definitions
[TABLE]
and
[TABLE]
[TABLE]
Theorem 5**.**
Let be a function. For , define to be:
[TABLE]
The following is a system of linear equations in the unknowns :
[TABLE]
Assuming that has been evaluated for , the following is a system of linear equations in the unknowns :
[TABLE]
Proof.
Let . Using the properties of the auxiliary functions in Property 2, we obtain
[TABLE]
and
[TABLE]
Adding these two equations yields
[TABLE]
Considering all , , gives a system of equations in the unknowns .
Now let . Again using the properties of the auxiliary functions in Property 2, we obtain
[TABLE]
and
[TABLE]
Adding these two equations yields
[TABLE]
Considering all , , gives a system of equations in the unknowns .
∎
Solving this system allows us to express as a ratio of elements in , and via
[TABLE]
we can also can be express as a ratio of elements in . Then is the sum
[TABLE]
We will explicitly construct the auxiliary functions and find solutions to these systems in Section 5.
4 Explicit construction of the auxiliary functions
We next show how to compute . We use a recurrence relation (equation (16)) that can be implemented by a computer. We use the function defined next.
Definition 19**.**
[TABLE]
To compute , we use the following functions.
The function for is the sum of -expressions of all -partial blocks:
[TABLE]
Let denote
[TABLE]
For , the function is the sum of -expressions of all partial trees with empty subtrees such that the sequence of partial blocks for contains partial blocks of length :
[TABLE]
Then is the sum of the functions over all :
[TABLE]
We note that if is a polynomial, then the above sum over is a finite sum, and then is a polynomial in whose coefficients are polynomials in the .
For an and , let be a partial tree with empty subtrees:
[TABLE]
Recall that is the sum of -expressions of all partial trees with empty subtrees and . We consider the three cases of whether is equal to 0; greater than 0 and less than ; and equal to .
1. i = 0.
We have the following three subcases.
subcase(1) and
The sum of the -expressions for such partial trees is
[TABLE]
because we take any partial tree with empty subtrees and replace the leftmost empty to be the partial block . This partial block has -expression .
subcase(2) and
Such partial trees are obtained from taking a partial tree with empty subtrees, and replacing the first empty subtrees with a partial block of length . The sum of -expressions of all such partial trees is
[TABLE]
subcase(3)
The -expression of this partial tree is
[TABLE]
Adding the -expressions for these three cases yields
[TABLE]
2. 1 i m 1
We have the following two subcases.
subcase(1) and
Such partial trees are obtained by taking a partial tree with empty subtrees and replacing the leftmost empty subtree with the partial block . The sum of the -expressions of all such is
[TABLE]
subcase(2) and
Such partial trees are obtained by taking a partial tree with empty subtrees and and replacing the leftmost empty subtree with the partial block . The sum of the -expressions of all such created this way is
[TABLE]
Adding the -expressions for these two subcases yields
[TABLE]
3. i m 1
Then we have the following two sub cases.
subcase(1)
There is an integer where such that there is a subsequence of
[TABLE]
where
[TABLE]
for some , and
[TABLE]
for . Such a is obtained from a with empty subtrees and replacing the leftmost empty subtree with the partial block , and then replacing each of the next empty subtrees with the partial block . Summing over all , the sum of -expressions of all such trees is
[TABLE]
subcase(2)
We have the subsequence
[TABLE]
where for . Such a is obtained from a with empty subtrees and replacing each of the empty subtrees with the partial block . The sum of the -expressions of such trees created this way is
[TABLE]
Therefore
[TABLE]
We list the auxiliary functions for a quintic polynomial
[TABLE]
:
[TABLE]
:
[TABLE]
:
[TABLE]
:
[TABLE]
:
[TABLE]
5 Numerical examples
5.1 A quintic polynomial with rational zeros
Now we specialize to be the following polynomial with real coefficients and map the various expressions to real numbers.
[TABLE]
5.2 A Jensen polynomial for
We apply NRS algorithms to the third degree Jensen polynomial for . We recall the definition of Jensen polynomials for a power series
[TABLE]
The -th degree Jensen polynomial is
[TABLE]
It is a theorem that a power series series has all real zeros if and only all its Jensen polynomials have all real zeros.
Let
[TABLE]
Let denote the power series coefficients of :
[TABLE]
To compute , we use the formula
[TABLE]
where
[TABLE]
and
[TABLE]
is the unsigned Stirling number of the first kind. In another paper we prove that the formula (17) holds by proving
[TABLE]
where
[TABLE]
and . There we prove that the coefficients of the powers of in are positive if and the series is absolutely convergent for any . We use formula (17) summing up to to compute
[TABLE]
The third degree Jensen polynomial is
[TABLE]
6 Further work
Suppose
[TABLE]
is a polynomial of degree with and with all positive zeros such that .
- •
The NRS() algorithm applied to the coefficients is convergent and outputs the sum
[TABLE]
- •
For each , the series converges quadratically when the zeros are distinct.
- •
The convergence of NRS() implies that those outputs yield zeros of .
Now let
[TABLE]
where and are indeterminates. We say a polynomial is -positive if it is a polynomial in the that has positive coefficients. A rational function is -positive if it is a ration of -positive polynomials.
- •
is positive.
- •
The difference
[TABLE]
is positive.
- •
The expressions arising from formal contour integrals are -positive.
- •
The may also have -positive properties.
- •
Combinatorial proof that is a formal zero. We proof for .
- •
Interpret the other hypergeometric series in [3] using trees and the Newton-Raphson-Simpson method.
- •
Use more general rings to express formal zeros. For example, let . Let
[TABLE]
[TABLE]
and a zero of . Set
[TABLE]
with a multinomial in the .
- •
Use other orderings to evaluate . For a suitable degree 2 polynomial, the sum can be summed by an ordering that yields the Taylor series of the square root in the quadratic formula. We would find corresponding orderings for formal zeros that generalize this ordering to higher degree, for example by expressing expressing formal zeros in terms of Taylor series, some of evaluate to radical expressions. See how these hypothetical Taylor series are related to Turán inequalities.
- •
Relationship between Turán inequalities and expressions for . See if Turán inequalities or other set of conditions imply convergence of NRS().
- •
Householder methods expressed in terms of trees and generalized as was NRS(). Maybe altering to include higher-order terms or making the type number of a tree to include multiple parameters.
- •
For arbitrary analytic functions , express the convergence of in terms of the convergence of .
- •
Apply NRS() to coefficients of using (17). Perhaps try -analogues of terms in (17) to try to prove positivity of NRS() quantities. See if -analogues of coefficients of (17) satisfy Turán inequalities. For example, the -analogue of
[TABLE]
may be
[TABLE]
The unsigned Stirling numbers have -analogues, and the may be expressed via elliptic integrals in terms of a rational number sequence we denote by . Each number is defined as a sum over a certain finite subset of classical plane trees by:
[TABLE]
[TABLE]
This allows us to express as a series of rational numbers. A -analogue of could lead to -analogues of .
- •
Galois theory applied to formal zeros.
- •
NRS() applied to the function or its -analogues.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Kollerstrom, Nick. “Thomas Simpson and ‘Newton’s Method of Approximation’: An Enduring Myth”. The British Journal for the History of Science Vol. 25, No. 3 (Sep., 1992), pp. 347-354
- 2[2] Stanley, Richard P. Enumerative Combinatorics, Volume 2 . Cambridge University Press, Cambridge, UK, 1999.
- 3[3] Sturmfels, Bernd. “Solving algebraic equations in terms of 𝒜 𝒜 \mathscr{A} -hypergeometric series”. Discrete Mathematics 210, (2000), pp. 171-181.
