On Generalizations of the Newton-Raphson-Simpson Method

Mario DeFranco

arXiv:1903.10697·math.CO·September 18, 2025

On Generalizations of the Newton-Raphson-Simpson Method

Mario DeFranco

PDF

Open Access

TL;DR

This paper introduces a family of algorithms called NRS(m) that generalize the Newton-Raphson-Simpson method, enabling the evaluation of sums of formal zeros of functions and connecting to hypergeometric series via combinatorial structures.

Contribution

The paper develops a new class of algorithms NRS(m) that extend the Newton-Raphson-Simpson method and relate to hypergeometric series using novel combinatorial objects.

Findings

01

NRS(1) recovers the classical Newton-Raphson-Simpson iterations.

02

NRS(m) can evaluate certain hypergeometric series.

03

The algorithms utilize trees with negative vertex degrees for their construction.

Abstract

We present generalizations of the Newton-Raphson-Simpson method. Specifically, for a positive integer $m$ and the sequence of coefficients of a Taylor series of a function $f (z)$ , we define an algorithm we denote by NRS( $m$ ) which is a way to evaluate, in our terminology, a sum of $m$ formal zeros of $f (z)$ . We prove that NRS(1) yields the familiar iterations of the Newton-Raphson-Simpson method. We also prove that NRS( $m$ ) is way to evaluate certain $A$ -hypergeometric series defined by Sturmfels. In order to define these algorithms, we make use of combinatorial objects which we call trees with negative vertex degree.

Tables8

Table 1. Table 1: m = 1 𝑚 1 m=1

$n$	$J_{1} (n)$	$- \frac{a_{0}}{a_{1}} + S_{1} (n)$
0	0	5.161290323 $\times 10^{- 1}$
1	3.099986240, $\times 10^{- 1}$	8.261276563 $\times 10^{- 1}$
2	1.403785124 $\times 10^{- 1}$	9.665061687 $\times 10^{- 1}$
3	3.188553499 $\times 10^{- 2}$	9.983917037 $\times 10^{- 1}$
4	1.604320113 $\times 10^{- 3}$	9.999960238 $\times 10^{- 1}$
5	3.976182710 $\times 10^{- 6}$	1.000000000
6	2.439269432 $\times 10^{- 11}$	1.000000000
7	9.180054555 $\times 10^{- 22}$	1.000000000

Table 2. Table 2: m = 2 𝑚 2 m=2

$n$	$J_{0, 2} (n)$	$J_{1, 2} (n)$	$J_{2} (n)$	$- \frac{a_{1}}{a_{2}} + S_{2} (n)$
0	0	0	0	1.60000000
1	-4.659688684 $\times 10^{- 1}$	1.285700049	8.197311805 $\times 10^{- 1}$	2.419731181
2	-2.893844613 $\times 10^{- 1}$	7.161536865 $\times 10^{- 1}$	4.267692251 $\times 10^{- 1}$	2.846500406
3	-1.070644295 $\times 10^{- 1}$	2.455302622 $\times 10^{- 1}$	1.384658328 $\times 10^{- 1}$	2.984966238
4	-1.243741433 $\times 10^{- 2}$	2.730562457 $\times 10^{- 2}$	1.486821024 $\times 10^{- 2}$	2.999834449
5	-1.448081738 $\times 10^{- 4}$	3.103391679 $\times 10^{- 4}$	1.655309941 $\times 10^{- 4}$	2.999999980
6	-1.827406122 $\times 10^{- 8}$	3.861697182 $\times 10^{- 8}$	2.034291060 $\times 10^{- 8}$	3.000000000
7	-2.798594637 $\times 10^{- 16}$	5.864244005 $\times 10^{- 16}$	3.065649367 $\times 10^{- 16}$	3.000000000
8	-6.410472972 $\times 10^{- 32}$	1.336321214 $\times 10^{- 32}$	6.952739166 $\times 10^{- 32}$	3.000000000

Table 3. Table 3: m = 3 𝑚 3 m=3

$n$	$J_{0, 3} (n)$	$J_{1, 3} (n)$	$J_{2, 3} (n)$	$J_{3} (n)$
0	0	0	0	0
1	-1.469709756	-5.895234873 $\times 10^{- 1}$ ,	3.873351306	1.814118062
2	-8.795058030 $\times 10^{- 1}$	-6.561933355 $\times 10^{- 1}$	2.427500757	8.918016189 $\times 10^{- 1}$
3	-3.065450285 $\times 10^{- 1}$	-2.976185716 $\times 10^{- 1}$	8.727716252 $\times 10^{- 1}$	2.686080251 $\times 10^{- 1}$
4	-3.145514266 $\times 10^{- 2}$	-3.445847512 $\times 10^{- 2}$	9.116888144 $\times 10^{- 2}$	2.525526366 $\times 10^{- 2}$
5	-2.842482660 $\times 10^{- 4}$	-3.311045690 $\times 10^{- 4}$	8.323671070 $\times 10^{- 4}$	2.170142720 $\times 10^{- 4}$
6	-2.143169948 $\times 10^{- 8}$	-2.585019447 $\times 10^{- 8}$	6.315545741 $\times 10^{- 8}$	1.587356346 $\times 10^{- 8}$
7	-1.163725119 $\times 10^{- 16}$	-1.433465253 $\times 10^{- 16}$	3.443052895 $\times 10^{- 16}$	8.458625235 $\times 10^{- 17}$
8	-3.335397374 $\times 10^{- 33}$	-4.162656330 $\times 10^{- 33}$	9.893837244 $\times 10^{- 33}$	2.395783540 $\times 10^{- 33}$

Table 4. Table 4: m = 3 𝑚 3 m=3 continued

$n$	$- \frac{a_{2}}{a_{3}} + S_{3} (n)$
0	4.000000000
1	5.814118062
2	6.705919681
3	6.974527706
4	6.999782970
5	6.999999984
6	7.000000000
7	7.000000000
8	7.000000000

Table 5. Table 5: m = 4 𝑚 4 m=4

$n$	$J_{0, 4} (n)$	$J_{1, 4} (n)$	$J_{2, 4} (n)$	$J_{3, 4} (n)$
0	0	0	0	0
1	-2.901096310	-1.381474433,	4.104059794	3.730963449
2	-1.242997894	-1.092366178	5.026520639 $\times 10^{- 1}$	3.078851277
3	-2.246822248 $\times 10^{- 1}$	-2.526877753 $\times 10^{- 1}$	-6.146352087 $\times 10^{- 2}$	7.352225534 $\times 10^{- 1}$
4	-6.219368047 $\times 10^{- 3}$	-7.840976033 $\times 10^{- 3}$	-4.228779292 $\times 10^{- 3}$	2.330509308 $\times 10^{- 2}$
5	-4.203554555 $\times 10^{- 6}$	-5.637329986 $\times 10^{- 6}$	-3.932363674 $\times 10^{- 6}$	1.700311697 $\times 10^{- 5}$
6	-1.780482352 $\times 10^{- 12}$	-2.477579062 $\times 10^{- 12}$	-1.965670765 $\times 10^{- 12}$	7.550765040 $\times 10^{- 12}$
7	-3.047097262 $\times 10^{- 25}$	-4.339903585 $\times 10^{- 25}$	-3.710732333 $\times 10^{- 25}$	1.332463930 $\times 10^{- 24}$

Table 6. Table 6: m = 4 𝑚 4 m=4 continued

$n$	$J_{4} (n)$	$- \frac{a_{3}}{a_{4}} + S_{4} (n)$
0	0	10.00000000
1	3.552452499	13.552452499
2	1.246139269	14.798591768
3	1.963890324 $\times 10^{- 1}$	14.994980800
4	5.015969708 $\times 10^{- 3}$	14.999996770
5	3.229868758 $\times 10^{- 6}$	15.000000000
6	1.327032861 $\times 10^{- 12}$	15.000000000
7	2.226906125 $\times 10^{- 25}$	15.000000000

Table 7. Table 7: m = 1 𝑚 1 m=1

$n$	$J_{1} (n)$	$- \frac{a_{0}}{a_{1}} + S_{1} (n)$
0	0	14.421 $\times 10^{- 1}$
1	3.0425	17.463
2	1.37564 $\times 10^{- 1}$	17.601
3	2.7830 $\times 10^{- 4}$	17.601
4	1.1384 $\times 10^{- 9}$	17.601
5	1.9048 $\times 10^{- 20}$	17.601

Table 8. Table 8: m = 2 𝑚 2 m=2

$n$	$J_{0, 2} (n)$	$J_{1, 2} (n)$	$J_{2} (n)$	$- \frac{a_{1}}{a_{2}} + S_{2} (n)$
0	0	0	0	93.850
1	-4.5493	28.506	23.957	117.81
2	-4.3017 $\times 10^{- 1}$	2.6095	2.1794	119.99
3	-3.7235 $\times 10^{- 2}$	2.2057 $\times 10^{- 2}$	1.8333 $\times 10^{- 2}$	120.00
4	-2.6905 $\times 10^{- 7}$	1.5663 $\times 10^{- 6}$	1.2972 $\times 10^{- 6}$	120.00
5	-1.3678 $\times 10^{- 15}$	7.8611 $\times 10^{- 15}$	6.4933 $\times 10^{- 15}$	120.00
6	-3.4665 $\times 10^{- 32}$	1.9734 $\times 10^{- 31}$	1.6268 $\times 10^{- 31}$	120.00

Equations455

c_{N + 1} = c_{N} - \frac{f ( c _{N} )}{f ^{'} ( c _{N} )} .

c_{N + 1} = c_{N} - \frac{f ( c _{N} )}{f ^{'} ( c _{N} )} .

f (z) = k = 0 \sum d a_{k} z^{k} .

f (z) = k = 0 \sum d a_{k} z^{k} .

n = 0 \sum N - 1 J_{1} (n)

n = 0 \sum N - 1 J_{1} (n)

n = 0 \sum \infty J_{m} (n),

n = 0 \sum \infty J_{m} (n),

f (z) = k = 0 \sum \infty a_{k} z^{k},

f (z) = k = 0 \sum \infty a_{k} z^{k},

f (Z) = 0 \in R .

f (Z) = 0 \in R .

Z_{m} = Z_{m} (a_{0}, a_{1}, ..., a_{m - 2}, a_{m + 1}, a_{m + 2}, ...)

Z_{m} = Z_{m} (a_{0}, a_{1}, ..., a_{m - 2}, a_{m + 1}, a_{m + 2}, ...)

Z_{m} = - \frac{a _{m - 1}}{a _{m}} + \accentset ⇀ n \sum c (\accentset ⇀ n) a^{\accentset ⇀ n}

Z_{m} = - \frac{a _{m - 1}}{a _{m}} + \accentset ⇀ n \sum c (\accentset ⇀ n) a^{\accentset ⇀ n}

\accentset ⇀ n = (n_{0}, n_{1}, ..., n_{m - 2}, n_{m + 1}, n_{m + 2}, ...)

\accentset ⇀ n = (n_{0}, n_{1}, ..., n_{m - 2}, n_{m + 1}, n_{m + 2}, ...)

a^{\accentset ⇀ n} = i = 0, \neq = m - 1, m \prod \infty a_{i}^{n_{i}};

a^{\accentset ⇀ n} = i = 0, \neq = m - 1, m \prod \infty a_{i}^{n_{i}};

\frac{\partial^{\accentset{\rightharpoonup}{n}}f}{(\partial a)^{\accentset{\rightharpoonup}{n}}}(Z_{m})\big{|}_{a_{i}=0,i\neq m,m-1}=0

\frac{\partial^{\accentset{\rightharpoonup}{n}}f}{(\partial a)^{\accentset{\rightharpoonup}{n}}}(Z_{m})\big{|}_{a_{i}=0,i\neq m,m-1}=0

Z_{m_{1}, m_{2}} = (- \frac{a _{m_{1}}}{a _{m_{2}}})^{\frac{1}{m _{2} - m _{1}}} + \accentset ⇀ n \sum c (\accentset ⇀ n) a^{\accentset ⇀ n}

Z_{m_{1}, m_{2}} = (- \frac{a _{m_{1}}}{a _{m_{2}}})^{\frac{1}{m _{2} - m _{1}}} + \accentset ⇀ n \sum c (\accentset ⇀ n) a^{\accentset ⇀ n}

g_{m} (z) = z - \frac{f ( z )}{a _{m} z ^{m - 1}},

g_{m} (z) = z - \frac{f ( z )}{a _{m} z ^{m - 1}},

n \to \infty lim g_{m}^{n} (- \frac{a _{m - 1}}{a _{m}})

n \to \infty lim g_{m}^{n} (- \frac{a _{m - 1}}{a _{m}})

- [\frac{a _{m - 1}}{a _{m}}] + [\frac{a _{m - 2}}{a _{m - 1}}] .

- [\frac{a _{m - 1}}{a _{m}}] + [\frac{a _{m - 2}}{a _{m - 1}}] .

A_{m} = - [\frac{a _{m - 1}}{a _{m}}] .

A_{m} = - [\frac{a _{m - 1}}{a _{m}}] .

i = 0 \prod \infty (- \frac{a _{i}}{a _{1}})^{n_{i}}

i = 0 \prod \infty (- \frac{a _{i}}{a _{1}})^{n_{i}}

i = 0 \sum \infty n_{i} = k .

i = 0 \sum \infty n_{i} = k .

r_{k_{1}} r_{k_{2}} \in R_{1} (k_{1} + k_{2}) .

r_{k_{1}} r_{k_{2}} \in R_{1} (k_{1} + k_{2}) .

r = k = 0 \sum \infty r (k)

r = k = 0 \sum \infty r (k)

f (z) = k = 0 \sum \infty a_{k} z^{k} .

f (z) = k = 0 \sum \infty a_{k} z^{k} .

R_{1} (T) = k = 0 \prod \infty (- \frac{a _{k}}{a _{1}})^{d_{k} (T)} .

R_{1} (T) = k = 0 \prod \infty (- \frac{a _{k}}{a _{1}})^{d_{k} (T)} .

A_{1} = T \in Luk_{1} \sum R_{1} (T) .

A_{1} = T \in Luk_{1} \sum R_{1} (T) .

k = i = 0 \sum \infty d_{i} (T) .

k = i = 0 \sum \infty d_{i} (T) .

A_{1} = k = 0 \sum \infty A_{1} (k)

A_{1} = k = 0 \sum \infty A_{1} (k)

N \to \infty lim k = 0 \sum N A_{1} (k) .

N \to \infty lim k = 0 \sum N A_{1} (k) .

Luk_{1} (n) = {T \in Luk_{1} and type (T) = n}

Luk_{1} (n) = {T \in Luk_{1} and type (T) = n}

J_{1} (n) = T \in Luk_{1} (n) \sum R_{1} (T)

J_{1} (n) = T \in Luk_{1} (n) \sum R_{1} (T)

S_{1} (0) = 0, S_{1} (n) = k = 1 \sum n J (k) .

S_{1} (0) = 0, S_{1} (n) = k = 1 \sum n J (k) .

A_{1} = n = 0 \sum \infty J_{1} (n) .

A_{1} = n = 0 \sum \infty J_{1} (n) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Combinatorial Mathematics · Polynomial and algebraic computation · Advanced Mathematical Identities

Full text

On Generalizations of the Newton-Raphson-Simpson Method

Mario DeFranco

Abstract

We present generalizations of the Newton-Raphson-Simpson method. Specifically, for a positive integer $m$ and the sequence of coefficients of a Taylor series of a function $f(z)$ , we define an algorithm we denote by NRS( $m$ ) which is a way to evaluate, in our terminology, a sum of $m$ formal zeros of $f(z)$ . We prove that NRS(1) yields the familiar iterations of the Newton-Raphson-Simpson method. We also prove that NRS( $m$ ) is way to evaluate certain $\mathscr{A}$ -hypergeometric series defined by Sturmfels [3]. In order to define these algorithms, we make use of combinatorial objects which we call trees with negative vertex degree.

1 Introduction

The main purpose of this paper is to define a sequence of algorithms NRS( $m)$ for positive integer $m$ which generalize the Newton-Raphson-Simpson method.

We review the Newton-Raphson-Simpson method. Let $f(z):\mathbb{C}\rightarrow\mathbb{C}$ be a a differentiable function and $c_{0}\in\mathbb{C}$ . Recall that the Newton-Raphson-Simpson method constructs a sequence $c_{N},N\geq 0$ defined by

[TABLE]

Then the limit $c=\lim_{N\rightarrow\infty}c_{N}$ , if it exists, is a zero of $f(z)$ . Depending on $f(z)$ and $c_{0}$ , the limit may or may not exist. See Kollerstrom [1] for information about the Newton-Raphson-Simpson method.

Given an integer $d\geq m$ , the algorithm NRS( $m$ ) constructs a sequence of rational functions $J_{m}(n),n\geq 0$ in the variables $a_{0},a_{1},...,a_{d}$ . We think of the $a_{k}$ as being the coefficients of a polynomial

[TABLE]

We prove that

[TABLE]

is equal to the $N$ -th iteration $c_{N}$ of the Newton-Raphson-Simpson method applied to $f(z)$ with $c_{0}=0$ . For larger $m$ , we claim that

[TABLE]

if convergent, is equal to a sum of $m$ zeros of $f(z)$ . In Section 5 we apply the NRS( $m$ ) for certain polynomials and give tables of values for the partial sums of (1). These tables indicate the series for these examples converges to the sum of the $m$ zeros of $f(z)$ that are closest to 0. In Section 6 we talk more about the claimed sufficient conditions on $f(z)$ that yield these results, namely that the zeros of $f(z)$ be positive.

We obtain the rational functions $J_{m}(n)$ by considering certain infinite sums $A_{m}$ in the $a_{i}$ ; choosing a certain order of summation for $A_{m}$ yields the series (1). We define the $A_{m}$ in Section 3 in terms of combinatorial objects which we call trees with negative vertex degree. Next we just present a high-level description of the $A_{m}$ and why they are relevant, including their appearance in [3].

The sums $A_{m}-A_{m-1}$ (where $A_{0}$ denotes 0) are examples of what we call formal zeros for $f(z)$ . For a power series

[TABLE]

we view the $a_{k}$ as indeterminates in some suitable $R_{1}$ and we view $f(z)$ as a function $f(z):R\rightarrow R$ . Then a formal zero for $f(z)$ is an element $Z\in R$ such that

[TABLE]

There are some different ways to approach the $A_{m}$ .

One way, for example, is to view

[TABLE]

as a function of the independent variables $a_{k}$ for $k\neq m-1,m$ and to view $a_{m-1}$ and $a_{m}$ as constants. Then we set

[TABLE]

where

[TABLE]

is a sequence of non-negative integers $n_{i}$ , almost all zero; where

[TABLE]

and where $c({\accentset{\rightharpoonup}{n}})$ are some coefficients. We can solve for the $c({\accentset{\rightharpoonup}{n}})$ by using the set of equations

[TABLE]

for all $\accentset{\rightharpoonup}{n}$ . This method yields a sum for $Z_{m}$ that is equal to $A_{m}-A_{m-1}$ . More generally, for $m_{1}<m_{2}$ , we may also view $Z_{m_{1},m_{2}}$ as a function of the independent variables $a_{i}$ for $i\neq m_{1},m_{2}$ and view $a_{m_{1}}$ and $a_{m_{2}}$ as constants. Then we set

[TABLE]

and solve for $c(\accentset{\rightharpoonup}{n})$ as before.

Another method to obtain the $A_{m}$ is to consider the limits of functions in a suitable ring. For example, if we let

[TABLE]

then the limit

[TABLE]

is equal to $A_{m}-A_{m-1}$ . Again we view $a_{m-1}$ and $a_{m}$ as constants, and we interpret expressions with denominators as geometric series.

In [3], Sturmfels considers differential equations satisfied by the roots of a polynomial and expresses their solutions using certain $\mathscr{A}$ -hypergeometric series. He gives formulas for the coefficients $c(\accentset{\rightharpoonup}{n})$ and denotes some of these solutions by

[TABLE]

In Section 3 we prove that

[TABLE]

We now describe the outline of this paper. In Section 2 we prove that NRS(1) is equivalent to Newton-Raphson-Simpson method. In Section 3 we define NRS( $m$ ) using the trees with negative vertex degree and some functions built from $f(z)$ that we call auxiliary functions. In Section 4 we show how to explicitly compute the auxiliary functions. In Section 5 we apply NRS( $m$ ) to actual polynomials and present numerical tables of the associated quantities. In Section 6 we discuss further work.

2 NRS(1) and the type number of a tree

For each plane tree, we define what we call its type number. We then show how the Newton-Raphson-Simpson method is actually summing trees ordered by this type number (Theorem 2). The NRS( $m$ ) will also sum trees by type number, but the trees will have negative vertex degree.

We recall the definition of rooted trees and plane trees. See Chapter 5 of [2]. We will use the convention that each vertex has degree 0 or degree $\geq 2$ .

Definition 1.

A rooted tree $R$ is a finite acyclic graph with one vertex distinguished as the root which we denote by $\mathrm{root}(R)$ . Let $v$ be a vertex of $R$ . The subtrees of $v$ are the components of $R\backslash v$ that do not contain the root of $R$ . A plane tree $T$ is a rooted tree such that the subtrees of each vertex are linearly ordered. The vertex degree $\mathrm{deg}(v)$ , or just degree, of a vertex $v$ in $T$ is the number of subtrees of $v$ . We also require that $\deg(v)\neq 1$ for any vertex $v$ of $T$ .

Remark 1.

A plane tree $T$ is equivalent to an ordered sequence $\{T_{1},T_{2},...,T_{k}\}$ of other plane trees, for $k\geq 0$ . We call $T_{i}$ the $i$ -th root subtree of $T$ . We denote the plane tree that consists of a single vertex by $T_{0}$ ; this tree corresponds to $k=0$ and the empty sequence. We let $d_{i}(T)$ be the number of vertices of $T$ that have degree $i$ . We call $\displaystyle\{d_{i}(T)\}_{i=1}^{\infty}$ the degree sequence of $T$ . We let $\mathrm{Luk}_{1}$ denote the set of all plane trees (after the Łukasiewicz words reviewed in Section 3).

2.1 The ring $R_{1}$

We define a ring $R_{1}$ and a map from the set of plane trees into $R_{1}$

For integer $k\geq 0$ , let $R_{1}(k)$ be the $\mathbb{Q}$ -vector space spanned by monomials of the form

[TABLE]

where $n_{i}$ are non-negative integers, almost all 0 and with $n_{1}=0$ , satisfying

[TABLE]

Thus an element $r\in R_{1}(k)$ is a finite sum of the monomials of the form (2). For $r_{k_{1}}\in R_{1}(k_{1})$ and $r_{k_{2}}\in R_{1}(k_{2})$ , clearly

[TABLE]

We let $R_{1}$ be the ring consisting of all elements $R_{1}$ of the form

[TABLE]

where $r(k)\in R_{1}(k)$ , and where addition and multiplication in $R_{1}$ are the usual operations on graded infinite sums. Note that in the sum (3) for $R_{1}$ we allow infinitely many of the $r(k)$ to be non-zero; in this case we say that $r$ is an infinite sum. Otherwise we say that $r$ is a finite sum. We let $f(z)$ denote a general power series of the form :

[TABLE]

We view the coefficients $a_{k}$ as indeterminates. We define expressions $R_{1}(T)$ using the coefficients $a_{k}$ and a plane tree $T$ .

Definition 2.

Let $T$ be a rooted planar tree. Define $R_{1}(T)\in R_{1}$ to be

[TABLE]

2.2 The element $A_{1}$

We wish to find a way to compute the sum

[TABLE]

Note that this sum $A_{1}$ is a well-defined element of $R_{1}$ because, for each $k\geq 0$ , there are only finitely many $T$ with

[TABLE]

Thus we write

[TABLE]

with $A_{1}(k)\in R_{1}(k)$ . For a given power series $f(z)$ whose coefficients $a_{k}$ are actual complex numbers, we would like to evaluate $A_{1}$ as a complex number. By direct substitution, the $A_{1}(k)$ easily yield complex numbers (because the $A_{1}(k)$ are polynomials in the $-\frac{a_{i}}{a_{1}}$ ). Then an obvious way to evaluate $A_{1}$ is to take the limit of partial sums

[TABLE]

However, we find that for general $f(z)$ this series does not have desirable convergence properties. We thus specify a different way of ordering the sum (4). We prove that this ordering is equivalent to the Newton-Raphson-Simpson method. If the partial sums of this ordering converge, then $A_{1}$ corresponds to a zero of $f(z)$ .

To specify this different ordering, we define what we call the type number of a plane tree.

2.3 The type number of a plane tree

Definition 3.

Let $T$ be a plane tree. We define a non-negative integer $\mathrm{type}(T)$ , which we call the type number of $T$ , and we say that $T$ is of type $n$ if $\mathrm{type}(T)=n$ . If $T=T_{0}$ consists of a single vertex, then define $\mathrm{type}(T)$ to be 0. Otherwise, define $\mathrm{type}(T)$ to be $n+1$ if either of the following two conditions holds:

1. Exactly one of $T$ ’s root subtrees is of type $n+1$ and the rest are of type at most $n$ .

2. Two or more of $T$ ’s root subtrees are of type $n$ and the rest are of type less than $n$ .

If $T$ satisfies the second condition, we say that $T$ is final.

Definition 4.

Let

[TABLE]

Let

[TABLE]

and

[TABLE]

Note that the elements $J_{1}(n)$ and $S_{1}(n)$ are well-defined elements of $R_{1}$ .

The ordering that we specify is

[TABLE]

Now $J_{1}(n)$ as an element of $R_{1}$ is itself an infinite sum; instead of trying to find an ordering to evaluate that sum, we establish an equation (Theorem 1) in $R_{1}$ that is linear in $J_{1}(n)$ with coefficients in terms of $J(k)$ for $k<n$ . Then we solve those equations for $J_{1}(n)$ . This allows us to express $J_{1}(n)$ as a ratio of elements in $R_{1}$ .

We show how to establish the equations for $J_{1}(n)$ . We use the auxiliary function $f_{0}(x)$ :

Definition 5.

Define the auxiliary function $f_{0}(x)$ by

[TABLE]

The two necessary properties of $f_{0}(x)$ are given below in Property 1.

Definition 6.

Let $X$ be a subset of $\mathrm{Luk}_{1}$ . Define the set $\mathrm{Subtrees}_{m}(X)\subset\mathrm{Luk}_{1}$ to be the set of trees $T$ such that if $T^{\prime}$ is a root subtree of $T$ with $\deg(\mathrm{root}(T^{\prime}))\geq 2$ , then $T^{\prime}\in X$ .

Let $T_{1}\in\mathrm{Luk}_{1}$ such that $T_{1}\notin X$ . Define the set $\mathrm{Subtrees}_{1}(X,T_{1})\subset\mathrm{Luk}_{1}$ to be the set of trees $T$ such that $T$ has exactly one root subtree that is equal to $T_{1}$ ; and if $T^{\prime}\neq T_{1}$ is a root subtree of $T$ with $\deg(\mathrm{root}(T^{\prime}))\geq 2$ , then $T^{\prime}\in X$ .

Property 1.

For $X\subset\mathrm{Luk}_{m}$ , let

[TABLE]

Then

[TABLE]

Let $T_{1}\in\mathrm{Luk}_{1}$ and $\notin X$ . Then

[TABLE]

The next theorem establishes equations that determine $J_{1}(n)$ .

Theorem 1.

[TABLE]

Proof.

We first establish the equation for $J(1)$ . Let $T$ be of type 1. If $T$ is final, then all of its root subtrees of type [math]; that is, all the root subtrees are single vertices. Thus

[TABLE]

If $T$ is not final, then it has exactly one root subtree of type 1 and the rest are single vertices.

[TABLE]

Therefore

[TABLE]

We now determine $J_{1}(n)$ . For any positive integer $k$ , we have that

[TABLE]

represents trees whose root subtrees are of type at most $k$ ; and

[TABLE]

represents trees with exactly one root subtree of type $k$ and the rest of type at most $k-1$ . Therefore

[TABLE]

∎

Lemma 1.

For $n\geq 1$ , we have

[TABLE]

Proof.

We prove this by induction. When $n=1$ , we have by Lemma 1

[TABLE]

Now

[TABLE]

so

[TABLE]

Now assume the statement of the lemma is true for some $n\geq 1$ . By Lemma 1

[TABLE]

Applying the definition of $f_{0}(x)$ and simplifying, we have

[TABLE]

and

[TABLE]

The sum of the above three expressions is

[TABLE]

where we use the fact that

[TABLE]

and the induction hypothesis. Furthermore

[TABLE]

Substituting these results into (5) proves the induction step and the lemma. ∎

Theorem 2.

Let $c_{0}=0$ and $c_{N}$ be defined by the Newton-Raphson-Simpson method applied to $f(z)$ . Let $J_{1}(n)$ be as defined above. For $N\geq 1$ , then

[TABLE]

Proof.

If $N=1$ , then

[TABLE]

and the statement of the theorem is true. We re-express the statement as

[TABLE]

and assume it is true for some $N\geq 1$ . Now we apply Lemma 1 to obtain

[TABLE]

This proves the theorem. ∎

3 NRS( $m$ ) and trees with negative vertex degree

To define the algorithms, we define generalized Łukasiewicz words (Definition 8) and trees with negative vertex degree (Construction 1). The plane trees discussed above have possible vertex degrees of either 0 or an integer that is at least 2. We call these classical plane trees. The trees with negative vertex degree have the same structure as classical plane trees but their vertices may have degree that is any integer except $1$ .

3.1 Generalized Łukasiewicz words and trees with negative vertex degree

Recall that a plane tree is uniquely determined by the preorder (depth-first order) sequence of its vertex degrees. We will call this sequence the prodder sequence. See Chapter 5 of [2] for the definition of preorder. For classical plane trees, this preorder sequence of non-negative integers is called the Łukasiewicz word for the tree. We recall the defining properties of Łukasiewicz words.

Definition 7.

A Łukasiewicz word $l$ may be defined as a sequence $\{l_{i}\}_{i=1}^{N}$ of integers such that

[TABLE]

for each $n<N$ . (Note that according to our convention each $l_{i}\neq 1$ as well.)

Definition 8.

Define a generalized Łukasiewicz word $l$ to be a sequence $\{l_{i}\}_{i=1}^{N}$ of integers such that

[TABLE]

for each $n<N$ . Define $\mathrm{minDegree}(l)$ to be the smallest (most negative) integer $l_{i}$ that occurs in $l$ . For $m\geq 1$ , define $\mathrm{Luk}_{m}$ to be the set of all generalized Łukasiewicz words $l$ such that $\mathrm{minDegree}(l)\geq-m+1$ .

Construction 1.

Given a generalized Łukasiewicz word $l=\{l_{i}\}_{i=1}^{N}$ , we construct a tree $T$ with negative vertex degree in the following way. We construct a new word $U(l)$ from $l$ by taking each $l_{i}$ in $l$ with $l_{i}<0$ and replacing it with a sequence of 0’s of length $|l_{i}|+1$ . Thus the generalized Łukasiewicz word

[TABLE]

yields

[TABLE]

By construction $U(l)$ is a non-generalized Łukasiewicz word and thus is the preorder sequence for some classical plane tree which we call $U(T)$ . Now from $U(T)$ we construct the tree $T$ : we give $T$ the same structure as $U(T)$ , but for each set of $|l_{i}|+1$ vertices of degree 0 in $U(T)$ that came from an $l_{i}<0$ in $l$ , we say that the rightmost vertex $v$ of these vertices in the preorder has degree $l_{i}$ ; that the $|l_{i}|$ vertices of degree 0 immediately to the left of $v$ in the preorder are “canceled” by $v$ ; and that these canceled vertices do not contribute to the number of vertices of degree 0 in $T$ . We say that a canceled vertex does not have any degree but we do consider it a subtree of its parent vertex. We say that the classical plane tree $U(T)$ is the* underlying tree of $T$ . We say that $T$ has the preorder sequence $l$ . See Figure 1.*

For $m\geq 1$ , we identify the set of all plane trees whose vertex degrees are at least $-m+1$ with $\mathrm{Luk}_{m}$ .

Definition 9.

Let $T$ be a tree with negative vertex degree with preorder sequence $l=\{l_{i}\}_{i=1}^{N}$ . We define the type number $\mathrm{type}(T)$ of $T$ to be equal to $\mathrm{type}(U(T))$ , where $U(T)$ is the underlying tree of $T$ . We say that $T$ is final if $U(T)$ is final. We define $\mathrm{terminal}(T)$ to be the number of consecutive 0’s at the right end of $l$ .

Remark 2.

We can construct any tree $T$ with negative vertex degree by specifying a sequence of trees $\{T_{1},T_{2},...,T_{k}\}$ , where each $T_{i}$ is a tree of negative vertex degree, and then appropriately assigning negative degrees to those trees $T_{i}$ that consist of a single vertex. That is, suppose $T_{i}$ is a single vertex and we assign it to have degree $-h<0$ . Then there must be a subsequence of the form

[TABLE]

*where $T_{j}$ consists of a single vertex for $i-k+2\leq j<i$ , and $\mathrm{terminal}(T_{i-k+1})\geq h-(k-2)$ . This motivates the following definition. *

Definition 10.

For integers $k$ and $h$ with $m-1\geq h\geq k-1\geq 1$ , define a $(k,h)_{m}$ -block to be a sequence

[TABLE]

of trees in $\mathrm{Luk}_{m}$ where there are $k-1$ trees $T_{0}$ after $T_{1}$ , and $\mathrm{terminal}(T_{1})\geq h-(k-2)$ . Define a $1_{m}$ -block to be a sequence consisting of a single tree

[TABLE]

where $T_{1}$ is any tree in $\mathrm{Luk}_{m}$ . We refer to both $(k,h)_{m}$ -blocks and $1_{m}$ blocks as blocks.

Remark 3.

We identify $\mathrm{Luk}_{m}$ with the set of sequences

[TABLE]

where $N\geq 0$ and $B_{i}$ is either a $(k,h)_{m}$ -block or a $1_{m}$ -block. The tree $T_{0}$ corresponds to the empty sequence (when $N=0$ ). We compare this identification to that of Remark 1.**

3.2 The number of generalized Łukasiewicz words with a given degree sequence

Let

[TABLE]

be a sequence of non-negative integers such that $d_{1}=0$ ; only finitely many of the $d_{k}$ are non-zero; and

[TABLE]

Then the number of Łukasiewicz words

[TABLE]

such that the integer $k$ appears $d_{k}$ times in $l$ is equal to

[TABLE]

Theorem 5.3.10 of [2] proves this statement. We present a corresponding result about generalized Łukasiewicz words. The proof in [2] directly carries over and we present it here in that generality.

Theorem 3.

Let

[TABLE]

be a sequence of non-negative integers such that $d_{1}=0$ ; only finitely many of the $d_{i}$ are non-zero; and

[TABLE]

The the number of generalized Łukasiewicz words

[TABLE]

with degree sequence $d$ is

[TABLE]

Proof.

Let

[TABLE]

Consider the set $\mathcal{A}_{d}$ of all sequences

[TABLE]

such that $d_{k}$ of the $l_{i}$ equal $k$ and

[TABLE]

The order of $\mathcal{A}_{d}$ is thus

[TABLE]

Let $l\in\mathcal{A}_{d}$ and let $C(i,l)$ denote the $i$ -th conjugate of $l$ :

[TABLE]

We claim that these $N$ conjugates are distinct. If $C(i;l)=C(j;l)$ for $j>i$ , then that means

[TABLE]

whenever $k\equiv k^{\prime}\mod(j-i)$ . This implies that $j-i$ divides $N$ and that each $d_{k}$ is a multiple of $\frac{N}{j-i}$ . By assumption

[TABLE]

so $\frac{N}{j-i}$ divides $1$ . But that means $j-i=N$ , which is impossible since $1\leq i,j\leq N$ . Therefore the $N$ conjugates of $l$ are distinct.

We claim that exactly one of these conjugates is a generalized Łukasiewicz word. First we show that at least one conjugate is a generalized Łukasiewicz word. Suppose that the negative integer $M$ is an attained lower bound for the partial sums:

[TABLE]

for all $1\leq k\leq N$ and that

[TABLE]

with $k_{1}$ minimal (we may assume that $k_{1}\neq N$ , or else $M=-1$ and we are done). Then we claim that the conjugate $w$

[TABLE]

is a generalized Łukasiewicz word. We have

[TABLE]

for all $k_{1}\leq k\leq N$ , or else $M$ would not be a lower bound.

Now suppose

[TABLE]

for some $1\leq k<k_{1}$ . Since

[TABLE]

that implies

[TABLE]

contradicting the minimality of $k_{1}$ . Therefore $w$ is a generalized Łukasiewicz word.

Now suppose

[TABLE]

is a generalized Łukasiewicz word. If some conjugate $w^{\prime}$

[TABLE]

for $j\neq 1$ is also a generalized Łukasiewicz word, then

[TABLE]

and

[TABLE]

Therefore

[TABLE]

But this contradicts the assumption that $w$ is a generalized Łukasiewicz word. Therefore the only conjugate of $w$ that is a generalized Łukasiewicz word is $w$ itself.

Let $\mathcal{L}_{d}$ denote the set of generalized Łukasiewicz words with degree sequence $d$ . Now $\mathcal{L}_{d}\subset\mathcal{A}_{d}$ , and we have partitioned $\mathcal{A}_{d}$ into subsets that each have order $N$ such that each subset contains exactly one generalized Łukasiewicz word. Thus

[TABLE]

This proves the theorem.

∎

3.3 The ring $R_{m}$

We define the ring $R_{m}$ . For $k\geq 0$ , let $R_{m}(k)$ be the $\mathbb{Q}$ -vector space spanned by monomials of the form

[TABLE]

where $n_{i}$ are non-negative integers, almost all zero with $n_{1}=0$ , satisfying

[TABLE]

Thus an element $r\in R_{m}(k)$ is a finite sum of the monomials of the form (7). For $r_{k_{1}}\in R_{m}(k_{1})$ and $r_{k_{2}}\in R_{m}(k_{2})$ , then

[TABLE]

We let $R_{m}$ be the ring consisting of all elements $r$ of the form

[TABLE]

where $r(k)\in R_{m}(k)$ ; and where addition and multiplication in $R_{m}$ are the usual operations on infinite sums. Note that in the sum (8) we allow infinitely many of the $r(k)$ to be non-zero.

Definition 11.

Let $T\in\mathrm{Luk}_{m}$ . Define

[TABLE]

We call $R_{m}(T)$ the $R_{m}$ -expression of $T$ .

3.4 The element $A_{m}$

Definition 12.

[TABLE]

As for $A_{1}$ in section 2, the elements $A_{m}$ are well-defined elements of $R_{m}$ because for any $k\geq 0$ , there are only finitely many trees $T\in\mathrm{Luk}_{m}$ with

[TABLE]

We let

[TABLE]

denote the $\mathscr{A}$ -hypergeometric series of [3].

Theorem 4.

The $\mathscr{A}$ -hypergeometric series $\displaystyle\left[\frac{a_{m-1}}{a_{m}}\right]$ may be viewed as an element of $R_{m}$ . As elements of $R_{m}$ ,

[TABLE]

Proof.

To agree with the notation of [3], we let $j=m$ . In equation 4.2 of [3], Sturmfels defines $\displaystyle\left[\frac{a_{j-1}}{a_{j}}\right]$ to be the infinite sum

[TABLE]

where the sum is over all sequences $i$ of non-negative integers $\{i_{0},i_{1},...,i_{n}\}$ such that

[TABLE]

and

[TABLE]

Using equation (10), equation (11) may be rewritten as

[TABLE]

And

[TABLE]

Thus we can interpret each $i_{k},k\neq j,j-1$ as the number of vertices in a tree $T$ with negative vertex degree that have degree $1+k-j$ ; $i_{j-1}+1$ as the number of vertices that have degree [math]; and $i_{j}$ as the number of vertices that have vertex degree (that is, are not canceled). By Theorem 3, expression (12) counts the number of all such $T$ . The monomial factor in (9) is then $-R_{j}(T)$ . ∎

3.5 Strategy to determine $J_{m}(n)$

We perform this sum by ordering the trees $T$ according to their type number: letting

[TABLE]

we write

[TABLE]

As in Section 2 for $m=1$ , the quantity $J_{m}(n)$ is an infinite sum. Instead of specifying an ordering for this sum, we establish equations in $R_{m}$ that allow us to solve for $J_{m}(n)$ . Specifically, we introduce the quantities $J_{i,m}(n)$ for $0\leq i\leq m-1$ and establish a system of $m$ equations that are linear in the $J_{i,m}(n)$ . We define these $J_{i,m}(n)$ now.

First, recall the construction of trees in $\mathrm{Luk}_{m}$ discussed in Remark 2. Given integers $h$ and $k$ , the number $\mathrm{terminal}(T)$ determines whether $T$ is a valid choice for the first tree in the $(k,h)_{m}$ -block. Therefore we partition $\mathrm{Luk}_{m}$ into $m+1$ disjoint subsets based on $\mathrm{terminal}(T)$ :

Definition 13.

Let $T_{0}$ denote the tree consisting of a single vertex and let $m\geq 1$ . If $m>1$ , define

[TABLE]

For $i\neq 1$ and $0\leq i<m-1$ , define

[TABLE]

and

[TABLE]

Thus

[TABLE]

Refining this partition by the type number yields the following terms.

Definition 14.

For $0\leq i\leq m-1$ , let

[TABLE]

Thus for $n\geq 1$

[TABLE]

We next explain how to establish the system of linear equations to determine the $J_{i,m}(n)$ . When $m=1$ , the quantity

[TABLE]

and we used the single auxiliary function $f_{0}(x)$ to establish an equation for $J_{0,1}(n)$ . For general $m$ , we will use $m$ auxiliary functions $f_{i,m}(\accentset{\rightharpoonup}{x})$ in $m$ variables $\accentset{\rightharpoonup}{x}=(x_{0},x_{1},...,x_{m-1})$ . The two properties that these auxiliary have which generalize the those of $f_{0}(x)$ are listed in Property 2 below.

Definition 15.

Let $X$ be a subset of $\mathrm{Luk}_{m}$ . Define the set $\mathrm{Subtrees}_{m}(X)\subset\mathrm{Luk}_{m}$ to be the set of trees $T$ such that if $T^{\prime}$ is a subtree of the root of $T$ with $\deg(\mathrm{root}(T^{\prime}))\geq 2$ , then $T^{\prime}\in X$ .

Let $T_{1}\in\mathrm{Luk}_{m}$ such that $T_{1}\notin X$ . Define the set $\mathrm{Subtrees}_{m}(X,T_{1})\subset\mathrm{Luk}_{m}$ to be the set of trees $T$ such that the root of $T$ has exactly one subtree that is equal to $T_{1}$ ; and if $T^{\prime}\neq T_{1}$ is a subtree of the root of $T$ with $\deg(\mathrm{root}(T^{\prime}))\geq 2$ , then $T^{\prime}\in X$ .

Property 2.

For $X\subset\mathrm{Luk}_{m}$ , let

[TABLE]

and

[TABLE]

Then

[TABLE]

Let $T_{1}\in\mathrm{Luk}_{i,m}$ and $\notin X$ . Then

[TABLE]

To construct the auxiliary functions $f_{i,m}(\accentset{\rightharpoonup}{x})$ that satisfy these properties, we define the map $P$ , partial blocks, and partial trees next.

3.6 The forgetful map $P$

Recall Remark 3 in which we identify a tree with negative vertex degree with a sequence of blocks. Let $T_{1}$ denote the first tree in a block. We define a forgetful map $P$ on the set of blocks such that $P$ forgets everything about the tree $T_{1}$ except the integer $\mathrm{terminal}(T_{1})$ and whether $T_{1}=T_{0}$ . Specifically, let $B$ be a $(k,h)_{m}$ -block. If $T_{1}\neq T_{0}$ , define $P(B)$ to be the triple

[TABLE]

and if $T_{1}=T_{0}$ , define $P(B)$ to be the triple

[TABLE]

On the set of $1_{m}$ -blocks, define

[TABLE]

and, if $T_{1}\neq T_{0}$ ,

[TABLE]

We call the images of $P$ partial blocks which for concreteness are defined next along with their $R_{m}$ -expressions.

Definition 16.

Let $T_{0}$ denote the tree consisting of a single vertex. In the following four cases we define a partial block $b$ , its length which we denote $\mathrm{length}(b)$ , and its $R_{m}$ -expression $R_{m}(b)$ . The expression $R_{m}(b)$ will be a monomial in $x_{i}$ and $(-\frac{a_{i}}{a_{m}})^{\pm 1}$ . We define $b$ to be:

1. A triple of integers

[TABLE]

where $1\leq h\leq m-1$ ; $2\leq k\leq h+1$ ; and $h-(k-2)\leq i\leq m-1$ . Define

[TABLE]

and

[TABLE]

2. The triple

[TABLE]

where $k\geq 2$ . Define

[TABLE]

and

[TABLE]

3. For $0\leq i\leq m-1$ , the $1$ -tuple

[TABLE]

Define

[TABLE]

and

[TABLE]

4. The 1-tuple

[TABLE]

Define

[TABLE]

and

[TABLE]

The map $P$ extends naturally to $\mathrm{Luk}_{m}$ : writing $T$ as a sequence of blocks

[TABLE]

we set

[TABLE]

We call these images partial trees which we define next. The part about the $s$ empty subtrees will be necessary to express a recurrence relation among the $f_{i,m}(\accentset{\rightharpoonup}{x})$ in Section 4.

Definition 17.

For integer $s\geq 0$ , define a partial tree with $s$ empty subtrees to be a sequence:

[TABLE]

where $N\geq 0$ ; each $b_{i}$ is either a $(k,h)_{m}$ -partial block or a $1_{m}$ -partial block; and there are $s$ $\emptyset$ ’s representing empty subtrees. Let $\overline{\mathrm{Luk}}_{m;s}$ denote the set of all such partial trees. We say that

[TABLE]

Thus

[TABLE]

For $\overline{T}\in\overline{\mathrm{Luk}}_{m;0}$ , then by construction $P^{-1}(\overline{T})\subset\mathrm{Luk}_{i,m}$ for some $i$ . Define $\mathrm{terminal}(\overline{T})$ to be this $i$ .

Define

[TABLE]

Define the $R_{m}$ -expression $R_{m}(\overline{T})$ of the partial tree $\overline{T}$ to be

[TABLE]

For $\overline{T}\in\overline{\mathrm{Luk}}_{m;0}$ , we view $R_{m}(\overline{T})=R_{m}(\overline{T})(x_{0},x_{1},...,x_{m-1})$ as a function of the variables $x_{i}$ . Next we show that $R_{m}(\overline{T})$ satisfies properties similar to those listed in Property 2:

Lemma 2.

Let $\overline{T}\in\overline{\mathrm{Luk}}_{m;0}$ and $X\subset\mathrm{Luk}_{m}$ . Suppose $T_{1}\notin X$ and $T_{1}\in\mathrm{Luk}_{i,m}$ .

[TABLE]

Proof.

$1$ . This follows immediately from the definition of a partial tree. The expression $R_{m}(\overline{T})(\accentset{\rightharpoonup}{S}(X))$ is obtained by substituting each $x_{i}$ in $R_{m}(\overline{T})$ with

[TABLE]

Expanding out, we obtain a sum of terms; each term is an $R_{m}$ expression of a tree $T\in P^{-1}(\overline{T})$ whose root subtrees are in $X$ if they have root degree $\geq 2$ . Every such $T$ corresponds to some term.

$2$ . By the product rule, the expression $R_{m}(T_{1})\frac{\partial{R_{m}(\overline{T})}}{\partial x_{i}}(\accentset{\rightharpoonup}{x})$ is obtained by choosing each factor of $x_{i}$ that appears in $R_{m}(\overline{T})$ and replacing it with $R_{m}(T_{1})$ , and then adding all such expressions. This corresponds to making $T_{i}$ a root subtree of $T$ in the spot where the $x_{i}$ was removed. Substituting it the $S(X)$ as in part $1$ now yields the sum of terms that correspond to elements in $P^{-1}(\overline{T})\cap\mathrm{Subtrees}_{m}(X,T_{1})$ .

∎

3.7 The system of linear equations for $J_{i,m}(n)$

Now we can define the auxiliary functions $f_{i,m}(\accentset{\rightharpoonup}{x})$ . The auxiliary functions will satisfy the properties in Property 2 because the $R_{m}(\overline{T})$ satisfy similar properties. We can then establish the system of equations for $J_{i,m}(n)$ in Theorem 5.

Definition 18.

[TABLE]

Corolllary 1.

The auxiliary functions $f_{i,m}(\accentset{\rightharpoonup}{x})$ satisfy the two properties in Property 2.

Proof.

This is immediate from the definition of $f_{i,m}(\accentset{\rightharpoonup}{x})$ , Lemma 2 and the disjoint union

[TABLE]

∎

Recall the definitions

[TABLE]

and

[TABLE]

Theorem 5.

Let $F(x_{0},x_{1},...,x_{m-1})$ be a function. For $n\geq 1$ , define $L_{m}(F,n)$ to be:

[TABLE]

The following is a system of linear equations in the unknowns $J_{i,m}(1)$ :

[TABLE]

Assuming that $J_{i,m}(k)$ has been evaluated for $1\leq k\leq n$ , the following is a system of linear equations in the unknowns $J_{i,m}(n+1)$ :

[TABLE]

Proof.

Let $n=1$ . Using the properties of the auxiliary functions in Property 2, we obtain

[TABLE]

and

[TABLE]

Adding these two equations yields

[TABLE]

Considering all $i$ , $0\leq i\leq m-1$ , gives a system of $m$ equations in the $m$ unknowns $J_{i,m}(1)$ .

Now let $n>1$ . Again using the properties of the auxiliary functions in Property 2, we obtain

[TABLE]

and

[TABLE]

Adding these two equations yields

[TABLE]

Considering all $i$ , $0\leq i\leq m-1$ , gives a system of $m$ equations in the $m$ unknowns $J_{i,m}(n+1)$ .

∎

Solving this system allows us to express $J_{i,m}(n)$ as a ratio of elements in $R_{m}$ , and via

[TABLE]

we can also can be express $J_{m}(n)$ as a ratio of elements in $R_{m}$ . Then $A_{m}$ is the sum

[TABLE]

We will explicitly construct the auxiliary functions and find solutions to these systems in Section 5.

4 Explicit construction of the auxiliary functions

We next show how to compute $f_{i,m}(\accentset{\rightharpoonup}{x})$ . We use a recurrence relation (equation (16)) that can be implemented by a computer. We use the function $\mathrm{PartialTrees}_{m}(\accentset{\rightharpoonup}{x},s)$ defined next.

Definition 19.

[TABLE]

To compute $\mathrm{PartialTrees}_{m}(\accentset{\rightharpoonup}{x},s)$ , we use the following functions.

The function $\mathrm{PartialBlock}_{m}(\accentset{\rightharpoonup}{x};k,h)$ for $h\geq k-1\geq 1$ is the sum of $R_{m}$ -expressions of all $(k,h)_{m}$ -partial blocks:

[TABLE]

Let $\accentset{\rightharpoonup}{n}$ denote

[TABLE]

For $s\geq 0$ , the function $\mathrm{PartialTrees}_{m}(\accentset{\rightharpoonup}{x};s,\accentset{\rightharpoonup}{n})$ is the sum of $R_{m}$ -expressions of all partial trees $\overline{T}$ with $s$ empty subtrees such that the sequence of partial blocks for $\overline{T}$ contains $n_{k}$ partial blocks of length $k$ :

[TABLE]

Then $\mathrm{PartialTrees}_{m}(\accentset{\rightharpoonup}{x};s)$ is the sum of the functions $\mathrm{PartialTrees}_{m}(\accentset{\rightharpoonup}{x};s,\accentset{\rightharpoonup}{n})$ over all $\accentset{\rightharpoonup}{n}$ :

[TABLE]

We note that if $f(z)$ is a polynomial, then the above sum over $\accentset{\rightharpoonup}{n}$ is a finite sum, and then $\mathrm{PartialTrees}_{m}(\accentset{\rightharpoonup}{x};s)$ is a polynomial in $x_{0},x_{1},...,x_{m-1}$ whose coefficients are polynomials in the $\displaystyle-\frac{a_{k}}{a_{m}}$ .

For an $N\geq 0$ and $s\geq 0$ , let $\overline{T}$ be a partial tree with $s$ empty subtrees:

[TABLE]

Recall that $f_{i,m}(\accentset{\rightharpoonup}{x},s)$ is the sum of $R_{m}$ -expressions of all partial trees $\overline{T}$ with $s$ empty subtrees and $\mathrm{terminal}(\overline{T})=i$ . We consider the three cases of whether $i$ is equal to 0; greater than 0 and less than $m-1$ ; and equal to $m-1$ .

1. i = 0.

We have the following three subcases.

subcase(1) $N>0$ and $b_{N}=(0)$

The sum of the $R_{m}$ -expressions for such partial trees is

[TABLE]

because we take any partial tree with $s+1$ empty subtrees and replace the leftmost empty to be the partial block $b=(0)$ . This partial block $b$ has $R_{m}$ -expression $x_{0}$ .

subcase(2) $N>0$ and $\mathrm{length}(b_{N})=k\geq 2$

Such partial trees are obtained from taking a partial tree with $s+k$ empty subtrees, and replacing the first $k$ empty subtrees with a partial block $b$ of length $k$ . The sum of $R_{m}$ -expressions of all such partial trees is

[TABLE]

subcase(3) $N=0$

The $R_{m}$ -expression of this partial tree $\overline{T}$ is

[TABLE]

Adding the $R_{m}$ -expressions for these three cases yields

[TABLE]

2. 1 $\mathbf{\leq}$ i $\mathbf{<}$ m $-$ 1

We have the following two subcases.

subcase(1) $N>0$ and $b_{N}=(i)$

Such partial trees are obtained by taking a partial tree with $s+1$ empty subtrees and replacing the leftmost empty subtree with the partial block $b=(i)$ . The sum of the $R_{m}$ -expressions of all such $\overline{T}$ is

[TABLE]

subcase(2) $N>0$ and $b_{N}=(T_{0})$

Such partial trees are obtained by taking a partial tree $\overline{T}^{\prime}$ with $s+1$ empty subtrees and $\mathrm{terminal}(\overline{T}^{\prime})=i-1$ and replacing the leftmost empty subtree with the partial block $b=(T_{0})$ . The sum of the $R_{m}$ -expressions of all such $\overline{T}$ created this way is

[TABLE]

Adding the $R_{m}$ -expressions for these two subcases yields

[TABLE]

3. i $\mathbf{=}$ m $-$ 1

Then we have the following two sub cases.

subcase(1)

There is an integer $k$ where $0\leq k\leq m-2$ such that there is a subsequence of $\overline{T}$

[TABLE]

where

[TABLE]

for some $i\geq m-1-k$ , and

[TABLE]

for $1\leq r\leq k$ . Such a $\overline{T}$ is obtained from a $\overline{T}^{\prime}$ with $k+1$ empty subtrees and replacing the leftmost empty subtree with the partial block $(i)$ , and then replacing each of the next $k$ empty subtrees with the partial block $(T_{0})$ . Summing over all $k$ , the sum of $R_{m}$ -expressions of all such trees $\overline{T}$ is

[TABLE]

subcase(2)

We have the subsequence

[TABLE]

where $b_{r}=(T_{0})$ for $N-(m-1)-1\leq r\leq N$ . Such a $\overline{T}$ is obtained from a $\overline{T}^{\prime}$ with $m-1$ empty subtrees and replacing each of the empty subtrees with the partial block $(T_{0})$ . The sum of the $R_{m}$ -expressions of such trees created this way is

[TABLE]

Therefore

[TABLE]

$\square$

We list the auxiliary functions for a quintic polynomial

[TABLE]

$\mathbf{m=1}$ :

[TABLE]

$\mathbf{m=2}$ :

[TABLE]

$\mathbf{m=3}$ :

[TABLE]

$\mathbf{m=4}$ :

[TABLE]

$\mathbf{m=5}$ :

[TABLE]

5 Numerical examples

5.1 A quintic polynomial with rational zeros

Now we specialize $f(z)$ to be the following polynomial with real coefficients and map the various expressions to real numbers.

[TABLE]

5.2 A Jensen polynomial for $\xi(\frac{1}{2}+i\sqrt{t})$

We apply NRS $(m)$ algorithms to the third degree Jensen polynomial for $\xi(\frac{1}{2}+i\sqrt{t})$ . We recall the definition of Jensen polynomials for a power series

[TABLE]

The $N$ -th degree Jensen polynomial is

[TABLE]

It is a theorem that a power series series $f(z)$ has all real zeros if and only all its Jensen polynomials have all real zeros.

Let

[TABLE]

Let $a_{i}$ denote the power series coefficients of $\xi(\frac{1}{2}+i\sqrt{t})$ :

[TABLE]

To compute $a_{k}$ , we use the formula

[TABLE]

where

[TABLE]

and

[TABLE]

is the unsigned Stirling number of the first kind. In another paper we prove that the formula (17) holds by proving

[TABLE]

where

[TABLE]

and $p>1$ . There we prove that the coefficients of the powers of $s$ in $g_{n}(s,p)$ are positive if $p>1$ and the series is absolutely convergent for any $s\in\mathbb{C}$ . We use formula (17) summing up to $n=100$ to compute

[TABLE]

The third degree Jensen polynomial is

[TABLE]

6 Further work

Suppose

[TABLE]

is a polynomial of degree $N$ with $a_{i}\in\mathbb{C}$ and with all positive zeros $z_{k}$ such that $z_{k}<z_{k+1}$ .

•

The NRS( $m$ ) algorithm applied to the coefficients $a_{k}$ is convergent and outputs the sum

[TABLE]

•

For each $0\leq i\leq m-1$ , the series $\sum_{n=1}^{\infty}J_{i,m}(n)$ converges quadratically when the zeros $z_{i}$ are distinct.

•

The convergence of NRS( $m$ ) implies that those outputs yield zeros of $f(z)$ .

Now let

[TABLE]

where $a_{k}$ and $z_{k}$ are indeterminates. We say a polynomial is $z_{i}$ -positive if it is a polynomial in the $z_{1},...,z_{N}$ that has positive coefficients. A rational function is $z_{i}$ -positive if it is a ration of $z_{i}$ -positive polynomials.

•

$J_{m}(n)$ is $z_{i}$ positive.

•

The difference

[TABLE]

is $z_{i}$ positive.

•

The expressions arising from formal contour integrals are $z_{i}$ -positive.

•

The $J_{i,m}(n)$ may also have $z_{i}$ -positive properties.

•

Combinatorial proof that $A_{m}-A_{m-1}$ is a formal zero. We proof for $m=1,2$ .

•

Interpret the other hypergeometric series in [3] using trees and the Newton-Raphson-Simpson method.

•

Use more general rings to express formal zeros. For example, let $I\subset\mathbb{N}_{0}$ . Let

[TABLE]

and $z_{0}$ a zero of $g(z)$ . Set

[TABLE]

with $a^{\accentset{\rightharpoonup}{n}}$ a multinomial in the $a_{i},i\notin I$ .

•

Use other orderings to evaluate $A_{m}$ . For a suitable degree 2 polynomial, the sum $A_{1}$ can be summed by an ordering that yields the Taylor series of the square root in the quadratic formula. We would find corresponding orderings for formal zeros that generalize this ordering to higher degree, for example by expressing expressing formal zeros in terms of Taylor series, some of evaluate to radical expressions. See how these hypothetical Taylor series are related to Turán inequalities.

•

Relationship between Turán inequalities and expressions for $J_{m}(n)$ . See if Turán inequalities or other set of conditions imply convergence of NRS( $m$ ).

•

Householder methods expressed in terms of trees and generalized as was NRS( $m$ ). Maybe altering $L_{(}F,n)$ to include higher-order terms or making the type number of a tree to include multiple parameters.

•

For arbitrary analytic functions $f(z)$ , express the convergence of $f_{i,m}(\accentset{\rightharpoonup}{x})$ in terms of the convergence of $f(z)$ .

•

Apply NRS( $m$ ) to coefficients of $\xi(\frac{1}{2}+\sqrt{t})$ using (17). Perhaps try $q$ -analogues of terms in (17) to try to prove positivity of NRS( $m$ ) quantities. See if $q$ -analogues of coefficients of (17) satisfy Turán inequalities. For example, the $q$ -analogue of

[TABLE]

may be

[TABLE]

The unsigned Stirling numbers have $q$ -analogues, and the $b_{k}$ may be expressed via elliptic integrals in terms of a rational number sequence we denote by $\{\kappa(n)\}_{n=0}^{\infty}=1,-3,26,-378,8136,-244728,...$ . Each number $\kappa(n)$ is defined as a sum over a certain finite subset $\mathcal{E}(2n+1)$ of classical plane trees by:

[TABLE]

This allows us to express $b_{k}$ as a series of rational numbers. A $q$ -analogue of $\kappa(n)$ could lead to $q$ -analogues of $b_{k}$ .

•

Galois theory applied to formal zeros.

•

NRS( $m$ ) applied to the function $f(z)=\sum_{n=0}^{\infty}\frac{z^{n}}{(\alpha n)!}$ or its $q$ -analogues.

Bibliography3

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Kollerstrom, Nick. “Thomas Simpson and ‘Newton’s Method of Approximation’: An Enduring Myth”. The British Journal for the History of Science Vol. 25, No. 3 (Sep., 1992), pp. 347-354
2[2] Stanley, Richard P. Enumerative Combinatorics, Volume 2 . Cambridge University Press, Cambridge, UK, 1999.
3[3] Sturmfels, Bernd. “Solving algebraic equations in terms of 𝒜 𝒜 \mathscr{A} -hypergeometric series”. Discrete Mathematics 210, (2000), pp. 171-181.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

On Generalizations of the Newton-Raphson-Simpson Method

Abstract

1 Introduction

2 NRS(1) and the type number of a tree

Definition 1**.**

Remark 1**.**

2.1 The ring R1R_{1}R1​

Definition 2**.**

2.2 The element A1A_{1}A1​

2.3 The type number of a plane tree

Definition 3**.**

Definition 4**.**

Definition 5**.**

Definition 6**.**

Property 1**.**

Theorem 1**.**

Proof.

Lemma 1**.**

Proof.

Theorem 2**.**

Proof.

3 NRS(mmm) and trees with negative vertex degree

3.1 Generalized Łukasiewicz words and trees with negative vertex degree

Definition 7**.**

Definition 8**.**

Construction 1**.**

Definition 9**.**

Remark 2**.**

Definition 10**.**

Remark 3**.**

3.2 The number of generalized Łukasiewicz words with a given degree sequence

Theorem 3**.**

Proof.

3.3 The ring RmR_{m}Rm​

Definition 11**.**

3.4 The element AmA_{m}Am​

Definition 12**.**

Theorem 4**.**

Proof.

3.5 Strategy to determine Jm(n)J_{m}(n)Jm​(n)

Definition 13**.**

Definition 14**.**

Definition 15**.**

Property 2**.**

3.6 The forgetful map PPP

Definition 16**.**

Definition 17**.**

Lemma 2**.**

Proof.

3.7 The system of linear equations for Ji,m(n)J_{i,m}(n)Ji,m​(n)

Definition 18**.**

Corolllary 1**.**

Proof.

Theorem 5**.**

Proof.

4 Explicit construction of the auxiliary functions

Definition 19**.**

5 Numerical examples

5.1 A quintic polynomial with rational zeros

5.2 A Jensen polynomial for ξ(12+it)\xi(\frac{1}{2}+i\sqrt{t})ξ(21​+it​)

6 Further work

Definition 1.

Remark 1.

2.1 The ring $R_{1}$

Definition 2.

2.2 The element $A_{1}$

Definition 3.

Definition 4.

Definition 5.

Definition 6.

Property 1.

Theorem 1.

Lemma 1.

Theorem 2.

3 NRS( $m$ ) and trees with negative vertex degree

Definition 7.

Definition 8.

Construction 1.

Definition 9.

Remark 2.

Definition 10.

Remark 3.

Theorem 3.

3.3 The ring $R_{m}$

Definition 11.

3.4 The element $A_{m}$

Definition 12.

Theorem 4.

3.5 Strategy to determine $J_{m}(n)$

Definition 13.

Definition 14.

Definition 15.

Property 2.

3.6 The forgetful map $P$

Definition 16.

Definition 17.

Lemma 2.

3.7 The system of linear equations for $J_{i,m}(n)$

Definition 18.

Corolllary 1.

Theorem 5.

Definition 19.

5.2 A Jensen polynomial for $\xi(\frac{1}{2}+i\sqrt{t})$