Efficiently Computing Real Roots of Sparse Polynomials

Gorav Jindal; Michael Sagraloff

arXiv:1704.06979·cs.SC·April 25, 2017

Efficiently Computing Real Roots of Sparse Polynomials

Gorav Jindal, Michael Sagraloff

PDF

Open Access

TL;DR

This paper introduces an efficient algorithm for computing and isolating real roots of sparse polynomials with guaranteed accuracy, leveraging coefficient oracles and providing complexity bounds.

Contribution

It presents a novel root-finding algorithm for sparse polynomials that efficiently isolates roots with complexity bounds polynomial in key parameters.

Findings

01

Algorithm computes disjoint disks containing roots with specified precision.

02

Bit complexity is polynomial in polynomial degree and coefficient bounds.

03

Effective root isolation for polynomials with simple roots.

Abstract

We propose an efficient algorithm to compute the real roots of a sparse polynomial $f \in R [x]$ having $k$ non-zero real-valued coefficients. It is assumed that arbitrarily good approximations of the non-zero coefficients are given by means of a coefficient oracle. For a given positive integer $L$ , our algorithm returns disjoint disks $Δ_{1}, \dots, Δ_{s} \subset C$ , with $s < 2 k$ , centered at the real axis and of radius less than $2^{- L}$ together with positive integers $μ_{1}, \dots, μ_{s}$ such that each disk $Δ_{i}$ contains exactly $μ_{i}$ roots of $f$ counted with multiplicity. In addition, it is ensured that each real root of $f$ is contained in one of the disks. If $f$ has only simple real roots, our algorithm can also be used to isolate all real roots. The bit complexity of our algorithm is polynomial in $k$ and $lo g n$ , and near-linear in $L$ …

Equations42

f (x) = \sum_{i = 1}^{k} f_{i} x^{e_{i}} \in R [x],

f (x) = \sum_{i = 1}^{k} f_{i} x^{e_{i}} \in R [x],

\tilde{O} ((k + lo g n) \cdot (L + n lo g max (1, ∣ c ∣) + lo g n + τ + k)) .

\tilde{O} ((k + lo g n) \cdot (L + n lo g max (1, ∣ c ∣) + lo g n + τ + k)) .

\frac{f ( m ^{*} )}{f ( b )} = i = 1 \prod n \frac{m ^{*} - z _{i}}{b - z _{i}}

\frac{f ( m ^{*} )}{f ( b )} = i = 1 \prod n \frac{m ^{*} - z _{i}}{b - z _{i}}

\frac{m ^{*} - z _{i}}{b - z _{i}} \leq 2 t + 1 \leq 2 k^{2} + 1,

\frac{m ^{*} - z _{i}}{b - z _{i}} \leq 2 t + 1 \leq 2 k^{2} + 1,

\frac{m ^{*} - z _{i}}{b - z _{i}} \leq 1 + \frac{2 t}{2 k ^{2} n} \leq 1 + \frac{1}{n}

\frac{m ^{*} - z _{i}}{b - z _{i}} \leq 1 + \frac{2 t}{2 k ^{2} n} \leq 1 + \frac{1}{n}

\frac{f ( m ^{*} )}{f ( b )}

\frac{f ( m ^{*} )}{f ( b )}

M_{G} (x) := min (∣ g_{1} (x) ∣, ∣ g_{2} (x) ∣,, \dots, ∣ g_{t} (x) ∣) .

M_{G} (x) := min (∣ g_{1} (x) ∣, ∣ g_{2} (x) ∣,, \dots, ∣ g_{t} (x) ∣) .

O (t \cdot lo g lo g max (λ^{- 1}, 1) \cdot (T (lo g max (λ^{- 1}, 1))) .

O (t \cdot lo g lo g max (λ^{- 1}, 1) \cdot (T (lo g max (λ^{- 1}, 1))) .

M_{i}^{L} := min (\tilde{g}_{i}^{L} (m_{i}) ∣, ∣ \tilde{g}_{2}^{L} (m_{i}) ∣, \dots, ∣ \tilde{g}_{t}^{L} (m_{i}) ∣) \geq 4 \cdot 2^{- L} = 2^{- L + 2} .

M_{i}^{L} := min (\tilde{g}_{i}^{L} (m_{i}) ∣, ∣ \tilde{g}_{2}^{L} (m_{i}) ∣, \dots, ∣ \tilde{g}_{t}^{L} (m_{i}) ∣) \geq 4 \cdot 2^{- L} = 2^{- L + 2} .

2^{ℓ *- 1} \leq M_{D_{f}} (m^{*}) \leq λ \leq 2^{ℓ^{*} + 1}

2^{ℓ *- 1} \leq M_{D_{f}} (m^{*}) \leq λ \leq 2^{ℓ^{*} + 1}

∣ M_{D_{f}} (a) ∣ = 2^{- O (k (k l o g n + τ + l o g m a x (1, \frac{1}{r}) + n l o g m a x (1, a + r)))} .

∣ M_{D_{f}} (a) ∣ = 2^{- O (k (k l o g n + τ + l o g m a x (1, \frac{1}{r}) + n l o g m a x (1, a + r)))} .

∣ f (t) ∣

∣ f (t) ∣

x \in I_{2} in f ∣ f (x) ∣ > r \cdot ε \cdot 2^{- 2 τ - 1 - 2 l o g n - n l o g m a x (1, a + r)} .

x \in I_{2} in f ∣ f (x) ∣ > r \cdot ε \cdot 2^{- 2 τ - 1 - 2 l o g n - n l o g m a x (1, a + r)} .

x \in I_{i} in f ∣ f^{[k - i]} (x) ∣ > 2^{- τ - i \cdot (2 τ - 1 - 2 k l o g n - n l o g m a x (1, a + r))} \cdot j = 1 \prod i - 1 \frac{r}{2 ^{j}} .

x \in I_{i} in f ∣ f^{[k - i]} (x) ∣ > 2^{- τ - i \cdot (2 τ - 1 - 2 k l o g n - n l o g m a x (1, a + r))} \cdot j = 1 \prod i - 1 \frac{r}{2 ^{j}} .

\tilde{O} (k^{5} \cdot (k + lo g n) \cdot lo g n \cdot (k lo g n + τ + L + n lo g max (1, b_{0})))

\tilde{O} (k^{5} \cdot (k + lo g n) \cdot lo g n \cdot (k lo g n + τ + L + n lo g max (1, b_{0})))

M_{D_{f}} (p) = 2^{- O (ℓ + k (k l o g n + τ + L + n l o g m a x (1, b_{0})))} .

M_{D_{f}} (p) = 2^{- O (ℓ + k (k l o g n + τ + L + n l o g m a x (1, b_{0})))} .

w (L^{'}) \leq (2 + λ)^{∣ L ∣} \cdot max (2^{- L}, w (L)),

w (L^{'}) \leq (2 + λ)^{∣ L ∣} \cdot max (2^{- L}, w (L)),

T_{l} (Δ, K, F) : \frac{F ^{(l)} ( m ) r ^{l}}{l !} - K \cdot i \neq = l \sum \frac{F ^{(i)} ( m ) r ^{i}}{i !} > 0.

T_{l} (Δ, K, F) : \frac{F ^{(l)} ( m ) r ^{l}}{l !} - K \cdot i \neq = l \sum \frac{F ^{(i)} ( m ) r ^{i}}{i !} > 0.

L_{0} \leq 2 \cdot (max (1, lo g max (E_{ℓ}, E_{r})^{- 1}) + 4) .

L_{0} \leq 2 \cdot (max (1, lo g max (E_{ℓ}, E_{r})^{- 1}) + 4) .

\frac{∣ a _{i} ∣}{∣ a _{0} ∣}

\frac{∣ a _{i} ∣}{∣ a _{0} ∣}

\leq \frac{1}{i ! \cdot i !} \cdot \frac{k ^{4 k^{2} + 2 k}}{n ^{14 i - 15 k}} \leq \frac{1}{i ! \cdot i !} \cdot \frac{k ^{4 k^{2} + 2 k}}{n ^{6 i}}

\leq \frac{1}{5 ! \cdot n \cdot i !} \cdot \frac{k ^{4 k^{2} + 2 k}}{k ^{6 k^{2}}} \leq \frac{1}{120 \cdot n \cdot i !} \cdot (\frac{1}{k ^{2}})^{k^{2} - 2 k} < \frac{1}{128 n}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPolynomial and algebraic computation · Commutative Algebra and Its Applications · Numerical Methods and Algorithms

Full text

Efficiently Computing Real Roots of Sparse

Polynomials

Gorav Jindal

Michael Sagraloff

Max-Planck-Institut für Informatik

Germany

[email protected]

Max-Planck-Institut für Informatik

Germany

[email protected]

Abstract

We propose an efficient algorithm to compute the real roots of a sparse polynomial $f\in\mathbb{R}[x]$ having $k$ non-zero real-valued coefficients. It is assumed that arbitrarily good approximations of the non-zero coefficients are given by means of a coefficient oracle. For a given positive integer $L$ , our algorithm returns disjoint disks $\Delta_{1},\ldots,\Delta_{s}\subset\mathbb{C}$ , with $s<2k$ , centered at the real axis and of radius less than $2^{-L}$ together with positive integers $\mu_{1},\ldots,\mu_{s}$ such that each disk $\Delta_{i}$ contains exactly $\mu_{i}$ roots of $f$ counted with multiplicity. In addition, it is ensured that each real root of $f$ is contained in one of the disks. If $f$ has only simple real roots, our algorithm can also be used to isolate all real roots.

The bit complexity of our algorithm is polynomial in $k$ and $\log n$ , and near-linear in $L$ and $\tau$ , where $2^{-\tau}$ and $2^{\tau}$ constitute lower and upper bounds on the absolute values of the non-zero coefficients of $f$ , and $n$ is the degree of $f$ . For root isolation, the bit complexity is polynomial in $k$ and $\log n$ , and near-linear in $\tau$ and $\log\sigma^{-1}$ , where $\sigma$ denotes the separation of the real roots.

1 Introduction

1.1 Problem Definition and Contribution

In this paper, we study the problem of computing the real roots of a sparse polynomial

[TABLE]

where $e_{i}$ are non-negative integers, with $0\leq e_{1}<e_{2}<\ldots<e_{k}\leq n$ , and $2^{-\tau}\leq|f_{i}|\leq 2^{\tau}$ for all $i.$ We call such a polynomial $f$ an $(n,k,\tau)$ -nomial or simply a $k-$ nomial if $n$ and $\tau$ are either not specified or clear from the context. We may assume that $k\geq 2$ and $e_{1}=0$ as $1-$ nomials do not have any real root different from [math] and as $f\cdot x^{-e_{1}}$ has exactly the same roots as $f$ except for a possible root at [math]. We further assume that, as input, we receive the exponents $e_{i}$ as well as approximations $\tilde{f}_{i}$ of the non-zero coefficients $f_{i}.$ More specifically, we assume the existence of a coefficient oracle that, for any positive integer $\kappa$ , provides dyadic approximations $\tilde{f}_{i}=\frac{m_{i}}{2^{\kappa+1}},$ with $m_{i}\in\mathbb{Z}$ and $|f_{i}-\tilde{f}_{i}|<2^{-\kappa}$ for all $i$ . We call such an approximation $\tilde{f}=\sum_{i=1}^{k}\tilde{f}_{i}x^{e_{i}}$ an (absolute) $\kappa-$ bit approximations of $f$ . Notice that the numbers $n$ and $k$ are directly part of the input, whereas this is not the case for $\tau$ . However, we may easily compute (i.e. for a cost bounded by $\tilde{O}(k\tau)$ ) a good approximation $\tilde{\tau}\in\mathbb{Z}$ of $\tau$ with $\tau<\tilde{\tau}<\tau+2$ by asking the oracle for an $\kappa$ -bit approximations $\tilde{f}$ of $f$ for $\kappa=1,2,4,\ldots$ until $|\tilde{f}_{i}|>2^{-\kappa+1}$ for all $i$ . Then, $\tilde{\tau}:=\max_{i}\lceil|\log|\tilde{f}_{i}||\rceil$ fulfills the above inequality.

Within recent years, the problem of isolating all (real) roots of a (square-free) polynomial has attracted a lot of interest in the literature; e.g. consider [3, 9, 16] and the references therein. The most efficient algorithms [3, 10, 11, 16] for root isolation achieve running times that are considered to be near-optimal for dense polynomials (i.e. if $k$ is of comparable size as $n$ ) $f\in\mathbb{R}[x]$ . For polynomials with integer coefficients, the best known bound on the bit complexity of this problem is of size $\tilde{O}(n^{2}\tau)$ . The additional cost for refining isolating intervals to a size less than $2^{-\tau},$ and thus for computing $L$ -bit approximations of all real roots, is $\tilde{O}(n\tau)$ ; e.g. see [7, 10, 13, 16]. Notice that, for $k-$ nomials with integer coefficients, the above bounds are not polynomial in the size of the sparse input representation of $\tilde{f}$ , which is bounded by $O(k(\log n+\tau))$ as we need $\log n$ bits to store each exponent $e_{i}$ and $\tau+1$ bits to store each $f_{i}$ . Hence, it is natural to ask whether there exists an algorithm for either root isolation or approximation that runs in polynomial time in the size of the sparse input representation. In [6], Cucker et al. showed how to compute all integer roots of a sparse integer polynomial in polynomial time. Lenstra [8] further improves upon this result giving a polynomial time algorithm to compute all rational factors of $f$ of a fixed constant degree. Furthermore, for polynomials with only a very few non-zero coefficients, there exist polynomial time algorithms to approximate (and also count) the real roots of $f.$ Rojas and Ye [14, 18] propose an algorithm for $3$ -nomials that uses only $O(\log n)$ arithmetic operations in the field over $\mathbb{Q}$ generated by the coefficients of $f$ . Bastani et al. [2] propose a polynomial time algorithm to count the number of real roots for most $4-$ nomials.

For isolating the roots of a sparse integer polynomial, we recently proposed a method [15] that has polynomial arithmetic complexity and whose bit complexity is $\tilde{\Omega}(n\tau\cdot k^{4}).$ The latter bound is also near-optimal for small $k$ as there exists a family of Mignotte-like $4-$ nomials, for which the output complexity is always lower bounded by $O(n\tau).$ This result already rules out the existence of a polynomial time algorithm for isolating the roots of a sparse polynomial, however, it remains an open question whether counting the real roots or computing $L-$ bit approximations of the real roots can be achieved in polynomial time.

In this paper, we give a positive answer for a slight relaxation of the latter problem. That is, we give a polynomial time algorithm to compute a partial clustering of the roots that contains all real roots of $f$ . For a more precise statement, we need the following definitions, where $\Delta_{r}(m)\subset\mathbb{C}$ denotes the open disk in complex space with center $m$ and radius $r$ .

Definition 1 ( $(L,I)$ -covering).

For a polynomial $f$ as in (1.1), an integer $L\in\mathbb{N}$ , and an interval $I\subset\mathbb{R}$ , we call a list $((\Delta_{r_{1}}(m_{1}),\mu_{1}),(\Delta_{r_{2}}(m_{2}),\mu_{2}),\ldots,(\Delta_{r_{t}}(m_{t}),\mu_{t}))$ an * $(L,I)$ -covering for $f$ *if the following conditions are fulfilled:

The disks $\Delta_{r_{i}}(m_{i})$ are pairwise disjoint, $m_{j}$ are real values with $m_{1}<\cdots<m_{t}$ , and $r_{j}\leq 2^{-L}$ for all $j$ . 2. 2.

$\Delta_{r_{j}}(m_{j})$ contains exactly $\mu_{j}$ roots of $f$ for all $j$ . 3. 3.

For every real root $\xi$ of $f$ in $I,$ there exists some disk $\Delta_{r_{j}}(m_{j})$ that contains $\xi$ .

We further introduce a weaker version of $L$ -covering:

Definition 2 (Weak $(L,I)$ -covering).

A weak $(L,I)$ -covering for $f$ is a list $(I_{1},\ldots,,I_{t})$ of open disjoint and sorted real intervals that fulfills the following conditions:

The width of each interval $I_{j}$ is at most $2^{-L}.$ 2. 2.

For every real root $\xi$ of $f$ in $I$ , there exists an interval $I_{j}$ that contains $\xi.$

If $I=\mathbb{R},$ we omit $I$ and just call a (weak) * $(L,\mathbb{R})$ -covering for $f$ a (weak) * $L$ -covering for f. Then, our main contribution is a polynomial-time algorithm for computing an $L-$ covering:

Theorem 3.

For an $(n,k,\tau)$ -nomial, we can compute an $L$ -covering $\mathcal{L}$ of size $|\mathcal{L}|<2k$ in time $\tilde{O}(\mathrm{poly}(k,\log n)\cdot(\tau+L))$ .

Notice that our algorithm computes $L-$ bit approximations of all real roots but might also return (real-valued) $L-$ bit approximations of some non-real roots with a small imaginary part. Further notice that unless $\mu_{j}$ is odd, we also do not know whether $m_{j}$ actually approximates a real root, and unless $\mu_{j}=1,$ we cannot conclude that a disk $\Delta_{r_{j}}(m_{j})$ in an $L$ -covering is isolating for a root of $f$ . Hence, in general, our algorithm does not yield the correct number of distinct real roots. However, if $f$ has only simple roots, we may compute an $L-$ covering for $f$ for $L=2,4,8,\ldots$ until $\mu_{j}=1$ for all $j$ . Then, the disks $\Delta_{r_{i}}(m_{i})$ isolate all real roots.

Theorem 4.

Let $f$ be an $(n,k,\tau)$ -nomial with only simple real roots, and let $\sigma$ be the minimal distance between any two (complex) distinct roots of $f$ (i.e. the separation of $f$ ). Then, we can compute isolating intervals for all real roots in $\tilde{O}(\mathrm{poly}(k,\log n)(\tau+\log\max(1,1/\sigma)))$ bit operations.

We improve upon [15] in several ways. Namely, [15] only applies to integer polynomials, whereas our novel approach applies to polynomials with arbitrary real coefficients. In addition, the running time of the algorithm in [15] does not adapt to the actual hardness of the roots, whereas the complexity of our novel approach rather depends on the actual separation than on the worst-case bound [17] of size $2^{-O(n(\tau+\log n))}$ for the separation of an integer polynomial. In the worst case, our method isolates all real roots of a very sparse integer polynomial (i.e. $k=(\log(n\tau))^{O(1)}$ ) in time $\tilde{O}(n\tau)$ , and is thus near optimal.; see [15]

1.2 Overview of the Algorithm

Before we go into detail, we give a brief overview of our algorithm, where we omit technical details. We first remark that the problem of computing an $(L,[1,\infty))$ -covering can be reduced to the problem of computing an $(L,[0,1])$ -covering (in fact, we are computing an $(L,[0,1+1/n])$ -covering but this for technical reasons only) by means of the coordinate transformation $x\mapsto\frac{1}{x}$ followed by multiplication with $x^{n}$ . We may also reduce the problem of computing an $(L,(-\infty,0])$ -covering of $f$ to the problem of computing an $(L,[0,\infty))$ -covering by means of the coordinate transformation $x\mapsto-x$ . Hence, we are eventually left with merging $(L,[0,1])$ -coverings for the polynomials $f,$ $x^{n}\cdot f(1/x),$ $f(-x),$ and $x^{n}\cdot f(-1/x)$ in a suitable manner. We give details for this step in Section 7. Notice that the considered coordinate transformation preserves the sparseness of the input polynomial, hence we may concentrate on the problem of computing an $(L,[0,1])$ -covering for $f$ only. For this, we first compute a weak $(L,[0,1])$ -covering of $f$ , which is achieved by recursively computing weak $(L,[0,1])$ -coverings of the so-called *fractional derivatives of $f$ . *

Definition 5 (Fractional Derivatives).

Let $f$ be a polynomial as in (1.1). Then, we define $f^{[1]}:=\frac{f^{\prime}}{x^{e_{2}-1}}$ as the (first) fractional derivative of $f$ . In other words, we divide the first derivative $f^{\prime}$ of $f$ by the highest possible power of $x$ that divides $f^{\prime}$ . The $i-$ th fractional derivative $f^{[i]}$ of $f$ is then recursively defined as the first fractional derivative of $f^{[i-1]}.$ Notice that, for $i\leq k-1$ , $f^{[i]}$ is an $(n,k-i,\tau+k\cdot\log n)-$ nomial with a non-zero constant term and $f^{[i]}\equiv 0$ for $i\geq k.$ We further use the notation $\mathcal{D}_{f}$ to denote the tuple of all non-zero fractional derivatives $f,f^{[1]},f^{[2]},f^{[2]},\ldots,f^{[k-1]}$ , i.e, $\mathcal{D}_{f}=(f,f^{[1]},f^{[2]},f^{[3]},\ldots,f^{[k-1]})$ .

The general idea of recursively computing the real roots of $f$ from the real roots of its fractional derivatives has already been considered in previous work; e.g. [1, 4, 5, 8, 12, 14, 15]. The simple idea is that, given a weak $(L,[0,1])$ -covering $(I_{1}^{\prime},\ldots,I_{t^{\prime}}^{\prime})$ for $f^{[1]}$ , we already know that in between two consecutive intervals $I_{j}=(a,b)$ and $I_{j+1}=(c,d)$ , the polynomial $f$ is monotone, and thus there can be at most one real root in between $b$ and $c$ , which then must be simple. In order to check for the existence of such a root, it suffices to check whether $f$ changes signs at the points $b$ and $c.$ In case of a sign change, we may then refine the interval $(a,b)$ , which is known to be isolating for a real root of $f$ , to a width less than $2^{-L}.$ If we proceed in this way for all intervals in between two consecutive intervals as well as with the leftmost interval, whose endpoints111For technical reasons, we will indeed consider slight perturbations of [math] and $1$ in our algorithm. are [math] and the left endpoint of $I_{1}^{\prime}$ , and the rightmost interval, whose endpoints are the right endpoint of $I_{t^{\prime}}^{\prime}$ and $1,$ then we obtain a set of intervals $I_{j}^{\prime\prime}$ of size at most $2^{-L}$ that cover all real roots of $f$ that are contained in $[0,1]$ but in none of the intervals $I_{j}^{\prime}$ . Hence, the union of the intervals $I_{j}^{\prime}$ and $I_{j}^{\prime\prime}$ constitutes an $(L,[0,1])$ -covering for $f$ . This shows how to compute an $(L,[0,1])$ -covering for $f$ from recursively computing $(L,[0,1])$ -coverings for its fractional derivatives.

We remark that, in this simplistic description, we have omitted several key problems one faces when formalizing the algorithm: Evaluating the sign of a polynomial $f$ at given points $b,c$ may require a very high precision, which should be avoided to ensure a polynomial bit complexity. In addition, we need an efficient refinement method that uses only a polynomial number of iterations. For the latter problem, we use a slightly modified variant of our algorithm from [15, 16]. For the computation of the sign of $f$ (and its higher order fractional derivatives) at certain points, we consider an approach that allows us to slightly perturb the evaluation points such that the absolute value of each of the considered polynomials does not become too small. One major contribution of this paper, when compared to our previous work [15], is to show that this can be done in way such that the precision always stays polynomial in $\log n,$ $k,$ $\tau,$ and $L$ .

In the second step, we derive an $(L,[0,1])-$ covering from a weak $(L^{\prime},[0,1])-$ covering, where $L^{\prime}$ has been chosen sufficiently large. A straight forward approach would be to use a method for computing the number of roots in the one-circle region $\Delta(I)=\Delta_{r}(m)$ of each interval $I$ in the weak $(L^{\prime},[0,1])-$ covering. Here, $\Delta(I)$ is defined as the disk centered at the midpoint $m=m(I)$ of $I$ and passing through the endpoints of the interval. In the literature, several methods have been proposed to count the number of roots in a disk in complex space. Unfortunately, these algorithm are not sparsity aware, which rules out a straight-forward application of them. Recent work [3] introduces the so-called $T_{l}$ -test, a method for root counting based on Pellet’s Theorem. The method only needs to compute approximations of the coefficients of the polynomial $f(m+r\cdot x)$ , however, we cannot afford to compute all coefficients. Fortunately, in our situation, only the first $k^{2}$ coefficients are actually needed to determine the outcome of the test. In order to guarantee success of the test, it may further be necessary to merge some of the intervals in the weak covering and to consider disks that are larger than the one-circle regions of the merged intervals. This explains why we need a weak $(L^{\prime},[0,1])-$ covering with a sufficiently large $L^{\prime}>L.$ We consider our method for counting the roots of a sparse polynomial in a disk as the second main contribution of our paper.

2 On the Geometry of Roots

Descartes’ Rule of Signs states that the number $\mathrm{var}(F)$ of sign changes in the coefficient sequence of a polynomial $F\in\mathbb{R}[x]$ constitutes an upper bound on the number of real roots (counted with multiplicity). Hence, it follows immediately that a $k-$ nomial $f$ as in (1.1) has at most $k-1$ negative and at most $k-1$ positive real roots. Apart from this simple fact, $k$ -nomials have indeed much more structure on their roots, which we will briefly survey in this section.

Let $I=(a,b)$ be an interval, $F_{I}(x):=(x+1)^{n}\cdot F\left(\frac{ax+b}{x+1}\right)$ , and $v_{I}:=\mathrm{var}(F,I)$ be the number of sign changes in the coefficient sequence of the polynomial $F_{I}$ . Notice that there is a one-to-one correspondence between the roots of $F$ in $I$ and the positive real roots of $F_{I}$ via the Möbius transformation that maps a point $x\in\mathbb{C}\setminus\{-1\}$ to $\frac{ax+b}{x+1}\in\mathbb{C}$ . Thus, $v_{I}$ constitutes an upper bound on the number of roots of $F$ in $I.$ In fact, $v_{I}$ also constitutes a lower bound on the number of roots in the so called Obreshkoff lens $L_{n}$ of the interval $I$ . $L_{n}$ is defined as the intersection $L_{n}:=\overline{C}_{n}\cap\underline{C}_{n}$ of the two open disks $\overline{C}_{n},\underline{C}_{n}\subset\mathbb{C}$ that intersect the real axis in the endpoints $a$ and $b$ of $I$ , and whose centers see the line segment $(a,b)$ under the angle $\frac{2\pi}{n+2}$ . For an illustration, see [16, Fig. 1]. It further holds [15, 16]) that $\mathrm{var}(F,I)\leq\mathrm{var}(F)\leq k-1$ for any interval $I\subset\mathbb{R}^{+}$ , hence we conclude that the Obreshkoff lens $L_{n}$ of any such interval contains at most $k-1$ roots. For $b\mapsto\infty$ , the Obreshkoff lens $L_{n}$ of the interval $I=(0,b)$ converges to the cone $C_{n}$ whose boundary are the two half-lines starting at the origin and intersecting the real axis at an angle $\pm\frac{\pi}{n+2}$ ; see Figure 2.1. Hence, it follows that the interior of $C_{n}$ contains at most $k-1$ roots of any given $k-$ nomial of degree $n$ .

Theorem 6.

The cone $C_{n}$ contains at most $k-1$ roots of any $k$ -sparse polynomial of degree $n$ .

3 Polynomial arithmetic

Our algorithm only needs to perform basic operations on polynomials. In particular, we need to evaluate the sign of a given sparse polynomial at some points $x$ . As we already mentioned in the overview of our algorithm, the complexity of this operation becomes too large if the value of the polynomial at a given point $x$ is almost zero as then one needs to perform computations with a very high precision. Also, exact evaluation of a sparse polynomial at a rational point (even of small bitsize) is expensive as the output has bitsize linear in $n$ . Instead, we consider approximate evaluation, which allows us to evaluate a sparse polynomial $f$ as in (1.1) at an arbitrary point $x\in(0,1+1/n)$ to an absolute error less than $2^{-L}$ in a time that is polynomial222Notice that, for $c\in(0,1+1/n^{O(1)})$ , we may omit the term $n\log\max(1,|c|)$ in the bounds stated in Lemma 1. in $\log n,$ $k,$ $\tau,$ and $L.$ More precisely, we derive the following result:

Lemma 1.

Let $f\in\mathbb{R}[x]$ be an $(n,k,\tau)$ -nomial, $c$ be a positive real number, and $L$ a non-negative integer. Then, we can compute an $L$ -bit approximation $\lambda$ of $f(c)$ (i.e. $|\lambda-f(c)|<2^{-L}$ ) in a number of bit operations bounded by

[TABLE]

Proof.

In essence, we follow the same approach as in [7]. That is, for a fixed non-negative integer $K$ , we perform each occurring operation $\circ$ (i.e. either addition or multiplication) with fixed precision $K$ . More precisely, the input is initially rounded after the $K$ -th bit after the binary point. Then, in each of the following steps, we replace each exact operation $\circ$ between two numbers $a$ and $b$ by a corresponding approximate operation $\tilde{\circ}$ , where we define $a\tilde{\circ}b$ to be the result obtained by rounding $a\circ b$ after the $K$ -th bit after the binary point. Suppose that we have computed approximations $\tilde{a}=a+\varepsilon_{1}$ and $\tilde{b}=b+\varepsilon_{2}$ of two intermediate results $a$ and $b$ , where we assume that $\varepsilon:=\max(|\varepsilon_{1}|,|\varepsilon_{2}|,2^{-K})<1.$ Then, we have $|a\cdot b-\tilde{a}\tilde{\cdot}\tilde{b}|<|a|\cdot|\varepsilon_{1}|+|b|\cdot\varepsilon_{2}+|\varepsilon_{1}\varepsilon_{2}|+2^{-K}<4\cdot\varepsilon\cdot\max(1,|a|,|b|)$ and $|a+b-\tilde{a}\tilde{+}\tilde{b}|<|\varepsilon_{1}|+|\varepsilon_{2}|+2^{-K}$ . Hence, when evaluating one term $f_{i}\cdot x^{e_{i}}$ of $f$ at the point $x=c$ with absolute precision $K>L+\log k+1+\tau+(2\log n+1)\cdot(n\cdot\log\max(1,|c|)+2)$ via repeated squaring, we induce a total error $\varepsilon_{i}$ for the computation of $f_{i}\cdot c^{e_{i}}$ of size less than $2^{\tau}c^{n(2\log n+1)}\cdot 4^{2\log n+1}\cdot 2^{-K-L}<(2k)^{-1}\cdot 2^{-L}$ as there are at most $2\log n+1$ multiplications, and each (exact) intermediate result has absolute value bounded by $\max(2^{\tau},c^{n}).$ When eventually summing up the approximations of all terms $f_{i}\cdot x^{e_{i}}$ , we thus induce an error of size less than $\sum_{i}\varepsilon_{i}+k\cdot 2^{-K}<2^{-L}$ for the computation of the final result. The bound on the bit complexity follows from the fact that we need $O(k+\log n)$ arithmetic operations on integers of bitsize $O(K+\tau+\log k+n\log\max(1,|c|))=O(K)$ , and each such operation uses $\tilde{O}(K)$ bit operations.

∎

We already mentioned that evaluating the sign of a polynomial $f$ at a point $x$ might be costly if $f(x)$ has a small absolute value. In order to avoid such undesired computations, we first perturb $x$ in a suitable manner. That is, instead of evaluating the sign of $f$ at $x,$ we evaluate its sign at a nearby point, where $f$ becomes large enough. This can be done in a way such that the actual behavior of the algorithm does not change. We will call such points “admissible”. We remark that we already used this concept in previous work [15, 16]. Here, we modify the approach to choose an admissible point, where the sign of each fractional derivative of a sparse polynomial $f$ can be evaluated in polynomial time.

Definition 7 (Admissible point).

Let $g:\mathbb{R}\rightarrow\mathbb{R}$ be a function and $m[t;\delta]=\{m_{i}:=m+(i-k)\cdot\delta;i=0,1,\ldots,2t\}$ be a multipoint. We call a point $m^{*}\in m[t;\delta]$ to be $(g,m[t;\delta])$ -admissible if $\left|g(m^{*})\right|\geq\frac{1}{8}\cdot\max_{x\in m[t;\delta]}\left|g(x)\right|$ .

If $t$ and $\delta$ (or even $m$ and $g$ ) are clear from the context, we simply call a $(g,m[t;\delta])$ -admissible point $(g,m)$ -admissible (or just admissible). Since the value of a polynomial $g$ at an admissible points is “relatively large”, we expect that $g$ has no root in a neighborhood of an admissible point. The following lemma formalizes this intuition.

Lemma 2.

Suppose that $m\in\mathbb{R}_{+}$ and $m^{*}\in m[t;\delta]$ is an $(f,m[t;\delta])$ -admissible point for an $(n,k,\tau)$ -nomial $f$ with $k\geq 2$ and $k\leq t\leq k^{2}$ . Further assume that $\frac{m}{\delta}>4k^{2}n^{2}$ . Then, the disk $\Delta_{\frac{\delta}{k^{4k}}}(m^{*})$ does not contain any root of $f$ .

Proof.

Let $z_{1},\ldots,,z_{n}$ denote the complex roots of $f.$ Since $f(x)$ has at most $k-1$ roots in the cone $C_{n}$ (see Figure 2.1 ) and $t\geq k$ , there exists a point $b\in m[t;\delta]$ such that $\Delta_{\delta/2}(b)$ does not contains any of these $k-1$ roots.

By way of contradiction assume that there is a root $z_{l}$ of $f$ in the ball of radius $\frac{\delta}{k^{4k}}$ around $m^{*}$ .

We have

[TABLE]

By using the triangle inequality for the distance between $m^{*}$ and $z_{i}$ , one can see that, for the roots $z_{i}\neq z_{l}$ that are contained in $C_{n}$ , we have

[TABLE]

whereas $\frac{m^{*}-z_{l}}{b-z_{l}}<2\cdot k^{-4k}.$ Now, consider the roots $z_{i}$ of $f$ that are outside of $C_{n}$ . Since $\frac{m}{\delta}>4k^{2}n^{2}$ , it follows that the distance of $m^{*}$ to any such $b$ is at at least $32k^{2}\delta n$ .

Again using the triangle inequality for distance between $m^{*}$ and $z_{i}$ , this implies that

[TABLE]

Hence

[TABLE]

This contradicts the fact that $m^{*}$ is an admissible point. ∎

Definition 8.

Let $\mathcal{G}=(g_{1},g_{2},\ldots,g_{t})$ be a tuple of $t$ functions $g_{i}:\mathbb{R}\rightarrow\mathbb{R}$ . Then, $M_{\mathcal{G}}(x)$ is defined as follows:

[TABLE]

For a fixed real $x$ , we call $\mathcal{\tilde{G}}(x)=(\tilde{g}_{1}(x),\tilde{g}_{2}(x),\ldots\tilde{g}_{t}(x))$ an $L$ -approximation of $\mathcal{G}(x)$ if $\left|\tilde{g}_{i}(x)-g_{i}(x)\right|\leq 2^{-L}$ for all $i$ .

We first show how to compute an admissible point $m^{*}\in m[t;\delta]$ for $M_{\mathcal{G}}(x)$ under the assumption that we can compute an $L$ -approximation of $\mathcal{G}(x)$ for any $x\in m[t;\delta]$ in time $T(L)$ .

Lemma 3.

Let $\mathcal{G}=(g_{1},g_{2},\ldots,g_{t})$ be as in Definition 8, $m[t;\delta]$ a multipoint and $m_{i}:=\max_{a\in m[t;\delta]}\left|M_{\mathcal{G}}(a)\right|$ . Suppose that for for a point $x\in m[t;\delta]$ we can compute an $L$ -approximation of $\mathcal{G}(m_{i})$ in time $T(L,x)$ , then we can compute an $(M_{\mathcal{G}},m[t;\delta])$ -admissible point $m^{*}\in m[t;\delta]$ in time

[TABLE]

Within the same time, we may compute an integer $\ell^{*}$ with $2^{\ell^{*}-1}\leq\left|M_{\mathcal{G}}(m^{*})\right|\leq\lambda\leq 2^{\ell^{*}+1}.$

Proof.

We proceed in the same fashion as in Lemma 8 of [16]. For $L=1,2,4,\ldots,$ , we compute $L$ -approximations $\tilde{\mathcal{G}}_{i}^{L}=(\tilde{g}_{1}^{L}(m_{i}),\tilde{g}_{2}^{L}(m_{i}),\ldots\tilde{,g}_{t}^{L}(m_{i}))$ of $\mathcal{G}(m_{i})$ for all points $m_{i}\in m[t;\delta]$ until the following condition is satisfied for at least one $m_{i}$ :

[TABLE]

Then, let $i_{0}$ be the index such that $M_{i_{0}}^{L}$ is maximal among all $M_{i}^{L}$ , and let $\ell^{*}$ be an integer such that $\left|\ell^{*}-\log M_{i_{0}}^{L}\right|\leq\frac{1}{2}$ . We output $\ell^{*}$ and $m^{*}:=m_{i_{0}}$ . It is now straight-forward (cf. the proof of Lemma 8 in [16]) to show that $2^{\ell^{*}-1}\leq M_{\mathcal{\mathcal{G}}}(m^{*})\leq\lambda\leq 2^{\ell^{*}+1}$ .

Following the above approach, we must succeed for an $L\leq 2\log\max(\frac{1}{\lambda},1)$ . Since we double $L$ at most $O(\log\log\max(\frac{1}{\lambda},1))$ many times and since we approximately evaluate the functions $g_{i}$ at $t$ points, the stated complexity bound follows. ∎

We now apply the above lemma to the special case where $\mathcal{G}=\mathcal{D}_{f}$ is the sequence of fractional derivatives of $f$ . Then, Lemma 1 yields a bound of the bit complexity of computing $L$ -approximations of $\mathcal{D}_{f}(m_{i})$ for all points $m_{i}\in m[t;\delta]$ .

Corollary 9.

Assume that $f(x)$ is a $(n,k,\tau)$ -nomial, $m[t;\delta]$ a multipoint and $\lambda:=\max_{m_{i}\in m[t;\delta]}\left|M_{\mathcal{D}_{f}}(m_{i})\right|$ . Further assume that $m[t;\delta]\subset(0,\alpha)$ for some positive real number $\alpha$ . Then, we can determine an $(M_{\mathcal{D}_{f}},m[t;\delta])$ -admissible point $m^{*}$ and an integer $\ell^{*}$ with

[TABLE]

using $\tilde{O}(t\cdot k\cdot(k+\log n)\cdot(\tau+k\log n+n\log\max(1,\alpha)+\log\max(1,\lambda^{-1})))$ many bit operations.

Notice that the running time of the above algorithm depends on the value $\lambda:=\max_{m_{i}\in m[t;\delta]}\left|M_{\mathcal{D}_{f}}(m_{i})\right|$ . We will now derive a bound on $\lambda$ that shows that, for a sufficiently large $t$ and suitably chosen $m$ and $\delta$ , we can always compute an $(M_{\mathcal{D}_{f}},m[t;\delta])$ -admissible point $m^{*}$ in polynomial time.

Lemma 4.

Let $f\in\mathbb{R}[x]$ be a $(n,k,\tau)$ -nomial as in (1.1), and let $a,r$ be positive real numbers with $r<a$ and such that $(a-r,a+r)$ does not contain any real root of any fractional derivative of $f(x)$ . Then, it holds that

[TABLE]

Proof.

We may assume that $r$ is small enough to guarantee that $\frac{a}{r}>2n$ . This implies that, for any two points $x,x^{\prime}\in I_{1}:=(a-r,a+r)$ , we have that $x/x^{\prime}\in(1-1/n,1+1/n)$ . Now, let us write $f=c+x^{j}\cdot g$ with a constant $c$ of absolute value at least $2^{-\tau}$ and $g$ an $(n-j,k-1,\tau+\log n)-$ nomial that is not divisible by $x.$ Then, it holds that $f^{[1]}=j\cdot g+x\cdot g^{\prime}$ , and thus $f^{\prime}=x^{j-1}\cdot f^{[1]}$ . In addition, since $I_{1}:=(a-r,a+r)$ does not contain any root of $f$ and $f^{[1]}$ , it follows that $f$ is monotone on $I$ and only takes positive or negative values. This implies that $|f(t)-f(t^{\prime})|=||f(t)|-|f(t^{\prime})||$ for all $t,t^{\prime}\in I$ . In addition, for any $t\in I_{2}:=(a-r/2,a+r/2)$ , we can choose a point $t^{\prime}=t\pm r/2$ such that $|f(t)|>|f(t^{\prime})|$ . Now, according to the mean value theorem, there exists a $\xi$ in between $t$ and $t^{\prime}$ with $f(t)-f(t^{\prime})=(t-t^{\prime})\cdot f^{\prime}(\xi)=\frac{r}{2}\cdot\xi^{j-1}\cdot f^{[1]}(\xi)$ . Hence, we obtain $|f(t)|>|f(t)|-|f(t^{\prime})|=||f(t)|-|f(t^{\prime})||=|f(t)-f(t^{\prime})|\geq\frac{r}{2}\cdot\xi^{j-1}\cdot f^{[1]}(\xi)\geq\frac{r}{8}\cdot t^{j-1}\cdot f^{[1]}(\xi),$ where the latter inequality follows from $(\xi/t)^{j-1}>(1-1/n)^{n}>1/2$ . Also, $|f(t)|\geq|c|-t^{j}\cdot|g(t)|\geq 2^{-\tau}-t^{j-1}\cdot k\cdot 2^{\tau+\log n}\cdot\max(1,a+r)^{n}$ . With $\varepsilon:=\min(1,\inf_{x\in I_{1}}|f^{[1]}(x)|)$ , the above inequalities thus imply that

[TABLE]

Now, if $t^{j-1}<2^{-\tau-1}(k2^{\tau+\log n}\cdot\max(1,a+r)^{n})^{-1}$ , then the second argument in the above term becomes larger than $2^{-\tau-1}$ . Otherwise, the first term becomes larger than $\frac{r\varepsilon}{8}\cdot 2^{-\tau-1}(k2^{\tau+\log n}\cdot\max(1,a+r)^{n})^{-1}$ . Hence, we conclude that

[TABLE]

We now recursively apply the above result to the fractional derivatives $f^{[k-i]}$ and the intervals $I_{i}:=(a-\frac{r}{2^{i-1}},a+\frac{r}{2^{i-1}})$ , where $i=1,2,\ldots,k$ . Notice that each of the polynomials is an $(n,k,\tau+k\log n)-$ nomial and that $\inf_{x\in I_{1}}|f^{[k-1]}(x)|>2^{-\tau}$ as $f^{[k-1]}$ is a constant of absolute value at least $2^{-\tau}$ . Hence, it follows that

[TABLE]

∎

Combining the above lemma and Corollary 9 now yields

Theorem 10.

Let $f$ be a $(n,k,\tau)$ -nomial as in (1.1), and let $m[t;\delta]$ be a multipoint with $t\geq k^{2}$ and $m[t;\delta]\subset(0,\alpha)$ for some for some real number $\alpha$ . Then, we can compute an $(M_{\mathcal{D}_{f}},m[t;\delta])$ -admissible point $m^{*}$ using $\tilde{O}(t\cdot k^{2}\cdot(k+\log n)\cdot(k\log n+\tau+\log\max(1,\frac{1}{\delta})+n\log\max(1,\alpha)))$ bit operations.

Proof.

Since each fractional derivative of $f$ has at most $k-1$ positive real roots and since $t\geq k^{2}$ , there exists an $a\in m[t;\delta]$ such that $(a-\delta/2,a+\delta/2)$ does not contain any real root of any of fractional derivative. Hence, Lemma 4 implies that $\lambda:=\max_{x\in m[t;\delta]}|M_{\mathcal{D}_{f}}(x)|\geq|M_{\mathcal{D}_{f}}(a)|$ is lower bounded by $2^{-O(k(k\log n+\tau+\log\frac{1}{\delta}+n\log\max(1,a+\delta)))}$ . Corollary 9 then yields the claimed bound on the running time. ∎

4 Refinement

A crucial subroutine of our overall algorithm is an efficient method for refining an interval $I_{0}=(a_{0},b_{0})\subset\mathbb{R}_{+}$ , with $\max(|\log a_{0}|,|\log b_{0}|)=O(\tau)$ , that is known to be isolating for a simple real root of a $k$ -nomial $f$ . It is assumed that the algorithm receives the sign of $f$ at the endpoints of $I_{0}$ as additional input. For the refinement, we consider the algorithm NewRefine from Section 3 in [15] (see also Section 5 in [16]), however, we make a single (minor) modification. As the argument from [15] directly applies, we only state the main results and refer the reader to [15] for details.

NewRefine recursively refines $I_{0}$ to a size less than $2^{-L}$ using a trial and error approach that combines Newton iteration and bisection. For this, only $f$ and its first derivative $f^{\prime}$ need to be evaluated. More precisely, in each iteration, the algorithm computes $(f,m[\lceil k/2\rceil;\delta])-$ admissible points $m^{*}$ for a constant number of points $m\in I$ and a corresponding $\delta$ of size $2^{-O(\tau+\log n+L)}$ . In addition, $f$ and $f^{\prime}$ are evaluated at these admissible points to an absolute precision that is bounded by $O(\log\max(1,|f(m^{*})|^{-1})+\log n+L+\tau)$ . Each endpoint of the interval returned by NewRefine is then either one of the admissible points computed in a previous iteration or one of the endpoints of $I_{0}$ .

We now propose the following modification of NewRefine, which we denote NewRefine∗: Whenever NewRefine asks for an $(f,m[\lceil k/2\rceil;\delta])-$ admissible point $m^{*}$ , we compute an $(M_{\mathcal{D}_{f}},m[k^{2};\delta^{\prime}])-$ admissible point $m^{*}$ , with $\delta^{\prime}=\delta\cdot\frac{\lceil k/2\rceil}{k^{2}}$ , instead. Then, the same argument333The argument in [15] only uses that, in each iteration, we choose an arbitrary point $m^{*}\in[m-\lceil k/2\rceil\cdot\delta,m+\lceil k/2\rceil\cdot\delta]$ . as in [15] yields:

Theorem 11.

For refining $I_{0}$ to a size less than $2^{-L}$ , the algorithm NewRefine∗ needs $O(k\cdot(\log n+\log(\tau+L)))$ iterations. In each iteration, we need to compute a constant number of $(M_{\mathcal{D}_{f}},m[k^{2};\delta^{\prime}])-$ admissible points $m^{*}$ , with $m[k^{2};\delta^{\prime}]\subset I_{0}$ and $\delta^{\prime}=2^{-O(\tau+\log n+L)}$ . In addition, the polynomials $f$ and $f^{\prime}$ have to evaluated at $m^{*}$ to an absolute precision bounded by $O(\log\max(1,|f(m^{*})|^{-1})+\log n+L+\tau)$ .

Combining Theorems 11 and 10, we obtain a bound on the complexity of refining $I_{0}$ to a size less than $2^{-L}$ :

Corollary 12.

For refining $I_{0}$ to a size less than $2^{-L}$ , the algorithm NewRefine∗ needs

[TABLE]

bit operations. For each endpoint $p$ of the interval returned by NewRefine, it holds that

[TABLE]

with $\ell:=\log\min(1,M_{\mathcal{D}_{f}}(a_{0}),M_{\mathcal{D}_{f}}(b_{0}))^{-1}$ .

5 Computing a Weak Covering

We now describe how to compute a weak * $(L,[0,1+1/n])$ -*covering for a given $(n,k,\tau)$ -nomial $f$ in polynomial time. We first compute an upper bound $\tilde{\tau}\in\mathbb{Z}$ for $\tau$ with $\tau\leq\tilde{\tau}\leq\tau+2$ , and define $\delta:=\min(2^{-2\tau-2},1/n)\cdot k^{-2}$ . Then, in the first step, we compute $(M_{\mathcal{D}_{f}},m[k^{2};\delta])-$ admissible points $a^{*}$ and $b^{*}$ for $m:=2^{-2\tau-2}$ and $m:=1+2/n$ , respectively. Then, we follow the approach as outlined in the first part of Section 1.2 to compute a weak $(L,[a^{*},b^{*}])$ -covering for $f$ , where we use the algorithm NewRefine∗ from the previous Section to refine isolating intervals for the roots of the fractional derivatives of $f$ to a size less than $2^{-L}$ . The so obtained covering is indeed also a weak * $(L,[0,1+1/n])$ -*covering for $f$ , which follows from the fact that $b^{*}\geq 1+1/n$ and each positive root of $f$ is lower bounded by $(1+\max_{i=1}^{k}|f_{i}|/|f_{1}|)^{-1}$ due to Cauchy’s root bound [17]. For details, consider the exact definition of Algorithm 1.

Correctness of the algorithm follows directly from our considerations in Section 1.2. Further notice that, for each $i$ in the outermost for-loop of the algorithm, we add at most $k-i-1$ intervals to $W_{i}$ to obtain $W_{i+1}$ as $f^{[i]}$ has at most $k-i-1$ positive real roots. Hence, each list $W_{i}$ contains at most $k^{2}$ many intervals. It remains to bound the running time of Algorithm 1. The proof of the following Lemma follows in a straight forward manner from Theorem 10, Corollary 12, and the fact that we need to call the refinement algorithm at most $k$ times for each fractional derivative.

Lemma 5.

Algorithm 1 computes a weak $(L,[0,1+\frac{1}{n}])$ -covering for $f$ consisting of at most $k^{2}$ many intervals. Its bit complexity is $\tilde{O}(k^{7}(k+\log n\cdot(k\log n+\tau+L)\cdot)\log n).$

Proof.

According to Theorem 10, the cost for computing $a^{*}$ and $b^{*}$ is bounded by $\tilde{O}(k^{4}(k+\log n)(k\log n+\tau))$ . The refinement algorithm is called at most $k^{2}$ many times for refining the roots of the fractional derivatives. Corollary 12 thus yields a bound of size $\tilde{O}(k^{7}\cdot(k+\log n)(k\log n+\tau))$ for the bit complexity of the refinements. The computations of the signs of the factional derivative $f^{[i]}$ at the endpoints of the intervals in $W_{i+1}$ is dominated by this bound as the refinement algorithm returns intervals whose endpoints are admissible with respect to $M_{\mathcal{D}_{f}}$ . Thus, the computation of each such admissible point already yields the sign of all fractional derivative at this point. ∎

In order to further process a weak $(L,[0,1+1/n])$ -covering for $f$ , we need the intervals in the weak covering to be well separated. For given $L,\lambda\in\mathbb{N}_{0}$ , we say that a list $\mathcal{L}$ of intervals is $(L,\lambda)$ -separated if the distance $\operatorname{dist}(I,J)$ between $I$ and its neighboring intervals is at least $\min(2^{-L},\lambda\cdot w(I))$ . Notice that, starting from an arbitrary list $\mathcal{L}$ of intervals, we can always deduce an $(L,\lambda)$ -separated list $\mathcal{L}^{\prime}$ from $\mathcal{L}$ in a way such that each interval in $\mathcal{L}$ is contained in an interval from $\mathcal{L}^{\prime}$ . Namely, this can be achieved by recursively merging pairs of intervals $I,J\in\mathcal{L}$ that violate the above condition until the actual list is $(L,\lambda)$ -separated. It is easy to see that

[TABLE]

where $w(\mathcal{L})$ and $w(\mathcal{L}^{\prime})$ denote the maximal width of an interval in $\mathcal{L}$ and $\mathcal{L}^{\prime}$ , respectively. Hence, by first computing a weak $(L^{\prime},[0,1+1/n])$ -covering $\mathcal{L}$ , with $L^{\prime}=L+k^{2}\cdot\log(2+\lambda)$ and $|\mathcal{L}|=O(k^{2})$ , and then recursively merging the intervals, we obtain a weak $(L,[0,1+1/n])-$ covering for $f$ that is also $(L,\lambda)$ -separating and whose intervals have width at most $2^{-L}$ . From Lemma 5, we thus conclude:

Corollary 13.

For any $\lambda,L\in\mathbb{N}_{0}$ , we can compute a $(L,\lambda)$ - separating weak $(L,[0,1+1/n])-$ covering for $f$ in $\tilde{O}(k^{7}(k+\log n)\cdot(k\log n+\tau+L+k^{2}\log(2+\lambda))\cdot\log n)$ bit operations.

6 $T_{l}$ -test

In the previous section, we have shown how to compute a weak $(L,[0,1+1/n))$ -covering of a given $(n,k,\tau)$ -nomial $f$ . Now, we aim to convert this weak covering to a covering of $f$ . For this, we need an algorithm to count the number of roots of $f(x)$ contained in a given disk. Recent work [3] introduces a simple corresponding algorithm, denoted $T_{l}$ -test, which is based on Pellet’s Theorem. More precisely, for an arbitrary polynomial $F\in\mathbb{C}[x]$ , a disk $\Delta=\Delta_{r}(m)\subset\mathbb{C}$ , and a parameter $K\geq 1$ , we consider the inequality

[TABLE]

Hence, we check whether the absolute value of the $l$ -th coefficient $a_{l}$ of $F_{\Delta}(x)=f(m+rx)=\sum_{i=0}^{n}a_{i}x^{i}$ dominates the sum of the absolute values of all remaining coefficients weighted by the parameter $K$ . We say that $T_{l}(\Delta,K,F)$ succeeds if the above inequality is fulfilled. Otherwise, we say that it fails. In case of success (for any $K\geq 1$ ), $\Delta$ contains exactly $l$ roots of $F$ counted with multiplicity, whereas we have no information in case of a failure. However, in [3], we derive sufficient conditions on the success of the $T_{l}$ -test:

Theorem 14.

[[3], Corollary 1] Let $F\in\mathbb{C}[x]$ be a polynomial of degree $n$ , and $\Delta_{r}(m)$ be a disk. If $\Delta_{r}(m)$ as well as the enlarged disk $\Delta_{256n^{5}r}(m)$ contain $l$ roots of $F$ counted with multiplicity, then $T_{l}(\Delta_{16nr}(m),\frac{3}{2},F)$ succeeds.**

Unfortunately, the above test has two major drawbacks when dealing with sparse polynomials. First, we need to compute the coefficients $F_{\Delta}$ exactly, which we cannot afford as the bitsize of each coefficient is at least linear in $n$ . Second, an even more severe, there are $n$ coefficients to be computed. Hence, using the above approach directly to count the number of roots of a sparse polynomial $f$ does not work. Instead, we propose two modifications to overcome these issues. The first modification, namely to use approximate (in a proper manner) instead of exact arithmetic, has already been considered in previous work. However, the second modification is more subtle. It exploits the fact that, for a suitably chosen disk centered at some admissible point, only the first $k^{2}$ coefficient are relevant for the outcome of the above test.

We first go into details with respect to our first modification. Let us define $E_{\ell}:=|a_{l}|$ and $E_{r}:=K\cdot\sum_{i\neq l}|a_{i}|$ the expressions on the left and right hand side of the inequality in (6.1). We aim to check whether $E_{\ell}-E_{r}>0$ or not. In general, if a predicate $\mathcal{P}$ is of the latter form $\mathcal{P}=(E_{\ell}-E_{r}>0)$ with two (computable) expressions $E_{\ell}$ and $E_{r}$ , you can compute approximations $\tilde{E}_{\ell}$ and $\tilde{E}_{r}$ of $E_{\ell}$ and $E_{r}$ with $|\tilde{E}_{\ell}-E_{\ell}|<2^{-L}$ and $|\tilde{E}_{r}-E_{r}|<2^{-L}$ for $L=1,2,4,\ldots$ For a certain $L$ , you may then try to compare $E_{\ell}$ and $E_{r}$ taking into account their corresponding approximations and the approximation error. Eventually (i.e. for a sufficiently large $L$ ), you either succeed, in which case you can return the sign, or assert that $E_{\ell}$ and $E_{r}$ are good approximations of each other. In the latter case, you just return a flag called Undecided. In short, this is the idea of so-called soft-predicates. For details, we refer to [3].

Notice that, in cases where $E_{\ell}$ considerably differs from $E_{r}$ , the soft predicate $\tilde{\mathcal{P}}$ allows us to compute the sign of $\mathcal{P}$ without the need of exact arithmetic. In all other cases (i.e. if it returns Undecided), we know at least that $E_{\ell}$ and $E_{r}$ are good approximations of each other. We remark that, in [3], the above soft predicate $\tilde{\mathcal{P}}$ was only described for $\delta=\frac{1}{2}$ , however, it easily generalizes to any constant $\delta$ . In [3, Lem. 2], it has been shown that, for any constant $\delta$ , Algorithm 2 needs an $L_{0}$ -bit approximation of $E_{\ell}$ and $E_{r}$ with $L_{0}$ bounded by

[TABLE]

In [3], we considered a soft-variant of the $T_{l}$ -test, where we compared the expressions $E_{\ell}:=|a_{l}|$ and $E_{r}:=\sum_{i\neq l}|a_{i}|$ . Now, we apply the above soft-predicate to the expressions $E_{\ell}:=a_{l}$ and $E_{r}:=\sum_{i\neq l}^{i\leq k^{2}}|a_{i}|$ , that is, we replace the entire sum $\sum_{i\neq l}|a_{i}|$ by its truncation after the first $k^{2}$ terms. However, we will make the assumption that the truncated sum is upper bounded by $\frac{|a_{0}|}{128}$ ; see Algorithm 3. This might look haphazardly at first sight, however, we will later see that the latter condition is always fulfilled for a $k$ -nomial $F$ and a suitable disk $\Delta_{r}(m)$ centered at an admissible point.

Lemma 6.

For a disk $\Delta:=\Delta_{r}(m)\subset\mathbb{C}$ , the $\tilde{T}_{l}$ -test needs to compute $L$ -bit approximations of $E_{\ell}$ and $E_{r}$ with $L\leq L(m,r,f):=2\cdot\left(5+\log n-\log\max_{i}|a_{i}|\right).$ If $T_{l}(\Delta,\frac{3}{2},f)$ succeeds, then the $\tilde{T}_{l}$ -test returns True. Running Algorithm 3 for all $l=0,\ldots,k$ uses a number of bit operations upper bounded by $\tilde{O}(k^{2}\cdot(k+\log n)(L(m,r,f)+\tau+n\log\max(1,m)+k^{2}\cdot(\log n+\log\max(1,r)))).$

Proof.

From the assumption, it follows that $\max_{i=0,\ldots,n}|a_{i}|=\max_{i=0,\ldots,k^{2}}|a_{i}|\leq\frac{1}{2}\cdot\max(|E_{\ell}|,|E_{r}|)$ . This yields the claimed bound on the absolute error to which $E_{\ell}$ and $E_{r}$ need to be computed. We now prove correctness. If the algorithm returns True, then $E_{\ell}>E_{r}$ , and thus $|a_{l}|>\frac{65}{64}\cdot\sum_{i\neq l}^{i\leq k^{2}}|a_{i}|.$ If $l=0$ , then $\sum_{i\neq 0}|a_{i}|<\frac{64}{65}\cdot|a_{0}|+\frac{1}{128}\cdot|a_{0}|<|a_{0}|$ . Otherwise, we have $|a_{l}|>\frac{65}{64}\cdot\sum_{i\neq l}^{i\leq k^{2}}|a_{i}|\geq\sum_{i\neq l}^{i\leq k^{2}}|a_{i}|+\frac{1}{64}\cdot|a_{0}|\geq\sum_{i\neq l}^{i\leq n}|a_{i}|$ . Hence, in both cases, $T_{l}(\Delta,1,f)$ succeeds, which implies that $\Delta$ contains exactly $l$ roots.

Now, suppose that $T_{l}(\Delta,\frac{3}{2},f)$ succeeds. If the $\tilde{T}_{l}$ -test returns Undecided, then $\frac{128}{129}\cdot E_{\ell}<E_{r}\leq\frac{129}{128}\cdot E_{\ell}$ . On the other hand, we have $|a_{l}|>\frac{3}{2}\sum_{i\neq l}^{\leq n}|a_{i}|\geq\frac{3}{2}\sum_{i\neq l}^{\leq k^{2}}|a_{i}|$ , and thus $E_{\ell}>\frac{3}{2}E_{r}$ , which contradicts the fact that $\frac{128}{129}\cdot E_{\ell}<E_{r}$ . If the $\tilde{T}_{l}$ -test returns False, a similar argument yields a contradiction as well. This shows that success of $T_{l}$ implies that $\tilde{T}_{l}$ returns True. It remains to show the claimed bounds on the bit complexity. It suffices to estimate the cost for computing an $L(m,r,f)$ -bit approximations of $E_{\ell}$ and $E_{r}$ . The $i$ -th coefficient $a_{i}$ , with $i\leq k^{2}$ , can be computed by evaluating the $(n,k,\tau+k^{2}\cdot(\log n+\log\max(1,r)))$ -nomial $g_{i}=f^{(i)}(x)r^{i}/i!$ at $x=m$ . In order to compute $L(m,r,f)$ -bit approximations of $E_{\ell}$ and $E_{r}$ , we need to compute an $(L(m,r,f)+2\log k)$ -bit approximation of each $g_{i}(m)$ , for $i=0,\ldots,k$ . According to Lemma 1, this can be done using $\tilde{O}(k^{2}\cdot(k+\log n)(L(m,r,f)+n\log\max(1,m)+\tau+k^{2}\cdot(\log n+\log\max(1,r)))$ bit operations. ∎

Notice that, in order to actually use the $\tilde{T}_{l}$ -test for counting the roots in a disk $\Delta$ , we need two conditions to be satisfied. First, we need the condition $\sum_{i>k^{2}}|a_{i}|\leq\frac{|a_{0}|}{128}$ to be true. Second, we need to satisfy the preconditions of the $T_{l}$ -test.

Theorem 15.

Let $f$ be a $(n,k,\tau)$ -nomial as in (1.1), let $\Delta:=\Delta_{r}(m)$ be a disk centered at some $m\in\mathbb{R}_{>0}$ with $\frac{m}{r}>n^{16}$ , and let $f_{\Delta}(x)=\sum_{i=0}^{n}a_{i}\cdot x^{i}$ . Further suppose that $\Delta_{\frac{r}{k^{4k+2}}}(m)$ does not contain any roots of $f$ . Then, it holds that $\sum_{i>k^{2}}|a_{i}|\leq\frac{|a_{0}|}{128}$ .

Proof.

Let $z_{1},z_{2},\ldots,z_{n}$ be the complex roots of $F(x)$ , then $\frac{a_{i}}{a_{0}}=\frac{F^{(i)}(m)}{F(m)\cdot i!}\cdot r^{i}=\frac{r^{i}}{i!}\cdot\sum_{(j_{1},j_{2},\ldots,j_{i})}\frac{1}{\prod_{\ell=1}^{i}(m-z_{j_{\ell}})},$ where we sum over all tuples $(j_{1},j_{2},\ldots,j_{i})$ with distinct entries $j_{s}$ , $1\leq j_{s}\leq n$ . For a fixed tuple $(j_{1},j_{2},\ldots,j_{i})$ , at most $k$ of the $i$ roots $z_{j_{1}},z_{j_{2}},\ldots,z_{j_{i}}$ can appear in the corresponding term of the above sum. At most $k$ of these roots are contained in the code $C_{n}$ as defined in Figure 2.1, whereas the remaining $i-k$ roots are located outside of $C_{n}$ . Since $\frac{m}{r}>n^{16}$ , the distance from $m$ to any of these roots is at least $n^{15}r$ . Also, since $\Delta_{\frac{r}{k^{4k}}}(m)$ does not contain any roots of $F(x)$ , distance of $m$ from the roots in $C_{n}$ is at least $\frac{r}{k^{4k}}$ . Thus, we get $\sum_{(j_{1},j_{2},\ldots,j_{i})}\frac{1}{\prod_{\ell=1}^{i}|m-z_{j_{k}}|}\leq{n\choose i}\cdot\frac{k^{4k^{2}}}{r^{k}\cdot(n^{5}r)^{i-k}}.$ Hence, for $i>k^{2}$ , we get

[TABLE]

Hence, summing up over all $i>k^{2}$ proves the claim. ∎

The following Corollary is now an immediate consequence of the above theorem and Lemma 15.

Corollary 16.

Let $f(x)\in\mathbb{R}[x]$ be as in $(n,k,\tau)$ -nomial as in (1.1). Let $m,r\in\mathbb{R}^{+}$ . Let $m^{*}$ be a $(M_{D_{f}},m[k^{2};\frac{r}{k^{2}}])$ -admissible point and $r^{*}=2r$ . Define $\Delta=\Delta_{r^{*}}(m^{*})\supseteq\Delta_{r}(m)$ and $f_{\Delta}(x)=\sum_{i=0}^{n}a_{i}\cdot x^{i}$ . Further assume that $\frac{m}{r}\geq 2(1+n^{16})$ , then $\sum_{i>k^{2}}|a_{i}|\leq\frac{|a_{0}|}{128}.$

In the next step, we show how to satisfy the precondition of the $T_{l}$ -test. Theorem 14 says that if $\Delta_{256n^{5}r}(m)$ does not contain any of the roots which are not contained in $\Delta_{r}(m)$ , then $T_{l}(\Delta_{16nr},f)$ succeeds for some $l$ . Let us define $M=256n^{5}r$ , and let $\Delta_{i}:=\Delta_{M^{i}r}(m)$ for $i=0,1,\ldots,k+1$ . Further assume that $r$ has been chosen sufficiently small enough such that each of disks is contained in the cone $C_{n}$ . Since $C_{n}$ contains at most $k$ roots, there must exist a $j$ with $0\leq j\leq k$ such that $\Delta_{j+1}-\Delta_{j}$ does not contain any root. Hence the $T_{l}$ -test will succeed on $\Delta_{16nM^{j}r}(m)$ . So instead of running the $T_{l}$ -test on some initial disk $\Delta_{r}(m)$ , we run it on all disks $\Delta_{16nM^{i}r}(m)$ for $i=0,1,\ldots,k$ , and return the first disk on which the $T_{l}$ -test succeeds; see Algorithm 4.

Correctness of the algorithm follows immediately from the above considerations. The condition on $m$ and $r$ guarantees that each of the disks $\Delta_{i}$ is contained in $C_{n}$ . Lemma 7 gives a bound on its running time.

Lemma 7.

Algorithm 4 returns a disk $\Delta_{r^{\prime}}(m^{\prime})$ , with $r^{\prime}\leq Rr$ and $m-r\leq m^{\prime}\leq m+r$ , together with the number of roots of $f(x)$ in $\Delta_{r^{\prime}}(m^{\prime})$ . Its bit complexity is bounded by $\tilde{O}(k^{5}\cdot(k+\log n)\cdot(k^{2}\log n+n\log\max(1,|m|)+\tau+\log\frac{1}{r}))$ .

Proof.

The condition $m\geq r+2Rnr$ with $R=2^{8k+4}n^{5k+16}$ implies that all the disks considered in the Algorithm 4 are contained in the cone $C_{n}$ . In addition, the condition of Corollary 16 is fulfilled.

One iteration of the inner for loop uses a number of bit operations bounded by $\tilde{O}(k^{2}\cdot(k+\log n)\cdot(L(m^{\prime},R^{\prime},f)+\tau+n\log\max(1,m^{\prime})+k^{2}(\log n+\log\max(1,R^{\prime}))$ ; see Lemma 6. Here, $R^{\prime}\leq R$ and $m-r\leq m^{\prime}\leq m+r$ . In addition, $L(m^{\prime},R^{\prime}):=L(m^{\prime},R^{\prime},f):=2\cdot\left(5+\log n-\log\|f_{\Delta}\|_{\infty}\right).$

If $f_{\Delta}(x)=\sum_{i=0}^{n}a_{i}\cdot x^{i}$ , then obviously $\|f_{\Delta}\|_{\infty}\geq|a_{0}|=|f(m^{\prime})|$ . Since $m^{\prime}$ is an $(M_{D_{f}},m[k^{2};\frac{r}{k^{2}}])$ -admissible point, Lemma 4 implies that

$|M_{D_{f}}(m^{\prime})|\geq 2^{-O(k(\tau+n\log\max(1,m^{\prime})+k\log n+\log\max(1,\frac{k^{2}}{r})))}$ . Thus, we conclude that $-\log\|f_{\Delta}\|_{\infty}\leq k(\tau+n\log\max(1,m^{\prime})+k\log n+\log\frac{1}{r})$ , and $L(m^{\prime},R^{\prime})\leq O(k(\tau+n\log\max(1,m^{\prime})+k\log n+\log\frac{1}{r}))$ . It follows that Algorithm 4 runs in time $\tilde{O}(k^{4}\cdot(k+\log n)(k(\tau+n\log\max(1,m^{\prime})+k\log n+\log\frac{1}{r})+\tau+k^{2}(\log\max(1,R)+\log n)+n\log\max(1,m^{\prime})))=\tilde{O}(k^{5}\cdot(k+\log n)\cdot(k^{2}\log n+n\log\max(1,m^{\prime})+\tau+\log\frac{1}{r}))$ . ∎

7 Computing a Covering

We now show to compute an $(L,[0,1+1/n])$ -covering from a weak $(L^{\prime},[0,1+\frac{1}{n}])$ -covering, For this, we apply Algorithm 4 to the one-circle regions of the intervals in the weak covering. The following Lemma shows that the requirements in Algorithm 4 are fulfilled if we choose $L^{\prime}$ large enough. In addition, by ensuring that the intervals in the weak covering are well separated from each other, we can ensure that the corresponding disks returned by Algorithm 4 are disjoint.

Lemma 8.

Algorithm 5 computes an $(L,[0,1+\frac{1}{n}])$ -covering $\mathcal{L^{\prime}}$ for $f$ using $\tilde{O}(k^{7}\cdot(k+\log n)(k^{3}\log n+\tau+L))$ bit operations. The distance between any two disks of $\mathcal{L^{\prime}}$ is at least $32\cdot 2^{-L}$ , and $\Delta\cap\mathbb{R}\subset(2^{-3\tau},2)$ for any disk $\Delta$ in $\mathcal{L}^{\prime}$ .

Proof.

The output $\mathcal{L}^{\prime}$ surely covers all the real roots of $f$ in the interval $[0,1+\frac{1}{n}]$ . Since the weak covering $\mathcal{L}$ computed in Algorithm 5 is $(L^{\prime},8R)$ -separated and since Algorithm 4 only blows up any disk by a factor of $R$ , we conclude that disks in $\mathcal{L^{\prime}}$ are still separated by at least $4R2^{-L^{\prime}}\geq 32\cdot 2^{-L}$ . In addition, the radius of each disks in $\mathcal{L^{\prime}}$ is at most $R2^{-L^{\prime}}\geq 2^{-L}$ .

Notice that the left endpoint of any interval in $\mathcal{L}$ is at least $2^{-2\tau-3}$ . Thus, for any disk $\Delta$ from $\mathcal{L}^{\prime}$ the left endpoint of the interval $\Delta\cap\mathbb{R}$ is at least $2^{-2\tau-3}-R2^{-L^{\prime}}\geq 2^{-2\tau-5}$ . A similar argument yields the claimed bound on the right end points of $\Delta\cap\mathbb{R}$ .

The running time bounds follow from the stated upper bound on $L^{\prime}$ and $R$ and the fact that $m^{\prime}\leq 1+O(\frac{1}{n}+2^{-L}))$ is always satisfied. ∎

It remains to show how to compute an $(L,[0,\infty))$ -covering for $f$ from an $(L,[0,1+\frac{1}{n}))$ -covering $\mathcal{L}_{1}$ for $f$ and an $(L,[0,1+\frac{1}{n}))$ -covering $\mathcal{L}_{2}$ for $x^{n}f(\frac{1}{x})$ . We first derive an $(L,[\frac{n}{n+1},\infty))$ -covering for $f$ from $\mathcal{L}_{2}$ by inverting the disks $\Delta$ in $\mathcal{L}_{2}$ . The proof of the following lemma is straight forward.

Lemma 9.

Let $\mathcal{L}$ be an $(L,[0,1+\frac{1}{n}])$ -covering of $x^{n}f(\frac{1}{x})$ as computed by Algorithm 5, and $\mathcal{L}^{\prime}:=\{(\Delta^{-1},\mu):(\Delta,\mu)\in\mathcal{L}\}$ be the list obtained from $\mathcal{L}$ by inverting the disks in $\mathcal{L}$ (i.e. $\Delta_{r}(m)^{-1}=\Delta_{r^{\prime}}(m^{\prime})$ with $r^{\prime}=\frac{2r}{m^{2}-r^{2}}$ and $m^{\prime}=\frac{m}{m^{2}-r^{2}}$ ). Then, $\mathcal{L}^{\prime}$ is an $(L^{\prime},[\frac{n}{n+1},\infty))$ -covering of $f$ with $L^{\prime}\geq L-6\tau$ and the distance between two disks in $L^{\prime}$ is at least $8\cdot 2^{-L}$ .

Finally, we merge an $(L,[0,1+1/n))$ -covering $\mathcal{L}_{1}$ and an $(L,[\frac{n}{n+1},\infty))$ -covering $\mathcal{L}_{2}$ for $f$ . Here, we assume that $L>3+\log n$ , and that the coverings are computed using Algorithm 5 and by inverting the $(L,(0,1+1/n))$ -covering for $x^{n}\cdot f(1/x)$ to obtain $\mathcal{L}_{2}$ . This guarantees that the distance between any two disks in either $\mathcal{L}_{1}$ or $\mathcal{L}_{2}$ is at least $8\cdot 2^{-L}$ . For the merging, we keep each disk from $\mathcal{L}_{1}$ that has no intersection with a disk from $\mathcal{L}_{1}$ , and vice versa. For each pair of elements $(\Delta_{1},\mu_{1})\in\mathcal{L}_{1}$ and $(\Delta_{2},\mu_{2})\in\mathcal{L}_{2}$ with $\Delta_{1}\cap\Delta_{2}\neq\emptyset$ , we keep $(\Delta_{1},\mu_{1})$ (and omit $(\Delta_{2},\mu_{2})$ ) if the center of $\Delta_{1}$ is not larger than $1$ . Otherwise, we keep $(\Delta_{2},\mu_{2})$ (and omit $(\Delta_{1},\mu_{1})$ ). Following this approach, we might loose some of the complex roots that are contained in the union of $\Delta_{1}$ and $\Delta_{2}$ , however, we will not loose any real root. Thus, the so obtained list constitutes an $(L,(0,\infty))$ -covering for $f$ .

Notice that any two $(L,(0,\infty))$ - and $(L,(-\infty,0))$ -coverings for $f$ can be trivially merged by taking their union. In addition, since the final covering contains a list of disjoint disks contained in the union of the cone $C_{n}$ and its reflection on the imaginary axis, and since the union of these two cones contains at most $2k-1$ roots of $f$ , the number of disks is also bounded by $2k-1$ . Hence, our main Theorem 3 follows.

Bibliography18

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Maria Emilia Alonso Garçia and André Galligo. A root isolation algorithm for sparse univariate polynomials. In ISSAC , pages 35–42, 2012.
2[2] Osbert Bastani, Christopher J. Hillar, Dimitar Popov, and J. Maurice Rojas. Randomization, Sums of Squares, Near-Circuits, and Faster Real Root Counting. Contemp. Mathematics , 556:145–166, 2011.
3[3] Ruben Becker, Michael Sagraloff, Vikram Sharma, and Chee-Keng Yap. A near-optimal subdivision algorithm for complex root isolation based on the pellet test and newton iteration. J. Symb. Comput. , 2015. In press.
4[4] George E. Collins and Rüdiger Loos. Polynomial real root isolation by differentiation. In SYMSAC , pages 15–25, 1976.
5[5] Michel Coste, Tomás Lajous-Loaeza, Henri Lombardi, and Marie-Francoise Roy. Generalized Budan-Fourier theorem and virtual roots. J. Complexity , 21(4):479 – 486, 2005.
6[6] F. Cucker, P. Koiran, and S. Smale. A polynomial time algorithm for diophantine equations in one variable. J. Symb. Comput. , 27(1):21 – 29, 1999.
7[7] Michael Kerber and Michael Sagraloff. Root refinement for real polynomials using quadratic interval refinement. Journal of Computational and Applied Mathematics , 280:377 – 395, 2015.
8[8] Hendrik W. Lenstra (Jr.). Finding small degree factors of lacunary polynomials. Number Theory in Progress , 1:267–276, 1999.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Efficiently Computing Real Roots of Sparse

Abstract

1 Introduction

1.1 Problem Definition and Contribution

Definition 1** ((L,I)(L,I)(L,I)-covering).**

Definition 2** (Weak (L,I)(L,I)(L,I)-covering).**

Theorem 3**.**

Theorem 4**.**

1.2 Overview of the Algorithm

Definition 5** (Fractional Derivatives).**

2 On the Geometry of Roots

Theorem 6**.**

3 Polynomial arithmetic

Lemma 1**.**

Definition 7** (Admissible point).**

Lemma 2**.**

Definition 8**.**

Lemma 3**.**

Corollary 9**.**

Lemma 4**.**

Theorem 10**.**

4 Refinement

Theorem 11**.**

Corollary 12**.**

5 Computing a Weak Covering

Lemma 5**.**

Corollary 13**.**

6 TlT_{l}Tl​-test

Theorem 14**.**

Lemma 6**.**

Theorem 15**.**

Corollary 16**.**

Lemma 7**.**

7 Computing a Covering

Lemma 8**.**

Lemma 9**.**

Definition 1 ( $(L,I)$ -covering).

Definition 2 (Weak $(L,I)$ -covering).

Theorem 3.

Theorem 4.

Definition 5 (Fractional Derivatives).

Theorem 6.

Lemma 1.

Definition 7 (Admissible point).

Lemma 2.

Definition 8.

Lemma 3.

Corollary 9.

Lemma 4.

Theorem 10.

Theorem 11.

Corollary 12.

Lemma 5.

Corollary 13.

6 $T_{l}$ -test

Theorem 14.

Lemma 6.

Theorem 15.

Corollary 16.

Lemma 7.

Lemma 8.

Lemma 9.