Real roots of random polynomials with coefficients of polynomial growth:   a comparison principle and applications

Yen Q. Do

arXiv:1905.02101·math.PR·October 15, 2021

Real roots of random polynomials with coefficients of polynomial growth: a comparison principle and applications

Yen Q. Do

PDF

Open Access

TL;DR

This paper introduces a comparison principle to analyze the distribution of real roots in random polynomials with coefficients of polynomial growth, extending results beyond zero-mean cases and applying to various polynomial classes.

Contribution

It develops a novel comparison principle that reduces the analysis of non-centered coefficients to the mean-zero case, enabling new results for diverse random polynomial models.

Findings

01

New estimates for the number of real roots in various polynomial classes

02

Logarithmic integrability estimates for random polynomials

03

Sharp local estimates for real zeros

Abstract

This paper seeks to further explore the distribution of the real roots of random polynomials with non-centered coefficients. We focus on polynomials where the typical values of the coefficients have power growth and count the average number of real zeros. Almost all previous results require coefficients with zero mean, and it is non-trivial to extend these results to the general case. Our approach is based on a novel comparison principle that reduces the general situation to the mean-zero setting. As applications, we obtain new results for the Kac polynomials, hyperbolic random polynomials, their derivatives, and generalizations of these polynomials. The proof features new logarithmic integrability estimates for random polynomials (both local and global) and fairly sharp estimates for the local number of real zeros.

Equations520

p_{n} (z) = a_{0} + a_{1} z + \dots + a_{n} z^{n}, z \in C,

p_{n} (z) = a_{0} + a_{1} z + \dots + a_{n} z^{n}, z \in C,

b_{j} = E [a_{j}] and ∣ c_{j} ∣ = V a r [a_{j}] .

b_{j} = E [a_{j}] and ∣ c_{j} ∣ = V a r [a_{j}] .

p_{ξ, L, n} (z)

p_{ξ, L, n} (z)

p_{n} (z) = m_{n} (z) + r_{n} (z)

p_{n} (z) = m_{n} (z) + r_{n} (z)

E N_{p_{n}} = \frac{( 1 + L ) lo g n}{2 π} + O (1),

E N_{p_{n}} = \frac{( 1 + L ) lo g n}{2 π} + O (1),

k \geq 1

k \geq 1

∣ b_{j + 1} - b_{j} ∣ = O ((j + 1)^{ρ_{1}}) .

∣ b_{j + 1} - b_{j} ∣ = O ((j + 1)^{ρ_{1}}) .

C > 0

C > 0

E N_{p_{n}} = \frac{1 + 2 ρ + 1}{2 π} lo g n + o (lo g n) .

E N_{p_{n}} = \frac{1 + 2 ρ + 1}{2 π} lo g n + o (lo g n) .

E N_{p_{n}} = \frac{1 + 2 ρ + 1}{2 π} lo g n + O (1) .

E N_{p_{n}} = \frac{1 + 2 ρ + 1}{2 π} lo g n + O (1) .

∣ b_{j + 1} - b_{j} ∣ = ∣ μ c_{j} ∣∣ (L + j) / (j + 1) - 1∣ = O ((j + 1)^{- 1} c_{j}) = O ((j + 1)^{ρ - 1}) .

∣ b_{j + 1} - b_{j} ∣ = ∣ μ c_{j} ∣∣ (L + j) / (j + 1) - 1∣ = O ((j + 1)^{- 1} c_{j}) = O ((j + 1)^{ρ - 1}) .

C_{1} lo g n + O (1) \leq E N_{p_{n}} \leq C_{2} lo g n + O (1) .

C_{1} lo g n + O (1) \leq E N_{p_{n}} \leq C_{2} lo g n + O (1) .

E N_{p_{n}} = O (1) .

E N_{p_{n}} = O (1) .

j = 1 \sum n j^{α} t^{j} ≲ (1 - t)^{- α - 1} .

j = 1 \sum n j^{α} t^{j} ≲ (1 - t)^{- α - 1} .

E_{n} (t)

E_{n} (t)

(1 - c / n)^{n} (c / n)^{α} n^{α} \leq e^{- c} c^{α}

(1 - c / n)^{n} (c / n)^{α} n^{α} \leq e^{- c} c^{α}

j = 1 \sum n /2 (n + 1 - j)^{β} j^{α} t^{j}

j = 1 \sum n /2 (n + 1 - j)^{β} j^{α} t^{j}

j = n /2 \sum n (n + 1 - j)^{β} j^{α} t^{j}

j = n /2 \sum n (n + 1 - j)^{β} j^{α} t^{j}

P (∣ ξ_{j} - α ∣ \leq δ_{0})

P (∣ ξ_{j} - α ∣ \leq δ_{0})

E ∣ ξ_{j} - α ∣^{2}

E ∣ ξ_{j} - α ∣^{2}

0 < 1 - δ_{0}^{2} \leq C_{1} x^{\frac{ϵ _{0}}{2 + ϵ _{0}}} - δ_{0}^{2} x

0 < 1 - δ_{0}^{2} \leq C_{1} x^{\frac{ϵ _{0}}{2 + ϵ _{0}}} - δ_{0}^{2} x

A_{k} := {∣ ξ_{j} + \frac{b _{j}}{c _{j}} ∣ \leq δ_{0}, \forall j_{0} \leq j \leq k - 1} \cap {∣ ξ_{k} + \frac{b _{k}}{c _{k}} ∣ > δ_{0}}

A_{k} := {∣ ξ_{j} + \frac{b _{j}}{c _{j}} ∣ \leq δ_{0}, \forall j_{0} \leq j \leq k - 1} \cap {∣ ξ_{k} + \frac{b _{k}}{c _{k}} ∣ > δ_{0}}

E [1_{A_{k}} N_{p_{n}} (- r_{1}, r_{1})] ≲ k (lo g k) P (A_{k}),

E [1_{A_{k}} N_{p_{n}} (- r_{1}, r_{1})] ≲ k (lo g k) P (A_{k}),

N_{p_{n}} (- r_{1}, r_{1})

N_{p_{n}} (- r_{1}, r_{1})

\frac{1}{P ( A _{k} )} E [1_{A_{k}} N_{p_{n}} (- r_{1}, r_{1})]

\frac{1}{P ( A _{k} )} E [1_{A_{k}} N_{p_{n}} (- r_{1}, r_{1})]

E ∣ z ∣ = r_{2} sup ∣ p^{*}_{n}^{(k)} (z) ∣ ≲_{r_{2}, ρ} (n + 1 - k)^{ρ} ((2 k + 1)!)^{1/2} (1 - r_{2}^{2})^{- (k + 1)},

E ∣ z ∣ = r_{2} sup ∣ p^{*}_{n}^{(k)} (z) ∣ ≲_{r_{2}, ρ} (n + 1 - k)^{ρ} ((2 k + 1)!)^{1/2} (1 - r_{2}^{2})^{- (k + 1)},

E ∣ z ∣ = r_{2} sup ∣ p^{*}_{n}^{(k)} (z) ∣ \leq i > (n - k) /2 \sum (n + 1 - k - i)^{ρ} (i + 1) \dots (i + k) r_{2}^{i},

E ∣ z ∣ = r_{2} sup ∣ p^{*}_{n}^{(k)} (z) ∣ \leq i > (n - k) /2 \sum (n + 1 - k - i)^{ρ} (i + 1) \dots (i + k) r_{2}^{i},

i > (n - k) /2 \sum

i > (n - k) /2 \sum

∣ m_{n}^{(k)} (t) ∣

∣ m_{n}^{(k)} (t) ∣

∣ m^{*}_{n}^{(k)} (t) ∣

C_{1} lo g n + O (1) \leq E N_{p_{n}} \leq C_{2} lo g n + O (1) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGeometry and complex manifolds

Full text

Real roots of random polynomials with coefficients of polynomial growth: a comparison principle and applications

Yen Q. Do

Department of Mathematics, The University of Virginia, Charlottesville, VA 22904-4137

[email protected]

Abstract.

This paper seeks to further explore the distribution of the real roots of random polynomials with non-centered coefficients. We focus on polynomials where the typical values of the coefficients have power growth and count the average number of real zeros. Almost all previous results require coefficients with zero mean, and it is non-trivial to extend these results to the general case. Our approach is based on a novel comparison principle that reduces the general situation to the mean-zero setting. As applications, we obtain new results for the Kac polynomials, hyperbolic random polynomials, their derivatives, and generalizations of these polynomials. The proof features new logarithmic integrability estimates for random polynomials (both local and global) and fairly sharp estimates for the local number of real zeros.

2000 Mathematics Subject Classification:

30B20

Y.D. partially supported by NSF grant DMS-1800855.

1 Introduction and statement of results
2 Sample applications of the comparison principle
3 Correlation functions: background and main estimates
4 Local anti-concentration inequalities
5 Logarithmic integrability of random polynomials
6 Counting local real roots
7 Lindeberg swapping and Tao-Vu replacement estimates
8 Proof of universality for complex correlation functions
9 Counting local non-real roots
10 Proof of universality for real correlation functions
11 Reduction of Theorem 1 to Gaussian polynomials
12 Proof of Theorem 1 for Gaussian polynomials

1. Introduction and statement of results

This paper seeks to further explore the distribution of the real roots of random algebraic polynomials

[TABLE]

where the coefficients $a_{0},\dots,a_{n}$ are independent real-valued random variables with finite means and finite variances. We are particularly interested in the average number of real roots of such polynomials. This problem has attracted many mathematicians’ attention since previous centuries, initially out of theoretical curiosity, but has recently found applications in statistical physics and finance [10, 29, 28, 30]. It was reported in [34] that during the $18$ th century Waring considered the distribution of the real roots for random polynomials of low degrees. It however took quite a while until the first (but rather crude) estimates for the number of real roots for random polynomials were established, in a result of Bloch and Polya at the beginning of the 20th century [1]. Various authors subsequently worked on this problem, leading to significant developments during 1940s-1970s, with seminal contributions of Kac [17], Littlewood and Offord [19, 20, 21], Ibragimov and Maslova [12, 14, 15, 13, 22, 23], among others. Recently, there has been a renewed interest in this problem [6, 16, 11, 2, 9, 31, 32, 5, 8, 27], in particular Tao and Vu [33] developed a new framework to study the real roots of random polynomials, adapting their methods from random matrix theory. See also [25, 3, 4, 26] for some further development of the methods in [33].

Despite the large number of prior studies, only a very few are about random polynomials with non-centered coefficients, namely when the coefficients may have nonzero means. Furthermore, these studies often require very restrictive assumptions of algebraic nature on the relationship between the mean, the variance, and the underlying index of the coefficients. Ibragimov and Maslova [14, 15] in 1970s considered random polynomials with iid coefficients of nonzero mean (these are known as Kac polynomials). They showed that the expected number of real roots for the Kac random polynomials is essentially reduced to a half if the iid coefficients have a (common) nonzero mean. In [4], a joint work with Oanh Nguyen and Van Vu, using different methods we strengthened and generalized this result to random polynomials where the mean and the variance of the coefficient $a_{j}$ are linearly dependent and furthermore they are algebraic polynomials of $j$ .

In this paper, we consider an innovative approach that circumvents the needs for algebraic constraints between the mean and the variance of the coefficients and does not require any algebraic dependence on the underlying index. In particular, this approach offers some explanation for the interaction between the mean and the variance of random polynomials. We focus on generalized Kac polynomials, an important class where the typical values of the coefficients are comparable to a fixed power of the underlying index. We will discuss below the technical details of our set up.111It may be possible that the current approach will be applicable to some other classes of random functions (such as those studied in [26]), however this will not be explored in this paper and left for further studies.

For convenience of notation, we write $a_{j}=b_{j}+c_{j}\xi_{j}$ where

[TABLE]

Note that we do not assume $c_{j}\geq 0$ and prefer to leave the setup in this generality for the convenience of the proof. Let $\rho\in{\mathbb{R}}$ . For the typical values of $|a_{j}|$ to be comparable to $(1+j)^{\rho}$ , it is natural to assume that ${\mathbb{E}}[a_{j}]=O((1+j)^{\rho})$ and $(Var[a_{j}])^{1/2}$ is comparable to $(1+j)^{\rho}$ , so that there is a significant range of values for $|a_{j}|$ about the size of $(1+j)^{\rho}$ . The following condition essentially describes these assumptions. For technical reasons, below we will need $\rho>-1/2$ .

Condition 1.

Assume that for some $\epsilon_{0},C_{0},N_{0}>0$ and $\rho>-1/2$ it holds that

(i) ${\mathbb{E}}|\xi_{j}|^{2+\epsilon_{0}}\leq C_{0}$ for all $0\leq j\leq n$ ;

(ii) $|b_{j}|,|c_{j}|\leq C_{0}(1+j)^{\rho}$ for all $j$ ;

(iii) $|c_{j}|\geq\frac{1}{C_{0}}(1+j)^{\rho}$ for $N_{0}\leq j\leq n-N_{0}$ .

We note that $b_{j}$ and $c_{j}$ may depend on $n$ . Without loss of generality, we may assume that $0<\epsilon_{0}\leq 1$ throughout the paper. The implicit constants in this paper are allowed to depend on the implicit constants in Condition 1, which include $\rho,\epsilon_{0},C_{0},N_{0}$ .

We now mention several examples that satisfy Condition 1. Via Stirling’s formula, it can be seen that the coefficients of hyperbolic random polynomials222For discussions about the importance of random hyperbolic polynomials in statistical physics, we refer the reader to the beautiful lecture notes [10].

[TABLE]

satisfy the above condition; here $L>0$ and $\xi_{j}$ ’s are independent with unit variance. In particular, if $L=1$ we recover the Kac random polynomials. In fact, we may generate other examples satisfying Condition 1 by taking finite linear combinations of hyperbolic polynomials and their derivatives. Now, while our approach works with more general polynomials, even for the polynomials considered in [4, 14, 15] we are also able to obtain significantly new results.

1.1. Notational conventions

Throughout the paper, for any function $q:{\mathbb{R}}\to{\mathbb{C}}$ we let $N_{q}$ denote the number of its real roots, and let $N_{q}(I)$ be the number of roots inside $I\subset{\mathbb{R}}$ . Note that these numbers could be $\infty$ , but they are never negative.

By $A\lesssim_{t_{1},\dots,}B$ we mean $A=O_{t_{1},\dots}(B)$ , in other words there is a finite constant $C$ such that $|A|\leq CB$ and the constant $C$ is allowed to depend on the parameters $t_{1},\dots$ . Sometimes we will simply write $A\lesssim B$ (without mentioning the parameters $t_{1},\dots$ ) when $C$ is an absolute consatnt or if it is clear from the context what $C$ could depend on. When both $A\lesssim B$ and $B\lesssim A$ hold we will write $A\approx B$ , and we use the same convention for $A\approx_{t_{1},\dots}B$ .

The reciprocal polynomial for a polynomial $p_{n}$ of degree $n$ is $p^{*}_{n}(z):=z^{n}p_{n}(1/z)$ .

1.2. Statement of results

To study $N_{p_{n}}$ , we write

[TABLE]

where $m_{n}(z)={\mathbb{E}}p_{n}(z)$ is a deterministic polynomial and $r_{n}=p_{n}-m_{n}$ is a random polynomial with zero mean. Our heuristics is the following idea: locally, between $m_{n}$ and $r_{n}$ , the dominant component will dictate the behavior of $p_{n}$ and hence will have a stronger influence on the number of real zeros of $p_{n}$ .

Our main result, Theorem 1 is an estimate for the number of real roots of $p_{n}$ inside an arbitrary interval, demonstrating the following comparison principle:

(i) if $m_{n}$ dominates $r_{n}$ then on average there are very few real roots for $p_{n}$ , as $|m_{n}|$ is typically bigger than $|r_{n}|$ .

(ii) if $m_{n}$ is dominated by $r_{n}$ then on average the number of real roots of $p_{n}$ is the same as the number of real roots of $r_{n}$ plus a bounded term.

In the statement of Theorem 1, we will be more precise about the meaning of “dominated” and “dominates”. Here we make some preliminary remarks. First, since $r_{n}$ is random with zero mean, it makes sense to use the standard deviation $(Var[r_{n}])^{1/2}$ as an indicator for the size of $r_{n}$ , and this heuristics is also used for derivatives of $r_{n}$ . For $t\geq 1$ , to compare $m_{n}$ and $r_{n}$ it turns out to be more convenient to work with the reciprocal polynomials $m^{*}_{n}$ and $r^{*}_{n}$ .

In the following, we say that $J$ is an enlargement for $I=(a,b)$ if it is obtained by extending $I$ to the left and to the right a little bit: generally speaking this means there is an absolute constant $c>0$ such that the added length to the right is bounded below by $c(\Big{|}1-|b|\Big{|}+\frac{1}{n})$ and the added length to the left is bounded below by $c(\Big{|}1-|a|\Big{|}+\frac{1}{n})$ .

There are special cases when the enlargement requirement could be made less stringent (without affecting our main results below): if $|1-|b||$ is bounded below by any positive absolute constant then there is no need to extend $I$ to the right and we may use $b$ as the right endpoint for $J$ , and similarly if $|1-|a||$ is bounded below by any positive absolute constant then we may take $a$ as the left endpoint for $J$ . These improvements are made possible with the aid of Lemma 2.

We note that the above notion of enlargement can also be similarly defined for half open/half closed/closed/infinite intervals. In all cases, the following will be true: if $J$ is an enlargement of $I$ then it also qualifies as an enlargement of any subintervals of $I$ .

Theorem 1 (Comparison principle).

There is a constant $0<C<\infty$ such that the following holds. Assume that the coefficients of $p_{n}$ satisfy Condition 1 and are real valued. Let $I\subset{\mathbb{R}}$ be an interval whose endpoints may depend on $n$ and assume that $J$ is an enlargement of $I$ .

Let $m_{n}^{*}(t)=t^{n}m_{n}(\frac{1}{t})$ and $r^{*}_{n}(t)=t^{n}r_{n}(\frac{1}{t})$ for $t\neq 0$ .

(1) Assume that

•

if $t\in J\cap[-1,1]$ then $\quad|m_{n}(t)|>C|\log(1-|t|+\frac{1}{n})|^{1/2}\sqrt{Var[r_{n}(t)]}$ ,

•

if $t\in J\setminus[-1,1]$ then $|m^{*}_{n}(\frac{1}{t})|>C|\log(1-\frac{1}{|t|}+\frac{1}{n})|^{1/2}\sqrt{Var[r^{*}_{n}(\frac{1}{t})]}$ .

Then ${\mathbb{E}}N_{p_{n}}(I)=O(1).$

(2) Let $\phi:[0,1]\to[0,1]$ such that $\displaystyle\int_{1/n}^{c}\frac{\phi(t)}{t}dt=O(1)$ for some $c>0$ .

Assume that for each $k=0,1$ we have the uniform estimates:

•

if $t\in J\cap[-1,1]$ then $\quad|m^{(k)}_{n}(t)|\lesssim\phi(1-|t|+\frac{1}{n})\sqrt{Var[r^{(k)}_{n}(t)]}$ ,

•

if $t\in J\setminus[-1,1]$ then $\quad|{(m^{*}_{n})}^{(k)}(\frac{1}{t})|\lesssim\phi(1-\frac{1}{|t|}+\frac{1}{n})\sqrt{Var[{(r_{n}^{*})}^{(k)}(\frac{1}{t})]}$ ,

and for $k=2$ the weaker estimates without $\phi$ also hold on $J\cap[-1,1]$ and $J\setminus[-1,1]$ .

Then ${\mathbb{E}}N_{p_{n}}(I)={\mathbb{E}}N_{r_{n}}(I)+O(1).$

We note that Theorem 1 is more useful for intervals near $\pm 1$ , since under Condition 1 it can be shown (using a standard argument of Ibragimov and Maslova) that ${\mathbb{E}}N_{p_{n}}(I)=O(1)$ if $I$ is bounded away from $\pm 1$ (see Lemma 2).

In Theorem 1, for technical reasons we need to assume that the domination relationship (between $m_{n}$ and $r_{n}$ ) is effective on an enlargement $J$ of $I$ , however if $p_{n}$ is a Gaussian random polynomial then the conclusions hold with $J=I$ and some of the conditions could be weakened, see Section 12. The proof of the Gaussian case in Section 12 will also shed more light on the motivation for the assumptions on $m_{n}$ and $r_{n}$ in the statement of Theorem 1. One of the main technical ingredients in our proof is a new result about universality for the correlation of the roots of $p_{n}$ , see Section 3.

Using Theorem 1, we could derive new results about the real roots of non-centered random polynomials (with coefficients of power growth) from analogous results for centered random polynomials, which in turn were studied extensively in [4]. Below, we summarize several sample results that can be obtained in this direction (although this list is by no means comprehensive).333A more thorough discussion about possible applications is included in Section 2, where these sample results will be derived from Theorem 1. The sample results will further demonstrate the following observation from [4]: we may extract asymptotic estimates for the number of real roots of a random polynomial from asymptotic information about its coefficients. This phenomenon was first observed in [4] for random polynomials with centered coefficients of polynomial growth.

Below, following [4], we define a generalized polynomial of $j\in\mathbb{Z}_{+}$ to be a finite linear combination of hyperbolic coefficients $h_{L}(j):=\frac{L(L+1)\dots(L+j-1)}{j!}$ , $L>0$ . Its degree is defined to be $L_{max}-1$ , where $L_{max}$ is the biggest $L$ in the combination. If we requires $L$ to be integer then this notion is the same as the classical notion of polynomials. Note that (via Stirling’s formula) a generalized polynomial of degree $\delta$ is asymptotically comparable to $j^{\delta}$ .

Our first sample result is about random hyperbolic polynomials (1.1).

Theorem 2.

Let $p_{n}$ be the hyperbolic random polynomial $p_{\xi,L,n}$ given by (1.1) where $\xi_{j}$ are independent with a common nonzero mean and variance $1$ and uniformly bounded $(2+\epsilon)$ moments for some $\epsilon>0$ .

[TABLE]

Theorem 2 is a special case of the following more general result.

Theorem 3.

Assume that the coefficients of $p_{n}$ satisfy Condition 1. Assume furthermore that there are $\rho_{1}<\rho-1/2<\rho_{2}$ such that $|b_{j}|\gtrsim j^{\rho_{2}}+O(1)$ and

[TABLE]

in particular ${\mathbb{E}}N_{p_{n}}$ grows like $\log n$ as $n\to\infty$ . Furthermore, if for some $C$ we have $c_{j}=(C+o(1))j^{\rho}$ as $j\to\infty$ then

[TABLE]

In particular, if $c_{j}^{2}$ is a generalized polynomial of $j$ then

[TABLE]

Theorem 2 may be derived from Theorem 3 as follows. Letting $\rho=(L-1)/2$ , we note that for the set up of Theorem 2 we will have $b_{j}=c_{j}\mu$ for some $\mu\neq 0$ , and by Stirling’s formula $c_{j}=\sqrt{\frac{L(L+1)\dots(L+j-1)}{j!}}=(C_{L}+o(1))(1+j)^{\rho}$ . On the other hand,

[TABLE]

Using Theorem 3, it follows that ${\mathbb{E}}N_{p_{n}}={\mathbb{E}}N_{r_{n}}(1-\frac{1}{C},1+\frac{1}{C})$ , and thus using [4] we obtain the desired conclusions. We may argue similarly to get the desired asymptotics for ${\mathbb{E}}N_{p^{(k)}_{n}}$ .

Below is a class of random polynomials where the deterministic component $m_{n}$ is dominated by the random component $r_{n}$ .

Theorem 4.

Assume Condition 1 and assume that for some $\rho^{\prime}<\rho-1/2$ we have $|b_{j}|=O((1+j)^{\rho^{\prime}})$ . Then there are finite positive constants $C_{1}$ and $C_{2}$ such that

[TABLE]

Furthermore if for some $C$ we have $c_{j}=(C+o(1))j^{\rho}$ as $j\to\infty$ then we could take $C_{1},C_{2}$ to be $\frac{1+\sqrt{2\rho+1}}{\pi}+o(1)$ . In particular, if $c_{j}^{2}$ is a generalized polynomial of $j$ then we could let $C_{1},C_{2}=\frac{1+\sqrt{2\rho+1}}{\pi}$ .

Finally, we mention a simple class of random polynomials where $m_{n}$ dominates $r_{n}$ , leading to very few real zeros for the random polynomial.

Theorem 5.

Assume Condition 1. Suppose furthermore that for some $\rho^{\prime}\in(\rho-\frac{1}{2},\rho]$ and some $\rho^{\prime\prime}<\rho^{\prime}$ the following holds: for odd $j$ we have $b_{j}=O((1+j)^{\rho^{\prime\prime}})$ and for even $j$ we have $b_{j}\gtrsim(1+j)^{\rho^{\prime}}-O(1)$ . Then

[TABLE]

Furthermore, the above estimate holds true if we interchange the role of odd and even $j$ ’s in the above assumptions.

1.3. Outline of the paper

In the next section, we discuss the applications of Theorem 1 and the proof for the sample results mentioned above. In the rest of the paper, we prove Theorem 1. Our proof of Theorem 1 uses universality estimates for the correlation functions of the real roots of $p_{n}$ , see Section 3. Using these estimates, we could reduce the proof of Theorem 1 to the Gaussian setting. The Gaussian case of Theorem 1 will be examined using the Kac-Rice formula, see Section 12.

2. Sample applications of the comparison principle

In this section, we discuss several applications of Theorem 1 and present the proofs for Theorem 3, Theorem 4, and Theorem 5. We will use the following basic computation about power series.

Lemma 1.

For any $\alpha>-1$ and $\beta>-1$ and any $c>0$ and $C>1$ the following holds:

(i) If $\frac{1}{C}\leq t\leq 1-\frac{c}{n}$ then $\sum_{j=1}^{n}(n+1-j)^{\beta}j^{\alpha}t^{j}\ \ \approx_{\alpha,\beta,c,C}\ \ n^{\beta}(1-t)^{-\alpha-1}$ .

(ii) If $|1-t|\leq c/n$ then $\sum_{j=1}^{n}(n+1-j)^{\beta}j^{\alpha}t^{j}\ \ \approx_{\alpha,\beta,c,C}\ \ n^{\alpha+\beta+1}$ .

Proof of Lemma 1.

Note that if $1-c/n\leq t\leq 1+c/n$ then $1,t,\dots,t^{n}$ are all comparable to $1$ , therefore $\sum_{j=1}^{n}(n+1-j)^{\beta}j^{\alpha}t^{j}\approx\sum_{j=1}^{n}(n+1-j)^{\beta}j^{\alpha}\approx n^{\alpha+\beta+1}$ . Here, to see the last estimate we may split the sum into $1\leq j\leq n/2$ and $n/2<j\leq n$ , and use the fact that for the first range $n+1-j\approx n$ and for the second range $j\approx n$ . This proves part (ii), and furthermore in part (i) we may assume that $1/C\leq t\leq 1-c/n$ where $c$ is sufficiently large. We now discuss the proof of part (i) under this assumption.

We consider first the case $\beta=0$ . By Taylor’s theorem, we have $(1-t)^{-\alpha-1}=1+(\alpha+1)t+\dots+\frac{(\alpha+1)\dots(\alpha+n)}{n!}t^{n}+E_{n}(t)$ , where the error term $E_{n}(t)$ is nonnegative. Now, note that $(\alpha+1)\dots(\alpha+j)/j!\approx j^{\alpha}$ , therefore

[TABLE]

For the other direction of the estimate, it suffices to establish that the error term $E_{n}(t)$ is smaller than fraction of $(1-t)^{-\alpha-1}$ when $c$ is sufficiently large. Here we use the Lagrange form of the error term, which says that for some $s\in(0,t)$ we have

[TABLE]

The desired estimate then follows from the fact that $(1-v)^{n}v^{\alpha}n^{\alpha}$ is a decreasing function for $v\in[\alpha/n,1]$ , and

[TABLE]

and $e^{-c}c^{\alpha}$ could be made arbitrarily small by choosing $c$ sufficiently large.

We now consider the general situation. We have

[TABLE]

Thus it remains to show that the remaining summation over $n/2<j\leq n$ is $O(n^{\beta}(1-t)^{-(\alpha+1)})$ (note that this summation is nonnegative). For these $j$ ’s we note that $j$ is comparable to $n$ . Since $\beta>-1$ we may choose $1<p<\infty$ depending on $\beta$ such that $\beta p>-1$ . Let $q=p/(p-1)$ be its conjugate exponent. Then using Hölder’s inequality we have

[TABLE]

This completes the proof of Lemma 1. ∎

Let $C>0$ be a sufficiently large constant and let $A_{C}=\{z\in{\mathbb{R}}:||z|-1|>1/C\}$ . In the applications of Theorem 1, we will need the following estimate.

Lemma 2.

For any $C>0$ we have ${\mathbb{E}}N_{p_{n}}(A_{C})=O_{C}(1)$ .

We include a proof of Lemma 2 using an argument of Ibragimov–Maslova [13] (see also [4] where a simpler version of Lemma 2 was proved). We’ll need the following estimate, which will also be used later in the proof of Theorem 1.

Lemma 3.

For any $\delta_{0}<1$ there is $p_{0}\in(0,1)$ such that for any $\alpha$ we have $\max_{j}{\mathbb{P}}(|\xi_{j}-\alpha|\leq\delta_{0})\leq 1-p_{0}$ .

Proof of Lemma 3.

Let $\delta_{0}<1$ and let $0\leq j\leq n$ .

We first consider $|\alpha|>3$ . Without loss of generality assume $\alpha>3$ , the case $\alpha<-3$ is can be treated similarly. Then

[TABLE]

Thus we may take any $p_{0}\leq 3/4$ for $|\alpha|>3$ .

We now consider $|\alpha|\leq 3$ . Then ${\mathbb{E}}|\xi_{j}-\alpha|^{2+\epsilon_{0}}=O_{C_{0},\epsilon_{0}}(1)$ . Therefore,

[TABLE]

Let $x={\mathbb{P}}(|\xi_{j}-\alpha|>\delta_{0})\geq 0$ . Since ${\mathbb{E}}|\xi_{j}-\alpha|^{2}=1+|\alpha|^{2}\geq 1$ , we obtain

[TABLE]

for some $C_{1}=C_{1}(C_{0},\epsilon_{0})$ where $C_{0}$ and $\epsilon_{0}$ are as in Condition 1. Thus by examining the function $C_{1}x^{\epsilon_{0}/(2+\epsilon_{0})}-\delta_{0}^{2}x$ of $x$ , it is follows that there is some $p_{0}=p_{0}(\delta_{0},C_{1},\epsilon_{0})\in(0,1)$ such that any $x\in[0,1]$ that satisfies the above inequality must be inside $[p_{0},\infty)$ . Consequently ${\mathbb{P}}(|\xi_{j}-\alpha|\leq\delta_{0})\geq p_{0}$ , as desired. ∎

Proof of Lemma 2.

It suffices to show that for $r_{1}<1$ we have $N_{p_{n}}(-r_{1},r_{1})=O_{r_{1}}(1)$ and $N_{p^{*}_{n}}(-r_{1},r_{1})=O_{r_{1}}(1)$ . We will show in detail the first estimate, and comment on the needed changes for the second estimate.

Take any $r_{2}\in(r_{1},1)$ . Let $\delta_{0},p_{0}$ be as in Lemma 3. From Condition 1, let $j_{0}$ be such that $c_{j}\approx(1+j)^{\rho}$ for $j_{0}\leq j\leq n-j_{0}$ . Define

[TABLE]

for each $j_{0}\leq k\leq n-j_{0}$ , and define $A_{n-j_{0}+1}=\{|\xi_{j}+\frac{b_{j}}{c_{j}}|\leq\delta_{0},\forall j_{0}\leq j\leq n-j_{0}\}$ .

For $k=n-j_{0}+1$ it is clear that we have ${\mathbb{E}}[1_{A_{k}}N_{p_{n}}(-r_{1},r_{1})]\leq np_{0}^{n-2j_{0}}=O(1)$ .

For $j_{0}\leq k\leq n-j_{0}$ , we have ${\mathbb{P}}(A_{k})\leq p_{0}^{k-j_{0}}$ , thus it suffices to show that

[TABLE]

On the event $A_{k}$ , we have $|p_{n}^{(k)}(0)|=k!|b_{k}+c_{k}\xi_{k}|\gtrsim k!|c_{k}|\ \gtrsim\ (k+1)^{\rho}$ , thus using Jensen’s formula we have

[TABLE]

Let $n_{0}$ be an integer larger than $\max(0,\rho)$ . Using convexity and Jensen’s inequality, we have

[TABLE]

To estimate ${\mathbb{E}}N_{p^{*}_{n}}(-r_{1},r_{1})$ , we proceed similarly, and the following estimate will be needed:

[TABLE]

where $r_{2}\in(r_{1},1)$ . To see this estimate, we note that

[TABLE]

then we split the sum into $i\leq(n-k)/2$ and $i>(n-k)/2$ and argue as in the proof of Lemma 1. The treatment of $i\leq(n-k)/2$ is entirely similar as before, but for $i>(n-k)/2$ we actually need to be more careful (than the proof of Lemma 1) about the dependence on $k$ of the implicit constant. We include the details below. By Cauchy–Schwartz we have

[TABLE]

∎

We now divide the discussion of the applications of Theorem 1 into three sections, corresponding to whether $m_{n}$ is always small, or always large, or mixed large/small, in comparison to $r_{n}$ .

2.0.1. Small mean

Here the mean $m_{n}$ will be completely dominated by $r_{n}$ . We first state a corollary of Theorem 1 in this direction, before proving Theorem 4.

Corollary 1.

Let $\phi:[0,1]\to[0,1]$ such that $\int_{1/n}^{c}\frac{\phi(t)}{t}dt=O(1)$ for some $c>0$ . Assume Condition 1 and assume that there is a constant $C>1$ such that for $1/C\leq|t|\leq 1$ and $0\leq k\leq 1$ we have

[TABLE]

and assume that the weaker estimates without $\phi$ also hold true for $k=2$ . Then there are finite positive constants $C_{1}$ and $C_{2}$ such that

[TABLE]

Furthermore if for some $C$ we have $c_{j}=(C+o(1))j^{\rho}$ as $j\to\infty$ then we could take $C_{1},C_{2}$ to be $\frac{1+\sqrt{2\rho+1}}{\pi}+o(1)$ . In particular, if $c_{j}^{2}$ is a generalized polynomial of $j$ then we could let $C_{1},C_{2}=\frac{1+\sqrt{2\rho+1}}{\pi}$ .

Thanks to [4], the zero-mean case (i.e. $b_{j}=0$ for all $j$ ) of the above corollary already holds true. Thus, using Lemma 2 and Theorem 1, Corollary 1 is a simple consequence of the following estimates

[TABLE]

which follows from elementary computations (see Lemma 1 for details).

We now prove Theorem 4. Since $\rho>-1/2$ , we may assume without loss of generality that $\rho^{\prime}>-1$ . Using Lemma 1, for $|t|\leq 1$ we then have

[TABLE]

which clearly implies (2.3). Thus Theorem 4 follows from Corollary 1.

2.0.2. Large mean

Here near $\pm 1$ the mean $m_{n}$ will always dominate $r_{n}$ . As before, we state a corollary of Theorem 1 before proving Theorem 5.

Corollary 2.

Let $\varphi:(0,\infty)\to[0,\infty)$ be such that $\varphi(t)\to\infty$ as $t\to 1/n$ . Assume Condition 1 and assume that there is a constant $C>1$ with the following properties: for $1-\frac{1}{C}\leq|t|\leq 1$ we have

[TABLE]

This corollary follows immediately from (2.2) and Theorem 1 and Lemma 2. We now apply this corollary with $\varphi(t)=t^{-\epsilon}$ to prove Theorem 5. By splitting $m_{n}=m_{n,odd}+m_{n,even}$ and using Lemma 1 to treat each of them individually, we obtain (for $1-1/C\leq|t|\leq 1$ )

[TABLE]

where $\epsilon=\rho^{\prime}+1/2-\rho>0$ . Thus Theorem 5 follows from Corollary 2.

2.0.3. Mixed case

Here we consider the mixed situation, where $m_{n}$ is dominated by $r_{n}$ on a part of the real line and dominates $r_{n}$ elsewhere. In our opinion this is the most interesting case. Here we describe a simple scenario, which applies to random Kac polynomials with non-centered coefficients (considered in [15]) as well as linear combination of derivatives of a random Kac polynomial (considered in [4]), and also hyperbolic random polynomials with non-centered coefficients (Theorem 2 of the current paper). In this scenario, $m_{n}$ is dominated by $r_{n}$ near $-1$ while being the dominant component near $1$ . (Note that due to symmetry we could also state a symmetric version where the roles of $1$ and $-1$ are interchanged.)

Corollary 3.

Let $\varphi:(0,\infty)\to[0,\infty)$ be such that $\varphi(t)\to\infty$ as $t\to 1/n$ . Let $\phi:[0,1]\to[0,1]$ such that $\int_{1/n}^{c}\frac{\phi(t)}{t}dt=O(1)$ for some $c>0$ . Assume Condition 1 and assume that there is a constant $C>1$ with the following properties:

(i) for $1-\frac{1}{C}\leq t\leq 1$ we have

[TABLE]

(ii) for $-1\leq t\leq-1+\frac{1}{C}$ and for each $k=0,1$ we have

[TABLE]

and the weaker estimates without $\phi$ also hold true for $k=2$ . Then

[TABLE]

and in particular there are constants $C_{1},C_{2}>0$ such that

[TABLE]

Furthermore if for some $C$ we have $c_{j}=(C+o(1))j^{\rho}$ as $j\to\infty$ then we could take $C_{1},C_{2}$ to be $\frac{1+\sqrt{2\rho+1}}{2\pi}+o(1)$ . In particular, if $c_{j}^{2}$ is a generalized polynomial of $j$ then we could take $C_{1}=C_{2}=\frac{1+\sqrt{2\rho+1}}{2\pi}$ .

Now, it was shown in [4] that ${\mathbb{E}}N_{r_{n}}(1-1/C,1+1/C)$ grows like $\log n$ , and furthermore if $c_{j}=(C+o(1))j^{\rho}$ then ${\mathbb{E}}N_{r_{n}}(1-1/C,1+1/C)=\frac{1+\sqrt{2\rho+1}}{2\pi}\log n+o(\log n)$ , and the error term could also be improved to $O(1)$ if $c_{j}^{2}$ is a generalized polynomial of $j$ . Thus, Corollary 3 is an immediate consequence of Theorem 1 and (2.2).

We now discuss the proof of Theorem 3. From the given assumption it follows that $b_{j}$ are of the same sign for $j\gtrsim 1$ , so without loss of generality we may assume that $b_{j}>0$ for $j\gtrsim 1$ . Now, using $b_{j}\gtrsim j^{\rho_{2}}$ and $\rho_{2}>\rho-1/2$ one may show that $m_{n}(t)$ dominates $r_{n}(t)$ near $1$ . Indeed, by elementary computations (see Lemma 1), for $t\in[1-1/C,1]$ we have

[TABLE]

We now show that $m_{n}$ is dominated by $r_{n}$ near $-1$ . To see this, let $k\geq 0$ and we use discrete integration by parts to write

[TABLE]

and uniformly over $j_{1}\leq j_{2}$ we have $t^{j_{1}}+\dots+t^{j_{2}}=O(1)$ for $-1\leq t\leq-1+\frac{1}{C}$ . On the other hand, using the given hypothesis we may estimate

[TABLE]

Without loss of generality we may assume $\rho_{1}>\rho-1$ . Since $|t|^{k}\sim 1$ , we obtain

[TABLE]

where $\epsilon=\rho-\rho_{1}-\frac{1}{2}>0$ .

Similarly, for $m^{*}_{n}$ we may estimate, with the assistance of Lemma 1,

[TABLE]

Thus Theorem 3 follows from Corollary 3.

3. Correlation functions: background and main estimates

In this section, we summarize our main results about correlation functions for $p_{n}$ and $p^{*}_{n}$ . These estimates are key ingredients in the proof of Theorem 1 and the proof for these estimates will be presented in subsequent sections.

We first recall some background about correlation functions, following [33, 4]. While there is a more general theory of correlation functions for random point processes, see for instance [10], our discussion will specialize to the context of the roots of random polynomials. Let $Z$ denote the multi-set of the (complex) roots of $p_{n}$ , where a root of multiplicity $m$ will be identified as $m$ different elements.

For $k\geq 1$ , we say that a Borel measure $d\sigma$ on ${\mathbb{C}}^{k}$ is the $k$ -point correlation measure for the (complex) roots of $p_{n}$ if the following equality holds for any continuous and compactly supported function $\phi:{\mathbb{C}}^{k}\to{\mathbb{C}}$ :

[TABLE]

Here, the summation on the left hand side (inside the expectation) is over all ordered $k$ -tuples of different elements of $Z$ . The existence of such a measure is a simple application of the Riesz representation theorem. In the literature, it is common (see e.g. [33]) to define the $k$ -point correlation function as the density of $d\sigma$ with respect to the Lebesque measure (which exists for instance in Gaussian settings [10] or more generally smooth distributions), here we will work with correlation measures to allow for more generality.

When $p_{n}$ is a real polynomial (i.e. with real-valued coefficients), the set of complex zeros for $p_{n}$ is symmetric with respect to the real line, and there may be a nontrivial probability that $p_{n}$ has at least one real root. Thus, for such polynomials we will define the mixed complex-real correlation measures for the roots as follows. Let $m\geq 1$ and $k\geq 0$ and let $d\sigma$ be a measure on ${\mathbb{R}}^{m}\times({\mathbb{C}}\setminus{\mathbb{R}})^{k}$ . We say $d\sigma$ is the $(m,k)$ -point correlation measure for $Z$ if the following two conditions hold:

(i) $d\sigma$ is symmetric under complex conjugations: for any measurable $A\subset{\mathbb{R}}^{m}\times({\mathbb{C}}\setminus{\mathbb{R}})^{k}$ , it holds that $\rho(A)=\rho(A^{\prime})$ where $A^{\prime}$ is one of the $k$ sets obtained from $A$ by taking conjugate in one fixed coordinate;

(ii) for any compactly supported continuous $\phi:{\mathbb{R}}^{m}\times{\mathbb{C}}^{k}\to{\mathbb{C}}$ we have

[TABLE]

Here, the summations on the left hand side are over ordered tuples of different elements of $Z$ . If $d\sigma$ has a density with respect to the Lebesgue measure, such density is classically called the $(m,k)$ -point correlation function [33], which will then be invariant under taking complex conjugation of any variable.

We now define the admissible local sets where comparison estimates for the correlation measures will be proved. These are sets where the expected number of complex roots for $p_{n}$ could be as small as a bounded constant $O(1)$ . For random polynomials with centered-coefficients, the structure of these sets is well-known and has been exploited by previous authors, here we will use the same structure for random polynomials with non-centered coefficients, following [4].

Let $\delta>0$ that may depend on $n$ . Define

[TABLE]

Define $I_{{\mathbb{R}}}(\delta)=I(\delta)\cap{\mathbb{R}}$ and define $I_{{\mathbb{C}}_{+}}(\delta)=I(\delta)\cap{\mathbb{C}}_{+}$ .

Let $p^{*}_{n}(z):=z^{n}p_{n}(1/z)$ be the reciprocal polynomial of $p_{n}$ .

Below, we say that two (possibly complex valued) random variables $\xi_{j}$ and $\widetilde{\xi}_{j}$ have matching moments to up to second order if

[TABLE]

for any $0\leq\alpha,\beta\leq 2$ such that $\alpha+\beta\leq 2$ . Note that if one of $\xi_{j}$ , $\widetilde{\xi}_{j}$ is real valued then this matching condition will force the other to be real-valued. The Gaussian analogue of $p_{n}(z)=\sum_{j}(b_{j}+c_{j}\xi_{j})z^{j}$ if $G_{j}$ is defined to be $p_{n,G}(z)=\sum_{j}(b_{j}+c_{j}G_{j})z^{j}$ where $G_{0},\dots,G_{n}$ are independent Gaussian and $G_{j}$ and $\xi_{j}$ have matching moments up to the second order.

Our main result about the mixed complex-real $(m,k)$ -point correlation functions for the roots of $p_{n}$ is stated below, here $m\geq 1$ and $k\geq 0$ . In Theorem 6, we consider a real random polynomials whose coefficients satisfy Condition 1, and we let $d\sigma$ and $d\sigma^{*}$ denote the $(m,k)$ -point correlation measures for the roots of $p_{n}$ and $p^{*}_{n}$ . The Gaussian analogues of these two correlation measures will be denoted by $d\sigma_{G}$ and $d\sigma^{*}_{G}$ .

In the following, it is understood that all implicit constants may depend on the implicit constants in Condition 1.

Theorem 6.

*Given $0<c<\widetilde{c}<1$ , we could find $C_{1},\alpha_{1}>0$ such that the following holds for any $\frac{1}{n}\lesssim\delta\leq\frac{1}{C_{1}}$ and any $(x,z)=(z_{1},\dots,z_{m},z_{m+1},\dots,z_{m+k})\in I_{{\mathbb{R}}}(\delta)^{m}\times I_{{\mathbb{C}}_{+}}(\delta)^{k}$ :

*Let $\phi_{\delta}$ be supported on $B_{{\mathbb{R}}}(0,c\delta)^{m}\times B_{{\mathbb{C}}}(0,c\delta)^{k}$ such that as a function on ${\mathbb{R}}^{m+2k}$ it is in $C^{3k+2}$ and furthermore $\sup|\partial^{\alpha}\phi_{\delta}|\leq\delta^{-|\alpha|}$ up to order $|\alpha|\leq 3k+2$ .

Let $J\subset I_{{\mathbb{R}}}(\delta)+(-\widetilde{c}\delta,\widetilde{c}\delta)$ be such that for any $1\leq j\leq m+k$ the following holds444Note that the interval $J=I_{{\mathbb{R}}}(\delta)+(-\widetilde{c}\delta,\widetilde{c}\delta)$ has this property, although in the applications we may work with much thinner intervals (which is allowed if $\widetilde{c}$ is small).:

•

if $sign(Re(z_{j}))\geq 0$ and $|Im(z_{j})|\leq\widetilde{c}\delta$ then $(|z_{j}|-\widetilde{c}\delta,|z_{j}|+\widetilde{c}\delta)\subset J$ .

•

*if $sign(Re(z_{j}))<0$ and $|Im(z_{j})|\leq\widetilde{c}\delta$ then $(-|z_{j}|-\widetilde{c}\delta,-|z_{j}|+\widetilde{c}\delta)\subset J$ . *

(i) Assume that $|m^{\prime\prime}_{n}|\lesssim\sqrt{Var[r^{\prime\prime}_{n}]}$ uniformly on $J$ , or $|m_{n}|>C_{1}|\log(1+\frac{1}{n}-|t|)|^{1/2}\sqrt{Var[r_{n}]}$ for all $t\in J$ . Then

[TABLE]

(ii) Assume that $|{m^{*}}^{\prime\prime}_{n}|\lesssim\sqrt{Var[{r_{n}^{*}}^{\prime\prime}]}$ uniformly on $J$ , or $|m^{*}_{n}|>C_{1}|\log(1+\frac{1}{n}-|t|)|^{1/2}\sqrt{Var[r^{*}_{n}]}$ for all $t\in J$ . Then

[TABLE]

Our proof will use the following result for the $k$ -point complex correlation functions, where $k\geq 1$ . In Theorem 7, we consider a (possibly complex valued) random polynomial $p_{n}$ whose coefficients satisfy Condition 1. Below we let $d\sigma$ and $d\sigma^{*}$ denote the $k$ -point correlation measures for the zeros of $p_{n}$ and $p^{*}_{n}$ , and let $d\sigma_{G}$ and $d\sigma^{*}_{G}$ be their Gaussian analogues.

Theorem 7.

*Given any $0<c<1$ , we could find constants $C_{1},\alpha_{1}>0$ such that the following holds for any $\frac{1}{n}\lesssim\delta\leq\frac{1}{C_{1}}$ and any $z\in I(\delta)^{k}$ :

Let $\phi_{\delta}$ be supported on $B_{{\mathbb{C}}}(0,c\delta)^{k}$ such that as a function on ${\mathbb{R}}^{2k}$ it is $C^{3k+2}$ and furthermore $\sup|\partial^{\alpha}\phi_{\delta}|\leq\delta^{-|\alpha|}$ up to order $|\alpha|\leq 3k+2$ .

Then

[TABLE]

Our Theorem 7 slightly generalizes [4, Theorem 2.3]. Here we point out an example outside the scope of [4]. Recall that in [4, Theorem 2.3] it is assumed that $p_{n}(z)=c_{0}\xi_{0}+c_{1}\xi_{1}z+\dots+c_{n}\xi_{n}z^{n}$ where $\xi_{j}$ are independent with unit variance (but could have nonzero means). In our setting, with $p_{n}(z)=a_{0}+a_{1}z+\dots+a_{n}z^{n}$ , if $a_{j}$ is a nonzero constant with probability $1$ (which is allowed to happen for $j=O(1)$ or $j\geq n-O(1)$ according to Condition 1) then it is not possible to write $a_{j}=c_{j}\xi_{j}$ where $\xi_{j}$ of variance $1$ .

We will prove Theorem 7 using an adaptation of the proof of [4, Theorem 2.3]. We take this as an opportunity to provide a more streamlined presentation of the argument in [4], in particular in the proof we will prove new estimates involving log integrability of random polynomials and bounds on the local number of roots, which could be of independent interests.

4. Local anti-concentration inequalities

In this section we will prove several anti-concentration inequalities for random polynomials whose coefficients satisfy Condition 1. We will use these estimates later in the proof of Theorem 7. Below, let $q_{n}=(n+1)^{-\rho}p^{*}_{n}$ be the normalized reciprocal polynomial for $p_{n}$ . Recall that

[TABLE]

Our first set of estimates is contained the following theorem:

Theorem 8.

Let $0\leq c<1$ . Then there are constants $C_{1},\alpha_{1}>0$ such that the following holds for any $\frac{1}{n}\lesssim\delta\leq\frac{1}{C_{1}}$ and any $|z|\in I(\delta)+(-c\delta,c\delta)$ and any $t>0$ :

[TABLE]

Now, if $\delta\approx 1/n$ then Theorem 8 does not give us much information: the right hand sides of (4.1) and (4.2) are now comparable to $1$ , therefore these estimates hold automatically. In this range of $\delta$ , the following set of estimates is more useful. Below, let $\log_{+}(x)=\max(0,\log x)$ .

Theorem 9.

Let $0\leq c<1$ . Then there is a constant $C_{1}>0$ such that the following holds for any $\frac{1}{n}\lesssim\delta\leq\frac{1}{C_{1}}$ and any $|z|\in I(\delta)+(-c\delta,c\delta)$ and any $t>0$ :

[TABLE]

As a corollary of Theorem 8 and Theorem 9, we obtain

Corollary 4.

Let $0\leq c<1$ . Then there is a constant $C_{1}>0$ such that the following holds for any $\frac{1}{n}\lesssim\delta\leq\frac{1}{C_{1}}$ and any $|z|\in I(\delta)+(-c\delta,c\delta)$ : for any $0<\alpha_{2}<\frac{1}{2}$ there is a constant $C_{2}$ such that

[TABLE]

Proof of Corollary 4.

Below we only prove the claimed estimate for $\log|p_{n}|$ , and the same argument specialized to the case $\rho=0$ can be applied to $\log|q_{n}|$ . Using Theorem 8 and Theorem 9, for any $\lambda>0$ we have

[TABLE]

Thus, for any $\delta\in[\frac{\alpha_{2}}{\alpha_{1}}\frac{\log n}{n},\frac{1}{C_{1}}]$ we have

[TABLE]

On the other hand, for any $\frac{1}{n}\lesssim\delta\leq\frac{\alpha_{2}}{\alpha_{1}}\frac{\log n}{n}$ we have

[TABLE]

∎

4.1. Proof of Theorem 8

Recall that $p_{n}(z)=\sum_{j}(b_{j}+c_{j}\xi_{j})z^{j}$ . Using Condition 1, we may find $j_{0}\geq 0$ and $M_{0}>0$ such that

[TABLE]

for all $j$ , while $|c_{j}|\geq M_{0}^{-1}(1+j)^{\rho}$ for $j_{0}\leq j\leq n-j_{0}$ .

We first prove (4.1). Since the left hand side of (4.1) is $O(1)$ , we may assume without loss of generality that $\delta>\frac{B}{n}$ for a large absolute constant $B$ . In particular, we will have $1-(2+c)\delta\leq|z|\leq 1-(1-c)\delta$ , thus $|z|^{N}\leq(1-(1-c)\delta)^{N}$ .

Now, there is a constant $c^{\prime}>0$ depending only on $c$ such that $(1-(1-c)\delta)^{1/\delta}<1-c^{\prime}$ for all $\delta>0$ . Therefore, we may choose $j_{0}\leq N\approx 1/\delta$ such that $|z|^{N}$ is very small. In particular, we may choose such $N$ so that $|z|^{N}<2^{-(\rho+2)}M_{0}^{-2}$ . Now, observe that, thanks to (4.5),

[TABLE]

for any $1\leq k\leq(n-j_{0})/N$ . Therefore,

[TABLE]

for any $1\leq\ell\leq[\frac{n-j_{0}}{N}]\approx n\delta$ .

We now recall the following anti-concentration bound:

Claim 1.

Let $\epsilon_{0},C_{0}>0$ . Then there are constants $\alpha_{2},C_{2}>0$ such that the following holds for any $\ell\geq 1$ : If $\xi_{1},\dots,\xi_{\ell}$ are independent with zero mean and unit variance satisfying ${\mathbb{E}}|\xi_{j}|^{2+\epsilon_{0}}<C_{0}$ , then for any lacunary sequence $|d_{1}|\geq 2|d_{2}|\geq\dots\geq 2^{\ell-1}|d_{\ell}|$ we have:

[TABLE]

For a proof of this now-standard bound, see e.g. [33, Lemma 9.2] or [4, Lemma 4.2]. We apply the above anti-concentration bound to $d_{j}=c_{jN}z^{jN}$ and to the random variables $\xi_{N},\dots,\xi_{(\ell-1)N}$ . By absorbing the remaining terms in $p_{n}(z)$ into the concentration point $u$ , it follows that

[TABLE]

for any $1\leq\ell\leq\ell_{N}:=[(n-j_{0})/N]$ . To obtain the desired estimate (4.1) from this inequality, we will choose $\ell$ to depend on $t$ , and this choice is explained below.

First, note that $|z|^{1/\delta}\geq(1-(2+c)\delta)^{1/\delta}$ , which is uniformly bounded away from [math] and since $N\approx 1/\delta$ , we may find a constant $\alpha_{3}>0$ such that $|z^{N}|\geq e^{-\alpha_{2}/2}$ . It follows that

[TABLE]

For convenience, let $C_{3}>0$ be such that $|c_{\ell N}z^{\ell N}|\geq\frac{1}{C_{3}}\delta^{-\rho}e^{-\alpha_{3}\ell}$ . We then let $\ell$ to be the integer such that

[TABLE]

Now, since the left hand side of (4.1) is $O(1)$ we may assume without loss of generality that $\ell\geq 1$ . To check that this $\ell$ will lead us to (4.1), we divide the consideration into two cases:

Case 1: $1\leq\ell\leq\ell_{N}$ .

It follows from the above constraint on $\ell$ that $e^{-\ell}=O((t\delta^{\rho})^{1/\alpha_{3}})$ . In this range of $\ell$ we may use (4.7), and obtain

[TABLE]

Thus by ensuring $\alpha_{1}\leq\alpha_{2}/\alpha_{3}$ we obtain (4.1).

Case 2: $\ell>\ell_{N}$ .

Here (4.7) is not available, however we observe that the LHS of (4.1) is nondecreasing with respect to $t$ . Therefore, using the case $\ell=\ell_{N}$ of Case 1, we obtain

[TABLE]

Since $\ell_{N}\approx n\delta$ , the last estimate can be bounded above by $O(e^{-\alpha_{1}n\delta})$ for some $\alpha_{1}>0$ . This completes the proof of (4.1).

We now discuss the proof of (4.2), which will follow the same argument. For convenience of notation, we let $q_{n}(x)=(e_{0}+d_{0}\widetilde{\xi}_{0})+(e_{1}+d_{1}\widetilde{\xi}_{1})x+\dots+(e_{n}+d_{n}\widetilde{\xi}_{n})x^{n}$ , where $e_{j}=b_{n-j}(n+1)^{-\rho}$ , $d_{j}=c_{n-j}(n+1)^{-\rho}$ and $\widetilde{\xi}_{j}=\xi_{n-j}$ . It is clear that $e_{j}\lesssim 1$ and $d_{j}\approx 1$ for $j_{0}\leq j\leq n/2$ , therefore we may apply the special case $\rho=0$ of (4.1) to the random polynomial $d_{0}\widetilde{\xi}_{0}+\dots+d_{[n/2]}\widetilde{\xi}_{[n/2]}x^{[n/2]}$ . The desired estimate for $q_{n}$ then follows by absorbing the other terms into the concentration point $u$ .

4.2. Proof of Theorem 9

Below we only prove (4.3), and (4.4) can be obtained from (4.3) by arguing as in the proof of Theorem 8 in the last section.

The proof uses the following generalization of a lemma of Erdös (for a proof see [4, Lemma 4.1]):

Claim 2.

Let $\epsilon_{0},C_{0}>0$ . Then there is a constant $C>0$ such that the following holds for any $m\geq 1$ : If $\xi_{1},\dots,\xi_{m}$ are independent and $\sup_{j}{\mathbb{E}}|\xi_{j}|^{2+\epsilon_{0}}<C_{0}$ then for any $d_{1},\dots,d_{m}\in{\mathbb{C}}$ we have

[TABLE]

Let $n-j_{0}\geq m\geq 2j_{0}$ , where $j_{0}=O(1)$ is such that $|c_{j}|$ is comparable to $(1+j)^{\rho}$ for $j_{0}\leq j\leq n-j_{0}$ (thanks to Condition 1). Applying the above estimate to $d_{j}=c_{j}z^{j}$ for $m/2\leq j\leq m$ , it follows that

[TABLE]

Now, we may choose $C\geq 1$ be sufficiently large such that $\delta\geq 1/(Cn)$ . For any $z\in I(\delta)+(-c\delta,c\delta)$ , it holds that $|z|\geq 1-2C\delta$ , therefore

[TABLE]

Collecting estimates, for $C>0$ large enough we will have

[TABLE]

for any integer $m\in[2j_{0},n-j_{0}]$ . To obtain the desired estimate (4.3) from this inequality, we will choose $m$ suitably depending on $t>0$ . We will choose $m$ to be the integer such that

[TABLE]

Now, since the LHS of (4.3) is $O(1)$ , we may assume without loss of generality that $m\geq 2j_{0}$ . To show that this choice would give us (4.3), we divide the consideration into two cases:

Case 1: $2j_{0}\leq m\leq n-j_{0}$ . For such $m$ we may use (4.8). We note that, as a consequence of the above constraint on $m$ , we will have $m\delta\gtrsim\log_{+}(\frac{1}{t\delta^{\rho}})$ . Consequently,

[TABLE]

Case 2: $m\geq n-j_{0}+1$ . Here we will use monotonicity of the left hand side of (4.3) (as a function of $t$ ). Since we now have $t<\frac{1}{C}\delta^{-\rho}e^{-C(n-j_{0})\delta}$ , it follows that

[TABLE]

This completes our proof of Theorem 9.

5. Logarithmic integrability of random polynomials

This section is devoted to establishing several estimates about the integrability of $\log|p_{n}|$ and $\log|p^{*}_{n}|$ , which will be used to prove bounds for the number of local real roots of $p_{n}$ in subsequent sections. Throughout this section, we’ll assume that the coefficients of $p_{n}$ satisfy Condition 1. For convenience, let $q_{n}:=(n+1)^{-\rho}p^{*}_{n}$ .

5.1. Logarithmic integrability on the unit disk

We start with an estimate about integrability on the unit disk $B(0,1)=\{|z|\leq 1\}$ . We view this as a global estimate.

Theorem 10.

There are absolute constants $C,c>0$ and an event $F$ of exponentially decaying probability ${\mathbb{P}}(F)=O(e^{-cn})$ such that the following holds:

[TABLE]

for all $q\geq 1$ , and the analogous estimate also holds for $q_{n}$ .

We note that the exclusion of an exceptional set of exponentially decaying probability is important. To see this, suppose that $b_{j}=0$ for all $j$ , then $p_{n}(x)\equiv 0$ on the event $F=\{\xi_{j}=0\ \ \forall j\}$ , which has an exponentially decaying probability ${\mathbb{P}}(F)=O(p^{n})$ if for some fixed $p\in(0,1)$ we have ${\mathbb{P}}(\xi_{j}=0)\geq p$ for all $j$ . Such event must be excluded to ensure any integrability for $|\log|p_{n}||$ on $B(0,1)$ .

Without loss of generality we may assume that $n\geq 3$ in the proof. Given such a condition, the right hand side of (5.1) is a strictly increasing function of the implicit constant $C$ , which will be convenient in the proof.

To start, we note that the estimate (5.1) follows from a slightly weaker estimate:

Proposition 1.

There is an event $F$ of exponentially decaying probability ${\mathbb{P}}(F)=O(e^{-cn})$ (for some fixed $c>0$ ) such that the following holds: for any $\epsilon>0$ , there is a constant $C=C(\epsilon)$ such that

[TABLE]

for all $q\geq 1$ , and the analogous estimate also holds for $q_{n}$ .

Indeed, the key observation here is that the the implicit constant $C$ does not depend on $q$ . If (5.2) holds, using Holder’s inequality we have, for any $p\geq 1$ :

[TABLE]

The desired conclusion (5.1) then follows by choosing $p=\log n$ .

The main ingredient in the proof of Proposition 1 is a result of Nazarov-Nishry-Sodin [24, Corollary 1.2] for random Fourier series, summarized below:

Proposition 2.

[24]** There is an absolute constant $C>0$ such that the following holds: Let $r_{\epsilon}(z)=\sum_{j}\epsilon_{j}d_{j}z^{j}$ where $d_{j}$ are deterministic with $\sum_{j}|d_{j}|^{2}=1$ and $\epsilon_{j}$ are independent Rademacher random variables. Then for any $p>0$

[TABLE]

Our proof will actually use the following simple extension of Proposition 2.

Lemma 4.

There is an absolute constant $C>0$ such that the following holds for any $m:B(0,1)\to\mathbb{C}$ measurable with $M:=\int_{0}^{2\pi}\int_{0}^{1}|m(re^{i\theta})|^{2}drd\theta<\infty$ : Let $r_{\epsilon}(z)=\sum_{j}\epsilon_{j}d_{j}z^{j}$ where $d_{j}$ are deterministic with $\sum_{j}|d_{j}|^{2}=1$ and $\epsilon_{j}$ are independent Rademacher random variables. Then for any $p>0$ we have

[TABLE]

In Lemma 4, we could in fact replace the constant $7$ by any constant bigger than $6$ (for our applications any absolute constant would suffice).

5.1.1. Proof of Lemma 4

To prove Lemma 4, we will use the following crude estimate. For convenience of notation, let $f(z)=m(z)+r_{\epsilon}(z)$ and let $|.|$ denote the Lebesgue measure of measurable subsets of $[0,1]\times[0,2\pi]$ .

Claim 3.

There is an absolute constant $C>0$ such that for any $p>0$ and $\lambda\geq 0$ we have

[TABLE]

Now, let $h\geq 1$ be integer such that $h-1<p\leq h$ , we then have

[TABLE]

This competes the proof of Claim 3.

In the proof of Lemma 4, we will use another estimate, which in turn is a consequence of Proposition 2.

Claim 4.

There is an absolute constant $C$ such that for any $p>0$ and $\lambda\geq 0$ we have

[TABLE]

Since the left hand side of the above estimate is always bounded above by $2\pi$ and since $p^{p}\geq e^{-1/e}$ for any $p>0$ , we may assume $\lambda>1$ without any loss of generality. For such $\lambda$ , it suffices to show that

[TABLE]

Let $\epsilon^{\prime}_{j}$ be iid copies of $\epsilon_{j}$ , such that $\epsilon^{\prime}_{0},\dots,\epsilon^{\prime}_{n},\epsilon_{0},\dots,\epsilon_{n}$ are independent Rademacher random variables. Let $\eta_{j}=(\epsilon_{j}-\epsilon_{j}^{\prime})/\sqrt{2}$ , which are also independent Rademacher random variables. We have

[TABLE]

This completes the proof of Claim 4.

We are now ready to start the proof of Lemma 4. We combine Claim 4 and Claim 3 and estimate

[TABLE]

This completes the proof of Lemma 4.

5.1.2. Proof of Proposition 1

We now start the proof of (5.2) for $\log|p_{n}|$ . For convenience of notation, we denote $p_{n,\xi}(w)=\sum_{j}(b_{j}+c_{j}\xi_{j})w^{j}$ to keep track of the dependence of $p_{n}$ on the vector of coefficients $\xi=(\xi_{0},\dots,\xi_{n})$ . Let

[TABLE]

We first show that ${\mathbb{P}}(F)=O(e^{-cn})$ for some $c>0$ . Since $\xi_{j}$ are independent and $|c_{j}|\approx j^{\rho}\gtrsim n^{-1/2}$ for $n-O(1)\geq j\geq O(1)$ , it suffices to show that that there are constants $\delta_{0}>0$ and $p_{0}>0$ such that ${\mathbb{P}}(|\xi_{j}|<\delta_{0})\leq 1-p_{0}$ for all $j$ . This was proved in Lemma 3.

We now divide the remaining of the proof into two cases: the simpler case when $\xi_{j}$ are symmetric for each $j$ , and the general case where no symmetry is assumed.

Case 1: Symmetric coefficients.

Assume that for each $j$ the distributions of $\xi_{j}$ and $-\xi_{j}$ are the same.

Let $\epsilon_{0},\dots,\epsilon_{n}$ be independent Rademacher random variables that are independent from $\xi_{0},\dots,\xi_{n}$ , and let $\widetilde{\xi}_{j}=\epsilon_{j}\xi_{j}$ . Thanks to symmetry, $p_{\xi,n}$ has the same distribution as $p_{\widetilde{\xi},n}$ . Note that $\sigma(\xi)=\sigma(\widetilde{\xi})$ , therefore $F_{\widetilde{\xi}}=F_{\xi}$ and is independent of $\epsilon_{j}$ . Thus it suffices to show that, for any $C>0$ large enough,

[TABLE]

Note that on the event $F^{c}_{\xi}$ we have $\sigma(\xi)\geq n^{-1}$ , which implies $|\log\sigma(\xi)|<\log(n^{2}\sigma(\xi))$ . Conditioning on this event and using Lemma 4, we obtain

[TABLE]

here $C$ depends on $\rho$ . Thus, it remains to show that

[TABLE]

for some $C>0$ (independent of $q$ ). This estimate in turn follows from concavity of $\log^{q}(x)$ on $(e^{q},\infty)$ and Jensen’s inequality:

[TABLE]

Case 2: General coefficients.

We now drop the assumption that the distribution of $\xi_{j}$ ’s are symmetric. To show (5.1), it suffices to show that, for $C=C(\epsilon)>0$ large enough,

[TABLE]

for any $\ell\geq 0$ and any $q\geq 1$ . Since the left hand side of (5.1.2) is $O(1)$ , this estimate holds trivially for $\ell=O(1)$ . Thus, we will assume below that $\ell\geq 1$ , in particular we may replace $(1+\ell)^{-q}$ by $\ell^{-q}$ on the right hand side without any loss of generality.

Now, let $c^{\prime}=c/(2q)$ . We divide the proof of (5.1.2) into two parts, depending on whether $\ell\leq e^{c^{\prime}n}$ or $\ell\geq e^{c^{\prime}n}$ .

Smaller $\ell$ ’s: For $\ell\leq e^{c^{\prime}n}$ , we have $\ell^{-q}\geq e^{-cn/2}$ , thus it suffices to show that

[TABLE]

Now, $\{|\log|p_{n,\xi}(w)||\geq\ell)\}=\{\log|p_{n,\xi}(w)|\geq\ell)\}\cup\{\log|p_{n,\xi}(w)|\leq-\ell)\}$ , and

[TABLE]

Thus, it remains to show that $\int{\mathbb{P}}(\log|p_{n,\xi}(w)|\leq-\ell)$ is bounded by the right hand side of (5.5).

Let $\widetilde{\xi}_{j}$ be iid copy of $\xi_{j}$ that are independent of each other and of other $\xi_{j}$ ’s. Let $\eta_{j}=\frac{1}{\sqrt{2}}(\xi_{j}-\widetilde{\xi}_{j})$ , then $\eta_{j}$ is symmetric with mean zero and variance $1$ . We also have ${\mathbb{E}}|\eta_{j}|^{2+\epsilon_{0}}=O(C_{0})$ uniform over $j$ , thanks to Condition 1. One could easily show that ${\mathbb{P}}(F_{\eta})=O(e^{-cn})$ (with the same $c$ as in the estimate for ${\mathbb{P}}(F_{\xi})$ , although this it not important - we could refine the constant $c$ for $F_{\xi}$ so that these two exceptional sets share the same constant from the beginning of the proof).

Now, using Hölder’s inequality, we obtain

[TABLE]

Let $C$ be sufficiently large, then using the known estimates for the symmetric case, which applies to $p_{\eta,n}$ and $2q$ , we may generously estimate the last display by

[TABLE]

This completes the proof of (5.1.2) for this range of $\ell$ .

Larger $j$ ’s: For $\ell\geq e^{c^{\prime}n}$ , we proceed as follows. Let $\epsilon_{0},\dots,\epsilon_{n}$ be independent Rademacher random variables that are independent from $\xi_{j}$ ’s. Let $\widetilde{\xi}_{j}=\epsilon_{j}\xi_{j}$ and consider the symmetrized variant of $p_{n,\xi}$ , namely

[TABLE]

Using Hölder’s inequality, for any $p,q\geq 1$ we have

[TABLE]

Here, in the last estimate we used the fact that $p_{n,\xi}$ is equal to $p_{n,\widetilde{\xi}}$ with probability $2^{-(n+1)}$ . Observe that $F_{\xi}=F_{\widetilde{\xi}}$ . Thus, using the (known) estimate for the symmetric case, we can further estimate the last display by

[TABLE]

Since $\ell^{\epsilon q}\geq e^{c^{\prime}nq\epsilon}=e^{cn\epsilon/2}$ , it follows that by taking $p\geq\max(1,(c\epsilon)^{-1}\ln 4)$ we have $\ell^{-\epsilon q}2^{n/p}\leq 1$ and we obtain the desired estimate.

This completes the proof of the desired estimate (5.2) for $\log|p_{n}|$ of Proposition 1.

We now discuss the proof for the analogous estimate for $\log|q_{n}|$ . For convenience of notation, let $p^{*}_{n}(x)=\sum_{j}(b^{*}j+c^{*}_{j}\widetilde{\xi}_{j})x^{j}$ where $b^{*}_{j}=b_{n-j}$ , $c^{*}_{j}=c_{n-j}$ , and $\widetilde{\xi}_{j}=\xi_{n-j}$ . In particular, $m^{*}_{n}(x)=\sum_{j}b^{*}_{j}x^{j}$ . We similarly let

[TABLE]

where $\sigma^{*}(\xi)=(\sum_{j}|c^{*}_{j}\widetilde{\xi}_{j}|^{2})^{1/2}$ . Using Condition 1, we have $|c^{*}_{j}|\approx(n+1)^{\rho}$ for $O(1)\leq j\leq n/2$ , therefore by the same argument as before we obtain ${\mathbb{P}}(F^{*}_{\xi})=O(e^{-cn})$ for some $c>0$ . Now, the proof of the symmetric case is entirely the same as before once we verify that on $F^{*}_{\xi}$ it holds that

[TABLE]

But this is clear using Condition 1. Finally, the proof of the general case follows from the symmetric case as long as we could verify that $\int_{B(0,1)}{\mathbb{E}}|q_{n}(w)|^{2}=O(n^{C})$ , which again is clear from Condition 1.

5.2. Logarithmic integrability on local sets

In this section we will prove a probabilistic upper bound regarding the local integrability of $\log|p_{n}|$ and $\log|q_{n}|$ where $q_{n}=(n+1)^{-\rho}p^{*}_{n}$ . This is an estimate on a ball of radius comparable to the scale $\delta$ with center near $I(\delta)$ . All implicit constants below may depend on the implicit constants in Condition 1.

Theorem 11.

Let $0\leq c,c^{\prime}<1$ be such that $c+c^{\prime}<1$ and let $C_{1}>0$ be big enough depending on $c,c^{\prime}$ . Then for any $\alpha_{0}\in(0,1/2)$ and $\frac{1}{n}\lesssim\delta\leq\frac{1}{C_{1}}$ and $z\in I(\delta)+(-c\delta,c\delta)$ there is an event $F$ with probability $O(\delta^{\alpha_{0}})$ such that the following estimate holds uniformly over $1\leq p<\infty$ :

[TABLE]

and the analogous estimate also holds if we replace $p_{n}$ by $q_{n}=(n+1)^{-\rho}p^{*}_{n}$ .

As a consequence Theorem 11, we obtain

[TABLE]

(and the analogous estimate for $q_{n}$ ), which is reminiscent of Theorem 10.

Using Lemma 1, we have the following probabilistic estimates for $\log|p_{n}|$ :

Lemma 5.

Let $0\leq c<1$ . For $\frac{1}{n}\lesssim\delta<\frac{1}{5}$ it holds for any $\epsilon>0$ and $s\in{\mathbb{R}}$ that

[TABLE]

Proof of Lemma 5.

Recall that $p_{n}(w)=a_{0}+a_{1}w+\dots+a_{n}w^{n}$ and ${\mathbb{E}}|a_{j}|^{2}=O((1+j)^{2\rho})$ thanks to Condition 1. Using Lemma 1 and Cauchy-Schwartz, for any $\epsilon>0$ and $w\in I(\delta)+(-c\delta,c\delta)$ we have

[TABLE]

Since ${\mathbb{E}}(\sum_{j=0}^{n}(1+j)^{-2\rho-1-2\epsilon}|a_{j}|^{2})=O(\sum_{j\geq 0}(1+j)^{-1-2\epsilon})=O(1)$ , we obtain

[TABLE]

The desired probabilistic estimate for $\log|p_{n}|$ then follows immediately.

Now, the proof of the claimed probabilistic estimate for $\log|q_{n}|$ is similar. For convenience of notation, let $M_{\epsilon}=(\sum_{j=0}^{n}(n+1-j)^{-2\rho}(1+j)^{-1-2\epsilon}|a_{n-j}|^{2})^{1/2}$ . Using Cauchy Schwarz and Lemma 1 we have, for $\epsilon>0$ and $w\in I(\delta)+(-c\delta,c\delta)$ :

[TABLE]

Again, ${\mathbb{E}}[M_{\epsilon}^{2}]\lesssim\sum_{j=0}^{n}(1+j)^{-1-2\epsilon}=O(1)$ , and the desired estimate follows immediately. ∎

5.2.1. Proof of Theorem 11

We will only show the proof for the claimed estimate for $\log|p_{n}|$ , and the same argument works for $\log|q_{n}|$ . Fix $z\in I(\delta)+(-c\delta,c\delta)$ . Let $C_{1}>0$ be big enough so that Corollary 4 holds.

Thanks Corollary 4, we may assume that

[TABLE]

for some $C_{2}>0$ large. Let $c^{\prime\prime}\in(c^{\prime},1-c)$ . Then for $w\in B(z,c^{\prime\prime}\delta)$ we have $|w|\in I(\delta)+(-(c+c^{\prime\prime})\delta,(c+c^{\prime\prime})\delta)$ , so thanks to Lemma 5, it holds with probability $1-O(\delta^{\alpha_{0}})$ that

[TABLE]

for $C_{3}>0$ large.

Below, we will condition on the event where (5.6) and (5.7) hold, on which we will show that

[TABLE]

Now, the integrand $|\log|p_{n}||$ will blowup near the zeros of $p_{n}$ , however only logarithmically. The above assumptions on $\log|p_{n}|$ will ensure that there are not many such zeros near $z$ , and the main part of the argument is to control the zero-free part of $p_{n}$ using properties of subharmonic functions.

More specifically, let $\ell:=N_{p_{n}}(B(z,c^{\prime\prime}\delta))$ be the number of zeros of $p_{n}$ in $B(z,c^{\prime}\delta)$ . As a consequence of Jensen’s formula, we have

[TABLE]

Now, let $u_{1},\dots,u_{\ell}$ be the zeros of $p_{n}$ in $B(z,c^{\prime}\delta)$ . Let $Q_{n}(w)=p_{n}(w)/((w-u_{1})\dots(w-u_{\ell}))$ , this is a (random) polynomial having no zeros inside $B(z,c^{\prime}\delta)$ , we view $Q_{n}$ as the zero-free part of $p_{n}$ . It follows that, for any $p\geq 1$ ,

[TABLE]

Since $\ell=O(|\log\delta|)$ , it remains to bound the integral involving $Q_{n}$ . In fact, we will show that $|\log|Q_{n}(w)||=O(|\log\delta|^{2})$ uniformly on $B(z,c^{\prime}\delta)$ , which is a stronger estimate. To see this, we first show that $\log|Q_{n}|$ satisfies inequalities similar to (5.6) and (5.7). Indeed, note that $\log|Q_{n}(w)|:B(0,c^{\prime\prime}\delta)\to{\mathbb{R}}\cup\{-\infty\}$ is a subharmonic function, and by the maximum principle it achieves its maximum on the boundary. It follows that

[TABLE]

On the other hand, since $|z-u_{i}|\leq c^{\prime}\delta\leq 1$ for all $i=1,\dots,\ell$ , we also have

[TABLE]

Thus we have verified that $Q_{n}$ satisfies inequalities similar to (5.6) and (5.7). Now, let $h(w):=C|\log\delta|^{2}-\log|Q_{n}(w)|$ for a big constant $C$ such that $h$ is nonnegative (and harmonic) on $B(z,c^{\prime\prime}\delta)$ . Note that

[TABLE]

Using Harnack’s inequality, for any $w\in B(z,c^{\prime}\delta)$ we have

[TABLE]

It follows that $|\log|Q_{n}(w)||\leq O(|\log\delta|^{2})+|h(w)|=O(|\log\delta|^{2})$ for any $w\in B(z,c^{\prime}\delta)$ , as desired.

6. Counting local real roots

In this section, we will use the log integrability estimates and the anti concentration estimates from previous sections to establish several estimates for the local number of real roots for $p_{n}$ .

For each $U\subset{\mathbb{C}}$ and any function $f$ analytic on a neighborhood of $U$ , let $N_{f}(U)$ denote the number of roots of $f$ inside $U$ .

In this section, we assume that the coefficients of $p_{n}$ satisfy Condition 1, and all implicit constants may depend on the implicit constants in Condition 1.

Theorem 12.

Let $0\leq c,c^{\prime}<1$ be such that $c+c^{\prime}<1$ . Then there are constants $C_{1},C_{2},C_{3}>0$ such that the following holds: for any $\frac{1}{n}\lesssim\delta\leq\frac{1}{C_{1}}$ and any $|z|\in I(\delta)+(-c\delta,c\delta)$ and any $M>0$ and any event $E$ we have

[TABLE]

The analogous estimate also holds for $N_{q_{n}}=N_{p^{*}_{n}}$ . Furthermore, for $\delta\geq C_{3}\log n/n$ we could take $C_{2}=1$ .

It follows from Theorem 12 that the number of roots of $p_{n}$ and $p^{*}_{n}$ on $I_{{\mathbb{R}}}(\delta)$ are at most logarithmic away from $O(1)$ . We state a useful corollary, when $E^{c}=\emptyset$ .

Corollary 5.

Let $0\leq c,c^{\prime}<1$ be such that $c+c^{\prime}<1$ . Then there are constant s $C_{1},C_{2},C_{3}>0$ such that for any $\frac{1}{n}\lesssim\delta\leq\frac{1}{C_{1}}$ and any $|z|\in I(\delta)+(-c\delta,c\delta)$ we have

[TABLE]

Furthermore, for $\delta\geq C_{3}\log n/n$ we could take $C_{2}=1$ .

We will divide the proof of Theorem 12 into two cases, depending on whether $\delta$ is small or large. More specifically, we will consider first $\delta\geq C_{3}\log n/n$ for some sufficiently large constant $C_{3}$ , this is the large scale setting. Then we will consider the case when $\frac{1}{n}\lesssim\delta\lesssim\log n/n$ and refer to this as the small scale setting.

6.1. Larger scales

We will use the following sublevel set estimate.

Lemma 6.

Let $0\leq c,c^{\prime}<1$ be such that $c+c^{\prime}<1$ . Let $C>0$ be sufficiently large. Let $\delta\in[\frac{C\log n}{n},\frac{1}{C}]$ and assume that $|z|\in I(\delta)+(-c\delta,c\delta)$ . Then uniformly over $\lambda>C|\log\delta|$ we have

[TABLE]

Let $C_{3}$ be large compared to the constant $C$ from Lemma 6. Using Lemma 6 , we will prove (6.1) for $\delta>C_{3}\log n/n$ . We will only show the details for $N_{p_{n}}$ , the same argument could be applied to $N_{q_{n}}$ . Now, for brevity let $N=N_{p_{n}}(B(z,c^{\prime}\delta))$ and $F=\{N\geq C_{3}|\log\delta|\}$ . Since $N\leq n$ trivially, we obtain

[TABLE]

if $C_{3}$ is sufficiently larger than $CM$ . It follows that

[TABLE]

6.1.1. Proof of Lemma 6

Let $c^{\prime\prime}\in(c^{\prime},1-c)$ . Using Jensen’s formula, we have

[TABLE]

For the first term on the right hand side of (6.3), we apply Lemma 5 with $s=\lambda/2$ and note that $e^{2s}$ is a lot larger than any given power of $(1/\delta)$ .

For the second term on the right hand side of (6.3), we use Theorem 8 with $t=e^{-\lambda/2}$ and use the assumption that $\lambda\geq C\log(1/\delta)$ (where $C$ is very large) to get the desired estimate.

The proof for $N_{q_{n}}$ is entirely similar.

6.2. Smaller scales

We now consider the smaller (and more critical) range $\frac{1}{n}\lesssim\delta\lesssim\frac{\log n}{n}$ . Here we will use Theorem 10 (from Section 5) about the log integrability of $p_{n}$ and $q_{n}$ , which shows that there is an event $F$ with probability ${\mathbb{P}}(F)=O(e^{-cn})$ such that for any $q\geq 1$ we have

[TABLE]

where $C$ is sufficiently large, and the analogous estimate also holds for $\log|q_{n}|$ . We will use these estimates to show the desired estimates for $\log|p_{n}|$ in this range of $\delta$ , and the argument for $\log|q_{n}|$ is entirely similar.

[TABLE]

which is $O_{M}(\delta^{M})$ for any $M>0$ . Thus, we may assume without loss of generality that $E\subset F^{c}$ . For convenience, denote $U=B(z,c^{\prime}\delta)$ and $\Omega:=B(z,c^{\prime\prime}\delta)$ where $c^{\prime\prime}\in(c^{\prime},1-c)$ . Let $\phi$ be a smooth function such that $1_{B(0,c^{\prime})}\leq\phi\leq 1_{B(0,c^{\prime\prime})}$ and let $\phi_{\delta}(.)=\phi(./\delta)$ denote the $L^{\infty}$ -preserving dilation of $\phi$ . We now use Green’s formula

[TABLE]

where $dw$ is the Lebesgue measure on ${\mathbb{C}}$ . It follows that

[TABLE]

Consequently, using Hölder’s inequality, the following holds for any $p\geq 1$

[TABLE]

Recall that $E\subset F^{c}$ and note that $\Omega\subset B(0,1)$ and $|\Omega|=O(\delta^{2})$ . Therefore, using (6.4) for $q=kp$ , we obtain

[TABLE]

Choosing $p=\log n\approx\log(1/\delta)$ , then $\delta^{-1/p}=O(1)$ , therefore

[TABLE]

Now, if ${\mathbb{P}}(E)\leq\delta^{2M}$ , then it is clear that the last right hand side is $O(\delta^{M})$ . If ${\mathbb{P}}(E)\geq\delta^{2M}$ then it is clear that ${\mathbb{P}}(E)^{1/p}\gtrsim 1$ , consequently

[TABLE]

This completes the proof of Theorem 12.

7. Lindeberg swapping and Tao-Vu replacement estimates

Our goal in this section is to establish the following result, which is a simple extension of a replacement estimate in Tao–Vu [33] to non-centered polynomials.

Lemma 7.

For any $C,\epsilon,C_{0}>0$ there is $0<C_{1}<\infty$ so that the following holds.

Let $\xi_{0},\dots,\xi_{n},G_{0},\dots,G_{n}$ be independent with ${\mathbb{E}}|\xi_{j}|^{2+\epsilon}<C$ and ${\mathbb{E}}|G_{j}|^{2+\epsilon}<C$ such that $\xi_{j}$ and $G_{j}$ have matching moments up to second order, for at least $n-C$ indices $j$ . Let $\delta\in(0,1)$ , $\alpha_{1}>0$ , $w_{1},\dots,w_{m}\in I(\delta)$ , and $F:{\mathbb{R}}^{m}\to{\mathbb{C}}$ be such that

(i) $m\lesssim\delta^{-\alpha_{1}}$ , and $|\partial^{\beta}F|\leq\delta^{-\alpha_{1}}$ for $|\beta|\leq 3$ ;

(ii) for all $1\leq i\leq m$ and $0\leq j\leq n$ it holds that $|c_{j}w_{i}^{j}|\lesssim\delta^{C_{1}\alpha_{1}}(\sum_{j}|c_{j}w_{i}^{j}|^{2})^{1/2}$ .

[TABLE]

where the implicit constant may depend on $\alpha_{1},C_{0},C_{1},\epsilon$ .

Without loss of generality we may assume that $G_{j}$ are Gaussian for all $j$ . Following [33], we will prove Lemma 7 using the Lindeberg swapping argument. The following basic estimate captures some ideas of this argument.

Lemma 8 (Basic Lindeberg swapping).

Let $\epsilon,C>0$ . Assume that $\xi_{1},\dots,\xi_{n}$ and $\widetilde{\xi}_{1},\dots,\widetilde{\xi}_{n}$ are independent such that $\max_{j}{\mathbb{E}}|\xi_{j}|^{2+\epsilon}\leq C$ and $\max_{j}{\mathbb{E}}|\widetilde{\xi}_{j}|^{2+\epsilon}\leq C$ .

Assume that $\xi_{j}$ and $\widetilde{\xi}_{j}$ have matching moments up to second order for any $j\not\in J_{0}$ . Here $J_{0}$ is a subset of $\{1,\dots,n\}$ .

Assume that $H:{\mathbb{C}}^{n}\to{\mathbb{C}}$ , such that, as a function on ${\mathbb{R}}^{2n}$ , $H\in C^{3}$ . Then for some $\widetilde{C}$ finite positive depending on $C$ and $\epsilon$ we have:

[TABLE]

Here viewing as a function on ${\mathbb{R}}^{2n}$ we let $M_{i}:=\sum_{j=1}^{n}\sum_{m=0}^{i}\|(\partial_{2j-1})^{i-m}(\partial_{2j})^{m}H\|_{sup}.$

Proof.

Let $H_{1}=H(\xi_{1},\dots,\xi_{n})$ , and let $H_{j+1}$ be obtained from $H_{j}$ by swapping $\xi_{j}$ with $\widetilde{\xi}_{j}$ . We then estimate the left hand side by $\sum_{j}|{\mathbb{E}}(H_{j}-H_{j-1})|$ .

Let $j\not\in J_{0}$ . We view $H(\dots,w_{j},\dots)$ as a function of $Re(w_{j})$ and $Im(w_{j})$ , denoted by $f_{j}$ . For convenience, let $M_{j,i}:=\sum_{m=0}^{i}\|(\partial_{1})^{i-m}(\partial_{2})^{m}f_{j}\|_{sup}$ .

We consider approximation of $f_{j}(x,y)$ using Taylor expansion around $(0,0)$ up to second order terms. By simple interpolation, the error term in this approximation is bounded above $O(\max(|x|^{2+\epsilon},|y|^{2+\epsilon})M_{j,3}^{\epsilon}M_{j,2}^{1-\epsilon})$ . Since $\xi_{j}$ and $\widetilde{\xi}_{j}$ are independent from the others and have matching moments up to second order and since ${\mathbb{E}}|\xi_{j}|^{2+\epsilon}\leq C$ , ${\mathbb{E}}|\widetilde{\xi}_{j}|^{2+\epsilon}\leq C$ , it follows from direct examination that

[TABLE]

Summing these estimates over $j\not\in J_{0}$ and using Hölder’s inequality, we obtain

[TABLE]

Now, let $j\in J_{0}$ . Again we view $H$ as a function $f_{j}$ of $Re(w_{j})$ and $Im(w_{j})$ and approximate it by Taylor expansion around $(0,0)$ up to first order terms. We similarly obtain $|{\mathbb{E}}[H_{j+1}-H_{j}]|\lesssim M_{j,1}({\mathbb{E}}|\xi_{j}|+{\mathbb{E}}|\widetilde{\xi}_{j}|)=O(M_{j,1})$ . Using Kolmogorov’s inequality [18] and a simple application of Hölder’s inequality we obtain

[TABLE]

∎

We now prove Lemma 7. Let $\sigma(z)=\sqrt{Var[p_{n,\xi}(z)]}=(\sum_{0\leq j\leq n}|c_{j}z^{j}|^{2})^{1/2}$ . Let $\widetilde{F}:{\mathbb{R}}^{m}\to{\mathbb{C}}$ be defined by $\widetilde{F}(u_{1},\dots,u_{m})=F(u_{1}+\log\sigma(w_{1}),\dots,u_{m}+\log\sigma(w_{m}))$ . Then we also have $|\partial^{\alpha}\widetilde{F}|\lesssim\delta^{-\alpha_{1}}$ for all partial derivatives of order $|\alpha|\leq 3$ .

Let $M=C_{2}\log(1/\delta)$ for some large constant $C_{2}>0$ to be chosen later.

We perform a decomposition of $\widetilde{F}=F_{1}+F_{2}$ where $F_{1}=\phi\widetilde{F}$ and $F_{2}=(1-\phi)\widetilde{F}$ , where $\phi$ is constructed below. Then $\phi:{\mathbb{R}}^{m}\to{\mathbb{R}}$ is a smooth function supported on $\{(x_{1},\dots,x_{m})\in{\mathbb{R}}^{m}:\min x_{j}\geq-(M+1)\}$ and equals $1$ on $\{(x_{1},\dots,x_{m})\in{\mathbb{R}}^{m}:\min x_{j}\geq-M\}$ , such that $\|\partial^{\alpha}\phi\|_{\infty}\lesssim m^{|\alpha|}$ for any multi-index $\alpha$ .

We plan to apply Lemma 8 to

[TABLE]

Now, $|\partial^{\alpha}F_{1}|\ \lesssim\ m^{3}\delta^{-\alpha_{1}}\ \lesssim\ \delta^{-4\alpha_{1}}$ for $|\alpha|\leq 3$ . Via explicit computations,

[TABLE]

Now, on the support of $F_{1}$ we have $\displaystyle|\frac{c_{k}w_{j}^{k}}{p_{n}(w_{j})}|\lesssim e^{M}\frac{|c_{k}w_{j}^{k}|}{\sigma(w_{j})}$ . Thus, for $x,y\in\{Re(\xi_{k}),Im(\xi_{k})\}$ we have

[TABLE]

Summing over $k$ and using Cauchy Schwartz, we obtain

[TABLE]

Similarly, we estimate the third partial derivatives for $H$ and use these estimates to bound $M_{3}$ . Here we will arrive at trilinear sums, so using the assumption $|c_{j}w_{k}^{j}/\sigma(w_{k})|=O(\delta^{C_{1}\alpha_{1}})$ we eventually obtain

[TABLE]

Now, we may assume $\epsilon\leq 1$ . Via Lemma 8, we have the generous bound

[TABLE]

We now reset $H(\xi_{0},\dots,\xi_{n}):=(1-\phi)(\log f(w_{1}),\dots,\log f(w_{m}))$ . The partial derivatives of $(1-\phi)$ are $O(1)$ and are supported in $\min(\log f(w_{1}),\dots,\log f(w_{m}))\geq-M-1$ . Consequently, via the same consideration as before, we obtain

[TABLE]

here we have used the fact that $p_{n,G}(w_{j})$ is Gaussian and $m=O(\delta^{-\alpha_{1}})$ . Collecting estimates, we obtain

[TABLE]

We choose $M=C_{2}\alpha_{1}\log(1/\delta)$ where $C_{2}\geq C_{0}+2$ , and $C_{1}>(11+3C_{2}+C_{0})/\epsilon$ , then it is clear that the last right hand side is $O(\delta^{C_{0}\alpha_{1}})$ , as desired. This completes the proof of Lemma 7.

8. Proof of universality for complex correlation functions

In this section we prove Theorem 7. Following the framework developed by Tao-Vu [33], we will use the Monte Carlo sampling method (summarized in Lemma 9) and the Lindeberg swapping argument (implemented in Lemma 7). Below, we will only prove the desired estimates for the correlation functions of $p_{n}$ . The same argument could be applied to $q_{n}=(n+1)^{-\rho}p^{*}_{n}$ to get the desired estimates for $p^{*}_{n}$ .

We will actually show the desired estimates when $\phi_{\delta}$ has the tensor structure, namely $\phi_{\delta}(w)=\phi_{1,\delta}(w_{1})\dots\phi_{k,\delta}(w_{k})$ , furthermore for such $\phi_{\delta}$ we will only need to assume that each $\phi_{j,\delta}$ , viewed as a function on ${\mathbb{R}}^{2}$ , is continuously differentiable up to second order and furthermore $|\partial^{\alpha}\phi_{j,\delta}|\leq O(\delta^{-|\alpha|})$ for $|\alpha|\leq 2$ . The reduction from general (i.e. non tensor) $\phi_{\delta}$ to this special set up could be carried out as follows: First, let $c^{\prime}\in(c,1)$ , and let $\phi_{j,\delta}$ be smooth and supported inside $B_{{\mathbb{C}}}(0,c^{\prime}\delta)$ such that $\phi_{j,\delta}=1$ on $B_{\mathbb{C}}(0,c\delta)$ , and as a function on ${\mathbb{R}}^{2}$ it is $C^{2}$ and satisfies the derivative bound $|\partial^{\alpha}\phi_{j,\delta}|\leq O(\delta^{-|\alpha|})$ up to order $2$ . We may write

[TABLE]

using the multiple Fourier series expansion of $\phi$ on the polydisk $B_{{\mathbb{C}}}(0,\delta)^{k}$ . By standard stationary phase estimates, if $\phi_{\delta}$ is $C^{m}$ then $|c_{n}|\lesssim_{m}(1+|n_{1}|+\dots+|n_{k}|)^{-m}$ , while $\partial^{\alpha}[\phi_{j}(w_{j})e^{4\pi in_{j}w_{j}/\delta}]=O(\delta^{-|\alpha|}(1+|n_{j}|)^{|\alpha|})$ , therefore if $m$ is large enough depending on $k$ , say $m\geq 3k+2$ , then we could write $\phi$ as a linear average of tensor-type functions with the properties mentioned earlier.

Thus, we may now assume that $\phi$ has the tensor structure. Let $z=(z_{1},\dots,z_{k})\in I(\delta)^{k}$ be fixed (no implicit constants will depend on $z_{j}$ ’s). Recall that $Z$ denotes the multi-set of zeros of $p_{n}$ . By definition,

[TABLE]

where the sum is over non repeated tuples of $k$ elements of the zero sets of $p_{n}$ . An application of the inclusion-exclusion formula will allow us to rewrite the last right hand side as a linear combination of terms, and each term is a product of finitely many sum of the following type

[TABLE]

where $1\leq j\leq k$ is fixed and $\phi_{j,\delta,X}$ is a function supported in $B_{{\mathbb{C}}}(0,c\delta)$ such that, as a function on ${\mathbb{R}}^{2}$ , it is $C^{2}$ and its partial derivatives up to order $2$ are bounded accordingly.

Consequently, it suffices to show that, for a sequence $X_{i_{1}},\dots,X_{i_{\ell}}$ of the above type,

[TABLE]

(uniform over all choices of $1\leq\ell\leq k$ and $1\leq i_{1}<\dots<i_{\ell}\leq k$ ), for some $c>0$ . Without loss of generality, we may assume that $\ell=k$ and $i_{1}=1$ ,… , $i_{k}=k$ , and for brevity we will omit the dependence on $X_{j}$ in the notation and simply write $X_{j}=\sum_{\alpha}\phi_{j,\delta}(z_{j}-\alpha)$ below.

Let $\alpha_{0}>0$ be a sufficiently small constant that may depend on the underlying implicit constants in Condition 1. By a standard construction, we could find $\varphi:{\mathbb{C}}^{k}\to{\mathbb{C}}$ such that $\phi$ supported on $B(0,2\delta^{-\alpha_{0}})$ and $\varphi(w_{1},\dots,w_{k})=w_{1}\dots w_{k}$ on $B(0,\delta^{-\alpha_{0}})$ , furthermore $|\varphi(w_{1},\dots,w_{k})|\leq|w_{1}\dots w_{k}|$ for any $w_{1},\dots,w_{k}$ , and (as a function on ${\mathbb{R}}^{2k}$ ) $\varphi$ will be in $C^{2}$ with $|\partial^{\alpha}\varphi(w)|\lesssim\delta^{-k\alpha_{0}}$ for any (partial) derivatives of order up to $2$ .

Let $C>0$ is sufficiently large and let $\frac{1}{n}\lesssim\delta\leq\frac{1}{C}$ . We first use Theorem 11 and Lemma 5 to conclude that for any $0<c^{\prime}<1/2$ there is an event $E=E(\delta,\alpha_{0},z_{1},\dots,z_{k})$ with probability ${\mathbb{P}}(E)=O_{c^{\prime},\alpha_{0}}(\delta^{c^{\prime}})$ such that on $T=E^{c}$ the following holds for each $j=1,2,\dots,k$ :

[TABLE]

We now use Green’s formula, which says that the following holds for any $\phi$ compactly supported in $C^{2}({\mathbb{R}}^{2})$

[TABLE]

where $dw$ is the Lebesgue measure. It follows that, for each $1\leq j\leq k$ , we have

[TABLE]

Thus, using Hölder’s inequality and using the above properties of $T$ , we obtain $|X_{j}|\lesssim|\log\delta|^{2}$ on the event $T$ . By ensuring that $\delta<1/C$ for $C$ sufficiently large, it follows that $|X_{j}|<\delta^{-\alpha_{0}}$ on the event $T$ . Now, outside $T$ we still have $|\phi(X_{1},\dots,X_{k})|\leq|X_{1}\dots X_{k}|$ , therefore

[TABLE]

We now use Monte Carlo sampling to approximate the integral form (8.1) of $X_{j}$ with a discrete sum.

Lemma 9 (Monte Carlo sampling).

Let $(X,\mu)$ be a probability space and let $f\in L^{2}(X,\mu)$ . Assume that $w_{1},\dots,w_{m}$ are drawn independently from $X$ using the distribution $\mu$ . Then for $S=\frac{1}{m}(f(w_{1})+\dots+f(w_{m}))$ we have ${\mathbb{E}}S=\int_{X}fd\mu$ and

[TABLE]

Now, $\Delta\phi_{j,\delta}$ is supported inside $B(0,c\delta)$ and is bounded above by $O(\delta^{-2})$ .

Let $w_{j,i}$ be uniformly chosen from $B(0,c\delta)$ (independent of each other and of the coefficients of $p_{n}$ ), here $1\leq i\leq m$ and $1\leq j\leq k$ . Using (8.1) and Lemma 9, it follows that

[TABLE]

where $a_{j,i}=-\frac{1}{2}c^{2}\delta^{2}\Delta\phi_{j,\delta}(z_{j}-w_{j,i})$ . Note that $|a_{j,i}|=O(1)$ .

Now, on the event $T$ , the right hand side in the last display is $O(m^{-1}\lambda^{-2}|\log\delta|^{4})$ . Using the above estimate, we now show that all $X_{j}$ ’s could be replaced by the corresponding averages at a total small cost:

Claim 5.

Let $w=(w_{11},\dots,w_{1m},\dots,w_{k1},\dots,w_{km})$ . Then

[TABLE]

where the expectation is taken over $w$ and $\xi=(\xi_{0},\dots,\xi_{n})$ .

To see this, let $\lambda=\delta^{(k+1)\alpha_{0}}$ . Then on the product probability space generated by $\xi=(\xi_{0},\dots,\xi_{n})$ and $w_{j}=(w_{j,1},\dots,w_{j,m})$ it holds with probability $1-{\mathbb{P}}(T^{c})-O_{k}(m^{-1}\delta^{-(2k+3)\alpha_{0}})$ that

[TABLE]

for all $j=1,\dots,k$ . Now, letting $m\approx\delta^{-(3k+4)\alpha_{0}}$ and choosing $\alpha_{0}$ sufficiently small (so that in particular $c>(k+1)\alpha_{0})$ ), it follows that the following inequality holds with probability $1-O(\delta^{(k+1)\alpha_{0}})$ :

[TABLE]

(Here we’ve used the assumption that the first order partial derivatives of $\varphi$ is bounded above by $O(\delta^{k\alpha_{0}}))$ .) On the event that this estimate does not hold (which has probability $O(\delta^{(k+1)\alpha_{0}})$ ), we have the crude bound $O(\delta^{-k\alpha_{0}})$ for the left hand side of the above display, here we have used the assumption that $|\phi(w_{1},\dots,w_{k})|\leq|w_{1}\dots w_{k}|$ and $\phi$ is supported on $B_{{\mathbb{C}}}(0,2\delta^{-\alpha_{0}})^{k}$ . Collecting estimates, the desired estimate of Claim 5 follows immediately.

On the event $E$ , we note that $X_{j}\lesssim N_{p_{n}}(B(z_{j},c\delta))$ and similarly $X_{j,G}\lesssim N_{p_{n,G}}(B(z_{j},c\delta))$ . Consequently, using (8.2) and Claim 5 we obtain

[TABLE]

Using Theorem 12, the two terms involving $N_{p_{n}}(B(z_{j},c\delta))$ and $N_{p_{n,G}}(B(z_{j},c\delta))$ are bounded by $O(|\log\delta|^{Ck}\delta^{\alpha_{0}})$ , which in turn is bounded by $O(\delta^{\alpha_{0}/2})$ .

Thus, it remains to bound the first term on the right hand side of (8.3). Here we use Lindeberg swapping, or more precisely Lemma 7. Below we only discuss swapping of $\frac{1}{m}\sum_{i=1}^{m}\log|p_{n}(w_{1,i})|$ with its Gaussian analogue $\frac{1}{m}\sum_{i=1}^{m}a_{1,i}\log|p_{n,G}(w_{1,i})|$ ; the swapping of the other $k-1$ averages can be done similarly. Now, by conditioning on other variables and treating them as parameters, we may let

[TABLE]

It remains to show that

[TABLE]

We can check that $|\partial^{\beta}F|\lesssim\frac{1}{m^{|\beta|}}\delta^{-k\alpha_{0}}$ for any partial derivatives up to order $3$ . Note that $m\approx\delta^{-(3k+4)\alpha_{0}}$ by choice and $\alpha_{0}$ could be chosen arbitrarily small. Therefore, in order to show the estimate in the last display via Lemma 7, it remains to show that for some uniform constant $c>0$ (independent of $\alpha_{0}$ ) the following holds

[TABLE]

for any $1\leq i\leq m$ and any $0\leq j\leq n$ . To see this, note that $1-|w_{i}|\approx\delta$ and $c_{j}$ ’s satisfy Condition 1, therefore

[TABLE]

while $|c_{j}w_{i}^{j}|\lesssim(1+j)^{\rho}(1-\delta)^{j}$ . Via examination of the function $x^{\rho}(1-\delta)^{x}$ over $x\in[0,\infty)$ , we could show that $|c_{j}w_{i}^{j}|/\sqrt{Var[p_{n}(w_{i})]}\lesssim\delta^{\rho+\frac{1}{2}}+\delta^{1/2}$ , thus we could take any $0<c\leq\min(\rho+\frac{1}{2},\frac{1}{2})$ . (Recall the assumption that $\rho>-1/2$ ).

9. Counting local non-real roots

In this section, we will prove several estimates for the local number of non-real roots of $p_{n}$ near the real line. These estimates play an essential role in the next section, where the proof of Theorem 6 will be presented. Recall that we write $p_{n}=m_{n}+r_{n}$ where $m_{n}(z)=\sum_{j}b_{j}z^{j}$ is the deterministic component and $r_{n}=\sum_{j}c_{j}\xi_{j}z^{j}$ is the random component. We divide the analysis into two scenarios.

Scenario 1: $m_{n}$ is “small” compared to $r_{n}$ . This scenario generalizes the special case $m_{n}=0$ considered in in [4], where it was shown that with high probability $r_{n}$ has no non-real local root. Here we will show that a similar conclusion holds even with the addition of a “small” deterministic component $m_{n}$ .

Lemma 10.

Let $\epsilon_{0}>0$ be sufficiently small and let $c\in[0,1)$ . Then for $C=C(\epsilon_{0},c)>0$ sufficiently large the following holds for any $\frac{1}{n}\lesssim\delta\leq\frac{1}{C}$ and $\eta:=\delta^{1+\epsilon_{0}}$ and any $x\in I_{{\mathbb{R}}}(\delta)+(-c\delta,c\delta)$ .

(i) Assume that on $B(x,2\eta)$ we have $|m^{\prime\prime}_{n}|\ \lesssim\ \sqrt{Var[r^{\prime\prime}_{n}]}$ .

[TABLE]

(ii) Assume that on $B(x,2\eta)$ we have $|{m^{*}}^{\prime\prime}_{n}|\ \lesssim\ \sqrt{Var[{r_{n}^{*}}^{\prime\prime}]}$ .

[TABLE]

Scenario 2: $m_{n}$ is “large” compared to $r_{n}$ . Here we will show that with high probability $p_{n}$ has no local roots in a neighborhood of the real line.

Lemma 11.

Let $\epsilon_{0}>0$ be sufficiently small and let $c\in[0,1)$ . Let $\kappa>0$ . Then for $C,C^{\prime}>0$ sufficiently large the following holds for any $\frac{1}{n}\lesssim\delta\leq\frac{1}{C}$ and $\eta:=\delta^{1+\epsilon_{0}}$ and any $x\in I_{{\mathbb{R}}}(\delta)+(-c\delta,c\delta)$ .

(i) Assume that on $B(x,2\eta)$ we have $|m_{n}|\ >\ C^{\prime}|\log\delta|^{1/2}\sqrt{Var[r_{n}]}$ .

[TABLE]

(ii) Assume that on $B(x,2\eta)$ we have $|{m^{*}}_{n}|\ >\ C^{\prime}|\log\delta|^{1/2}\sqrt{Var[r^{*}_{n}]}$ .

[TABLE]

9.1. Proof of Lemma 10

9.1.1. Proof of Lemma 10, part (i)

Here we prove part (i) and we will discuss the modifications for part (ii) later. For convenience, let

[TABLE]

Step 1. Reduction to Gaussian: We’ll use Theorem 7 in this step. Let $\widetilde{c}\in(c,1)$ .

Let $\eta_{1},\dots,$ be an enumeration of the (complex) roots of $p_{n}$ and let $\eta_{1,G},\dots$ be an enumeration of the (complex) roots of $p_{n,G}$ , both enumerated with multiplicity.

Let $\epsilon_{1}>0$ be small to be chosen later. Let $\varphi:{\mathbb{C}}\to[0,1]$ be smooth supported on $B(0,2)$ such that $\varphi(z)=1$ if $|z|\leq 1$ . We have

[TABLE]

We now discuss the set up required to apply Theorem 7. Since $x\in I_{{\mathbb{R}}}(\delta)+(-c\delta,c\delta)$ , we may write $x=x_{0}+\alpha$ where $x_{0}\in I_{{\mathbb{R}}}(\delta)$ and $|\alpha|\leq c\delta$ . We then let

[TABLE]

which is defined on ${\mathbb{C}}^{2}$ , and here $L=O(1)$ is a sufficiently large absolute constant (in particular independent of $\epsilon_{0}$ ) so that all required derivative bounds (from Theorem 7) for $\phi_{\delta}$ are satisfied. Now, $supp(\phi_{\delta})\subset B_{{\mathbb{C}}}(\alpha,2\eta)^{2}\subset B_{{\mathbb{C}}}(0,\widetilde{c}\delta)^{2}$ if we require $\delta<1/C$ with $C>0$ sufficiently large depending on $\epsilon_{0}$ , $c$ , and $\widetilde{c}$ . It then follows from Theorem 7 (and the definition of correlation functions) that for some $\alpha_{0}>0$ (independent of $L,\epsilon_{0}$ ) the following holds:

[TABLE]

Unraveling the notation, we obtain

[TABLE]

Using Corollary 5 and observing that $X_{G}\leq N_{p_{n,G}}(B(z,\delta/9))$ , we have

[TABLE]

for any $m\geq 1$ , so by choosing $m$ large we have a bound of $O_{\epsilon_{1},M}(\delta^{M})$ for any $M>0$ . Using Theorem 12, it follows that

[TABLE]

Collecting estimates, we obtain

[TABLE]

by choosing $\epsilon_{0}$ small. So it remains to show that $P(X_{G}\geq 2)\lesssim\delta^{\kappa\epsilon_{0}+2\epsilon_{1}}$ . Since $\epsilon_{1}$ could be chosen very small, it suffices to show that $P(X_{G}\geq 2)\lesssim\delta^{\kappa^{\prime}\epsilon_{0}}$ for some $\kappa^{\prime}\in(\kappa,2)$ , which is essentially the Gaussian analogue of the desired estimate.

Step 2. Proof for Gaussian. We will show that, with high probability $p_{G,n}$ is close to its linear approximation at $x$ , namely $\mathcal{L}(z):=p_{n,G}(x)+p^{\prime}_{n,G}(x)(z-x)$ .

[TABLE]

Using Rouché’s theorem and linearity of $\mathcal{L}$ , (9.1) implies the desired estimate. Now, to show (9.1), we will prove two estimates.

Claim 6.

The following holds uniformly over $t>0$ :

[TABLE]

Claim 7.

For some $\alpha_{0}>0$ the following holds uniformly over $t>0$ :

[TABLE]

The desired estimate (9.1) then follows from choosing $t=(\eta/\delta)^{\kappa}$ in Claim 6 and choosing $t=M|\log\delta|^{1/2}$ (with $M$ large) in Claim 7. Here we need $\kappa<2$ .

9.1.2. Proof of Claim 6

Since $\mathcal{L}$ is linear with real coefficients and since $x\in{\mathbb{R}}$ , $\min_{z\in\partial B(x,2\eta)}|\mathcal{L}(z)|$ is achieved at $z=x-2\eta$ or $z=x+2\eta$ . Consequently, for any $t>0$ we have

[TABLE]

here we have used the fact that $\mathcal{L}(x+2\eta)$ and $\mathcal{L}(x-2\eta)$ are Gaussian. Using Lemma 1 and Condition 1, we have $Var[r_{n}(x)]\approx\delta^{4}\sup_{\xi\in B(x,2\eta)}Var[r^{\prime\prime}_{n}(\xi)]$ . Therefore it remains to show that for any $s\in\{-2\eta,2\eta\}$ we have

[TABLE]

Now, since $\xi_{j}$ are independent, we have $\sqrt{Var[\mathcal{L}(x+s)]}=\|c_{j}(x^{j}+sjx^{j-1})_{j=0}^{n}\|_{l^{2}}$ .

If $\delta\geq\frac{1}{10n}$ then by definition we have $1-|x|\approx\delta$ . Therefore, using the triangle inequality and Lemma 1 and Condition 1 we obtain

[TABLE]

Using Lemma 1 and Condition 1, it follows that $Var[r^{\prime}_{n}(x)]\approx\delta^{-2}Var[r_{n}(x)]$ . Since $\eta\ll\delta$ , the desired estimate (9.2) follows immediately.

Now, if $\frac{1}{n}\lesssim\delta<\frac{1}{10n}$ we have $|s|\leq 2\eta<1/(2n)$ . Therefore, uniformly over $0\leq j\leq n$ we have $|x^{j}+sjx^{j-1}|\gtrsim|x|^{j}$ , which implies the desired estimate (9.2).

9.1.3. Proof of Claim 7.

To estimate $\max_{z\in\partial B(x,2\eta)}|\mathcal{E}(z)|$ , we first estimate the mean and the variance of $\mathcal{E}(z)$ . We will show that

[TABLE]

uniformly over $z\in B(x,2\eta)$ and $w\in B(x,3\eta)$ .

For (9.4), let $w\in B(x,3\eta)$ . By the mean value theorem, we have

[TABLE]

By ensuring $C=C(\epsilon_{0},c)$ is large, for any $\xi\in B(x,3\eta)$ we have $\xi\in I(\delta)+(-c^{\prime}\delta,c^{\prime}\delta)$ for $c^{\prime}=(1+c)/2<1$ . Using Lemma 1, it follows that

[TABLE]

For (9.3), again by the mean value theorem we have

[TABLE]

Now, we combine (9.3) and (9.4) to prove Claim 7. For convenience of notation, let $q(z):=\mathcal{E}(z)-{\mathbb{E}}\mathcal{E}(z)$ . Without loss of generality, we may assume that $t$ is much larger than the implicit constants in the last estimate for $|{\mathbb{E}}\mathcal{E}(z)|$ and in (9.3). It follows from (9.3) and (9.4) that

[TABLE]

Using Cauchy’s theorem, for $z\in\partial B(x,2\eta)$ we have

[TABLE]

where $d|w|$ is the arclength measure along the integration contour $\partial B(x,3\eta)$ . Note that $Var[q(z)]=Var[\mathcal{E}(z)]$ . It follows that, for some $c>0$ , we have

[TABLE]

9.1.4. Proof of Lemma 10, part (ii)

Our proof of part (ii) of Lemma 10 is entirely similar to that of the proof of part (i), where the key ingredients is the fact that uniformly over $\xi\in B(x,3\eta)$ we have $\sqrt{Var[{r^{*}}^{(m)}_{n}(\xi)]}\approx_{m}(1+n)^{\rho}\delta^{-(2m+1)/2}$ for any $m\geq 0$ , which in turn is a consequence of Condition 1 and Lemma 1.

9.1.5. Proof of Lemma 11, part (i)

We will proceed in a similar fashion as in the proof of Lemma 10. The reduction to the Gaussian setting can be done similarly by using universality estimates for the 1-point correlation function of the complex zeros of $p_{n}$ from Theorem 7 and estimates proved in Theorem 12 and Corollary 5.

We now discuss the proof for the Gaussian setting. The given assumption clearly implies that $m_{n}$ has no zero in $B(x,2\eta)$ . Thus, using Rouché’s theorem it suffices to show that

[TABLE]

Using Cauchy’s theorem and arguing as in the proof of Claim 7, we obtain

[TABLE]

for some $\alpha_{0}>0$ and any $\lambda>0$ . Using Lemma 1 and Condition 1, we also have

[TABLE]

Thus, using the given hypothesis we obtain, for some $c^{\prime}>0$ ,

[TABLE]

Let $t=1$ in the last estimate. Then for any $\kappa>0$ we could choose $C_{0}\approx\sqrt{\kappa}$ but large such that this estimate is bounded above by $O((\eta/\delta)^{\kappa})$ , as desired.

9.1.6. Proof of Lemma 11, part (ii)

The proof is entirely similar to part (i).

10. Proof of universality for real correlation functions

Below we prove part (i) of Theorem 6, and the same argument may be used to prove part (ii) of this theorem (details will be omitted).

Let $x=(x_{1},\dots,x_{m})\in I_{{\mathbb{R}}}(\delta)^{m}$ and $z=(z_{m+1},\dots,z_{m+k})\in I_{{\mathbb{C}}_{+}}(\delta)^{k}$ . For convenience of notation write $z_{j}=x_{j}+iy_{j}$ for all $j$ . Then for $j\leq m$ we have $y_{j}=0$ and $x_{j}\in I(\delta)$ , while for $j>m$ we have $y_{j}>0$ . Note that $x_{j}$ and $y_{j}$ may not be inside $I_{{\mathbb{C}}}(\delta)$ for $j>m$ .

Arguing as in the proof of Theorem 7, it suffices to show that

[TABLE]

( $X_{G,j}$ are Gaussian analogues), and $F_{j,\delta}$ and $H_{j,\delta}$ satisfy the following conditions:

(i) for each $j\leq m$ , $F_{j,\delta}$ is in $C^{2}({\mathbb{R}})$ , supported in $(-c\delta,c\delta)$ such that $|F^{(\ell)}_{j,\alpha}|\leq 1$ for $\ell=0,1,2$ .

(ii) for each $j>m$ , $H_{j,\delta}$ is supported on $B_{{\mathbb{C}}}(0,c\delta)$ and is also $C^{2}({\mathbb{R}}^{2})$ with $|\partial^{\alpha}H_{j,\delta}|\leq\delta^{-|\alpha|}$ for any $|\alpha|\leq 2$ .

Let $\epsilon_{0}>0$ be sufficiently small, as required by Lemma 10 and let $\eta=\delta^{1+\epsilon_{0}}$ .

Let $c^{\prime}\in(0,1)$ be small such that $c+c^{\prime}<\widetilde{c}$ .

Let $\Phi:{\mathbb{R}}\to{\mathbb{R}}$ be a bump function supported on $[-c^{\prime},c^{\prime}]$ with $\Phi(0)=1$ .

Let $\Psi:{\mathbb{R}}\to[0,1]$ be a smooth function supported on $\{x\geq c^{\prime}/2\}$ such that $\Psi(x)=1$ if $x\geq c^{\prime}$ .

Let $L=O(1)$ be sufficiently large. Let $K_{1,\delta},\dots,K_{m+k,\delta}:{\mathbb{C}}\to{\mathbb{C}}$ be defined by

[TABLE]

One could check that $K_{1,\delta},\dots,K_{m+k,\delta}$ are supported on $B(0,(c+c^{\prime})\delta)$ and are $C^{2}({\mathbb{R}}^{2})$ with $\partial^{\alpha}$ derivatives bounded by $O(\delta^{-|\alpha|})$ for any multi-index $|\alpha|\leq 2$ .

Applying Theorem 7 for test functions of tensor-product type, it follows that for some $\alpha_{0}>0$ (which does not depend on $\epsilon_{0}$ ) we have

[TABLE]

Letting $Z_{j}:=\delta^{-L\epsilon_{0}}Y_{j}$ and making sure $\epsilon_{0}<c_{0}/(Lm+Lk)$ , it remains to show

[TABLE]

for some $\alpha_{1}>0$ . Since $X_{j},Z_{j}\leq N_{p_{n}}(B(z_{j},c\delta))$ , using Corollary 5 we have ${\mathbb{E}}|X_{j}|^{m+k},{\mathbb{E}}|Z_{j}|^{m+k}\lesssim|\log\delta|^{O(m+k)}$ . Via Holder’s inequality, it therefore suffices to show that for some $c>0$ we have

[TABLE]

Now, for each $1\leq j\leq m+k$ let

[TABLE]

We first show that if $X_{j}-Z_{j}\neq 0$ then $|Im(z_{j})|\leq(c+c^{\prime})\delta$ and

[TABLE]

Indeed, we first consider $1\leq j\leq m$ . Then $z_{j}=x_{j}\in I_{{\mathbb{R}}}(\delta)$ . Therefore,

[TABLE]

Since both $F_{j,\delta}$ and $F_{j,\delta}$ are bounded, it suffices to show that any $\alpha$ that contributes to the sum must be in $S_{j}$ . Indeed, for such $\alpha$ we have $|Re(\alpha)-x_{j}|<c\delta$ and $|Im(\alpha)|<c^{\prime}\delta$ , which implies the desired claim.

We now consider $m+1\leq j\leq m+k$ . We have

[TABLE]

Since $\Psi$ is supported on $[c^{\prime}/2,\infty)$ in the second summation we could further assume that $\alpha\in{\mathbb{C}}_{+}$ . We obtain

[TABLE]

For any contributing $\alpha$ , it holds that $|Im(\alpha)|<c^{\prime}\eta$ , therefore

[TABLE]

In particular, $|Re(z_{j})|\geq|z_{j}|-|Im(z_{j})|\geq 1-O(\delta)$ and this can be made very large compared to $\delta$ . Now,

[TABLE]

therefore $Re(\alpha)$ has the same sign as $Re(z_{j})$ . Thus it remains to show that $||Re(\alpha)|-|z_{j}||\leq(c+c^{\prime})\delta$ . Now, using the triangle inequality this follows from

[TABLE]

This completes the proof of (10.1).

Now, the strip $S_{j}$ could be covered by $O(\delta^{-\epsilon_{0}})$ sets of the form $B(x,\eta)$ with center $x$ inside $(sign(Re(z_{j}))|z_{j}|-(c+c^{\prime})\delta,sign(Re(z_{j}))|z_{j}|+(c+c^{\prime})\delta)$ . Since $c+c^{\prime}<\widetilde{c}$ and since $Im(z_{j})|\leq(c+c^{\prime})\delta$ , it follows that for such $x$ the ball $B(x,2\eta)$ would be inside the interval $J$ where the given hypothesis on the relationship between $m_{n}$ and $r_{n}$ holds. Now, since $p_{n}$ is a real polynomials its complex roots are symmetric about the real axis. Thus, using the small ball estimates proved in Lemma 10 (if $m_{n}$ is small compared to $r_{n}$ ) or the small ball estimates proved in Lemma 11 (if $m_{n}$ is large compared to $r_{n}$ ) with $\kappa=3/2$ , together with an union bound, we obtain

[TABLE]

Now, since $|Z\cap(S_{j}\setminus{\mathbb{R}})|$ is a nonnegative integer, by Theorem 12 we have

[TABLE]

This completes the proof of Theorem 6.

11. Reduction of Theorem 1 to Gaussian polynomials

In this section, using Theorem 6 we will reduce Theorem 1 to Gaussian random polynomials. The proof of Theorem 1 for Gaussian polynomials will be discussed in the next section.

Let $B_{C}=\{1-\frac{1}{C}\leq|t|\leq 1+\frac{1}{C}\}$ . Using Lemma 2, to reduce Theorem 1 to the Gaussian setting, it suffices to show that

[TABLE]

Thus without loss of generality we may assume that $I\subset[1-1/C,1+1/C]$ or $I\subset[-1-1/C,-1+1/C]$ . Below, we will only consider the first case, and we may use the same argument for the other case.

Let $\epsilon>0$ be a very small absolute constant. Recall the definition of $I(\delta)$ from (3.1) and the paragraph after (3.1). Let $\widetilde{I}_{{\mathbb{R}}}(\delta)=\{z:1/z\in I_{{\mathbb{R}}}(\delta)\}$ .

Note that we may cover $I$ using intervals $I_{{\mathbb{R}}}(2^{m})$ and $\widetilde{I}_{{\mathbb{R}}}(2^{\ell})$ where $\frac{1}{n}\lesssim 2^{m}\lesssim\frac{1}{C}$ and $\frac{1}{n}\lesssim 2^{\ell}\lesssim\frac{1}{C}$ . Let $M$ and $L$ be respectively the sets of $m$ and $\ell$ such that $I(2^{m})$ and $\widetilde{I}(2^{\ell})$ intersect $I$ . Clearly, nearby covering intervals have comparable lengths. Thus, we may construct a sequence of functions $\varphi_{m},\psi_{\ell}$ (similar to a partition of unity) such that $\varphi_{m}$ is supported on $(1+\epsilon)I(2^{m})$ and $\psi_{\ell}$ is supported on $(1+\epsilon)\widetilde{I}(2^{\ell})$ , and furthermore

(i) $|\partial^{\alpha}\psi_{\ell}|\lesssim 2^{|\alpha|\ell}$ and $|\partial^{\alpha}\varphi_{m}|\lesssim 2^{|\alpha|m}$ for any partial derivatives, and

(ii) $\gamma(y):=\sum_{m\in M}\varphi_{m}(y)+\sum_{\ell\in L}\psi_{\ell}(y)$ is equal to $1$ for all $y\in I$ and is supported inside $I\cup I_{l}\cup I_{r}$ where $I_{l},I_{r}$ are two intervals from the covering that contain endpoints of $I$ .

Now, we could shrink the endpoint intervals $I_{l}$ and $I_{r}$ by factors comparable to $1$ (if necessary) so that $I$ remains covered by the new collection of intervals, and at the same time $(1+2\epsilon)I_{l},(1+2\epsilon)I_{r}$ are subsets of the assumed enlargement $J$ of $I$ . The given definition of enlargement ensures that the shrinking of these intervals could be done. We may redesign the bump functions $\phi_{m}$ and $\psi_{\ell}$ associated with $I_{l}$ and $I_{r}$ such that they will still be supported inside $(1+\epsilon)I_{l}$ and $(1+\epsilon)I_{r}$ , respectively.

It follows from Theorem 6 that, for some $\alpha_{1}>0$ ,

[TABLE]

Summing the last two estimates over $m$ and $\ell$ , we obtain

[TABLE]

Now, $|{\mathbb{E}}N_{n}(I)-{\mathbb{E}}\sum_{\alpha\in Z\cap{\mathbb{R}}}\gamma(\alpha)|=O({\mathbb{E}}N_{n}(I_{l}\cup I_{r}))$ . For the local intervals $I_{l}$ and $I_{r}$ , we will show that ${\mathbb{E}}N_{n}(I_{l})=O(1)$ and ${\mathbb{E}}N_{n}(I_{r})=O(1)$ . Since the details are entirely similar we will only discuss the estimate for ${\mathbb{E}}N_{n}(I_{l})$ . Since $(1+\epsilon)I_{l}\subset J$ the enlargement of $I$ , we may construct a bump function $\phi$ adapted to $I_{l}$ that equals $1$ on $I_{l}$ but vanishes outside $(1+\epsilon/2)I_{l}$ , in particular its support is strictly contained inside $J$ . Let $d\rho$ be the $1$ -point correlation measure for the real root of $p_{n}$ and $d\rho_{G}$ be its Gaussian analogue. By Theorem 6, we obtain

[TABLE]

Then assuming that the Gaussian case of Theorem 1 is known and using the fact that $J$ remains an enlargement of $(1+\epsilon/2)I_{l}$ , we obtain

[TABLE]

here in the last estimate we may use Proposition 3 in the next section (which is a consequence of explicit Gaussian computations in [4]).

This completes the proof of the reduction of Theorem 1 to Gaussian polynomials.

12. Proof of Theorem 1 for Gaussian polynomials

In this section we prove Theorem 1 for the Gaussian polynomial $p_{n}(t)=\sum_{j=0}^{n}(b_{j}+c_{j}\xi_{j})t^{j}$ where $\xi_{j}$ are iid normalized Gaussian, and throughout the section we will assume that $b_{j}$ and $c_{j}$ satisfy Condition 1.

Let $m_{n}={\mathbb{E}}[p_{n}]$ and $r_{n}(t)=\sum_{j}c_{j}\xi_{j}t^{j}$ and let $\mathcal{P}=Var[r_{n}(t)]$ , $\mathcal{Q}=Var[r^{\prime}_{n}(t)]$ , and $\mathcal{R}=Cov[r_{n}(t),r^{\prime}_{n}(t)]$ , and $\mathcal{S}=\mathcal{P}\mathcal{Q}-\mathcal{R}^{2}$ .

We recall the following Kac-Rice formula [7, Corollary 2.1]. Let $erf(x)=\int_{0}^{x}e^{-t^{2}}dt$ . Then ${\mathbb{E}}N_{n}(a,b)=I_{1}(a,b)+I_{2}(a,b)$ where

[TABLE]

We will also work with the normalized reciprocal polynomial $p^{*}_{n}(t)=m^{*}_{n}(t)+r^{*}_{n}(t)$ , and we will denote by $I_{1}^{*}$ , $I_{2}^{*}$ , $\mathcal{P}^{*},\mathcal{Q}^{*},\mathcal{R}^{*},\mathcal{S}^{*}$ the analogous quantities.

Using Lemma 2, we may assume without loss of generality that $I\subset\{1-c\leq|t|\leq 1+c\}$ for a (small) absolute constant $c>0$ . By breaking up $I$ into $I_{>1}$ and $I_{\leq 1}$ and notice that $N_{p_{n}}(I_{>1})=N_{p^{*}_{n}}(K)$ where $K=\{1/t,\ \ t\in I_{>1}\}$ we may reduce the consideration to $I\subset\{1-c\leq|t|\leq 1\}$ .

Now, using Lemma 1, we have

Corollary 6.

Assume that $b_{j}$ and $c_{j}$ satisfy Condition 1. Then for any $c\in(0,1)$ it holds uniformly over $1-c\leq|t|\leq 1$ that

[TABLE]

On the other hand, by the classical Kac formula, $\rho_{n}(t):=\frac{\mathcal{S}^{1/2}}{\pi\mathcal{P}}$ is the density for the real root distribution of $r_{n}(t)=\sum_{j}c_{j}\xi_{j}t^{j}$ , and similarly $\rho^{*}_{n}(t):=\frac{{\mathcal{S}^{*}}^{1/2}}{\pi{\mathcal{P}}^{*}}$ is the density for the real root distribution of $r^{*}_{n}(t)$ , and both of them can be easily bounded by $O(n)$ by elementary inspection. Note that the Gaussian density for $r_{n}(t)=\sum_{j}c_{j}\xi_{j}t^{j}$ (and for its reciprocal polynomial) was studied555In fact, in [4] it was required that $|c_{j}|\sim(1+j)^{\rho}$ for $O(1)\leq j\leq n$ , however the Gaussian computations in [4] can be easily modified to work with the weaker assumption $O(1)\leq j\leq n-O(1)$ in the current paper. in [4], and we summarize the known estimates for them from [4, Lemma 10.3, Lemma 10.6] in the following proposition.

Proposition 3.

Assume that $c_{j}$ satisfy Condition 1. Let $c>0$ be small. Then uniformly over $1-c\leq|t|\leq 1-c^{\prime}/n$ we have

[TABLE]

and uniformly over $1-c^{\prime}/n\leq|t|\leq 1+c^{\prime}/n$ we have

[TABLE]

In fact, in the original setting considered in [4] it was required that $c_{j}\approx(1+j)^{\rho}$ for all $O(1)\leq j\leq n$ , so it is a little stricter than our setting $O(1)\leq j\leq n-O(1)$ , however the computation in the Gaussian setting in [4] is not affected much with our slightly more relaxed assumption. We omit the details.

12.1. Estimates for $I_{2}$

We will show that, under the hypothesis of Theorem 1 about the relative relation between $m_{n}$ and $r_{n}$ on $I$ , we will always have $I_{2}(I)=O(1)$ . We separate the proof into two cases, depending on whether $m_{n}$ dominates $r_{n}$ or is dominated by $r_{n}$ .

First, we consider the situation when the deterministic component $m_{n}$ dominates the random component $r_{n}$ on $I$ .

Lemma 12.

Let $c>0$ . There is a constant $C>0$ such that the following holds. Let $I\subset\{1-c\leq|t|\leq 1\}$ be an interval whose endpoints may depend on $n$ .

[TABLE]

Proof.

Using Lemma 1 and Corollary 6 we have

[TABLE]

and by the given hypothesis $|m_{n}(t)|^{2}/\mathcal{P}\geq 2C^{\prime}|\log(1+\frac{1}{n}-|t|)|$ where $C^{\prime}$ is comparable to $C^{2}$ . Therefore

[TABLE]

so if $C$ is big enough then $C^{\prime}>5/2$ and the last integral is $O(1)$ , as desired.

The consideration for $I^{*}_{2}(I)$ is entirely similar. ∎

We now consider the situation when $m_{n}$ is dominated by $r_{n}$ .

Recall that $\phi:(0,1)\to[0,1]$ is such that the following holds for some $c>0$ :

[TABLE]

Lemma 13.

Let $c>0$ and let $\phi:(0,1)\to{\mathbb{R}}_{+}$ satisfy (12.3). Let $I\subset\{1-c\leq|t|\leq 1\}$ be an interval whose endpoints may depend on $n$ .

(i) Assume that the following holds uniformly over $t\in I$ .

[TABLE]

(ii) Under the analogous assumptions, we also have $I^{*}_{2}(I)=O_{\epsilon}(1)$ .

Proof.

Using the given hypothesis and using Corollary 6, we have

[TABLE]

Since $\exp(-m_{n}^{2}/\mathcal{P})\leq 1$ , we obtain

[TABLE]

This completes the proof of part (i). The second part (ii) can be proved similarly. ∎

12.2. Estimates for $I_{1}$

Here we will also divide the consideration into two cases, depending on whether $m_{n}$ is dominant or $r_{n}$ is dominant.

The following result addresses the situation when $m_{n}$ is dominated by $r_{n}$ .

Lemma 14.

Assume that $\phi:(0,1)\to{\mathbb{R}}_{+}$ satisfies (12.3). Let $c>0$ and let $I\subset\{1-c\leq|t|\leq 1\}$ be an interval whose endpoints may depend on $n$ .

(i) Assume that uniformly over $t\in I$ we have

[TABLE]

(ii) Under analogous assumptions, a similar estimate holds for $I^{*}_{1}(I)$ .

The following result deals with the situation when $m_{n}$ dominates $r_{n}$ .

Lemma 15.

Let $c>0$ and let $I\subset\{1-c\leq|t|\leq 1\}$ be an interval whose endpoints may depend on $n$ .

(i) Assume that uniformly over $t\in I$ we have

[TABLE]

(ii) Under analogous assumptions, a similar estimate holds for $I^{*}_{1}(I)$ .

The proof of these results are based on the following technical estimate. For convenience, let $\mathcal{T}(t)=\frac{m_{n}^{2}}{\mathcal{P}}+\frac{m_{n}^{\prime 2}}{\mathcal{Q}}$ , and define $\mathcal{T}^{*}(t)$ analogously. Recall that $\rho_{n}(t):=\frac{\mathcal{S}^{1/2}}{\pi\mathcal{P}}$ is the density for the real root distribution of $r_{n}(t)$ , and $\rho^{*}_{n}:=\frac{{\mathcal{S}^{*}}^{1/2}}{\pi\mathcal{P}^{*}}$ is the density for the real root distribution for $r^{*}_{n}$ .

Lemma 16.

Let $c>0$ be sufficiently small and let $c^{\prime}>0$ be sufficiently large. Then there are finite absolute constants $C_{1},C_{2}>0$ that may depend on $c,c^{\prime}$ such that the following holds for any interval $I$ whose endpoints may depend on $n$ .

(i) If $I\subset\{1-c^{\prime}/n\leq|t|\leq 1+c^{\prime}/n\}$ then $I_{1}(I)=O(1)$ and $I^{*}_{1}(I)=O(1)$ .

(ii) If $I\subset\{1-c\leq|t|\leq 1-c^{\prime}/n\}$ then

[TABLE]

and the analogous estimate holds for $I^{*}_{1}(I)$ .

Proof.

(i) Since $\mathcal{P}\mathcal{Q}\geq\mathcal{R}^{2}$ , it follows that $m_{n}^{2}\mathcal{Q}+m_{n}^{\prime 2}\mathcal{P}-2m_{n}m_{n}^{\prime}\mathcal{R}\geq 0$ , so

[TABLE]

The estimate for $I^{*}_{1}$ is proved similarly.

(ii) Let $1-c\leq|t|\leq 1-c^{\prime}/n$ . From Corollary 6 and Proposition 3, we obtain

[TABLE]

In other words for some $C>0$ we have $\mathcal{P}^{1/2}\mathcal{Q}^{1/2}\geq(1+C)|\mathcal{R}|$ . Consequently, by the geometric mean inequality we have

[TABLE]

Now, by Corollary 6 and Proposition 3 we have $\mathcal{S}\approx\mathcal{P}\mathcal{Q}$ . It follows that

[TABLE]

The desired estimate then follows from the definition (12.1) of $I_{1}$ .

The proof for $I^{*}_{1}(t)$ is completely analogous. ∎

We now use Lemma 16 to prove Lemma 14 and Lemma 15. Below we will show only the proof for the desired estimates for $I_{1}$ , the same argument works for $I^{*}_{1}$ . We start with the case when $m_{n}$ is dominated by $r_{n}$ : under the assumptions of Lemma 14 we have $\mathcal{T}(t)\lesssim\phi(1-|t|+\frac{1}{n})$ . Using $1\geq e^{-x}\geq 1-x$ for $x\geq 0$ and using Proposition 3, it follows that

[TABLE]

Now in the case when $m_{n}$ dominates $r_{n}$ : under the assumptions of Lemma 15 we have $\mathcal{T}(t)\gtrsim|\log(1+\frac{1}{n}-|t)|$ , while $\rho_{n}(t)\lesssim(1+\frac{1}{n}-|t|)^{-1}$ thanks to Proposition 3. Therefore, for some $c^{\prime\prime}>0$ we have

[TABLE]

Bibliography34

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A Bloch and György Pólya. On the roots of certain algebraic equations. Proceedings of the London Mathematical Society , 2(1):102–114, 1932.
2[2] Federico Dalmao. Asymptotic variance and CLT for the number of zeros of Kostlan Shub Smale random polynomials. Comptes Rendus Mathematique , 353(12):1141–1145, 2015.
3[3] Yen Do, Hoi Nguyen, and Van Vu. Real roots of random polynomials: expectation and repulsion. Proceedings of the London Mathematical Society (3) , 111(6):1231–1260, 2015.
4[4] Yen Do, Oanh Nguyen, and Van Vu. Roots of random polynomials with coefficients of polynomial growth. Annals of Probability (2018), Vol. 46, no. 5, 2407–2494 .
5[5] Yen Do and Van Vu. Central limit theorems for the real zeros of Weyl polynomials. American Journal of Mathematics (2020), vol. 142, issue 2, pp. 1327–1369. , 2017.
6[6] Alan Edelman and Eric Kostlan. How many zeros of a random polynomial are real? Bulletin of the American Mathematical Society , 32(1):1–37, 1995.
7[7] Kambiz Farahmand. Topics in random polynomials , volume 393. CRC Press, 1998.
8[8] Hendrik Flasche and Zakhar Kabluchko. Real zeros of random analytic functions associated with geometries of constant curvature. ar Xiv preprint ar Xiv:1802.02390 , 2018.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Real roots of random polynomials with coefficients of polynomial growth: a comparison principle and applications

Abstract.

2000 Mathematics Subject Classification:

Contents

1. Introduction and statement of results

Condition 1**.**

1.1. Notational conventions

1.2. Statement of results

Theorem 1** (Comparison principle).**

Theorem 2**.**

Theorem 3**.**

Theorem 4**.**

Theorem 5**.**

1.3. Outline of the paper

2. Sample applications of the comparison principle

Lemma 1**.**

Proof of Lemma 1.

Lemma 2**.**

Lemma 3**.**

Proof of Lemma 3.

Proof of Lemma 2.

2.0.1. Small mean

Corollary 1**.**

2.0.2. Large mean

Corollary 2**.**

2.0.3. Mixed case

Corollary 3**.**

3. Correlation functions: background and main estimates

Theorem 6**.**

Theorem 7**.**

4. Local anti-concentration inequalities

Theorem 8**.**

Theorem 9**.**

Corollary 4**.**

Proof of Corollary 4.

4.1. Proof of Theorem 8

Claim 1**.**

4.2. Proof of Theorem 9

Claim 2**.**

5. Logarithmic integrability of random polynomials

5.1. Logarithmic integrability on the unit disk

Theorem 10**.**

Proposition 1**.**

Proposition 2**.**

Lemma 4**.**

5.1.1. Proof of Lemma 4

Claim 3**.**

Claim 4**.**

5.1.2. Proof of Proposition 1

5.2. Logarithmic integrability on local sets

Theorem 11**.**

Lemma 5**.**

Proof of Lemma 5.

5.2.1. Proof of Theorem 11

6. Counting local real roots

Theorem 12**.**

Corollary 5**.**

6.1. Larger scales

Lemma 6**.**

6.1.1. Proof of Lemma 6

6.2. Smaller scales

7. Lindeberg swapping and Tao-Vu replacement estimates

Lemma 7**.**

Lemma 8** (Basic Lindeberg swapping).**

Proof.

8. Proof of universality for complex correlation functions

Lemma 9** (Monte Carlo sampling).**

Claim 5**.**

9. Counting local non-real roots

Lemma 10**.**

Lemma 11**.**

9.1. Proof of Lemma 10

9.1.1. Proof of Lemma 10, part (i)

Condition 1.

Theorem 1 (Comparison principle).

Theorem 2.

Theorem 3.

Theorem 4.

Theorem 5.

Lemma 1.

Lemma 2.

Lemma 3.

Corollary 1.

Corollary 2.

Corollary 3.

Theorem 6.

Theorem 7.

Theorem 8.

Theorem 9.

Corollary 4.

Claim 1.

Claim 2.

Theorem 10.

Proposition 1.

Proposition 2.

Lemma 4.

Claim 3.

Claim 4.

Theorem 11.

Lemma 5.

Theorem 12.

Corollary 5.

Lemma 6.

Lemma 7.

Lemma 8 (Basic Lindeberg swapping).

Lemma 9 (Monte Carlo sampling).

Claim 5.

Lemma 10.

Lemma 11.

Claim 6.

Claim 7.

Corollary 6.

Proposition 3.

12.1. Estimates for $I_{2}$

Lemma 12.

Lemma 13.

12.2. Estimates for $I_{1}$

Lemma 14.

Lemma 15.

Lemma 16.