Real roots of random polynomials with coefficients of polynomial growth: a comparison principle and applications
Yen Q. Do

TL;DR
This paper introduces a comparison principle to analyze the distribution of real roots in random polynomials with coefficients of polynomial growth, extending results beyond zero-mean cases and applying to various polynomial classes.
Contribution
It develops a novel comparison principle that reduces the analysis of non-centered coefficients to the mean-zero case, enabling new results for diverse random polynomial models.
Findings
New estimates for the number of real roots in various polynomial classes
Logarithmic integrability estimates for random polynomials
Sharp local estimates for real zeros
Abstract
This paper seeks to further explore the distribution of the real roots of random polynomials with non-centered coefficients. We focus on polynomials where the typical values of the coefficients have power growth and count the average number of real zeros. Almost all previous results require coefficients with zero mean, and it is non-trivial to extend these results to the general case. Our approach is based on a novel comparison principle that reduces the general situation to the mean-zero setting. As applications, we obtain new results for the Kac polynomials, hyperbolic random polynomials, their derivatives, and generalizations of these polynomials. The proof features new logarithmic integrability estimates for random polynomials (both local and global) and fairly sharp estimates for the local number of real zeros.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeometry and complex manifolds
Real roots of random polynomials with coefficients of polynomial growth: a comparison principle and applications
Yen Q. Do
Department of Mathematics, The University of Virginia, Charlottesville, VA 22904-4137
Abstract.
This paper seeks to further explore the distribution of the real roots of random polynomials with non-centered coefficients. We focus on polynomials where the typical values of the coefficients have power growth and count the average number of real zeros. Almost all previous results require coefficients with zero mean, and it is non-trivial to extend these results to the general case. Our approach is based on a novel comparison principle that reduces the general situation to the mean-zero setting. As applications, we obtain new results for the Kac polynomials, hyperbolic random polynomials, their derivatives, and generalizations of these polynomials. The proof features new logarithmic integrability estimates for random polynomials (both local and global) and fairly sharp estimates for the local number of real zeros.
2000 Mathematics Subject Classification:
30B20
Y.D. partially supported by NSF grant DMS-1800855.
Contents
- 1 Introduction and statement of results
- 2 Sample applications of the comparison principle
- 3 Correlation functions: background and main estimates
- 4 Local anti-concentration inequalities
- 5 Logarithmic integrability of random polynomials
- 6 Counting local real roots
- 7 Lindeberg swapping and Tao-Vu replacement estimates
- 8 Proof of universality for complex correlation functions
- 9 Counting local non-real roots
- 10 Proof of universality for real correlation functions
- 11 Reduction of Theorem 1 to Gaussian polynomials
- 12 Proof of Theorem 1 for Gaussian polynomials
1. Introduction and statement of results
This paper seeks to further explore the distribution of the real roots of random algebraic polynomials
[TABLE]
where the coefficients are independent real-valued random variables with finite means and finite variances. We are particularly interested in the average number of real roots of such polynomials. This problem has attracted many mathematicians’ attention since previous centuries, initially out of theoretical curiosity, but has recently found applications in statistical physics and finance [10, 29, 28, 30]. It was reported in [34] that during the th century Waring considered the distribution of the real roots for random polynomials of low degrees. It however took quite a while until the first (but rather crude) estimates for the number of real roots for random polynomials were established, in a result of Bloch and Polya at the beginning of the 20th century [1]. Various authors subsequently worked on this problem, leading to significant developments during 1940s-1970s, with seminal contributions of Kac [17], Littlewood and Offord [19, 20, 21], Ibragimov and Maslova [12, 14, 15, 13, 22, 23], among others. Recently, there has been a renewed interest in this problem [6, 16, 11, 2, 9, 31, 32, 5, 8, 27], in particular Tao and Vu [33] developed a new framework to study the real roots of random polynomials, adapting their methods from random matrix theory. See also [25, 3, 4, 26] for some further development of the methods in [33].
Despite the large number of prior studies, only a very few are about random polynomials with non-centered coefficients, namely when the coefficients may have nonzero means. Furthermore, these studies often require very restrictive assumptions of algebraic nature on the relationship between the mean, the variance, and the underlying index of the coefficients. Ibragimov and Maslova [14, 15] in 1970s considered random polynomials with iid coefficients of nonzero mean (these are known as Kac polynomials). They showed that the expected number of real roots for the Kac random polynomials is essentially reduced to a half if the iid coefficients have a (common) nonzero mean. In [4], a joint work with Oanh Nguyen and Van Vu, using different methods we strengthened and generalized this result to random polynomials where the mean and the variance of the coefficient are linearly dependent and furthermore they are algebraic polynomials of .
In this paper, we consider an innovative approach that circumvents the needs for algebraic constraints between the mean and the variance of the coefficients and does not require any algebraic dependence on the underlying index. In particular, this approach offers some explanation for the interaction between the mean and the variance of random polynomials. We focus on generalized Kac polynomials, an important class where the typical values of the coefficients are comparable to a fixed power of the underlying index. We will discuss below the technical details of our set up.111It may be possible that the current approach will be applicable to some other classes of random functions (such as those studied in [26]), however this will not be explored in this paper and left for further studies.
For convenience of notation, we write where
[TABLE]
Note that we do not assume and prefer to leave the setup in this generality for the convenience of the proof. Let . For the typical values of to be comparable to , it is natural to assume that and is comparable to , so that there is a significant range of values for about the size of . The following condition essentially describes these assumptions. For technical reasons, below we will need .
Condition 1**.**
Assume that for some and it holds that
(i) for all ;
(ii) for all ;
(iii) for .
We note that and may depend on . Without loss of generality, we may assume that throughout the paper. The implicit constants in this paper are allowed to depend on the implicit constants in Condition 1, which include .
We now mention several examples that satisfy Condition 1. Via Stirling’s formula, it can be seen that the coefficients of hyperbolic random polynomials222For discussions about the importance of random hyperbolic polynomials in statistical physics, we refer the reader to the beautiful lecture notes [10].
[TABLE]
satisfy the above condition; here and ’s are independent with unit variance. In particular, if we recover the Kac random polynomials. In fact, we may generate other examples satisfying Condition 1 by taking finite linear combinations of hyperbolic polynomials and their derivatives. Now, while our approach works with more general polynomials, even for the polynomials considered in [4, 14, 15] we are also able to obtain significantly new results.
1.1. Notational conventions
Throughout the paper, for any function we let denote the number of its real roots, and let be the number of roots inside . Note that these numbers could be , but they are never negative.
By we mean , in other words there is a finite constant such that and the constant is allowed to depend on the parameters . Sometimes we will simply write (without mentioning the parameters ) when is an absolute consatnt or if it is clear from the context what could depend on. When both and hold we will write , and we use the same convention for .
The reciprocal polynomial for a polynomial of degree is .
1.2. Statement of results
To study , we write
[TABLE]
where is a deterministic polynomial and is a random polynomial with zero mean. Our heuristics is the following idea: locally, between and , the dominant component will dictate the behavior of and hence will have a stronger influence on the number of real zeros of .
Our main result, Theorem 1 is an estimate for the number of real roots of inside an arbitrary interval, demonstrating the following comparison principle:
(i) if dominates then on average there are very few real roots for , as is typically bigger than .
(ii) if is dominated by then on average the number of real roots of is the same as the number of real roots of plus a bounded term.
In the statement of Theorem 1, we will be more precise about the meaning of “dominated” and “dominates”. Here we make some preliminary remarks. First, since is random with zero mean, it makes sense to use the standard deviation as an indicator for the size of , and this heuristics is also used for derivatives of . For , to compare and it turns out to be more convenient to work with the reciprocal polynomials and .
In the following, we say that is an enlargement for if it is obtained by extending to the left and to the right a little bit: generally speaking this means there is an absolute constant such that the added length to the right is bounded below by c(\Big{|}1-|b|\Big{|}+\frac{1}{n}) and the added length to the left is bounded below by c(\Big{|}1-|a|\Big{|}+\frac{1}{n}).
There are special cases when the enlargement requirement could be made less stringent (without affecting our main results below): if is bounded below by any positive absolute constant then there is no need to extend to the right and we may use as the right endpoint for , and similarly if is bounded below by any positive absolute constant then we may take as the left endpoint for . These improvements are made possible with the aid of Lemma 2.
We note that the above notion of enlargement can also be similarly defined for half open/half closed/closed/infinite intervals. In all cases, the following will be true: if is an enlargement of then it also qualifies as an enlargement of any subintervals of .
Theorem 1** (Comparison principle).**
There is a constant such that the following holds. Assume that the coefficients of satisfy Condition 1 and are real valued. Let be an interval whose endpoints may depend on and assume that is an enlargement of .
Let and for .
(1) Assume that
- •
if then ,
- •
if then .
Then
(2) Let such that for some .
Assume that for each we have the uniform estimates:
- •
if then ,
- •
if then ,
and for the weaker estimates without also hold on and .
Then
We note that Theorem 1 is more useful for intervals near , since under Condition 1 it can be shown (using a standard argument of Ibragimov and Maslova) that if is bounded away from (see Lemma 2).
In Theorem 1, for technical reasons we need to assume that the domination relationship (between and ) is effective on an enlargement of , however if is a Gaussian random polynomial then the conclusions hold with and some of the conditions could be weakened, see Section 12. The proof of the Gaussian case in Section 12 will also shed more light on the motivation for the assumptions on and in the statement of Theorem 1. One of the main technical ingredients in our proof is a new result about universality for the correlation of the roots of , see Section 3.
Using Theorem 1, we could derive new results about the real roots of non-centered random polynomials (with coefficients of power growth) from analogous results for centered random polynomials, which in turn were studied extensively in [4]. Below, we summarize several sample results that can be obtained in this direction (although this list is by no means comprehensive).333A more thorough discussion about possible applications is included in Section 2, where these sample results will be derived from Theorem 1. The sample results will further demonstrate the following observation from [4]: we may extract asymptotic estimates for the number of real roots of a random polynomial from asymptotic information about its coefficients. This phenomenon was first observed in [4] for random polynomials with centered coefficients of polynomial growth.
Below, following [4], we define a generalized polynomial of to be a finite linear combination of hyperbolic coefficients , . Its degree is defined to be , where is the biggest in the combination. If we requires to be integer then this notion is the same as the classical notion of polynomials. Note that (via Stirling’s formula) a generalized polynomial of degree is asymptotically comparable to .
Our first sample result is about random hyperbolic polynomials (1.1).
Theorem 2**.**
Let be the hyperbolic random polynomial given by (1.1) where are independent with a common nonzero mean and variance and uniformly bounded moments for some .
[TABLE]
[TABLE]
Theorem 2 is a special case of the following more general result.
Theorem 3**.**
Assume that the coefficients of satisfy Condition 1. Assume furthermore that there are such that and
[TABLE]
[TABLE]
in particular grows like as . Furthermore, if for some we have as then
[TABLE]
In particular, if is a generalized polynomial of then
[TABLE]
Theorem 2 may be derived from Theorem 3 as follows. Letting , we note that for the set up of Theorem 2 we will have for some , and by Stirling’s formula . On the other hand,
[TABLE]
Using Theorem 3, it follows that , and thus using [4] we obtain the desired conclusions. We may argue similarly to get the desired asymptotics for .
Below is a class of random polynomials where the deterministic component is dominated by the random component .
Theorem 4**.**
Assume Condition 1 and assume that for some we have . Then there are finite positive constants and such that
[TABLE]
Furthermore if for some we have as then we could take to be . In particular, if is a generalized polynomial of then we could let .
Finally, we mention a simple class of random polynomials where dominates , leading to very few real zeros for the random polynomial.
Theorem 5**.**
Assume Condition 1. Suppose furthermore that for some and some the following holds: for odd we have and for even we have . Then
[TABLE]
Furthermore, the above estimate holds true if we interchange the role of odd and even ’s in the above assumptions.
1.3. Outline of the paper
In the next section, we discuss the applications of Theorem 1 and the proof for the sample results mentioned above. In the rest of the paper, we prove Theorem 1. Our proof of Theorem 1 uses universality estimates for the correlation functions of the real roots of , see Section 3. Using these estimates, we could reduce the proof of Theorem 1 to the Gaussian setting. The Gaussian case of Theorem 1 will be examined using the Kac-Rice formula, see Section 12.
2. Sample applications of the comparison principle
In this section, we discuss several applications of Theorem 1 and present the proofs for Theorem 3, Theorem 4, and Theorem 5. We will use the following basic computation about power series.
Lemma 1**.**
For any and and any and the following holds:
(i) If then .
(ii) If then .
Proof of Lemma 1.
Note that if then are all comparable to , therefore . Here, to see the last estimate we may split the sum into and , and use the fact that for the first range and for the second range . This proves part (ii), and furthermore in part (i) we may assume that where is sufficiently large. We now discuss the proof of part (i) under this assumption.
We consider first the case . By Taylor’s theorem, we have , where the error term is nonnegative. Now, note that , therefore
[TABLE]
For the other direction of the estimate, it suffices to establish that the error term is smaller than fraction of when is sufficiently large. Here we use the Lagrange form of the error term, which says that for some we have
[TABLE]
The desired estimate then follows from the fact that is a decreasing function for , and
[TABLE]
and could be made arbitrarily small by choosing sufficiently large.
We now consider the general situation. We have
[TABLE]
Thus it remains to show that the remaining summation over is (note that this summation is nonnegative). For these ’s we note that is comparable to . Since we may choose depending on such that . Let be its conjugate exponent. Then using Hölder’s inequality we have
[TABLE]
This completes the proof of Lemma 1. ∎
Let be a sufficiently large constant and let . In the applications of Theorem 1, we will need the following estimate.
Lemma 2**.**
For any we have .
We include a proof of Lemma 2 using an argument of Ibragimov–Maslova [13] (see also [4] where a simpler version of Lemma 2 was proved). We’ll need the following estimate, which will also be used later in the proof of Theorem 1.
Lemma 3**.**
For any there is such that for any we have .
Proof of Lemma 3.
Let and let .
We first consider . Without loss of generality assume , the case is can be treated similarly. Then
[TABLE]
Thus we may take any for .
We now consider . Then . Therefore,
[TABLE]
Let . Since , we obtain
[TABLE]
for some where and are as in Condition 1. Thus by examining the function of , it is follows that there is some such that any that satisfies the above inequality must be inside . Consequently , as desired. ∎
Proof of Lemma 2.
It suffices to show that for we have and . We will show in detail the first estimate, and comment on the needed changes for the second estimate.
Take any . Let be as in Lemma 3. From Condition 1, let be such that for . Define
[TABLE]
for each , and define .
For it is clear that we have .
For , we have , thus it suffices to show that
[TABLE]
On the event , we have , thus using Jensen’s formula we have
[TABLE]
Let be an integer larger than . Using convexity and Jensen’s inequality, we have
[TABLE]
To estimate , we proceed similarly, and the following estimate will be needed:
[TABLE]
where . To see this estimate, we note that
[TABLE]
then we split the sum into and and argue as in the proof of Lemma 1. The treatment of is entirely similar as before, but for we actually need to be more careful (than the proof of Lemma 1) about the dependence on of the implicit constant. We include the details below. By Cauchy–Schwartz we have
[TABLE]
∎
We now divide the discussion of the applications of Theorem 1 into three sections, corresponding to whether is always small, or always large, or mixed large/small, in comparison to .
2.0.1. Small mean
Here the mean will be completely dominated by . We first state a corollary of Theorem 1 in this direction, before proving Theorem 4.
Corollary 1**.**
Let such that for some . Assume Condition 1 and assume that there is a constant such that for and we have
[TABLE]
and assume that the weaker estimates without also hold true for . Then there are finite positive constants and such that
[TABLE]
Furthermore if for some we have as then we could take to be . In particular, if is a generalized polynomial of then we could let .
Thanks to [4], the zero-mean case (i.e. for all ) of the above corollary already holds true. Thus, using Lemma 2 and Theorem 1, Corollary 1 is a simple consequence of the following estimates
[TABLE]
which follows from elementary computations (see Lemma 1 for details).
We now prove Theorem 4. Since , we may assume without loss of generality that . Using Lemma 1, for we then have
[TABLE]
which clearly implies (2.3). Thus Theorem 4 follows from Corollary 1.
2.0.2. Large mean
Here near the mean will always dominate . As before, we state a corollary of Theorem 1 before proving Theorem 5.
Corollary 2**.**
Let be such that as . Assume Condition 1 and assume that there is a constant with the following properties: for we have
[TABLE]
[TABLE]
This corollary follows immediately from (2.2) and Theorem 1 and Lemma 2. We now apply this corollary with to prove Theorem 5. By splitting and using Lemma 1 to treat each of them individually, we obtain (for )
[TABLE]
where . Thus Theorem 5 follows from Corollary 2.
2.0.3. Mixed case
Here we consider the mixed situation, where is dominated by on a part of the real line and dominates elsewhere. In our opinion this is the most interesting case. Here we describe a simple scenario, which applies to random Kac polynomials with non-centered coefficients (considered in [15]) as well as linear combination of derivatives of a random Kac polynomial (considered in [4]), and also hyperbolic random polynomials with non-centered coefficients (Theorem 2 of the current paper). In this scenario, is dominated by near while being the dominant component near . (Note that due to symmetry we could also state a symmetric version where the roles of and are interchanged.)
Corollary 3**.**
Let be such that as . Let such that for some . Assume Condition 1 and assume that there is a constant with the following properties:
(i) for we have
[TABLE]
(ii) for and for each we have
[TABLE]
and the weaker estimates without also hold true for . Then
[TABLE]
and in particular there are constants such that
[TABLE]
Furthermore if for some we have as then we could take to be . In particular, if is a generalized polynomial of then we could take .
Now, it was shown in [4] that grows like , and furthermore if then , and the error term could also be improved to if is a generalized polynomial of . Thus, Corollary 3 is an immediate consequence of Theorem 1 and (2.2).
We now discuss the proof of Theorem 3. From the given assumption it follows that are of the same sign for , so without loss of generality we may assume that for . Now, using and one may show that dominates near . Indeed, by elementary computations (see Lemma 1), for we have
[TABLE]
We now show that is dominated by near . To see this, let and we use discrete integration by parts to write
[TABLE]
and uniformly over we have for . On the other hand, using the given hypothesis we may estimate
[TABLE]
Without loss of generality we may assume . Since , we obtain
[TABLE]
where .
Similarly, for we may estimate, with the assistance of Lemma 1,
[TABLE]
Thus Theorem 3 follows from Corollary 3.
3. Correlation functions: background and main estimates
In this section, we summarize our main results about correlation functions for and . These estimates are key ingredients in the proof of Theorem 1 and the proof for these estimates will be presented in subsequent sections.
We first recall some background about correlation functions, following [33, 4]. While there is a more general theory of correlation functions for random point processes, see for instance [10], our discussion will specialize to the context of the roots of random polynomials. Let denote the multi-set of the (complex) roots of , where a root of multiplicity will be identified as different elements.
For , we say that a Borel measure on is the -point correlation measure for the (complex) roots of if the following equality holds for any continuous and compactly supported function :
[TABLE]
Here, the summation on the left hand side (inside the expectation) is over all ordered -tuples of different elements of . The existence of such a measure is a simple application of the Riesz representation theorem. In the literature, it is common (see e.g. [33]) to define the -point correlation function as the density of with respect to the Lebesque measure (which exists for instance in Gaussian settings [10] or more generally smooth distributions), here we will work with correlation measures to allow for more generality.
When is a real polynomial (i.e. with real-valued coefficients), the set of complex zeros for is symmetric with respect to the real line, and there may be a nontrivial probability that has at least one real root. Thus, for such polynomials we will define the mixed complex-real correlation measures for the roots as follows. Let and and let be a measure on . We say is the -point correlation measure for if the following two conditions hold:
(i) is symmetric under complex conjugations: for any measurable , it holds that where is one of the sets obtained from by taking conjugate in one fixed coordinate;
(ii) for any compactly supported continuous we have
[TABLE]
Here, the summations on the left hand side are over ordered tuples of different elements of . If has a density with respect to the Lebesgue measure, such density is classically called the -point correlation function [33], which will then be invariant under taking complex conjugation of any variable.
We now define the admissible local sets where comparison estimates for the correlation measures will be proved. These are sets where the expected number of complex roots for could be as small as a bounded constant . For random polynomials with centered-coefficients, the structure of these sets is well-known and has been exploited by previous authors, here we will use the same structure for random polynomials with non-centered coefficients, following [4].
Let that may depend on . Define
[TABLE]
Define and define .
Let be the reciprocal polynomial of .
Below, we say that two (possibly complex valued) random variables and have matching moments to up to second order if
[TABLE]
for any such that . Note that if one of , is real valued then this matching condition will force the other to be real-valued. The Gaussian analogue of if is defined to be where are independent Gaussian and and have matching moments up to the second order.
Our main result about the mixed complex-real -point correlation functions for the roots of is stated below, here and . In Theorem 6, we consider a real random polynomials whose coefficients satisfy Condition 1, and we let and denote the -point correlation measures for the roots of and . The Gaussian analogues of these two correlation measures will be denoted by and .
In the following, it is understood that all implicit constants may depend on the implicit constants in Condition 1.
Theorem 6**.**
*Given , we could find such that the following holds for any and any :
*Let be supported on such that as a function on it is in and furthermore up to order .
Let be such that for any the following holds444Note that the interval has this property, although in the applications we may work with much thinner intervals (which is allowed if is small).:
- •
if and then .
- •
*if and then . *
(i) Assume that uniformly on , or for all . Then
[TABLE]
(ii) Assume that uniformly on , or for all . Then
[TABLE]
Our proof will use the following result for the -point complex correlation functions, where . In Theorem 7, we consider a (possibly complex valued) random polynomial whose coefficients satisfy Condition 1. Below we let and denote the -point correlation measures for the zeros of and , and let and be their Gaussian analogues.
Theorem 7**.**
*Given any , we could find constants such that the following holds for any and any :
Let be supported on such that as a function on it is and furthermore up to order .
Then
[TABLE]
Our Theorem 7 slightly generalizes [4, Theorem 2.3]. Here we point out an example outside the scope of [4]. Recall that in [4, Theorem 2.3] it is assumed that where are independent with unit variance (but could have nonzero means). In our setting, with , if is a nonzero constant with probability (which is allowed to happen for or according to Condition 1) then it is not possible to write where of variance .
We will prove Theorem 7 using an adaptation of the proof of [4, Theorem 2.3]. We take this as an opportunity to provide a more streamlined presentation of the argument in [4], in particular in the proof we will prove new estimates involving log integrability of random polynomials and bounds on the local number of roots, which could be of independent interests.
4. Local anti-concentration inequalities
In this section we will prove several anti-concentration inequalities for random polynomials whose coefficients satisfy Condition 1. We will use these estimates later in the proof of Theorem 7. Below, let be the normalized reciprocal polynomial for . Recall that
[TABLE]
Our first set of estimates is contained the following theorem:
Theorem 8**.**
Let . Then there are constants such that the following holds for any and any and any :
[TABLE]
Now, if then Theorem 8 does not give us much information: the right hand sides of (4.1) and (4.2) are now comparable to , therefore these estimates hold automatically. In this range of , the following set of estimates is more useful. Below, let .
Theorem 9**.**
Let . Then there is a constant such that the following holds for any and any and any :
[TABLE]
As a corollary of Theorem 8 and Theorem 9, we obtain
Corollary 4**.**
Let . Then there is a constant such that the following holds for any and any : for any there is a constant such that
[TABLE]
Proof of Corollary 4.
Below we only prove the claimed estimate for , and the same argument specialized to the case can be applied to . Using Theorem 8 and Theorem 9, for any we have
[TABLE]
Thus, for any we have
[TABLE]
On the other hand, for any we have
[TABLE]
∎
4.1. Proof of Theorem 8
Recall that . Using Condition 1, we may find and such that
[TABLE]
for all , while for .
We first prove (4.1). Since the left hand side of (4.1) is , we may assume without loss of generality that for a large absolute constant . In particular, we will have , thus .
Now, there is a constant depending only on such that for all . Therefore, we may choose such that is very small. In particular, we may choose such so that . Now, observe that, thanks to (4.5),
[TABLE]
for any . Therefore,
[TABLE]
for any .
We now recall the following anti-concentration bound:
Claim 1**.**
Let . Then there are constants such that the following holds for any : If are independent with zero mean and unit variance satisfying , then for any lacunary sequence we have:
[TABLE]
For a proof of this now-standard bound, see e.g. [33, Lemma 9.2] or [4, Lemma 4.2]. We apply the above anti-concentration bound to and to the random variables . By absorbing the remaining terms in into the concentration point , it follows that
[TABLE]
for any . To obtain the desired estimate (4.1) from this inequality, we will choose to depend on , and this choice is explained below.
First, note that , which is uniformly bounded away from [math] and since , we may find a constant such that . It follows that
[TABLE]
For convenience, let be such that . We then let to be the integer such that
[TABLE]
Now, since the left hand side of (4.1) is we may assume without loss of generality that . To check that this will lead us to (4.1), we divide the consideration into two cases:
Case 1: .
It follows from the above constraint on that . In this range of we may use (4.7), and obtain
[TABLE]
Thus by ensuring we obtain (4.1).
Case 2: .
Here (4.7) is not available, however we observe that the LHS of (4.1) is nondecreasing with respect to . Therefore, using the case of Case 1, we obtain
[TABLE]
Since , the last estimate can be bounded above by for some . This completes the proof of (4.1).
We now discuss the proof of (4.2), which will follow the same argument. For convenience of notation, we let , where , and . It is clear that and for , therefore we may apply the special case of (4.1) to the random polynomial . The desired estimate for then follows by absorbing the other terms into the concentration point .
4.2. Proof of Theorem 9
Below we only prove (4.3), and (4.4) can be obtained from (4.3) by arguing as in the proof of Theorem 8 in the last section.
The proof uses the following generalization of a lemma of Erdös (for a proof see [4, Lemma 4.1]):
Claim 2**.**
Let . Then there is a constant such that the following holds for any : If are independent and then for any we have
[TABLE]
Let , where is such that is comparable to for (thanks to Condition 1). Applying the above estimate to for , it follows that
[TABLE]
Now, we may choose be sufficiently large such that . For any , it holds that , therefore
[TABLE]
Collecting estimates, for large enough we will have
[TABLE]
for any integer . To obtain the desired estimate (4.3) from this inequality, we will choose suitably depending on . We will choose to be the integer such that
[TABLE]
Now, since the LHS of (4.3) is , we may assume without loss of generality that . To show that this choice would give us (4.3), we divide the consideration into two cases:
Case 1: . For such we may use (4.8). We note that, as a consequence of the above constraint on , we will have . Consequently,
[TABLE]
Case 2: . Here we will use monotonicity of the left hand side of (4.3) (as a function of ). Since we now have , it follows that
[TABLE]
This completes our proof of Theorem 9.
5. Logarithmic integrability of random polynomials
This section is devoted to establishing several estimates about the integrability of and , which will be used to prove bounds for the number of local real roots of in subsequent sections. Throughout this section, we’ll assume that the coefficients of satisfy Condition 1. For convenience, let .
5.1. Logarithmic integrability on the unit disk
We start with an estimate about integrability on the unit disk . We view this as a global estimate.
Theorem 10**.**
There are absolute constants and an event of exponentially decaying probability such that the following holds:
[TABLE]
for all , and the analogous estimate also holds for .
We note that the exclusion of an exceptional set of exponentially decaying probability is important. To see this, suppose that for all , then on the event , which has an exponentially decaying probability if for some fixed we have for all . Such event must be excluded to ensure any integrability for on .
Without loss of generality we may assume that in the proof. Given such a condition, the right hand side of (5.1) is a strictly increasing function of the implicit constant , which will be convenient in the proof.
To start, we note that the estimate (5.1) follows from a slightly weaker estimate:
Proposition 1**.**
There is an event of exponentially decaying probability (for some fixed ) such that the following holds: for any , there is a constant such that
[TABLE]
for all , and the analogous estimate also holds for .
Indeed, the key observation here is that the the implicit constant does not depend on . If (5.2) holds, using Holder’s inequality we have, for any :
[TABLE]
The desired conclusion (5.1) then follows by choosing .
The main ingredient in the proof of Proposition 1 is a result of Nazarov-Nishry-Sodin [24, Corollary 1.2] for random Fourier series, summarized below:
Proposition 2**.**
[24]** There is an absolute constant such that the following holds: Let where are deterministic with and are independent Rademacher random variables. Then for any
[TABLE]
Our proof will actually use the following simple extension of Proposition 2.
Lemma 4**.**
There is an absolute constant such that the following holds for any measurable with : Let where are deterministic with and are independent Rademacher random variables. Then for any we have
[TABLE]
In Lemma 4, we could in fact replace the constant by any constant bigger than (for our applications any absolute constant would suffice).
5.1.1. Proof of Lemma 4
To prove Lemma 4, we will use the following crude estimate. For convenience of notation, let and let denote the Lebesgue measure of measurable subsets of .
Claim 3**.**
There is an absolute constant such that for any and we have
[TABLE]
[TABLE]
Now, let be integer such that , we then have
[TABLE]
This competes the proof of Claim 3.
In the proof of Lemma 4, we will use another estimate, which in turn is a consequence of Proposition 2.
Claim 4**.**
There is an absolute constant such that for any and we have
[TABLE]
Since the left hand side of the above estimate is always bounded above by and since for any , we may assume without any loss of generality. For such , it suffices to show that
[TABLE]
Let be iid copies of , such that are independent Rademacher random variables. Let , which are also independent Rademacher random variables. We have
[TABLE]
[TABLE]
This completes the proof of Claim 4.
We are now ready to start the proof of Lemma 4. We combine Claim 4 and Claim 3 and estimate
[TABLE]
This completes the proof of Lemma 4.
5.1.2. Proof of Proposition 1
We now start the proof of (5.2) for . For convenience of notation, we denote to keep track of the dependence of on the vector of coefficients . Let
[TABLE]
We first show that for some . Since are independent and for , it suffices to show that that there are constants and such that for all . This was proved in Lemma 3.
We now divide the remaining of the proof into two cases: the simpler case when are symmetric for each , and the general case where no symmetry is assumed.
Case 1: Symmetric coefficients.
Assume that for each the distributions of and are the same.
Let be independent Rademacher random variables that are independent from , and let . Thanks to symmetry, has the same distribution as . Note that , therefore and is independent of . Thus it suffices to show that, for any large enough,
[TABLE]
Note that on the event we have , which implies . Conditioning on this event and using Lemma 4, we obtain
[TABLE]
here depends on . Thus, it remains to show that
[TABLE]
for some (independent of ). This estimate in turn follows from concavity of on and Jensen’s inequality:
[TABLE]
Case 2: General coefficients.
We now drop the assumption that the distribution of ’s are symmetric. To show (5.1), it suffices to show that, for large enough,
[TABLE]
for any and any . Since the left hand side of (5.1.2) is , this estimate holds trivially for . Thus, we will assume below that , in particular we may replace by on the right hand side without any loss of generality.
Now, let . We divide the proof of (5.1.2) into two parts, depending on whether or .
Smaller ’s: For , we have , thus it suffices to show that
[TABLE]
Now, , and
[TABLE]
Thus, it remains to show that is bounded by the right hand side of (5.5).
Let be iid copy of that are independent of each other and of other ’s. Let , then is symmetric with mean zero and variance . We also have uniform over , thanks to Condition 1. One could easily show that (with the same as in the estimate for , although this it not important - we could refine the constant for so that these two exceptional sets share the same constant from the beginning of the proof).
Now, using Hölder’s inequality, we obtain
[TABLE]
Let be sufficiently large, then using the known estimates for the symmetric case, which applies to and , we may generously estimate the last display by
[TABLE]
This completes the proof of (5.1.2) for this range of .
Larger ’s: For , we proceed as follows. Let be independent Rademacher random variables that are independent from ’s. Let and consider the symmetrized variant of , namely
[TABLE]
Using Hölder’s inequality, for any we have
[TABLE]
Here, in the last estimate we used the fact that is equal to with probability . Observe that . Thus, using the (known) estimate for the symmetric case, we can further estimate the last display by
[TABLE]
Since , it follows that by taking we have and we obtain the desired estimate.
This completes the proof of the desired estimate (5.2) for of Proposition 1.
We now discuss the proof for the analogous estimate for . For convenience of notation, let where , , and . In particular, . We similarly let
[TABLE]
where . Using Condition 1, we have for , therefore by the same argument as before we obtain for some . Now, the proof of the symmetric case is entirely the same as before once we verify that on it holds that
[TABLE]
But this is clear using Condition 1. Finally, the proof of the general case follows from the symmetric case as long as we could verify that , which again is clear from Condition 1.
5.2. Logarithmic integrability on local sets
In this section we will prove a probabilistic upper bound regarding the local integrability of and where . This is an estimate on a ball of radius comparable to the scale with center near . All implicit constants below may depend on the implicit constants in Condition 1.
Theorem 11**.**
Let be such that and let be big enough depending on . Then for any and and there is an event with probability such that the following estimate holds uniformly over :
[TABLE]
and the analogous estimate also holds if we replace by .
As a consequence Theorem 11, we obtain
[TABLE]
(and the analogous estimate for ), which is reminiscent of Theorem 10.
Using Lemma 1, we have the following probabilistic estimates for :
Lemma 5**.**
Let . For it holds for any and that
[TABLE]
Proof of Lemma 5.
Recall that and thanks to Condition 1. Using Lemma 1 and Cauchy-Schwartz, for any and we have
[TABLE]
Since , we obtain
[TABLE]
The desired probabilistic estimate for then follows immediately.
Now, the proof of the claimed probabilistic estimate for is similar. For convenience of notation, let . Using Cauchy Schwarz and Lemma 1 we have, for and :
[TABLE]
Again, , and the desired estimate follows immediately. ∎
5.2.1. Proof of Theorem 11
We will only show the proof for the claimed estimate for , and the same argument works for . Fix . Let be big enough so that Corollary 4 holds.
Thanks Corollary 4, we may assume that
[TABLE]
for some large. Let . Then for we have , so thanks to Lemma 5, it holds with probability that
[TABLE]
for large.
Below, we will condition on the event where (5.6) and (5.7) hold, on which we will show that
[TABLE]
Now, the integrand will blowup near the zeros of , however only logarithmically. The above assumptions on will ensure that there are not many such zeros near , and the main part of the argument is to control the zero-free part of using properties of subharmonic functions.
More specifically, let be the number of zeros of in . As a consequence of Jensen’s formula, we have
[TABLE]
Now, let be the zeros of in . Let , this is a (random) polynomial having no zeros inside , we view as the zero-free part of . It follows that, for any ,
[TABLE]
Since , it remains to bound the integral involving . In fact, we will show that uniformly on , which is a stronger estimate. To see this, we first show that satisfies inequalities similar to (5.6) and (5.7). Indeed, note that is a subharmonic function, and by the maximum principle it achieves its maximum on the boundary. It follows that
[TABLE]
On the other hand, since for all , we also have
[TABLE]
Thus we have verified that satisfies inequalities similar to (5.6) and (5.7). Now, let for a big constant such that is nonnegative (and harmonic) on . Note that
[TABLE]
Using Harnack’s inequality, for any we have
[TABLE]
It follows that for any , as desired.
6. Counting local real roots
In this section, we will use the log integrability estimates and the anti concentration estimates from previous sections to establish several estimates for the local number of real roots for .
For each and any function analytic on a neighborhood of , let denote the number of roots of inside .
In this section, we assume that the coefficients of satisfy Condition 1, and all implicit constants may depend on the implicit constants in Condition 1.
Theorem 12**.**
Let be such that . Then there are constants such that the following holds: for any and any and any and any event we have
[TABLE]
The analogous estimate also holds for . Furthermore, for we could take .
It follows from Theorem 12 that the number of roots of and on are at most logarithmic away from . We state a useful corollary, when .
Corollary 5**.**
Let be such that . Then there are constant s such that for any and any we have
[TABLE]
Furthermore, for we could take .
We will divide the proof of Theorem 12 into two cases, depending on whether is small or large. More specifically, we will consider first for some sufficiently large constant , this is the large scale setting. Then we will consider the case when and refer to this as the small scale setting.
6.1. Larger scales
We will use the following sublevel set estimate.
Lemma 6**.**
Let be such that . Let be sufficiently large. Let and assume that . Then uniformly over we have
[TABLE]
Let be large compared to the constant from Lemma 6. Using Lemma 6 , we will prove (6.1) for . We will only show the details for , the same argument could be applied to . Now, for brevity let and . Since trivially, we obtain
[TABLE]
if is sufficiently larger than . It follows that
[TABLE]
6.1.1. Proof of Lemma 6
Let . Using Jensen’s formula, we have
[TABLE]
[TABLE]
[TABLE]
For the first term on the right hand side of (6.3), we apply Lemma 5 with and note that is a lot larger than any given power of .
For the second term on the right hand side of (6.3), we use Theorem 8 with and use the assumption that (where is very large) to get the desired estimate.
The proof for is entirely similar.
6.2. Smaller scales
We now consider the smaller (and more critical) range . Here we will use Theorem 10 (from Section 5) about the log integrability of and , which shows that there is an event with probability such that for any we have
[TABLE]
where is sufficiently large, and the analogous estimate also holds for . We will use these estimates to show the desired estimates for in this range of , and the argument for is entirely similar.
[TABLE]
which is for any . Thus, we may assume without loss of generality that . For convenience, denote and where . Let be a smooth function such that and let denote the -preserving dilation of . We now use Green’s formula
[TABLE]
where is the Lebesgue measure on . It follows that
[TABLE]
[TABLE]
Consequently, using Hölder’s inequality, the following holds for any
[TABLE]
Recall that and note that and . Therefore, using (6.4) for , we obtain
[TABLE]
Choosing , then , therefore
[TABLE]
Now, if , then it is clear that the last right hand side is . If then it is clear that , consequently
[TABLE]
This completes the proof of Theorem 12.
7. Lindeberg swapping and Tao-Vu replacement estimates
Our goal in this section is to establish the following result, which is a simple extension of a replacement estimate in Tao–Vu [33] to non-centered polynomials.
Lemma 7**.**
For any there is so that the following holds.
Let be independent with and such that and have matching moments up to second order, for at least indices . Let , , , and be such that
(i) , and for ;
(ii) for all and it holds that .
[TABLE]
where the implicit constant may depend on .
Without loss of generality we may assume that are Gaussian for all . Following [33], we will prove Lemma 7 using the Lindeberg swapping argument. The following basic estimate captures some ideas of this argument.
Lemma 8** (Basic Lindeberg swapping).**
Let . Assume that and are independent such that and .
Assume that and have matching moments up to second order for any . Here is a subset of .
Assume that , such that, as a function on , . Then for some finite positive depending on and we have:
[TABLE]
Here viewing as a function on we let
Proof.
Let , and let be obtained from by swapping with . We then estimate the left hand side by .
Let . We view as a function of and , denoted by . For convenience, let .
We consider approximation of using Taylor expansion around up to second order terms. By simple interpolation, the error term in this approximation is bounded above . Since and are independent from the others and have matching moments up to second order and since , , it follows from direct examination that
[TABLE]
Summing these estimates over and using Hölder’s inequality, we obtain
[TABLE]
Now, let . Again we view as a function of and and approximate it by Taylor expansion around up to first order terms. We similarly obtain . Using Kolmogorov’s inequality [18] and a simple application of Hölder’s inequality we obtain
[TABLE]
∎
We now prove Lemma 7. Let . Let be defined by . Then we also have for all partial derivatives of order .
Let for some large constant to be chosen later.
We perform a decomposition of where and , where is constructed below. Then is a smooth function supported on and equals on , such that for any multi-index .
We plan to apply Lemma 8 to
[TABLE]
Now, for . Via explicit computations,
[TABLE]
Now, on the support of we have . Thus, for we have
[TABLE]
Summing over and using Cauchy Schwartz, we obtain
[TABLE]
Similarly, we estimate the third partial derivatives for and use these estimates to bound . Here we will arrive at trilinear sums, so using the assumption we eventually obtain
[TABLE]
Now, we may assume . Via Lemma 8, we have the generous bound
[TABLE]
We now reset . The partial derivatives of are and are supported in . Consequently, via the same consideration as before, we obtain
[TABLE]
here we have used the fact that is Gaussian and . Collecting estimates, we obtain
[TABLE]
We choose where , and , then it is clear that the last right hand side is , as desired. This completes the proof of Lemma 7.
8. Proof of universality for complex correlation functions
In this section we prove Theorem 7. Following the framework developed by Tao-Vu [33], we will use the Monte Carlo sampling method (summarized in Lemma 9) and the Lindeberg swapping argument (implemented in Lemma 7). Below, we will only prove the desired estimates for the correlation functions of . The same argument could be applied to to get the desired estimates for .
We will actually show the desired estimates when has the tensor structure, namely , furthermore for such we will only need to assume that each , viewed as a function on , is continuously differentiable up to second order and furthermore for . The reduction from general (i.e. non tensor) to this special set up could be carried out as follows: First, let , and let be smooth and supported inside such that on , and as a function on it is and satisfies the derivative bound up to order . We may write
[TABLE]
using the multiple Fourier series expansion of on the polydisk . By standard stationary phase estimates, if is then , while , therefore if is large enough depending on , say , then we could write as a linear average of tensor-type functions with the properties mentioned earlier.
Thus, we may now assume that has the tensor structure. Let be fixed (no implicit constants will depend on ’s). Recall that denotes the multi-set of zeros of . By definition,
[TABLE]
where the sum is over non repeated tuples of elements of the zero sets of . An application of the inclusion-exclusion formula will allow us to rewrite the last right hand side as a linear combination of terms, and each term is a product of finitely many sum of the following type
[TABLE]
where is fixed and is a function supported in such that, as a function on , it is and its partial derivatives up to order are bounded accordingly.
Consequently, it suffices to show that, for a sequence of the above type,
[TABLE]
(uniform over all choices of and ), for some . Without loss of generality, we may assume that and ,… ,, and for brevity we will omit the dependence on in the notation and simply write below.
Let be a sufficiently small constant that may depend on the underlying implicit constants in Condition 1. By a standard construction, we could find such that supported on and on , furthermore for any , and (as a function on ) will be in with for any (partial) derivatives of order up to .
Let is sufficiently large and let . We first use Theorem 11 and Lemma 5 to conclude that for any there is an event with probability such that on the following holds for each :
[TABLE]
We now use Green’s formula, which says that the following holds for any compactly supported in
[TABLE]
where is the Lebesgue measure. It follows that, for each , we have
[TABLE]
Thus, using Hölder’s inequality and using the above properties of , we obtain on the event . By ensuring that for sufficiently large, it follows that on the event . Now, outside we still have , therefore
[TABLE]
We now use Monte Carlo sampling to approximate the integral form (8.1) of with a discrete sum.
Lemma 9** (Monte Carlo sampling).**
Let be a probability space and let . Assume that are drawn independently from using the distribution . Then for we have and
[TABLE]
Now, is supported inside and is bounded above by .
Let be uniformly chosen from (independent of each other and of the coefficients of ), here and . Using (8.1) and Lemma 9, it follows that
[TABLE]
where . Note that .
Now, on the event , the right hand side in the last display is . Using the above estimate, we now show that all ’s could be replaced by the corresponding averages at a total small cost:
Claim 5**.**
Let . Then
[TABLE]
where the expectation is taken over and .
To see this, let . Then on the product probability space generated by and it holds with probability that
[TABLE]
for all . Now, letting and choosing sufficiently small (so that in particular ), it follows that the following inequality holds with probability :
[TABLE]
(Here we’ve used the assumption that the first order partial derivatives of is bounded above by .) On the event that this estimate does not hold (which has probability ), we have the crude bound for the left hand side of the above display, here we have used the assumption that and is supported on . Collecting estimates, the desired estimate of Claim 5 follows immediately.
On the event , we note that and similarly . Consequently, using (8.2) and Claim 5 we obtain
[TABLE]
[TABLE]
[TABLE]
Using Theorem 12, the two terms involving and are bounded by , which in turn is bounded by .
Thus, it remains to bound the first term on the right hand side of (8.3). Here we use Lindeberg swapping, or more precisely Lemma 7. Below we only discuss swapping of with its Gaussian analogue ; the swapping of the other averages can be done similarly. Now, by conditioning on other variables and treating them as parameters, we may let
[TABLE]
It remains to show that
[TABLE]
[TABLE]
We can check that for any partial derivatives up to order . Note that by choice and could be chosen arbitrarily small. Therefore, in order to show the estimate in the last display via Lemma 7, it remains to show that for some uniform constant (independent of ) the following holds
[TABLE]
for any and any . To see this, note that and ’s satisfy Condition 1, therefore
[TABLE]
while . Via examination of the function over , we could show that , thus we could take any . (Recall the assumption that ).
9. Counting local non-real roots
In this section, we will prove several estimates for the local number of non-real roots of near the real line. These estimates play an essential role in the next section, where the proof of Theorem 6 will be presented. Recall that we write where is the deterministic component and is the random component. We divide the analysis into two scenarios.
Scenario 1: is “small” compared to . This scenario generalizes the special case considered in in [4], where it was shown that with high probability has no non-real local root. Here we will show that a similar conclusion holds even with the addition of a “small” deterministic component .
Lemma 10**.**
Let be sufficiently small and let . Then for sufficiently large the following holds for any and and any .
(i) Assume that on we have .
[TABLE]
(ii) Assume that on we have .
[TABLE]
Scenario 2: is “large” compared to . Here we will show that with high probability has no local roots in a neighborhood of the real line.
Lemma 11**.**
Let be sufficiently small and let . Let . Then for sufficiently large the following holds for any and and any .
(i) Assume that on we have .
[TABLE]
(ii) Assume that on we have .
[TABLE]
9.1. Proof of Lemma 10
9.1.1. Proof of Lemma 10, part (i)
Here we prove part (i) and we will discuss the modifications for part (ii) later. For convenience, let
[TABLE]
Step 1. Reduction to Gaussian: We’ll use Theorem 7 in this step. Let .
Let be an enumeration of the (complex) roots of and let be an enumeration of the (complex) roots of , both enumerated with multiplicity.
Let be small to be chosen later. Let be smooth supported on such that if . We have
[TABLE]
We now discuss the set up required to apply Theorem 7. Since , we may write where and . We then let
[TABLE]
which is defined on , and here is a sufficiently large absolute constant (in particular independent of ) so that all required derivative bounds (from Theorem 7) for are satisfied. Now, if we require with sufficiently large depending on ,, and . It then follows from Theorem 7 (and the definition of correlation functions) that for some (independent of ) the following holds:
[TABLE]
Unraveling the notation, we obtain
[TABLE]
Using Corollary 5 and observing that , we have
[TABLE]
for any , so by choosing large we have a bound of for any . Using Theorem 12, it follows that
[TABLE]
Collecting estimates, we obtain
[TABLE]
by choosing small. So it remains to show that . Since could be chosen very small, it suffices to show that for some , which is essentially the Gaussian analogue of the desired estimate.
Step 2. Proof for Gaussian. We will show that, with high probability is close to its linear approximation at , namely .
[TABLE]
Using Rouché’s theorem and linearity of , (9.1) implies the desired estimate. Now, to show (9.1), we will prove two estimates.
Claim 6**.**
The following holds uniformly over :
[TABLE]
Claim 7**.**
For some the following holds uniformly over :
[TABLE]
The desired estimate (9.1) then follows from choosing in Claim 6 and choosing (with large) in Claim 7. Here we need .
9.1.2. Proof of Claim 6
Since is linear with real coefficients and since , is achieved at or . Consequently, for any we have
[TABLE]
here we have used the fact that and are Gaussian. Using Lemma 1 and Condition 1, we have . Therefore it remains to show that for any we have
[TABLE]
Now, since are independent, we have .
If then by definition we have . Therefore, using the triangle inequality and Lemma 1 and Condition 1 we obtain
[TABLE]
Using Lemma 1 and Condition 1, it follows that . Since , the desired estimate (9.2) follows immediately.
Now, if we have . Therefore, uniformly over we have , which implies the desired estimate (9.2).
9.1.3. Proof of Claim 7.
To estimate , we first estimate the mean and the variance of . We will show that
[TABLE]
uniformly over and .
For (9.4), let . By the mean value theorem, we have
[TABLE]
By ensuring is large, for any we have for . Using Lemma 1, it follows that
[TABLE]
For (9.3), again by the mean value theorem we have
[TABLE]
Now, we combine (9.3) and (9.4) to prove Claim 7. For convenience of notation, let . Without loss of generality, we may assume that is much larger than the implicit constants in the last estimate for and in (9.3). It follows from (9.3) and (9.4) that
[TABLE]
Using Cauchy’s theorem, for we have
[TABLE]
where is the arclength measure along the integration contour . Note that . It follows that, for some , we have
[TABLE]
9.1.4. Proof of Lemma 10, part (ii)
Our proof of part (ii) of Lemma 10 is entirely similar to that of the proof of part (i), where the key ingredients is the fact that uniformly over we have for any , which in turn is a consequence of Condition 1 and Lemma 1.
9.1.5. Proof of Lemma 11, part (i)
We will proceed in a similar fashion as in the proof of Lemma 10. The reduction to the Gaussian setting can be done similarly by using universality estimates for the 1-point correlation function of the complex zeros of from Theorem 7 and estimates proved in Theorem 12 and Corollary 5.
We now discuss the proof for the Gaussian setting. The given assumption clearly implies that has no zero in . Thus, using Rouché’s theorem it suffices to show that
[TABLE]
Using Cauchy’s theorem and arguing as in the proof of Claim 7, we obtain
[TABLE]
for some and any . Using Lemma 1 and Condition 1, we also have
[TABLE]
Thus, using the given hypothesis we obtain, for some ,
[TABLE]
Let in the last estimate. Then for any we could choose but large such that this estimate is bounded above by , as desired.
9.1.6. Proof of Lemma 11, part (ii)
The proof is entirely similar to part (i).
10. Proof of universality for real correlation functions
Below we prove part (i) of Theorem 6, and the same argument may be used to prove part (ii) of this theorem (details will be omitted).
Let and . For convenience of notation write for all . Then for we have and , while for we have . Note that and may not be inside for .
Arguing as in the proof of Theorem 7, it suffices to show that
[TABLE]
( are Gaussian analogues), and and satisfy the following conditions:
(i) for each , is in , supported in such that for .
(ii) for each , is supported on and is also with for any .
Let be sufficiently small, as required by Lemma 10 and let .
Let be small such that .
Let be a bump function supported on with .
Let be a smooth function supported on such that if .
Let be sufficiently large. Let be defined by
[TABLE]
One could check that are supported on and are with derivatives bounded by for any multi-index .
Applying Theorem 7 for test functions of tensor-product type, it follows that for some (which does not depend on ) we have
[TABLE]
Letting and making sure , it remains to show
[TABLE]
for some . Since , using Corollary 5 we have . Via Holder’s inequality, it therefore suffices to show that for some we have
[TABLE]
Now, for each let
[TABLE]
We first show that if then and
[TABLE]
Indeed, we first consider . Then . Therefore,
[TABLE]
Since both and are bounded, it suffices to show that any that contributes to the sum must be in . Indeed, for such we have and , which implies the desired claim.
We now consider . We have
[TABLE]
Since is supported on in the second summation we could further assume that . We obtain
[TABLE]
For any contributing , it holds that , therefore
[TABLE]
In particular, and this can be made very large compared to . Now,
[TABLE]
therefore has the same sign as . Thus it remains to show that . Now, using the triangle inequality this follows from
[TABLE]
This completes the proof of (10.1).
Now, the strip could be covered by sets of the form with center inside . Since and since , it follows that for such the ball would be inside the interval where the given hypothesis on the relationship between and holds. Now, since is a real polynomials its complex roots are symmetric about the real axis. Thus, using the small ball estimates proved in Lemma 10 (if is small compared to ) or the small ball estimates proved in Lemma 11 (if is large compared to ) with , together with an union bound, we obtain
[TABLE]
Now, since is a nonnegative integer, by Theorem 12 we have
[TABLE]
This completes the proof of Theorem 6.
11. Reduction of Theorem 1 to Gaussian polynomials
In this section, using Theorem 6 we will reduce Theorem 1 to Gaussian random polynomials. The proof of Theorem 1 for Gaussian polynomials will be discussed in the next section.
Let . Using Lemma 2, to reduce Theorem 1 to the Gaussian setting, it suffices to show that
[TABLE]
Thus without loss of generality we may assume that or . Below, we will only consider the first case, and we may use the same argument for the other case.
Let be a very small absolute constant. Recall the definition of from (3.1) and the paragraph after (3.1). Let .
Note that we may cover using intervals and where and . Let and be respectively the sets of and such that and intersect . Clearly, nearby covering intervals have comparable lengths. Thus, we may construct a sequence of functions (similar to a partition of unity) such that is supported on and is supported on , and furthermore
(i) and for any partial derivatives, and
(ii) is equal to for all and is supported inside where are two intervals from the covering that contain endpoints of .
Now, we could shrink the endpoint intervals and by factors comparable to (if necessary) so that remains covered by the new collection of intervals, and at the same time are subsets of the assumed enlargement of . The given definition of enlargement ensures that the shrinking of these intervals could be done. We may redesign the bump functions and associated with and such that they will still be supported inside and , respectively.
It follows from Theorem 6 that, for some ,
[TABLE]
[TABLE]
Summing the last two estimates over and , we obtain
[TABLE]
Now, . For the local intervals and , we will show that and . Since the details are entirely similar we will only discuss the estimate for . Since the enlargement of , we may construct a bump function adapted to that equals on but vanishes outside , in particular its support is strictly contained inside . Let be the -point correlation measure for the real root of and be its Gaussian analogue. By Theorem 6, we obtain
[TABLE]
Then assuming that the Gaussian case of Theorem 1 is known and using the fact that remains an enlargement of , we obtain
[TABLE]
here in the last estimate we may use Proposition 3 in the next section (which is a consequence of explicit Gaussian computations in [4]).
This completes the proof of the reduction of Theorem 1 to Gaussian polynomials.
12. Proof of Theorem 1 for Gaussian polynomials
In this section we prove Theorem 1 for the Gaussian polynomial where are iid normalized Gaussian, and throughout the section we will assume that and satisfy Condition 1.
Let and and let , , and , and .
We recall the following Kac-Rice formula [7, Corollary 2.1]. Let . Then where
[TABLE]
We will also work with the normalized reciprocal polynomial , and we will denote by , , the analogous quantities.
Using Lemma 2, we may assume without loss of generality that for a (small) absolute constant . By breaking up into and and notice that where we may reduce the consideration to .
Now, using Lemma 1, we have
Corollary 6**.**
Assume that and satisfy Condition 1. Then for any it holds uniformly over that
[TABLE]
On the other hand, by the classical Kac formula, is the density for the real root distribution of , and similarly is the density for the real root distribution of , and both of them can be easily bounded by by elementary inspection. Note that the Gaussian density for (and for its reciprocal polynomial) was studied555In fact, in [4] it was required that for , however the Gaussian computations in [4] can be easily modified to work with the weaker assumption in the current paper. in [4], and we summarize the known estimates for them from [4, Lemma 10.3, Lemma 10.6] in the following proposition.
Proposition 3**.**
Assume that satisfy Condition 1. Let be small. Then uniformly over we have
[TABLE]
and uniformly over we have
[TABLE]
In fact, in the original setting considered in [4] it was required that for all , so it is a little stricter than our setting , however the computation in the Gaussian setting in [4] is not affected much with our slightly more relaxed assumption. We omit the details.
12.1. Estimates for
We will show that, under the hypothesis of Theorem 1 about the relative relation between and on , we will always have . We separate the proof into two cases, depending on whether dominates or is dominated by .
First, we consider the situation when the deterministic component dominates the random component on .
Lemma 12**.**
Let . There is a constant such that the following holds. Let be an interval whose endpoints may depend on .
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Proof.
Using Lemma 1 and Corollary 6 we have
[TABLE]
and by the given hypothesis where is comparable to . Therefore
[TABLE]
so if is big enough then and the last integral is , as desired.
The consideration for is entirely similar. ∎
We now consider the situation when is dominated by .
Recall that is such that the following holds for some :
[TABLE]
Lemma 13**.**
Let and let satisfy (12.3). Let be an interval whose endpoints may depend on .
(i) Assume that the following holds uniformly over .
[TABLE]
[TABLE]
[TABLE]
(ii) Under the analogous assumptions, we also have .
Proof.
Using the given hypothesis and using Corollary 6, we have
[TABLE]
Since , we obtain
[TABLE]
This completes the proof of part (i). The second part (ii) can be proved similarly. ∎
12.2. Estimates for
Here we will also divide the consideration into two cases, depending on whether is dominant or is dominant.
The following result addresses the situation when is dominated by .
Lemma 14**.**
Assume that satisfies (12.3). Let and let be an interval whose endpoints may depend on .
(i) Assume that uniformly over we have
[TABLE]
[TABLE]
[TABLE]
(ii) Under analogous assumptions, a similar estimate holds for .
The following result deals with the situation when dominates .
Lemma 15**.**
Let and let be an interval whose endpoints may depend on .
(i) Assume that uniformly over we have
[TABLE]
[TABLE]
(ii) Under analogous assumptions, a similar estimate holds for .
The proof of these results are based on the following technical estimate. For convenience, let , and define analogously. Recall that is the density for the real root distribution of , and is the density for the real root distribution for .
Lemma 16**.**
Let be sufficiently small and let be sufficiently large. Then there are finite absolute constants that may depend on such that the following holds for any interval whose endpoints may depend on .
(i) If then and .
(ii) If then
[TABLE]
and the analogous estimate holds for .
Proof.
(i) Since , it follows that , so
[TABLE]
The estimate for is proved similarly.
(ii) Let . From Corollary 6 and Proposition 3, we obtain
[TABLE]
In other words for some we have . Consequently, by the geometric mean inequality we have
[TABLE]
Now, by Corollary 6 and Proposition 3 we have . It follows that
[TABLE]
The desired estimate then follows from the definition (12.1) of .
The proof for is completely analogous. ∎
We now use Lemma 16 to prove Lemma 14 and Lemma 15. Below we will show only the proof for the desired estimates for , the same argument works for . We start with the case when is dominated by : under the assumptions of Lemma 14 we have . Using for and using Proposition 3, it follows that
[TABLE]
Now in the case when dominates : under the assumptions of Lemma 15 we have , while thanks to Proposition 3. Therefore, for some we have
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A Bloch and György Pólya. On the roots of certain algebraic equations. Proceedings of the London Mathematical Society , 2(1):102–114, 1932.
- 2[2] Federico Dalmao. Asymptotic variance and CLT for the number of zeros of Kostlan Shub Smale random polynomials. Comptes Rendus Mathematique , 353(12):1141–1145, 2015.
- 3[3] Yen Do, Hoi Nguyen, and Van Vu. Real roots of random polynomials: expectation and repulsion. Proceedings of the London Mathematical Society (3) , 111(6):1231–1260, 2015.
- 4[4] Yen Do, Oanh Nguyen, and Van Vu. Roots of random polynomials with coefficients of polynomial growth. Annals of Probability (2018), Vol. 46, no. 5, 2407–2494 .
- 5[5] Yen Do and Van Vu. Central limit theorems for the real zeros of Weyl polynomials. American Journal of Mathematics (2020), vol. 142, issue 2, pp. 1327–1369. , 2017.
- 6[6] Alan Edelman and Eric Kostlan. How many zeros of a random polynomial are real? Bulletin of the American Mathematical Society , 32(1):1–37, 1995.
- 7[7] Kambiz Farahmand. Topics in random polynomials , volume 393. CRC Press, 1998.
- 8[8] Hendrik Flasche and Zakhar Kabluchko. Real zeros of random analytic functions associated with geometries of constant curvature. ar Xiv preprint ar Xiv:1802.02390 , 2018.
