Extreme value statistics for the roots of a complex Kac polynomial
Yacine Barhoumi-Andr\'eani

TL;DR
This paper studies the statistical behavior of the largest root in complex Gaussian Kac polynomials, focusing on fluctuations and large deviations, with results involving Fredholm determinants and integral series.
Contribution
It provides a detailed analysis of the fluctuations and large deviations of the largest root, extending previous work with explicit formulas involving Fredholm determinants.
Findings
Fluctuations characterized by Fredholm determinants.
Large deviations described by a series of multiple integrals.
Results deepen understanding of root distribution in random polynomials.
Abstract
We investigate the fluctuations and large deviations of the root of largest modulus in a model of random polynomial with independent complex Gaussian coefficients (Kac polynomials). The fluctuations were recently computed by R. Butez (arxiv 1704.02761) and involve a Fredholm determinant. The precise large deviations show a particular function defined by a series of mutiple integrals in the same vein.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeometry and complex manifolds · Stochastic processes and statistical mechanics · Financial Risk and Volatility Modeling
Extreme value statistics for the roots of a complex Kac polynomial
Yacine Barhoumi-Andréani
Department of Statistics, University of Warwick, Coventry CV4 7AL, U.K.
Abstract.
We investigate the fluctuations and large deviations of the root of largest modulus in a model of random polynomial with independent complex Gaussian coefficients (Kac polynomials). The fluctuations were recently computed by R. Butez (arxiv 1704.02761) and involve a Fredholm determinant. The precise large deviations involve a function defined by a series of mutiple integrals similar to such an expansion and exhibit a link with moments of characteristic polynomials of truncated Haar-distributed random unitary matrices.
2010 Mathematics Subject Classification:
60B20, 60B99, 15B52
1. Introduction
Consider the following random polynomial
[TABLE]
where is a sequence of i.i.d. Gaussian random variables. These polynomials are commonly called Kac polynomials since their seminal study by Kac ([18] ; see also [9, ch. 1.1] for historical remarks). The study of their roots has given rise to numerous interesting questions, concerning for instance the number of roots in a given set (in the case where the ’s are real, a classical question concerns the number of real roots of such a polynomial, see [10]) or more generally the linear statistics of the roots, i.e. random variables of the form for a given function (see the introduction of [7] and cited references for a review of recent results).
In the case of complex Kac polynomials, a lot is known about the repartition of the roots, in particular, one knows that the roots cluster uniformly around the unit circle (see e.g. [16, 17]). Nevertheless, this does not preclude some particular roots to escape arbitrarily far away from this area and a natural question is thus to investigate if the maximum modulus of the roots is concentrated around . Such a behaviour was recently investigated by R. Butez ([7]) who addresses the question of fluctuations of the extreme value statistics of the roots and answers it in the following theorem :
Theorem 1.1** (Butez).**
Order the sequence of roots according to their modulus, so that a. s. Then, converges in distribution to a random variable a. s. inside the unit disk and converges in law to . Moroever, the point process converges in distribution in the space of Radon measures111We associate a point process to a point measure , see [7]. to the roots of the Gaussian analytic function whose law is given by a determinantal point process with Bergman kernel (the zeroes of such an analytic function are countable with no accumulation points a. s.).
The determinantal character of the roots of such a random analytic function is a direct consequence of a theorem of Peres and Viràg (see [23]), a corollary of which is the explicit computation of the law of when :
Theorem 1.2** (Peres-Viràg).**
Let be a sequence of i.i.d. random variables uniformly distributed on the interval . Then, one has the following equality in distribution
[TABLE]
As one can see, the root of maximum modulus does not converge to . One can check moreover that
[TABLE]
namely, its law is supported on . This fact raises the natural question of the large deviations of inside the unit circle. This is the main theorem of the paper.
Theorem 1.3** (Precise left large deviations).**
Let . Then
[TABLE]
with
[TABLE]
The plan of this article is the following : we start by recalling some classical facts about Kac polynomials, and we give an alternative proof to Butez’ theorem 1.1, we then treat in an elementary way the left large deviations (in the logarithmic setting, thus) and we finally prove theorem 1.3. The proof involves the characteristic polynomial of a model of truncated random unitary matrices distributed according to the Haar measure (the so-called Circular Unitary Ensemble). We conclude with questions of interest, perspectives and future work on the topic.
Notations and properties
We gather here some notations used throughout the paper. The unit circle will be denoted by , the unit disk by and the disk of radius by , i.e.
[TABLE]
When integrating on or any curve in the complex plane, we set
[TABLE]
The Stirling formula is
[TABLE]
We define the Vandermonde determinant, for , by
[TABLE]
In particular, for , one has
[TABLE]
For , we define the -th elementary symmetric polynomial by and
[TABLE]
We recall that for all , we have the following equality (see e.g. [21])
[TABLE]
The Cauchy determinant is given by
[TABLE]
Last, for a matrix , we will write for its -coefficient.
2. Fluctuations
2.1. Reminders on random Gaussian polynomials
From now on, we denote by in place of the roots of the complex Kac polynomial (1).
The correlation functions or joint intensities of the roots in (1) are defined by
[TABLE]
Let denote the Lebesgue-density of the random variable . Using the Hammersley formula (see e.g. [3, (3.3.1) p. 39]) that writes, for any random polynomial
[TABLE]
one gets the formula (see [3, (3.4.2) p. 40] and references cited)
[TABLE]
where and are the matrices of general term given by
[TABLE]
Let be a sequence of random variables with values in and with correlation functions . Then, if for a set one has
[TABLE]
then, using an inclusion-exclusion formula (see e.g. [4]), one gets
[TABLE]
We will use this formula to compute the limiting distribution of the maximum modulus of (1).
2.2. The limiting distribution of the maximum modulus
We define the maximum modulus of the roots by
[TABLE]
The following gap probability for
[TABLE]
is known to converge to (see [7])
[TABLE]
where is the operator of kernel (Bergman kernel) acting on and where the process is determinantal of kernel .
This is a consequence of the convergence of the polynomial to the Gaussian Analytic Function for all and the measurability of the map (see [7]). We moreover have for all
[TABLE]
where is a sequence of i.i.d. uniform random variables on .
We now prove this result in a more analytic way.
Lemma 2.1**.**
The correlation functions of the roots of the Gaussian polynomial (1) converge for all , when , uniformly on for all .
Proof.
Using the formula (6), one sees that it is enough to prove the convergence of the covariances given in (7) in the underlying domain, namely, to prove that , and defined in (7) converge inside the unit disc. One has, uniformly in with
[TABLE]
Moreover,
[TABLE]
and
[TABLE]
As a result, one has the convergence, uniformly in
[TABLE]
with
[TABLE]
∎
Corollary 2.2**.**
The following convergence is satisfied for all
[TABLE]
Proof.
The convergence of the extreme values of the roots of (1) is then a consequence of the following result from [23] : the limiting correlation functions (11) take the form
[TABLE]
This theorem uses the invariance of the limiting analytic function by the group of homographies of and the Borchardt’s identity (see [3, 5.1.12]).
Writing and due to the self-reciprocity of the polynomial, i.e. , one has, for all
[TABLE]
As a corollary, the set of modulus of the roots converge in distribution of an i.i.d. sequence where is a sequence of i.i.d. random variables uniformly ditributed on (see [3, th. 4.7.1 & cor. 5.1.7]).
In the case of the maximum modulus of the roots, one can give a more analytical proof of this last result by finding the eigenvectors of the Bergman kernel acting on with . Indeed, setting for the set of eigenvalues of the compact operator acting on . The Bergman kernel is a trace-class operator on since . Thus, one has (see e.g. [12])
[TABLE]
Consider the functions for . Then,
[TABLE]
hence the result. ∎
Remark 2.3*.*
As the correlation functions determine the process, this proof applies to other models of random Gaussian polynomials if one can prove the convergence of the covariances. In general, using the inclusion-exclusion (8) for , one gets
[TABLE]
2.3. A remark on the Tracy-Widom distribution
The Tracy-Widom distribution (see [26]) writes as a Fredholm determinant in the same vein as the previous probability and satisfies moreover
[TABLE]
where writes as
[TABLE]
with the Hastings-McLeod solution of the Painlevé II equation satisfying when , being the Airy function (see [15]) and designates the additive convolution .
We are interested in the equivalent form for the probability density (12). For this, we write for
[TABLE]
Define the sum of divisors of by
[TABLE]
Then, setting , one gets
[TABLE]
Note that one can write
[TABLE]
where designates the multiplicative convolution defined by .
We see that is the analogue of . This motivates the following question :
Question 2.4**.**
Can one find the same decomposition with the Tracy-Widom distribution, namely, can one find the eigenvalues of the Airy operator which is the analogue of the Bergman operator in the previous setting ?
3. Large deviations and precise large deviations
3.1. The density of the zeroes
The distribution of the roots vector of the polynomial (1) is given by (see e.g. [3, 5, 11, 13, 20])
[TABLE]
where we have set and .
The formula (13) is obtained using the transformation given by the well-known formula valid for all
[TABLE]
The Jacobian of this transformation is given by the Vandermonde determinant (see e.g. [3]). One then integrates on to get (13), namely
[TABLE]
Denote by the Lebesgue-density of :
[TABLE]
Using the Plancherel-Parseval formula for a polynomial, i.e. , and (4) we get
[TABLE]
This transforms (14) into
[TABLE]
Remark 3.1*.*
This form was used in [27] to prove a large deviations result for the empirical measure of the roots.
We are now interested in the tail of this last probability. Using (13) and (15), we have
[TABLE]
Using the scaling property of the Vandermonde determinant
[TABLE]
one gets
[TABLE]
A classical computation shows that
[TABLE]
One can thus define the probability measure
[TABLE]
There is a matrix model underlying such a probability measure given by a -block truncation of a random matrix, i.e. a random Haar-distributed unitary matrix of size (see [24, eq. (16)]).
Using the formula valid for all and
[TABLE]
we can write this last probability as
[TABLE]
where is the characteristic polynomial of the random matrix from the truncated given by
[TABLE]
3.2. Large deviations
In the limiting fluctuations of the largest modulus, the support of the limiting law given in (12) is defined on . We are interested in the the large deviations of inside the disk.
Theorem 3.2** (Left large deviations).**
Let . Then, satisfies a (left) large deviation principle at speed with rate function , namely, for all
[TABLE]
More precisely, one has, with a independent of
[TABLE]
Proof.
We have
[TABLE]
Define
[TABLE]
We have
[TABLE]
where are the eigenvalues of . This last inequality comes from the fact that . In particular, we have
[TABLE]
namely
[TABLE]
Taking the logarithm and dividing by , we get
[TABLE]
where we have used the Stirling formula (2). ∎
3.3. Precise large deviations
We now prove theorem 1.3.
Proof.
Let be a constant to be choosen later. Write
[TABLE]
Define
[TABLE]
Expanding the Laplace transform of the random variable
[TABLE]
and applying the Fubini theorem, one gets
[TABLE]
This last moment expansion is valid if the law of is defined by its moments. We now prove this fact. Using the Fubini theorem, we have
[TABLE]
An explicit formula is available to compute the integer moments of the characteristic polynomial (see e.g. [1, eq. (2.11)] ; for the reader’s convenience, this result is reminded in section 5), namely
[TABLE]
where is the function defined by
[TABLE]
We thus have
[TABLE]
hence
[TABLE]
Main contribution : Using (18), i.e. , we have, when , and
[TABLE]
Note that the implied constant in the is independent of and . We then get
[TABLE]
Using the Cauchy determinant (5) and the relation (Notations and properties), we get for
[TABLE]
It is moreover clear that the function is integrable on as . We thus get
[TABLE]
One can check that a possible choice for is given by . For the choice
[TABLE]
and using , we obtain
[TABLE]
where is a function that can be made explicit.
Remainder : Define
[TABLE]
so that
[TABLE]
Let be a random variable defined by (i.e. Gamma-distributed). Let us suppose that is independent of , the determinantal point process of kernel on the unit disk (namely the eigenvalues of the truncated random matrix whose (16) is the law). Then, we have
[TABLE]
We have moreover
[TABLE]
and, using (12) for , with , one gets for all
[TABLE]
In particular, using the fact that for all and for all , , one gets
[TABLE]
hence the result. ∎
Remark 3.3*.*
It is clear that since it writes as
[TABLE]
Remark 3.4*.*
The function is similar to a Fredholm expansion since it involves a series with integrals, but one can ask about a more “classical” expression, for instance, a series expansion of the form . Such an expression is possible if one writes the expansion of the Cauchy product in terms of Schur functions or power functions (see [21, ch. 1.3]), i.e.
[TABLE]
with (the notations are the ones in [14, ch. 2])
[TABLE]
Integrating these expressions and applying the Fubini theorem for double sums (wich is convergent due to the presence of the term and the fact that the number of terms in is equal to , the number of partitions of an integer , and the fact that for in virtue of a celebrated result of Hardy and Ramanujan), one gets the result. Nevertheless, the integrals or do not have a simple expression other than a combinatorial sum involving semi-standard tableaux or equivalent quantities. We leave to the interested reader the exercise to express such quantities.
4. Conclusion and perspectives
In view of the previous results, a natural continuation of the problem is the following one : compute the (precise) transition deviations around , in particular, one can conjecture that there exists a function such that for all
[TABLE]
and more generally that there exists a function such that
[TABLE]
Note that one can replace the term by . The problem of the precise transition deviations is then reminiscent of another problem of random polynomials, the computation of the persistence exponent ; in the case of the real Kac polynomial, when is replaced by the maximum absolute value of the real roots, see e.g. [2, 6, 9, 22] and references cited. The proof of theorem 1.3 can be adapted to this setting, but the rescaling of the integrals involve the moments of the characteristic polynomial of a random truncated matrix in the microscopic scaling (see [8, 19] for the case of the ) and requires additional care to extract its asymptotic behaviour. We plan to address this question in a subsequent publication.
5. Annex : Expectation of products of characteristic polynomials
We prove here (17). We want to compute
[TABLE]
One has
[TABLE]
hence
[TABLE]
We now compute
[TABLE]
This is, up to a multiplicative constant, the -joint intensity of a random variable of size with a joint law given by defined in (16) (see e.g. [25] or [3, ch. 3]).
A classical method expresses as for a certain “kernel” autoreproduced for the underlying scalar product, here , where
[TABLE]
Start by writing the square modulus of the Vandermonde determinant in the variable as
[TABLE]
where the are monic (i.e. with the higher degree coefficient equal to 1) and satisfy . The fact that comes from the fact that one can perform linear combinations of lines or columns without changing its value.
The function is said to be a kernel. As such a kernel satisfies the following properties
- (1)
Autoreproduction : , 2. (2)
Trace :
the following formula holds for all (see [25, Lemma 1] or [3, ex. 4.1.1])
[TABLE]
Iterating, and defining , we get for all
[TABLE]
In order to compute (19) with this last formula, we need to find the kernel associated to . One thus needs to norm the columns of the Vandermonde matrix since these polynomials are monic. For this, we compute
[TABLE]
In particular, for ,
[TABLE]
The normed polynomials are thus given for by
[TABLE]
and the kernel is
[TABLE]
where is defined in (18).
Finally, one can write
[TABLE]
which implies that
[TABLE]
Setting , we can write (19) as
[TABLE]
which is (17).
Acknowledgements
The author thanks Gernot Akemann, Olivier Hénard, Mario Kieburg, Igor Krasovski, Ashkan Nikeghbali, Dan Romik, Nick Simm, Oleg Zaboronski and Ofer Zeitouni for interesting discussions, questions and remarks concerning previous versions of this work.
A particular thanks is given to Michael Cranston and Elliot Paquette for technical discussions and computational help, and to Raphaël Butez for providing the author with an earlier version of [7] and interesting discussions that followed. The author acknowledges moreover the organisers of the conference “Random matrix theory and strongly correlated systems” held at the university of Warwick the 21-24 March 2016 that allowed him to meet with several of the aforementionned persons.
During the redaction of this work, the author was supported by the Schweizerischer Nationalfonds PDFMP2 134897/1 and by the EPSRC grant EP/L012154/1.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] G. Akemann, G Vernizzi, Characteristic polynomials of complex random matrix models, Nuclear Physics B, 660(3):532-556 ( 2003 ).
- 2[2] A. P. Aldous, Y. V. Fyodorov, Real roots of random polynomials: universality close to accumulation points, J. Phys. A: Mathematical and General 37(4):123 ( 2004 ).
- 3[3] J. Ben Hough, M. Krishnapur, Y. Peres, B. Viràg, Zeros of Gaussian Analytic Functions and Determinantal Point Processes, American Mathematical Society, Providence, available at http://research.microsoft.com/en-us/um/people/peres/gaf_book.pdf ( 2009 ).
- 4[4] A. Borodin, Determinantal point processes, Oxford Handbook of Random Matrix Theory, G. Akemann, J. Baik and Ph. Di Francesco eds. http://arxiv.org/abs/0911.1153 v 1 ( 2009 ).
- 5[5] E. Bogomolny, O. Bohigas, P. Leboeuf, Distribution of roots of random polynomials, Phys. Rev. Lett. 68:2726-2729 ( 1992 ).
- 6[6] A. J. Bray, S. N. Majumdar, G. Schehr, Persistence and first-passage properties in nonequilibrium systems, Advances in Physics, 62(3):225-361 ( 2013 ).
- 7[7] R. Butez, The largest root of random Kac polynomials is heavy tailed , https://arxiv.org/abs/1704.02761 ( 2017 ).
- 8[8] R. Chhaibi, J. Najnudel, A. Nikeghbali, The circular unitary ensemble and the Riemann Zeta function: the microscopic landscape and a new approach to ratios, Inventiones mathematicae, 207(1):23-113 http://arxiv.org/abs/1410.1440 ( 2017 ).
