Extremes of Chi triangular array from the Gaussian $\beta$-Ensemble at high temperature
Cambyse Pakzad

TL;DR
This paper investigates the extreme eigenvalues of the Gaussian beta-ensemble at high temperatures, showing convergence to a Poisson process and providing asymptotic estimates for the largest eigenvalue.
Contribution
It establishes the convergence of the extreme point process to a Poisson process in high temperature regimes and explicitly determines normalizing sequences.
Findings
Convergence to Poisson point process in high temperature regimes
Explicit normalizing sequences for different regimes
Asymptotic estimates for the largest eigenvalue
Abstract
We study the extreme point process associated to the off-diagonal components in the matrix representation of the Gaussian -Ensemble and prove its convergence to Poisson point process as when the inverse temperature scales with and tends to . We consider two main high temperature regimes: and . The normalizing sequences are explicitly given in each cases. As a consequence, we estimate the first order asymptotic of the largest eigenvalue of the Gaussian -Ensemble.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWireless Communication Networks Research
Extremes of Chi triangular array from the Gaussian -Ensemble at high temperature
Cambyse Pakzad
Abstract
We study the extreme point process associated to the off-diagonal components in the matrix representation [2] of the Gaussian -Ensemble and prove its convergence to Poisson point process as when the inverse temperature scales with and tends to [math]. We consider two main high temperature regimes: and . The normalizing sequences are explicitly given in each cases. As a consequence, we estimate the first order asymptotic of the largest eigenvalue of the Gaussian -Ensemble.
00footnotetext: 2010 Mathematics Subject Classification. 60B20 - 60F05 - 60G70
1 Introduction
Gaussian ensembles, namely Hermitian matrices with independent Gaussian entries whose joint distribution invariant is under conjugation by appropriate unitary matrices, play a central role in the realm of Random Matrix Theory. Regarding their spectrum, the joint distribution of the eigenvalues is available through the formula:
[TABLE]
According to the Dyson index , the density (1) describes the eigenvalues of the Gaussian Orthogonal Ensemble , the Gaussian Unitary Ensemble , or the Gaussian Symplectic Ensemble . Although it seemed that no natural matrix model could be associated to the Gaussian -Ensemble, a collection of points with distribution (1) and general , the gap is bridged many years later by Edelman and Dumitriu in [2]. Through the Househölder method, they constructed the following tridiagonal symmetric random matrix:
[TABLE]
with independence between all variables, such that for any and , the eigenvalues of have exactly joint distribution from (1).
Let us consider the off-diagonal sequence of , namely are independent random variables with distribution for .
The scope of this paper is to study the asymptotic behavior of the largest , namely of the extreme point process where , are suitably chosen normalizing sequences and when the inverse temperature scales with in such a way that . We consider two regimes in this context. In first place, we study the case which itself divides into two subregimes around . Then, the intermediate high temperature regime is analyzed for . Both cases lead to an inhomogeneous limiting Poisson point process on the half line with intensity measure of exponential kind. The point process approach is borrowed from the Extreme Value Theory (see [4, 6]) as it shows to be a convenient way to encompass many informations. In this vein, our results are stated in terms of Poisson convergence. One consequence of this fact is that one can derive the limiting distribution of the rescaled largest component to be Gumbel. We exploit here the quantitative growth rate of the largest entries in (2) to provide an estimate on the first order asymptotic of the largest eigenvalue in the high temperature regime.
The question of the normalizing (or scaling) sequences is fundamental. They indicate the precise growth rate of the maximum of the set of random variables considered, which, roughly speaking, provides the right way to look at the extreme values. Although they are not always explicit or with closed form, we furnish here exact formulae. The case has a nice interpretation: the scaling sequences for thoroughly match with those of the maximum of i.i.d. copies of .
We now state our main result in the case :
Theorem 1.1**.**
Let such that and , with , a triangular array of independent random variables.
Assume . Let the scaling sequences:
[TABLE]
Then the point process converges weakly to an inhomogeneous Poisson point process on with intensity measure .
Assume . Let the scaling sequences:
[TABLE]
Then the point process converges weakly to an inhomogeneous Poisson point process on with intensity measure .
In the intermediate high regime , we have the following result:
Theorem 1.2**.**
Let and . Let , with , a triangular array of independent random variables. Let the scaling sequences:
[TABLE]
Then the point process converges weakly to an inhomogeneous Poisson point process on with intensity measure .
Remark 1.3**.**
On one hand, we consider a size sample of growing in and on the other hand, the greatest chi parameter of is . Assuming that , one could expect that the maximum of the should converge to [math]. Our result shows the other way. In the regime , the centering is still diverging to infinity. Though, one can retrain our work to show it is converging to [math] when . We did not consider this special case here since the competition against the maxium of Gaussian variables becomes irrelevant.
Concerning the edge statistics of , we derive the following corollary:
Corollary 1.4**.**
Let the largest eigenvalue of defined in (2). Assume the hypothesis of Theorem 1.1 or 1.2. There exists constants such that with probability tending to , as ,
[TABLE]
Proof.
Consider the standard basis of . The characterization of the largest eigenvalue with the Rayleigh quotient (see Theorem 4.2.2 in [3]) allows us to write:
[TABLE]
Thus we get the lower bound .
On the other hand, we apply the Gerschgorin circles (see Theorem 6.1.1 in [3]) to get an upperbound:
[TABLE]
The claim yields by combining both bounds (6), (7) with the conclusions of Theorem 1.1 or 1.2, and the counterpart result for maximum of i.i.d. Gaussian variables, that is the point process converges weakly in probability to a Poisson point process, where the normalizing sequences are:
[TABLE]
∎
In Section 2, we establish a general tool for asymptotic Poissonian statistics of extreme point processes in the independent but non identically distributed context. Theorem 2.3 will be the framework of our method and is the analog of the classic Poisson limit theorem for i.i.d. extremes (see for instance Proposition 3.21 in [6]). Sections 3 and 4 are devoted to the proof of each case following the steps accordingly to Theorem 2.3, namely we prove appropriate convergence and uniform negligibility.
2 Multivariate Poisson limit
For completness, we recall the definition of a Poisson point process.
Definition 2.1**.**
Let a topological space endowed with a -finite measure . A Poisson point process on with intensity is a random Radon measure on such that for any and pairwise disjoint Borel sets of with finite measure, the random vector has law .
In order to reach the point process level, we need the following finite-dimensional Poisson approximation:
Proposition 2.2**.**
Let a triangular array of independent random variables with same interval support . Let an interval and an increasing homeomorphism. Let and and the random vector defined for by:
[TABLE]
For , we suppose that:
[TABLE]
[TABLE]
Then we have the convergence in distribution:
[TABLE]
Proof.
For convenience, we set .
Let , . We compute the Laplace transform of :
[TABLE]
Taking the logarithm and Taylor-expanding,
[TABLE]
From hypothesis (8) and (9), we deduce:
[TABLE]
[TABLE]
It follows that:
[TABLE]
∎
We are now ready to show the goal of this section:
Theorem 2.3**.**
Let a triangular array of independent random variables with same interval support . Assume there exists interval, càdlàg with finite limit at the right boundary of and increasing homeomorphism such that:
[TABLE]
and for such that ,
[TABLE]
Then the point process
[TABLE]
converges to a Poisson point process on the interval with intensity defined by
[TABLE]
Proof.
Let . By Definition 2.1, we need to show the following convergence in distribution, for any and ,
[TABLE]
where for ,
[TABLE]
First thing to notice is the following identity for ,
[TABLE]
Thus, we aim to apply Proposition 2.2 in order to complete the proof. Clearly, the condition (9) is fulfilled thanks to the uniform negligibility hypothesis (11). Besides, for any , one has because of the hypothesis on . Hence, using (10), we get:
[TABLE]
The assumption (8) readily follows with . ∎
3 Proof of Theorem 1.1
Before going in the details of the demonstration, let us fix our notation and recall some well-known facts about Gamma function and Gamma distribution.
Remark 3.1**.**
A real random variable follows a distribution, ie: , if its density is , . Also,
[TABLE]
For and , the following equalities holds in distribution:
[TABLE]
The main step in our proof is the fulfillment of condition (10) so we need a way to control summation of incomplete Gamma functions. Indeed, previous Remark 3.1 indicates how chi’s survival functions are related to incomplete Gamma functions. As we have to take limit on a sum, we require to work on a finite setting () and estimate each summand by the crucial bound:
Lemma 3.2**.**
For and ,
[TABLE]
Proof.
The upper bound is classic and can be found in [1]. One can use Padé approximants to prove the lower bound of (13). As a reference for this, we give [5]. ∎
We state two well-known facts which are going to be helpful to accuratly evaluate the whole sum.
Remark 3.3**.**
It is well-known that the Gamma function admits the Taylor expansion near :
[TABLE]
This combines well with the hypothesis since it implies uniformly in , ie: . Thus,
[TABLE]
For , let us give another useful identity:
[TABLE]
Lastly, we show a technical result which estimates an auxiliary term in the upcoming computations. Its outcome varies depending to regime and accurately shows the differences between the two regimes.
Lemma 3.4**.**
Let such that for some and .
We define:
[TABLE]
Assume . Then,
[TABLE]
Assume . Then,
[TABLE]
Proof.
We begin with the first case . We treat each three main terms (except ) by second order Taylor expansion, so that equals to:
[TABLE]
Expanding the product, we get:
[TABLE]
The first order terms vanishes. Cleaning the highest order quantities, it leads to:
[TABLE]
Thus, the result readily follows from the assumptions made on and .
Now, we assume . Likewise,
[TABLE]
∎
We can now turn to the actual proof of Theorem 1.1 which is in two stages accordingly to conditions (10) and (11). We prove the first condition.
Proposition 3.5**.**
Let such that and a triangular array of independent random variables with for any .
Assume . For , we set:
[TABLE]
Then for ,
[TABLE]
Assume . For , we set:
[TABLE]
Then for ,
[TABLE]
Proof.
For convenience of notation, let . By Remark 3.1, we write the quantity of interest in terms of incomplete Gamma functions:
[TABLE]
By hypothesis , we can apply Lemma 3.2 to estimate (20):
[TABLE]
Applying the Gamma expansion (14) gives:
[TABLE]
We analyze the last term with the identity (15):
[TABLE]
The decay hypothesis on and definitions (18), (19) imply that and hence,
[TABLE]
Combining everything in (21), we get:
[TABLE]
with
[TABLE]
We turn to the lower bound. For any and , we have:
[TABLE]
Applying the lower bound (13) on (20) and according to (22), we get:
[TABLE]
Repeating the same steps as before, it yields:
[TABLE]
Note that we have not made any additional assumptions on yet. The next step is to estimate the term. According to the regime considered, two different outcomes turn out, which is described in Lemma 3.4.
We begin with the first case and restrict to , ie: . Combining the asymptotic (16) with (23), it follows that:
[TABLE]
From the definition (18) of the scaling function, we compute the following asymptotics:
[TABLE]
[TABLE]
And finally, substituting in (24), the result yields:
[TABLE]
We consider the second regime case for and assume in the sequel that . The main difference lies in Lemma 3.4. We substitute the asymptotic (17) in (23):
[TABLE]
Again, we lastly compute from (19) the required asymptotics:
[TABLE]
[TABLE]
[TABLE]
Combining these with (25), it leads to the desired conclusion:
[TABLE]
∎
Finally, we show condition (11):
Lemma 3.6**.**
Let . Under the assumptions of Proposition 3.5, one has:
[TABLE]
Proof.
For such as in (18) or (19), and , let us write:
[TABLE]
There exists a constant such that:
[TABLE]
Assume that and as in (19). In this case, considering the asymptotics (26), (27) and the fact that
[TABLE]
a little computation shows that the latter quantity in (29) converges to [math] uniformly in as .
When and as in (18), the conclusion follows from (30) and . ∎
4 Proof of Theorem 1.2
From the normalization sequences in Theorem 1.2, we define the scaling function in the same way that (18) by:
[TABLE]
This is an increasing homeomorphism on with inverse . We note that it corresponds to the scaling function of i.i.d. copies of . In other words, for , it satisfies:
[TABLE]
Likewise Theorem 1.1, we follow the path of Theorem 2.3 and begin with the assumption (10). We control the sum of chi’s survival functions with the same approach. The regime does not allow an uniform negligibility on the chi’s random variables. In Theorem 1.1, one takes into consideration all summands, each underlying random variables contributing to the extreme value. This time, the terms with index negligible to , ie: , are inconsequential. In order to get meaninful at the limit, we need to explore the region of . We cut off at the appropriate section of the sum, hence identifying the significant stack of chi’s random variables significant to the maximum.
Lemma 4.1**.**
Let . Let such that and , with , a triangular array of independent random variables and let defined in (31).
Then for ,
[TABLE]
Proof.
For clarity, we will work with instead of , . By Remark 3.1,
[TABLE]
Since for any , one can estimate (32) with Lemma 3.2:
[TABLE]
The extra-term in the lower bound converges to because . Hence, it is sufficient to prove that converges to .
For this purpose, let such that , e.g., \displaystyle{\mu_{n}=\big{(}\log\log(n)\big{)}^{-\frac{1}{2}}}. We split with
[TABLE]
and show that:
[TABLE]
We begin with the main core term . Let us define:
[TABLE]
Consequently,
[TABLE]
We compute the geometric sum in the previous line:
[TABLE]
In order to estimate (35), we use Taylor expansion, the hypothesis on , the definition (31) and (33), which provide the asymptotics:
[TABLE]
Combining (33), (35) and (36) with the bound (34), it provides the asymptotic of :
[TABLE]
Finally, together with (31) and the fact that
[TABLE]
[TABLE]
the expression (37) yields the claim.
Regarding the first sum , there exists a numerical constant such that:
[TABLE]
By Taylor expansion, we find another constant such that the previous upper bound asymptoticaly equals to:
[TABLE]
Using (38) and (39), for some constant , we have:
[TABLE]
The hypothesis on is precisely that , thus the proof is complete. ∎
To conclude the proof, it only remains to show:
Lemma 4.2**.**
Given the assumptions of Theorem 1.2 and defined in (31), for any , one has:
[TABLE]
Proof.
Let and , then
[TABLE]
The latter term is independent of and converges to [math]. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Jonathan M Borwein, O Chan, et al. Uniform bounds for the complementary incomplete gamma function. Mathematical Inequalities and Applications , 12:115–121, 2009.
- 2[2] Ioana Dumitriu and Alan Edelman. Matrix models for beta ensembles. J. Math. Phys. , 43(11):5830–5847, 2002.
- 3[3] Roger A. Horn and Charles R. Johnson. Matrix Analysis . Cambridge University Press, New York, NY, USA, 2nd edition, 2012.
- 4[4] M. R. Leadbetter, Georg Lindgren, and Holger Rootzén. Extremes and related properties of random sequences and processes . Springer Series in Statistics. Springer-Verlag, New York-Berlin, 1983.
- 5[5] Frank WJ Olver, Daniel W Lozier, Ronald F Boisvert, and Charles W Clark. NIST handbook of mathematical functions hardback and CD-ROM . Cambridge University Press, 2010.
- 6[6] Sidney I. Resnick. Extreme values, regular variation and point processes . Springer Series in Operations Research and Financial Engineering. Springer, New York, 2008. Reprint of the 1987 original.
