Optimal soft edge scaling variables for the Gaussian and Laguerre even $\beta$ ensembles
Peter J. Forrester, Allan K. Trinh

TL;DR
This paper investigates the large N asymptotics of eigenvalue densities at the soft edge for Gaussian and Laguerre beta ensembles, revealing a universal correction term of order N^{-2/3} when properly centered.
Contribution
It introduces optimal soft edge scaling variables for Gaussian and Laguerre beta ensembles, demonstrating universal correction behavior in the asymptotic density.
Findings
Leading correction term is O(N^{-2/3}) when scaled variables are centered properly.
The correction behavior is consistent across different parameter regimes in Laguerre ensembles.
Numerical evidence supports the theoretical asymptotic results.
Abstract
The ensembles are a class of eigenvalue probability densities which generalise the invariant ensembles of classical random matrix theory. In the case of the Gaussian and Laguerre weights, the corresponding eigenvalue densities are known in terms of certain dimensional integrals. We study the large asymptotics of the density with a soft edge scaling. In the Laguerre case, this is done with both the parameter fixed, and with proportional to . It is found in all these cases that by appropriately centring the scaled variable, the leading correction term to the limiting density is . A known differential-difference recurrence from the theory of Selberg integrals allows for a numerical demonstration of this effect.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Optimal soft edge scaling variables for the Gaussian and Laguerre even ensembles
Peter J. Forrester and Allan K. Trinh
Department of Mathematics and Statistics, ARC Centre of Excellence for Mathematical & Statistical Frontiers, University of Melbourne, Victoria 3010, Australia
Abstract.
The ensembles are a class of eigenvalue probability densities which generalise the invariant ensembles of classical random matrix theory. In the case of the Gaussian and Laguerre weights, the corresponding eigenvalue densities are known in terms of certain dimensional integrals. We study the large asymptotics of the density with a soft edge scaling. In the Laguerre case, this is done with both the parameter fixed, and with proportional to . It is found in all these cases that by appropriately centring the scaled variable, the leading correction term to the limiting density is . A known differential-difference recurrence from the theory of Selberg integrals allows for a numerical demonstration of this effect.
1. Introduction
Universal results, whereby scaled large properties are independent of many local properties, abound in random matrix theory. Due to this, it is possible to talk of the statistical properties of large sized, bulk scaled complex Hermitian matrices, for example, without having to be more specific about the details of the ensemble. Thus the random Hermitian matrices may be formed entry-wise by sampling from a zero mean, finite standard deviation distribution – this is the Wigner class – or they may be defined by imposing a unitary invariant measure i.e. depending on the eigenvalues but not the eigenvectors; see e.g. [33].
Bulk scaling refers to choosing the origin in the support of the spectrum but away from the edges, then choosing for the length scale units of the mean spacing in the neighbourhood of this point. This is to be contrasted to the soft edge scaling, for which the origin is chosen to be in the neighborhood of the largest (or possibly smallest) eigenvalue, and the length scale is chosen to be of the order of the spacing between the first and the second eigenvalues.
Universality results apply in both the bulk and soft edge scaling regimes [12]. These two cases show themselves in some celebrated applications of random matrices. In the study of the statistical properties of the large Riemann zeros, the Montgomery-Odlyzko law asserts that upon scaling to have mean spacing unity, these coincide with the statistical properties of the bulk scaled eigenvalues of Hermitian matrices; see e.g. [25]. And for growth models on curved interfaces in the KPZ class, fluctuations have been predicted theoretically [32] and shown experimentally [34], to be identical to the fluctuations of the soft edge scaled largest eigenvalue of a random Hermitian matrix. Since many statistical quantities can be computed exactly for bulk and soft edge scaled Hermitian random matrices (see e.g. [14]), there are explicit functional forms available against which to compare data relating to the applications.
Data on the Montgomery-Odlyzko law comes by way of various data sets of Odlyzko – originally the high precision evaluation of the -th Riemann zero and over neighbours [29], and more recently high precision evaluation of the () zero and over neighbours as announced in [30]. The latter sample size is sufficiently large that the difference between the empirical statistical quantity of interest (e.g. the nearest neighbour spacing) as determined from the data, and the random matrix prediction implied by the Montgomery-Odlyzko law, exhibits a distinct functional form [30]. In keeping with the proposal of Keating and Snaith [26] that the eigenvalues of random unitary matrices chosen with Haar measure provides the correct statistical description of finite size effects, one can use knowledge of the leading correction to the bulk scaled distribution in this random matrix ensemble to predict the forms seen in the data [2, 17, 4].
Soft edge finite size effects can also be analysed [21, 5, 15, 11]. While different ensembles exhibit different functional forms for the leading correction, there is growing evidence that the optimal order – as controlled by the precise choice of the centring variable – has the leading soft edge correction for complex Hermitian matrices being proportional to [20]. Although not the subject of the present work, there is also a literature along these lines with respect to hard edge scaling; see [10, 3, 31].
In the case of real symmetric matrices, the work of Johnstone [23] and collaborators [24, 28] has shown that the optimal choice of the soft edge centring and scaling variables for the classical ensembles again leads to a correction proportional to . Here the term classical ensembles refers to the particular class of orthogonally invariant real symmetric random matrices with eigenvalue probability density (PDF) proportional to
[TABLE]
chosen with weight function one of
[TABLE]
(the notation denotes the function taking the value for true and [math] otherwise).
The purpose of the present paper is to study the optimal choice of the soft edge scaling for the -generalisation of the Gaussian and Laguerre orthogonal ensembles as specified by (1.1) and (1.2). Now the eigenvalue PDF is proportional to
[TABLE]
with weight function
[TABLE]
The matrix ensemble implied by (1.3) in the Gaussian case was first introduced by Dyson [9] as an interpolation of the classical Gaussian orthogonal (), unitary () and sympletic () ensembles. Later it was shown that in both the Gaussian and Laguerre cases, (1.3) for general can be realised as the eigenvalue PDF for particular real tridiagonal matrices [8].
The quantity to be considered is the eigenvalue density
[TABLE]
where depending on the weight (1.4) and
[TABLE]
is the normalisation. We note that (1.5) can be written as
[TABLE]
where denotes the average with respect to (1.3) in the case of eigenvalues . For even, the average is a polynomial in , and it is this feature which gives opportunity to analyse the soft edge scaling limit.
Earlier work of Desrosiers and Forrester [7] has, for even, computed the soft edge scaling limit of for and . This involves the multidimensional integral
[TABLE]
which extends the integral representation of the Airy function,
[TABLE]
It was shown in [7] that
[TABLE]
Here we extend these results to bound the leading correction, by identifying the optimal choice of the centring variable.
Theorem 1**.**
We have that
[TABLE]
and
[TABLE]
are, for large , equal to the RHS of (1.10) with a correction term .
An analogous result can also be established for a soft edge scaling in the Laguerre case with .
Theorem 2**.**
Consider the Laguerre case of (1.3) with . Let
[TABLE]
We have that
[TABLE]
is, for large , equal to the RHS of (1.10) with a correction term .
Moreover, our working below gives the explicit form of the leading order correction term in each case in terms of generalisations of (1.8). In the Gaussian case this is given by (3.20) while in the Laguerre case the relevant expressions are (4.11) with and (4.21) with a fixed.
In Section 2 we revise known results for the leading correction to not only the density, but also the general -point correlation function for the Gaussian and Laguerre unitary ensembles ( case of (1.3) and (1.4)). Specialised to the density, these results provide a check on our subsequent working. We proceed in Section 3 to establish the Gaussian case of Theorem 1. Section 4 establishes the Laguerre case of Theorem 1, as well as Theorem 2. A known differential-difference recurrence for computing the polynomial in (1.7) allows for a numerical and graphical demonstration, given in Section 5, that the identified scaling variables exhibit convergence at the claimed rate.
2. The case
A distinguishing feature of the case is that the general -point correlation is determinantal,
[TABLE]
for some correlation kernel independent of . This provides a strategy to establish the leading correction term to the limiting form of under optimal soft edge scaling of this correlation function: thus one should analyse the corresponding asymptotic properties of . This was done in the Gaussian and Laguerre cases (the latter with fixed) by Choup [5] and later extended to the Laguerre unitary ensemble with in [20], giving an correction in each case.
To present these results, define
[TABLE]
where in the final formula is given by (1.11). Also set
[TABLE]
The results of [5, 20] tell us that for large
[TABLE]
for and furthermore tell us the precise functional form of the term . To present the latter, we specialise to the case , when the LHS of (2.4) corresponds to the soft edge scaled density. Using the notation (2.3) we have
[TABLE]
3. The Gaussian ensemble
Key to analysing (1.7) in the Gaussian case for even is the duality formula [1]
[TABLE]
Here we have used the notation to refer to the PDF corresponding to (1.3). Note the exchange of roles between the two sides of (3.1). We see that the average in (1.7) is of the form of the LHS of (3.1) for even, telling us that then
[TABLE]
The normalisation and that implied by the average in (3.2) are known explicitly, see e.g. [1]. Introducing the notation of this latter reference allows (3.2) to be written, after minor manipulation,
[TABLE]
where
[TABLE]
and
[TABLE]
with
[TABLE]
The contour in (3.4) is equal to as read off from (3.2), however by first ordering the product of differences in the integrand can be interpreted as an analytic function and deformed into the complex plane by Cauchy’s theorem (see [7]).
Our interest is the large asymptotics of (3.3) after the replacement
[TABLE]
where the new scaled variable is . The choice (3.6) is consistent with the first case of (2.2). We begin by noting that upon simplification, then use of Stirling’s formula, one can check
[TABLE]
Also we have
[TABLE]
In relation to the exponent in (3.4), with we see from (3.5) that
[TABLE]
With (3.9) substituted in (3.4), the next step is to perform a saddle point analysis of (3.4). This has been done to leading order in [7] – here we extend the working therein to the next order but appealing to the latter reference for the formal justification of the methodology.
According to (3.9) the leading order term in the exponent of (3.4) is , which we can check has a double saddle point at . Expanding about this point by introducing the variable , then changing variables – this is in keeping with the direction of steepest descent; see [7] – gives
[TABLE]
Consequently
[TABLE]
where
[TABLE]
As an extension of (1.8), define
[TABLE]
Substituting (3.10) in (3.4), then substituting the result together with the expansion (3.7) in (3.3) with replaced by (3.6), we deduce that
[TABLE]
Our task now is to obtain a simplification of .
Proposition 1**.**
We have
[TABLE]
Proof.
In (3.12), let is regard as an operator acting on all terms in the integrand to the right.
It follows from the fundamental theorem of calculus that
[TABLE]
On the other hand, by direct differentiation
[TABLE]
where the second equality follows from the fact that the integrand implied by the final term in the line above is anti-symmetric upon the interchange of any two co-ordinates and thus vanishes. Hence
[TABLE]
It similarly follows from the fundamental theorem of calculus that
[TABLE]
while direct differentiation gives
[TABLE]
Taking the arithmetic mean of the final term with itself after interchanging shows
[TABLE]
Hence
[TABLE]
Recalling the definition of in (3.11), it follows from (3.15) and (3.16) that
[TABLE]
The result (3.14) now follows from the fact that
[TABLE]
∎
The results (3.13), (1.10) and (3.14) tell us that for even
[TABLE]
where
[TABLE]
The first statement in Theorem 1 now follows from (3.18) by recentring according to then expanding the first term according to Taylor’s theorem up to second order, and the second term up to first order; this process also gives the explicit correction term as
[TABLE]
Remark 1*.*
(A.) In the case we have from (3.20), (3.19) and (3.12) that the correction term (3.20) is equal to
[TABLE]
where is given by (3.11). Expanding and making use of (1.9) gives agreement with as specified in (2.3).
(B.) The factor in (3.14) is familiar as a factor in the corrections to the bulk smoothed densities in the Gaussian and Laguerre ensembles; see [22] and the recent works [35], [19].
4. The Laguerre ensemble
Introduce the normalisations
[TABLE]
Use the notation CE to denote the ensemble of eigen-angles specified by the PDF proportional to
[TABLE]
Combining [14, Prop. 13.2.5 & Exercises 13.1 (4.4)] gives the duality formula
[TABLE]
According to [7] the multi-dimensional integral implied by the RHS of (4.1) can be written in the form (3.4), but with
[TABLE]
and a counterclockwise closed contour about . Furthermore, as in the Gaussian case, by suitably ordering the contours the absolute value signs about can be removed and the contours deformed into the complex plane. Given this, after recalling (1.7), and using the notation as is specified in terms of and by (3.4), we have in keeping with [7, Eq. (22)] that
[TABLE]
The case
In keeping with the third case of (2.2), we want to expand for large to the first two orders (4.3) with and
[TABLE]
We begin by noting
[TABLE]
and
[TABLE]
In (4.6) use was made use of Stirling’s formula and the multiplication formula for the gamma function
[TABLE]
Next, we turn to a saddle point analysis of the integral in (4.3). After making the substitution (4.4) in (4.2) we see that up to terms of order , the latter quantity is equal to
[TABLE]
This possesses a double saddle point at . We expand about this point in the direction of steepest descent by the changing of variables . This gives
[TABLE]
Combining the results (4.5), (4.6), (4.7) with (4.3) shows
[TABLE]
where
[TABLE]
Analogous to the result of Proposition 1, the term in (4.8) can be identified as being proportional to the derivative of the leading term.
Proposition 2**.**
We have
[TABLE]
Proof.
This follows directly from (4.9), (3.15), (3.16) and (3.17). ∎
Combining (4.8) and (4.10) establishes Theorem 2. In addition, with the notation (3.19), we see that the explicit form of the correction term is
[TABLE]
In the case , proceeding as in the Gaussian case of Remark 1, we can check that this is consistent with as specified in (2.3).
The case fixed
We fix and make the replacement
[TABLE]
A simple calculation shows
[TABLE]
Further, by use of Stirling’s formula and the multiplication formula for the gamma function we deduce
[TABLE]
With fixed we read of that the leading large term of (4.2) is
[TABLE]
This has a double saddle point at . Expanding around this point in the direction of the steepest descent by the change of variables then shows
[TABLE]
By combining (4.13), (4.14), (4.15) with (4.3), for the Laguerre -ensemble with fixed, the large expansion of the soft edge scaled density as defined in the variable (4.12) is given by
[TABLE]
where
[TABLE]
Immediate from (4.17) and (3.17) is that
[TABLE]
Applying the translation to (4.16) then shows that the scaled soft edge density in the variable listed second in (2.2), up to order , is given by
[TABLE]
where
[TABLE]
This corroborates with Theorem 1 in the Laguerre case, and furthermore gives the leading correction as
[TABLE]
where is specified according to (3.19) with the substitution (4.20). Note that this correction is independent of . In the case this latter property is already evident from the appropriate case of (2.3). The procedure of Remark 1 can be used to reduce (4.21) with to this functional form.
5. Numerics
In this section we provide a method to determine the graphical form of the optimal correction term for the soft edge density of the Gaussian and Laguerre even -ensembles. Multiple integral forms have been given in (3.20), (4.11) and (4.21). However these are not well suited to accurate numerical evaluation. Instead we turn to known [13, 16, 18] recursive properties relating to the average in (1.7), recalling that for even the latter is a polynomial of degree . As such we are making use of the broader theory of the Selberg integral; see [14, Ch. 4].
For fixed parameters , introduce the auxiliary function as the multiple integral
[TABLE]
where
[TABLE]
denotes the elementary symmetric polynomials. In the case we know from [13] that these integrals satisfy the differential-difference equation
[TABLE]
where
[TABLE]
The immediate relevance of (5.1) to the average (1.7) comes from the fact that
[TABLE]
where the superscript indicates use of the Jacobi weight in (1.7). In the case , , (5.1) is independent of , which provides an initial condition to iterate (5.2) up to . The crucial property
[TABLE]
then gives the value of the polynomial in the case , , and the iteration can be continued.
Introduce the notation
[TABLE]
for the Gaussian and Laguerre cases of (5.1). A simple change of variables shows
[TABLE]
Applying this limiting procedure to (5.2) gives
[TABLE]
and
[TABLE]
where and . The constant (i.e. independent) values for , can be read off from the normalisations appearing in (3.3) and (4.1), which being limiting cases of the Selberg integral can be expressed in terms of products of gamma functions. Thus G_{0}\big{\rvert}_{\alpha=1}=G_{2\lambda,N} and L_{0}\big{\rvert}_{\alpha=1}=W_{a,2\lambda,N}.
Figures 1, 2, 3 and 4 were generated by substituting the appropriate soft edge variables
[TABLE]
into (5.5) and (5.6). With , defined by (5.7), (2.2) (with the exception for the Laguerre case with fixed, where we take instead ), Figures 1, 2 and 3 plot
[TABLE]
For some large values, the successive differences given by (5.8) admit, as seen graphically, the same functional form and are . (5.8) plots the corrections (3.20), (4.21), (4.11) up to some error, to leading order, proportional to Figure 4 plots
[TABLE]
where
[TABLE]
and shows the approximate functional form of the correctional term of order due to the scaling (2.2) which bears resemblance to the derivative of (1.7). This further emphasises the results found in Proposition 1, 2 and (4.18). The small displacement relative to the derivative in the displayed examples – which we take as a finite N artifact –- is not a feature of the previous figures (right captions) due to the fundamentally different nature of what is being plotted.
6. More general invariant ensembles
In our previous study [20], the question of the optimal soft edge scaling for complex Hermitian Wigner matrices was briefly addressed. We recall that an Hermitian random matrix is termed a Wigner ensemble if all its diagonal entries (which must be real) are independently chosen from the same zero mean, finite variance distribution, and all its upper triangular entries (which may be complex) chosen similarly. The Gaussian unitary ensemble, corresponding to the Gaussian case of (1.1) with , is both a unitary invariant ensemble, and a complex Wigner matrix: matrices from this ensemble can be generated by choosing the diagonal entries from the normal distribution N and the upper triangular entries from N. In [20] numerical simulations were performed on the particular complex Wigner matrices , where all entries of are chosen independently and uniformly from the set of four values . The off diagonal entries of then have mean zero and variance one half, as for the GUE, guaranteeing (see e.g. [33]) that to leading order the largest eigenvalue occurs at . By the use of simulation, evidence was presented that a scaled and centred variable can be identified such that the leading correction to the limiting distribution of the soft edge scaled largest eigenvalue is . Of course the natural question is to ask if this effect holds more generally.
What then for the case of general invariant ensembles beyond the classical cases? In the case , and for the family of weights , as noted in [27, Eq. (1.9)], results of [6] give the asymptotic formula for the soft edge scaled correlation kernel (2.1)
[TABLE]
where depend on . Thus once again there is a choice of scaling variables such that the correction to the correlations at the soft edge is .
Acknowledgements
This work is part of a research program supported by the Australian Research Council (ARC) through the ARC Centre of Excellence for Mathematical and Statistical frontiers (ACEMS). PJF also acknowledges partial support from ARC grant DP170102028, and AKT acknowledges the support of a Melbourne postgraduate award.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] T.H. Baker and P.J. Forrester, The Calogero-Sutherland model and generalized classical polynomials , Commun. Math. Phys. 188 (1997), 175–216.
- 2[2] E. Bogomolny, O. Bohigas, P. Leboeuf, and A.C. Monastra, On the spacing distribution of the Riemann zeros: corrections to the asymptotic result , J. Phys. A 39 (2006), 10743–10754.
- 3[3] F. Bornemann, A note on the expansion of the smallest eigenvalue distribution of the LUE at the hard edge , Ann. Appl. Prob. 26 (2016), 1942–1946.
- 4[4] F. Bornemann, P.J. Forrester, and A. Mays, Finite size effects for spacing distributions in random matrix theory: circular ensembles and Riemann zeros , Stud. Appl. Math. 138 (2017), 401–437.
- 5[5] L.N. Choup, Mesoscopic fluctuations for the thinned circular unitary ensemble , Int. Math. Res. Notices 2006 (2006), 61049.
- 6[6] P. Deift and D. Gioev, Universality at the edge of the spectrum for unitary, orthogonal, and symplectic ensembles of random matrices , Comm. Pure Appl. Math., 60 (2007), 867?910.
- 7[7] P. Desrosiers and P.J. Forrester, Hermite and Laguerre β 𝛽 \beta -ensembles: asymptotic corrections to the eigenvalue density , Nucl. Phys. B 743 (2006), 307–332.
- 8[8] I. Dumitriu and A. Edelman, Matrix models for beta ensembles , J. Math. Phys. 43 (2002), 5830–5847.
