Extreme eigenvalues of random matrices from Jacobi ensembles
B. Winn

TL;DR
This paper derives two-term asymptotic formulas for the distribution of the smallest and largest eigenvalues in Jacobi beta-ensembles, revealing explicit expressions and correction terms for large matrices.
Contribution
It provides new two-term asymptotic formulas for eigenvalue distributions in Jacobi beta-ensembles with explicit correction terms and special case formulas involving familiar functions.
Findings
Explicit two-term asymptotic formulas derived
First-order corrections proportional to distribution derivatives
Special cases with explicit formulas involving Bessel functions
Abstract
Two-term asymptotic formulae for the probability distribution functions for the smallest eigenvalue of the Jacobi -Ensembles are derived for matrices of large size in the r\'egime where is arbitrary and one of the model parameters is an integer. By a straightforward transformation this leads to corresponding results for the distribution of the largest eigenvalue. The explicit expressions are given in terms of multi-variable hypergeometric functions, and it is found that the first-order corrections are proportional to the derivative of the leading order limiting distribution function. In some special cases and/or small values of , explicit formulae involving more familiar functions, such as the modified Bessel function of the first kind, are presented.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Quantum Mechanics and Non-Hermitian Physics · Molecular spectroscopy and chirality
Extreme eigenvalues of random matrices from Jacobi ensembles
B. Winn
Department of Mathematical Sciences, Loughborough University, Loughborough, LE11 3TU, U.K.
(22ndJanuary 2024)
Abstract
Two-term asymptotic formulæ for the probability distribution functions for the smallest eigenvalue of the Jacobi -Ensembles are derived for matrices of large size in the régime where is arbitrary and one of the model parameters is an integer. By a straightforward transformation this leads to corresponding results for the distribution of the largest eigenvalue. The explicit expressions are given in terms of multi-variable hypergeometric functions, and it is found that the first-order corrections are proportional to the derivative of the leading order limiting distribution function.
In some special cases and/or small values of , explicit formulæ involving more familiar functions, such as the modified Bessel function of the first kind, are presented.
1 Introduction
A random matrix is a matrix whose entries are random variables. As eigenvalues of a matrix are continuous functions of its entries, so the eigenvalues of a random matrix are random variables. A random matrix has the Jacobi -Ensemble (JE) distribution if a joint probability density function of its eigenvalues is
[TABLE]
where
[TABLE]
and the Vandermonde determinant is defined by
[TABLE]
That (1.1) is a properly normalised probability density is a consequence of Selberg’s integral [1].
In many situations is a non-negative integer, and we will mostly be assuming that one of or is a non-negative integer, but (1.1) makes sense for arbitrary real values of these parameters subject to the constraints
[TABLE]
The naming of this ensemble reflects the presence in (1.1) of the factors which are a density with respect to which (a certain version of) the classical Jacobi polynomials form an orthogonal collection.
We label by the sorted eigenvalues, so that . This article is concerned with the distribution of the extreme eigenvalues and . In fact, since the change of variables , for , in (1.1) leaves the joint probability density invariant, save for the exchange , and reverses the order of the eigenvalues, it will not present a loss of generality to focus on the smallest eigenvalue .
The limiting empirical eigenvalue density for Jacobi random matrices was derived in [2]. For fixed , the large limiting density is
[TABLE]
This means that for large the number of eigenvalues in the interval is approximately , and it is natural to expect to have a non-trivial limiting distribution.
Our main objects of interest will be the (cumulative) probabilty distribution function , and the rescaled version
[TABLE]
Deferring to below a more comprehensive summary of previous work on this problem, we mention a result [3] of Moreno-Pozas, Morales-Jimenez, McKay in the case (the Jacobi Unitary Ensemble, JUE). They proved, for and ; and for , the two-term asymptotic result
[TABLE]
where the determinants appearing in (1.7) are of size , and is the -Bessel function—the modified Bessel function of the first kind,
[TABLE]
and for .
On the other hand, Borodin and Forrester have derived [4] the leading-order distribution of the smallest eigenvalue of the JE for any and :
[TABLE]
where is a multivariate hypergeometric function that will be defined precisely in Section 2.3, and . Our principal result is a version of the two-term asymptotic (1.7) valid for .
Theorem 1.1**.**
Let be the smallest eigenvalue of the Jacobi -Ensemble, , with and . For ,
[TABLE]
The error estimate can depend on but is uniform for in a compact set.
All our results for the distribution of the smallest eigenvalue can be re-cast to give an analogous result for the largest eigenvalue, as indicated earlier. We will not write down these analogues for every result, allowing just the following Corollary of Theorem 1.1.
Corollary 1.2**.**
Let be the largest eigenvalue of the Jacobi -Ensemble, , with and . For ,
[TABLE]
There are several known random matrix models that lead to JE eigenvalue distributions. Most famous are perhaps the double-Wishart (or Manova) models from Statistics: set , to be independent and matrices with independent standard normal real random variable entries, . If , , then the matrix has eigenvalues distributed according to the JE with , and [5, 6, 7, 8, 9]. Since our results rely on being integer, this requires to be an odd diference.
If we repeat the above construction, with complex normal random variables, then the eigenvalue distribution of is JE with , and [10, Section 8].
Another model leading to the joint probability density function (1.1) is the corners process of random matrices from classical compact groups: if is a random unitary or orthogonal matrix chosen with respect to Haar measure, , and is the principal submatrix of (the upper-left corner matrix), then, letting denote the eigenvalues of , the points are distributed according to (1.1) with and (unitary case) or (orthogonal case) [11, 12, §7.2]. In the latter case which is not an integer, so Theorem 1.1 does not apply, but Corollary 1.2 does apply for the distribution of the largest eigenvalue if is an odd number (whence is an integer).
Random matrix models that allow full exploration of the parameter space, including to arbitrary , are also known [13, 14, 15].
The JE exhibits two “hard edges” in the spectrum at and since the eigenvalues are strictly confined between these values, which furthermore coincides with the support of the limiting eigenvalue density (1.5). This is in contrast to some other random matrix models such as the Gaussian ensembles [16] which have compactly supported limiting eigenvalue density—the famous Wigner’s semi-circle law [17, 18]—but without any intrisic obstacle to having individual eigenvalues appearing at any point on the real line. Statistics such as the distribution of smallest eigenvalues are expected to be “universal” in the limit , in the sense that they ought not to depend on the precise features of the random matrix model in question. In our present context it means that the limiting distribution (1.9) will be valid for other matrix models with a hard spectral edge. Indeed, the same limiting distribution has been proven for a different set of matrix models exhibiting a hard edge—the Laguerre -Ensembles (LE; sometimes called Wishart random matrices) [19], as well as modifications of the JUE that preserve the hard edge [20].
The finite corrections to the leading order derived in Theorem 1.1 are not expected to be universal—indeed the presence of the parameter seems to rule that out—but they do exhibit an interesting feature that had already been conjectured for the Laguerre Unitary Ensemble at the hard edge [21] and proved for that model in [22, 23, 24]: the correction term is proportional to the derivative of the main term. This holds for our two-term asymptotic (1.10), although it may not seem immediately apparent: see (5.7) below. Forrester and Trinh [25] have investigated the eigenvalue density for the LE for , and found two-term asymptotics at the hard edge of the spectrum, and that the correction term is also proportional to the derivative of the leading term. It seems likely that the methods in the present work could also be adapted to study the hard-edge of the LE too.
JE random matrices have a number of known applications. The outage probability of multiple-input/multiple-output (MIMO) systems subject to interference, such as those used in cellular mobile radio networks, can be modelled in terms of the largest eigenvalue of JUE matrices [26]. The conductance eigenvalues in random matrix models for mesoscopic disordered quantum systems are known to be governed by the JE distribution with [27, 28]. In this context, expressions for the average spectral density have been derived in terms of multi-variable hypergeometric functions [29], somewhat similar to expressions for the smallest eigenvalue derived in Section 3. Finally, some tests in multivariate Statistics are based on the distributions of extreme eigenvalues of JE (generally the parameters and corresponding to the real and complex underlying fields are most relevant), see Roy [30]. Some of these statistical applications are reviewed in Section 2 of [31]. Roy’s test has practical applications in signal analysis in the presence of coloured noise, for which the distribution of the largest eigenvalue of the JUE is required [32].
Aside from the references [3, 4] mentioned above, theoretical work on the distribution of extreme eigenvalues for Jacobi ensembles goes back at least to [33] for and Constantine [34] for , motivated by the aforementioned applications in Statistics.
In [35] expressions were derived for distribution functions in terms of multivariate hypergeometric functions in variables, and corresponding formulae for density functions in variables given in [36] and [37]. Algorithms for a numerical evaluation of the distribution of the smallest eigenvalue in the JUE were given in [38] with methods applicable to arbitrary values of the parameters , and furthermore which extend even to non-integer values of .
Johnstone [31] and Jiang [39] have investigated statistics of extreme eigenvalues, and other quantities, in a setting where the parameter values and are not fixed, but vary as , leading to a soft edge in the spectrum. Scaling limits at the hard and soft-edge were treated together in [40].
Forrester and Li [41] have studied eigenvalue correlations for a broader class of unitary ensembles with a hard edge at the spectrum (which includes the JUE) and found -correction terms consistent with [3].
In Section 2 we introduce some of the analytic tools that will be used (multi-variable hypergeometric functions and Jacobi polynomials). In Section 3 we collect some exact formulæ for finite . In Section 4 we prove a two-term asymptotic formula for multivariate hypergeometric functions, that is then used to give the proof of Theorem 1.1 in Section 5. A few special cases are treated in Section 6.
2 Multi-variable hypergeometric functions and
Jacobi polynomials
Multi-variable analogues of classical hypergeometric functions and orthogonal polynomials are a relatively recently-developed area of study that have nevertheless proved very useful in Random Matrix Theory, see, e.g. [42, 43, 44, 45, 46, 47] as well as many other articles cited in the present work. They can be defined as series of Jack polynomials which we define first.
2.1 Jack polynomials
Let be a set of variables, be an integer partition111We can assume that the number of parts of is equal to the number of variables, since if there are more parts than variables the corresponding Jack polynomial is zero; on the other hand, any partition can be padded with [math]s to increase the number of parts to . of size , and let . The Jack polynomials [48] are certain homogeneous, symmetric polynomials of degree .
We define the operators
[TABLE]
and
[TABLE]
for
Jack polynomials are joint eigenfunctions of and [49]. In fact,
[TABLE]
(a relation satisfied by any homogeneous polynomial of degree ) and
[TABLE]
where
[TABLE]
The definition of is completed by triangularisation: if
[TABLE]
is the expansion of in the basis of monomial symmetric functions, then the coefficient unless in terms of dominance ordering of partitions [49]; and normalisation:
[TABLE]
(That such a normalisation exists is proved in [49, Prop. 2.3], although a different normalisation for the Jack polynomials is actually used throughout [49]. The normalisation leading to (2.7) is commonly-used for applications in Random Matrix Theory.)
2.2 Multi-variable Jacobi polynomials
As with multi-variable hypergeometric functions defined in the next subsection, multi-variable generalisations of the classical Jacobi polynomials were initially studied for Jack parameter [50] with applications in Statistics in mind. Later these were generalised to other values of , with a variety of conventions for normalisation and support of the orthogonality measure [51, 52, 53, 54]. They sometimes go by the name “Jacobi polynomials associated with the root system ” [55]. In our definitions, we follow [56] with a difference in the choice of normalisation.
For fixed, is a symmetric polynomial eigenfunction of the operator
[TABLE]
of the form
[TABLE]
for constants depending on and , and the notation means for . We normalise by requiring (the “monic” choice).
The multi-variable Jacobi polynomials are orthogonal with respect to the joint probability density (1.1) of the JE [56, Théorème 2].
2.3 Multivariate hypergeometric functions
Multivariate hypergeometric functions were introduced for general values of the Jack parameter by Kaneko [57] and Korányi [58], generalising the definition relevant to the case introduced by Herz [59], the Statistics applications of which being studied in [34, 10, 60]. Efficient numerical implementations of multi-variable hypergeometric functions are available [61].
They are defined as a sum over partitions as
[TABLE]
where is the generalised Pochhammer symbol defined by
[TABLE]
and the classical Pochhammer symbol is
[TABLE]
The reader familiar with hypergeometric functions of a single variable (recapitulated in (4.17) below) will recognise the generalisation (2.10).
For general values of the parameters the series in (2.10) converges absolutely for all if and for in some ball if [57]. However, if any of the “upper” parameters, say, is equal to a negative integer , , then the series contains only finitely-many terms and defines a multi-variable symmetric polynomial of degree .
2.4 Some useful identities
Yan undertook one of the first systematic studies of multi-variable hypergeometric functions for arbitrary and proved a number of formulæ and identities, including the Pfaff-like formula [62, eq. (35)]
[TABLE]
(The case was derived in [60, Theorem 7.4.3].)
A number of integral representations are also available. We mention here, and will use below, the formula due to Kaneko [57]:
[TABLE]
valid for , , and . In (2.14) and below we use as a shorthand for .
3 Calculations for finite-size matrices
In this section we collect some formulæ for the distribution and density of the smallest eigenvalue of a JE matrix of fixed finite size .
3.1 Probability distribution of the smallest eigenvalue
If is the smallest eigenvalue of the JE then, for any constant ,
[TABLE]
As all eigenvalues are between [math] and , we obviously have
[TABLE]
so it will be sufficient to find expressions for the probability in the range .
Proposition 3.1**.**
Let be the smallest eigenvalue of the joint distribution (1.1) with . Then, for ,
[TABLE]
Proof.XRecalling that are un-ordered eigenvalues, we integrate the joint probability density (1.1), to get
[TABLE]
If we make the substitution , , this maps each of the integrals to an integral over , and we have
[TABLE]
For this integral can be evaluated by means of Kaneko’s integral (2.14) to get
[TABLE]
Up to this point we have followed Borodin and Forrester’s paper [4] (our (3.6) is equation (3.16) therein). The only novel step in the proof is to simplify the argument of the multivariate hypergeometric function in (3.6) by applying the Pfaff-like identity (2.13) to give (3.3).
Based on (3.6), Borodin and Forrester proved the asymptotic scaling limit (1.9) for the smallest eigenvalue.
We have also a formula for in terms of multi-variable Jacobi polynomials.
Corollary 3.2**.**
With and as above, an alternative expression for the probability in Proposition 3.1 is, for ,
[TABLE]
where is the multi-variable Jacobi polynomial, and an explicit expression for the denominator in (3.7) is
[TABLE]
In these expressions is a shorthand for .
Corollary 3.2 will be proved in Section 5.2. We also note that explicitly computable recursions for coefficients in the series expansion in powers of for have been derived in [63].
3.2 Probability density of the smallest eigenvalue
Our main interest is in the probability distribution function of the smallest eigenvalue of the JE. However with little effort we can derive a formula for a probabilty density in terms of a multi-variable hypergeometric function, that will also be used to prove a key differentiation identity (Corollary 3.4 below).
Proposition 3.3**.**
If , a marginal probability density function for the smallest eigenvalue of the Jacobi -Ensemble (1.1) is given by
[TABLE]
for , where the normalisation constant is
[TABLE]
Proof.XThe joint probability density function of the ordered eigenvalues of the JE is
[TABLE]
This has the same functional form as (1.1), except for the factor in the numerator to account for the ordering of the variables. To derive the marginal density function for we integrate out all the other variables
[TABLE]
un-ordering the integrations. With the change of variables for , this multiple integral becomes
[TABLE]
where, in a slightly unusual notation . The -fold multiple integral may be evaluated by means of Kaneko’s integral (2.14) to give
[TABLE]
By an application of the Pfaff-like identity (2.13), this may be re-written as (3.9).
The formula (3.9) for the probability density function was first derived by Dumitriu [36], with a different method of proof. Slightly different, but equivalent, multivariable hypergeometric function representations for the probability density function have been given in [37].
Corollary 3.4**.**
For we have the derivative identity
[TABLE]
for all except possibly .
Proof.XFrom (3.1) and (3.3) above the probability distribution function of , the smallest eigenvalue, is
[TABLE]
for , and a probability density function is given by (3.9). The result (3.15) follows because the density function agrees with the derivative of the distribution function at points of continuity. By analytic continuation the identity persists outside of the interval .
We remark that the result (3.15) does not seem easy to prove in a direct way starting from the definition (2.10) of the multivariate hypergeometric functions. A similar observation was made by Forrester [19] who found the analogous identity at the level of multivariate hypergeometric functions.
Later, we will want to take the limit , so we record here the asymptotic behaviour of in this limit.
Lemma 3.5**.**
As we have
[TABLE]
Proof.XUsing the value (1.2) for the Selberg integrals, and cancelling common factors we get
[TABLE]
Re-writing the factors appearing in the product in (3.18) as
[TABLE]
we realise that many factors cancel in the product over and we are left with
[TABLE]
Reuniting the product with the prefactors in (3.18) and further cancellation results in
[TABLE]
The asymptotic (3.17) follows by applying the asymptotic formula
[TABLE]
to the -dependent factors.
4 Two-term asymptotic formula
Our main analytic tool is going to be a two-term asymptotic formula for the multi-variable hypergeometric function, stated below, and proved in the following subsections.
Theorem 4.1**.**
Let and be fixed, such that is not a negative integer for . Then with ,
[TABLE]
where the error estimate is uniform for in compact subsets of , but may depend on . The operator in (4.1) is defined in (2.2).
Our strategy will be to split the sum defining the multi-variable hypergeometric function as
[TABLE]
recalling that the Jack polynomials are homogeneous of order . It will turn out that the tail terms (the second sum) do not contribute significantly to the limit.
4.1 Preliminary results
It is a consequence of Stirling’s formula that
[TABLE]
as . We will need a form of this result with control on how the error depends on the parameters .
Lemma 4.2**.**
Suppose is a quantity such that as . Then
[TABLE]
as with .
Proof.XBy the classical Stirling formula [64, 6.1.37]
[TABLE]
as with . Therefore,
[TABLE]
using
[TABLE]
and
[TABLE]
provided .
Now,
[TABLE]
provided . Putting it together with (4.6) we get (4.4).
Corollary 4.3**.**
Suppose and satisfy and as . Then
[TABLE]
Proof.XApplying Lemma 4.2 and cancelling common factors,
[TABLE]
by the binomial theorem. Re-writing the second term gives (4.10).
We will also require some bounds on Pochhammer symbols (2.12).
Lemma 4.4**.**
Let be fixed, and .
If then ; 2. 2.
If then .
Proof.XWe have that
[TABLE]
If then using the second line of (4.12) and the triangle inequality
[TABLE]
If then re-ordering the product,
[TABLE]
However,
[TABLE]
so from (4.14) we end up with
[TABLE]
We shall test certain sums for convergence by comparison with the classical hypergeometric series
[TABLE]
For generic choice of parameters the power series (4.17) is known to have radius of convergence if and infinite radius of convergence if [65, §2.2]. The exceptions to these rules are when the series has only a finite number of terms and reduces to a polynomial in . This can happen when one of the parameters is a negative integer.
4.2 The main contribution
We start by analysing the first sum on the right-hand side of (4.2).
Proposition 4.5**.**
Let be fixed quantities. Then
[TABLE]
where the implied constant may depend on and but is uniform for in compact subsets of .
Making use of the representation
[TABLE]
we can write, using Corollary 4.3 for the ratios of gamma functions,
[TABLE]
where . This leads to
[TABLE]
We adopt here a convenient shorthand . Recalling the actions (2.3), (2.4) of the operators , on Jack polynomials,
[TABLE]
Substituting (4.22) for the numerator in (4.18) leads via cancellation of the factors to the sum on the right-hand side of (4.18). To prove the error estimate it will be sufficient to demonstrate that
[TABLE]
uniformly for in compact subsets of . We do this following a method elaborated by Kaneko [57]. Namely: there exists a constant depending only on such that
[TABLE]
where and [57, Lemma 1]. Thus there exists a constant depending only on such that . We also may observe that
[TABLE]
So
[TABLE]
using the definition (2.11) of the generalised Pochhammer symbol together with (4.25). It can be seen that (4.26) is bounded by comparing each factor to a convergent hypergeometric series.
4.3 Bounding the tail terms
Using Kaneko’s bound for Jack polynomials from the end of subsubsection 4.2 we get
[TABLE]
for some constant depending only on and depending only on and . If is a partition with then, as is the largest part, we must have . Factorising the generalised Pochhammer symbols according to (2.11) we achieve the inequality
[TABLE]
Proposition 4.6**.**
For every and compact set , there exists a constant (which may additionally depend on ) such that for every , and all sufficiently large, we have
[TABLE]
Proof.XFor brevity, let us define , , . The following three estimates give us the bound we need:
Using part 2. of Lemma 4.4,
[TABLE]
provided we additionally choose large enough so that , and the absolute convergence of a hypergeometric series in the unit disc. 2. 2.
Using part 1. of Lemma 4.4,
[TABLE]
by comparison with a hypergeometric series. 3. 3.
For the sum over between and we again use part 1. of Lemma 4.4, as in the previous step leading to
[TABLE]
where the series has been compared to a hypergeometric series.
We use 1. and 2. to bound each factor for in (4.28) by a constant. We then use 1. and 3. to deduce the rapid decay in of the remaining sum over .
With essentially the same method and calculations we can bound similarly the tail of two further series.
Proposition 4.7**.**
For every and compact set , there exists a constant (which may additionally depend on ) such that for every , and all sufficiently large, we have
[TABLE]
and
[TABLE]
Proof.XUsing the fact that is an eigenfunction of and with eigenvalues that depend only polynomially on the parts of , we may follow the proof of the preceding Proposition 4.6 making only trivial changes.
4.4 Asymptotic formula
Proposition 4.8**.**
For fixed , we have
[TABLE]
where the implied constant may depend on but is uniform for in compact subsets of .
Proof.XStarting from (4.2) and as a consequence of the bound (4.29) of Proposition 4.6, we have
[TABLE]
for any . By Proposition 4.5 this is
[TABLE]
By Proposition 4.7 we can complete the sums in (4.37) without affecting the error estimate to get
[TABLE]
This is (4.35), recognising
[TABLE]
Given that is continuous, this already proves
[TABLE]
locally uniformly in —the leading-order of Theorem 4.1.
Our final task in this section will be to put the “” term of (4.35) into a nicer form.
4.5 Partial Differential Equation satisfied by
It is a Theorem of Yan [62, Theorem 2.1] and Kaneko [57, Theorem 2] that if is not a negative integer for any then the unique solution to system of equations
[TABLE]
, subject to being symmetric in its variables and analytic at , is
[TABLE]
This result for was first proved by Muirhead [66], having been conjectured, apparently, by A. G. Constantine. Muirhead also shows how the system (4.41) can be degenerated to give the holonomic system of equations for multivariate hypergeometric functions, which can easily be generalised for arbitrary as follows.
Proposition 4.9**.**
Provided that is not a negative integer for any , the multivariate hypergeometric function is the unique solution of the differential equations
[TABLE]
, subject to the constraints that is symmetric in its variables, is analytic at and satisfies .
Proof.XSince we now know (4.40) that
[TABLE]
we set and make the change of variables in (4.41) to get
[TABLE]
Dividing through by and letting we recover (4.43).
Corollary 4.10**.**
Under the same condition on as in Proposition 4.9, the function is a solution to the partial differential equation
[TABLE]
where .
Proof.XWe multiply through the th equation (4.43), satisfied by , by :
[TABLE]
Since
[TABLE]
(4.47) is equivalent to
[TABLE]
Summing over (4.49) for , we arrive at
[TABLE]
Re-arranged, this is (4.46).
Proof of Theorem 4.1.X We use Corollary 4.10 to replace the term in (4.35) by
[TABLE]
The resultant cancellation of terms involving leads to (4.1).
5 Main Results
5.1 Proof of Theorem 1.1
Looking-back to (3.3) we had
[TABLE]
Standard asymptotic arguments give
[TABLE]
uniformly for in compact sets.
In the result of Theorem 4.1, taking to be a constant multiple of we have
[TABLE]
which may be applied in (5.1) with , , , and (so that is not a negative integer for , justifying the use of Proposition 4.9), to give
[TABLE]
Combining with (5.2) we get
[TABLE]
Setting in (3.15) of Corollary 3.4
[TABLE]
where was defined in (3.10). A further application of (the leading order of) Theorem 4.1 and equation (5.2), and Lemma 3.5 for the asymptotics of , brings (5.6) to
[TABLE]
We use (5.7) to remove the derivative term from (5.5), giving
[TABLE]
This completes the proof.
That the first-order correction term in (5.8) is proportional to the derivative of the leading term implies that the finite- behaviour can be interpreted, up to an error of order as a correction to the width: if we let
[TABLE]
then we may interpret Theorem 1.1 as saying
[TABLE]
By Taylor’s theorem, this is equivalent to the re-centring
[TABLE]
5.2 Connection with Jacobi polynomials
We now prove the formula (3.7) from Corollary 3.2 giving a formula for the distribution of the smallest JE eigenvalue in terms of multi-variable Jacobi polynomials.
Proof of Corollary 3.2.X We have for
[TABLE]
This is essentially Théorème 5 of [56], incorporating our different choice of normalisation of the Jacobi polynomials. It may be proved by observing that both sides of (5.12) are multivariate symmetric polynomials that satisfy the same partial differential equation (see [62] or [57] for the PDE satisfied by ) and that the leading term on both sides is proportional to with identical constant.
Comparing to (3.6) we find the appropriate parameters for the multivariate Jacobi polynomial are
[TABLE]
From (3.6) and the identity (5.12) with parameters as above we have
[TABLE]
Taking the limit we need to have and so
[TABLE]
Some of the quantities in (5.15) simplify: we have, starting from (2.11),
[TABLE]
and
[TABLE]
so that
[TABLE]
In (5.15) the ratio is of the form (5.18) and the factor is of the form (5.17). Thus we get
[TABLE]
5.3 Numerical simulations
In order to visualise our results better, we present in this subsection the results of some numerical simulations, and compare them against our theoretical predictions.
In Figures 1 and 2 we present numerical results for two different values of : (Figure 1) and (Figure 2). In both cases we plot the empirical distribution function for 1 000 samples of the scaled smallest eigenvalue of JE random matrices for and respectively, together with the same calculation at . We observe that for the smaller value of , the empirical distribution fits well to the two-term asymptotic prediction (1.10) of Theorem 1.1. At , the data follows the leading order term of (1.10) (which is the formula (1.9) derived by Borodin and Forrester [4]), the order corrections being negligble at such matrix size.
The implementation details differ somewhat between Figures 1 and 2. To construct random matrices from the Jacobi Orthogonal Ensemble () to generate the data for Figure 1, we used the double-Wishart matrix construction starting from independent and matrices as described in the Introduction, which results in samples from the JOE. For the theoretical prediction with we were able to use the simpler formula (6.54) from Section 6.3 below, rather than (1.10).
To generate the data for Figure 2 there is no double-Wishart construction available, so we instead implemented the algorithm of Killip and Nenciu [14] which can generate samples from the JE for arbitrary . For calculating the values of the multivariate hypergeometric functions needed for the theoretical prediction (1.10) we used the numerical routines of Koev and Edelman [61].
6 Explicit formulæ
In certain situations we are able to derive expressions for and asymptotics for that are more explicit, and these are expounded in the present Section.
To begin with we focus on the case corresponding to the Jacobi Unitary Ensemble. In this case we benefit from the fact that the multi-variable Jacobi polynomials enjoy a determinantal structure. In Section 6.3 we record some formulæ (for arbitrary ) for the special cases and .
6.1 Determinantal identities
In order to state the determinantal identities let us denote by the th monic Jacobi polynomial orthogonal with respect to the measure on the interval . In terms of the definition of Jacobi polynomials given by Szegő [67, Ch. IV], our definition satisfies
[TABLE]
As a hypergeometric function,
[TABLE]
We also need the hook-length for a partition, defined by
[TABLE]
with the number of non-zero parts.
If the following Lemma gives alternative expresions for the multivariate Jacobi polynomials.
Lemma 6.1**.**
Let . Then
[TABLE]
where is the hook-length of the partition and is the Vandermonde determinant (1.3). If, furthermore , , then we have
[TABLE]
Proof.XThat the multivariate Jacobi polynomial at has a determinant evaluation in terms of univariate Jacobi polynomials is known since [56, Théorème 10]:
[TABLE]
Lasalle uses a different version of the Jacobi polynomials to us which changes the numerical value of the constant in (6.6). We can fix the constant by observing that since our Jacobi polynomials are monic,
[TABLE]
where is a Schur polynomial. The Jack polynomials at are proportional to Schur polynomials [48], and in fact
[TABLE]
The implication is
[TABLE]
In our applications is a scalar multiple of . To take the confluent limit , we use the formula
[TABLE]
proved in [68, Lemma A.1], where denotes the Wronskian
[TABLE]
and the functions must be regular at . (A version of (6.10) valid for polynomials was proved in [69, Theorem 1], which would suffice to handle (6.4), but later we will have reason to apply (6.10) to non-polynomial functions.) In the application we presently have in mind, is the Jacobi polynomial and using the fact that
[TABLE]
which extends to
[TABLE]
we have
[TABLE]
Combining (6.4), (6.10) and (6.14) we get (6.5).
Corollary 6.2**.**
For , and fixed such that is not a negative integer for , we have
[TABLE]
where is the -Bessel function. If , we have a further formula:
[TABLE]
A special case (with ) of (6.16) was given in [70], where it was proved by using the relationships with Painlevé functions [71]. We further remark that (6.15) could be proved in a less direct way through the use of tau functions of hypergeometric type [72].
Proof.XWe know that if is not a negative integer for any then
[TABLE]
uniformly for in compact subsets of . From (5.12)
[TABLE]
Setting and applying (6.4),
[TABLE]
It is problematic to take the limit here directly. The determinant of Jacobi polynomials tends to [math] as but to find the rate and leading-term some further manipulations are necessary. These involve repeated use of the contiguous identity
[TABLE]
to add successively to each row a multiple of the row below, in a recursive fashion, to get that
[TABLE]
and we have
[TABLE]
We return to the hypergeometric function representation of Jacobi polynomials (6.2),
[TABLE]
and observe that
[TABLE]
so
[TABLE]
Using (4.3) and the fact that
[TABLE]
we have the asymptotic behaviour
[TABLE]
as . Putting this into the determinant from (6.22),
[TABLE]
as .
We already know (equation (5.16)) that
[TABLE]
and we have
[TABLE]
so
[TABLE]
Putting (6.29) and (6.31) together into the right-hand side of (6.19) we find that all the factors of cancel, and we recover the limit, which yields
[TABLE]
We can derive an expression involving the more familar Bessel functions by means of the identity [73, §7.8, eq. (1)]
[TABLE]
Inserting this into (6.32), we have
[TABLE]
which is equivalent to (6.15).
To take the confluent limit , we prefer to work with (6.32). Using, for a second time, identity (6.10),
[TABLE]
Since
[TABLE]
we have
[TABLE]
and
[TABLE]
Putting this into (6.35),
[TABLE]
reversing the order of the rows in the determinant. Using (6.33) this becomes
[TABLE]
As a final step we use the identity to reduce this to (6.16).
6.2 Smallest eigenvalue of the Jacobi Unitary Ensemble
Proposition 6.3**.**
Let be the probability distribution function of the smallest eigenvalue of the Jacobi -Ensemble with , and . For ,
[TABLE]
As we have, for ,
[TABLE]
where the determinants in (6.42) are of size .
Proof.XIn view of (3.7) we need to specialise (6.5) to the rectangular partition , to get
[TABLE]
We have already seen (equation (6.30)) that
[TABLE]
and, after cancellation, (6.43) becomes
[TABLE]
In the determinant in (6.45) there is a factor multiplying the th row. If we extract these factors, they cancel the factorials in the denominator. Finally, we reverse the order of the rows producing a factor and the end result
[TABLE]
So, if ,
[TABLE]
where, from (5.19),
[TABLE]
Note that since and too we get a simplified formula
[TABLE]
Equations (6.47) and (6.49) yield (6.41).
We now turn to the two-term asymptotic formula (6.42). With in (1.10),
[TABLE]
Using the representation (6.16) for the multi-variable hypergeometric functions this becomes
[TABLE]
and, upon cancellation of the gamma function factors,
[TABLE]
The formula (6.52) proves a conjecture made in [3]. (The result is also implicit in the recent work [41].) It seems likely that explicit formulæ along the lines of (6.52) will also be available in the other privileged cases and . The details will appear elsewhere.
6.3 Small values of
If , then most of the analysis above is quite unnecessary and we already recover from (3.5)
[TABLE]
recognising the value of the Selberg integral (or observing that both sides must be unity as ). So, for and ,
[TABLE]
referring-back to (5.2). This generalises, to arbitrary , Corollary 1 of [3].
If then the multivariate hypergeometric funtions of arguments become ordinary one-variable hypergeometric functions. In this case, for ,
[TABLE]
by (3.3). Using Corollary 3.2, and the fact that multi-variable Jacobi polynomials of a single variable coincide with classical single-variable ones, this may be expressed further as
[TABLE]
We apply Theorem 4.1 with to (6.55) to find that
[TABLE]
using (6.36) for the derivative of the hypergeometric function. We may further use (6.33) to replace hypergeometric functions with Bessel functions, yielding
[TABLE]
We note that
[TABLE]
an application of the Bessel function identity [73, §7.11, eq. (23)]
[TABLE]
Putting (6.59) into (6.58) gives
[TABLE]
This result is consistent with (6.52) when .
Acknowledgements
The author wishes to acknowledge helpful conversations about this work with J. P. Keating and D. Savin.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Selberg (1944) “Bemerkninger om et multipelt integral,” Norsk Mat. Tidsskr. 26 , pp. 71–78.
- 2[2] K. W. Wachter (1980) “The limiting empirical measure of multiple discriminant ratios,” Ann. Statist. 8 , pp. 937–957.
- 3[3] L. Moreno-Pozas, D. Morales-Jimenez, and M. R. Mc Kay (2019) “Extreme eigenvalue distributions of Jacobi ensembles: New exact representations, asymptotics and finite size corrections,” Nuclear Phys. B 947 , art. no. 114724.
- 4[4] A. Borodin and P. J. Forrester (2003) “Increasing subsequences and the hard-to-soft edge transition in matrix ensembles,” J. Phys. A 36 , pp. 2963–2981.
- 5[5] R. A. Fisher (1939) “The sampling distribution of some statistics obtained from non-linear equations,” Ann. Eugenics 9 , pp. 238–249.
- 6[6] P. L. Hsu (1939) “On the distribution of roots of certain determinantal equations,” Ann. Eugenics 9 , pp. 250–258.
- 7[7] S. N. Roy (1939) “ 𝒑 𝒑 p -Statistics and some generalizations in analysis of variance appropriate to multivariate problems,” Sankhyā 4 , pp. 381–396.
- 8[8] M. A. Girshick (1939) “On the sampling theory of roots of determinantal equations,” Ann. Math. Statistics 10 , pp. 203–224.
