Fluctuations of the overlap at low temperature in the 2-spin spherical SK model
Benjamin Landon, Philippe Sosoe

TL;DR
This paper analyzes the low-temperature fluctuations of the overlap in the 2-spin spherical SK model, revealing they are governed by eigenvalues of GOE matrices and follow the Airy1 distribution.
Contribution
It provides an explicit characterization of the overlap fluctuations in the low-temperature phase using random matrix theory, connecting spin glass behavior to GOE eigenvalues.
Findings
Overlap fluctuations are of order N^{-1/3}.
Fluctuations are described by eigenvalues of GOE matrices.
Limiting distribution is given by the Airy1 random point field.
Abstract
We describe the fluctuations of the overlap between two replicas in the 2-spin spherical SK model about its limiting value in the low temperature phase. We show that the fluctuations are of order and are given by a simple, explicit function of the eigenvalues of a matrix from the Gaussian Orthogonal Ensemble. We show that this quantity converges and describe its limiting distribution in terms of the Airy1random point field (i.e., the joint limit of the extremal eigenvalues of the GOE) from random matrix theory.
| Fluctuations of the overlap at low temperature |
| in the 2-spin spherical SK model |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Deformed GOE
1 Introduction
The 2-spin, spherical Sherrington-Kirkpatrick (SSK) model with zero magnetic field is defined by the random Hamiltonian
[TABLE]
Here is a function of , and the coefficients are iid standard normal random variables. This model was introduced in [16], by analogy with the standard SK model where the are Ising spins taking values in the hypercube [24, 20]. The interest of spherical spin glass models lies in the availability of more explicit computations due to the continuous nature of the state space for the spins. See for example the papers [27, 26, 28] by E. Subag for important work which takes advantage of the continuous geometry of spherical spin glasses.
The partition function of the SSK model is given by
[TABLE]
Here is a parameter corresponding to the inverse temperature. The limit of the free energy
[TABLE]
was determined in [10], and a rigorous justification appeared in [29]. The model exhibits a phase transition at , in the sense that the limit
[TABLE]
fails to be analytic in at this value.
In [3], J. Baik and J.O. Lee use a contour integral representation for the partition function which had previously appeared in [16] to compute the asymptotic fluctuations of around . They show that in the high temperature phase , the quantity
[TABLE]
converges to a normal random variable. This is the analogue for the SSK model of the classical central limit theorem for the SK model with Ising spins of M. Aizenman, D. Ruelle, and J. Lebowitz [1].
In the low temperature phase of the SSK, Baik and Lee proved that the free energy has asymptotic fluctuations given by the Tracy-Widom distribution associated to the Gaussian Orthogonal Ensemble (GOE):
[TABLE]
The convergence is in distribution as and denotes the asymptotic distribution of the top eigenvalue of the real symmetric GOE matrix [30]. The analog of the low temperature result (1.4) for the classical SK model seems out of reach of current methods. Moreover, we are not aware of a prediction concerning the limiting distribution of the free energy of the SK model in the low temperature phase.
In a parallel development, a model related to (1.1) and (1.2) has appeared in the context of high-dimensional statistics. Onatski, Moreira and Hallin [19] obtained an analog of the high-temperature CLT in the case that the random variables are associated with a Wishart ensemble (as opposed to the case under consideration where they are naturally associated with a symmetric matrix of normal random variables). In this context, the Gaussian fluctuations have implications for the asymptotic power of statistical tests in detecting the presence of an unknown signal in an otherwise isotropically distributed dataset. Here, the high temperature regime corresponds to the regime of low signal-to-noise ratio.
In addition to the intrinsic interest of computing the fluctuations of , the method in [3] offers a satisfying interpretation of the phase transition in the SSK in terms of random matrix theory. The argument in [3] reveals that in the the high temperature phase is dominated by linear statistics of eigenvalues of the matrix
[TABLE]
Linear statistics are quantities of the form
[TABLE]
where is a regular function, and denotes the semicircle distribution, defined below. This latter quantity is the asymptotic density of states of the GOE eigenvalues, and is known as Wigner’s semicircle law [31]. The Gaussian behavior (1.3) then follows from the well-known random matrix fact that the asymptotic fluctuations of (1.5) are Gaussian [23]. In the low temperature phase, instead depends to leading order only on the first eigenvalue . Thus the phase transition corresponds to a transition from a regime where all eigenvalues contribute to the limiting behavior, to one where only the leading eigenvalue does. Baik and Lee have applied their method to a number of variants of the SSK [3, 4, 5, 6], including the bipartite SSK and models incorporating a Curie-Weiss type interaction in addition to the spin glass couplings.
The phase transition in the classical and spherical SK model, and more generally in -spin models [20], can also be detected in the terms of the behavior of overlaps. To define these, we introduce the Gibbs measure defined by the expectation
[TABLE]
Let be two independent samples (“replicas”) from the Gibbs measure (1.6). The overlap between and is the normalized inner product:
[TABLE]
It is known that for , tends to zero as , while in the low temperature phase , it concentrates around the constant values [21]. In [18], V.-L. Nguyen and the second author used a contour integral representation related to that to that in [4] to show that has Gaussian fluctuations for temperatures corresponding to , for any .
In the present work, we describe the annealed fluctuations of the Gibbs expectation about the limiting value in the low temperature regime. More precisely, Theorem 2.1 below, provides an expansion for the overlap around these values down to order in terms of explicit quantities related to the Gaussian Orthogonal Ensemble from random matrix theory. The expansion we derive does not seem to appear in the physics literature, but we were informed by J. Baik that predictions close to the results we find were obtained using physics methods by Baik, Le Doussal and Wu [2].
The expansion of Theorem 2.1 states that the leading order to the contribution of the fluctuations of the overlap around its mean is given by the quantity
[TABLE]
where the are the eigenvalues of arranged in decreasing order. In our second main result, Theorem 2.2, we prove that (when renormalized by ) the quantity (1.7) converges and moreover describe its limit in terms of the Airy1 random point field. This latter point process arises in random matrix theory as the limits of the largest eigenvalues of the GOE.
2 Main results
We express the asymptotic distribution of the overlap between two replicas in terms of the eigenvalue distribution generated by the Gaussian Orthogonal Ensemble (GOE). To understand the connection between the GOE and our problem, define the symmetric random matrix defined by
[TABLE]
where the are the random variables appearing in the definition of the Hamiltonian (1.1). The distribution of is that of a normalized GOE (Gaussian Orthogonal Ensemble) matrix with the diagonal set to zero. We denote the ordered eigenvalues of by
[TABLE]
Next, note that the Hamiltonian equals
[TABLE]
For two vectors we define the overlap as the normalized inner product of and :
[TABLE]
where , are the components of . For a bounded, measurable function
[TABLE]
we denote the Gibbs expectation of by
[TABLE]
As a consequence of the representation (2.2), we derive the following integral formula for the Gibbs expectation in Section 4:
[TABLE]
where the integrals are over a vertical line in the complex plane to the right of all the and
[TABLE]
Our main result provides an expansion of the overlap in terms of , up to an error of size :
Theorem 2.1**.**
Let with . For any and large enough, there is an event such that
[TABLE]
on which the following estimate holds:
[TABLE]
Theorem 2.1 is a consequence of Theorem 5.9 below. The event is defined in Definition 5.3 below; it is a high probability event on which certain a-priori estimates on the eigenvalue locations hold (the rigidity and level repulsion estimates) - these are introduced in the next section.
Define
[TABLE]
Note that these quantities appear on the right side of equation (2.5). The exponent is assocaited to level repulsion, in that on (by definition of ). On the event the magnitude of , will be seen to be at most
[TABLE]
Theorem 2.1 thus identifies the overlap down to a term of order .
It is interesting to study the behavior of the leading order contribution to the fluctuations of which is the term
[TABLE]
in the context of the work [22] of Panchenko and Talagrand. They obtained an exponential estimate for the probability that but noted that the event that could not be ruled out at the level of large deviations.
Due to the rigidity estimates of random matrix theory which are reviewed in the next section, the quantity has a light upper tail; for example the probability that it exceeds goes to [math] superpolynomially, for any fixed . On the other hand, the asymptotic density of the random variable is expected to behave like near [math] and so (recall that the eigenvalues are ordered and so this is a positive quantity) has a relatively heavy lower tail. Due to the somewhat large probability of the event of Theorem 2.1, we do not attempt to make this comparison to the work [22] rigorous, settling for pointing out the heuristic agreement of our error term with the behavior observed in [22].
As we have noted above, the matrix is closely related to the GOE. The GOE is the matrix ensemble with entries
[TABLE]
and all the non-identical random variables are independent. We keep the dependence of on implicit. In Appendix A we show that the largest eigenvalues of agree with those of if we take for , up to errors of order for any . Alternatively, it is possible to appeal to the literature on edge universality in random matrix theory (e.g., [betaedge]), however the precise statement we require does not quite appear there. Instead, we have opted to carry out the calculations in Appendix A which are a relatively straightforward application of the resolvent method. The result derived here is in fact stronger than what could be deduced from the universality literature and may be of other application.
Related to this, we remark that the SSK Hamiltonian is for the most part defined as in (1.1), where we have excluded from the summation the diagonal terms . Of course in the usual SK model, whether or not the diagonal is included makes no difference since ; in the SSK model, including the diagonal would result in above. This would simplify our analysis somewhat as we could omit the calculations in Appendix A which compare the eigenvalues of directly to those of . To maintain consistency with the physics literature [16, crisanti] we have excluded the diagonal from the sum.
As can be seen by the expansion (2.5), the main contribution to the fluctuations of the overlap about its mean is from the extremal eigenvalues of . For any finite , the largest eigenvalues of , are known to converge in distribution, after a rescaling, to the first particles of the Airy1 random point field; we denote this latter quantity by . We will give a more precise defintion in Section 6 below. Due to our estimates proven in the appendix, the same joint convergence holds also for the largest eigenvalues of .
A natural conjecture is then that the rescaled fluctutations of the overlap converge in distribution to the random variable given by
[TABLE]
In Theorem 6.1, we show that this limit exists almost surely, and so is a well-defined random variable. The deterministic correction on the RHS of (2.10) represents the leading order term in the density of states of the Airy1 random point field. The expected location of the th particle of the Airy1 random point field is roughly and so neither the sum or the deterministic correction converge as .
Our main result on the limiting distribution of the fluctuations of the overlap is the following.
Theorem 2.2**.**
Let be the random variable in (2.10). We have the following convergence in distribution for :
[TABLE]
In Theorem 2.1 we introduced the square in order to study the overlap, due to the symmetry of the overlap distribution with respect to the Gibbs measure (i.e., ). An alternative would be to study ; if we knew that concentrated about on the scale then this of course could be deduced from Theorem 2.1. We prove the concentration by calculating the fourth moment ; this is the content of the following theorem which is proven in Section 7.
Theorem 2.3**.**
On the event of Theorem 2.1 we have,
[TABLE]
and furthermore on the event , the first two terms are . As a consequence,
[TABLE]
and so we have the convergence in distribution of
[TABLE]
where is as above.
We discuss the relation of our results to the forthcoming work of Baik, Le Doussal and Wu [2]. They predict that the fluctuations of should be governed by
[TABLE]
where the are independent standard normal random variables (in particular independent of the ). The Gibbs average corresponds to taking the expectation over the . It is a simple calculation to integrate out the and find quantities agreeing with the leading order contribution in (2.5) and (2.12).
2.1 Outline of the paper
In Section 3, we state some basic results on random matrix theory which we will use to control the eigenvalues of the matrix . In Section 4, we obtain the representation (2.3), along the lines of similar representations in [4], [18].
Most of the work is completed in Section 5, where we analyze the representation 2.3 by the method of steepest descent. As was already noticed in [4], in the case of interest here, the analysis is complicated by proximity of the saddle point to the branch point of the complex phase function in (4.2). Moreover, because the representation (2.3) involves a ratio of saddle point integrals, we must evaluate these to greater precision than was done in [4] and the subsequent papers. The key idea is to separate the contribution from to and use this to find the approximate steepest descent contours (see Lemma 5.1).
In Section 6, we show that the term appearing in (2.5) of order , in the sense that converges in distribution. This involves some establishing some preliminary estimates for the GOE as well as the Airy1 random point field, which we could not locate in previous literature. We deduce this using corresponding results for the GUE and Airy2 random point field proven by Gustavsson and Soshnikov [25, 14], respectively, and the Forrester-Rains coupling [12] between the GUE and GOE.
3 Random matrix results
In this section, we summarize the results from random matrix theory we use in the rest of the paper. A central role is played by the resolvent matrix
[TABLE]
where is the GOE matrix in (2.9), and is the matrix ensemble given by (2.1). The spectral parameter is is commonly denoted with and . In the recent literature, the resolvent has customarily been denoted by , a notation we reserve for the quantity (4.3) in this paper. We also introduce the Stieltjes transform of the empirical eigenvalue distribution:
[TABLE]
and similarly for . The classical semi-circle law is then equivalent to the approximation for fixed ,
[TABLE]
where the semi-circle law and its Stieltjes transform are
[TABLE]
We now state the local semi-circle law as it appears in [7, Theorem 2.6]. First, we introduce the notion of overwhelming probability.
Definition 3.1**.**
We say that an event or family of events hold with overwhelming probability if for all we have for large enough.
Theorem 3.2** (Local semi-circle law).**
Define the spectral domain by
[TABLE]
For any and all sufficiently large, we have for both and that the estimates,
[TABLE]
and for both and ,
[TABLE]
hold uniformly in with overwhelming probability.
A consequence of the semi-circle we will use several times is that the eigenvalues are close to the corresponding quantiles of the semi-circle distribution. These quantiles are known as the classical locations of the eigenvalues in random matrix theory:
[TABLE]
Theorem 3.3** (Eigenvalue rigidity).**
For each , we have that the estimates
[TABLE]
hold uniformly in with overwhelming probability, for the eigenvalues of or .
We will also need some finer information concerning level repulsion. The next result shows that, up to an error, the distribution of the spacing between and has a density on scale . While this will be sufficient for our purposes, one instead expects that there is level repulsion, i.e. on the right side of (3.3) should be replaced by . This has been established in great generality for the spacings where in [8, Theorem 3.7], but has not been proven for the eigenvalues at the edge.
The following result could be deduced from Remark 1.5 of [15]. A complete proof was not given in that work and relies on asymptotics of the Hermite polynomials. For the sake of completeness, we will give a different proof which relies only on the eigenvalue rigidity and the loop equations.
Lemma 3.4** (Existence of spacing density).**
Let . There is a constant such that for ,
[TABLE]
where the are the eigenvalues of or .
**Remark. **Inspecting the proof we see that the restriction enters only in proving the estimate for - i.e., it holds for all for the eigenvalues of .
Proof. By Proposition A.1 it suffices to prove the estimate for . We begin with the obvious estimate:
[TABLE]
As a consequence of Theorem 3.3 (see, for example [3, Eqn (6.3)]), we have with overwhelming probability,
[TABLE]
Combining this with the estimate for some , which is a consequence of Section 5 of [17] we obtain the same inequality in expectation,
[TABLE]
Then using Markov’s inequality we have,
[TABLE]
By [13, Lemma 3.7], we have
[TABLE]
where in the last step we used again Theorem 3.3. The result follows. ∎
4 Representation for the overlap
In this section, we derive a contour integral representation for Gibbs expectation . Throughout this section, we will denote the eigenvalues of by
[TABLE]
for notational simplicity. We now prove the following lemma.
Lemma 4.1**.**
The quantity is given by
[TABLE]
where
[TABLE]
for any so that .
**Remark. **Note that up to constants the quantity appearing in the denominator of (4.2) is the partition function and is the representation used by Baik and Lee [3].
Proof. Our starting point is the definition,
[TABLE]
Baik and Lee give the representation:
[TABLE]
Let be the unit sphere in . Let be the surface area measure on , so that is the uniform measure on . By a change of variables, we obtain
[TABLE]
Let . In order to compute the above integral, we consider
[TABLE]
We use polar coordinates, substituting and with and . We then set , to find that
[TABLE]
where
[TABLE]
On the other hand, direct integration shows that the function is given by
[TABLE]
Taking the inverse Laplace transform, we obtain
[TABLE]
where is any real number satisfying . Recalling that
[TABLE]
and letting , we obtain:
[TABLE]
Combining (4.12) and (4.5), we obtain (4.2). ∎
5 Steepest descent analysis
We proceed to the asymptotic evaluation of the integrals in (4.2). As in the previous section, we will continue to denote the eigenvalues of by
[TABLE]
As was already noticed in [3], in the low temperature regime, the dominant contribution to the integrals comes from an neighborhood of the saddle point which is itself distance from the largest eigenvalue of . The prescence of the branch point due to close to the saddle makes a steepest descent analysis via a direct expansion of the function untenable.
Compared to the computation in [3] and subsequent works, we must evaluate the numerator and denominator in (4.2) with greater precision. For the main result of [3], for example, it was sufficient to show that
[TABLE]
where satisfies for any . The contribution of to the free energy is then , automatically of lower order than the dominant Tracy-Widom fluctuations which are of size .
In order to evaluate the overlap, it is necessary to determine the leading order term of , not only up to multiplicative terms. Additionally, our computation involves a more precise localization of than .
We now give an overview of the saddle point analysis. The saddle point of the function is distance of order from Instead of working directly with the steepest descent contours of the function , we will consider the dominant contribution near the saddle which is, up to additive constants,
[TABLE]
This function is much simpler than , as it involves only the eigenvalue . The contributions from other eigenvalues are replaced by their deterministic leading order term using Theorem 3.3. The additional key input here is Lemma 3.4 which ensures that the eigenvalues are an order of magnitude further from than the distance between the saddle and . This allows for the localization of the function near its saddle, despite the prescence of the branch point due to the logarithmic singularity at .
The saddle point of the function (5.2) is clearly,
[TABLE]
where
[TABLE]
The advantage afforded by working with the approximation (5.2) is that the behavior of the steepest descent contours of this function are relatively explicit. For the contours, we make the ansatz , for . Setting the imaginary part of (5.2) to zero gives a parametrization of the approximate steepest descent contour we will use. We determine properties of this parameterization in Lemma 5.1. In Lemma 5.4 we analyze the behavior of along this approximate steepest descent contour.
Lemma 5.1**.**
For , let be the standard determination of the argument. For the equation
[TABLE]
has a unique strictly positive solution which we denote . Furthermore, there is a constant so that if ,
[TABLE]
For any there is a depending on so that if , we have
[TABLE]
Before proceeding to the proof of Lemma 5.1, we note that if is analytic with real and imaginary parts denoted by
[TABLE]
then, with , the Cauchy-Riemann equations imply
[TABLE]
and so
[TABLE]
using the notation .
Proof. For uniqueness we note that if , then the left side of (5.5) is increasing, whereas the right side is decreasing. If we calculate the derivative of the right side,
[TABLE]
This is a decreasing function of , and so the right side of (5.5) is a concave function of . Its derivative at is strictly greater than , so we get the uniqueness as the left side is a linear function with slope . Differentiating the equation (5.5), we find using (5.10)
[TABLE]
Note that the second factor on the left is positive (for it is the difference of the slopes of the tangent lines of the functions on either side of (5.5) at the point ), so
[TABLE]
and the lower bound of (5.7) will follow once we establish (5.6). The upper bound is immediate.
Let denote the principal determination of the logarithm. Expanding this function in a power series around , we have for some and ,
[TABLE]
where is an analytic function in the disc , obeying the estimates
[TABLE]
These estimates follow from the fact that all the coefficients in the power series expansion of the logarithm are real. The imaginary part of is the argument function appearing on the right side of (5.5). Taking imaginary parts on both sides of (5.14) and using (5.5), we find (denoting for brevity)
[TABLE]
Dividing by gives,
[TABLE]
after possibly making smaller. Using this to estimate the higher order terms in (5.16) yields (5.6) after solving (5.16) for as a function of . ∎
We also require the following elementary lemma.
Lemma 5.2**.**
Let be as above. For any , there is a so that if then,
[TABLE]
Proof. The case is trival so we may assume . Recall that is the unique positive solution to . If , then and so for such , we have . We have,
[TABLE]
The denominator is bounded above, and the numerator is bounded below by for . It remains to consider the case where for fixed . Consider the function,
[TABLE]
This function has zeros at and and is strictly positive in between these points. We need to prove that there is a constant depending on so that . By direct calculation, it has a local maximum in between these two points at We see that for a and that for and constants depending on . Since it follows that for some new depending on , and then that for some . Since is bounded, we then see that for some . Since we see that for for some . This then implies that for some , which is what we needed to prove. ∎
We now define the event of Theorem 2.1.
Definition 5.3**.**
Let and . Let be the following event:
[TABLE]
where are the classical eigenvalue locations defined in (3.2).
As stated above, this is the event in the statement of Theorem 2.1. By Theorem 3.3 and Lemma 3.4 we have that
[TABLE]
for any and large enough. Fix a sufficiently small . In particular, we take
[TABLE]
Define the contours
[TABLE]
and
[TABLE]
The contour is a U-shaped contour symmetric along the negative real axis, and are vertical lines at the ends of going to .
Lemma 5.4**.**
Assume that holds with sufficiently small. The following estimates hold. There is a so that
[TABLE]
For any there are and so that the following holds for and large enough :
[TABLE]
Finally, there is the following estimate for and ,
[TABLE]
Proof. We calculate some derivatives of along the contours. In this proof, will be restricted to lie on the various contours and so we will generally denote . First, along ,
[TABLE]
where we used that is negative. We write,
[TABLE]
We need to estimate the second term. We write,
[TABLE]
From the level repulsion assumption and choice of ,
[TABLE]
for all , and so
[TABLE]
For the second term in (5.31), rigidity gives
[TABLE]
Now, since ,
[TABLE]
Finally,
[TABLE]
Therefore,
[TABLE]
As observed in the proof of Lemma 5.1, the first term on the RHS (in the brackets) is positive. If for a then by Lemma 5.2 we conclude that there is a depending on so that
[TABLE]
We have therefore proven that for ,
[TABLE]
and for ,
[TABLE]
The estimate (5.27) follows from the previous two estimates and integration. We consider of the form , in order to prove (5.26). We consider the behavior of as varies. We calculate,
[TABLE]
This is decreasing, so we immediately get – using (5.27) – the estimate (5.26) in the region . For smaller , note that
[TABLE]
Hence,
[TABLE]
Since , we get (5.26) for the rest of the possible values of , integrating from to a smaller , using (5.41) and the above estimate on the derivative.
Finally, we turn to (5.28). We have,
[TABLE]
For ,
[TABLE]
and for the rest of ,
[TABLE]
This yields (5.28) (recall for this estimate). ∎
Now introduce the contour
[TABLE]
where
[TABLE]
Recall that is a U-shaped contour symmetric about the negative real axis. The contour connects the ends of the U to the real axis.
The following lemma shows that we can deform the contours in (4.2) into .
Lemma 5.5**.**
Suppose the event holds with sufficiently small. Then the following estimates hold. There is a so that
[TABLE]
and
[TABLE]
Proof. By analyticity and the absolute convergence guaranteed by (5.28), all of the contours can be moved from the vertical lines appearing in (4.2) to the contour . It just remains to replace by . This replacement for (5.50) is immediate from (5.26) and (5.28). For (5.5) we also have to deal with cross terms (e.g, the integral over or in the variable times the integral over in the variable) and the extra terms in the integrand. Note that along all of the contours under consideration we always have
[TABLE]
Note furthermore that for , the estimate (5.27) gives
[TABLE]
and that the arc length of satisfies
[TABLE]
as is monotonic. These observations together with (5.28) and (5.26) yield (5.5). ∎
In the following lemma we Taylor expand (or at least all of the terms appearing in its definition except the one with ) around the saddle , arriving at a form of the integrands which we will be able to calculate. We first define a few functions which naturally appear in the Taylor expansion. Let,
[TABLE]
and
[TABLE]
Lemma 5.6**.**
The following holds on the event for sufficiently small . First, we have
[TABLE]
Second,
[TABLE]
Finally,
[TABLE]
Proof. We write
[TABLE]
We Taylor expand the second term. For ,
[TABLE]
From the fact that for we first conclude that
[TABLE]
where we used Lemma 5.4 (i.e, the corresponding estimate for and the above two equalities which relate this quantity to ). Then, for ,
[TABLE]
Note that
[TABLE]
and
[TABLE]
The first and second estimates of the lemma follows from (5.62) and the fact that . For the third estimate, one first uses (5.63) and (5.62) to arrive at an integral in terms of and the (quantity inside the absolute value) on the LHS of (5.63). The error is . The next replacement uses (5.64), and one arrives at the final estimate of the lemma. ∎
We now rescale and shift the contour of integration to lie along the real axis. Let be the following keyhole contour around the point , for :
[TABLE]
Lemma 5.7**.**
On the event we have, for sufficiently small , the following estimates for some .
[TABLE]
and
[TABLE]
Proof. First we make the substitution . As the integrand is analytic on we see that all of the contours may be shifted from to . The rest of may be added to the integral as for some , at only an error exponentially small in . ∎
We collect some explicit integrals in the next lemma.
Lemma 5.8**.**
Let , and be a keyhole contour around as above. Then,
[TABLE]
All of the above calculations can be done by considering the contributions from the integral along the real axis and the circle around as . In the cases where these contributions are diverging, one treats the circular integral by Taylor expansion (i.e., expanding the exponential around ), and integrates by parts the integral along the real axis. One finds that the diverging quantities cancel, and is left with a real integral which can be calculated explicitly.
We finally arrive at the following, from which Theorem 2.1 follows.
Theorem 5.9**.**
On the event we have with sufficiently small that,
[TABLE]
Proof. We first use Lemma 4.1 to arrive at the formula (4.2) for the overlap. The results of the present section are used to analyze the contour integrals appearing in the numerator and denominator of (4.2). From (5.5), (5.57), (5.58) and Lemma 5.7 we arrive at the following expression for the numerator:
[TABLE]
for as above. For the integral in the denominator we use (5.50), (5.56) and Lemma 5.7 to find,
[TABLE]
We now use Lemma 5.8 with and . For the numerator, the two terms of (5.75) equal
[TABLE]
whereas the denominator equals
[TABLE]
We get the claim from these two calculations as well as,
[TABLE]
∎
6 Existence of limit
In this section we consider the limit of the random variables
[TABLE]
as . The limit will be characterized in terms of the Airy1 random point field. Our convention is so that if are the largest eigenvalues of the GOE, then for every finite ,
[TABLE]
so that the ensemble has finitely many particles located on the negative real line. We will prove the following theorem.
Theorem 6.1**.**
Let denote the largest eigenvalues of the GOE, and the Airy1 random point field. The sequences of random variables
[TABLE]
converges in distribution to a random variable which is given by
[TABLE]
where the limit on the RHS of (6.4) exists almost surely.
**Remark. **We do not determine whether or not the distribution of is non-trivial.
6.1 Preliminary estimates
We will need an estimate for the variance of the number of eigenvalues of the GOE in an interval as well as the corresponding estimate for the Airy1 random point field. We will deduce these from the corresponding results for the GUE and Airy2 random point field and the coupling of Forrester and Rains between the GUE and the GOE [12].
Theorem 6.2** (Soshnikov [25]).**
Let be the particles of the Airy2 point process and let . We have the following estimates for some and any .
[TABLE]
and
[TABLE]
**Remark. **Soshnikov states the variance asymptotics (6.6) with the constant instead of . This appears to be due to a mistake in the computation of the quanity in [25, Lemma 5]. Note also that the factor is consistent with the variance asymptotics for the counting function in the GUE, in (6.9) below. We will use only that the variance grows logarithmically in .
Theorem 6.3** (Gustavsson [14]).**
Let be the eigenvalues of the GUE. Let . There is a so that the following holds. For any , we have
[TABLE]
and so for ,
[TABLE]
Furthermore, there is a so that if , then
[TABLE]
**Remark. **Gustavsson only claimed the result (6.9) in the case that as at any arbitrarily slow rate (his interest was in the case that the variance tends to infinity, a necessary condition for applying a theorem of Costin and Lebowitz [9]). Inspecting his proof yields the estimate (6.9) for fixed but large enough . ∎
We need also the following result of Forrester and Rains.
Theorem 6.4** (Forrester, Rains, [12]).**
Let and denote the set formed by the union of the eigenvalues of the GOE and GUE, respectively. Then,
[TABLE]
where the RHS is the set formed by the second largest, fourth largest, sixth largest, etc. elements of .
From the above results we deduce the following.
Proposition 6.5**.**
Let denote the eigenvalues of the GOE and let . There is a so that the following holds. For any , we have
[TABLE]
There is a so that if , then
[TABLE]
Proof. Let and be the eigenvalues of two independent GOE matrices of dimension and respectively. For the course of this proof, let and be the number of eigenvalues of these matrices at least . Let be same but for the eigenvalues of an independent GUE matrix. The coupling of Theorem 6.4 implies that there is a random variable and a bounded random variable so that,
[TABLE]
Using that and are independent, that is bounded, and the estimate (6.9) yields (6.12). Taking expectations we see that
[TABLE]
We now estimate the difference of the two quantities on the RHS. Given a GOE matrix of dimension , its minor formed by removing the first row and column is a GOE matrix of dimension multiplied by the prefactor . Additionally, the eigenvalues of the minor interlace the eigenvalues of . Hence,
[TABLE]
Let . Applying now (6.14) twice, with and , and taking the difference we see that (note that the difference with has the same sign as it is just the number of eigenvalues between and )
[TABLE]
where we used (6.8) in the last step. This yields (6.11). ∎
Lemma 6.6**.**
Let and be two independent Airy1 random point fields. Let be an Airy2 random point field. Let . Then,
[TABLE]
is equal to the probability that there are either or particles from the superimposed point process below .
Proof. Let be the scaled eigenvalues of a GUE matrix, and and the scaled eigenvalues from independent GOE matrices of dimension and (for the latter, do the scaling with and not by ). By the convergence in distribution of the th eigenvalue of the GOE/GUE to the th particle of the Airy1/Airy2 process we have,
[TABLE]
By Theorem 6.4,
[TABLE]
By the independence,
[TABLE]
Taking the limit , we see
[TABLE]
This yields the claim. ∎
Proposition 6.7**.**
Let be the Airy1 random point field. Then,
[TABLE]
and
[TABLE]
Proof. Let and be two independent Airy1 random point fields. Let be the random variable that is if there are or particles in the superposition below . Then by the previous lemma, has the same distribution as the number of particles in an Airy2 random point field below . If , then . The claim now follows. ∎
6.2 Proof of Theorem 6.1
Let us denote by the random variable,
[TABLE]
Let where is the constant above. We have by (6.12),
[TABLE]
as long as . Note that if solves
[TABLE]
and then by (6.11),
[TABLE]
Assume that . Choosing now
[TABLE]
we see by (6.11) that
[TABLE]
as long as . Therefore, we have that
[TABLE]
as long as . In particular, we see that there is a so that for all ,
[TABLE]
A similar argument gives
[TABLE]
for and . From all of these estimates we find,
[TABLE]
Denote by the event on the left side of (6.33). Let and choose constants and so that
[TABLE]
Let denote the intersection of these two events. Choose so that . Fix also
[TABLE]
By the choice of and the definition of we have for any that on the event ,
[TABLE]
For any we then have,
[TABLE]
The outcome of all of this is that there for any , there is a so that for any we have the estimate,
[TABLE]
By Theorem 3.3,
[TABLE]
with overwhelming probability. We write, for ,
[TABLE]
A calculation shows that the term on the last line is by our assumption on . Hence, for any bounded Lipschitz we see that for any , there is a so that for any fixed ,
[TABLE]
With denoting the particles of the Airy1 process, much of the same calculations as above show that
[TABLE]
for some , and
[TABLE]
Arguing as above, we see that for any there is an event with probability at least and a so that for all , we have
[TABLE]
From this we see that,
[TABLE]
almost surely which proves that the limiting random variable exists. Moreover, we see that for any there is a so that for all and any bounded Lipschitz function we have,
[TABLE]
On the other hand we know that for any finite ,
[TABLE]
where the convergence is in distribution as . This yields the claim. ∎
6.3 Proof of Theorem 2.2
Let be the GOE matrix given by (2.9) and be the matrix (2.1). We choose a coupling so that for . By Proposition A.1, Theorem 3.3 and Lemma 3.4 there is an event of probability at least on which,
[TABLE]
for some . The result follows from this and Theorems 2.1 and 6.1. ∎
7 Extension to
In this section we extend our results to the quantity . Our starting point is the following elementary calculation,
[TABLE]
For the second term we have,
[TABLE]
Hence, if we can show that with probability , then the convergence of Theorem 2.2 extends to .
We expand,
[TABLE]
We already calculated in Theorem 5.9 down to . It remains to calculate the first term . The modification of the representation formula is,
[TABLE]
Since the function appearing in the exponential is identical to what we encountered in considering , the steepest descent analysis of the numerator is very similar to that in Section 5. In light of this, we will use the same notation as in Section 5.
Following along the argument of Section 5 we see that, analogously to Lemma 5.5, we can change the contour from the vertical line through to at an error exponential in , for some , on the event for sufficiently small .
For the analog of Lemma 5.6 we similarly derive the following estimates which all hold on the event for sufficiently small. First,
[TABLE]
Second,
[TABLE]
Third,
[TABLE]
Finally,
[TABLE]
In order to calculate the numerator of (7.4) up to errors that are we see that it suffices to compute the integrals in (7.5) and (7.7). We can proceed identically to Lemma 5.7 and pass to the rescaled variable being integrated over up to again an error exponential in for some . The integral resulting from (7.7) is identical to (5.7), whereas the integral coming from (7.5) is
[TABLE]
In order to calculate this, we note the identities
[TABLE]
and
[TABLE]
From all of this, we see that we have derived the following estimate for the numerator of (7.4), with ,
[TABLE]
whereas for the demoninator we have
[TABLE]
From this and (5.79) we see that
[TABLE]
and furthermore that
[TABLE]
On the event the first two terms are .
Appendix A Zero-diagonal GOE
Let be a GOE matrix as in (2.9), and let be its diagonal and let
[TABLE]
so that is as in (2.1). In this section we prove that with overwhelming probability, the extremal eigenvalues of and are close.
Proposition A.1**.**
Let . The following estimate holds for with overwhelming probability:
[TABLE]
The proof follows from the Helffer-Sjoestrand formula, which we recall in (A.29), and the following lemma providing control over the difference of Stieltjes transforms.
Lemma A.2**.**
Denote by and the empirical Stieltjes transforms of and . Let and . With overwhelming probability, for any , and , we have
[TABLE]
The proof of the above lemma is based on the following resolvent expansion as well as two moment estimates which are the content of Lemmas A.3 and A.4 below. We have,
[TABLE]
Denote,
[TABLE]
Note that is independent of . We first prove,
Lemma A.3**.**
Let be a constant. On the event
[TABLE]
we have for even
[TABLE]
where denotes the expectation over .
**Remark. **This estimate is sub-optimal for but we will not need a better estimate. The next lemma below deals with the error term in the resolvent expansion. ∎
Before embarking on the proof, we record here the Ward identity,
[TABLE]
for any self-adjoint matrix . This is a consequence of the spectral theorem (see Section 3 of [7]).
Proof. Denote by and multi-indices in and variables respectively, with even. We will use the -indices to denote the summation coming from the trace, and the -indices to be the summations coming from matrix multiplication. Roughly, we are writing the trace as,
[TABLE]
With this convention we then have,
[TABLE]
where denotes either or as appropriate, and is a monomial which contains all of the Green’s function elements that has indices only in (we separate out the matrix elements of that have an index in ). The choice of or or the form of will not be important for the calculation done here. The can be grouped into partitions of the indices into coincidences. That is,
[TABLE]
where the first sum is over partitions on elements, and the second summation means the sum over all so that if whenever and are in the same block of and whenever and are in distinct blocks of the partition. The independence of the implies that unless the size of each block of the partion is at least , then the expectation vanishes. Denote by the set of such partitions. Estimating (using the assumption (A.6)) we see that from this discussion,
[TABLE]
From the Ward identity (A.8), for any index , we have
[TABLE]
and so
[TABLE]
The summation over for has at most terms, and has cardinality bounded in terms of only and , so we get the claim. ∎
Lemma A.4**.**
Let . On the event that we have for even that,
[TABLE]
where denotes the expectation over .
Proof. We expand out the expectation similar to the proof of the above lemma. We estimate , and and obtain,
[TABLE]
where is the following monomial in Green’s function elements,
[TABLE]
i.e., it is the product of except when for any . Note that we have dropped any Green’s function elements that involve the index or , and kept the ones involving only the indices. We will use the estimate,
[TABLE]
and so we need to count how many off-diagonal entries appear in when is in a specific partition . Suppose that has blocks. Recall that means that if and only if and are in the same block in the partition . Note that contains Green’s function entries. Denote the size of the th block of by . There can be at most Green’s function entries in the monomial with whose indices and both appearin the th block. Therefore, there are at least
[TABLE]
off-diagonal Green’s function entries in the monomial . Hence, for where has blocks,
[TABLE]
The summation over has less than terms, and so
[TABLE]
This is the claim. ∎
Proof of Lemma A.2. By the local semi-circle law, we have the estimates for ,
[TABLE]
and
[TABLE]
with overwhelming probability. Choose large enough so that . Then by Lemma A.4, the final term in the resolvent expansion (A.4) is less than with overwhelming probability, as it involves at most terms of the form and . Finally, the other terms in the resolvent expansion are bounded using Lemma A.3. ∎
In order to prove Proposition A.1, we will use the estimate on the Stieltjes transforms that we have just proved to find an estimate on traces of smoothed out indicator functions using the Helffer-Sjöstrand formula. This is the content of the following lemma.
Lemma A.5**.**
Let and be arbitrary. There is an event such that the following holds with overwhelming probability. Suppose that is a smooth function so that on and outside of . Assume that,
[TABLE]
where . Assume that for . Assume
[TABLE]
Then,
[TABLE]
Proof. We work on the event that the estimate of Lemma A.2 holds with and . We can assume that . Fix,
[TABLE]
and let be a cut-off function so that for and for and for . Denote
[TABLE]
Recall the Helffer-Sjöstrand formula (see, e.g., [11]): for , define the almost analytic extension of by:
[TABLE]
Then, for , we have
[TABLE]
Using this formula for we obtain,
[TABLE]
Using Lemma A.2 and the assumed bounds for and , we see that the last term is bounded above by
[TABLE]
where we used as well as (see, e.g., Section 3 of [7]). Similarly, the second last term is bounded above by
[TABLE]
For the first term, we first estimate the contribution of . From the fact that is increasing (and the same for ) and the local semi-circle law, we find the estimate
[TABLE]
which gives
[TABLE]
For we get,
[TABLE]
We have proven,
[TABLE]
The claim follows from choosing . ∎
Proof of Proposition A.1. Let now and . Let us denote by the eigenvalues of and by the eigenvalues of . Applying Lemma A.5 to with , and , we see first that
[TABLE]
(or else for some we would have , contradicting the estimate proven in Lemma A.5 and the fact that for such ). Reversing the roles of and we then get that
[TABLE]
Let be the smallest index so that
[TABLE]
and let . Similarly, let be the smallest index so that
[TABLE]
and let , and define and so on. Let be the smallest integer so that
[TABLE]
By rigidity we have that for ,
[TABLE]
if . Therefore,
[TABLE]
First, using Lemma A.5 we see that there are no eigenvalues in the interval
[TABLE]
for , by taking and , and . Next, we apply Lemma A.5 with the choice and . Note that since , we see that the length of in this case is less than , and so the lemma applies, which gives
[TABLE]
Since we have already shown that there are no eigenvalues in the intervals and , it follows that the quantity is precisely the number of eigenvalues in the interval . This must be an integer, and so it equals . The claim follows. ∎
Acknowledgements. We wish to thank Jinho Baik for sharing some computations from work in preparation, and Hao Wu for pointing out an error in an earlier draft of the manuscript. P.S. thanks Vu Lan Nguyen for discussions on the spherical SK model at low temperature. B.L. thanks Amol Aggarwal for many illuminating and useful discussions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Aizenman, J. L. Lebowitz, and D. Ruelle. Some rigorous results on the Sherrington-Kirkpatrick spin glass model. Comm. Math. Phys. , 112(1):3–20, 1987.
- 2[2] J. Baik, P. Le Doussal, and H. Wu. In preparation , 2019.
- 3[3] J. Baik and J. O. Lee. Fluctuations of the free energy of the spherical Sherrington–Kirkpatrick model. J. Stat. Phys. , 165(2):185–224, 2016.
- 4[4] J. Baik and J. O. Lee. Fluctuations of the free energy of the spherical Sherrington–Kirkpatrick model with ferromagnetic interaction. Annales Henri Poincaré , 18(6):1867–1917, 2017.
- 5[5] J. Baik and J. O. Lee. Free energy of bipartite spherical Sherrington–Kirkpatrick model. preprint, ar Xiv:1711.06364 , 2017.
- 6[6] J. Baik, J. O. Lee, and H. Wu. Ferromagnetic to paramagnetic transition in spherical spin glass. preprint, ar Xiv:1805.05630 , 2018.
- 7[7] F. Benaych-Georges and A. Knowles. Lectures on the local semicircle law for Wigner matrices. To appear in SMF volume Panoramas et Syntheses , 2016.
- 8[8] P. Bourgade, L. Erdös, and H.-T. Yau. Edge universality of beta ensembles. Comm. Math. Phys. , 332(1):261–353, 2014.
