The generalized TAP free energy II
Wei-Kuo Chen, Dmitry Panchenko, Eliran Subag

TL;DR
This paper extends the generalized TAP approach for mixed p-spin models, providing a simplified energy representation linked to the Parisi measure and analyzing ground-state configurations at zero temperature.
Contribution
It introduces a simplified energy representation for TAP states and extends positive temperature results to zero temperature for ground states.
Findings
Energy of TAP states is uniform at a given distance from the origin.
Ground-state energy and configuration organization are characterized at zero temperature.
Results connect TAP states with the Parisi measure.
Abstract
In a recent paper [14], we developed the generalized TAP approach for mixed -spin models with Ising spins at positive temperature. Here we extend these results in two directions. We find a simplified representation for the energy of the generalized TAP states in terms of the Parisi measure of the model and, in particular, show that the energy of all states at a given distance from the origin is the same. Furthermore, we prove the analogues of the positive temperature results at zero temperature, which concern the ground-state energy and the organization of ground-state configurations in space.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
The generalized TAP free energy II
Wei-Kuo Chen
,
Dmitry Panchenko
and
Eliran Subag
Abstract.
In a recent paper [14], we developed the generalized TAP approach for mixed -spin models with Ising spins at positive temperature. Here we extend these results in two directions. We find a simplified representation for the energy of the generalized TAP states in terms of the Parisi measure of the model and, in particular, show that the energy of all states at a given distance from the origin is the same. Furthermore, we prove the analogues of the positive temperature results at zero temperature, which concern the ground-state energy and the organization of ground-state configurations in space.
School of Mathematics, University of Minnesota. Email: [email protected]. Partially supported by NSF grant DMS-17-52184.
Department of Mathematics. University of Toronto. Email: [email protected]. Partially supported by NSERC
Courant Institute. Email: [email protected]. Supported by the Simons Foundation.
1. Introduction and main results
The TAP approach, named after Thouless, Anderson and Palmer, was originally introduced in [31], where their famous equations for the magnetization and representation for the free energy of the SK model were derived. In a recent paper [14], adopting ideas from [28], we defined the generalized TAP free energy using a geometric approach for mixed -spin models with Ising spins, at any positive temperature. Our first goal here will be to compute the energy of all generalized TAP states in terms of their distance to the origin. The main focus, however, will be on the zero temperature analogue of the analysis in [14]. Of course, as the temperature tends to zero the Gibbs measure concentrates on near maximal energies, hence this analysis deals with the ground state energy and configurations. In particular, the corresponding TAP representation at zero temperature expresses the ground state energy, and the location and structure of TAP states contain information about the organization of ground state configurations in space.
The first rigorous mathematical results concerning the TAP approach were derived by Talagrand [30] who established the TAP equations for the SK model at high temperature; see also the works of Chatterjee [10] and Bolthausen [8, 9]. Much more recently, an analogue of the TAP equations within pure states was proved for generic mixed -spin models at low temperature by Auffinger and Jagannath [4]. Moreover, in [13], the TAP representation for the free energy was proved for general mixed models by the first two authors. In the setting of the spherical models, the representation for the free energy was proved for the -spin model by Belius and Kistler [6], and at very low temperature, for the -spin model with by the third author [27] and for mixed models close to pure by Ben Arous, Zeitouni and the third author [7].
In all of those works, the analysis was done at the level of pure states. As the temperature tends to zero, they degenerate to a single point and the TAP correction converges to zero, leaving only the energy term in the representation for the free energy. As a result, the TAP approach at the level of pure states trivializes at zero temperature. In [14, 28] the generalized TAP free energy was defined based on geometric principles, inspired by structural properties of the Gibbs measure, consequent to the famous ultrametricity property [17, 18, 19] proved by the second author in [21] (see also [22]). In contrast to the above, in addition to the pure states, this approach also treats ancestral states and generalizes to zero temperature in a natural way, as we shall see below.
1.1. Previous results at positive temperature
Let us introduce the model and recall the results from our previous paper [14]. Since these results will be used to pass to the zero temperature limit, here we will also introduce an inverse temperature parameter . The pure -spin Hamiltonian indexed by is defined by
[TABLE]
where are i.i.d. standard Gaussian random variables. Given a sequence that decreases fast enough, for example,
[TABLE]
is called a mixed -spin Hamiltonian. Here the processes are independent of each other for The covariance of the Gaussian process equals
[TABLE]
where is called the overlap of and , and where
[TABLE]
Let us recall the Parisi formula [25, 26] for the free energy
[TABLE]
If is the space of probability measures on , for let be the solution on of the Parisi PDE
[TABLE]
with the boundary condition where Let
[TABLE]
Then, the limit of the free energy is given by the Parisi formula [25, 26],
[TABLE]
which was first proved by Talagrand in [29] (building on a breakthrough by Guerra [15]), and later generalized to models with odd spin interactions in [24]. The minimizer is unique [1] (see also [16]) and is called the Parisi measure.
Next, we recall the generalized TAP free energy at inverse-temperature . For and let us consider a narrow band of configurations close to the hyperplane perpendicular to
[TABLE]
Given and let us consider a set consisting of configurations in this narrow band such that all are almost orthogonal to each other,
[TABLE]
For real numbers and an integer number , let
[TABLE]
The motivation for this functional was given in [14], so we will not repeat it here.
We will denote the concave conjugate of the Parisi functional defined in (1.6) by
[TABLE]
For , the minimizer on the right-hand side exists and is denoted by \hbox to0.0pt{\hskip 1.38889pt\leavevmode\hbox{\overline{\hbox{}}}\hss}{\leavevmode\hbox{\Psi}}_{\beta}(q,a,\zeta). Let denote the space of probability measures
[TABLE]
For such that , we define
[TABLE]
Notice that this functional depends only on the values of on the interval , so we can view it as a functional on the space of all cumulative distribution functions on Finally, define the TAP functional
[TABLE]
We will denote the minimizer to the right-hand side by It was proved in [14] that the minimizer is unique and that is a continuous functional on . Let us denote
[TABLE]
For , define the empirical measure
[TABLE]
The following were the main results in [14].
Theorem 1** (TAP correction).**
For any , if are small enough and is large enough then, for large ,
[TABLE]
Theorem 2** (TAP representation).**
For any and any ,
[TABLE]
Theorem 3** (TAP states are ancestral).**
For any and any ,
[TABLE]
Theorem 4** (Generalized TAP equations).**
For any
[TABLE]
where is the minimizer to (1.15) with
1.2. The energy of generalized TAP states
Our first main result computes the energy and TAP correction for all generalized TAP states at positive temperature in terms of their distance from the origin. Given , let us denote the TAP free energy functional by
[TABLE]
and, given , let
[TABLE]
be the set of -maximizers of For simplicity of notation, we keep the dependence of and on implicit. The elements of the set with and are called the generalized TAP states.
Theorem 5** (The energy of generalized TAP states).**
For any and any sequence going to zero, almost surely,
[TABLE]
and
[TABLE]
where
[TABLE]
Remark 6** (Classical case).**
By definition, and therefore, by Theorems 2 and 3, we must have for all the generalized TAP states. Classical TAP states correspond to , which is the largest point in the support of in which case the TAP correction simplifies to (see [14, Proposition 11])
[TABLE]
where
[TABLE]
In particular, (1.24) implies that the entropy of the classical TAP states is given by
[TABLE]
so both energy and entropy of classical TAP states are constant. ∎
There exists an asymptotic description of measures corresponding to ancestor states in the Parisi ansatz, and we will derive an asymptotic analogue of (1.24) directly from this description. Such description first appeared in the physics literature in [20]. Rigorously, an asymptotic distribution of spins (from which a description of can be extracted) in terms of the Parisi measure was derived in Chapter 4 in [22] under certain regularizing perturbations that were introduced in [23], and it was observed in [4] that for generic models the same proof works without perturbations. The results in [22] were written in terms of the discrete Ruelle probability cascades, whose overlap distribution approximates the Parisi measure , but one can write them directly in terms of the Parisi measure (without discretization) in terms of the solution of the SDE
[TABLE]
as was done, for example, in [4] and [5]. We will not describe all these results precisely here, but simply mention that, for , asymptotically the coordinates of an ancestor state with look like i.i.d. random variables with the distribution
[TABLE]
In other words, is an asymptotic analogue of . We will show the following.
Theorem 7**.**
For any and defined in (1.29),
[TABLE]
Moreover, for any
[TABLE]
The first equation is an asymptotic analogue of (1.24), and the second equation states that, in general, is an upper bound on the TAP correction for such measures.
1.3. TAP approach at zero temperature
Next, we will describe the analogue of the above results at zero temperature. Let us define
[TABLE]
Then we can write
[TABLE]
We will be interested in points where the above inequalities become approximate equalities, for large . In other words, we are interested to characterize points that have many near ground states orthogonal to each other relative to .
Let be the family of c.d.f.s induced by all measures on with
[TABLE]
For , consider the solution to the following PDE,
[TABLE]
on with the boundary condition It was shown in [1, Corollary 2] (see also [12, Section 2]) how such solution can be defined for all under the condition (1.35). For , we define
[TABLE]
We will see that, for , the minimizer is unique and finite (see Remark 19 below). We will denote this minimizer by \hbox to0.0pt{\hskip 1.38889pt\leavevmode\hbox{\overline{\hbox{}}}\hss}{\leavevmode\hbox{\Psi}}(q,a,\gamma), so that
[TABLE]
Moreover, for , this infimum is well-defined and (see Remark 15 below)
[TABLE]
If with we define
[TABLE]
Again, notice that this functional depends only on the values of on the interval , so we can view it as a functional on the space of measures on such that
[TABLE]
Finally, we let
[TABLE]
We are now ready to state our main results on the generalized TAP free energy at zero temperature. The first is a uniform concentration result for the TAP free energy defined in (1.33) around the (non-random) functional we have just defined (1.41), applied to the empirical measure .
Theorem 8** (TAP correction at zero temperature).**
For any , if are small enough and is large enough then, for large
[TABLE]
Recall that the Parisi formula for the ground state energy of the mixed -spin model derived in [3] states that
[TABLE]
and this variational formula has a unique minimizer, denoted The next result is the TAP representation for the ground state energy, which is the zero-temperature analogue of the TAP representation for the free energy in Theorem 2 above.
Theorem 9** (TAP representation at zero temperature).**
For any and any
[TABLE]
Note that by combining the two theorems above, if is an approximate maximizer in (1.44), then the inequalities of (1.34) become approximate equalities. Namely,
[TABLE]
provided that and are small enough, and is large enough. In other words, any generalized TAP state contains many samples which approximately maximize the energy, and such that the centered samples are approximately orthogonal.
Recall that the functional was defined in (1.41) as an infimum over the space of c.d.f.s . The following theorem shows that the minimizer is unique.
Theorem 10**.**
For any with has a unique minimizer .
We think of the minimizer as the order parameter associated to a generalized TAP state with . It is related the order parameter of the original model through the following theorem.
Theorem 11** (Ancestral property of zero-temperature TAP states).**
For any and any ,
[TABLE]
Note that if is an approximate maximizer in (1.44), then it must also be an approximate maximizer of (1.46) and
[TABLE]
Next, in order to describe the critical point equations for the TAP states,
[TABLE]
we need to compute the gradient of The statement is somewhat more involved than what one would expect from the direct analogue of Theorem 4 above. Denote and let
[TABLE]
Theorem 12** (Gradient of TAP correction).**
For any with if we denote
[TABLE]
then
[TABLE]
If we combine (1.48) and (1.51), we can write
[TABLE]
If we plug both sides into and recall the definition of \overline{\hbox{}}$$\Psi, we get
[TABLE]
These are the TAP equations at zero temperature.
2. Passing to zero temperature
Some of the zero temperature results above can be proved by adapting the proofs from [14] to the zero-temperature setting. This, however, entails a rather involved and long analysis. Instead, the approach we shall take here is to relate the zero-temperature variants to the results proved for positive temperature in [14], and use those as much as possible. The main result of this section is Lemma 14 below, that bounds, for a given empirical measure , the difference between the functional (see (1.15)) at a given positive temperature and the zero-temperature functional (see (1.41)). It will allow us to reduce zero-temperature results to the positive temperature results in the previous section. We first prove the following simple consequence of Theorem 1, that bounds the difference of the functional at two different temperatures.
Lemma 13**.**
For any and ,
[TABLE]
Proof.
For a fixed , by (1.17) and Gaussian concentration,
[TABLE]
for for large enough . On the other hand,
[TABLE]
and, therefore,
[TABLE]
This implies that
[TABLE]
Choosing so that and using continuity of proves the same inequality for arbitrary . Since is arbitrary, we get (2.1). ∎
Let us denote an -distance on by
[TABLE]
It was proved in [1, Corollary 2] and [12, Proposition 2] that
[TABLE]
Since
[TABLE]
we get that
[TABLE]
Hence, is Lipschitz on , which will be useful in the proof of our next result.
Lemma 14**.**
For any and we have that
[TABLE]
Proof.
With , let . If we make the change of variables
[TABLE]
it is easy to check that
[TABLE]
with the boundary condition
[TABLE]
Standard properties of the Parisi functional extend to . For example, it is well-known that changing the boundary condition in the definition of by at most a constant changes the solution by at most this constant, so the same holds for . Observe that
[TABLE]
Since and in (1.36) only differ in the boundary conditions, which differ by at most we get
[TABLE]
Using this together with
[TABLE]
implies that
[TABLE]
Note also that
[TABLE]
Combining the last two displays, we get
[TABLE]
If we denote by the set of all measures on of total mass at most , taking infimum over all gives
[TABLE]
As by (2.4), the infimum over converges to the infimum over all and using (2.1) finishes the proof. ∎
Remark 15**.**
It was shown in the proof of Theorem 10 in [14] that
[TABLE]
which together with (2.7) implies that
[TABLE]
By (2.3), it follows that
[TABLE]
for all ∎
3. TAP correction and representation
In this section we combine Lemma 14 from the previous section and Theorems 1 and 2, which concern the positive temperature case, to prove their zero-temperature analogues, Theorems 8 and 9.
3.1. Proof of Theorem 8
Note that
[TABLE]
Together with Lemma 14 this implies that
[TABLE]
This implies that the probability on the left-hand side of (1.42) is bounded from below by
[TABLE]
If we take large enough so that then our claim follows from Theorem 1.∎
3.2. Proof of Theorem 9
Recall that it was proved in [1] that if we denote by the minimizer of the Parisi formula for the free energy of the original model, then vaguely, i.e., for any continuous function with compact support in ,
[TABLE]
Fix in the support of . Then there exists in the support of such that as Note that
[TABLE]
and, from (2.5),
[TABLE]
From these,
[TABLE]
To handle the second big bracket, observe that since is continuous, it follows from (2.5) that is uniformly continuous on , since is compact. Hence, for any there exists such that whenever satisfy From now on, we fix large enough so that Note that for any with , we can find with such that Furthermore, we can choose so that the absolute values of the coordinates of and are arranged in the same order,
[TABLE]
If then
[TABLE]
Hence, from the above uniform continuity,
[TABLE]
In a similar manner, for any with , we can find with so that and
[TABLE]
On the other hand, from the Dudley entropy integral formula, there exists a constant depending only on such that
[TABLE]
which, combined with the Gaussian concentration inequality, implies that
[TABLE]
with probability at least , where is a constant depending only on Hence, from this inequality and (3.2),
[TABLE]
with probability at least . Thus, from (LABEL:zero:ThmTAPrepZT:proof:eq4),
[TABLE]
Our result then follows by using Theorem 2. ∎
4. Ancestral property of TAP states
This section is dedicated to the proof of Theorem 11. Unlike in the previous section, here we work at zero-temperature directly. First, note that since , using Theorem 9 and Gaussian concentration, our proof will be complete if we can show that, whenever lies in the support of
[TABLE]
Let be the space of all cumulative distribution functions induced by positive measures on satisfying From Guerra’s RSB bound for the ground state energy,
[TABLE]
for any , where
[TABLE]
and where is the solution to
[TABLE]
on with the boundary condition
[TABLE]
If now we take and then from the conjugation, and thus, As a consequence,
[TABLE]
This finishes our proof. ∎
5. Continuity of the Parisi functional
In this section we will prove that the Parisi functional is continuous when defined on an extension of to measures that charge the point . Namely, we set \hbox to0.0pt{\hskip 1.25pt\leavevmode\hbox{\overline{\hbox{}}}\hss}{\leavevmode\hbox{\mathcal{N}}}_{q,1} to be the collection of all measures on of the form
[TABLE]
where , , and We equip \hbox to0.0pt{\hskip 1.25pt\leavevmode\hbox{\overline{\hbox{}}}\hss}{\leavevmode\hbox{\mathcal{N}}}_{q,1} with the topology of vague convergence.
Remark 16**.**
Note that if \nu_{n}\in\cup_{q\in[0,1]}{\hbox to0.0pt{\hskip 1.25pt\leavevmode\hbox{\overline{\hbox{}}}\hss}{\leavevmode\hbox{\mathcal{N}}}}_{q,1} converges vaguely to certain \nu_{0}\in\cup_{q\in[0,1]}{\hbox to0.0pt{\hskip 1.25pt\leavevmode\hbox{\overline{\hbox{}}}\hss}{\leavevmode\hbox{\mathcal{N}}}}_{q,1}, then converges to a.e. on , where for are the pairs associated to on Indeed, this can be seen by noting that for are convex functions on and that for all due to the vague convergence of and the fact that is continuous on Since for are almost surely differentiable, we see that at the points of simultaneous differentiability of for the Griffith lemma (see, e.g., [30]) implies
[TABLE]
We also mention that it is not necessarily true that instead the following limit is valid
[TABLE]
For each \nu\in\hbox to0.0pt{\hskip 1.25pt\leavevmode\hbox{\overline{\hbox{}}}\hss}{\leavevmode\hbox{\mathcal{N}}}_{q,1}, if is the pair associated with , we define
[TABLE]
In addition, we also define, for
[TABLE]
The main result of this section is the following proposition, which establishes the continuity of . It will be used in the proof of Theorems 10 and 12.
Proposition 17**.**
For any , if vaguely in \hbox to0.0pt{\hskip 1.25pt\leavevmode\hbox{\overline{\hbox{}}}\hss}{\leavevmode\hbox{\mathcal{N}}}_{q,1} then
[TABLE]
and
[TABLE]
We also prove the following corollary, which will be used in the proof of Theorem 12.
Corollary 18**.**
If for , and \nu_{n}\in\hbox to0.0pt{\hskip 1.25pt\leavevmode\hbox{\overline{\hbox{}}}\hss}{\leavevmode\hbox{\mathcal{N}}}_{q_{n},1}\to\nu_{0}\in\hbox to0.0pt{\hskip 1.25pt\leavevmode\hbox{\overline{\hbox{}}}\hss}{\leavevmode\hbox{\mathcal{N}}}_{q,1} vaguely on , then
[TABLE]
and
[TABLE]
The proof of Proposition 17 utilizes the stochastic optimal control representation for from [1, Corollary 2], which we now recall. For any let be the collection of all progressively measurable processes with respect to the filtration generated by the standard Brownian motion and with . Then we can express
[TABLE]
In particular, for any ,
[TABLE]
Remark 19**.**
Notice that the representation (5.7) shows that , which means that, for the minimizer in the definition of in (1.37) is unique and finite.
For any , denote
[TABLE]
We will need the following estimate on .
Lemma 20**.**
For any , , and , we have
[TABLE]
Proof.
Recall (5.6) for with Let Then, by (5.6) and Jensen’s inequality,
[TABLE]
To establish the upper bound, for any , write
[TABLE]
and, using bound the second term by
[TABLE]
By (5.6), this implies that
[TABLE]
Taking the supremum over gives the desired upper bound. ∎
5.1. Proof of Proposition 17
By the definition of , the assertion (5.3) evidently follows from (5.2), so we only focus on proving (5.2). Obviously this assertion holds if From now on, assume that
Let and be the pairs associated with and respectively. From the vague convergence, almost surely on Therefore, for any
[TABLE]
which yields, by the bounded convergence theorem,
[TABLE]
(However, of course, it is not necessarily true that )
Next, fix . For any and , set
[TABLE]
where Using (5.6) for with and Lemma 20,
[TABLE]
In addition, by the triangle inequality,
[TABLE]
For any , if we write then
[TABLE]
which, by the triangle inequality and \mathbb{E}\bigl{(}\int_{q}^{q^{\prime}}\!dB_{s}\bigr{)}^{2}=\xi^{\prime}(1)-\xi^{\prime}(q^{\prime}) implies that
[TABLE]
From this inequality, (5.10) and (5.11), we see that by taking maximum over and using (5.6) for with , it follows that
[TABLE]
Taking a limit and using (5.9) gives
[TABLE]
Note that, since
[TABLE]
we can rewrite the expression on the left-hand side of the above inequality as
[TABLE]
From the vague convergence and (5.9), the last term converges to
[TABLE]
and, therefore, (LABEL:add:lem1:proof:eq2) implies
[TABLE]
The right-hand side vanishes as , which completes the proof. ∎
5.2. Proof of Corollary 18
Let and Then converges to vaguely and from Proposition 17,
[TABLE]
On the other hand, by using the representation (5.6) for with and with , we see that
[TABLE]
From these, we see that
[TABLE]
Note that from the vague convergence of to , converges to a.s. Using the fact that are nondecreasing, we see that
[TABLE]
Consequently,
[TABLE]
This together with (5.13) completes our proof. ∎
6. Uniqueness of the minimizer
This section is devoted to the proof of Theorem 10. We begin with the following two lemmas which will be needed in the proof. For any fixed measure with , it was proved in [14] that the functional has a minimizer in and the restriction of this minimizer to (which can be viewed as an element of ) is unique.
Recall the stochastic optimal control representation for which states that for any , one can express
[TABLE]
where the supremum is taken over all progressively measurable processes on with respect to the standard Brownian motion In particular, the supremum here is attained by
[TABLE]
where is the strong solution of
[TABLE]
with the initial condition
Lemma 21**.**
For any and , we have that
[TABLE]
Proof.
Let be any nondecreasing function on with right-continuity for some For any continuously differentiable functions on , the following integration by parts is valid,
[TABLE]
where the first integral on the right-hand side should be understood as the Riemann-Stieltjes integral. Note that a direct differentiation of the Parisi PDE in gives
[TABLE]
From the Feynman-Kac formula,
[TABLE]
For convenience, from now on, we denote and Using the usual integration by part gives
[TABLE]
where the second equality used the fact that . In addition, from (6.3),
[TABLE]
These imply that
[TABLE]
Finally, our proof is completed by plugging the following equation (see [14, Lemma 37]) into this equation,
[TABLE]
∎
Lemma 22**.**
For any and we have that
[TABLE]
Furthermore, if is supported on for some , then
[TABLE]
Proof of Lemma 22.
If , the inequality (6.4), obviously, holds. From now on, we assume that so First, let us explain that it is enough to prove the assertion (6.4) for measures with the support in . On the one hand, we noted in the proof of Theorem 9 that is continuous in and, moreover, we can approximate any by measures with the support in while keeping fixed. On the other hand, it was shown in the proof of Theorem 10 in [14] that is continuous in for any fixed and, by the properties of the Parisi functional , it is -Lipschitz in uniformly over , which implies that is continuous. By the uniqueness of the minimizer restricted to , this implies that is also continuous in restricted by These observations imply that it is enough to prove Lemma 22 for with the support in . From now on, we suppose that for some .
Fix . For any
[TABLE]
and
[TABLE]
Note that, for any ,
[TABLE]
where is the minimizer of
[TABLE]
It was proved in Section 12.2 in [14] that is continuous and bounded on . Recall from Proposition 4 in [2] (with there) that
[TABLE]
where was defined in (6.1). If we denote
[TABLE]
then
[TABLE]
To handle this equation, for any and , set By a standard calculation (see e.g. [11]), one can compute the directional derivative of ,
[TABLE]
which must be non-negative by the minimality of Again, in a standard way one can readily see (by varying ) that this forces for any in the support of . This implies that
[TABLE]
Here, note that from (6.3),
[TABLE]
Plugging these two equations into the previous display leads to
[TABLE]
From this, (LABEL:eqTAPBHlower), (LABEL:eqTAPBHlower2), and (6.8) (together with our assumption that ) it follows that the left and right derivatives of (which exist from convexity in ) satisfy
[TABLE]
Now, since is a convex function in , this implies that
[TABLE]
From this and the convexity of in , the assertion (6.4) follows by noting that
[TABLE]
while the assertion (6.5) is validated by using the above inequality and
[TABLE]
This finishes the proof. ∎
Proof of Theorem 10
Let be fixed and set . In the case that , the space is a singleton and the theorem follows trivially. From now on, assume that . Denote by the minimizer associated to Note that, by Lemma 22 above,
[TABLE]
Denote and, for all measurable sets set
[TABLE]
Since is nondecreasing, (6.9) implies that
[TABLE]
On the other hand, from this inequality and (6.9), we also see that Because of these, we can choose a subsequence of so that converges to some vaguely on and is convergent. For notational clarity, we will assume that converges to vaguely on and converges without going to a subsequence. Note that since almost surely on , by Fatou’s lemma, , which means that Furthermore, if we denote
[TABLE]
and define by
[TABLE]
then converges to vaguely on Indeed, for any continuous function on with
[TABLE]
and, passing to the limit,
[TABLE]
where the right-hand side vanishes because, for any
[TABLE]
Next we prove that is a minimizer to From Proposition 17,
[TABLE]
Also, note that from the vague convergence of to
[TABLE]
Together these lead to
[TABLE]
Since, from (2.8),
[TABLE]
and, from Lemma 14,
[TABLE]
we conclude that Hence, is a minimizer to
Finally, we show that the minimizer to is unique. To see this, we recall from Lemma 5 in [12] that is a strictly convex functional in . This implies that for any , is strictly convex in and so is Hence, has a unique minimizer, ∎
Remark 23**.**
Recall the measures and in the above proof. From (6.5), we see that
[TABLE]
Moreover, we showed that converges to almost surely on as
7. Energy of TAP states
In this section, we will prove Theorem 5.
Proof of Theorem 5.
Let us denote
[TABLE]
Recall from Theorem 2 that for any in the support of the Parisi measure , the following limits exist almost surely (using Borell’s inequality and the concentration of the free energy),
[TABLE]
and, by [2, Remark 1], is differentiable with
[TABLE]
Since is convex in and, by Theorem 1, it converges to uniformly in , it follows that, for any , is convex in , which implies that and are convex in . Since
[TABLE]
for any and , we can write
[TABLE]
using convexity in the last inequality in each line, where the existence of is guaranteed by (6.5). Taking the supremum in the first line and infimum in the second line over and taking limits,
[TABLE]
Letting and using that is differentiable implies that
[TABLE]
By (6.5), denoting as before , for any ,
[TABLE]
By continuity of in both and and uniqueness of the minimizer, the order parameter is continuous in , so the same formula holds for all . Together with (7.3) this gives
[TABLE]
To finish the proof of (1.23), it remains to show that
[TABLE]
Also, (1.24) will follow simply by using (7.2) and the equality in (1.25) is valid directly from integration by parts. Note that, for any ,
[TABLE]
and These imply that
[TABLE]
where the a.s. convergence follows from Theorems 2 and 3 above, the concentration of the free energy, and the Borell inequality. Now, assume on the contrary that (7.5) is not true. From this and the above limit, we can choose so that (by passing to a subsequence if necessary) and for some and ,
[TABLE]
and, from the continuity of on ,
[TABLE]
The optimality of
[TABLE]
yields that
[TABLE]
This means that is a minimizer of . Recall that the minimizer is unique [14, Theorem 10], so, by (7.7), on This contradicts (7.6) and finishes the proof of (7.5). ∎
8. Energy of Ancestor Measure
In this section, we will prove Theorem 7.
Proof of Theorem 7.
Recall (6.2) and let for Denote
[TABLE]
Let be the distribution function of the random variable Note that
[TABLE]
where satisfies Since is strictly increasing, it follows that if then and hence,
[TABLE]
Here, the middle term can be computed through
[TABLE]
To handle this equation, note that and . These and (6.3) imply that
[TABLE]
which together with (6.3) leads to
[TABLE]
Since
[TABLE]
it follows that
[TABLE]
Consequently,
[TABLE]
Note that by the minimality of for any ,
[TABLE]
If, for , we take
[TABLE]
then this inequality implies that
[TABLE]
Hence,
[TABLE]
Finally, if is in the support of , then from [13, Equation ],
[TABLE]
which gives
[TABLE]
This finishes the proof. ∎
9. Gradient of
In this section we establish the proof of Theorem 12. Recall that by Lemma 14, converges to , uniformly in as Let be fixed. Let be any compact subset of For any define
[TABLE]
where and
[TABLE]
In the following, we will verify that
[TABLE]
If this is valid, this means that the gradient of exists for all and is equal to which finishes our proof. We now establish the above limit by three steps.
Step 1. Let and be two sequences with and so that
[TABLE]
If is the minimizer in the definition of , let us denote
[TABLE]
By the definition of \hbox to0.0pt{\hskip 1.25pt\leavevmode\hbox{\overline{\hbox{}}}\hss}{\leavevmode\hbox{\mathcal{N}}}_{q_{m_{n}},1} (see (5.1)), if we define a measure on by
[TABLE]
then from (6.4), it satisfies that
[TABLE]
From this upper bound, we can pass to a subsequence along which converges to some \nu_{0}\in\hbox to0.0pt{\hskip 1.25pt\leavevmode\hbox{\overline{\hbox{}}}\hss}{\leavevmode\hbox{\mathcal{N}}}_{q_{m_{0}},1} vaguely on , where
[TABLE]
for some and For notational clarity, we will assume throughout the rest of the proof that these hold without passing to a subsequence of We claim that
[TABLE]
where we recall (1.49) and that is the minimizer as in Theorem 10. Indeed, by the uniform convergence of to and continuity of ,
[TABLE]
On the other hand, by (2.8),
[TABLE]
For and , set
[TABLE]
so that
[TABLE]
If , then for large . It is clear from the representation (5.7) and the uniform control in (9.3) that the minimizer belongs to some cube , where depends only on and the upper bound in (9.3). Let us choose further subsequence along which . Then, using Proposition 17 exactly as in the argument leading to (6.10), we get
[TABLE]
By (9.5), this also equals to
[TABLE]
for some . By the strict convexity of the functional (9.6), we must have that and .
Note that for any , is strictly convex in and that is continuous in . From these, we see that is continuous in As a result, from Lemma 22, is continuous on for all Furthermore, this derivative is nondecreasing in and, as it converges to , which is a continuous function. Hence, from Dini’s theorem, converges to uniformly in From this, Remark 23, and the definition of , the limit \int_{0}^{1}\!\bigl{(}\xi^{\prime}(s)-\xi^{\prime}(q_{m_{n}})\bigr{)}d\nu_{n}(s) can be written in two ways,
[TABLE]
Since we showed that this implies that and finishes the proof of (9.4).
Step 2. Next, we handle the limit of the gradient of . Recall that from Theorem 4,
[TABLE]
Here, the second term on the right-hand side can be handled by using the fact that , the vague convergence of and (9.4), to obtain that
[TABLE]
Next, we treat the first term on the right-hand side of (9.9). Recall that for any , and , we have that
[TABLE]
Denote by
[TABLE]
Let us again assume without loss of generality that the following limits exist on the extended real line, for all Then from (2.6), (2.7), and Corollary 18,
[TABLE]
which means that x_{i}=\hbox to0.0pt{\hskip 1.38889pt\leavevmode\hbox{\overline{\hbox{}}}\hss}{\leavevmode\hbox{\Psi}}(q_{m_{0}},m_{0,i},\gamma_{m_{0}}). Combining this with (9.4), (9.9), and (9.10), we arrive at
[TABLE]
Step 3. Finally, we show that in a similar manner as the first and second steps. Once this is verified, this and (9.11) together imply the desired uniform convergence and hence finish our proof. Recall from Remark 23 that for each , if we define the measure on by
[TABLE]
then
[TABLE]
Note that \nu_{n}^{\prime}\in\hbox to0.0pt{\hskip 1.25pt\leavevmode\hbox{\overline{\hbox{}}}\hss}{\leavevmode\hbox{\mathcal{N}}}_{q_{m_{n},1}}. As in Step 1, we can assume without loss of generality that vaguely converges to some \nu_{0}^{\prime}\in\hbox to0.0pt{\hskip 1.25pt\leavevmode\hbox{\overline{\hbox{}}}\hss}{\leavevmode\hbox{\mathcal{N}}}_{q_{m_{0}},1} defined as
[TABLE]
for some and We claim that
[TABLE]
By the argument in Step 1 above,
[TABLE]
Hence, the uniqueness of the minimizer forces On the other hand, the vague convergence of to and (9.12) imply that
[TABLE]
which means that These complete the proof of (9.13). Now, from (9.13),
[TABLE]
Furthermore, following a similar argument as we handled the first term on the right-hand side of (9.9) in the second step, it can also be obtained that
[TABLE]
This together with the above limit gives that and this completes our proof. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Auffinger and W.-K. Chen. The Parisi formula has a unique minimizer. Comm. Math. Phys. , 335(3):1429–1444, 2015.
- 2[2] A. Auffinger and W.-K. Chen. The Legendre structure of the Parisi formula. Communications in Mathematical Physics , 348(3):751–770, 2016.
- 3[3] A. Auffinger and W.-K. Chen. Parisi formula for the ground state energy in the mixed p 𝑝 p -spin model. Ann. Probab. , 45(6B):4617–4631, 2017.
- 4[4] A. Auffinger and A. Jagannath. Thouless-Anderson-Palmer equations for generic p 𝑝 p -spin glasses. ar Xiv:1612.06359 , 2016.
- 5[5] A. Auffinger and A. Jagannath. On spin distributions for generic p 𝑝 p -spin models. Journal of Statistical Physics , page 413, November 2018.
- 6[6] D. Belius and N. Kistler. The TAP-Plefka variational principle for the spherical SK model. ar Xiv:1802.05782 , 2018.
- 7[7] G. Ben Arous, E. Subag, and O. Zeitouni. Geometry and temperature chaos in mixed spherical spin glasses at low temperature – the perturbative regime. ar Xiv:1804.10573 , 2018.
- 8[8] E. Bolthausen. An iterative construction of solutions of the TAP equations for the Sherrington-Kirkpatrick model. Comm. Math. Phys. , 325(1):333–366, 2014.
