Convergence Time to Equilibrium of the Metropolis dynamics for the GREM
A. M. B. Nascimento, L. R. Fontes

TL;DR
This paper analyzes how quickly the Metropolis dynamics for the GREM reach equilibrium by deriving bounds on the spectral gap using advanced mathematical tools, applicable to models with multiple hierarchical levels.
Contribution
It provides new bounds on the convergence time of Metropolis dynamics for GREM with arbitrary hierarchy levels using Poincaré inequalities and convex analysis.
Findings
Bounds on the inverse spectral gap are established.
The convergence time depends on the hierarchical structure of the GREM.
Methodology applies to general cases of the model.
Abstract
We study the convergence time to equilibrium of the Metropolis dynamics for the Generalized Random Energy Model with an arbitrary number of hierarchical levels, a finite and reversible continuous-time Markov process, in terms of the spectral gap of its transition probability matrix. This is done by deducing bounds to the inverse of the gap using a Poincar\'e inequality and a path technique. We also apply convex analysis tools to give the bounds in the most general case of the model.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Convergence Time to Equilibrium of the Metropolis dynamics for the GREM
A. M. B. Nascimento 33footnotemark: 3 Partially supported by CNPq grant 140762/2016-7
L. R. Fontes Partially supported by CNPq grant 311257/2014-3, and FAPESP grant 2017/10555-0Instituto de Matemática e Estatística, Universidade de São Paulo, Rua do Matão 1010, Cidade Universitária, 05508-090 São Paulo SP, Brasil. Emails: amarcos, [email protected]
Abstract
We study the convergence time to equilibrium of the Metropolis dynamics for the Generalized Random Energy Model with an arbitrary number of hierarchical levels, a finite and reversible continuous-time Markov process, in terms of the spectral gap of its transition probability matrix. This is done by deducing bounds to the inverse of the gap using a Poincaré inequality and a path technique. We also apply convex analysis tools to give the bounds in the most general case of the model.
AMS 2010 Mathematics Subject Classification. 60K35, 82B44, 82C44, 82D30
Key words and phrases. spin glasses, GREM, Metropolis dynamics, convergence to equilibrium, spectral gap, Poincaré inequality
1 Introduction and Main Result
The Generalized Random Energy Model (GREM) is a mean field model for a spin glass in equilibrium, introduced in [6]. Let us describe it. Consider a system with configuration space being , the discrete hypercube in dimensions, equipped with the following hierarchical structure in levels. Fix a number , such that , to indicate the number of levels. Let be a sequence of positive real numbers such that and consider the following partition of the number into integers:
[TABLE]
With this notation, we represent as the product
[TABLE]
so that a spin configuration is labeled as where stands for the -th level of . We denote with and generic spin coordinates of and respectively.
Now, we will define GREM’s Hamiltonian on . Let
[TABLE]
be a family of independent (vectors of independent) Gaussian random variables of mean 0 and variance . We may view as a random environment for the spin model to be defined next. Let be a collection of strictly positive real numbers such that , and denote by the vector . The GREM Hamiltonian on is then defined by
[TABLE]
where for each , we denote by the vector , and is the usual inner product on . Then is a family of Gaussian random variables with marginal mean zero and variance , and we remark that and are independent if and only if differ on the first level, i.e., if and only if .
We denote by the Gibbs measure at inverse temperature associated to the GREM Hamiltonian that assigns to each the mass
[TABLE]
where denotes the usual normalizing factor. As usual, the function
[TABLE]
indicates the finite volume free energy. Notice that all those quantities are random variables on .
Existence of the Free Energy.
An important equilibrium feature of the GREM that will be needed here is the existence of the free energy: for all the limit
[TABLE]
exists -almost surely and coincides with — see [4], Theorem 2.1. Notice that is a nonrandom function.
For the sake of completeness, we recall here the explicit formula of . To get to that, we start by considering the -dimensional Euclidean space equipped with the norm . Let us denote by the following subset of ,
[TABLE]
where
[TABLE]
Now, set and recursively, define
[TABLE]
where for . Let be such that . Consider now the collection , where
[TABLE]
and and . From the definition of , it is clear that is strictly increasing in . Suppose for some , and let be such that
[TABLE]
With this terminology, is the point of at minimal distance from
[TABLE]
We finally have, for all , that
[TABLE]
if for some — see [4]. We remark that this function is once, but not twice, continuously differentiable with respect to . From a physical point of view, this means that there exist (possibly multiple) third-order phase transitions for the GREM. Let us also point out that for there exists a unique point , independent of , such that and
[TABLE]
The latter identity is shown in Appendix, Lemma A.1.
Dynamics.
Here, we consider a dynamics for the GREM, that is, we construct a continuous time Markov chain with state space , for which the Gibbs measure is invariant; indeed, the chain and the GREM are in detailed balance. In fact, we consider the Metropolis dynamics. Let us define it next. Let us consider the continuous-time Markov process , taking values in and having transition probability matrix P with entries given by
[TABLE]
where is the GREM Hamiltonian defined in (1.3); is the inverse of temperature parameter; denotes the usual Hamming distance on and , . This process is reversible, and therefore, both stationary and ergodic, with respect to the Gibbs measure .
Before discussing our results, let us recall the related results derived for the REM under Metropolis (which corresponds to the GREM with ).
The following result is implied by Theorem 1 in [11]. Let be the spectral gap of the generator of the dynamics (or, equivalently, of the one-step transition probability matrix). Then for all we have that
[TABLE]
Indeed Theorem 1 in [11] provides estimates for the errors of approximation that hold a.s. for all large enough , but we will not be concerned with those here.
In this paper we will derive upper bounds for the analogue in our dynamics of the quantity whose limit is taken in (1.15). These, as is well known, provide upper bounds for the time to reach equilibrium under the dynamics. Let us describe the relevant quantities more precisely.
Let denote the eigenvalues of the one-step transition probability matrix P whose entries are defined in (1.14); since P is reversible with respect to , we have that
[TABLE]
is its spectral gap. Notice that, in the case of the REM, . The main result of this paper is the following.
Theorem 1**.**
For all ,
[TABLE]
Some remarks follow:
First of all, notice that the bound in the right-hand side of (1.17), viewed as function of , is the function that describes the free energy of the GREM for . As expected, we get Proposition 4.2 in [11] as corollary of the Theorem 1 by taking . We still remark that Theorem 1 holds for *all * , for *all * and for any choice of parameters and satisfying and . 2. 2.
In view of Theorem 1, using the following well known bound (see [7] for a derivation): for all and ,
[TABLE]
together with (1.6) and Theorem 1.5(iii) of [2], one deduces that for any ,
[TABLE]
Here is the transition kernel of the dynamics. 3. 3.
There is reason to believe that the bound (1.17) is not sharp, based on the results of [10], where large volume limits for a hierarchical, simplified version of the present dynamics are derived for the 2 level GREM at low temperature (in the cascading phase), with time properly scaled. The limit dynamics are ergodic processes, and have the (infinite volume) Gibbs measure as equilibrium measure. The time scalings for those results are always below what is implied by (1.17), and this would indicate that the latter bound is not sharp (at least at low temperatures).
On the other hand, under the dynamics of [10], it may be proved that (1.17) is the best bound one gets (to leading order) by using the Poincaré inequality employed in the present work (at all temperatures). 4. 4.
A direct analysis of the Metropolis dynamics for the GREM at time scales where one would expect to see an ergodic large volume limiting dynamics, as has been done in [10] for a simpler dynamics, has not been undertaken yet; even for the case of the REM, this has been done only at smaller time scales, where aging takes place instead — see [5, 9] — and, indeed, spectral gap estimations are important elements in the derivations.
See also [1] for applications of spectral gap estimation on the study of a class of dynamics for a large family of mean field spin glasses.
The rest of the paper is devoted to prove Theorem 1. In Section 2, we develop our bound to the inverse of the spectral gap, in terms of the canonical path approach by Jerrum and Sinclair. This leads to the statement of two propositions which immediately lead to the proof of Theorem 1. The proof of the first of the propositions is done in Section 3, in several steps which take most of the remainder of the paper. Section 4 contains the similar, shortly presented proof of the second proposition, and an appendix is devoted to supporting results.
2 Proof of the Theorem 1 – Canonical set of paths
As mentioned above, the proof of Theorem 1 relies on a Poincaré inequality derived in [13]. To write this inequality in our context, the first step is to identify the Markovian process with an undirected graph with vertex set . Naturally, we identify it with the -dimensional hierarchical hypercube graph which we will also denote, with a little abuse, by . Let us denote the edge set of . Now, let be a complete set of self-avoiding canonical paths on , that is, for each , there exists exactly one path in connecting and using only valid transitions of the Markov chain , that is, only through edges of . Denote by the maximum length of paths (i.e. number of edges) in . Then, from Theorem 5 in [13] we have
[TABLE]
where the maximum is over all edges and the summation is over all pairs such that there exists a path in that contains edge . The expression is called the congestion associated with the set of paths . Recall (1.4) and (1.14). Using them, it is easy to check that
[TABLE]
Notice that to apply efficiently inequality in (2.1) we need now to construct a suitable set of paths that allows us to get a good upper bound to . By “good”, we mean that on the limit, in the very spirit of (1.17), such bound coincides -almost surely with .
When one tries to obtain a spectral gap estimate for the Metropolis dynamics of spin glass models using the canonical path technique, one of the first concerns is with edges where is large. A natural attempt to control these bad edges is to avoid them as much as possible in the trajectories. The completeness of implies that they cannot be avoided as extreme edges of paths, but we may try to avoid them in the interior of paths; as we will see below, we succeed in doing that with high probability, with a set of paths that is amenable enough to subsequent analysis. This approach was already used in [11]. Observe that with such set of paths, if is a bad edge, then we have that either and has the lowest energy, or has lowest energy and . Considering the first case — the other one follows by symmetry —, the term inside of the sign in (2.2) can be estimated by
[TABLE]
To construct our suitable set of paths , we need to introduce some notation. Let be arbitrary. We say that a configuration is good if
[TABLE]
otherwise, we will call it bad. We will call any set of configurations, in particular an edge of , good if all the configurations in it are good; otherwise, we will call the set bad. Then the set can be written as the following disjoint union: , where and denote the sets of good and bad edges, respectively.
For any path with , , let denote the set of interior edges of . A path with all interior edges good is called good; a set of paths with all elements good is also called good. At this point, it is clear that the set of paths that we aim to construct, a good one, will depend on the realization of the random environment which implies that will be a random set of paths.
One of the fundamental concepts we will need here is the notion of independent paths. Two paths and will be called independent if for all and , the random variables and are independent; equivalently, if . An extension of this concept for a finite family of paths in can be done in an obvious way. At last, let us denote by , resp. , the usual Hamming distance on , resp. .
With these concepts in hands, we have the following lemma where we specify one condition under which there exist independent paths connecting configurations in . This will also motivate our subsequent definition of .
Lemma 2.1**.**
Let and be two configurations in . If , then there exists a family containing independent paths connecting to .
Proof.
Consider, for each pair of distinct vertices , the set of paths
[TABLE]
where denotes the path from to defined as follows. Suppose ; then let be the positions where and disagree, . Let be the path starting at and ending at whose -th edge, , corresponds to flipping to .
For future reference, we set
[TABLE]
We will now argue that is a family of paths that satisfies the required property. Let be the positions where and disagree on the first level, and consider the set of paths . We claim that this set of paths is independent. Indeed, this is quite clear if the discrepancies between and are only in the first level. Otherwise, let us first notice that it is enough to consider the case where and differ in the first coordinates (where thus ); now it is just a matter of noticing that any interior configuration of is characterized by the condition that and (in this paragraph, should be understood as ).
∎
With the help of this lemma, we can now construct the random set of paths that we will consider in (2.2). Let be arbitrary:
For a given pair of distinct configurations and such that , if there exists a good path in , then we choose one such path, say the one with the smallest superscript, for ; otherwise, we choose ; 2. 2.
If , and there exists a good vertex such that , and there exist good paths, one in and another in , such that the concatenation of these two paths is a self-avoiding path with length less than , then we choose this concatenation as the path from to in (notice that this is a good path since is good); otherwise, we choose .
It is immediate that thus chosen is a complete set of self-avoiding paths, that is each pair is uniquely connected by a self-avoiding path . Moreover, we may readily check that , so we get the bound
[TABLE]
The following is a key fact about .
Proposition 2.1**.**
For any and any the following holds: with -probability there exists an such that for all the set of paths is good.
Proof.
For pairs of vertices such that , the -almost sure existence of good paths connecting them in is proved arguing as Proposition 4.1 in [11] using the help of Lemma 2.1.
For pairs of vertices such that , let us first denote by the set of positions where and differ on the first level and also introduce the set
[TABLE]
where the condition “” is not present if . Here, is just the restriction of to set . We may readily check that and for all .
For , let stands for the path starting at the vertex , constructed by flipping the sites whose positions belong to , in increasing order of coordinate. In case , we assume that and . By Lemma 2.2 below, there are at least such paths, which are independent by construction. Thus, since there exists a constant such that the probability of all visited vertices for a given such path to be bad can be bounded by when is large enough, for any and , we can found such that for all ,
[TABLE]
It then follows from the Borel-Cantelli Lemma that, for any and , with -probability , for all sufficiently large there exists at least one vertex, say , and its corresponding path, say , which is good. By construction we have that are more than distance apart, and so are ; as before, for any and any , we can find good paths and for all large enough. The conclusion of this case now follows by concatenating the (good) paths and , to get the path from to in . ∎
Lemma 2.2**.**
For any and any such that , let be as in (2.7). Then
[TABLE]
Proof.
We have that
[TABLE]
where the last inequality follows from the fact that , , and standard bounds for and . Now, since as , for any , we have that for any sufficiently large. This is enough to get the statement of the lemma. ∎
Having constructed the set of paths , we can now proceed with the spectral gap estimate. From now on we assume that, for all and all , for all large enough , is good. Recalling that , where and denote the sets of good and bad edges respectively, we can write
[TABLE]
where , respectively , is as the maximum term in (2.6) but with the sign restrict to edges in , respectively . From (2.3), it follows immediately that and, by Proposition 2.1, one readily concludes that , where
[TABLE]
Using these last bounds in (2.11), can be estimated by
[TABLE]
for all large enough .
Let now
[TABLE]
so we have .
In Sections 3 and 4, we prove the following two results, respectively.
Proposition 2.2**.**
For all ,
[TABLE]
Proposition 2.3**.**
For all ,
[TABLE]
These propositions, combined with (1.6), immediately yield Theorem 1.
3 Proof of Proposition 2.2
We follow the strategy in [11] (see Subsection 4.2 therein), with steps that are increasingly more involved than in the case of that reference; in particular, our last two steps depart considerably from the direct approach there.
Step 1 – Bound in terms of .
Since the set is constructed using paths in , if we denote
[TABLE]
for and , we get the estimate
[TABLE]
Since are identically distributed, it is sufficient to give an estimate for one of them with a relatively good probability estimate. Consider thus
[TABLE]
For a given edge , there exists a unique coordinate such that . So that, by construction, the set of all pairs such that is exactly
[TABLE]
Then, if we denote , ,
[TABLE]
and
[TABLE]
we obtain the bound
[TABLE]
Step 2 – Coarse graining.
Now we will focus on estimating the right-hand side of (3.7). Before turning to this, let us briefly describe our strategy. We partition the -dimensional Euclidean space into subsets , and analyse separately the contribution to and coming from each , by means of large deviation-type estimates, thus securing control over the exponentially many terms involved in the above maximization. It is enough to study in detail; the case of is entirely similar.
Let , and be fixed, and let be such that . Let be such that , and set such that
[TABLE]
Let , . We can thus write
[TABLE]
and
[TABLE]
where , and . We stress the relationship between , and established in this paragraph.
Remark 3.1**.**
Notice that if (cases equivalent to ), then we readily get that
[TABLE]
By Theorem 1.5(iii) in [2] and (1.6), we thus have that for all ,
[TABLE]
For convenience, we enumerate/represent
[TABLE]
as
[TABLE]
Set , . With this notation, can be written as
[TABLE]
Let and consider the following partition of :
[TABLE]
where for , we set
[TABLE]
Now we decompose in the following way:
[TABLE]
where .
First we consider the last sum in the right-hand side of (3.16); denote it by . We will show that this quantity is zero for all large enough Indeed, we note first that
[TABLE]
Now consider the event
[TABLE]
One readily checks that , so that, from Proposition 3.1 in [4], we have that the sum in (3.17), and thus , vanishes for all large
Step 3 – Large deviation estimate.
It remains to bound the first term in the right-hand side of (3.16). In order to do this, we need to introduce some notation. Given , define the canonical projection such that , where by convention . Set . Now, let be the functional defined by
[TABLE]
We remark that by compactness and convexity, admits a unique maximum on , at say ; set . We note that .
For each , let us set
[TABLE]
with the convention that .
Let now be such that , . With the above terminology, we have that for all large enough,
[TABLE]
where
[TABLE]
with the middle sum above being over all sequences of integers , and
[TABLE]
Remark 3.2**.**
Note that in (3.22) the point is such that for all .
Let , and be fixed. For , set
[TABLE]
where . With this notation, we write
[TABLE]
Now let us estimate . Let . We then have for all and large enough that
[TABLE]
Let now be a positive constant to be specified later, and define the following family of integers. For all , set . Let and recursively define
[TABLE]
until such that or for all . Put . We then have that . Moreover, for every ,
[TABLE]
At last, if , then put
[TABLE]
otherwise, that is, if , then put
[TABLE]
Lemma 3.1**.**
With the notation introduced above, for any and , the following holds for all large enough .
[TABLE]
In the proof of Lemma 3.1, we will use the following result.
Lemma 3.2**.**
For each , let . Set
[TABLE]
Then
[TABLE]
Proof.
From the definition of the and (3.28), it follows that
[TABLE]
and thus . ∎
Proof of the Lemma 3.1.
We will argue by induction on . In the case , since , by Chernoff’s inequality and Lemma 3.2 above, we have
[TABLE]
Assume that (3.31) is proved for . Introducing the random set
[TABLE]
and taking into account the independence of the Gaussian random variables, we may write
[TABLE]
where we also use on the second inequality the induction hypothesis (3.31) for ; here, is a relabeling of the random variables . It remains to bound the last term on the right-hand side of (3.35). Notice that . Since , it follows from Chernoff’s inequality and Lemma 3.2 above that
[TABLE]
This concludes the proof. ∎
Coming back to (3.7), we need to make a probability estimate which holds for all possible random variables involved in the max signs. Recall that there is an index for the chosen path family, the index , the configurations , , and the indices , and . Since there are not more than distinct such objects, it suffices to have a probability estimate in (3.31) to compensate for this factor. This suggests the choice of for the following result, which is immediate from Lemma 3.1 and the union bound.
Proposition 3.1**.**
Given , assume that . Then for all sufficiently large,
[TABLE]
In view of (3.22) and (3.25), one readily deduces from (3.37) that for any given , with probability , for all large enough,
[TABLE]
where
[TABLE]
The next step is to estimate (the non-random term on) the right hand side of (3.39).
Step 4 – Deterministic estimation.
It is worth noticing at this point that we have to make our estimation uniform with respect to all the indices involved.
Let and be fixed. We partition the support of the sum in (3.39) into the subsets
[TABLE]
If , then we have from (3.26) that
[TABLE]
for all large enough, where the exponential factor is not present if . (3.39) can thus be bounded above by
[TABLE]
For , let
[TABLE]
We will estimate by distinguishing the case from the case .
Case I: . From (3.20), (3.24) and Remark 3.2, we get that
[TABLE]
Now, from basic properties of inner product, we can also write
[TABLE]
Moreover, by construction, if , then , and thus
[TABLE]
Thus, by suitably using (3.44) and (3.46), we get that
[TABLE]
Now, for , set
[TABLE]
and note that is a nonempty closed convex subset of so, from Theorem 2 (see Appendix A), there exists a unique element of , say , such that
[TABLE]
Since we have , it is not difficult to see with the help of Remark 3.2 and (3.28) that ; it then follows from (3.49) that
[TABLE]
For each , let
[TABLE]
With this definition, (3.47) and (3.50) imply that for all large enough and for any ,
[TABLE]
Before going to the next case, let us point out that for any , we have
[TABLE]
Indeed, let be given by , and notice, from (3.51b), that
[TABLE]
According to Theorem 2 (see Appendix A), we have that for all . The claim follows.
Case II: . In this case, since
[TABLE]
it is immediate from (3.53) that can be estimated by
[TABLE]
for all large enough . Summarizing and coming back to (3.42), we get that
[TABLE]
for all sufficiently large, for some not depending on or , where .
Recall (3.38). In view of (3.57) and standard combinatorial estimates, we obtain that for any , with a probability for all large enough,
[TABLE]
for some constant . This concludes the estimation of .
Let us now recall (3.21). Since we have already estimated , it remains to estimate the term . From (3.53), we readily find that
[TABLE]
It follows from Proposition 3.1, (3.58) and (3.59) that for any , with a probability for all large enough,
[TABLE]
Symmetrically, we also have that
[TABLE]
Thus, letting
[TABLE]
we get that with probability for all large enough,
[TABLE]
Step 5 – Maximization.
As a final step, it remains to maximize over and . We do this in the following lemma.
Lemma 3.3**.**
For every and all ,
[TABLE]
Proof.
Recall the definitions (3.8) of , (3.20) of and (3.51a-b) of . Also recall, from the discussion at the beginning of Step 3 above, that denotes the maximum of over , attained at point . We claim that
[TABLE]
for any and any .
We check this for and . The other cases follows from similar arguments. Noting that is not present in the left-side of (3.65), from definition of , it is equal to
[TABLE]
Now, using “” to indicate vector concatenation, it is immediate to observe that , and so we find that the expression in (3.66) equals
[TABLE]
where the inequality follows from the the facts that (as noted right below (3.19)) and (by the maximality of the latter quantity). Convexity now implies that , and thus from Theorem 2 we may conclude that
[TABLE]
(3.65) is now just a matter of recalling (1.12).
From (3.65), we find that
[TABLE]
The lemma now follows readily from the fact that . ∎
Now, from (3.7), (3.63), Lemma 3.3, and the Borel-Cantelli Lemma, we get that
[TABLE]
Replacing this in (3.2), and since is arbitrary, Proposition 2.2 follows.
4 Proof of Proposition 2.3
Recall (2.15). Let us start by describing briefly the strategy we use to prove Proposition 2.3. In Proposition 2.1, we have showed that, for each pair of vertices such that , the path connecting them in has, with -probability , the form for all large enough , where the vertex , which we will refer to here as the intermediate point of the path , is such that and . Keeping this in mind, since the summation in the right-hand side of (2.15) is over a set of self-avoiding paths that go through the edge , we have that either , or . So, our plan is to proceed with the estimation of by considering these two cases separately.
Recall (2.7). Using the above arguments, we get that with -probability ,
[TABLE]
for all large enough where
[TABLE]
and
[TABLE]
Notice that, by our construction, and have no edge in common.
Let us first estimate the term . To do this, it is enough to notice that for a given edge the sum in the right-hand side of (4.3) is over a set of paths connecting pairs of vertices such that is in a hypercube of dimension at most around and is in a hypercube of dimension at most around . Using this, it follows that
[TABLE]
hence, by Theorem 1.5(iii) in [2] and (1.6),
[TABLE]
since is arbitrary.
To estimate the term , we use basically the same argument that we have applied to prove Proposition 2.2. Arguing as we did to get (3.2), we can write
[TABLE]
where is as in (4.2) but with the paths in . Again, it is sufficient to consider the variable . Now, using the fact that the set is equal to
[TABLE]
for a given edge , with respective ; using the same notation used in (3.7), we readily get
[TABLE]
where the power of 4 error factor arises due to condition . Arguing now as at the end of Section 3, since is arbitrary, we conclude that
[TABLE]
Hence, the claim of Proposition 2.3 holds.
Appendix A Appendix
Lemma A.1**.**
Let be the point of such that
[TABLE]
Then, and
[TABLE]
Proof.
The proof of Lemma A is inspired by a one in [8] and has as key tool the Cauchy-Schwarz inequality. The fact that is an immediate consequence of definition (1.9) and assumptions . Now, let . By Cauchy-Schwarz inequality, for all , we have
[TABLE]
Since , it follows that
[TABLE]
Hence,
[TABLE]
Set , , and consider the numbers . From what we have just seen, the sequences and satisfy the conditions of Lemma A in [8] so that we readily get
[TABLE]
This concludes the proof of Lemma A.1. ∎
Theorem 2** (Projection onto a closed convex set).**
Let be a nonempty closed convex set. Then for every there exists a unique element such that
[TABLE]
Moreover, is characterized by the property
[TABLE]
See [3], Theorem V.2, p. 79.
Acknowledgements
This work is part of the Ph.D. thesis of the second author at IME-USP and was supported in part by CNPq 140762/2016-7. We warmfully thank Pierre Picco for suggesting this problem and for innumerable discussions concerning it in many occasions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Arous, G. B. and Jagannath, A. Spectral gap estimates in mean field spin glasses. Communications in Mathematical Physics , 361(1), 1-52 (2018).
- 2[2] Bovier, A. and Kurkova, I. Derrida’s generalised random energy models 1: models with finitely many hierarchies. Ann. Inst. H. Poincaré Probab. Statist. 40(4), 439–480 (2004).
- 3[3] Brezis, H. Analyse fonctionnelle. Théorie et applications. Masson (1983).
- 4[4] Capocaccia, D., Cassandro, M. and Picco, P. On the existence of thermodynamics for the generalized random energy model. Journal of Statistical Physics 46(3-4), 493–505 (1987).
- 5[5] Cernỳ, J. and Wassmer, T. Aging of the metropolis dynamics on the random energy model. Probability Theory and Related Fields 167(1-2), 253–303 (2017).
- 6[6] Derrida, B. A generalization of the random energy model which includes correlations between energies. J. Phys. Lett. 46(9), 401–407 (1985).
- 7[7] Diaconis, P. and Stroock, D. Geometric bounds for eigenvalues of markov chains. The Annals of Applied Probability 1(1), 36–61 (1991).
- 8[8] Dorlas, T. C. and Dukes, W. M. B. Large deviation approach to the generalized random energy model. Journal of Physics A: Mathematical and General 35(20), 4385 (2002).
