Equi-Energy sampling does not converge rapidly on the mean-field Potts model with three colors close to the critical temperature
Mirko Ebbers, Matthias L\"owe

TL;DR
This paper demonstrates that Equi-Energy Sampling (EES) does not rapidly converge for the mean-field Potts model with three or more colors near the critical temperature, highlighting limitations of EES in certain regimes.
Contribution
The paper proves that EES is slowly mixing in the mean-field Potts model below the critical temperature, extending the understanding of EES limitations to models with multiple colors.
Findings
EES does not rapidly converge near the critical temperature.
EES is slowly mixing for the mean-field Potts model with three or more colors.
Results apply to any number of colors q ≥ 3 with appropriate temperature regimes.
Abstract
Equi-Energy Sampling (EES, for short) is a method to speed up the convergence of the Metropolis chain, when the latter is slow. We show that there are still models like the mean-field Potts model, where EES does not converge rapidly in certain temperature regimes. Indeed we will show that EES is slowly mixing on the mean-field Potts model, in a regime below the critical temperature. Though we will concentrate on the Potts model with three colors, our arguments remain valid for any number of colors , if we adapt the temperature regime. For the situation of the mean-field Potts model this answers a question posed in \cite{HuaKou}.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Equi-Energy sampling does not converge rapidly on the mean-field Potts model with three colors close to the critical temperature
Mirko Ebbers
Fachbereich Mathematik und Informatik, Universität Münster, Einsteinstraße 62, 48149 Münster, Germany
and
Matthias Löwe
Fachbereich Mathematik und Informatik, Universität Münster, Einsteinstraße 62, 48149 Münster, Germany
[email protected] missing
Abstract.
Equi-Energy Sampling (EES, for short) is a method to speed up the convergence of the Metropolis chain, when the latter is slow. We show that there are still models like the mean-field Potts model, where EES does not converge rapidly in certain temperature regimes. Indeed we will show that EES is slowly mixing on the mean-field Potts model, in a regime below the critical temperature. Though we will concentrate on the Potts model with three colors, our arguments remain valid for any number of colors , if we adapt the temperature regime. For the situation of the mean-field Potts model this answers a question posed in Hua and Kou, (2011).
Key words and phrases:
Equi-Energy Algorithm, Potts model, Swapping Algorithm, Metropolis Algorithm, Markov Chain Monte Carlo methods, Curie-Weiss model
2010 Mathematics Subject Classification:
60J10,60K35, 82B05
Research of the second author was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany ’s Excellence Strategy EXC 2044 –390685587, Mathematics Münster: Dynamics –Geometry -Structure
1. Introduction
Sampling methods are of utmost importance in applied mathematics, e.g. in Bayesian statistics, computational physics, econometrics, or computational biology. In many cases one wants to sample a random element drawn from a finite set according to a probability distribution on . But even this problem may be less trivial than it sounds. Sometimes may be finite, yet very large. E.g. when modeling a ferromagnetic material on atoms, is of the form and for real size systems, is of the order , thus is of the order . Hence a straight-forward Monte-Carlo simulation would take exponentially long in the system size and thus would be much too expensive. In other situations, the size of may not even be known, as e.g. in the so-called knapsack problem (Löwe and Meise, (2001)).
One potential solution of this problem lies in the use of Markov Chain Monte Carlo (MCMC, for short) algorithms. They rely on an aperiodic and irreducible Markov chain on that has as its invariant (i.e. stationary) distribution. One runs this Markov chain, stops it after some long enough time, and takes the current state as a sample element. The ergodic Theorem for Markov chains ensures that this element is almost distributed according to , given one has waited long enough. This method immediately raises two questions:
- (1)
Can one find for each a Markov chain that converges in distribution to ? I.e. can one find for each an irreducible and aperiodic Markov chain that has as its invariant measure? This question is answered in the affirmative by the Metropolis-Hastings chain (see e.g. Häggström, (2002)). 2. (2)
How long do we need to wait to get a sample with a distribution that is reasonably close to ? If this waiting time is polynomial in the problem instance we speak about fast or rapid convergence, otherwise, in particular, if the mixing time is exponential in the problem instance, we will say the algorithm converges torpidly or slowly.
It is, however, well known that like the Glauber dynamics the Metropolis-Hastings algorithm usually converges slowly, when the target distribution is multi-modal. Such situations occur e.g. in statistical physics in the presence of a phase transition. Hence slow convergence applies to a number of interesting situations, among them the low temperature phase of the Curie-Weiss model (see e.g. the discussion in Mossel and Sly, (2013)). In the next section we will introduce a close relative of the Curie-Weiss model, the three state mean-field Potts model. This will be our test model for the EES to be introduced in Section 3. In Section 4 we will show that this sampler mixes slowly when applied to the (three state) mean-field Potts model in a certain temperature regime. Here, a key argument is based on a property of the mean-field Potts model that is closely related to its first order phase transition (see Lemma 4.3 and the remark below it): The limit point of the order parameter (cf. (2.1)) at high temperatures remains a local maximum of its distribution also in a certain part of the low temperature regime. Hence a Metropolis-Hastings chain started in a neighborhood of this high temperature limit point will typically not escape from this neighborhood in polynomial time (in the part of the low temperature regime described above). But then also EES cannot improve the performance of the Metropolis-Hastings algorithm, because there are simply no observations of the global maxima of the distribution of the algorithm could jump at. The proof is completed by combining this observation with the very powerful technical argument of conductance, also known as Cheeger’s inequality (see Theorem 4.4).
2. The mean-field Potts model
Let us now introduce the mean-field Potts model. Consider the space , where , , , and (to avoid some complications in the future, we can think of being a multiple of ). The elements of are sometimes referred to as colors.
For convenience, in this note, we will restrict to the case of , i.e. the mean-field Potts model with three states or colors taken from the set . This from now on will be our standing assumption for the rest of the note. However, we remark that our argument remains valid for general , if one changes the regime of temperatures appropriately. We will come back to this remark later.
On we construct an energy function given by
[TABLE]
Here denotes the indicator function for an event (which is formally the Dirac measure for the event , to stress that our notation is consistent with our later use of ). Note that can be written as a function of the vector
[TABLE]
Indeed, one easily checks that
[TABLE]
is therefore called an order parameter of the model. With we associate a Gibbs measure at inverse temperature , i.e.
[TABLE]
Here is the partition function. Note that conventionally in statistical physics the energy function would carry an additional minus sign, and the Gibbs measure (as well as the partition function) would be defined in terms of the exponential of minus times that energy. Since the two minus signs would cancel and lead to the same definition of the Gibbs measure our Gibbs measure does not carry these conventional minus signs.
The mean-field Potts model was studied in a variety of papers. We refer to Ellis and Wang, (1990) and Kesten and Schonmann, (1989), who showed that there is a critical inverse temperature . This critical inverse temperature in the 3 states mean-field Potts model equals (cf. Cuff et al., (2012), which discusses the very interesting phenomenon of a temperature-dependent cut-off effect for the Glauber dynamics of the model).
At the model undergoes a first order phase transition. More precisely, the order parameter of the model converges in distribution to , when . At smaller temperatures one observes the following: For there is and vectors such that the vector has in its i’th component and all other components are equal such that they sum up to one. For the distribution of converges to . Here denotes the Dirac-measure. Finally, for there are, moreover, weights that sum up to 1, such that the distribution of converges to .
The phase transition is of first order, since , i.e. the jump is discontinuous. Moreover, the vector remains a local maximum of the distribution of for some temperatures below the critical temperature. Such a behavior can also be observed for general values of at other values for . We will come back to this fact in Section 4, Lemmas 4.2 and 4.3, because it is of utmost importance for the proof of Theorem 4.1 to be given in Section 4.
3. Equi-Energy Sampling
Various modifications of the Metropolis-Hastings algorithm have been proposed to speed up its convergence. Among them the so-called swapping algorithm (see Geyer, (1991)), the exchange Monte Carlo method (see Hukushima and Nemoto, (1996)), parallel tempering (see Orlandini, (1998)) and the simulated tempering algorithm (see Marinari and Parisi, (1992), Geyer and Thompson, (1995), and Madras, (1998)) are very popular in applications. Another variant are Multicanonical Monte Carlo Simulations, introduced by Berg and Neuhaus, (1992), also see Berg, (2000). It is related to umbrella sampling (see Torrie and Valleau, (1977)) and is close in spirit to the swapping algorithm, simulated tempering, as well as EES. A major difference, however, is, how an a priori estimate of the probability distribution of interest is produced. Therefore, we have not yet been able to analyze so far, whether Multicanonical Monte Carlo Simulations suffer from the same shortcomings as swapping, parallel tempering or EES (see next paragraph).
In many situations the algorithms described in the previous paragraph seem indeed to be able to improve the convergence of the Metropolis chain, however, there are also negative theoretical results about these algorithms. Madras and Zheng, (2003) show that the swapping chain converges quickly for the Curie–Weiss model. On the other hand, Bhatnagar and Randall, (2004) and Bhatnagar and Randall, (2016) prove that both, the swapping algorithm and simulated tempering, are slowly mixing for the 3-state Potts model and conjecture that this is caused by the first order phase transition in the Potts model (also see our discussion in the remark following Lemma 4.3). Qualitative properties of the swapping algorithm and parallel tempering were studied in Doll et al., (2018). A first rapid convergence result for the Swapping Algorithm in an disordered situation was proved in Löwe and Vermet, (2009). Ebbers and Löwe, (2009) show that in disordered models the conjecture by Bhatnagar and Randall is not correct. They prove that the Swapping Algorithm mixes slowly on the Random Energy Model, even though this model has only a third order phase transition. In the Blume-Emery-Griffiths model both, rapid or torpid mixing may occur as was shown in Ebbers et al., (2014).
Another idea to improve the performance of the Metropolis chain is the so called Equi-energy sampling algorithm (see e.g. Kou et al., (2006)). This model was tested on the Ising model in Hua and Kou, (2011) and the question, how fast it converges, was posed. For the Potts model, we will answer this question in the next section. In particular, we will show that EES may be slowly mixing on relevant models from statistical mechanics. Variants of EES were studied, among others, in Baragatti et al., (2013).
The principle observation to motivate EES is that a main obstacle to fast mixing is the presence of a phase transition in the model. This, in turn, may be characterized by a multi-modal distribution of a macroscopic observable. Usually, then the (projected) Metropolis chain enters one of the modes rapidly and stays there for an exponentially long time. The EES tries to avoid this behavior by introducing shortcuts in the state space. These shortcuts are created by the observations of Metropolis chains at higher temperatures where the above mentioned modes are less pronounced or possibly not even present. More precisely, additionally to the Metropolis steps one allows also for jumps to points of the same or a similar energy as the present one, given one has observed these points already at higher temperatures (otherwise, the algorithm would require the exact structure of the energy function, in which case simulations would probably be pointless). The EES has been discussed in Kou et al., (2006), its convergence was shown in the same article, and, using a different technique, in Andrieu et al., (2008). We will now give an exact description of a version of this algorithm.
Let us first briefly recall the Metropolis-Hastings algorithm, which is the basis of the EES. To define the first let denote the following aperiodic, symmetric and irreducible Markov chain on :
[TABLE]
Here is the Hamming distance and is a Markov chain, because every has neighbors. Define the corresponding Metropolis-Hastings chain for the probability as :
[TABLE]
Note that sometimes is slow in natural situations, e.g. when sampling from the low temperature distribution of the Curie-Weiss model (see e.g. Madras and Piccioni, (1999), of course, the has to be adapted to the situation of the Curie-Weiss model). To speed up its convergence, we consider the EES. To define it, we first introduce a sequence of energy levels:
[TABLE]
In our context, it is easily checked that
[TABLE]
which will be used later. Moreover, introduce a sequence of inverse temperature levels
[TABLE]
where we assume that is the temperature we want to sample from. It will often be convenient to take . Note that may and will depend on , which is not made explicit in Kou et al., (2006), otherwise our construction, so far, agrees with the construction in Kou et al., (2006). We will make an explicit choice for and give reasons for this choice, after the description of the algorithm
For this, we will also need a dummy state and define Let be an matrix over , which is initially filled with , only.
The EES consists of alternations between two steps. One is a usual Metropolis step at a (random) temperature level . The other one is an equi-energy jump at the same temperature, if . At inverse temperature 0 there are only Metropolis moves. We store the resulting energies of the states we see at temperature by entering them into the ’th row of the matrix , if the state has not been seen before. In this case it replaces one of the ’s (in a pre-described order). To explain the equi-energy step assume that the chain is at temperature and in state . We determine the energy level , such that and choose (with equal probabilities) a state from all states with , which we have already seen at temperature level . This new state is accepted with probability . Otherwise, especially, if we have not seen any state in the same energy band in the st row of , the chain stays where it is. We denote the corresponding transition matrix (on ) by . Note that in general depends on time. We will not make this explicit, because we will just analyze the algorithm in the ”best case scenario”, where the matrix does not contain any ’s anymore. However, under this assumption, we will still be able to show that EES is slowly converging on the three state mean-field Potts model in a certain temperature regime. Formally
[TABLE]
Here is the time variable and is the set of states with , which we have already seen at temperature level by time .
One might expect, that we indeed use all states we have seen previously, rather than the ones we explored with the chain at temperature . However, there is hardly any difference between the two chains, because if temperatures are very different the chains will typically also see states of very different energies. Our choice has the advantage that it is easy to see that the global chain to be described below is reversible and moreover, it agrees with the choice in the literature, see Kou et al., (2006).
Based on this, we build a matrix that describes the movement of all particles simultaneously. This operator will be a matrix on , of course. We lift the movement of the i’th particle to by building
[TABLE]
where is the identity matrix. Similarly, we consider the matrix that lifts the Metropolis step to , i.e. we consider
[TABLE]
Combining these operators the EES is defined by
[TABLE]
Note that the versions of the EES given in Kou et al., (2006) and Andrieu et al., (2008) differ from each other and also our version is slightly different from those. However, the spirit of the algorithms is the same.
In the sequel, we will only consider a number of energy levels that depends linearly on , such that . We will furthermore assume that are equi-distant. Indeed, this choice of is somewhat arbitrary, allowing for a polynomial dependence between and would not alter the algorithm much. However, choosing , e.g. exponentially large in , would lead to empty, or almost empty energy bands which would make the equi-energy step obsolete. Moreover, it would obviously lead to exponential relaxation times (in ), because exponentially many temperatures have to be simulated. On the other hand, having too small, e.g. constant, leads to almost non-interactive components (i.e. an equi-energy jump is almost never accepted) and EES stands no chance of increasing the speed of convergence compared to the standard Metropolis algorithm.
Of course, eventually we will only be interested in the ’st coordinate of this Markov chain. However, studying it entirely, seems easier. First of all, let us note that indeed, the distribution of the ’st coordinate converges to .
Theorem 3.1**.**
The distribution of the ’st coordinate converges to as time tends to infinity.
Proof.
This is the content of Andrieu et al., (2008) for their version of the EES. For our version the assertion follows from the ergodic theorem for Markov chains. Indeed, denote by the Markov chain on , where is the space of all matrices. will behave in its first component like while in the second component we keep record of the filling of . Observe that each is reversible with respect to and does not play any role for it. On the other hand, once we reach a situation where is entirely filled with states different from (we denote this state of by in the second coordinate of , i.e. we have seen all states at all temperatures, also all the equi-energy steps are reversible with respect to . This is, because , if and only if, and lie in the same energy band and follows from the construction of the transition probabilities. Thus, once is reached – which happens almost surely in finite time – is reversible with respect to
[TABLE]
Then the convergence follows from the convergence theorem for Markov chains. This, in particular, yields the assertion of the theorem. ∎
The proof is somewhat misleading, as it seems to indicate, that for exponentially large state spaces there is no hope that EES may converge in polynomial time, since first the state has to be reached. However, if we consider the high temperature situation in the Potts model the Metropolis-Hastings chain converges to its invariant distribution in polynomial time, even without any equi-energy steps.
On the other hand, we will see in the next section that in part of the low temperature regime the situation is even worse. Even, when we start in the optimal state in its second component, i.e. when we assume the second component is already in , the mixing time may be exponential.
4. Torpid mixing of EES on the low temperature mean-field Potts model
We now come to the central result of the note.
Theorem 4.1**.**
EES is slowly mixing on the 3-state mean-field Potts model, when , even when the second component of the Markov chain introduced in the proof of Theorem 3.1 above is in state .
We will prepare the proof of the theorem by explaining the ideas and stating some lemmas. In the proof of the theorem we will exploit one of the main differences between the mean-field Potts model and the Curie-Weiss model (i.e. when ) at low temperatures. This difference lies in the fact, that in the Curie-Weiss model the state where both colors occur equally often is a local minimum of the Gibbs measure at all low temperatures, while it is a local maximum of the Gibbs measure in the mean-field Potts model for some temperatures in the low temperature regime (also see Lemma 4.3). In particular, in the Curie-Weiss model, the Gibbs measure is flat in this state at the critical temperature while it exposes a local maximum in this state at the critical temperature in the Potts model. Thus, in the latter, EES will be very reluctant to move far away from a state with . This is the core idea, even though the technical steps are somewhat more involved.
Let be numbers in that add up to 1 and such that is an integer for each . Then for such that we have that
[TABLE]
[TABLE]
where
[TABLE]
and is . Note that we used Stirling’s formula to derive the second equality in (4.1) and the fact that we can rewrite as , if .
Letting to be the domain of (and the set of all probabilities on the space ), Gore and Jerrum show:
Lemma 4.2**.**
(cf.(Gore and Jerrum,, 1999, Proposition 1)) Let be a local maximum of . Then satisfies:
- (1)
* lies in the interior of .* 2. (2)
Either for all , or there are , such that for all . In the latter case there is exactly one equal to , while all the other are equal to .
Analyzing the function around the point we find that (in accordance with Lemma 4.2) it might be a local but not a global maximum of , if is not too large (a similar observation was already made in Kesten and Schonmann, (1989)):
Lemma 4.3**.**
If then is a local mode of , if is large enough, i.e. is a local maximum point of .
Proof.
In view of (4.1) is suffices to analyze . For , and consider . It is easy matter to check that and . The assertion follows. ∎
- Remark
Lemma 4.3 is a main reason why Theorem 4.1 is true. It is not difficult to check that the same behavior is true for general in an appropriately chosen temperature regime (depending on ). Therefore, also Theorem 4.1 could be proven for general . Indeed, the property shown in Lemma 4.3 is intrinsically related to the first order phase transition of the mean-field Potts model. Such a phase transition can be characterized by the discontinuous transition of the accumulation point(s) of an order parameter of the model at the critical inverse temperature . In the Potts model this order parameter is the variable . However, in most natural models, these new accumulation point(s) are already local maxima of the distribution of the order parameter for some smaller values of . Similarly, the old accumulation point(s) remain local maxima of the distribution of the order parameter for some larger values of . This is exactly the statement of Lemma 4.3.
Another key ingredient of the proof is a conductance argument (also known as Cheeger’s inequality in Diaconis and Stroock, (1991))
Theorem 4.4**.**
(Sinclair and Jerrum, (1989)) Let be a Markov chain on a finite set . Assume it is reversible with respect to . For all , define
[TABLE]
The conductance given by
[TABLE]
Then the following holds true for the spectral gap of :
[TABLE]
As follows e.g. from Diaconis and Stroock, (1991) a spectral gap that is the inverse of a polynomial in the problem instance results in fast mixing of the Markov chain. On the other hand, if the spectral gap is the inverse of an exponential in the problem instance, the Markov chain mixes slowly. An immediate consequence of Theorem 4.4 is that the Metropolis algorithm alone is slowly mixing on the low temperature Potts model.
Proposition 4.5**.**
The Metropolis algorithm mixes slowly on the Potts model, if .
Proof.
Take the macro-state , i.e. the maximum point of , where . This point exists according to Lemma 4.2 and because we are in the low temperature region. Since is a maximum of , there is such that is decreasing on the ball of radius centered in , , when we walk from the center to the boundary on a straight line. is one of the three points in which the distribution of concentrates for large and that are equally likely. Thus for
[TABLE]
we obtain that
[TABLE]
when is large enough and is fixed and small enough. Moreover, due to the exponential structure of , i.e.
[TABLE]
and the behavior of on (on , the function decreases like a multiple of the square of the two norms) we obtain that
[TABLE]
for a suitably chosen constant . But this implies that the set constitutes a ”bad cut”. Indeed with the notation of the previous theorem we see that
[TABLE]
Thus mixes slowly, when . ∎
As a consequence, if EES is fast on the low temperature Potts model, this will have to be caused by the equi-energy steps. However, the following important observation is that we will not be able to switch between two states that are at very different distances from the center mode by an equi-energy step. More precisely:
Lemma 4.6**.**
For each and each there is a number of spins such that for all and whenever and satisfy and (where denotes the 1-norm on ) then
[TABLE]
Here is defined in (3.4).
Proof.
The proof mainly shows that under the given conditions the energies of and are too far apart from each other. Indeed, observe that requires and to be in the same energy band. Thus there is such that . Now each with minimizes the energy to . On the other hand, the states where all spins point into the same direction have maximal energy cf. (3.3).
Thus, recalling that , the width of the energy bands is
[TABLE]
Therefore, and are only in the same energy band, if
[TABLE]
i.e. if the two norms and satisfy
[TABLE]
Since 1-norm and 2-norm are equivalent on this proves the assertion. ∎
We will again use a conductance argument to prove Theorem 4.1. In order to prepare it let us lift the balls to : For let
[TABLE]
From now on we will assume that . Recall that then still is a local (but not a global) maximum of the function . Let us fix so small, that still is decreasing on when we move away from the center (in particular, is the only mode of on ). Moreover, let us fix and so large that even with two equi-energy steps and a Metropolis step in between, a with cannot be reached from a with .
This can be constructed as in Lemma 4.6. Indeed, we will need the following: For given with there is , such that if an equi-energy jump started in will not leave . The subsequent Metropolis step can only increase the 1-distance of to by at most , hence is still in, say, , for some . Finally, there is , such that if an equi-energy jump started in will not leave . We will from now on always take .
All this is necessary because the chains and possibly comprise two such jumps. Next we prove
Lemma 4.7**.**
Let and and be chosen as above. Then, there exists such that for , , we have
[TABLE]
Proof.
According to our above analysis is a local (but not a global) maximum point of the distribution of under , if . Therefore
[TABLE]
for chosen appropriately. The proof of this statement follows the concepts of the proof of Proposition 4.5. This fact is easily transferred to the measure due to its product structure. ∎
With the help of this lemma we will be able to establish that the set constitutes a ”bad cut” for the Markov chain .
Proposition 4.8**.**
Consider the Markov chain on the state space where again is the space of all matrices. Here the first coordinate keeps record of the current state of chains, while the second coordinate tracks the filling of the matrix .
If and the second coordinate of is equal to its conductance satisfies
[TABLE]
for some , if is large enough.
Proof.
Since clearly , when is large enough. Thus also
[TABLE]
for large enough. Thus
[TABLE]
Here we used, of course, our previous estimates together with our construction of . Starting in the combination of an equi-energy jump, a Metropolis move, and another equi-energy jump will not leave according to Lemma 4.6 and the construction of and . ∎
Now we have prepared everything to prove Theorem 4.1.
Proof of Theorem 4.1.
Just note, that conditioned on the event that the second coordinate of is in (which it cannot leave anymore), is reversible with respect to . Hence we can apply Theorem 4.4 together with the conductance estimate of Proposition 4.8 to obtain the desired result. ∎
- Remark
Note that a similar proof would not work in the Curie-Weiss model, because there the ”center point” , i.e. the ’s where both directions for the spins occur equally often, is always a local minimum of the Gibbs measure at low temperatures.
Moreover, note that we could adapt the proof to different values of as mentioned above.
Finally, a similar argument should work for ”more disordered” models, as Potts models on sufficiently dense Erdös-Rényi graphs, as e.g. analyzed in Kabluchko et al., (2019) for .
- Remark
We have just seen that EES mixes slowly on the 3-state Potts model at , even when we know the energies of the entire set of states. We also argued that at high temperatures these temperature steps are not necessary, because already the Metropolis chain itself converges rapidly. However, one may doubt that there are reasonable models, at all, in which EES converges rapidly while the Metropolis algorithm does not. The point is, that, if we have not filled almost entirely, a temperature jump may provide the desired tunneling effect, but to a rather unfavorable point of the target distribution.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Andrieu et al., (2008) Andrieu, C., Jasra, A., Doucet, A., and Del Moral, P. (2008). A note on convergence of the equi-energy sampler. Stoch. Anal. Appl. , 26(2):298–312.
- 2Baragatti et al., (2013) Baragatti, M., Grimaud, A., and Pommeret, D. (2013). Parallel tempering with equi-energy moves. Stat. Comput. , 23(3):323–339.
- 3Berg, (2000) Berg, B. A. (2000). Introduction to multicanonical Monte Carlo simulations. In Monte Carlo methods (Toronto, ON, 1998) , volume 26 of Fields Inst. Commun. , pages 1–24. Amer. Math. Soc., Providence, RI.
- 4Berg and Neuhaus, (1992) Berg, B. A. and Neuhaus, T. (1992). Multicanonical ensemble: A new approach to simulate first-order phase transitions. Phys. Rev. Lett. , 68:9–12.
- 5Bhatnagar and Randall, (2004) Bhatnagar, N. and Randall, D. (2004). Torpid mixing of simulated tempering on the Potts model. In Proceedings of the Fifteenth Annual ACM-SIAM Symposium on Discrete Algorithms , pages 478–487 (electronic), New York. ACM.
- 6Bhatnagar and Randall, (2016) Bhatnagar, N. and Randall, D. (2016). Simulated tempering and swapping on mean-field models. J. Stat. Phys. , 164(3):495–530.
- 7Cuff et al., (2012) Cuff, P., Ding, J., Louidor, O., Lubetzky, E., Peres, Y., and Sly, A. (2012). Glauber dynamics for the mean-field Potts model. J. Stat. Phys. , 149(3):432–477.
- 8Diaconis and Stroock, (1991) Diaconis, P. and Stroock, D. (1991). Geometric bounds for eigenvalues of Markov chains. Ann. Appl. Probab. , 1(1):36–61.
