Exponentially slow mixing in the mean-field Swendsen-Wang dynamics
Reza Gheissari, Eyal Lubetzky, Yuval Peres

TL;DR
This paper proves that the Swendsen-Wang dynamics for the mean-field Potts model with three or more colors exhibits exponential mixing time in the number of vertices at the critical window, confirming slow mixing behavior.
Contribution
It establishes a tight exponential lower bound on the mixing time of Swendsen-Wang dynamics in the mean-field setting at criticality, improving previous subexponential bounds.
Findings
Mixing time is at least exponential in the number of vertices.
The result applies to the mean-field Potts model with q≥3.
The same exponential bound holds for related FK model samplers.
Abstract
Swendsen-Wang dynamics for the Potts model was proposed in the late 1980's as an alternative to single-site heat-bath dynamics, in which global updates allow this MCMC sampler to switch between metastable states and ideally mix faster. Gore and Jerrum (1999) found that this dynamics may in fact exhibit slow mixing: they showed that, for the Potts model with colors on the complete graph on vertices at the critical point , Swendsen-Wang dynamics has . The same lower bound was extended to the critical window around by Galanis et al. (2015), as well as to the corresponding mean-field FK model by Blanca and Sinclair (2015). In both cases, an upper bound of was known. Here we show that the mixing time is truly exponential in : namely, $t_{\mathrm{mix}} \geq \exp…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Exponentially slow mixing in the
mean-field Swendsen–Wang dynamics
Reza Gheissari
R. Gheissari Courant Institute
New York University
251 Mercer Street
New York, NY 10012, USA.
,
Eyal Lubetzky
E. Lubetzky Courant Institute
New York University
251 Mercer Street
New York, NY 10012, USA.
and
Yuval Peres
Y. Peres Microsoft Research
1 Microsoft Way
Redmond, WA 98052, USA.
Abstract.
Swendsen–Wang dynamics for the Potts model was proposed in the late 1980’s as an alternative to single-site heat-bath dynamics, in which global updates allow this MCMC sampler to switch between metastable states and ideally mix faster. Gore and Jerrum (1999) found that this dynamics may in fact exhibit slow mixing: they showed that, for the Potts model with colors on the complete graph on vertices at the critical point , Swendsen–Wang dynamics has . The same lower bound was extended to the critical window around by Galanis *et al. *(2015), as well as to the corresponding mean-field FK model by Blanca and Sinclair (2015). In both cases, an upper bound of was known. Here we show that the mixing time is truly exponential in : namely, for Swendsen–Wang dynamics when and , and the same bound holds for the related MCMC samplers for the mean-field FK model when .
1. Introduction
The mean-field -state Potts model is a canonical statistical physics model extending the Curie–Weiss Ising model () to possible states; for , it is one of the simplest models to exhibit a discontinuous (first-order) phase transition. Formally, the mean-field -state Potts model with parameter is a probability distribution over , given by , where . The model exhibits a phase transition at from a disordered phase (), where the sizes of all color classes concentrate around , to an ordered phase (), where there is typically one color class of size for (see §2).
As a means of overcoming low-temperature bottlenecks in the energy landscape (dominant color classes), Swendsen and Wang [19] introduced a non-local reversible Markov chain, relying on the random cluster (FK) representation of the Potts model. The mean-field FK model is the generalization of —the Erdős–Rényi random graph—parametrized by , in which the probability of a graph , identified with , is given by , where is the number of connected components of (clusters of ).
Via the Edwards–Sokal coupling [8] of the -state Potts model at inverse temperature and the FK model with parameters with , the mean-field Swendsen–Wang dynamics can be formulated as follows: consider a mean-field Potts configuration with being the sets of vertices . An update of the dynamics, started from , first samples, independently for every , a configuration on the subgraph of , forming an FK configuration as the union of the ’s; then, it assigns an i.i.d. color to each cluster in , and for every , sets in the new state of the Markov chain.
As apparent from the second (coloring) stage of the Swendsen–Wang algorithm, it can seamlessly jump between the ordered low-temperature metastable states where one color is dominant. It was thus expected that this MCMC sampler would converge quickly to equilibrium at all temperatures; e.g., its total variation mixing time , formally defined in §2, would be at most polynomial in the system size for all .
Indeed, at (the Ising model) Cooper, Dyer, Frieze and Rue [6] proved that, on the complete graph, Swendsen–Wang has at all (it was later shown in [17] that at while at ), and Guo and Jerrum [14] recently showed that for any -vertex graph and all , Swendsen–Wang has (this is in contrast to single-site dynamics, where at low temperature [7]).
Countering this intuition, however, Gore and Jerrum [13] found in 1999 that, for any , the Swendsen–Wang dynamics for the mean-field -state Potts model has for some at its critical point . This is a consequence of the discontinuity of the phase transition of the mean-field Potts model for , where at , both the ordered phases (with one dominant color class) and the disordered phase (with all color classes having roughly sites) are metastable.
On the lattice , the Potts model exhibits a discontinuous phase transition for some choices of (depending on ); there it was shown in [4], following [3], that Swendsen–Wang dynamics in fact has for all sufficiently large, suggesting that an exponential lower bound in should also hold in mean-field, believed to approximate high-dimensional tori. (The matching upper bound of [4] applies to general graphs and translates to on the complete graph.) On , this lower bound was extended [11] to where the phase transition is first-order (all ).
For the Glauber dynamics of the mean-field Potts model, when , the mixing time for all was characterized in [7], where it was shown that, in discrete-time, has order at , order at , and finally at , where is the spinodal point corresponding to the onset of ordered metastable phases. Recently, Galanis, Štefankovic and Vigoda [9] analyzed the mixing time of the analogous mean-field Swendsen–Wang dynamics, finding it to mix in polynomial time111It was shown in that work that for , whereas at . both at high temperature and—unlike Glauber dynamics—at low temperatures, for all outside a critical window around , where the critical point (mirroring the spinodal point ) marks the disappearance of metastability of the disordered phase.
For , Swendsen–Wang was shown in [9] to slow down to (extending the lower bound at due to Gore and Jerrum). Analogously, for the related Glauber dynamics for the mean-field FK model (see §2 for precise definitions) with , Blanca and Sinclair [1] proved that whenever is in the critical window . The fact that three significant papers, over a period of almost twenty years, all presented a lower bound of the form , left open the possibility that this is the true order of the mixing time inside the critical window.
Our main result is that the mixing time of the mean-field Swendsen–Wang dynamics is truly exponential in at criticality, similar to the single-site Glauber dynamics.
Theorem 1**.**
Let be a fixed integer, and consider the Swendsen–Wang dynamics for the -state mean-field Potts model on vertices at inverse temperature . There exists some such that, for all large enough, .
The case of non-integer (the mean-field FK model) is more delicate: the analogue of Swendsen–Wang in this setting is Chayes–Machta dynamics [5], which we analyze via a recursive application of the fundamental lemma of Bollobás, Grimmett and Janson [2]. As in [1], comparison results of [20] extend the result to heat-bath Glauber dynamics.
Theorem 2**.**
Fix , and consider Glauber dynamics for the mean-field FK model on vertices with parameters where . There exists such that for large enough . The same holds for Chayes–Machta dynamics.
To outline our approach for proving Theorems 1–2, we first sketch the argument of [13], thereafter adapted to in [9] and to the FK model in [1]. Starting from a Potts configuration where each color class has vertices, since , for small enough , this corresponds to a subcritical Erdős-Rényi random graph in the first stage of the Swendsen–Wang dynamics. The exponential tail of component sizes in this regime shows that, for a sequence , with probability at least , no cluster in the edge configuration we obtain is larger than ; on this event, the component sizes satisfy , thus by Hoeffding’s inequality, with probability , every new color class will have vertices, and in particular no dominant color class would emerge. In this argument, choosing balances the two probability estimates to . However, at , the Potts model does admit a dominant color class with positive (uniformly bounded away from 0) probability, thus the mixing time is at least .
In order to improve this lower bound into per Theorem 1, instead of looking at the size of the largest component after the stage of the dynamics, we consider , the set of vertices in connected components of size larger than . We show that, whenever the stage is subcritical and is sufficiently large, the probability that is at most . Moreover, given , Hoeffding’s inequality implies that, following the second stage of the dynamics, all the new color classes will have vertices except with probability , yielding . The proof of Theorem 2 follows a similar path, yet involves additional equilibrium estimates on the conditional probabilities under , as the Chayes–Machta dynamics resamples a strict subset of the configuration in each step.
2. Preliminaries
Throughout this paper, we use the notation for two sequences to denote , and let denote . We re-parametrize the FK and Potts models by instead of and via the relations and , to allow us to treat the FK and Potts models in a unified manner. We will consider these models on the complete graph on vertices, .
Denote by , the Potts measure (with such that ) and by the corresponding FK measure with on the complete graph on vertices. The FK model with corresponds precisely to the Erdős–Rényi random graph and we use the shortened notation . We occasionally use to denote the mean-field FK model given by .
For any FK configuration , enumerate the clusters of in decreasing size and let . For a vertex let, denote the cluster to which belongs.
For all define the critical points and for , define
[TABLE]
so that for , we have (see e.g., [9, 1]). The critical points correspond to the parameters of emergence and disappearance of metastability, where at , the ordered and disordered metastable states have the same free energy. These two critical points can also have the following alternative interpreation [9]: corresponds to the first uniqueness/non-uniqueness threshold of the -regular infinite tree, and should correspond to a second uniqueness/non-uniqueness threshold of the -regular tree with periodic boundary conditions.
The FK and Potts phase transitions
The following give a description of the static phase transition undergone by the mean-field FK and Potts models respectively. Let be the largest solution of so when .
Proposition 2.1** ([2, Thms. 2.1–2.2],[18, Thm. 19]).**
Consider the -vertex mean-field FK model with parameters with ; if , for every , we have whereas if , for every , we have . If , there exists so that for all , and .
Corollary 2.2**.**
Consider the mean-field Potts model parametrized by and . If , for any ,
[TABLE]
and if , then there exists such that for sufficiently small ,
[TABLE]
If and , there exists so that for all sufficiently small ,
[TABLE]
Cluster dynamics
Swendsen–Wang dynamics for the -state Potts model on with parameter such that is the following discrete-time reversible Markov chain. From a Potts configuration on , generate a new state as follows.
- (1)
Introduce auxiliary edge variables and for set if on each of the sets of vertices of of the same color, , independently sample . 2. (2)
For every connected component of the resulting , reassign the cluster, collectively, an i.i.d. color in , to obtain the new configuration .
Chayes–Machta dynamics for the FK model on with parameters , for and , is the following discrete-time reversible Markov chain: From an FK configuration on , generate a new state as follows.
- (1)
Assign each cluster of an auxiliary i.i.d. variable . 2. (2)
Resample every such that and belong to active clusters () via i.i.d. random variables , yielding a new configuration .
Variants of Chayes–Machta dynamics with “active colors” have also been studied, with numerical evidence for being the most efficient choice; see [10].
Glauber dynamics for the FK model
Swendsen–Wang dynamics is closely related to the FK model; much of the analysis of Swendsen–Wang dynamics on general graphs has been via the Glauber dynamics for the corresponding FK model. Discrete-time Glauber dynamics [12] for the FK model on with is as follows: select an edge in uniformly at random and update according to .
Size of largest component and drift functions
For , let be the unique positive root of . Recall the following tail estimates for in .
Fact 2.3** (e.g., cf. [15, p. 109]).**
Consider with . Then for any ,
[TABLE]
In particular, \pi_{n,\lambda,1}({\mathscr{L}}_{1}\geq k)\leq n\exp\big{(}-(1-\lambda)^{2}k/2\big{)}.
Proposition 2.4** ([17, Lemma 5.4]).**
Consider with . There exists such that for every ,
[TABLE]
For the proof of Theorem 1, following [13, 9] define the drift function for the average size of the largest color class of the Swendsen–Wang dynamics
[TABLE]
The function has, for some values of a second fixed point besides , which we denote by , which solves
[TABLE]
Proposition 2.5** ([9, Lemma 5]).**
If , the fixed point is such that and moreover if , we have . Moreover, if and , is a Jacobian attractive fixed point of so that .
Similarly to the above, we can define the function given by
[TABLE]
which governs the mean drift of the size of the giant component in Chayes–Machta dynamics. We can also define to be the largest solution to . Following [1], let , observe that if , , and define the drift function .
Proposition 2.6** ([1, Lemma 2.14]).**
When and , the drift function has two roots, in ; moreover, is strictly positive on .
Mixing time and spectral gap
In this section, we introduce the quantities of interest regarding the time for the Swendsen–Wang and Glauber dynamics to reach equilibrium. Consider a Markov chain with finite state space and transition matrix reversible with respect to . For two measures , define their total variation distance by
[TABLE]
Then the mixing time of is defined as
[TABLE]
A related quantity that is sometimes easier to work with is the spectral gap of ; Since is reversible with respect to , we can enumerate its spectrum from largest to smallest as ; then the spectral gap of is defined as . The following is a standard comparison between the inverse spectral gap and the mixing time of a Markov chain with transition matrix (see e.g., [16]):
[TABLE]
Spectral gap comparisons
The following comparison inequalities between the aforementioned Markov chains are due to Ullrich.
Proposition 2.7** ([20]).**
Let be integer. Let be the spectral gap of Glauber dynamics FK model on a graph and let be the spectral gap of Swendsen–Wang. Then
[TABLE]
The proof of (2.2) further extends to all real , whence
[TABLE]
as was observed (and further generalized) by Blanca and Sinclair [1, §5], where is the spectral gap of Chayes–Machta dynamics.
3. Slow mixing of Swendsen–Wang dynamics
Towards the proof of Theorem 1, we first establish some preliminary estimates. For , we will frequently be interested in bounding the following quantity:
[TABLE]
The bottlenecks in the proofs of Theorems 1–2 both rely on the following estimate.
Lemma 3.1**.**
Consider with fixed. There exists such that for every , there exists such that for every ,
[TABLE]
Proof.
Recall that by Fact 2.3, there exists such that for all . Moreover, conditioned on other clusters, the remaining graph is distributed as for , so that for any vertices ,
[TABLE]
Let be the number of clusters with at least vertices; then so that by Azuma–Hoeffding inequality,
[TABLE]
Now let
[TABLE]
plugging into the Azuma-Hoeffding bound, we obtain
[TABLE]
In order to bound the right-hand side above, fix vertices ; the joint law of is dominated by the sum of i.i.d. random variables , where, for some (independent of and ), is sub-exponential with parameters and has mean . By the definition of , for any sufficiently large (depending on ), . By a union bound and symmetry, we have
[TABLE]
Moreover, is also sub-exponential with parameters . Therefore, there exists so that for all , there exists such that for all ,
[TABLE]
Plugging this bound in to (3.1) concludes the proof. ∎
In the coloring stage of the Swendsen–Wang dynamics, the following simple application of a Chernoff-Hoeffding inequality proves useful.
Lemma 3.2**.**
Consider an FK realization on vertices and suppose for some . Independently color each cluster of collectively red with probability , and let be the set of all red vertices. For all ,
[TABLE]
Proof.
We consider and separately. To bound the former, it suffices to prove an upper bound on
[TABLE]
which by Hoeffding’s inequality satisfies
[TABLE]
Similarly bounding by Hoeffding’s inequality and combining the two via a union bound concludes the proof. ∎
We prove Theorem 1 for separately for that is below, above and at .
3.1. The supercritical regime: proof of Theorem 1 for the case
To prove Theorem 1 for , let , and define the set of configurations,
[TABLE]
Now consider the Markov chain and let be the corresponding vector counting the number of sites in each state in . We need the following claim.
Claim 3.3**.**
Consider Swendsen–Wang dynamics with for ; there exists such that that for every
[TABLE]
Proof.
Consider a fixed . In the step of the Swendsen–Wang dynamics, we consider the color components separately. For each of the colored components a new edge configuration is sampled according to where ; call the edge configuration we obtain and note that by definition of Swendsen–Wang dynamics, the clusters of will all be disconnected. Then since and , if , every colored component is sub-critical in the step. Thus, for all , by Lemma 3.1, for some , if , for every and every ,
[TABLE]
Union bounding over the different such components, we obtain
[TABLE]
In that case, if and is the edge configuration induced on the whole graph after the step of the dynamics, there exists so that for ,
[TABLE]
We can then split up
[TABLE]
and consider the coloring step of the Swendsen–Wang dynamics. Then we obtain
[TABLE]
By an application of Lemma 3.2 with and a union bound, the above is, for and , bounded above by
[TABLE]
Since all the above estimates were uniform in , we obtain the desired. ∎
By Corollary 2.2, since is such that , for every small , we have . If is such that , clearly , and by Claim 3.3 and a union bound, since , there exists such that for every ,
[TABLE]
The definition of total variation mixing time then implies as desired. ∎
3.2. The subcritical regime: proof of Theorem 1 for
We first prove the following consequence of Lemma 3.1.
Lemma 3.4**.**
Consider with . There exist such that for every and sufficiently small and for every , we have
[TABLE]
Proof.
By a union bound, rewrite the left-hand side above as
[TABLE]
Since , by Fact 2.4, we have that for some . Now suppose and note that conditioning on , since the remaining graph is disconnected from , it must be distributed as which, since , is subcritical for small . In that case, by Lemma 3.1, given , there exists such that for every , there exists so that for ,
[TABLE]
combined with the union bound, this implies the desired. ∎
The proof of Theorem 1 for is a slight modification of the proof for . Recall the definitions of and from §2. Fix . In decreasing order, let the number of vertices in each color class of be and let
[TABLE]
By Corollary 2.2, since , for sufficiently small , we have . Therefore, it suffices by definition of total variation mixing to prove the following.
Claim 3.5**.**
Consider Swendsen–Wang dynamics with for ; there exist such that for every ,
[TABLE]
Proof.
Fix any and let be its corresponding color class vector. By definition of , for some there exists such that if , we have . From now on we take .
Consider the step of the Swendsen–Wang dynamics. Since , and , so that for sufficiently small, the first colored class of will be supercritical in the step and the other will all be subcritical; call the random graph configurations we obtain in this step for . Now fix such a and let . By Fact 2.4, we obtain that for some ,
[TABLE]
Moreover, by Lemma 3.1, we also have for some , for every ,
[TABLE]
On the complement of the above event, has a single giant component of size for , and . By Lemma 3.2, with probability , the largest color class of will be the one containing so without loss, we also assume that is the case.
At that stage, observe that and for . Then, first assigning the giant component a color, then using Lemma 3.2, we obtain that for some , for every ,
[TABLE]
By a similar bound on the other coloring steps and the choice ,
[TABLE]
By the choice of and the triangle inequality, this implies
[TABLE]
which by uniformity of the estimates over , concludes the proof. ∎
3.3. The critical point: proof of Theorem 1 for
In Corollary 2.2, for every , either in which case Claim 3.5 concludes the proof, or in which case Claim 3.3 concludes the proof. ∎
4. Slow mixing of Glauber dynamics for the FK model
Since for noninteger, Chayes–Machta dynamics activates a strict subset of the vertices at a time, we will need to use a modified argument to prove Theorem 2. We instead construct a bottleneck set and bound its bottleneck ratio. For , let
[TABLE]
for a chain with stationary distribution and kernel ; the Cheeger constant of is
[TABLE]
In order to prove the lower bound of Theorem 2, we prove such a lower bound on the inverse spectral gap of the Chayes–Machta dynamics, then using Proposition 2.7 and a standard comparison between the spectral gap and mixing time (2.1), we obtain the desired for the Glauber dynamics. Before the proof of Theorem 2, we prove some preliminary equilibrium bottleneck estimates for the mean-field FK model.
The following lemma that was fundamental to the understanding of the distribution in [2] is very useful for the proof of Theorem 2.
Lemma 4.1** ([2, Lemma 3.1]).**
Fix ; consider a mean-field FK realization . Independently color each cluster of red with probability and let be the collection of all red vertices. Conditional on , the subgraph is distributed according to and the subgraph is distributed according to .
The following corollary follows from iterating the process of Lemma 4.1 times.
Corollary 4.2**.**
Consider a mean-field FK realization . Independently color each cluster of color with probability each and otherwise. Then letting be the sets of vertices colored each of , the subgraph restricted to for is distributed according to . The subgraph restricted to is distributed according to . Moreover, the distributions of the color classes are (conditionally on ) independent.
(Note that when is integer, the set is deterministically empty.) Via Lemma 4.1, we prove the following analogues of Lemmas 3.1 and 3.4 when .
Lemma 4.3**.**
Consider the mean-field FK model on vertices with parameters with and . There exists such that for all sufficiently small, there exists such that for all ,
[TABLE]
Proof.
We prove the desired using Lemma 4.1. Consider the random graph with the choice of ; applying Lemma 4.1 to with , by [2, Lemma 9.1], for all , we have , for some . Then, we can write for any event ,
[TABLE]
where is the distribution over colorings of , averaged over realizations of . Letting , for every the probability on the right-hand side is bounded above by which, by Lemma 3.1, satisfies
[TABLE]
for some and for every and every . But by Lemma 4.1,
[TABLE]
which combined with implies
[TABLE]
Lemma 4.4**.**
Consider the mean-field FK model on vertices with parameters with and . There exists such that for all sufficiently small, there exists such that for all ,
[TABLE]
Proof.
As before, consider with ; by Lemma 4.1 with and [2, Lemma 9.1], . Let in (4.2). Then observe that and by Lemma 3.4, . Altogether, plugging the above bounds in to (4.2) implies that there exists such that for all and all ,
[TABLE]
4.1. The supercritical/critical regime,
We first prove the desired mixing time lower bound for , using the following bottleneck estimate.
Lemma 4.5**.**
Consider the mean-field FK model on vertices with parameters where and ; there exists such that for all sufficiently small , there exists such that for every ,
[TABLE]
Proof.
For define the sets
[TABLE]
In order to bound , use the coloring scheme described in Corollary 4.2. Let be the set of all possible partitions of into sets, i.e., the set of all possible colorings of FK configurations. Denote by the probability measure over colorings averaged over , and the probability measure over such colorings, averaged over . For every ,
[TABLE]
Then we can write, by Corollary 4.2,
[TABLE]
By Lemma 3.2, since implies , for every ,
[TABLE]
If for all , we are left with a remainder set satisfying
[TABLE]
Define the event over colorings of the mean-field FK model as
[TABLE]
so that the above conclusion can be written as
[TABLE]
Combined with the expression for , this implies that
[TABLE]
By a union bound, the first term on the right-hand side is bounded above by
[TABLE]
We lower bound the numerator and upper bound the denominator simultaneously as they entail similar estimates.
Since , there exists such that for all , the random graph is subcritical and the FK model is also subcritical. In other words, if , for every , the distributions for and are all subcritical. As such, by Lemma 3.1, there exists such that for every ,
[TABLE]
Similar bounds under follow immediately for a different from Lemma 4.3. Altogether, this implies that for every and every , there exists such that
[TABLE]
Proof of Theorem 2: the case .
For , recall the definitions of and . By Proposition 2.1, for , for sufficiently small and large , there exists such that . Then by (4.1) it suffices to prove an exponentially decaying upper bound on
[TABLE]
where are the transition matrix and edge measure, respectively, of the Chayes–Machta dynamics. We first bound the first term in the right-hand side of (4.4).
Consider some . In the activation stage of the Chayes–Machta dynamics, clusters are activated with probability ; denote by the set of activated vertices in this stage of the dynamics. Since , by Lemma 3.2 with the choice of ,
[TABLE]
Since , for , the random graph is subcritical. In that case, by Lemma 3.1, there exists such that for every ,
[TABLE]
Union bounding over the event and its complement, there exists such that for every , for every ,
[TABLE]
Lemma 4.5 yields a similar exponentially decaying upper bound on the second term on the right-hand side of (4.4), concluding the proof. ∎
4.2. The subcritical/critical regime,
Recall the definitions of and corresponding to the drift function . When , we will need the following intermediate lemma, before proceeding to the analogue of Lemma 4.5. This is a straightforward adaptation of an argument of [1].
Lemma 4.6**.**
Consider the mean-field FK model on vertices with parameters with ; let . Color red and independently color each cluster in red with probability ; let be the set of all red vertices. Resample and let be the resulting configuration on vertices; there exists so that for sufficiently small , for every , uniformly in ,
[TABLE]
Proof.
Fix any and let for . Then
[TABLE]
so that by Lemma 3.2, for all ,
[TABLE]
Therefore, we can write for every ,
[TABLE]
For all , for sufficiently small , using , since ,the random graph is supercritical. By continuity of , for any , there exists sufficiently small such that ; moreover, by Fact 2.4, for every
[TABLE]
for some . Thus, for sufficiently small , we have, for some ,
[TABLE]
It remains to argue that for sufficiently small, there exists such that for all , we have . If , then by [1, Lemma 2.14], and for small enough letting yields the desired. If , since is positive on , for small, . By continuity of , for , letting , we obtain
[TABLE]
Together, for sufficiently small, there exists such that
[TABLE]
The following is the analogue of Lemma 4.5 in the presence of a giant component.
Lemma 4.7**.**
Consider the mean-field FK model on vertices with parameters with and ; for every let
[TABLE]
There exists such that for sufficiently small , for ,
[TABLE]
Proof.
Fix and for , define the sets
[TABLE]
We prove the lemma similarly to Lemma 4.5, after treating the giant component separately. Using the coloring scheme of Corollary 4.2, with and defined as before, by considering the color class to which belongs, and using symmetry, we obtain
[TABLE]
Call the two sums on the right hand side I and II respectively and consider them separately. Conditional on and , if ,
[TABLE]
where we used Lemma 3.2 with . Following the proof of Lemma 4.5, let
[TABLE]
By Lemma 3.2 and a union bound, .
Using the fact that for every , , we can write
[TABLE]
If , for sufficiently small , the definition of and implies is supercritical, and both and are subcritical. By a union bound we can expand the numerator above as at most
[TABLE]
and analogously, the denominator as at least
[TABLE]
(In both of the above, we paid a cost of for the assumption .) By Lemma 3.4, (for every and , is subcritical) there exists such that for sufficiently small and every ,
[TABLE]
Moreover, as in the proof of Lemma 4.5, by Lemmas 3.1 and 4.3, we also have that for that there exists such that for every ,
[TABLE]
Clearly, analogous bounds hold for all of the above when replacing with . Combining all of the above bounds and plugging them in to the right-hand side of
[TABLE]
yields an exponentially decaying upper bound on the sum I. The bound on the sum II is very similar. Letting , we define
[TABLE]
As before, by Lemma 3.2, we can write
[TABLE]
and observe that for every , since , for sufficiently small , the FK model is supercritical and the random graphs are subcritical for all . By Lemmas 3.1 and 4.4, there exists such that for every sufficiently small and every ,
[TABLE]
and by the same reasoning, analogous bounds hold when replacing with . Then expanding the fraction in the upper bound on II as done in the bound on I implies there exists such that for sufficiently small and every ,
[TABLE]
We are now in position to complete the proof of Theorem 2.
Proof of Theorem 2: the case .
The proof when is similar to the extension of slow mixing for the Swendsen–Wang dynamics when to . Recall that for fixed , the two zeros of were denoted so that is positive on . We again use a conductance estimate to lower bound the inverse gap of the Chayes–Machta dynamics. Define for every ,
[TABLE]
As in (4.4), by (4.1) it suffices to show an exponentially decaying upper bound on
[TABLE]
for sufficiently small and large ; this is because by Proposition 2.1, for all small enough , we have . We bound the two terms above separately as in the proof for . First of all, note by Lemma 4.7 that the second term on the right-hand side is bounded above by for some for every sufficiently small and every .
Now consider any and bound under the Chayes–Machta dynamics. We split the transition probability of the Chayes–Machta dynamics into the case when is activated and is not activated; let denote the set of activated vertices. If , we have and since , by Lemma 3.2, if , then
[TABLE]
If , for sufficiently small , since , the random graph is subcritical, in which case with probability at least , . By Lemma 3.1, there exists such that for sufficiently small and every ,
[TABLE]
Thus, for some , for small enough , and every ,
[TABLE]
Now suppose that ; then one step of Chayes–Machta dynamics is described precisely by the set up of Lemma 4.6, with replaced by , yielding
[TABLE]
for some for all sufficiently small and . On the complement of that event, deterministically . By Lemma 3.4, for some , for small and large ,
[TABLE]
Combining the above, we deduce that there exists such that for all sufficiently small , for every , we have , concluding the proof of Theorem 2 when . ∎
Acknowledgment
R.G. thanks the theory group of Microsoft Research Redmond for its hospitality during the time some of this work was carried out. E.L. was supported in part by NSF grant DMS-1513403.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Blanca and A. Sinclair. Dynamics for the mean-field random-cluster model. In Proc. of the 19th International Workshop on Randomization and Computation (RANDOM 2015) , pages 528–543, 2015.
- 2[2] B. Bollobás, G. Grimmett, and S. Janson. The random-cluster model on the complete graph. Probab. Theory Related Fields , 104(3):283–317, 1996.
- 3[3] C. Borgs, J. Chayes, A. Frieze, J. H. Kim, P. Tetali, E. Vigoda, and V. H. Vu. Torpid mixing of some monte carlo markov chain algorithms in statistical physics. In Proc. of the 40th Annual Symposium on Foundations of Computer Science (FOCS 1999) , pages 218–229, 1999.
- 4[4] C. Borgs, J. T. Chayes, and P. Tetali. Tight bounds for mixing of the Swendsen-Wang algorithm at the Potts transition point. Probab. Theory Related Fields , 152(3-4):509–557, 2012.
- 5[5] L. Chayes and J. Machta. Graphical representations and cluster algorithms i. discrete spin systems. Physica A: Statistical Mechanics and its Applications , 239(4):542–601, 1997.
- 6[6] C. Cooper, M. E. Dyer, A. M. Frieze, and R. Rue. Mixing properties of the Swendsen-Wang process on the complete graph and narrow grids. J. Math. Phys. , 41(3):1499–1527, 2000. Probabilistic techniques in equilibrium and nonequilibrium statistical physics.
- 7[7] P. Cuff, J. Ding, O. Louidor, E. Lubetzky, Y. Peres, and A. Sly. Glauber dynamics for the mean-field Potts model. J. Stat. Phys. , 149(3):432–477, 2012.
- 8[8] R. G. Edwards and A. D. Sokal. Generalization of the Fortuin-Kasteleyn-Swendsen-Wang representation and Monte Carlo algorithm. Phys. Rev. D (3) , 38(6):2009–2012, 1988.
