Independent sets in the hypercube revisited
Matthew Jenssen, Will Perkins

TL;DR
This paper improves the asymptotic understanding of independent sets in the hypercube by combining combinatorial and statistical physics methods, providing sharper results and detailed structural insights.
Contribution
It introduces a novel combination of graph container methods with cluster expansion techniques to refine asymptotics and structural descriptions of independent sets in the hypercube.
Findings
Sharper asymptotic formulas for independent sets
Detailed probabilistic structure of typical independent sets
Answers to open questions posed by Galvin
Abstract
We revisit Sapozhenko's classic proof on the asymptotics of the number of independent sets in the discrete hypercube and Galvin's follow-up work on weighted independent sets. We combine Sapozhenko's graph container methods with the cluster expansion and abstract polymer models, two tools from statistical physics, to obtain considerably sharper asymptotics and detailed probabilistic information about the typical structure of (weighted) independent sets in the hypercube. These results refine those of Korshunov and Sapozhenko and Galvin, and answer several questions of Galvin.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Independent sets in the hypercube revisited
Matthew Jenssen
School of Mathematics, University of Birmingham, Birmingham, UK
and
Will Perkins
Department of Mathematics, Statistics, and Computer Science
University of Illinois at Chicago
851 S. Morgan, Chicago, IL
Abstract.
We revisit Sapozhenko’s classic proof on the asymptotics of the number of independent sets in the discrete hypercube and Galvin’s follow-up work on weighted independent sets. We combine Sapozhenko’s graph container methods with the cluster expansion and abstract polymer models, two tools from statistical physics, to obtain considerably sharper asymptotics and detailed probabilistic information about the typical structure of (weighted) independent sets in the hypercube. These results refine those of Korshunov and Sapozhenko and Galvin, and answer several questions of Galvin.
Key words and phrases:
independent sets, hypercube, cluster expansion
1991 Mathematics Subject Classification:
05C30, 05C31, 82B20
1. Introduction
Let denote the discrete hypercube of dimension : the graph with vertex set with edges between vectors that differ in exactly one coordinate. An independent set in a graph is a set of vertices that induce no edges. Let denote the number of independent sets of .
Korshunov and Sapozhenko proved the following result on the number of independent sets of the hypercube.
Theorem 1** (Korshunov and Sapozhenko [17]).**
[TABLE]
as .
A beautiful and influential proof of Theorem 1 was later given by Sapozhenko in [22]. See [8] for an exposition of this proof.
One of our main results in this paper will be to reinterpret Sapozhenko’s proof in terms of the cluster expansion from statistical physics. This allows us to compute additional terms in the asymptotic expansion of among other things. For instance, we can compute the asymptotics to the third order in .
Theorem 2**.**
[TABLE]
as .
More generally, we give a formula and an algorithm for computing the asymptotics to arbitrary order in .
Theorem 1 (along with Sapozhenko’s techniques) provided the first glimpse of a rich landscape of phenomena concerning independent sets in . To describe the phenomena we take the perspective of statistical physics. The independence polynomial of the hypercube is
[TABLE]
where is the set of all independent sets of . In particular, . The independence polynomial is the partition function of the hard-core model from statistical physics: a probability distribution on independent sets weighted by the fugacity parameter . This distribution is defined by
[TABLE]
The hard-core model (or hard-core lattice gas) is a simple model of a gas, and in statistical physics it is most commonly studied on the integer lattice . As is common in the literature, we will refer to vertices contained in an independent set drawn from the hard-core model as ‘occupied’.
Let be the set of ‘even’ vertices of the hypercube whose coordinates sum to an even number and let be the ‘odd’ vertices whose coordinates sum to an odd number. We note that is a bipartite graph with bipartition . Kahn [15] showed that for constant , typical independent sets drawn from contain either mostly even vertices or mostly odd vertices, and thus the hard-core model on exhibits a kind of ‘phase coexistence’ in the language of statistical physics.
By generalizing Sapozhenko’s techniques, Galvin [7] was able to describe the typical structure of independent sets drawn from in greater detail and for a wider range of parameters . We need two definitions to describe these results.
Definition 3**.**
For an independent set , we say is the minority side of the bipartition if and the majority side otherwise. If is the minority side, then is the majority side and vice versa.
Definition 4**.**
A set (or ), is -linked if the subgraph of induced by the vertex set is connected; in other words, is connected in the graph (the square of the graph ).
Galvin showed that for the hard-core model on at fugacity with constant, the number of occupied vertices on the minority side is asymptotically distributed as a Poisson random variable with mean and with high probability (whp) all -linked components of occupied vertices on the minority side are of size [7, Theorem 1.4]. He conjectured that there is in fact a series of thresholds at which -linked components of size emerge in a Poisson fashion and asked as an open problem for the distribution of occupied -linked components of size on the minority side for all .
Here we prove his conjecture and answer his question in a strong form. We show that the emergence of a -linked occupied component of size on the minority side has a sharp threshold at and we identify precisely the scaling window about this threshold. We also essentially determine the asymptotic joint distribution of the number of such components (see Theorem 6).
Theorem 5**.**
For fixed, let
[TABLE]
Then for the hard-core model on at fugacity :
- •
if as then whp there are no -linked occupied components of size on the minority side;
- •
if then whp there are -linked occupied components of size on the minority side;
- •
if tends to a constant then the distribution of the number of -linked occupied components of size on the minority side converges to a Poisson distribution with mean
[TABLE]
where the sum is over all trees on vertices and denotes the automorphism group of the tree .
In fact we prove much more detailed probabilistic results. Define the defect type of a -linked component of or to be the isomorphism class of the induced subgraph . In particular there is a unique defect type of size (an isolated vertex), a unique defect type of size (two vertices at distance in ), but two defect types of size : vertices whose distance- graph forms a clique and vertices whose distance- graph forms a path. For a given defect type , let be the random variable that counts the number of -linked occupied components of type on the minority side in the hard-core model on . Let and (for the asymptotics of and see Lemma 19 and Corollary 21 below).
We determine the limiting distribution of the number of each type of defect and show that the number of defects of different types are asymptotically independent.
Theorem 6**.**
There is a constant such that if and is a defect type then the following holds. If and are such that as for some constant , then
[TABLE]
where ‘’ denotes convergence in distribution. If and are such that as , then
[TABLE]
Moreover, suppose we have two finite sets of defect types so that for each , there exists so that , and for each , . Then the collection of random variables converges in distribution to a collection of independent Poisson and standard normal random variables.
We remark that the condition that is a technical requirement of a container lemma due to Galvin which is a key ingredient in our proofs (see Lemma 11 below). We expect that Theorem 6 in fact extends to the range .
There is a close connection between computing accurate estimates of the partition function and deriving probabilistic information about the hard-core model. As a key step in proving his probabilistic results, Galvin gave a significant generalization of Theorem 1 to counting weighted independent sets in the hypercube; that is, computing the asymptotics of for general .
Theorem 7** (Galvin [7]).**
For ,
[TABLE]
Moreover, there is a constant so that for ,
[TABLE]
The formula (1) generalises Theorem 1, and determines the asymptotics of for , while the formula (2) finds the asymptotics of for .
Our techniques based on the cluster expansion will allow us to sharpen Theorem 7 considerably: we find a formula that can be used not only to determine the asymptotics of for all constant but also to give an expansion of to arbitrary order in .
To write the formula we need some notation that comes from polymer models in the statistical mechanics of lattice systems [18]. Before we introduce these notions formally, we describe some of the intuition underlying the proof of Theorem 2 and the results to come. An immediate lower bound on of comes by considering the contribution from independent sets which lie entirely in one side of the bipartition of . We call the collection of independent sets which lie entirely in (or ) the even (odd) ground state. Taking , for example, there is a constant factor gap between this trivial lower bound and the correct asymptotics of Theorem 1. Therefore a constant proportion of independent sets do not belong to a ground state. However, almost all independent sets are very close to a ground state independent set. Thus it is natural to describe independent sets in terms of their deviations from a ground state: given a subset , let denote the probability that an independent set chosen according to satisfies . When is small, we think of it as a deviation from the odd ground state and note that the relative ‘cost’ of such a deviation is
[TABLE]
We denote this cost, or weight, of a deviation by . Crucially, the weight factorises over the -linked components of , and so we define an even polymer to be any -linked subset of , and define its weight by (3). We define odd polymers similarly.
The language of polymer models allows us to relate the partition function to the partition function of a (multivariate) hard-core model on an auxiliary graph whose vertices are polymers and each polymer has its associated weight as its fugacity. A key feature of this transformation is that while at large an independent set drawn from is typically very structured, the corresponding deviations on the minority side are typically unstructured and behave almost independently. Using the cluster expansion, we are then able to extract almost complete probabilistic information from our model. In particular it allows us to precisely quantify the contribution to from small deviations, and allows us to compute to essentially arbitrary accuracy.
The cluster expansion is a powerful and classical tool in the rigorous study of statistical mechanics. In our context, it is the multivariate Taylor expansion of the logarithm of the partition function of our auxiliary hard-core model. Studying this infinite series naturally leads to the question of convergence. Verifying the convergence of the cluster expansion amounts to showing that the number of polymers of a given weight is not too large. This is where the container method of Sapozhenko comes in. In fact, all of the ingredients needed to show that this polymer model has a convergent cluster expansion are already present in Sapozhenko’s work and Galvin’s extensions. In some sense Sapozhenko rediscovered the concept of a polymer model and computed the smallest order terms of the cluster expansion by hand. Certainly the intuition behind the specific polymer model is clear in his work.
We now venture to make some of the above mentioned notions more concrete. Recall that an even/odd polymer is a -linked subset of respectively. The size of a polymer , , is the number of vertices in . Since exhibits symmetry between and we will restrict our attention to even polymers. We say two even polymers are compatible if ; that is if is not -linked. Otherwise and are incompatible (and note that each polymer is incompatible with itself). For a tuple of even polymers, the incompatibility graph, , is the graph with vertex set and an edge between any two incompatible polymers. An even cluster is an ordered tuple of even polymers so that is connected. The size of a cluster is . Let be the set of all even clusters and the set of all even clusters of size .
Recall that for a polymer , we define its weight to be For a cluster we define
[TABLE]
where is the Ursell function of a graph , defined by
[TABLE]
Finally for we define
[TABLE]
Note that by symmetry would be identical if we had considered odd polymers and odd clusters instead.
We can now state our formula for .
Theorem 8**.**
Suppose and is bounded as . Then for all fixed ,
[TABLE]
where for each fixed , as . Moreover, can be computed in time .
In fact, it is not essential that remain bounded as : a similar formula holds for all values of with an addition of to , but for simplicity here we focus on the more interesting cases when is bounded or tends to [math].
As a quick check, note that at , since there are polymers of size and each has weight . Moreover, and so Theorem 8 implies that , recovering Theorem 1.
More generally, Theorem 8 extends Theorem 7. For instance, we can give a closed-form formula for the asymptotics of for any constant .
Corollary 9**.**
For any fixed and for ,
[TABLE]
For example, if , then
[TABLE]
We find it rather remarkable how well the two tools from statistical physics, polymer models and the cluster expansion, work with the graph container method, and we expect many further applications of this combination of methods. See [21] for a survey of the graph container method. In forthcoming work, Keevash and the first author [13] apply this combination of methods to resolve conjectures of Galvin and Engbers [5] and Kahn and Park [16] on the number of -colourings of . As a future research direction, we ask whether these statistical physics tools can be used in conjunction with the method of hypergraph containers [1, 23] to derive finer asymptotics and probabilistic information in some of the many extremal combinatorics problems in which hypergraph containers have been deployed.
The paper is organised as follows: We introduce abstract polymer models and the cluster expansion in Section 2, and then specialise to the hypercube and prove Theorem 8 in Section 3. We prove the probabilistic results of Theorems 5 and 6 in Section 4. We explicitly compute , and and prove Theorem 2 in Section 5.
Related work
As Galvin remarked in [7], only a few properties of the hypercube are needed in deriving Theorem 7; the same is true for Theorems 6 and 8. The essential properties are that the graph be bipartite and that some isoperimetric estimates hold (of the form of Lemma 12 below). In fact, using an approach to approximate counting based on the cluster expansion [12, 14], one could obtain efficient algorithms to approximate the partition function and to sample from the hard-core model for a class of graphs with these properties. The polymer models used in [14, 20, 3] to sample from the hard-core model on random regular bipartite graphs are very similar to the ones used here. For a similar class of bipartite graphs Galvin and Tetali [10] showed that the Glauber dynamics Markov chain for sampling from the hard-core model exhibits slow mixing; that proof is also based on extending the ideas of Sapozhenko.
2. Polymer models and the cluster expansion
Here we introduce the main tools we will use, abstract polymer models [11, 18] and the cluster expansion, both tools from statistical physics that have been used extensively to study phase diagrams of lattice spin models. We have already encountered the terms ‘polymer’ and ‘cluster’ in the previous section. Indeed, the polymers from the introduction are concrete examples of a more general notion which we introduce now.
Let be a finite set whose elements we call ‘polymers’. We equip with a complex-valued weight for each polymer as well as a symmetric and reflexive incompatibility relation between polymers. We write if polymers and are incompatible. Let be the collection of pairwise compatible sets of polymers from , including the empty set of polymers. Then the polymer model partition function is
[TABLE]
where the contribution from the empty set is .
A cluster is an ordered tuple of polymers whose incompatibility graph is connected. Let be the set of all clusters. The cluster expansion is the formal power series in the weights
[TABLE]
where
[TABLE]
and is the Ursell function as defined in (4). In fact the cluster expansion is simply the multivariate Taylor series for in the variables , as observed by Dobrushin [4]. See also Scott and Sokal [24] for a derivation of the cluster expansion and much more.
A sufficient condition for the convergence of the cluster expansion is given by a theorem of Kotecký and Preiss.
Theorem 10** ([18]).**
Let and be two functions. Suppose that for all polymers ,
[TABLE]
then the cluster expansion converges absolutely. Moreover, if we let and write if there exists so that , then for all polymers ,
[TABLE]
We remark that one could simply take in (8) in order to establish convergence of the cluster expansion. However, allowing to take non-zero values (thus strengthening (8)) allows us to give strong tail bounds on the cluster expansion via (9). This will allow us to show that certain truncations of the cluster expansion serve as good approximations to the logarithm of the partition function.
3. Polymers in the hypercube
We now return to our specific setting with polymers derived from the hard-core model on . These polymers will essentially be the same as those defined in Section 1. Here we will study the cluster expansion of this polymer model in depth.
3.1. Preliminaries
We begin with some notation and lemmas from [7].
For a set (and analogously for ), let denote the number of vertices of , be the set of neighbours of , and the bipartite closure of . Clearly . Let
[TABLE]
The following lemma of Galvin is based on the graph container method of Sapozhenko [22]. This is a key technical ingredient in [22, 7] and in the results of this paper.
Lemma 11** ([7]).**
There exist constants , so that for all , all ,
[TABLE]
In what follows, we will always assume that to allow us to apply Lemma 11.
We will also use the following isoperimetric estimates, which come from [6, 17] but can also be found in [7].
Lemma 12**.**
Suppose (or ). Then
- (1)
If , then . 2. (2)
If , then . 3. (3)
If , then .
We also make use of the following, from, e.g. [9].
Lemma 13**.**
The number of -linked subsets of size which contain a given vertex is at most .
3.2. The defect polymer model
We begin by fixing a side of the bipartition which we call the defect side. Let us suppose this side is (the case where is the defect side will be identical).
We define a polymer to be a -linked subset of the defect side in so that . Let be the set of all such polymers (we will make use of a subscript, as in or , if we want to indicate which is the defect side). Two polymers are compatible if is not -linked. Let be the set of all pairwise compatible sets of polymers from . The weight functions are defined as
[TABLE]
Let denote the resulting polymer model partition function (and note that by symmetry is the same regardless of the defect side).
The partition function is the normalizing constant of a probability distribution on defined by
[TABLE]
Using we can define a probability measure on as follows:
- (1)
With probability choose or to be the defect side. 2. (2)
Choose a polymer configuration from and assign all vertices of to be occupied on the defect side . 3. (3)
For each vertex on the non-defect side that is not blocked by an occupied vertex on the defect side, include in the independent set independently with probability .
The resulting distribution is not exactly the hard-core model on , but we will show that the two distributions are very close in total variation distance. Moreover, we will show that a scaling of the partition function is a very good approximation of the hard-core partition function . Note that the defect side need not be the minority side: in step we may choose no vertices to be occupied opposite the defect side. Nevertheless, we will show below that with very high probability the defect side is in fact the minority side of an independent set sampled according to (Lemma 17 below).
Lemma 14**.**
We have
[TABLE]
Moreover,
[TABLE]
We will prove Lemma 14 after showing that the polymer model satisfies the Kotecký–Preiss condition. Lemma 14 allows us to work with and to prove Theorems 6 and 8. In particular, to prove Theorem 8 we will approximate by truncating the cluster expansion for and exponentiating. To prove Theorem 6 we will prove the probabilistic statements for polymer configurations sampled from and then use Lemmas 14 and 17 to transfer these results to results about the minority side of an independent set drawn from .
We define the truncated cluster expansion of as
[TABLE]
We now show that condition (8) holds for the defect polymer model with appropriate choices of functions and , and thus gives a good approximation to .
Lemma 15**.**
For integers , let
[TABLE]
Then for sufficiently large
[TABLE]
In particular, for fixed and sufficiently large,
[TABLE]
Proof.
Let be defined by and define by . We will show that the Kotecký–Preiss condition (8) holds. That is, for every ,
[TABLE]
To prove this we will show that for all ,
[TABLE]
and this will suffice since if and only if for some and . We will break up the sum according to the different cases of .
First we sum over with . We use the fact that for such , by Lemma 12, and that there are at most -linked sets of size that contain a fixed vertex by Lemma 13.
[TABLE]
which is at most for large enough.
We next sum over with . We use the fact that for such , by Lemma 12.
[TABLE]
and so if and is large enough, then this sum is at most .
Now turning to with , we have that , and so
[TABLE]
where the last inequality comes from applying Lemma 11. In the sum, we have and , and so
[TABLE]
for large enough , and so
[TABLE]
for large enough. Putting the three bounds together gives (14).
To prove the lemma we now apply Theorem 10, applying (9) for the polymer containing the single vertex to obtain:
[TABLE]
Summing over all gives
[TABLE]
Since is non-increasing in , we have, recalling that and ,
[TABLE]
Keeping only terms in inequality (16) corresponding to clusters of size at least , we have
[TABLE]
as desired. ∎
The Kotecký–Preiss condition also allows us to prove a simple large deviation result for the total size of all polymers in a random polymer configuration drawn from .
Suppose is a random variable whose moment generating function is defined for in a neighbourhood of [math]. We will make extensive use of the cumulant generating function of , defined as
[TABLE]
that is, the logarithm of the moment generating function.
Lemma 16**.**
Let be a random configuration drawn from the distribution . Then with probability at least , we have
[TABLE]
where .
Proof.
We introduce an auxiliary polymer model with modified polymer weights:
[TABLE]
Let be the associated polymer model partition function. Then at where is a random polymer configuration from the original polymer model.
In the proof of Lemma 15, all of the estimates hold if we were to replace by . Therefore the proof shows that the Kotecký–Preiss condition holds for the polymer weights , and the functions as above. Applying (9) and summing over all polymers of size gives
[TABLE]
Then since , we have
[TABLE]
By Markov’s inequality we have
[TABLE]
and setting gives
[TABLE]
for large enough since for , grows faster than any fixed polynomial in . ∎
This large deviation bound allows us to show that with very high probability over an independent set drawn from , the defect side is the minority side.
Lemma 17**.**
With probability at least over the random independent set drawn from , the minority side of the bipartition is the defect side.
Proof.
Let and denote the defect and minority side respectively selected under . By Lemma 16 we have
[TABLE]
Let us fix an element in the sample space of with and let . Let denote the size of the intersection of with the non-defect side conditioned on the event that . Note that has a distribution where . We then have
[TABLE]
for sufficiently large. For the penultimate inequality we applied the Chernoff bound. The result follows. ∎
Now we can prove Lemma 14.
Proof of Lemma 14.
We say an independent set is captured by the odd polymer model if every -linked component of has and captured by the even polymer model if every -linked component of has . If we view as the sum of for representing the odd polymer model and for representing the even polymer model, then each that is captured by the odd polymer model contributes to the first summand and each that is captured by the even polymer model contributes to the second summand.
Observe first that every is captured by either the odd or the even polymer model. Indeed suppose not, then there exists which contains a set with and a set with . It follows that (since, for example, contains a perfect matching). However then and so , contradicting the fact that is an independent set.
It remains to bound the contribution to from independent sets that are counted twice. That is, bound where denotes the collection of independent sets that are captured by both the odd and even polymer models. However, any such independent set can be selected by conditioned on the event that (using the notation of Lemma 17). Letting denote the independent set selected by we have by Lemma 17 that
[TABLE]
All together this gives the inequalities
[TABLE]
and so
[TABLE]
which gives (10). Recall one formula for the total variation distance between discrete probability measures:
[TABLE]
The total variation distance bound is then immediate from (18) as the only independent sets that have higher probability under than are those that are counted twice. ∎
Now we can prove Theorem 8.
Proof of Theorem 8.
First we prove the estimate for fixed. Let be a cluster with . Since is a 2-linked set of size at most , there are possibilities for by Lemma 13. Given a set of size at most , there are at most a constant number of clusters of size such that . It follows that the number of clusters of size is . By Lemma 12, the weight of any cluster of size is (note that the Ursell function of a cluster of size is simply a constant). The claimed estimate on follows.
Let be fixed. By (13) we have that
[TABLE]
for sufficiently large and so
[TABLE]
It follows from Lemma 14 that
[TABLE]
where (it is here we use that is bounded as ).
Finally we show that can be computed in time . Let be the family of all -linked subsets of of size at most which contain the vertex . Given , we call a coordinate active for if for some . We note that every has at most active coordinates. For , we let denote the set of elements in whose set of active coordinates is precisely .
For , we will construct the list of all the elements with and whose set of active coordinates are a subset of . We do so iteratively. Suppose we have constructed the list . For a vertex , and , let denote the vertex of obtained by flipping the th and th coordinate of . For each pair , and , add to the list if . This procedure generates the whole list and shows that and so . For and , let denote the subset of consisting of those sets whose active coordinates are precisely . Note that we can generate the list in time by checking the elements of one by one.
For a cluster , we define the active coordinates of to be the active coordinates of the set . For fixed and , we generate the list of all clusters of size containing with active coordinates and . To do this we run through each and create the list of clusters of size with . We claim that this can be done in time . Recall that a cluster of size is an ordered set of polymers such that . Let us fix . Since there are at most ordered integer partitions of , it suffices to show that for a fixed such partition (so that ) we may find, in time , all clusters for which for all and . To do this we can simply check each element of (a set of size at most ) to see if it constitutes a legitimate cluster.
By symmetry of coordinates and vertex transitivity of we have
[TABLE]
Finally we note that by using an algorithm of Björklund, Husfeldt, Kaski, and Koivisto [2, Theorem 1], we may calculate the Ursell function of a cluster in time . Moreover for a set where , we can calculate in time . We can therefore calculate in time . ∎
4. Probabilistic properties via the cluster expansion
Here we use the cluster expansion to prove Theorems 5 and 6 and Corollary 9. Using Lemmas 14 and 17 we see that up to total variation error, we may replace the minority side of an independent set drawn from with the defect side of an independent set drawn from ; or in other words, a polymer configuration drawn from . Thus in this section we will let denote the (random) number of polymers of type in a random polymer configuration drawn from , and prove the conclusions of Theorems 6 and 5 for these random variables. We will also assume throughout this section that . Theorem 6 is vacuous if since for all types in that case; the formula (6) in Corollary 9 holds for by Theorem 7.
We begin with some preliminaries on cumulants of random variables. Recall the cumulant generating function of a random variable , . The th cumulant of is defined by taking derivatives of and evaluating at [math]:
[TABLE]
In fact the cumulants of are related to the moments of by a non-linear change of basis (see e.g. [19]). In particular, and . Moreover, if a random variable has a distribution determined by its moments, and if for a sequence of random variables we have for all , then converges to in distribution (denoted ). We will use this fact in conjunction with the following fact.
Fact 18**.**
If has a Poisson distribution with mean , then for all . If has a standard normal distribution (mean [math], variance ) then , , and for all .
We also need a few preliminaries about defect types. First, for fixed the number of defect types of size is bounded independent of . Let denote the type of a polymer . The weight of a polymer is determined by , since is determined by the number of edges of in the graph . Let denote for of type . Using Lemma 12, we have the simple bounds
[TABLE]
for a type of size and large enough. Note that for any fixed and any type , we have as ; that is, each polymer weight decays super-polynomially fast in . We denote by the number of polymers of type .
Lemma 19**.**
Let be a defect type of a fixed size . Then
[TABLE]
Moreover, if is a tree defect type then
[TABLE]
where and if is not a tree then
[TABLE]
Proof.
By the vertex transitivity of , every vertex of (or ) is contained in the same number of polymers of type . Let us denote this number by and note that . The lower bound in (23) follows from the fact that if there exists a polymer with type , then certainly . The upper bound follows from the fact that every vertex of is contained in at most -linked sets of size by Lemma 13.
Since is a connected graph we may fix an ordering of the vertices of so that is connected for all . We let denote the degree of the vertex in the graph .
We will construct an injective graph homomorphism recursively as follows. Suppose that we have constructed an injective graph homomorphism for some and let denote the number of such homomorphisms. We now extend to an injective graph homomorphism . We consider two cases.
If , then must lie in the joint neighbourhood of and for some . For any pair of vertices their codegree in is at most and so there are at most choices for whence
[TABLE]
Suppose now that and let denote the set of possible choices for . We note that if and only if is adjacent to and non-adjacent to for in . Again using the fact that the maximum codegree in is it follows that . We then have that
[TABLE]
If is not a tree then for some . It follows by (24) and the upper bound of (25) that . The bound follows from the fact that where denotes the automorphism group of the graph (recall that is a constant).
If is a tree then for all and so by (25) . The result follows. ∎
Now fix a defect type and let be the number of polymers of type in . We introduce modified polymer weights , given by
[TABLE]
Let be the corresponding polymer model partition function. Then we have
[TABLE]
and so
[TABLE]
If the cluster expansion for converges absolutely, we can write
[TABLE]
where , the number of polymers of type in the cluster .
The following lemma gives bounds on cluster weights using the Kotecký–Preiss condition. Theorem 6 will then follow in a series of corollaries.
Lemma 20**.**
Consider a fixed defect type , and let be a fixed integer. Then
[TABLE]
as .
Moreover if is a fixed set of distinct defect types, and are fixed positive integers, then
[TABLE]
Proof.
In the sum in (27), if we consider only clusters made up of a single polymer of type then we get a contribution of exactly , and so it remains to show that the contribution of all other terms is . Let denote the number of vertices in a graph of type . We first consider the contribution to the sum (27) from clusters with and . By (12), we may bound this contribution by
[TABLE]
[TABLE]
Consider now the contribution to the sum (27) from clusters with . For such a cluster, recalling that for a polymer and , we have
[TABLE]
Thus, using (16) we may bound this contribution by
[TABLE]
where the above inequality holds for large enough (independent of ). The result follows since
[TABLE]
Next we turn to (28). Consider a cluster with , where . Then we have
[TABLE]
where is the size of a polymer of type . By (16) and (22), the contribution of such clusters to the sum in (28) is therefore at most
[TABLE]
Finally, let , so that summing over all positive integer vectors , we have
[TABLE]
Putting these estimates together yields (28). ∎
An immediate corollary of Lemma 20 gives the asymptotics of , for a given type .
Corollary 21**.**
Let be a defect type. Then
[TABLE]
Proof.
These formulae follow from (26) and (27) by taking and respectively. ∎
We can also use Lemma 20 to prove Poisson convergence.
Corollary 22**.**
Suppose for a given type and fugacity we have as . Then .
Proof.
Using Fact 18, it is enough to show that for all . By (26) and our assumption we have
[TABLE]
and therefore using (26) again,
[TABLE]
for all . ∎
In a similar fashion, we obtain asymptotic normality if .
Corollary 23**.**
Fix a type . If is such that as , then .
Proof.
By Fact 18, it suffices to show that , , and for all . By the definition of , we have and . By translation invariance and scaling of higher cumulants, for we have
[TABLE]
by (26). By Lemmas 20 and 21 we have , and so for ,
[TABLE]
as since our assumption on implies . ∎
To study the joint distribution of the counts of different defect types, it is convenient to work with the joint cumulants of a collection of random variables. Given a set of random variables and non-negative integers , we define the joint cumulant
[TABLE]
In particular, with this notation
[TABLE]
We will use the fact that the joint cumulants of independent random variables vanish; that is, if , are independent random variables, and are positive integers, then
[TABLE]
Generalizing formula (26) to collections of random variables, we can express the joint cumulants of defect type counts via a modified cluster expansion. Let be a set of distinct defect types and let be non-negative integers. Then
[TABLE]
Corollary 24**.**
Consider two fixed sets and of distinct defect types so that for each , for some , and for each , as . Then the collection of random variables converges in distribution to a collection of independent Poisson and standard normal random variables.
Proof.
We will use the fact that the distribution of a collection of Poisson and normal random variables is determined by its joint moments, or equivalently, by its joint cumulants. Here, working with cumulants instead of moments will simplify calculations considerably. From Corollaries 22 and 23 we know that the cumulants of each of the individual random variables in the collection converge to the corresponding cumulants of the corresponding Poisson or normal random variable. Therefore it is enough to show convergence of the joint cumulants involving at least two of the random variables, and from (32), we must show that these converge to [math]. In particular, for , and , we will show
[TABLE]
as as long as least two of the ’s are positive. Since for , it will suffice to show (33) when we center and normalise all of the random variables, that is, for ,
[TABLE]
as long as at least two of the ’s are positive. WLOG we can assume that , for all , and that . By scaling and translation invariance, we have
[TABLE]
Then using (28) from Lemma 20 we have
[TABLE]
First suppose that . Then since for all and , we have
[TABLE]
since tends to [math] faster than any fixed polynomial in for any type . On the other hand if we have , then
[TABLE]
∎
Theorem 6 follows from Corollaries 22, 23, and 24. We now prove Theorem 5.
Proof of Theorem 5.
We can assume in what follows that , since if whp there are no occupied vertices on the minority side (Theorem 1.2 of [7]).
First we show that if , then whp there are no -linked components of size on the defect side.
Let the type of a polymer of size . We have the bounds
[TABLE]
where the upper bound uses Lemma 12, and so for this range of . By Lemma 13, , and so by Corollary 21,
[TABLE]
Now plugging in for some , we have
[TABLE]
and so as , . This is true for any type of size , and since there are a constant number of such types, Markov’s inequality shows that whp there are no polymers of size in if .
Now suppose . Consider a type where is isomorphic to a tree on vertices. In this case we have by Lemma 19, and so for the previous calculation gives
[TABLE]
In particular if as then . By Corollary 21, , and so by the second-moment method (Paley-Zygmund inequality), whp.
To prove the second part of Theorem 5, suppose that where converges to a constant as . Then for any type of size that is not a tree, by Lemma 19 we have as , and since there is a constant number of such types, we know that whp there are no non-tree -linked components of size on the minority side. Let be the defect types of size that are trees. The proof of Lemma 19 shows that in fact every tree on vertices is a defect type. Note that for each we have that . Then by Lemma 19 and Corollary 21 we have that for each , and so by a similar calculation as above we have that as where
[TABLE]
By Corollary 24, the collection of random variables converges to a collection of independent Poisson random variables with mean , and therefore their sum is distributed as Poisson with mean , completing the proof of Theorem 5. Calculating this mean explicitly amounts to calculating for every tree on vertices, a task whose running time depends only on (a constant). ∎
The proof of Corollary 9 involves a similar calculation.
Proof of Corollary 9.
We may again assume that since for larger , by Theorem 7. Now fix and take . We then can apply Theorem 8 with to obtain
[TABLE]
But by the same calculation as above in the proof of Theorem 5 we have
[TABLE]
and
[TABLE]
and so
[TABLE]
The example formula (7) follows from the computation of given below in Section 5. ∎
5. Computation of the cluster weights
Here we compute explicitly to use in Theorem 2 and Corollary 9.
Proposition 25**.**
We have
[TABLE]
[TABLE]
At , this is
[TABLE]
Polymers
There is a single type of polymer of size . There are of these, and each has weight .
There is a single type of polymer of size . There are of these and each has weight .
There are two types of polymers of size : those that form a clique in the distance graph and those that form a path on vertices. There are of the first type and each has weight ; there are of the second type and each has weight .
Clusters
There is a single cluster type of size , each consisting of single polymer of size , with Ursell function . Thus
[TABLE]
There are two types of clusters of size : an ordered pair of incompatible polymers of size , of which there are , with Ursell function and weight , and one polymer of size with Ursell function and count and weight given above.
All together this gives:
[TABLE]
At this is
[TABLE]
There are five types of clusters of size :
- (1)
One polymer of size , first type: of weight , Ursell function . 2. (2)
One polymer of size , second type: of weight , Ursell function . 3. (3)
Three polymers of size , incompatibility graph is a triangle: of weight , Ursell function . 4. (4)
Three polymers of size , incompatibility graph is a path on vertices: of weight , Ursell function . 5. (5)
One polymer of size , one of size : of weight , Ursell function .
All together this gives:
[TABLE]
At this is
[TABLE]
Proof of Theorem 2.
Theorem 8 tells us that
[TABLE]
since . If we write , then we have
[TABLE]
Since the Taylor series for around is
[TABLE]
we have
[TABLE]
which gives Theorem 2. ∎
Acknowledgements
WP supported in part by NSF Career award 1847451. Part of this work was done while WP visiting the Simons Institute for the Theory of Computing. We thank Lina Li for some very helpful comments on this paper.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. Balogh, R. Morris, and W. Samotij. Independent sets in hypergraphs. Journal of the American Mathematical Society , 28(3):669–709, 2015.
- 2[2] A. Björklund, T. Husfeldt, P. Kaski, and M. Koivisto. Computing the Tutte polynomial in vertex-exponential time. In Proceedings of the Forty-ninth Annual Symposium on Foundations of Computer Science , FOCS 2008, pages 677–686. IEEE, 2008.
- 3[3] S. Cannon and W. Perkins. Counting independent sets in unbalanced bipartite graphs. In Proceedings of the Fourteenth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA) , pages 1456–1466. SIAM, 2020.
- 4[4] R. Dobrushin. Estimates of semi-invariants for the Ising model at low temperatures. Translations of the American Mathematical Society-Series 2 , 177:59–82, 1996.
- 5[5] J. Engbers and D. Galvin. H-coloring tori. Journal of Combinatorial Theory, Series B , 102(5):1110–1133, 2012.
- 6[6] D. Galvin. On homomorphisms from the Hamming cube to ℤ ℤ \mathbb{Z} . Israel Journal of Mathematics , 138(1):189–213, 2003.
- 7[7] D. Galvin. A threshold phenomenon for random independent sets in the discrete hypercube. Combinatorics, probability and computing , 20(1):27–51, 2011.
- 8[8] D. Galvin. Independent sets in the discrete hypercube. ar Xiv preprint ar Xiv:1901.01991 , 2019.
