This paper develops efficient algorithms to approximately count independent sets and colorings on large random regular bipartite graphs, extending previous methods and confirming open questions in graph counting.
Contribution
It introduces an FPTAS for counting independent sets and q-colorings on almost all large regular bipartite graphs, based on recent techniques.
Findings
01
FPTAS for independent sets when Δ ≥ 53
02
FPTAS for weighted independent sets with large Δ and λ
03
FPTAS for q-colorings on large Δ-regular bipartite graphs
Abstract
We give a fully polynomial-time approximation scheme (FPTAS) to count the number of independent sets on almost every Δ-regular bipartite graph if Δ≥53. In the weighted case, for all sufficiently large integers Δ and weight parameters λ=Ω~(Δ1), we also obtain an FPTAS on almost every Δ-regular bipartite graph. Our technique is based on the recent work of Jenssen, Keevash and Perkins (SODA, 2019) and we also apply it to confirm an open question raised there: For all q≥3 and sufficiently large integers Δ=Δ(q), there is an FPTAS to count the number of q-colorings on almost every Δ-regular bipartite graph.
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Full text
Counting independent sets and colorings on random regular bipartite graphs
We give a fully polynomial-time approximation scheme (FPTAS) to count the number of independent sets on almost every Δ-regular bipartite graph if Δ≥53.
In the weighted case, for all sufficiently large integers Δ and weight parameters λ=Ω(Δ1), we also obtain an FPTAS on almost every Δ-regular bipartite graph.
Our technique is based on the recent work of Jenssen, Keevash and Perkins (SODA, 2019) and we also apply it to confirm an open question raised there:
For all q≥3 and sufficiently large integers Δ=Δ(q), there is an FPTAS to count the number of q-colorings on almost every Δ-regular bipartite graph.
1. Introduction
Counting independent sets on bipartite graphs (#BIS) plays a significant role in the field of approximate counting.
A wide range of counting problems in the study of counting CSPs [DGJ10, BDG*+*13, GGY17] and spin systems [GJ12, GJ15, GŠVY16, CGG*+*16], have been proved to be #BIS-equivalent or #BIS-hard under approximation-preserving reductions (AP-reductions) [DGGJ04].
Despite its great importance, it is still unknown whether #BIS admits a fully polynomial-time approximation scheme (FPTAS) or it is as hard as counting the number of satisfying assignments of Boolean formulas (#SAT) under AP-reduction.
In this paper, we consider the problem of approximating #BIS (and its weighted version) on random regular biparite graphs.
Random regular bipartite graphs frequently appear in the analysis of hardness of counting independent sets [MWW09, DFJ02, Sly10, SS12, GŠVY16].
Therefore, understanding the complexity of #BIS on such graphs is potentially useful for gaining insights into the general case.
Let Z(G,λ)=∑I∈I(G)λ∣I∣ where I(G) is the set of all independent sets of a graph G and λ>0 is the weight parameter.
This function also arises in the study of the hardcore model of lattice gas systems in statistical mechanics.
Hence we usually call Z(G,λ) the partition function of the hardcore model with fugacity λ.
In the case where input graphs are allowed to be nonbipartite, the approximability for counting the number of independent sets (#IS) is well understood.
Exploiting the correlation decay properties of Z(G,λ), Weitz [Wei06] presented an FPTAS for graphs of maximum degree Δ at fugacity λ<λc(Δ)=(Δ−2)Δ(Δ−1)Δ−1.
On the hardness side, Sly [Sly10] proved that, unless NP=RP, there is a constant ε=ε(Δ) that no polynomial-time approximation scheme exists for Z(G,λ) on graphs of maximum degree Δ at fugacity λc(Δ)<λ<λc(Δ)+ε(Δ).
Later, this result was improved at any fugacity λ>λc(Δ) [SS12, GŠV16].
In particular, these results state that if Δ≤5, there is an FPTAS for #IS on graphs of maximum degree Δ, otherwise there is no efficient approximation algorithm unless NP=RP.
The situation is different on bipartite graphs.
To the best of our knowledge, no NP-hardness result is known even on graphs with unbounded degree.
Surprisingly, Liu and Lu [LL15] designed an FPTAS for #BIS which only requires one side of the vertex partition to be of
maximum degree Δ≤5.
On the other hand, it is #BIS-hard to approximate Z(G,λ) at fugacity λ>λc(Δ) on biparite graphs of maximum degree Δ≥3 [CGG*+*16].
Recently, Helmuth, Perkins, and Regts [HPR18] developed a new approach via the polymer model and gave efficient counting and sampling algorithms for the hardcore model at high fugacity on certain finite regions of the lattice Zd and on the torus (Z/nZ)d.
Their approach is based on a long line of work [PS75, PS76, KP86, Bar16, BS16, PR17].
Shortly after that, Jessen, Keevash, and Perkins [JKP19] designed an FPTAS for the hardcore model at high fugacity on bipartite expander graphs of bounded degree. And they further extended the result to random Δ-regular bipartite graphs with Δ≥3 at fugacity λ>(2e)250. This is the first efficient algorithm for the hardcore model at fugacity λ>λc(Δ) on random regular bipartite graphs. A natural question is, can we design FPTAS for lower fugacity and in particular the problem #BIS on random regular bipartite graphs?
Indeed, we obtain such results.
Let Gn,Δbip denote the set of all Δ-regular bipartite graphs with n vertices on both sides.
Theorem 1**.**
For Δ≥53 and fugacity λ≥1, with high probability (tending to 1 as n→∞) for a graph G chosen uniformly at random from Gn,Δbip, there is an FPTAS for the partition function Z(G,λ).
Theorem 2**.**
For all sufficiently large integers Δ and fugacity λ=Ω(Δ1), with high probability (tending to 1 as n→∞) for a graph G chosen uniformly at random from Gn,Δbip, there is an FPTAS for the partition function Z(G,λ).
For notational convenience, we use the term “on almost every Δ-regular bipartite graph” to denote that a property holds with high probability (tending to 1 as n→∞) for randomly chosen graphs from Gn,Δbip.
Counting proper q-colorings on a graph is another extensively
studied problem in the field of approximate counting [Jer95, BD97, BDGJ99, DF03, HV03, Hay03, Mol04, DFFV06, HV06, GK12, DFHV13, LY13, GLLZ18], which is also shown to be #BIS-hard but unknown to be #BIS-equivalent [DGGJ04].
In general graphs, if the number of colors q is no more than the maximum degree Δ, there may not be any proper coloring over the graph. Therefore, approximate counting is studied in the range that q≥Δ+1. It was conjectured that there is an FPTAS if q≥Δ+1, but the current best result is q≥αΔ+1 with a constant α slightly below 611 [Vig00, CDM*+*19]. The conjecture was only confirmed for the special case Δ=3 [LYZZ17].
On bipartite graphs, the situation is quite different.
For any q≥2, we know that there always exist proper q-colorings for every bipartite graph.
So it is natural to wonder under which relations between q and Δ there is an FPTAS to count the number of q-colorings on biparite graphs.
Using a technique analogous to that for #BIS,
we obtain an FPTAS to count the number of q-colorings on random Δ-regular bipartite graphs for all sufficiently large integers Δ=Δ(q) for any q≥3.
Theorem 3**.**
For q≥3 and Δ≥100q10 where q=⌈q/2⌉, with high probability (tending to 1 as n→∞) for a graph chosen uniformly at random from Gn,Δbip, there is an FPTAS to count the number of q-colorings.
The classical approach to designing approximate counting algorithms is random sampling
via Markov chain Monte Carlo (MCMC). However, it is known that the Markov chains are slowly mixing on random
bipartite graphs for both independent set and coloring if the degree Δ is not too small.
Taking #BIS as an example, a typical independent set of a random regular bipartite graph of degree at least 6 is
unbalanced: it either chooses most of its vertices from the left side or the right side. Thus, starting from an independent set with most vertices from the left side, a Markov chain is unlikely to reach an independent set with most of its vertices from the right side in polynomial time.
Even so, a recent beautiful work exactly makes use of the above separating property to design approximately counting algorithm [JKP19]. By making the fugacity λ>(2e)250 sufficiently large, they proved that most contribution of the partition function comes from extremely unbalanced independent sets, those which occupy almost no vertices on one side and almost all vertices on the other side. In particular, for a bipartite graph G=(L,R,E) with n vertices on both sides, they identified two independent sets I=L and I=R as ground states as they have the largest weight λn among all the independent sets.
They proved that one only needs to sum up the weights of states which are close to one of the ground states, for no state is close to both ground states and the contribution from the states which are far away from both ground states is exponentially small.
However, the ground state idea cannot be directly applied to counting independent sets and counting colorings since each valid configuration is of the same weight. We extend the idea of ground states to ground clusters, which is not a single configuration but a family of configurations.
For example, we identify two ground clusters for independent sets, those which are entirely chosen from vertices on the left side and those which are entirely chosen entirely from vertices on the right side.
If a set of vertices is entirely chosen from vertices on one side, it is obviously an independent set.
Thus each cluster contains 2n different independent sets.
Similarly, we want to prove that we can count the configurations which are close to one of the ground clusters and then add them up.
For counting colorings, there are multiple ground clusters indexed by a subset of colors ∅⊊X⊊[q]: colorings which color L only with colors from X and color R only with colors from [q]∖X.
Unlike the ground states in [JKP19], our ground clusters may overlap with each other and some configurations are close to more than one ground clusters.
In addition to proving that the number of configurations which are far away from all ground clusters are exponentially small, we also need to prove that the number of double counted configurations are small.
After identifying ground states and with respect to a fixed ground state, Jessen, Keevash, and
Perkins [JKP19] defined a polymer model representing deviations from the ground state and rewrote the original partition function as a polymer partition function. We follow this idea and define a polymer model representing deviations from a ground cluster. However, deviation from a ground cluster is much subtler than deviation from a single ground state. For example, if we define polymer as connected components from the deviated vertices in the graph, we cannot recover the original partition function from the polymer partition function.
We overcome this by defining polymer as connected components in graph G2, where an edge of G2 corresponds to a path of length at most 2 in the original graph.
Here, a compatible set of polymers also corresponds to a family of configurations in the original problem, while it corresponds to a single configuration in [JKP19].
It is much more common in counting problems that most contribution is from a neighborhood of some clusters rather than a few isolated states. So, we believe that our development of the technique makes it suitable for a much broader family of problems.
Independent work
Towards the end of this project, we learned that the authors of [JKP19] obtained similar results in their upcoming journal version submission.
2. Preliminaries
In this section, we review some basic definitions and concepts, introduce necessary notations and set up some facts and tools.
2.1. Independent sets and colorings
All graphs considered in this paper are unweighted, undirected, with no loops but may have multiple edges111There is no essential difference from keeping the graphs simple. We allow multiple edges just for writing convenience..
Let G=(V,E) be a graph.
We use dG(u,w) to denote the distance between two vertices u,w in the graph G.
For ∅⊊U,W⊆V, define dG(U,W)=minu∈U,w∈WdG(u,w).
Let U⊆V be a nonempty set.
We define NG(U)={v∈V:dG({v},U)=1} to be the neighborhood of U and emphasize that NG(U)∩U=∅.
We use G[U] to denote the induced subgraph of G on U.
Let E2 be the set of unordered pairs (u,v) such that u=v and dG(u,v)≤2.
We define G2 to be the graph (V,E2). It is clear that if the maximum degree of G is at most Δ, then the maximum degree of G2 is at most Δ2.
An independent set of the graph G is a subset U⊆V such that (u,w)∈E for any u,w∈U.
We use I(G) to denote the set of all independent sets of G.
The weight of an independent set I is λ∣I∣ where λ>0 is a paramter called fugacity.
We use Z(G,λ)=∑I∈I(G)λ∣I∣ to denote the partition function of the graph G. Clearly, Z(G,1) is the number of indepndent sets of G.
For any positive integer i, we use [i] to denote the set {1,2,…,i}.
Let q≥3 be an integer.
Define q=⌊q/2⌋ and q=⌈q/2⌉.
A coloring σ:V→[q] over the graph G is a mapping which assigns to each vertex of G a color from [q].
We say σ is proper if σ(u)=σ(v) for any edge (u,v)∈E.
We use C(G) to denote the set of all proper colorings over G.
Sometimes we need to consider the rescriction of a coloring and we use σ∣U to denote the coloring obtained by restricting σ over a subset U⊆V.
Whenever G=(L,R,E) is a bipartite graph and σ is coloring over G, we simply write σX instead of σ∣X for all X∈{L,R}.
For a number of disjoint sets S1,S2,…,Sk, we use ⊔i=1kSi to denote their union and stress the fact that they are disjoint.
For a number of colorings σ1:V1→[q],σ2:V2→[q],…,σk:Vk→[q], if Vi and Vj are disjoint for any 1≤i=j≤k, then ∪i=1kσi is the coloring over ⊔i=1kVi such that its resctriction over Vi is σi for any 1≤i≤k.
For two positive real numbers a and b, we say a is an ε-relative approximation to b for some ε>0 if exp(−ε)b≤a≤exp(ε)b, or equivalently exp(−ε)a≤b≤exp(ε)a.
A fully polynomial-time approximation scheme (FPTAS) is an algorithm that for every ε>0 outputs an ε-relative approximation to Z(G) in time (∣G∣/ε)C for some constant C>0, where Z(G) is some quantity, like the number of independent sets, of graphs G that we would like to compute.
2.2. Random regular bipartite graphs
We follow the model of random regular bipartite graphs in [MWW09].
Let Δ be a positive integer.
We use G∼Gn,Δbip to denote sampling a bipartite graph G in the following way.
At the beginning, the two sides of G both have exactly n vertices and there are no edges between them.
In the i-th round, we sample a perfect matching Mi over the complete bipartite graph Kn,n uniformly at random and independently of previous rounds.
We repeat this process for Δ rounds and add the edges in M1,M2,…,MΔ to the graph G.
We do not merge multiple edges in G to keep it Δ-regular.
We remark that this distribution of random graphs is contiguous with a uniformly random Δ-regular simple (without multiple edges) bipartite graph, which implies that Lemma 4 and similar results also apply to the latter distribution.
See [MRRW97] for more information.
In the following, we discuss the property of random regular bipartite graphs.
We say a Δ-regular bipartite graph G=(L,R,E) with n vertices on both sides is an (α,β)-expander if for all subsets U⊆L or U⊆R with ∣U∣≤αn, ∣N(U)∣≥β∣U∣.
This property is called the expansion property of G.
We use Gα,βΔ to denote the set of all Δ-regular bipartite (α,β)-expander.
The following lemma states that under certain conditions almost every Δ-regular graph is an (α,β)-expander.
In addition to the expansion property, random regular graphs may also have the following property.
For 0<a,b<1, we say a bipartite graph G=(L,R,E) with n vertices on both sides has the (a,b)-cover property if ∣NG(U)∣>(1−b)n for all U⊆L or U⊆R with ∣U∣≥an.
2.3. The polymer model
We follow the way in [HPR18] to introduce the polymer model and related tools.
For a complete introduction to this model, see this wonderful book [FV17].
Let G be a graph and Ω be a finite set.
A polymer γ=(γ,ω) consists of a support γ which is a connected subgraph of G and a mapping ω which assigns to each vertex in γ some value in Ω.
We use ∣γ∣ to denote the number of vertices of γ.
There is also a weight function w(γ,⋅):C→C for each polymer γ.
There can be many polymers defined on the graph G and we use Γ∗=Γ∗(G) to denote the set of all polymers defined on it.
However, at the moment we do not give a constructive definition of polymers.
Such definitions are presented when they are needed, see Section 3.2 and Section 5.2.
We say two polymers γ1 and γ2 are compatible if dG(γ1,γ2)>1 and we use γ1∼γ2 to denote that they are compatible.
For a subset Γ⊆Γ∗ of polymers, it is compatible if any two different polymers in this set are compatible.
We define \mathcal{S}(\Gamma^{*})=\left\{\Gamma\subseteq\Gamma^{*}:\,\text{\Gamma is compatible}\right\} to be the collection of all compatible subsets of polymers.
For any Γ∈S(Γ∗), we define Γ to be the the subgraph of G by putting together the support of all polymers in Γ.
It is well defined since Γ is compatible.
We also define Γ to be the number of vertices of the subgraph Γ and ωΓ=∪γ∈Γω.
We say (Γ∗,w) is a polymer model defined on the graph G and the partition function of this polymer model is
[TABLE]
where z is a complex variable and ∏γ∈∅w(γ,z)=1 by convention. The following theorem222Here we only need a special case of the original theorem. states conditions that Ξ(G,z) can be approximated efficiently.
For any graph G=(V,E) with maximum degree Δ and v∈V, the number of connected induced subgraphs of size k≥2 containing v is at most (eΔ)k−1/2.
As a corollary, the number of connected induced subgraphs of size k≥1 containing v is at most (eΔ)k−1.
2.4. Some useful lemmas
Throughout this paper, we use H(x) to denote the binary entropy function
[TABLE]
Moreover, we extend this function to the interval [0,1] by defining H(0)=H(1)=0. This is reasonable since limx→0+H(x)=limx→1−H(x)=0.
Lemma 8**.**
It holds that H(x)≤2x(1−x)≤2x for all 0≤x≤1.
Proof.
Let f(x)=H(x)2x(1−x).
Since f(x)=f(1−x) and f(1/2)=1, it suffices to show that ∂f/∂x≥0 for any 1/2≤x<1.
It holds that
[TABLE]
for all 1/2≤x<1, since g(1/2)=0,limx→1−g(x)=0 and g is concave over [1/2,1).
The concavity of g follows from
[TABLE]
for 1/2≤x<1.
∎
Lemma 9**.**
It holds that H(x)≤−2xlog2x for all 0<x≤1/2.
Proof.
Let f(x)=H(x)+2xlog2x=xlog2x−(1−x)log2(1−x), it suffices to show that f(x)≤0 for x∈(0,1/2]. In fact, limx→0+f(x)=0,f(1/2)=0 and f is convex over (0,1/2]. The convexity of f follows from
[TABLE]
for 0<x≤1/2.
∎
Lemma 10**.**
For all a≥1, H(x)−1/aH(ax)≥x(lna−x)log2e for all 0≤x≤1/a.
Proof.
Recall that 1−x−x≤ln(1−x)≤−x for any 0<x<1. Thus for any 0<x<1/a,
[TABLE]
And the inequality holds trivially for x=0 and x=1/a.
∎
Lemma 11**.**
It holds that H(1−yx)(1−y)−H(x)≤−xylog2e for all 0≤x,y<1 with x+y<1.
Proof.
It holds that for any 0≤x,y<1 with x+y<1,
[TABLE]
Thus it suffices to show that f(x,y)≤0 for 0≤x,y<1 with x+y<1.
Fix 0≤x<1.
We verify that f(x,0)=0 and
Suppose that n is a positive integer and k∈[0,1] is a number such that kn is an integer. Then
[TABLE]
Lemma 13**.**
For b>a>0, the function f(λ)=λa/(λ+1)b is monotonically increasing on [0,b−aa] and monotonically decreasing on [b−aa,+∞).
Proof.
It holds that
[TABLE]
for all λ>0.
∎
3. Counting independent sets for λ≥1
Throughout this section, we consider integers Δ≥53, fugacity λ≥1 and set parameters ζ,α,β to be
[TABLE]
Lemma 14**.**
For Δ≥53, n→∞limG∼Gn,ΔbipPr[G∈Gα,βΔ]=1.
Proof.
We verify that the conditions in Lemma 4 are satisfied.
Recall that ζ=1.28,α=2.9/Δ,β=Δ/(2.9ζ) and Δ≥53.
Clearly 0<α<1/β<1.
Let f(Δ)=Δ−H(α)−αβH(1/β)H(α)+H(αβ).
It follows from Lemma 10 that
[TABLE]
for any Δ≥1000.
Then
[TABLE]
for Δ≥1000.
For 53≤Δ<1000, we can use computers to verify that f(Δ)>0.
Actually, in the current setting of parameters, f(52)≈−0.06<0<f(53)≈0.11.
∎
In the rest of this section, whenever possible, we will simplify notations by omitting superscripts, subscripts and brackets with the symbols between (but this will not happen in the statement of lemmas and theorems).
For example, Z(G,λ) may be written as Z if G and λ are clear from context.
3.1. Approximating Z(G,λ)
For all G=(L,R,E)∈Gα,βΔ,X∈{L,R} and λ≥1, we define
[TABLE]
The main result in this part is that we can use ZL(G,λ)+ZR(G,λ) to approximate Z(G,λ).
Lemma 15**.**
For Δ≥53 and λ≥1, there are constants C=C(Δ)>1 and N=N(Δ) so that for all G∈Gα,βΔ with n>N vertices on both sides, ZL(G,λ)+ZR(G,λ) is a C−n-relative approximation to Z(G,λ).
Proof.
Let N1,C1,N2,C2 be the constants in Lemma 16 and Lemma 17, respectively.
It follows from these lemmas that
[TABLE]
for all n>max(N1,N2).
It is clear that C1−n+C2−n≤2min(C1,C2)−n=(min(C1,C2)/21/n)−n<C−n for another constant C=C(Δ)>1 and for all n>N≥max(N1,N2) where N=N(Δ) is another sufficiently large constant.
Therefore we obtain
[TABLE]
for all n>N.
∎
Lemma 16**.**
For Δ≥3 and λ≥1, there are constants C=C(Δ)>1 and N=N(Δ) so that for all G∈Gα,βΔ with n>N vertices on both sides, ∑I∈IL(G)∪IR(G)λ∣I∣ is a C−n-relative approximation to Z(G,λ).
Proof.
It is clear that
[TABLE]
Let B=I∖(IL∪IR).
For any I∈B, it follows from the definition of B that ∣I∩L∣≥αn and ∣I∩R∣≥αn.
Using the expansion property, we obtain ∣N(I∩L)∣≥β⌊αn⌋ and thus ∣I∩R∣≤n−∣N(I∩L)∣≤(1−1/ζ)n where 1/ζ=β⌊αn⌋/n≥αβ−β/n.
Analogously, it holds that ∣I∩L∣≤(1−1/ζ)n.
In the following, we assume n≥N1 for some N1=N1(Δ)>0, such that
[TABLE]
We obtain an upper bound of ∑I∈Bλ∣I∣ as follows:
a)
Consider an independent set I∈B.
Recall that αn≤∣I∩L∣≤(1−1/ζ)n.
We first enumerate a subset U⊆L with αn≤∣U∣≤(1−1/ζ)n and then enumerate all independent sets I with I∩L=U.
Since 1−1/ζ<1/2, there are at most
[TABLE]
ways to enumerate such a set U, where the inequality follows from Lemma 12.
2. b)
Now fix a set U⊆L.
Recall that every independent set I∈B satisfies ∣I∩R∣≤(1−1/ζ)n.
Therefore
for all λ≥1. So there exists some constant C>1 such that
[TABLE]
for all n>N≥N1 where N=N(Δ) is another sufficiently large constant.
Using the upper bound on Equation 4 and 1+x≤exp(x) for any x∈R we obtain
[TABLE]
for all n>N.
∎
Lemma 17**.**
For Δ≥53 and λ≥1, there are constants C>1 and N so that for all G∈Gα,βΔ with n>N vertices on both sides, ZL(G,λ)+ZR(G,λ) is a C−n-relative approximation to ∑I∈IL(G)∪IR(G)λ∣I∣.
Proof.
For any I∈IL∩IR, it holds that ∣I∩L∣<αn and ∣I∩R∣<αn.
Clearly ∑I∈IL∪IRλ∣I∣≥(λ+1)n.
Therefore
[TABLE]
where the last inequality follows from Lemma 12.
Recall that α=2.9/Δ and Δ≥53.
Then
[TABLE]
It follows from Lemma 13 that 4H(α)λ2α/(λ+1) is monotonically decreasing in λ on [1,∞) for all fixed Δ≥53.
Thus
[TABLE]
for some constant C>1 and for all n>N where N is a sufficiently large constant.
Using the upper bound on Equation 5 and 1+x≤exp(x) for any x∈R we obtain
[TABLE]
for all n>N.
∎
3.2. Approximating ZX(G,λ)
In this subsection, we discuss how to approximate ZX(G,λ) for any graph G∈Gα,βΔ,X∈{L,R} and λ≥1.
We will use the polymer model (see Section 2.3).
First we constructively define the polymers we need.
For any I∈IX(G), we can partition the graph (G2)[I∩X] into connected components U1,U2,…,Uk for some k≥0 (trivially k=0 if I∩X=∅). There are no edges in G2 between Ui and Uj for any 1≤i=j≤k.
If k>0, let p(I)={(U1,1U1),(U2,1U2),…,(Uk,1Uk)} where 1Ui is the unique mapping from Ui to {1}.
If k=0, let p(I)=∅.
We define the set of all polymers to be
[TABLE]
and each element in this set is called a polymer. When the graph G and X are clear from the context, we simply denote by Γ∗ the set of polymers.
Clearly, p is a mapping from IX(G) to the set {Γ∈S(ΓX∗(G)):Γ<αn} since p(I)=∣I∩X∣<αn for all I∈IX(G).
For each polymer γ, define its weight function w(γ,⋅) as
[TABLE]
where z is a complex variable. The weight function can be computed in polynomial time in ∣γ∣.
The partition function of the polymer model (Γ∗,w) on the graph G2 is the following sum:
[TABLE]
Recall that two polymers γ1 and γ2 are compatible if dG2(γ1,γ2)>1 and this condition is equivalent to dG(γ1,γ2)>2.
Lemma 18**.**
For all bipartite graphs G=(L,R,E) with n vertices on both sides, X∈{L,R} and λ≥0,
[TABLE]
Proof.
Recall that in the definition of polymers, p is a mapping from IX to {Γ∈S(Γ∗):Γ<αn}.
Thus
[TABLE]
Fix Γ∈S(Γ∗) with Γ<αn.
It holds that
[TABLE]
where the last equality follows from Γ<αn.
Since Γ is compatible, NG(Γ)=⊔γ∈ΓNG(γ) and (L⊔R)∖(X⊔NG(Γ))=n−∑γ∈Γ∣NG(γ)∣.
Thus
[TABLE]
This completes the proof.
∎
Lemma 19**.**
For Δ≥53 and λ≥1, there are constants C>1 and N so that for all G=(L,R,E)∈Gα,βΔ with n>N vertices on both sides and X∈{L,R},
[TABLE]
is a C−n-relative approximation to ZX(G,λ).
Proof.
It is clear that ZX(G,λ)≥(λ+1)n.
Then using Lemma 18 and Lemma 21 we obtain
[TABLE]
To enumerate each Γ∈S(Γ∗) with Γ≥αn at least once, we first enumerate an integer αn≤k≤n, then since Γ⊆X, we choose k vertices from X.
Therefore
[TABLE]
where the inequalities follow from Lemma 12 and Lemma 8.
Recall that ζ=1.28, α=2.9/Δ,β=Δ/(2.9ζ) and Δ≥53.
Let f(Δ)=21/α−β=2Δ/2.9−Δ/(2.9ζ).
We obtain
[TABLE]
It follows from Lemma 20 that f(Δ) is monotonically decreasing in Δ on [53,+∞).
Thus
[TABLE]
for some constant C>1 and for all n>N where N is a sufficiently large constant.
Using the upper bound on Equation 7 and 1+x≤exp(x) for any x∈R we obtain
[TABLE]
for all n>N.
∎
Lemma 20**.**
The function f(Δ)=21/α−β is monotonically decreasing on [53,+∞).
Proof.
Recall that ζ=1.28, α=2.9/Δ,β=Δ/(2.9ζ). It holds that
[TABLE]
for all Δ≥53.
∎
Lemma 21**.**
For all polymers γ∈Γ∗ defined by G=(L,R,E)∈Gα,βΔ, X∈{L,R} and λ≥1,
[TABLE]
As a corollary, w(γ,1)≤2−β∣γ∣ and for all compatible Γ⊆Γ∗(G),
[TABLE]
Proof.
Let n=∣L∣=∣R∣ and let γ be any polymer. It follows from the definition of polymers that ∣γ∣≤αn and by the expansion property, ∣N(γ)∣≥β∣γ∣.
Thus we have
[TABLE]
where the last inequality follows from Lemma 13 since β>1 and λ≥1. In particular, w(γ,1)≤2−β∣γ∣. For any compatible Γ, it holds that Γ=∑γ∈Γ∣γ∣. Thus
∏γ∈Γw(γ,1)≤∏γ∈Γ2−β∣γ∣=2−β∣Γ∣.
∎
3.3. Approximating the partition function of the polymer model
Lemma 22**.**
For Δ≥53 and λ≥1, there is an FPTAS for Ξ(1) for all G=(L,R,E)∈Gα,βΔ and X∈{L,R}.
Proof.
We use the FPTAS in Theorem 5 to design the FPTAS we need.
To this end, we generate a graph G2 in polynomial time in ∣G∣ for any G∈Gα,βΔ.
We use this new graph G2 as input to the FPTAS in Theorem 5.
It is straightforward to verify the first three conditions in Theorem 5, only with the exception that the information of G2 may not be enough because certain connectivity information in G is discarded in G2.
Nevertheless, we can use the original graph G whenever needed and thus the first three conditions are satisfied.
For the last condition, Lemma 23 verifies it.
∎
Lemma 23**.**
There is a constant R>1 so that for Δ≥53 and λ≥1, Ξ(z)=0 for all G∈Gα,βΔ, X∈{L,R} and z∈C with ∣z∣<R, .
Proof.
Set R=1.001.
For any γ∈Γ∗, let a(γ)=t∣γ∣ where t=(−1+1+8e)/(4e)≈0.346.
We will verify that the KP-condition
[TABLE]
holds for any γ∗∈Γ∗ and any ∣z∣<R. It then follows from Lemma 6 that Ξ(z)=0 for any ∣z∣<R.
Recall that dG2(γ,γ∗)≤1 for all γ∼γ∗.
Thus there is always a vertex v∈γ⊆X such that v∈γ∗⊔NG2(γ∗).
The number of such vertices v is at most Δ2γ∗.
So to enumerate each γ∼γ∗ at least once, we can
a)
first enumerate a vertex v in X∩(γ∗∪NG2(γ∗));
2. b)
then enumerate an integer k from 1 to ⌊αn⌋;
3. c)
finally enumerate γ with v∈γ and ∣γ∣=k.
Since γ is connected in G2, applying Lemma 7 and using Lemma 21 to bound ∣w(γ,z)∣ we obtain
[TABLE]
Let x=et+1Δ22−βR. Since ∣z∣<R, we obtain
[TABLE]
Recall that ζ=1.28, β=Δ/(2.9ζ) and Δ≥53.
It follows from Lemma 24 that Δ22−β is monotonically decreasing in Δ on [53,+∞).
Thus it holds that
[TABLE]
and hence
[TABLE]
This completes the proof.
∎
Lemma 24**.**
The function f(Δ)=Δ22−β is monotonically decreasing on [53,+∞).
Proof.
Recall that ζ=1.28, β=Δ/(2.9ζ).
It is equivalent to show that ∂lnf/∂Δ<0 for all Δ≥53.
It holds that
[TABLE]
for all Δ≥53.
∎
3.4. Putting things together
Using the results from previous parts, we obtain our main result for counting independent sets.
Theorem 1.
For Δ≥53 and fugacity λ≥1, with high probability (tending to 1 as n→∞) for a graph G chosen uniformly at random from Gn,Δbip, there is an FPTAS for the partition function Z(G,λ).
For Δ≥53 and λ≥1, there is an FPTAS for Z(G,λ) for all G∈Gα,βΔ.
Proof.
First we state our algorithm.
See Algorithm 1 for a pseudocode description.
The input is a graph G=(L,R,E)∈Gα,βΔ and an approximation parameter ε>0.
The output is a number Z to approximate Z(G,λ). We use ΞX(z) to denote the partition function of the polymer model (ΓX∗(G),w) for X∈{L,R}.
Let N1,C2,N2,C2 be the constants in Lemma 15 and Lemma 19, respectively.
These two lemmas show that (λ+1)n(ΞL(1)+ΞR(1)) is a C1−n+C2−n≤2min(C1,C2)−n≤C−n-relative approximation to Z(G,λ) for another constant C>1 and all n>N≥max(N1,N2) where N is another sufficiently large constant.
If n≤N or ε≤2C−n, we use the brute-force algorithm to compute Z(G,λ).
If ε>2C−n, we apply the FPTAS in Lemma 22 with approximation parameter ε′=ε−C−n to obtain outputs ZL and ZR which approximate ΞL(1) and ΞR(1) , respectively.
Let Z=(λ+1)n(ZL+ZR) be the output.
It is clear that exp(−ε)Z≤Z(G,λ)≤exp(ε)Z.
Then we show that Algorithm 1 is indeed an FPTAS.
It is required that the running time of our algorithm is bounded by (n/ε)C3 for some constant C3 and for all n>N3 where N3 is a constant.
Let N3=N.
If ε≤2C−n, the running time of the algorithm would be 2.1n≤(nCn/2)C3≤(n/ε)C3 for sufficient large C3.
If ε>2C−n, the running time of the algorithm would be (n/ε′)C4=(n/(ε−C−n))C4≤(2n/ε)C4≤(n/ε)C3 for sufficient large C3, where C4 is a constant from the FPTAS in Lemma 22.
∎
4. Counting independent sets for λ=Ω(Δ1)
Let λl=Δ(lnΔ)4=Ω(Δ1).
Throughout this section, we consider sufficiently large integers Δ, fugacity λ>λl and set parameters α,β to be
[TABLE]
We define a set Gα,α,βΔ of graphs as
[TABLE]
Lemma 26**.**
For all sufficiently large integers Δ,
n→∞limG∼Gn,ΔbipPr[G∈Gα,α,βΔ]=1.
Proof.
In this proof we only consider sufficiently large integers Δ.
Recall that α=Δ(lnΔ)2 and β=3α1.
It suffices to show that
[TABLE]
First we verify that the conditions in Lemma 4 are satisfied and then Equation 10 follows.
Clearly, 0<α<1/β<1.
Let f(Δ)=Δ−H(α)−αβH(1/β)H(α)+H(αβ).
Recall that Δ is sufficiently large.
Thus α can be sufficiently small.
Using Lemma 10 we obtain
[TABLE]
Hence
[TABLE]
Then we show that Equation 11 is satisfied.
It is equivalent to show that
[TABLE]
Assume that a Δ-regular bipartite graph G=(L,R,E) with n vertices on both sides does not have this property.
Then there is a pair (U,V) with U⊆L,V⊆R or U⊆R,V⊆L that ∣U∣=⌈αn⌉,∣V∣=⌈αn⌉ and N(U)∩V=∅.
Applying union bound we obtain
[TABLE]
Using Lemma 12 and the perfect matching generation procedure of the distribution Gn,Δbip, we obtain
as n→∞.
Recall that Δ is sufficiently large.
Using Lemma 9 and Lemma 11 we obtain
[TABLE]
for some constant C=C(Δ)<0 as n→∞.
Therefore
[TABLE]
as n→∞.
∎
Putting together Theorem 1 and the result in this section, we obtain the following.
Theorem 2.
For all sufficiently large integers Δ and fugacity λ=Ω(Δ1), with high probability (tending to 1 as n→∞) for a graph G chosen uniformly at random from Gn,Δbip, there is an FPTAS for the partition function Z(G,λ).
Proof.
Let α′,β′ be the parameters in Section 3.
Let G=Gα′,β′Δ∩Gα,α,βΔ.
It then follows from Lemma 14 and Lemma 26 that n→∞limG∈Gn,ΔbipPr[G∈G]=1.
For λ≥1, we apply the algorithm from Theorem 1.
For λl<λ<1, we apply the algorithm from Lemma 33.
∎
Therefore, in the rest of this section, we only consider fugacity λl<λ<1.
The notations and definitions in the rest of this section would be identical to those in Section 3.
So we only review needed materials briefly and state results different from those in Section 3.
4.1. Approximating Z(G,λ)
Recall that
[TABLE]
The main result in this part is that we can use ZL(G,λ)+ZR(G,λ) to approximate Z(G,λ) for all λl<λ<1.
Lemma 27**.**
For all sufficiently large integers Δ, there are constants C=C(Δ)>1 and N=N(Δ) so that for all G∈Gα,α,βΔ with n>N vertices on both sides and λl<λ<1,
ZL(G,λ)+ZR(G,λ) is a C−n-relative approximation to Z(G,λ).
Proof.
In this proof we only consider sufficiently large integers Δ.
Applying Lemma 28, it suffices to show that ZL(G,λ)+ZR(G,λ) is a C−n-relative approximation to ∑I∈IL∪IRλ∣I∣.
For any I∈IL∩IR, it holds that ∣I∩L∣<αn and ∣I∩R∣<αn.
Clearly ∑I∈IL∪IRλ∣I∣≥(λ+1)n.
Using α≤1/2, Lemma 12 and λl<λ<1 we obtain
[TABLE]
Recall that Δ is sufficiently large, α=Δ(lnΔ)2 and λl=Δ(lnΔ)4.
Using Lemma 9 and ln(x+1)≥x/2 for any 0≤x≤1 we obtain
[TABLE]
for some constant C1=C1(Δ)<0.
Therefore
[TABLE]
for another constant C=C(Δ)>1 and for all n>N where N=N(Δ) is a sufficiently large constant.
Using the upper bound on Equation 13 and 1+x≤exp(x) for any x∈R we obtain
[TABLE]
for all n>N.
∎
Lemma 28**.**
For Δ≥3, G∈Gα,α,βΔ and λ∈R,
∑I∈IL(G)∪IR(G)λ∣I∣=Z(G,λ).
Proof.
Let B=I∖(IL∪IR).
If suffices to show that B=∅.
Suppose B is not empty.
Then there is an independent set I∈B such that ∣I∩L∣≥αn and ∣I∩R∣≥αn.
Applying the cover property, we obtain that ∣I∩R∣≤∣R∖N(I∩L)∣<αn, which contradicts that ∣I∩R∣≥αn.
Thus B=∅.
∎
4.2. Approximating ZX(G,λ)
Recall that for all G=(L,R,E)∈Gα,α,βΔ with n vertices on both sides and X∈{L,R}, we defined a polymer model (ΓX∗(G),w) of the graph G2. The partition function of this model is denoted by
[TABLE]
where z is a complex variable and w(γ,1)=λ∣γ∣(λ+1)−∣N(γ)∣z∣γ∣.
Lemma 29**.**
For all sufficiently large integers Δ,
there are constants C=C(Δ)>1 and N=N(Δ) so that for all G=(L,R,E)∈Gα,α,βΔ with n>N vertices on both sides, X∈{L,R} and λl<λ<1,
[TABLE]
is a C−n-relative approximation to ZX(G,λ).
Proof.
In this proof we only consider sufficiently large integers Δ.
It is clear that ZX(G,λ)≥(λ+1)n.
Then using Lemma 18 and the cover property we obtain
[TABLE]
For any γ, since ∣γ∣<αn, it follows from the expansion property that ∣NG(γ)∣≥β∣γ∣.
The compatibility of Γ states that dG(γ1,γ2)>2 for any γ1=γ2 in Γ, implying NG(γ1)∩NG(γ2)=∅.
Using these two facts, for any Γ∈S(Γ∗),
[TABLE]
implying that Γ≤n/β.
To enumerate each Γ∈S(Γ∗) with Γ≥αn at least once, we first enumerate an integer αn≤k≤n/β, then since Γ⊆X, we choose k vertices from X.
Recall that Δ is sufficiently large.
Using Lemma 12, α<1/β≤1/2, αβ=1/3 and λl<λ<1 we obtain
[TABLE]
Recall that α=Δ(lnΔ)2 and λl=Δ(lnΔ)4.
Using Lemma 9 and ln(x+1)≥x/2 for any 0≤x≤1 we obtain
[TABLE]
for some constant C1=C1(Δ)<0.
Therefore
[TABLE]
for some constant C=C(Δ)>1 and for all n>N where N=N(Δ) is a sufficiently large constant.
Using the upper bound on Equation 14 and 1+x≤exp(x) for any x∈R we obtain
[TABLE]
for all n>N.
∎
Lemma 30**.**
For all polymers γ∈ΓX∗(G) defined by G=(L,R,E)∈Gα,α,βΔ,X∈{L,R} and λl<λ<1,
w(γ,1)≤(λ+1)−β∣γ∣.
Proof.
For every γ∈Γ∗, it follows from the definition of polymers that ∣γ∣<αn.
Using the expansion property we obtain
[TABLE]
4.3. Approximating the partition function of the polymer model
Lemma 31**.**
For all sufficiently large integers Δ and λl<λ<1, there is an FPTAS for Ξ(1) for all G=(L,R,E)∈Gα,α,βΔ and X∈{L,R}.
Proof.
We use the FPTAS in Theorem 5 to design the FPTAS we need.
To this end, we generate a graph G2 in polynomial time in ∣G∣ for any G∈Gα,α,βΔ.
We use this new graph G2 as input to the FPTAS in Theorem 5.
It is straightforward to verify the first three conditions in Theorem 5, only with the exception that the information of G2 may not be enough because certain connectivity information in G is discarded in G2.
Nevertheless, we can use the original graph G whenever needed and thus the first three conditions are satisfied.
For the last condition, Lemma 32 verifies it.
∎
Lemma 32**.**
There is a constant R>1 so that for all sufficiently large integers \Delta,G=(\mathcal{L},\mathcal{R},E)$$\in{\mathcal{G}^{\Delta}_{\alpha,\alpha,\beta}},\mathcal{X}\in\left\{\mathcal{L},\mathcal{R}\right\} and z∈C with ∣z∣<R, Ξ(z)=0.
Proof.
In this proof we only consider sufficiently large integers Δ.
Set R=2.
For any γ, let a(γ)=∣γ∣.
We will verify that the KP-condition
[TABLE]
holds for any γ∗ and any ∣z∣<R.
It then follows from Lemma 6 that Ξ(z)=0 for any ∣z∣<R.
Recall that dG2(γ,γ∗)≤1 for all γ∼γ∗.
Thus there is always a vertex v∈γ⊆X such that v∈γ∗⊔NG2(γ∗).
The number of such vertices v is at most Δ2γ∗.
So to enumerate each γ∼γ∗ at least once, we can
a)
first enumerate a vertex v in X∩(γ∗∪NG2(γ∗));
2. b)
then enumerate an integer k from 1 to ⌊αn⌋;
3. c)
finally enumerate γ with v∈γ and ∣γ∣=k.
Since γ is connected in G2, using Lemma 7 and Lemma 30 and λl<λ<1 we obtain
[TABLE]
Recall that Δ is sufficiently large, β=3α1=3(lnΔ)2Δ and λl=Δ(lnΔ)4.
Using ln(x+1)≥x/2 for any 0≤x≤1 we obtain
For all sufficiently large integers Δ and λl<λ<1, there is an FPTAS for Z(G,λ) for all G∈Gα,α,βΔ.
Proof.
This can be readily obtained by replacing facts used in the proof of Lemma 25 with corresponding results obtained in this section.
∎
5. Counting colorings
Throughout this section, we consider integers q≥3,Δ≥100q10 and set parameters s,α,β to be
[TABLE]
We define a set Gq,s,α,βΔ of graphs as
[TABLE]
Lemma 34**.**
For q≥3 and Δ≥100q10, n→∞limG∼Gn,ΔbipPr[G∈Gq,s,α,βΔ]=1.
Proof.
Recall that s=18q51,α=Δ1/21 and β=3Δ1/2.
It suffices to show that
[TABLE]
First we verify that the conditions in Lemma 4 are satisfied and then Equation 17 follows.
Let f(Δ)=Δ−H(α)−αβH(1/β)H(α)+H(αβ).
It follows from Lemma 10 that
[TABLE]
for any Δ≥4.
Then
[TABLE]
for any Δ≥100.
Then we show that Equation 18 is satisfied. It is equivalent to show that
[TABLE]
Assume that a Δ-regular bipartite graph G=(L,R,E) with n vertices on both sides does not have the (s,α/q)-cover property.
Then there is a pair (U,V) with U⊆L,V⊆R or U⊆R,V⊆L that ∣U∣=⌈sn⌉,∣V∣=⌈α/qn⌉ and N(U)∩V=∅.
Thus
[TABLE]
Using Lemma 12 and the perfect matching generation procedure of the distribution Gn,Δbip, we obtain
[TABLE]
Recall that s=18q51 and α=Δ1/21.
It then follows from Lemma 8 that
[TABLE]
for all sufficiently large n.
Using Lemma 11 we obtain
[TABLE]
for some constant C>1 and for all sufficiently large n.
Therefore
[TABLE]
as n→∞.
∎
In the rest of this section, whenever possible, we will simplify notations by omitting superscripts, subscripts and brackets with the symbols between (but this will not happen in the statement of lemmas and theorems).
For example, C(G) may be written as C if G is clear from context.
5.1. Approximating ∣C(G)∣
For all q≥3,Δ≥3,G=(L,R,E)∈Gq,s,α,βΔ and ∅⊊X⊊[q], we define
[TABLE]
where dX(σ)=σL−1([q]∖X)+σR−1(X) (recall that σL=σ∣L and σR=σ∣R).
The main result of this subsection is that we can use ∑X:∣X∣∈{q,q}∣C(G)∣ to approximate ∣C(G)∣.
Lemma 35**.**
For q≥3 and Δ≥100q10, there are constants C=C(q)>1 and N=N(q) such that for all G∈Gq,s,α,βΔ with n>N vertices on both sides,
Z is a C−n-relative approximation to ∣C(G)∣, where Z=(qq)C[q](G) if q is even, otherwise Z=(qq)(C[q](G)+C[q](G)).
Proof.
Let N1,C1,N2,C2 and N3,C3 be the constants in Lemma 36, Lemma 37 and Lemma 38, respectively.
It follows from these lemmas that
[TABLE]
for all n>max(N1,N2,N3).
It is clear that
[TABLE]
for another constant C=C(q)>1 and for all n>N≥max(N1,N2,N3) where N=N(q) is another sufficiently large constant.
Therefore we obtain
[TABLE]
for all n>N.
∎
Lemma 36**.**
For q≥3 and Δ≥100q10, there are constants C=C(q)>1 and N=N(q) such that for all G∈Gq,s,α,βΔ with n>N vertices on both sides,
⋃X:∅⊊X⊊[q]C(G) is a C−n-relative approximation to ∣C(G)∣.
Proof.
For any coloring ω, let
[TABLE]
Fix σ∈C.
If maj(σL)∩maj(σR)=∅, then there exists a color c∈[q] that σL−1(c)≥sn and σR−1(c)≥sn.
Since σL−1(c)≥sn, it follows from the cover property that N(σL−1(c))>(1−α/q)n.
Since σ is proper, then σR−1(c)≤n−N(σL−1(c))<α/qn<sn, which contradicts that σR−1(c)≥sn.
Therefore, maj(σL)∩maj(σR)=∅ for any σ∈C.
Let B={σ∈C:σ∈∪XCX}.
We claim that ∣maj(σL)∣+∣maj(σR)∣≤q−1 for any σ∈B. Suppose that ∣maj(σL)∣+∣maj(σR)∣=q for some σ. Let X=maj(σL). Then we have
[TABLE]
By definition σ∈CX(G) and thus σ∈/B.
We give an upper bound of ∣B∣ via the following procedure which enumerates each σ∈B at least once.
a)
Recall that ∣maj(σL)⊔maj(σR)∣≤q−1 for any σ∈B.
Thus we enumerate two sets A,B⊆[q] such that ∣A⊔B∣=q−1.
Clearly, there are at most q2q ways to enumerate such sets.
2. b)
Assume that A and B have been enumerated out.
Then we enumerate colorings σ∈B with maj(σL)⊆A and maj(σR)⊆B.
To this end, we can enumerate σL and σR independently and combine them together.
3. c)
Consider σL with maj(σL)⊆A.
Clearly, σL−1([q]∖A)≤(q−∣A∣)sn.
Thus we enumerate a set Lminor⊆L with size ⌊(q−∣A∣)sn⌋.
Since qs≤1/2, there are at most (⌊(q−∣A∣)sn⌋n)≤(⌊qsn⌋n) ways to enumerate such a set.
4. d)
Assume that Lminor has been enumerated out.
Then we count colorings σ∈B with σL−1([q]∖A)⊆Lminor.
The number of such colorings is upper bounded by q(q−∣A∣)sn∣A∣n.
5. e)
Putting c) and d) together, there are at most (⌊qsn⌋n)q(q−∣A∣)sn∣A∣n ways to enumerate colorings σL with maj(σL)⊆A.
Analogously, there are at most (⌊qsn⌋n)q(q−∣B∣)sn∣B∣n ways to enumerate colorings σR with maj(σR)⊆B.
6. f)
Recall that s=18q51.
It holds that qs≤9q41.
Using Lemma 8, ln(1+x)≤x for any x>−1 and q≥2 we obtain
[TABLE]
for some constant C1=C1(q)>1.
Therefore,
[TABLE]
for another constant C=C(q)>1 and n>N where N=N(q) is a sufficiently large constant.
Using the upper bound on Equation 20 and 1+x≤exp(x) for any x∈R we obtain
[TABLE]
for all n>N.
∎
Lemma 37**.**
For q≥3 and Δ≥100q10, there are constants C=C(q)>1 and N=N(q) such that for all G∈Gq,s,α,βΔ with n>N vertices on both sides,
∑X:∅⊊X⊊[q]∣C(G)∣ is a C−n-relative approximation to ⋃X:∅⊊X⊊[q]C(G).
Proof.
Fix two sets ∅⊊X=Y⊊[q].
Clearly, ∣X∩Y∣+∣[q]∖(X∪Y)∣≤(max(∣X∣,∣Y∣)−1)+(q−max(∣X∣,∣Y∣))=q−1.
For any σ∈CX∩CY, it holds that
[TABLE]
This shows that for σ∈CX∩CY most of the vertices in L are colored using colors from X∩Y and most of the vertices in R are colored using colors from [q]∖(X∪Y).
According to this,
we can upper bound ∣CX∩CY∣ via the following procedure which enumerates each σ∈CX∩CY at least once.
First we enumerate a set B⊆L∪R with ∣B∣=⌊2αn⌋.
Then the vertices in B can be colored arbitrarily, but the vertices in L∖B can only be colored with colors from X∩Y and the vertices in R∖B can only be colored with colors from [q]∖(X∪Y).
Thus we obtain
[TABLE]
where the inequality follows from Lemma 12 and ∣X∩Y∣+∣[q]∖(X∪Y)∣≤q−1.
It is clear that ∣∪XCX∣≥qnqn and we obtain
[TABLE]
Recall that s=18q51 and α=Δ1/21≤10q51.
Since α≤qs≤1/2 and 2α≤(q+1)s, it follows from the upper bound on Equation 21 that
[TABLE]
for some constant C1=C1(q)>1.
Therefore
[TABLE]
for another constant C=C(q)>1 and n>N where N=N(q) is a sufficiently large constant.
Using the upper bound on Equation 22 and 1+x≤exp(x) for any x∈R we obtain
[TABLE]
for all n>N.
∎
Lemma 38**.**
For q≥3 and Δ≥100q10, there are constants C=C(q)>1 and N=N(q) such that for all G∈Gq,s,α,βΔ with n>N vertices on both sides,
Z is a C−n-relative approximation to ∑X:∅⊊X⊊[q]∣C(G)∣, where Z=(qq)C[q](G) if q is even, otherwise Z=(qq)(C[q](G)+C[q](G)).
Proof.
It follows from the symmetry of colors that ∣CX∣=∣CY∣ for any X and Y with ∣X∣=∣Y∣.
Fix Y with ∣Y∣<q or ∣Y∣>q.
We upper bound ∣CY∣ via the following procedure which enumerates each coloring σ∈CY at least once.
For each σ∈CY, it holds that dY(σ)<αn.
Thus we can enumerate a set B⊆L∪R with ∣B∣=⌊αn⌋.
The vertices in B can be colored arbitrarily, but the colors of the vertices in L∖B can only be chosen from Y and the vertices in R∖B can only be colored with colors from [q]∖Y.
Thus we obtain
[TABLE]
where the inequality follows from Lemma 12 and ∣Y∣⋅∣[q]∖Y∣≤(q−1)(q+1).
Clearly Z≥qnqn and we obtain
[TABLE]
Recall that α=Δ1/21≤10q51.
Using Lemma 8, ln(1+x)≤x for any x>−1 and q≥2 we obtain
[TABLE]
for some constant C1=C1(q)>1.
Therefore
[TABLE]
for another constant C=C(q)>1 and n>N where N=N(q) is a sufficiently large constant.
Using the upper bound on Equation 23 and 1+x≤exp(x) for any x∈R we obtain
[TABLE]
for all n>N.
∎
5.2. Approximating ∣C(G)∣
In this subsection, we discuss how to approximate ∣C(G)∣ for G=(L,R,E)∈Gq,s,α,βΔ and X⊆[q] with ∣X∣∈{q,q}.
We will use the polymer model (see Section 2.3).
First we constructively define the polymers we need.
For any σ∈C(G), let U={v∈L:σ(v)∈X}∪{v∈R:σ(v)∈[q]∖X}.
We can partition the graph (G2)[U] into connected components U1,U2,…,Uk for some k≥0. There are no edges in G2 between Ui and Uj for any 1≤i=j≤k.
If k>0, let p(σ)={(U1,σ∣U1),(U2,σ∣U2),…,(Uk,σ∣Uk)}.
If k=0, let p(σ)=∅.
We define the set of all polymers to be
[TABLE]
and each element in this set is called a polymer.
When the graph G and X are clear from the context, we simply denote by Γ∗ the set of polymers.
For each polymer γ∈Γ∗, define its weight function w(γ,⋅) as
[TABLE]
where z is a complex variable and
[TABLE]
The number of colorings in Cγ(G) can be computed in polynomial time in ∣γ∣ since ∣N(γ)∣≤β∣γ∣ and
[TABLE]
where V(γ) is the set of vertices of the subgraph γ.
The partition function of the polymer model (Γ∗,w) on the graph G2 is the following sum:
[TABLE]
Recall that two polymers γ1 and γ2 are compatible if dG2(γ1,γ2)>1 and this condition is equivalent to dG(γ1,γ2)>2.
We also extend the definition of Cγ(G) to Γ∈S(Γ∗(G)):
[TABLE]
Lemma 39**.**
For q≥3, all bipartite graphs G=(L,R,E) with n vertices on both sides and ∅⊊X⊊[q],
Proof.
Rewrite the right hand side of LABEL:eq:exact-rep as
[TABLE]
where the last step follows from Lemma 41. It is now sufficient to show that the set
[TABLE]
is a partition of CX.
It follows from the definition of CΓ that CΓ1∩CΓ2=∅ if Γ1=Γ2.
For any σ∈CX, it follows from the definition of p(σ) that p(σ) is compatible and p(σ)<αn, which shows that p(σ)∈P and thus CX⊆∪CΓ∈PCΓ.
For any σ∈CΓ∈P, it follows from the definition of CΓ that dX(σ)<αn, which implies that σ∈CX and thus ∪CΓ∈PCΓ⊆CX.
∎
Lemma 40**.**
For q≥3 and Δ≥100q10, there are constants C=C(q)>1 and N=N(q) such that for all G∈Gq,s,α,βΔ with n>N vertices on both sides and X⊆[q] with ∣X∣∈{q,q},
[TABLE]
is a C−n-relative approximation to ∣C(G)∣.
Proof.
Clearly ∣CX∣≥qnqn.
Then using Lemma 39 and Lemma 42 we obtain
[TABLE]
To enumerate each Γ∈S(Γ∗) with Γ≥αn at least once, we first enumerate an integer αn≤k≤2n, then we choose k first vertices from L∪R and enumerate all possible colorings over these k vertices.
Therefore
[TABLE]
where the inequalities follow from Lemma 12 and Lemma 8.
Recall that α=Δ1/21 and β=3Δ1/2.
Let f(Δ)=42/αq(1−1/q)β−1.
Using Δ≥100q10, q≥2, and the inequality ln(1+x)≤x for any x>−1, we obtain
[TABLE]
Since 2ln4−3220≈−1.02<−1, we obtain
[TABLE]
Therefore, we have
[TABLE]
for some constant C=C(q)>1 and for all n>N where N=N(q) is a sufficiently large constant.
Using the upper bound on Equation 24 and 1+x≤exp(x) for any x∈R we obtain
[TABLE]
for all n>N.
∎
Lemma 41**.**
For q≥3, all bipartite graphs G=(L,R,E) with n vertices on both sides, ∅⊊X⊊[q] and Γ∈S(ΓX∗(G)),
Proof.
For any γ∈Γ, let Vγ=γ⊔NG(γ). It holds that
where Cγ(G[Vγ]) is the set of colorings σ∈[q]Vγ that is proper in the graph G[Vγ], σγ=ω, σ(N(γ)∩L)⊆X and σ(N(γ)∩R)⊆[q]∖X.
Since Γ is compatible, for any different γ1∈Γ and γ2∈Γ, it holds that dG(γ1,γ2)>2 and thus Vγ1∩Vγ2=∅.
Let l=n−∣(⊔γ∈ΓVγ)∩L∣ and r=n−∣(⊔γ∈ΓVγ)∩R∣. Then we have
[TABLE]
where the first step follows from the definition of Cγ(G[Vγ]), the second step follows from that Vγ1∩Vγ2=∅ for any different γ1,γ2∈Γ and the last step follows from LABEL:eq:midstep-2.
∎
Lemma 42**.**
For q≥3,Δ≥100q10,G∈Gq,s,α,βΔ,∅⊊X⊊[q] with ∣X∣∈{q,q} and γ∈Γ∗(G),
[TABLE]
As a corollary, for any compatible Γ⊆Γ∗(G),
[TABLE]
Proof.
With out loss of generality, we fix ∅⊊X⊊[q] with ∣X∣=q and the other case (if exist) is symmetric.
Fix γ∈Γ∗.
Since G is an (α,β)-expander and ∣γ∣≤αn, it follows from Lemma 43 that ∣N(γ)∣≥(β−1)∣γ∣.
Let l=∣N(γ)∩L∣ and r=∣N(γ)∩R∣. Then
[TABLE]
For any compatible Γ, it holds that Γ=∑γ∈Γ∣γ∣. Thus
[TABLE]
Lemma 43**.**
For Δ≥3 and G=(L,R,E)∈Gα,βΔ with n vertices on both sides, ∣NG(U)∣≥(β−1)∣U∣ for all U⊆L∪R with ∣U∣≤αn.
Proof.
It follows from the expansion property that
[TABLE]
5.3. Approximating the partition function of the polymer model
Lemma 44**.**
For q≥3 and Δ≥100q10, there is an FPTAS for Ξ(1) for all G∈Gq,s,α,βΔ and X⊆[q] with ∣X∣∈{q,q}.
Proof.
We use the FPTAS in Theorem 5 to design the FPTAS we need.
To this end, we generate a graph G2 in polynomial time in ∣G∣ for any G∈Gq,s,α,βΔ.
We use this new graph G2 as input to the FPTAS in Theorem 5.
It is straightforward to verify the first three conditions in Theorem 5, only with the exception that the information of G2 may not be enough because certain connectivity information in G is discarded in G2.
Nevertheless, we can use the original graph G whenever needed and thus the first three conditions are satisfied.
For the last condition, Lemma 45 verifies it.
∎
Lemma 45**.**
There is a constant R>1 such that for all q≥3, Δ≥100q10, G∈Gq,s,α,βΔ and X⊆[q] with ∣X∣∈{q,q}, Ξ(z)=0 for all z∈C with ∣z∣<R.
Proof.
Set R=2.
For any γ∈Γ∗, let a(γ)=∣γ∣.
We will verify that the KP-condition
[TABLE]
holds for any γ∗∈Γ∗ and any ∣z∣<R.
It then follows from Lemma 6 that Ξ(z)=0 for any ∣z∣<R.
Fix γ∗∈Γ∗.
Recall that dG2(γ,γ∗)≤1 for all γ∼γ∗.
Thus there is always a vertex v∈γ such that v∈γ∗⊔NG2(γ∗).
The number of such vertices v is at most (Δ2+1)γ∗.
So to enumerate each γ=γ∗ at least once, we can
a)
first enumerate a vertex v∈γ∗⊔NG2(γ∗);
2. b)
then enumerate an integer k from 1 to ⌊αn⌋;
3. c)
finally enumerate γ with v∈γ and γ=k.
Since γ is connected in G2, applying Lemma 7 and using Lemma 42 to bound ∣w(γ,z)∣ we obtain
[TABLE]
Adding some extra nonnegative terms and using ∣z∣<R, we obtain
[TABLE]
Recall that β=3Δ1/2 and Δ≥100q10.
It holds that
[TABLE]
where the inequalities follow from the monotonicity of corresponding functions.
Therefore
Using the results from previous parts, we obtain our main result for counting colorings.
Theorem 3.
For q≥3 and Δ≥100q10, with high probability (tending to 1 as n→∞) for a graph chosen uniformly at random from Gn,Δbip, there is an FPTAS to count the number of q-colorings.
For q≥3 and Δ≥100q10, there is an FPTAS for ∣C(G)∣ for all G∈Gq,s,α,βΔ.
Proof.
First we state our algorithm.
See Algorithm 2 for a pseudocode description.
Fix q≥3 and Δ≥100q10.
The input is a graph G=(L,R,E)∈Gq,s,α,βΔ and an approximation parameter ε>0.
The output is a number Z to approximate ∣C(G)∣.
We use Ξ1(z) and Ξ2(z) to denote the partition functions of the polymer models (Γ[q]∗(G),w) and (Γ[q]∗(G),w), respectively.
Let N1,C2,N2,C2 be the constants in Lemma 35 and Lemma 40, respectively.
Let Z=(qq)C[q](G) if q is even, otherwise Z=(qq)C[q](G)+C[q](G).
These two lemmas show that Z is a C1−n+C2−n≤2min(C1,C2)−n≤C−n-relative approximation to ∣C(G)∣ for another constant C>1 and all n>N≥max(N1,N2) where N is another sufficiently large constant.
If n≤N or ε≤2C−n, we use the brute-force algorithm to compute ∣C(G)∣.
If ε>2C−n, we apply the FPTAS in Lemma 44 with approximation parameter ε′=ε−C−n to obtain Z1, an ε′-relative approximation to Ξ1(1). If q is even, then Z=(qq)q2nZ1 is the output of the algorithm. Otherwise, we apply again the FPTAS in Lemma 44 with approximation parameter ε′=ε−C−n to obtain Z2, an ε′-relative approximation to Ξ2(1). And the output is Z=(qq)(qq)n(Z1+Z2). It is clear that exp(−ε)Z≤∣C(G)∣≤exp(ε)Z.
Then we show that Algorithm 2 is indeed an FPTAS.
It is required that the running time of our algorithm is bounded by (n/ε)C3 for some constant C3 and for all n>N3 where N3 is a constant.
Let N3=N.
If ε≤2C−n, the running time of the algorithm would be qn≤(nCn/q)C3≤(n/ε)C3 for sufficient large C3.
If ε>2C−n, the running time of the algorithm would be (n/ε′)C4=(n/(ε−C−n))C4≤(2n/ε)C4≤(n/ε)C3 for sufficient large C3, where C4 is a constant from the FPTAS in Lemma 44.
∎
Bibliography44
The reference list from the paper itself. Each links out to its DOI / PubMed record.
1[Bar 16] Alexander I. Barvinok. Combinatorics and Complexity of Partition Functions , volume 30 of Algorithms and combinatorics . Springer, 2016.
2[Bas 81] Leonid A. Bassalygo. Asymptotically optimal switching circuits. Problems of Information Transmission , 17(3):206–211, 1981.
3[BCKL 13] Christian Borgs, Jennifer T. Chayes, Jeff Kahn, and László Lovász. Left and right convergence of graphs with bounded degree. Random Struct. Algorithms , 42(1):1–28, 2013.
4[BD 97] Russ Bubley and Martin Dyer. Path coupling: A technique for proving rapid mixing in Markov chains. In Proceedings of the 38th Annual IEEE Symposium on Foundations of Computer Science (FOCS’97) , pages 223–231. IEEE, 1997.
5[BDG + 13] Andrei A. Bulatov, Martin E. Dyer, Leslie Ann Goldberg, Mark Jerrum, and Colin Mc Quillan. The expressibility of functions on the Boolean domain, with applications to counting CS Ps. J. ACM , 60(5):32:1–32:36, 2013.
6[BDGJ 99] Russ Bubley, Martin Dyer, Catherine Greenhill, and Mark Jerrum. On approximately counting colorings of small degree graphs. SIAM Journal on Computing , 29(2):387–400, 1999.
7[BS 16] Alexander I. Barvinok and Pablo Soberón. Computing the partition function for graph homomorphisms with multiplicities. J. Comb. Theory, Ser. A , 137:1–26, 2016.
8[CDM + 19] Sitan Chen, Michelle Delcourt, Ankur Moitra, Guillem Perarnau, and Luke Postle. Improved bounds for randomly sampling colorings via linear programming. In Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2019, San Diego, California, USA, January 6-9, 2019 , pages 2216–2234, 2019.