Lower bounds on the chromatic number of random graphs
Peter Ayre, Amin Coja-Oghlan, Catherine Greenhill

TL;DR
This paper establishes a rigorous lower bound on the chromatic number of sparse random graphs using physics-inspired methods, improving understanding of graph coloring thresholds.
Contribution
It provides the first rigorous proof of a physics-predicted lower bound on the chromatic number of sparse random graphs, using the interpolation method.
Findings
Lower bounds match physics predictions for certain graph models
Explicit bounds for small average degrees are derived
Simplified derivation of asymptotic formulas for large degrees
Abstract
We prove that a formula predicted on the basis of non-rigorous physics arguments [Zdeborova and Krzakala: Phys. Rev. E (2007)] provides a lower bound on the chromatic number of sparse random graphs. The proof is based on the interpolation method from mathematical physics. In the case of random regular graphs the lower bound can be expressed algebraically, while in the case of the binomial random we obtain a variational formula. As an application we calculate improved explicit lower bounds on the chromatic number of random graphs for small (average) degrees. Additionally, show how asymptotic formulas for large degrees that were previously obtained by lengthy and complicated combinatorial arguments can be re-derived easily from these new results.
| 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| - | 6 | 11 | 16 | 21 | 27 | 33 | 39 | 46 | 52 | 59 | 66 | 73 | 81 | 88 | 96 | 104 | 111 | |
| 6 | 10 | 15 | 20 | 25 | 31 | 37 | 44 | 50 | 57 | 64 | 71 | 78 | 86 | 93 | 101 | 109 | 117 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Lower bounds on the chromatic number of random graphs
Peter Ayre, Amin Coja-Oghlan, Catherine Greenhill
Peter Ayre, [email protected], School of Mathematics and Statistics, UNSW Sydney, NSW 2052, Australia.
Amin Coja-Oghlan, [email protected], TU Dortmund, Faculty for Computer Science, 12 Otto Hahn St, Dortmund 44227, Germany.
Catherine Greenhill, [email protected], School of Mathematics and Statistics, UNSW Sydney, NSW 2052, Australia.
Abstract.
We prove that a formula predicted on the basis of non-rigorous physics arguments [Zdeborová and Krzakala: Phys. Rev. E (2007)] provides a lower bound on the chromatic number of sparse random graphs. The proof is based on the interpolation method from mathematical physics. In the case of random regular graphs the lower bound can be expressed algebraically, while in the case of the binomial random we obtain a variational formula. As an application we calculate improved explicit lower bounds on the chromatic number of random graphs for small (average) degrees. Additionally, we show how asymptotic formulas for large degrees that were previously obtained by lengthy and complicated combinatorial arguments can be re-derived easily from these new results. MSC: 05C80
Research of the first author supported in part by DFG CO 646. Research of the third author supported by the Australian Research Council Discovery Project DP190100977.
1. Introduction
1.1. Motivation and background
A most fascinating feature of combinatorics is how easy-to-state problems sometimes lead to deep and difficult mathematical challenges. The random graph colouring problem is a case in point. First mentioned in the seminal paper of Erdős and Rényi that started the theory of random graphs [23], the problem of finding the chromatic number of the binomial random graph with a fixed average degree remains open to this day. It is, in fact, the single open problem posed in that seminal paper that still awaits a complete solution. Nor is the chromatic number of the random -regular graph, a conceptually simpler object, known for all values of . Nevertheless, the quest for the chromatic number has contributed tremendously to the development of new techniques, some of which now count among the standard tools of probabilistic combinatorics [39].
A series of important papers contributed ever tighter bounds on the chromatic number of random graphs. Straightforward first moment calculations show that for any and for either the binomial111Sometimes is referred to as the Erdős-Rényi model. This is a slight misnamer as Erdős and Rényi [23] actually worked with a uniformly random graph with a given number of edges. random graph or the random regular graph ,
[TABLE]
To be precise, (1.1) is obtained by computing the expected number of -colourings222Throughout the paper the term -colouring or just colouring refers to proper vertex colourings. That is, colours are assigned to the vertices of a graph such that no two adjacent vertices receive the same colour. , which tends to zero as if . A celebrated contribution of Achlioptas and Naor [4] shows that for ,
[TABLE]
The proof hinges on the computation of the second moment of the number of -colourings, which involves a delicate analytical optimisation task. Following up on work of Achlioptas and Moore [3], Kemkes, Pérez-Giménez and Wormald [28] showed that (1.2) holds for the random regular graph as well. Expanding (1.1)–(1.2) asymptotically for large , we find if , while if , with vanishing as . A series of papers [9, 10, 16] improved these asymptotic bounds to
[TABLE]
for both the binomial and the random regular graph. But in the absence of explicit estimates of the error term (1.3) fails to render improved bounds for any specific value of . Finally, several articles have been dedicated to the special case . For the binomial random graph the best bounds read [2, 22]
[TABLE]
For the random regular graph Diaz, Kaporis, Kemkes, Kirousis, Pérez and Wormald [18] showed that w.h.p. if a certain optimisation problem attains its maximum at a specific point, for which they provided numerical evidence. Moreover, Shi and Wormald [40, 41] proved analytically that , while (1.1) implies that . The proofs of all of the above lower bounds rely upon the first moment method, in some cases applied to cleverly designed random variables [9, 10, 22]. Similarly, all of the upper bounds derive from second moment arguments, with the exception of the upper bound from (1.4) and [40, 41], which are algorithmic.
Additionally, physicists brought to bear a canny but non-rigorous technique called the ‘1RSB cavity method’ on the random graph colouring problem [44]. In the case of the random regular graph, the physics calculations predict an elegant formula. Let
[TABLE]
and let be the solution to the algebraic equation
[TABLE]
that minimises . (If there is more than one such value , choose one arbitrarily.) Then [44] predicts that
[TABLE]
There is a similarly precise, albeit more complicated, prediction as to the chromatic number of the binomial random graph; see Section 1.3 below.
The aim of this paper is to rigorously establish the lower bounds on the chromatic number predicted by the cavity method. In contrast to prior lower bound arguments, we do not rely on the first moment method. Instead, we adapt a technique from the mathematical physics of spin glasses known as the ‘interpolation method’ [24, 27, 38] to the graph colouring problem. In a combinatorial context, the interpolation method has previously been applied to establish a tight lower bound on the random -SAT threshold [21], to the independence number of random graphs [30] and other optimisation problems on random (hyper-)graphs [15, 37, 42] as well as to estimate the rank of random matrices over finite fields [11]. So it may not be surprising that the method can be made to work for graph colouring. However, the interpolation method remains relatively unknown in combinatorics, where, we believe, it may potentially improve over first moment bounds in many more applications as well. We therefore endeavour to explain the method at leisure in combinatorial terms to facilitate future applications of the interpolation method.
We proceed to state the results for the random regular graph precisely, followed by the lower bound on the chromatic number of the binomial random graph. Section 1.4 contains references to related work. An outline of the proof strategy follows in Section 2.
1.2. The random regular graph
Given and with from (1.5) define
[TABLE]
Then we have the following lower bound on the chromatic number of the random regular graph.
Theorem 1.1**.**
If and then w.h.p.
The function is differentiable and . Furthermore, the calculations performed towards [44, eq. (35)] show that . Hence, whenever the minimum value is negative, the minimiser must be a zero of . It follows, after some algebra, that the minimiser is a solution to (1.6). Thus, Theorem 1.1 verifies the lower bound from (1.7), which [44] conjectures to be tight for all .
Of course, we can evaluate numerically and calculate for any given . The first few values are displayed in Table 1. For those where is displayed in boldface, the new bound strictly improves over the first moment bound (1.1); in the other cases the bounds coincide. In addition, Table 1 shows the value up to which (1.2) implies that w.h.p.(here “smm” is short for “second moment method”).
The asymptotic lower bound (1.3) on , which was derived in [10] via an extremely laborious first moment argument, follows from Theorem 1.1 at the expense of just a brief calculation. (See Section 5.) That said, in the limit of large we do not improve over (1.3), which is conjectured to be optimal up to the precise value of the error term.
Corollary 1.2**.**
If then w.h.p.
1.3. The binomial random graph
Locally the random regular graph is as ‘deterministic’ as it gets: for all but a bounded number of exceptional vertices, any bounded-depth neighbourhood is just a -regular tree w.h.p. By contrast, in the binomial random graph the neighbourhoods are random, distributed as the trees generated by a Galton-Watson process with offspring distribution . The value of the chromatic number predicted by the cavity method mirrors this local non-uniformity. Indeed, while in the case of random regular graphs we obtained the scalar optimisation problem (1.8), in the binomial case we face an optimisation problem over a probability measure on the unit interval. To be precise, let be a probability distribution on . Moreover, let be a family of independent random variables with distribution . Additionally, let be independent of the . Then we define
[TABLE]
Setting
[TABLE]
we obtain the following lower bound on the chromatic number.
Theorem 1.3**.**
If and then w.h.p.
Zdeborová and Krzakala predict that this bound is tight for all [44].
Due to the optimisation over distributions , the value may be hard to evaluate. The physics literature relies upon a numerical heuristic called population dynamics [34] to tackle such optimisation problems, but of course there is no general guarantee that the true optimiser will be found. Yet fortunately Theorem 1.3 shows that any distribution yields an upper bound on , and thus a lower bound on the chromatic number. In particular, we could try atoms with . For instance, we find that , whence ; see Figure 1. Even this quick bound significantly improves over the best prior bound (1.4) from [22], based on a tricky first moment calculation, and comes within a whisker of the value obtained via population dynamics [44]. In principle, the bound could be sharpened by optimising over distributions with a (small) finite support, but such a calculation seems to require computer assistance.
Similarly, substituting a suitable atom into (1.10) suffices to rederive the large- asymptotic lower bound on the chromatic number from (1.3), originally established in [9] via a complicated first moment argument. As in the regular case we do not improve over (1.3) asymptotically for large ; again, (1.3) is conjectured to be optimal up to the precise value of the term hidden in the .
Corollary 1.4**.**
If then w.h.p.
1.4. Related work
The history of the random graph colouring problem is long and distinguished. Improving a prior result of Matula [32], Bollobás [8] determined the chromatic number of the dense binomial random for fixed , up to a multiplicative error of . Kučera and Matula obtained the same result via a different proof [33]. Łuczak [31] extended the approach from [32, 33] to sparse random graphs. His main result shows that w.h.p. for ,
[TABLE]
Particularly for small edge probabilities the bound (1.11) is not quite satisfactory, as a result of Alon and Krivelevich [5] shows that the chromatic number of is concentrated on two consecutive integers if . Seizing upon techniques from [4, 5], Coja-Oghlan, Panagiotou and Steger [14] determined a set of three consecutive integers on which the chromatic number concentrates for . Furthermore, the aforementioned result of Achlioptas and Naor [4] determines the two integers on which concentrates, when is fixed. Yet in this case it is widely conjectured that there exists a sharp threshold for -colourability, i.e., that for each there exists such that if while if . Clearly, if such a exists then the chromatic number would actually concentrate on a single integer for almost all . Towards the sharp threshold conjecture, Achlioptas and Friedgut [1] established the existence of a non-uniform threshold sequence for every . Physics predictions [44] assert that the -colourability threshold coincides with from (1.10) for all .
Concerning the random regular graph , Frieze and Łuczak [25] obtained an asymptotic bound akin to (1.11) for , which Cooper, Frieze, Reed and Riordan [17] extended to . Further, Krivelevich, Sudakov, Vu and Wormald [29] obtained an asymptotic formula akin to Bollobás’ result [8] for degrees . The best prior bounds on with fixed were stated in Section 1.1.
The physicists’ cavity method has inspired a great deal of rigorous work. Perhaps the most prominent example is the proof of the -SAT threshold conjecture for large by Ding, Sly and Sun [21]. The proof of the lower bound on the -SAT threshold is based on an impressive second moment argument, while the proof of the upper bound relies on the interpolation method. The way we use the interpolation method here is reminiscent of its application in [21]. Further problems in which the 1RSB cavity method has been vindicated include the independent set problem on random regular graphs [20], the regular -NAESAT problem [19] and the regular -SAT problem [13].
As for the history of the interpolation method itself, Guerra [27] invented the technique in order to study the free energy of the Sherrington-Kirkpatrick spin glass model. The interpolation method went on to become a mainstay of the mathematical physics of spin glasses (see, e.g., [36]). Franz and Leone [24] pioneered the use of the interpolation method for combinatorial problems. The approach was further elaborated and generalised by Panchenko and Talagrand [38], and their version of the interpolation method was applied to the -SAT problem in [21]. We will use (and adapt) the Panchenko–Talagrand version as well. Moreover, an important contribution of Bayati, Gamarnik and Tetali [7] applied a different variant of the interpolation method to prove, e.g., the existence of the limit of the normalised independence number of the random graph or . This version of the interpolation method does not provide estimates of the value of such limits. Sly, Sun and Zhang [42] combined the combinatorial interpolation scheme from [7] with the interpolation arguments from [24, 38] to derive bounds on the partition functions of random regular (and uniform) hypergraphs. For instance, [42, Theorem E.3] shows that the formula provided by the 1RSB cavity method yields an upper bound on the partition function for a variety of models. These models include the Potts antiferromagnet on the random regular graph, which plays a prominent role in the present paper as well. In particular, for the random regular graph Corollary 2.11 below, an important intermediate step towards the proof of Theorem 1.1, is a special case of [42, Theorem E.3]. Furthermore, building upon [37], Coja-Oghlan and Perkins [15] recently used the interpolation method to derive precise variational formulas for the partition functions of random regular (hyper-)graph models. The models studied in that paper include the Potts antiferromagnet as well, and the random regular graph version of Corollary 2.11 could be derived from [15, Theorem 7.6] with little effort. But since the expositions of the 1RSB interpolation method for random regular graphs in [15, 42] and for binomial random graphs in [38] are rather brisk, and since, strictly speaking, [38] does not cover the Potts model, we present the interpolation method from scratch, with a view to facilitating future uses of the method in combinatorics.
1.5. Preliminaries and notation
In order to avoid replications and case distinctions, throughout the paper we use the shorthand to denote either the random regular graph or the binomial random graph . Most of the statements and arguments in the following sections are generic and apply to either model. There are just a few steps where we will need to treat the two models separately. If is the binomial random graph then we let be a Poisson variable, while in the case of the random regular graph we let deterministically. In either case we let be independent copies of .
As per common practice, we use the -notation to refer to the limit . In our calculations we tacitly assume that is sufficiently large for the various estimates to be valid. In addition, in Section 5 we use -notation to refer to the limit of large as in Corollaries 1.2 and 1.4.
For a finite set we denote by the set of probability distributions on . We identify with the standard simplex in . Accordingly, inherits its topology from . Further, we write for the space of probability measures on . We endow with the weak topology, thus obtaining a Polish space. Additionally, denotes the space of probability measures on .
For a probability measure on a discrete probability space we denote by independent samples drawn from . Where the reference to is apparent we omit from the superscripts and just write , , etc. For a function we denote the expectation of by . Thus,
[TABLE]
Finally, we need the following version of a Markov random field. A factor graph
[TABLE]
consists of
- •
a finite set of variable nodes,
- •
a finite set of constraint nodes,
- •
a finite or countable range for each ,
- •
a subset for each ,
- •
a weight function for each .
A factor graph can be represented by a bipartite graph with vertex sets and where the neighbourhood of is just . We further define the function by
[TABLE]
for all , where denotes the restriction of to . Finally, the partition function of is defined by
[TABLE]
If then gives rise to a probability distribution
[TABLE]
that is called the Boltzmann distribution of .
2. Outline
We proceed to survey the proofs of the main results, deferring most technical details to the following sections.
2.1. The Potts antiferromagnet
The goal is to derive a lower bound on the chromatic number of the random graph or . We tackle this problem indirectly by way of a weighted version of the -colourability problem. To be precise, the -spin Potts antiferromagnet at inverse temperature on a multigraph is the probability distribution on defined by
[TABLE]
Here it is understood that each edge of contributes to the products in (2.1) and (2.2) according to its multiplicity. The strictly positive quantity , known as the partition function, ensures that is a probability measure. Moreover, we observe that the probability mass is governed by the number of edges that renders monochromatic. Indeed, the product in (2.1) imposes an ‘penalty factor’ for every monochromatic edge. Thus, larger values of deliver higher penalties to monochromatic edges. In particular, if is a -colouring of then the product evaluates to one. Therefore, the partition function is lower-bounded by the total number of -colourings of and equals the number of -colourings. Hence, if there exists such that .
Thus, our approach is to show that there exists such that if exceeds the thresholds stated in Theorems 1.1 and 1.3 then w.h.p. . To facilitate the analysis of we will work with slightly modified and (for our purposes) more amenable random graph models. Specifically, fixing , we let
[TABLE]
be a Poisson variable conditioned on not exceeding . Define as the random multigraph on the vertex set obtained by inserting independent random edges chosen uniformly out of all possible edges. Similarly, let be the random multigraph obtained from the following version of the configuration model: choose a matching of size of the complete graph on uniformly at random. Then obtain by inserting one -edge for every matching edge . In order to avoid case distinctions, we use the symbol to denote either or .
Working with the Potts antiferromagnet rather than directly with the graph colouring problem offers two advantages. First, the partition function is always positive and enjoys a Lipschitz property with respect to edge additions/deletions. Indeed, adding or deleting a single edge can change by an additive term of at most in absolute value. (See Section 3.1 below.) Second, as a consequence of this Lipschitz property it is easy to prove that is tightly concentrated about its expectation. Although similar statements already appear in the literature (e.g., [6, 15]), we include the proof for completeness.
Proposition 2.1**.**
For any there is such that for sufficiently large we have
[TABLE]
Proposition 2.1 implies that the partition functions of and do not differ too much.
Corollary 2.2**.**
For any we have
Finally, thanks to the following corollary it suffices to bound to show that fails to be -chromatic.
Corollary 2.3**.**
If there is such that then w.h.p.
Proof.
If then for all . Hence,
[TABLE]
Now, assume that for some . Then Proposition 2.1 implies that w.h.p., and thus . Thus, (2.5) shows that w.h.p.∎
The proofs of Proposition 2.1 and Corollary 2.2 can be found in Section 3. At the end of Section 2 we show how these results are used to prove our main theorems.
2.2. The interpolation scheme
The study of the partition function is closely intertwined with the study of the probability distribution from (2.1). What turns the latter task into a challenge is the possible presence of extensive stochastic dependencies amongst the colours that , drawn from , assigns to the different vertices. While there are short range dependencies between the colour of a vertex and the colours of vertices in its proximity, the expansion properties of are apt to cause long-range dependencies as well.
To cope with this issue, we are going to compare with another random graph model in which the dependencies between the vertices are more manageable. Specifically, we will upper-bound in terms of . To this end we will construct an interpolating family of random graphs such that essentially coincides with the random graph from Section 2.1. To compare and we will show that is non-negative. This general proof strategy is known as the interpolation method. The specific interpolation scheme that we use is an adaptation of the construction that Panchenko and Talagrand [38] used to study binary problems on binomial random hypergraphs (e.g., random -SAT formulas). In the case of random regular graphs, the present construction can actually be viewed as a special case of the interpolation scheme from [15]. But since we need to perform the analysis for the binomial random graph anyway, a unified treatment of both models incurs little overhead.
The elements of the interpolation scheme will not be plain random graphs but random factor graphs. To construct the interpolating family, fix a probability measure as well as parameters and a probability distribution on . Let be mutually independent random variables with distribution ; thus, . Next, given let
[TABLE]
be a set of mutually independent random variables such that the have distribution , the have distribution , the have distribution , the have distribution and the have distribution . Thus, all random variables in are mutually independent given . Additionally, let
[TABLE]
be mutually independent and independent of everything else. Define the event
[TABLE]
and write for given .
Remark 2.4**.**
Although the above description of the random variables is complete and correct, now seems to be a propitious moment to dwell on the measure-theoretic basis of the construction. It can be implemented on a standard Borel space. To this end we identify the space with the standard simplex in . Thus, inherits the Euclidean topology and the corresponding Borel algebra. Let
[TABLE]
be a measurable function and let
[TABLE]
be mutually independent random variables that are uniformly distributed on the unit interval , all defined on a common standard Borel space. Then induces a distribution as for a given we naturally obtain a distribution , namely the distribution of the -valued random variable . Consequently, the distribution of the -valued random variable belongs to the space . Indeed, since is a complete separable metric space, any distribution can be represented by a map in this manner. Now, the above can be identified with the -valued random variables , and similarly for . Moreover, the can be identified with
All the factor graphs have variable nodes
[TABLE]
with ranging over (that is, ), and ranging over . The constraint nodes are
[TABLE]
How constraint and variable nodes are connected depends on whether is the binomial or the regular random graph.
Definition 2.5** (binomial case).**
The connections between the constraint and variable are as follows.
- •
Each , , is adjacent to a random pair of two distinct variable nodes from ; these pairs are drawn uniformly and independently of everything else.
- •
Each , , is adjacent to and one random variable node from drawn uniformly and independently of everything else.
- •
The constraint nodes are adjacent to the variable node only.
The construction in the random regular case resembles the ‘configuration model’.
Definition 2.6** (regular case).**
Let be a uniformly random maximal matching of the complete bipartite graph with vertex classes
[TABLE]
this matching covers the left vertex set completely because .
- •
Each constraint node is adjacent to the variable nodes for which contains edges between and and and .
- •
Each is adjacent to and to the variable node for which contains an edge between and .
- •
The constraints are adjacent to only.
Finally, we need to define the weight functions of the constraint nodes: let
[TABLE]
Thus, simply weighs the value according to the given probability distribution . Moreover, the constraint nodes simulate the effect of the edges of the original graph as in the definition (2.1) of the Potts model. Indeed, if the variable nodes adjacent to are coloured the same then the weight is ; otherwise it is one. Moreover, weighs the colour of the adjacent variable node from according to . Further, is determined by the probability that two colours chosen independently from coincide. The total weight , partition function and the Boltzmann distribution are defined by the general formulas (1.13)–(1.15). In the physics literature the are called external fields [34]. A similar construction involving an extra -valued variable node was used in [42].
At ‘time’ (2.7) ensures that . Thus, the only constraints present are the . Each of them is connected to the variable node and to one other variable node. Hence, the factor graph is star-shaped with constraint node at the centre. In effect, the variable nodes are dependent only through .
By contrast, at (2.7) yields . Thus, the factor graph contains only constraints of type and of type . In effect, decomposes into two parts. The connected component of contains all the constraint nodes , none of which is connected with . Thus, once more there is a star structure with at the centre, and it is not too difficult to write out the partition function of this component. Furthermore, the factor graph induced on and is essentially identical to the original graph . More specifically, the Boltzmann distribution mimics that of the Potts antiferromagnet from (2.1). The only, for our purposes negligible, difference is that typically has slightly fewer than constraint nodes of the type . Thus, we can relate the partition functions and ; see Figure 2 for an illustration.
We observe that the distribution of the degrees of remains essentially the same for . Specifically, in the regular case most variables have degree exactly throughout the interpolation, and in the binomial case the degrees are approximately distributed. Additionally, the total number of constraints remains (essentially) constant throughout the interpolation as well. Indeed, at there are about constraints of type and about the same number of constraints , while at we have about constraints of type .
As mentioned above, the idea behind the construction is to compare with the partition function of a simpler model where correlations amongst are amenable to a precise analysis. The following two propositions spell out this relationship precisely.
Proposition 2.7**.**
Let
[TABLE]
Then for any there exists such that for all and for all we have .
Furthermore, the following proposition shows that dominates . The proof is based on estimating the derivative .
Proposition 2.8**.**
We have
Finally, we introduce a convenient proxy for the partition function of : let
[TABLE]
Corollary 2.9**.**
For any we have .
The proofs of Propositions 2.7 and 2.8 and Corollary 2.9 can be found in Section 4. We are thus left to study , which are approximations to the partition function of the factor graph and the partition function of the -component of , respectively.
Let us wrap up by dwelling on the intended combinatorial semantics of construction. The nodes and clearly mimic the orignial Potts antiferromagnet. But as we move along we replace more and more by external fields . These are meant to capture the physics intuition as to the nature of the interactions between variable nodes in the random graph ; Corollary 2.9 corroborates the physics picture to the extent that it yields an upper bound on the partition function. Specifically, the impact that an actual edge of has on an incident vertex is thought to be governed by the local graph structure around the other vertex in the graph obtained by removing [34]. Since short cycles are scarce, the local graph structure will likely be a tree. Indeed, it will just be a -ary tree in the random regular graph, and a Galton-Watson tree in the binomial case. In the binomial case, the specific tree structure is apt to impact the influence that exerts on . For example, if the Galton-Watson tree dies out quickly, then it will be easy to colour the entire tree properly regardless of the colour of . Thus, the edge will be of little consequence. By contrast, in the event of a relatively dense tree, choosing a specific colour for might have repercussions on a large number of other vertices. The random variables are meant to capture the randomness of the tree structure pending on vertex . But for the sake of simplicity, we do not incorporate an actual distribution on trees into our construction. Instead, we make do with the distribution that is meant to just capture the ensuing impact that has on .
Furthermore, the variable node is intended to represent the conjectured structure of the Boltzmann distribution . To elaborate: according to physics intuition, the distribution partitions the phase space into an unbounded number of ‘clusters’ for close to the -colourability threshold and large [35, 44]. Inside a cluster, i.e., under the conditional distribution , most vertices are strongly polarised towards a particular colour. In other words, the conditional marginals for are typically either fairly close to zero or to one, while of course overall the marginal of the colour of each vertex is just uniform. The variable node is intended to represent the choice of the cluster . Thus, the distribution , which mimics the local graph structure, determines how the marginal of is distributed given a cluster index, and then the sample represents the actual realisation of the distribution of the colour inside cluster number . Finally, models the distribution of the relative cluster volumes .
2.3. Poisson-Dirichlet weights
While the expression from Corollary 2.9 already bears a certain resemblance to (1.9), an important difference remains. Namely, the expressions inside the logarithm still contain , the number of vertices. If the probability distribution is an atom, that is, for some , then we can produce the same joint distribution on by deleting and from the factor graphs and and replacing by in the expressions for and . This causes and to factorise:
[TABLE]
In particular, long-range correlations are completely absent in the target of the interpolation. (The modified with and deleted consists of connected components, each containing exactly one .) In physics jargon the bound on that can be obtained from (2.10) is called the replica symmetric bound. While the replica symmetric bound easily implies the first moment bound (1.1), it does not appear sufficient to prove Theorems 1.1 and 1.3 for any .
Fortunately there is another choice of the distribution that leads to a simple formula. Recall that the Poisson-Dirichlet distribution with parameter is defined as follows. Let be the countable point set generated by a Poisson point process on with density , independent of all other sources of randomness that have been introduced thus far. Further, let be the sequence that comprises the points from in decreasing order, i.e., for all . Since , we have almost surely. Therefore,
[TABLE]
defines a probability measure on , the Poisson-Dirichlet law. To be precise, is a random probability measure which depends on . This distribution is used in the following lemma, which enables us to simplify .
Lemma 2.10** ([38, Proposition 1] and [43, Proposition 6.5.15]).**
Suppose that and that are positive identically distributed random variables with bounded second moments, mutually independent and independent of . Then
[TABLE]
In the physics literature, the Poisson-Dirichlet distribution has been postulated as the correct distribution of the relative cluster sizes [34, 35]. Moreover, Panchenko and Talagrand [38] used Lemma 2.10 to bound the partition function of the random -SAT model. We apply Lemma 2.10 in a similar manner to upper bound . Specifically, let be the -algebra generated by . Thanks to Lemma 2.10, we can simplify the bound from Corollary 2.9 as follows.
Corollary 2.11**.**
For any and we have
[TABLE]
Proof.
Choose small enough and assume that is sufficiently large. Moreover, for all let
[TABLE]
Applying Corollary 2.9 to the random distribution , we obtain
[TABLE]
Hence, Lemma 2.10 yields
[TABLE]
clearly, since do not depend on , the outer in (2.13) is on only. Further, because the are mutually independent given , we obtain
[TABLE]
The assertion follows from (2.13)–(2.15). ∎
2.4. The zero temperature limit
To actually deduce a bound on the chromatic number from Proposition 2.11 we need to fix the three remaining parameters . Since the Potts model approaches the graph colouring problem in the limit of large , it seems natural to take the limit . In physics jargon, we take the ‘temperature’ to zero. Moreover, physics intuition suggests sending the ‘Parisi parameter’ to zero as well. Ding et al. [21] took similar limits to derive the upper bound on the -SAT threshold from the formula for the -SAT partition function from [38].
With respect to , we make two different choices, depending on whether is regular or binomial. Let us begin with the regular case. For , let be the atom on colour . Moreover, let be the uniform distribution on the colours. Then for a given we define
[TABLE]
Geometrically, we can think of as a discrete distribution on the standard simplex that places mass on the centre and distributes the remaining mass equally amongst the vertices of the simplex. Let
[TABLE]
be the atom on . Further, the expression (1.10) for the binomial random graph involves a probability distribution on . Given any , we define
[TABLE]
Observe that the integrand is the distribution from (2.17), and thus . Plugging or into Proposition 2.11, we finally obtain the expressions from (1.8) and (1.10).
Proposition 2.12**.**
If is the random regular graph then
[TABLE]
Moreover, if is the binomial model then
[TABLE]
The proof of Proposition 2.12 can be found in Section 4.4.
Now we have all the pieces in place to complete the proofs of the main theorems.
Proof of Theorem 1.1.
Fix and assume that for some . (This holds when , for example.) Then Proposition 2.12 yields and such that . In particular we can take to be the value which minimises . Consequently, Corollary 2.11 implies that . Therefore, Corollary 2.3 implies that
[TABLE]
We are left to prove that also fails to be -chromatic w.h.p. for all . To see this, we observe that the property of being -colourable is decreasing; that is, if a graph is -colourable then so is every subgraph of . Now, [26, Theorem 9.36] shows that if and if a decreasing property is satisfied for w.h.p. then enjoys w.h.p., too. Thus, (2.19) implies that for all . ∎
Proof of Theorem 1.3.
Once more we fix and suppose that for some . (This holds when , for example.) Then by Proposition 2.12 there exist and such that and thus by Corollary 2.11. Hence, Corollary 2.3 yields
[TABLE]
Finally, due to monotonicity, (2.20) implies that w.h.p. for all . ∎
Given Theorems 1.1 and 1.3 the asymptotic formulas detailed in Corollary 1.2 and Corollary 1.4 follow from routine calculations, which we defer to Section 5.
3. Concentration
After proving Proposition 2.1 in Section 3.1, we prove Corollary 2.2 in Section 3.2.
3.1. Proof of Proposition 2.1
The proof is based on Azuma’s inequality and the Lipschitz property of the random variable . Indeed, (2.2) shows that if a multigraph is obtained from by adding one single edge then , and hence
[TABLE]
We pick a small enough and a smaller . We treat the binomial random graph and the random regular graph separately, tacitly assuming in either case that is sufficiently large.
3.1.1. The binomial random graph
Writing for the number of edges of and invoking the Chernoff bound, we obtain
[TABLE]
Further, let be the random multigraph on vertices comprising edges chosen uniformly and independently out of all possible edges. Let be the event that is simple. It is well known that
[TABLE]
Moreover, providing is chosen small enough, Azuma’s inequality and (3.1) imply that
[TABLE]
The estimates (3.3)–(3.4) imply that for all ,
[TABLE]
Since
[TABLE]
the bound (3.5) shows that for all ,
[TABLE]
Further, because and are identically distributed, (3.5) and (3.7) show that
[TABLE]
Moreover, combining (3.2), (3.6) and (3.8), we obtain (2.4).
To prove the second assertion, we recall that . We thus obtain the tail bound
[TABLE]
for sufficiently small . Since and are identically distributed, (3.4) yields
[TABLE]
Finally, providing that is chosen small enough, (3.1) and (3.9) imply that
[TABLE]
Therefore, the second part of (2.4) follows from (3.9) and (3.10).
3.1.2. The random regular graph
We recall that the random regular graph can be constructed via the configuration model by drawing a perfect matching of the complete graph on the vertex set uniformly at random. To be precise, the sequence is constructed by successively drawing a uniformly random edge that connects two distinct vertices of the complete graph on that are not incident with . Let be the random multigraph on obtained by inserting for each matching edge an edge between and and let denote the event that is simple. It is well known that
[TABLE]
see, e.g., [26, Corollary 9.7]. Moreover, is distributed as given .
To prove the first inequality we consider the filtration with generated by . Then the sequence is a Doob martingale. Moreover, (3.1) implies that
[TABLE]
Therefore, Azuma’s inequality yields
[TABLE]
The first assertion thus follows from (3.11) and (3.13). Further, we can think of as the multigraph obtained by inserting the edges induced by only. Hence, arguing as for (3.13) but stopping after steps gives
[TABLE]
Finally, the second assertion follows from (3.1), (3.9) and (3.14).
3.2. Proof of Corollary 2.2
Given we choose small enough , , and assume that is sufficiently large. Once more we treat the binomial and the regular models separately.
3.2.1. The binomial random graph
We continue to denote the total number of edges of the binomial graph by and by the random multigraph obtained by including uniformly and independently chosen edges. Due to (3.2) and (3.9), with probability , we can obtain from by adding or removing no more than edges. Hence, provided is small enough, (3.1) ensures that
[TABLE]
Furthermore, with the event that is simple, is distributed as given . Therefore, (3.3) and (3.4) imply that
[TABLE]
Finally, the assertion follows from (3.15) and (3.16).
3.2.2. The random regular graph
As in Section 3.1.2 we denote by the random multigraph with edges drawn from the configuration model. By the principle of deferred decisions we can think of as being obtained from by adding the missing edges. Hence, provided that is sufficiently small, (3.1) implies that
[TABLE]
Furthermore, as is distributed as given the event , (3.11) and (3.13) yield
[TABLE]
The assertion follows from (3.17) and (3.18).
4. Interpolation
In this section we carry out the technical details of the interpolation argument. Section 4.1 contains the proof of Proposition 2.7 while Section 4.2 deals with the proof of Proposition 2.8. Subsequently, Section 4.3 contains the proof of Proposition 2.9 and finally, in Section 4.4 we prove Proposition 2.12.
4.1. Proof of Proposition 2.7
A glimpse at (2.7) reveals that the random factor graph consists of constraint nodes and only. (See also the left side of Figure 2.) The constraints are adjacent to the variables but not to , while are adjacent to but not to . Consequently, the partition function factorises:
[TABLE]
Hence
[TABLE]
and by construction we have
[TABLE]
Additionally, is distributed as given . Hence, since , we can couple and such that
[TABLE]
Since for any we have for all , by (2.7) and (4.3) and applying Poisson tail bounds gives
[TABLE]
Combining (4.1), (4.2) and (4.4), we obtain
[TABLE]
Finally, the assertion follows from (4.5) and Corollary 2.2.
4.2. Proof of Proposition 2.8
We begin by defining a set of variable nodes of , along with a probability distribution on . In the binomial case ( is the binomial random graph) let and let be the uniform distribution on . In the regular case ( is the random regular graph) let be the set of all vertices of degree strictly less than in , and providing that we define, for all ,
[TABLE]
In both the binomial and the regular case we refer to the elements of as cavities. Assuming that , we denote by cavities drawn independently from . Note that .
The proof of Proposition 2.8 relies on coupling arguments. Specifically, we will couple with three random factor graphs obtained by adding one more constraint of each of the three types of constraints:
- •
assuming that , we obtain from by adding one more constraint as per Definition 2.5 or 2.6, respectively; if then we let .
- •
assuming that , we obtain from by adding one more constraint in accordance with Definition 2.5 or 2.6, respectively; if then we let .
- •
finally, obtain from by adding one more constraint .
The following lemma expresses the derivative of in terms of these three enhanced factor graphs. Let us observe for future reference that
[TABLE]
which follows from the fact that has at most constraint nodes, and that the weight functions of the constraint nodes satisfy .
Lemma 4.1**.**
We have
[TABLE]
Proof.
Recalling that are distributed as the independent Poisson variables from (2.7) given the event from (2.8), we see that
[TABLE]
The conditional expectation on the right hand side is independent of . But the means of are governed by . Hence, we need to differentiate . For we obtain
[TABLE]
The product rule yields
[TABLE]
which simplifies to
[TABLE]
Moreover, differentiating gives
[TABLE]
Combining (4.8)–(4.12) and using , we obtain
[TABLE]
By the principle of deferred decisions, if then we can think of given as resulting from given via the insertion of one more constraint . Therefore,
[TABLE]
The definition (2.7) of the Poisson variables ensures that . Hence, (4.6) and (4.14) yield
[TABLE]
Similarly,
[TABLE]
Thus, the assertion follows from (4.13), (4.15), (4.16) and (4.17). ∎
Let be the event that . The choice of the parameters (2.7) ensures that
[TABLE]
We proceed to calculate the three expressions on the r.h.s. of (4.7). Recall the function and the Boltzmann distribution which correspond to , defined as in (1.13) and (1.15) with from (2.11). Also recall the bracket notation from (1.12).
Lemma 4.2**.**
We have
[TABLE]
Proof.
Since (4.6) shows that , (4.18) implies that
[TABLE]
Moreover, conditioned on the event , the factor graph results from via the addition of a single constraint . Denoting by the variable nodes that joins, we obtain
[TABLE]
(Here the sum is over all , recalling that .) In particular, since , we have . Further, conditioned on , the probability that two cavities chosen independently with distribution coincide is . Hence, recalling the construction of the probability distribution on the set of cavities, we notice that the distribution of the pair and the distribution of the pair have total variation distance . Consequently, (4.20) yields
[TABLE]
The assertion follows from (4.19) and (4.21). ∎
Lemma 4.3**.**
We have
[TABLE]
Proof.
Just as in the proof of Lemma 4.2 we have
[TABLE]
Denote by the variable node adjacent to the new constraint of . Then conditioned on we have
[TABLE]
By construction, the variable node is distributed according to , the law of . Hence,
[TABLE]
Combining (4.22) and (4.23) completes the proof. ∎
Lemma 4.4**.**
We have
[TABLE]
Proof.
This follows from similar manipulations as in the proofs of Lemmas 4.2 and 4.3. ∎
Proof of Proposition 2.8.
Let
[TABLE]
Combining Lemmas 4.1–4.4, we see that
[TABLE]
We are going to show that for all ; then the assertion follows from (4.24).
Thus, fix and let denote independent samples from . Since the expectation of the product of independent random variables equals the product of their expectations, we can rewrite as
[TABLE]
To simplify the last expression, we introduce for ,
[TABLE]
Further, recall the family of distributions drawn from from (2.6). Writing for the expectation over only, let
[TABLE]
Since and in (4.25) are mutually independent, we can interchange the order in which expectations are taken and rewrite (4.25) as
[TABLE]
Finally, the assertion follows from (4.24) and (4.26). ∎
4.3. Proof of Corollary 2.9
We begin by estimating the partition function of .
Lemma 4.5**.**
For any there is such that for all large enough we have .
Proof.
Let denote the degrees of the variable nodes in . Each of the constraints is adjacent to only one of the variable nodes from . For each , suppose the constraints are adjacent to and let denote the distribution associated with , for . (In the definition of the interpolation scheme, this distribution is denoted .) Then we can write
[TABLE]
Suppose first that is the binomial random graph and let be independent random variables. The construction of ensures that is distributed as given . Since this event occurs with probability , we conclude that . Therefore, (4.27) yields
[TABLE]
To compare this last expression with from (2.9), let be independent random variables for . Then we can couple the from (2.9) and the from (4.28) by letting . Thus, since each factor in (4.28) lies in the interval , we obtain the estimate
[TABLE]
whence the assertion follows. Second, if is the random regular graph then deterministically. Hence, letting , we obtain (4.29) in this case as well. ∎
Proof of Corollary 2.9.
The corollary follows from Proposition 2.7, Proposition 2.8 and Lemma 4.5 by taking to zero. ∎
4.4. Proof of Proposition 2.12
We will calculate the limits of the two terms appearing in (2.11) separately. To facilitate a unified treatment, let be the given probability distribution in the binomial case and let for in the case of the random regular graph. Also let be independent samples from .
Lemma 4.6**.**
We have
[TABLE]
Proof.
For let and let . Then
[TABLE]
(The lower bound in the first line is trivial, while the upper bound follows since fails for each . The upper bound in the second line is trivial, while the lower bound follows by taking the term corresponding to some colour where holds.) Consequently, we obtain
[TABLE]
pointwise. Furthermore, by inclusion/exclusion,
[TABLE]
Since are mutually independent given , for any set of size we find that
[TABLE]
using (2.16)–(2.18). Hence, (4.31) yields
[TABLE]
Finally, the assertion follows from (4.30) and (4.32). ∎
Lemma 4.7**.**
We have
[TABLE]
Proof.
Let be the event that there exists such that . Then
[TABLE]
(The sum over equals 0 if and are atoms on two different colours, and equals otherwise.) Therefore, we have pointwise convergence
[TABLE]
Since , using (2.16)–(2.18), the assertion follows from (4.33). ∎
Proof of Proposition 2.12.
The proposition follows from Lemmas 4.6 and 4.7 immediately. ∎
5. Asymptotics
We perform asymptotic expansions of , in the limit of large to prove Corollaries 1.2 and 1.4. In this section, the notation suppresses polynomials in , and both and refer to the limit .
5.1. Proof of Corollary 1.2
Write , where
[TABLE]
We will let
[TABLE]
with , and expand and asymptotically in the limit . Substituting for in gives
[TABLE]
Observe that
[TABLE]
and so, for the term we have the expansion
[TABLE]
Moreover, for we have
[TABLE]
Plugging (5.4) and (5.5) into (5.2) gives
[TABLE]
Similarly, substituting for and in gives
[TABLE]
Hence,
[TABLE]
Consequently, if where slowly enough then for large enough . This completes the proof of Corollary 1.2.
5.2. Proof of Corollary 1.4
With as in (5.1) we consider the distribution . With let
[TABLE]
First,
[TABLE]
To see this, we interpret the sum in the middle as an inclusion/exclusion formula. Namely, choose independently such that the probability of drawing [math] equals and the probability of drawing equals . Then the sum equals the probability of the event , which is clearly lower bounded by Poisson tail bounds show that and combining this with (5.7) gives
[TABLE]
Hence, let be distributed as given . Then
[TABLE]
For the term, using (5.1) and (5.3), we have the expansion
[TABLE]
Moreover, for , using the fact that , we obtain
[TABLE]
Plugging (5.10) and (5.11) into (5.8), we obtain
[TABLE]
Since and since conditioning on does not shift the mean of by more than , we obtain . Using this and (5.12) yields
[TABLE]
Combining (5.13) with the expansion (5.1) of , we finally obtain
[TABLE]
Thus, setting with slowly, we see that for large enough . This completes the proof of Corollary 1.4.
Acknowledgment.
We thank Viktor Harangi and an anonymous reviewer for their very careful reading of our manuscript and their extremely accurate and helpful comments, which led to several improvements and corrections.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] D. Achlioptas, E. Friedgut: A sharp threshold for k 𝑘 k -colorability. Random Struct. Algorithms 14 (1999) 63–70.
- 2[2] D. Achlioptas, C. Moore: Almost all graphs with average degree 4 are 3-colorable. Journal of Computer and System Sciences 67 (2003) 441–471.
- 3[3] D. Achlioptas, C. Moore: The chromatic number of random regular graphs. Proc. 8th RANDOM (2004) 219–228.
- 4[4] D. Achlioptas, A. Naor: The two possible values of the chromatic number of a random graph. Annals of Mathematics 162 (2005) 1333–1349.
- 5[5] N. Alon, M. Krivelevich: The concentration of the chromatic number of random graphs. Combinatorica 17 (1997) 303–313
- 6[6] V. Bapst, A. Coja-Oghlan, S. Hetterich, F. Rassmann, D. Vilenchik: The condensation phase transition in random graph coloring. Communications in Mathematical Physics 341 (2016) 543–606.
- 7[7] M. Bayati, D. Gamarnik, P. Tetali: Combinatorial approach to the interpolation method and scaling limits in sparse random graphs. Annals of Probability 41 (2013) 4080–4115.
- 8[8] B. Bollobás: The chromatic number of random graphs. Combinatorica 8 (1988) 49–55
