Lower bounds on the chromatic number of random graphs

Peter Ayre; Amin Coja-Oghlan; Catherine Greenhill

arXiv:1812.09691·math.CO·October 20, 2021

Lower bounds on the chromatic number of random graphs

Peter Ayre, Amin Coja-Oghlan, Catherine Greenhill

PDF

TL;DR

This paper establishes a rigorous lower bound on the chromatic number of sparse random graphs using physics-inspired methods, improving understanding of graph coloring thresholds.

Contribution

It provides the first rigorous proof of a physics-predicted lower bound on the chromatic number of sparse random graphs, using the interpolation method.

Findings

01

Lower bounds match physics predictions for certain graph models

02

Explicit bounds for small average degrees are derived

03

Simplified derivation of asymptotic formulas for large degrees

Abstract

We prove that a formula predicted on the basis of non-rigorous physics arguments [Zdeborova and Krzakala: Phys. Rev. E (2007)] provides a lower bound on the chromatic number of sparse random graphs. The proof is based on the interpolation method from mathematical physics. In the case of random regular graphs the lower bound can be expressed algebraically, while in the case of the binomial random we obtain a variational formula. As an application we calculate improved explicit lower bounds on the chromatic number of random graphs for small (average) degrees. Additionally, show how asymptotic formulas for large degrees that were previously obtained by lengthy and complicated combinatorial arguments can be re-derived easily from these new results.

Tables1

Table 1. Table 1. Bounds on the chromatic number of the random regular graph 𝔾 ( n , d ) 𝔾 𝑛 𝑑 \mathbb{G}(n,d) for small d 𝑑 d .

$q$	3	4	5	6	7	8	9	10	11	12	13	14	15	16	17	18	19	20
$d_{q, smm}$	-	6	11	16	21	27	33	39	46	52	59	66	73	81	88	96	104	111
$d_{q}$	6	10	15	20	25	31	37	44	50	57	64	71	78	86	93	101	109	117

Equations296

χ (G)

χ (G)

χ (G)

χ (G)

χ (G) {\leq q > q \mbox i f d \leq (2 q - 1) lo g q - 2 lo g 2 + o_{q} (1), \mbox i f d > (2 q - 1) lo g q - 1 + o_{q} (1) \mbox w . h . p .

χ (G) {\leq q > q \mbox i f d \leq (2 q - 1) lo g q - 2 lo g 2 + o_{q} (1), \mbox i f d > (2 q - 1) lo g q - 1 + o_{q} (1) \mbox w . h . p .

χ (G (n, d / n))

χ (G (n, d / n))

Σ_{d, q} (α)

Σ_{d, q} (α)

i = 0 \sum q - 1 (- 1)^{i} (1 - (i + 1) (1 - α) / q)^{d - 1} ((i q - 1) - \frac{1 - α}{q} (i + 1 q)) = 0

i = 0 \sum q - 1 (- 1)^{i} (1 - (i + 1) (1 - α) / q)^{d - 1} ((i q - 1) - \frac{1 - α}{q} (i + 1 q)) = 0

χ (G (n, d)) {\leq q > q \mbox i f Σ_{d, q} (α^{*}) \geq 0 \mbox i f Σ_{d, q} (α^{*}) < 0 \mbox w . h . p .

χ (G (n, d)) {\leq q > q \mbox i f Σ_{d, q} (α^{*}) \geq 0 \mbox i f Σ_{d, q} (α^{*}) < 0 \mbox w . h . p .

Σ_{d, q}

Σ_{d, q}

Σ_{d, q}^{*} (a)

Σ_{d, q}^{*} (a)

Σ_{d, q}^{*}

Σ_{d, q}^{*}

χ (G (n, p))

χ (G (n, p))

⟨ X, μ ⟩

⟨ X, μ ⟩

G = (V, C, (Ω_{v})_{v \in V}, (\partial a)_{a \in C}, (ψ_{a})_{a \in C})

G = (V, C, (Ω_{v})_{v \in V}, (\partial a)_{a \in C}, (ψ_{a})_{a \in C})

σ \mapsto a \in C \prod ψ_{a} (σ_{\partial a})

σ \mapsto a \in C \prod ψ_{a} (σ_{\partial a})

Z (G)

Z (G)

μ_{G} (σ)

μ_{G} (σ)

μ_{G, β} (σ)

μ_{G, β} (σ)

Z_{β} (G)

m \sim Po_{\leq d n /2} ((1 - ε) d n /2)

m \sim Po_{\leq d n /2} ((1 - ε) d n /2)

P [∣ lo g Z_{β} (G) - E [lo g Z_{β} (G)] ∣ > δ n]

P [∣ lo g Z_{β} (G) - E [lo g Z_{β} (G)] ∣ > δ n]

n \to \infty lim sup P [χ (G) \leq q] > 0

n \to \infty lim sup P [χ (G) \leq q] > 0

{ρ_{h, i}, ρ_{h, i, j}, ρ_{h, i}^{'}, ρ_{h, i}^{''}, \hat{ρ}_{h} ∣ i, j, h \geq 1}

{ρ_{h, i}, ρ_{h, i, j}, ρ_{h, i}^{'}, ρ_{h, i}^{''}, \hat{ρ}_{h} ∣ i, j, h \geq 1}

M_{t}

M_{t}

M = {2 M_{t} + M_{t}^{'} \leq d n, M_{t} + M_{t}^{'} + M_{t}^{''} \leq d n}

M = {2 M_{t} + M_{t}^{'} \leq d n, M_{t} + M_{t}^{'} + M_{t}^{''} \leq d n}

R : [0, 1]^{2} \to P ([q]), (x, s) \mapsto R_{x, s}

R : [0, 1]^{2} \to P ([q]), (x, s) \mapsto R_{x, s}

(x_{i}, x_{i, j}, x_{i}^{'}, x_{i}^{''}, \hat{x}, y_{h, i}, y_{h, i, j}, y_{h, i}^{'}, y_{h, i}^{''}, \hat{y}_{h})_{h, i, j \geq 1}

(x_{i}, x_{i, j}, x_{i}^{'}, x_{i}^{''}, \hat{x}, y_{h, i}, y_{h, i, j}, y_{h, i}^{'}, y_{h, i}^{''}, \hat{y}_{h})_{h, i, j \geq 1}

s, v_{1}, \dots, v_{n},

s, v_{1}, \dots, v_{n},

e_{1}, \dots, e_{m_{t}}, a_{1}, \dots, a_{m_{t}^{'}}, b_{1}, \dots, b_{m_{t}^{''}}, g .

e_{1}, \dots, e_{m_{t}}, a_{1}, \dots, a_{m_{t}^{'}}, b_{1}, \dots, b_{m_{t}^{''}}, g .

(i = 1 ⋃ m_{t} {e_{i}} \times {1, 2}) \cup i = 1 ⋃ m_{t}^{'} {a_{i}} \mbox an d i = 1 ⋃ n {v_{i}} \times [d];

(i = 1 ⋃ m_{t} {e_{i}} \times {1, 2}) \cup i = 1 ⋃ m_{t}^{'} {a_{i}} \mbox an d i = 1 ⋃ n {v_{i}} \times [d];

ψ_{g} (σ_{s})

ψ_{g} (σ_{s})

ψ_{e_{i}} (σ_{v}, σ_{w})

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Lower bounds on the chromatic number of random graphs

Peter Ayre, Amin Coja-Oghlan, Catherine Greenhill

Peter Ayre, [email protected], School of Mathematics and Statistics, UNSW Sydney, NSW 2052, Australia.

Amin Coja-Oghlan, [email protected], TU Dortmund, Faculty for Computer Science, 12 Otto Hahn St, Dortmund 44227, Germany.

Catherine Greenhill, [email protected], School of Mathematics and Statistics, UNSW Sydney, NSW 2052, Australia.

Abstract.

We prove that a formula predicted on the basis of non-rigorous physics arguments [Zdeborová and Krzakala: Phys. Rev. E (2007)] provides a lower bound on the chromatic number of sparse random graphs. The proof is based on the interpolation method from mathematical physics. In the case of random regular graphs the lower bound can be expressed algebraically, while in the case of the binomial random we obtain a variational formula. As an application we calculate improved explicit lower bounds on the chromatic number of random graphs for small (average) degrees. Additionally, we show how asymptotic formulas for large degrees that were previously obtained by lengthy and complicated combinatorial arguments can be re-derived easily from these new results. MSC: 05C80

Research of the first author supported in part by DFG CO 646. Research of the third author supported by the Australian Research Council Discovery Project DP190100977.

1. Introduction

1.1. Motivation and background

A most fascinating feature of combinatorics is how easy-to-state problems sometimes lead to deep and difficult mathematical challenges. The random graph colouring problem is a case in point. First mentioned in the seminal paper of Erdős and Rényi that started the theory of random graphs [23], the problem of finding the chromatic number of the binomial random graph $\mathbb{G}(n,d/n)$ with a fixed average degree $d$ remains open to this day. It is, in fact, the single open problem posed in that seminal paper that still awaits a complete solution. Nor is the chromatic number of the random $d$ -regular graph, a conceptually simpler object, known for all values of $d$ . Nevertheless, the quest for the chromatic number has contributed tremendously to the development of new techniques, some of which now count among the standard tools of probabilistic combinatorics [39].

A series of important papers contributed ever tighter bounds on the chromatic number of random graphs. Straightforward first moment calculations show that for any $q\geq 3$ and for $\mathbb{G}$ either the binomial111Sometimes $\mathbb{G}(n,d/n)$ is referred to as the Erdős-Rényi model. This is a slight misnamer as Erdős and Rényi [23] actually worked with a uniformly random graph with a given number of edges. random graph $\mathbb{G}(n,d/n)$ or the random regular graph $\mathbb{G}(n,d)$ ,

[TABLE]

To be precise, (1.1) is obtained by computing the expected number of $q$ -colourings222Throughout the paper the term $q$ -colouring or just colouring refers to proper vertex colourings. That is, colours are assigned to the vertices of a graph such that no two adjacent vertices receive the same colour. , which tends to zero as $n\to\infty$ if $\log q+d\log(1-1/q)/2<0$ . A celebrated contribution of Achlioptas and Naor [4] shows that for $\mathbb{G}=\mathbb{G}(n,d/n)$ ,

[TABLE]

The proof hinges on the computation of the second moment of the number of $q$ -colourings, which involves a delicate analytical optimisation task. Following up on work of Achlioptas and Moore [3], Kemkes, Pérez-Giménez and Wormald [28] showed that (1.2) holds for the random regular graph $\mathbb{G}=\mathbb{G}(n,d)$ as well. Expanding (1.1)–(1.2) asymptotically for large $q$ , we find $\chi(\mathbb{G})>q$ if $d>(2q-1)\log q+o_{q}(1)$ , while $\chi(\mathbb{G})\leq q$ if $d\leq(2q-2)\log q-2+o_{q}(1)$ , with $o_{q}(1)$ vanishing as $q\to\infty$ . A series of papers [9, 10, 16] improved these asymptotic bounds to

[TABLE]

for both the binomial and the random regular graph. But in the absence of explicit estimates of the $o_{q}(1)$ error term (1.3) fails to render improved bounds for any specific value of $q$ . Finally, several articles have been dedicated to the special case $q=3$ . For the binomial random graph the best bounds read [2, 22]

[TABLE]

For the random regular graph Diaz, Kaporis, Kemkes, Kirousis, Pérez and Wormald [18] showed that $\chi(\mathbb{G}(n,5))=3$ w.h.p. if a certain optimisation problem attains its maximum at a specific point, for which they provided numerical evidence. Moreover, Shi and Wormald [40, 41] proved analytically that $\chi(\mathbb{G}(n,4))=3$ , while (1.1) implies that $\chi(\mathbb{G}(n,6))>3$ . The proofs of all of the above lower bounds rely upon the first moment method, in some cases applied to cleverly designed random variables [9, 10, 22]. Similarly, all of the upper bounds derive from second moment arguments, with the exception of the upper bound from (1.4) and [40, 41], which are algorithmic.

Additionally, physicists brought to bear a canny but non-rigorous technique called the ‘1RSB cavity method’ on the random graph colouring problem [44]. In the case of the random regular graph, the physics calculations predict an elegant formula. Let

[TABLE]

and let $\alpha^{*}\in[0,1]$ be the solution to the algebraic equation

[TABLE]

that minimises $\Sigma_{d,q}(\,\cdot\,)$ . (If there is more than one such value $\alpha^{*}$ , choose one arbitrarily.) Then [44] predicts that

[TABLE]

There is a similarly precise, albeit more complicated, prediction as to the chromatic number of the binomial random graph; see Section 1.3 below.

The aim of this paper is to rigorously establish the lower bounds on the chromatic number predicted by the cavity method. In contrast to prior lower bound arguments, we do not rely on the first moment method. Instead, we adapt a technique from the mathematical physics of spin glasses known as the ‘interpolation method’ [24, 27, 38] to the graph colouring problem. In a combinatorial context, the interpolation method has previously been applied to establish a tight lower bound on the random $k$ -SAT threshold [21], to the independence number of random graphs [30] and other optimisation problems on random (hyper-)graphs [15, 37, 42] as well as to estimate the rank of random matrices over finite fields [11]. So it may not be surprising that the method can be made to work for graph colouring. However, the interpolation method remains relatively unknown in combinatorics, where, we believe, it may potentially improve over first moment bounds in many more applications as well. We therefore endeavour to explain the method at leisure in combinatorial terms to facilitate future applications of the interpolation method.

We proceed to state the results for the random regular graph precisely, followed by the lower bound on the chromatic number of the binomial random graph. Section 1.4 contains references to related work. An outline of the proof strategy follows in Section 2.

1.2. The random regular graph

Given $d,q\geq 3$ and with $\Sigma_{d,q}(\alpha)$ from (1.5) define

[TABLE]

Then we have the following lower bound on the chromatic number of the random regular graph.

Theorem 1.1.

If $q\geq 3$ and $d\geq d_{q}$ then $\chi(\mathbb{G}(n,d))>q$ w.h.p.

The function $\alpha\mapsto\Sigma_{d,q}(\alpha)$ is differentiable and $\Sigma_{d,q}(1)=0$ . Furthermore, the calculations performed towards [44, eq. (35)] show that $\Sigma_{d,q}^{\prime}(0)<0$ . Hence, whenever the minimum value $\Sigma_{d,q}$ is negative, the minimiser $\alpha$ must be a zero of $\Sigma_{d,q}^{\prime}(\,\cdot\,)$ . It follows, after some algebra, that the minimiser is a solution to (1.6). Thus, Theorem 1.1 verifies the lower bound from (1.7), which [44] conjectures to be tight for all $q\geq 3$ .

Of course, we can evaluate $\Sigma_{d,q}$ numerically and calculate $d_{q}$ for any given $q$ . The first few values are displayed in Table 1. For those $q$ where $d_{q}$ is displayed in boldface, the new bound strictly improves over the first moment bound (1.1); in the other cases the bounds coincide. In addition, Table 1 shows the value $d_{q,\mathrm{smm}}$ up to which (1.2) implies that $\chi(\mathbb{G}(n,d))\leq q$ w.h.p.(here “smm” is short for “second moment method”).

The asymptotic lower bound (1.3) on $\chi(\mathbb{G}(n,d))$ , which was derived in [10] via an extremely laborious first moment argument, follows from Theorem 1.1 at the expense of just a brief calculation. (See Section 5.) That said, in the limit of large $q$ we do not improve over (1.3), which is conjectured to be optimal up to the precise value of the $o_{q}(1)$ error term.

Corollary 1.2.

If $d>(2q-1)\log q-1+o_{q}(1)$ then $\chi(\mathbb{G}(n,d))>q$ w.h.p.

1.3. The binomial random graph

Locally the random regular graph is as ‘deterministic’ as it gets: for all but a bounded number of exceptional vertices, any bounded-depth neighbourhood is just a $d$ -regular tree w.h.p. By contrast, in the binomial random graph $\mathbb{G}(n,d/n)$ the neighbourhoods are random, distributed as the trees generated by a Galton-Watson process with offspring distribution ${\rm Po}(d)$ . The value of the chromatic number predicted by the cavity method mirrors this local non-uniformity. Indeed, while in the case of random regular graphs we obtained the scalar optimisation problem (1.8), in the binomial case we face an optimisation problem over a probability measure on the unit interval. To be precise, let $\mathfrak{a}$ be a probability distribution on $[0,1]$ . Moreover, let $(\boldsymbol{\alpha}_{i})_{i\geq 1}$ be a family of independent random variables with distribution $\mathfrak{a}$ . Additionally, let $\boldsymbol{D}\sim{\rm Po}(d)$ be independent of the $\boldsymbol{\alpha}_{i}$ . Then we define

[TABLE]

Setting

[TABLE]

we obtain the following lower bound on the chromatic number.

Theorem 1.3.

If $q\geq 3$ and $d>d_{q}^{*}$ then $\chi(\mathbb{G}(n,d/n))>q$ w.h.p.

Zdeborová and Krzakala predict that this bound is tight for all $q\geq 3$ [44].

Due to the optimisation over distributions $\mathfrak{a}$ , the value $d_{q}^{*}$ may be hard to evaluate. The physics literature relies upon a numerical heuristic called population dynamics [34] to tackle such optimisation problems, but of course there is no general guarantee that the true optimiser will be found. Yet fortunately Theorem 1.3 shows that any distribution $\mathfrak{a}$ yields an upper bound on $d_{q}^{*}$ , and thus a lower bound on the chromatic number. In particular, we could try atoms $\mathfrak{a}=\delta_{\alpha}$ with $\alpha\in[0,1]$ . For instance, we find that $\Sigma_{4.697,3}(\delta_{0.25})<0$ , whence $d_{3}^{*}\leq 4.697$ ; see Figure 1. Even this quick bound significantly improves over the best prior bound (1.4) from [22], based on a tricky first moment calculation, and comes within a whisker of the value $d_{3}^{*}\approx 4.687$ obtained via population dynamics [44]. In principle, the $4.697$ bound could be sharpened by optimising over distributions with a (small) finite support, but such a calculation seems to require computer assistance.

Similarly, substituting a suitable atom $\mathfrak{a}=\delta_{\alpha}$ into (1.10) suffices to rederive the large- $q$ asymptotic lower bound on the chromatic number from (1.3), originally established in [9] via a complicated first moment argument. As in the regular case we do not improve over (1.3) asymptotically for large $q$ ; again, (1.3) is conjectured to be optimal up to the precise value of the term hidden in the $o_{q}(1)$ .

Corollary 1.4.

If $d>(2q-1)\log q-1+o_{q}(1)$ then $\chi(\mathbb{G}(n,d/n))>q$ w.h.p.

1.4. Related work

The history of the random graph colouring problem is long and distinguished. Improving a prior result of Matula [32], Bollobás [8] determined the chromatic number of the dense binomial random $\mathbb{G}(n,p)$ for fixed $p\in(0,1)$ , up to a multiplicative error of $1+o(1)$ . Kučera and Matula obtained the same result via a different proof [33]. Łuczak [31] extended the approach from [32, 33] to sparse random graphs. His main result shows that w.h.p. for $p=o(1)$ ,

[TABLE]

Particularly for small edge probabilities $p$ the bound (1.11) is not quite satisfactory, as a result of Alon and Krivelevich [5] shows that the chromatic number of $\mathbb{G}(n,p)$ is concentrated on two consecutive integers if $p\leq n^{-1/2-\Omega(1)}$ . Seizing upon techniques from [4, 5], Coja-Oghlan, Panagiotou and Steger [14] determined a set of three consecutive integers on which the chromatic number concentrates for $p\leq n^{-3/4-\Omega(1)}$ . Furthermore, the aforementioned result of Achlioptas and Naor [4] determines the two integers on which $\chi(\mathbb{G}(n,d/n))$ concentrates, when $d>0$ is fixed. Yet in this case it is widely conjectured that there exists a sharp threshold for $q$ -colourability, i.e., that for each $q\geq 3$ there exists $d_{q}^{\star}>0$ such that $\chi(\mathbb{G}(n,d/n))\leq q$ if $d<d_{q}^{\star}$ while $\chi(\mathbb{G}(n,d/n))>q$ if $d>d_{q}^{\star}$ . Clearly, if such a $d_{q}^{\star}$ exists then the chromatic number would actually concentrate on a single integer for almost all $d\in(0,\infty)$ . Towards the sharp threshold conjecture, Achlioptas and Friedgut [1] established the existence of a non-uniform threshold sequence for every $q\geq 3$ . Physics predictions [44] assert that the $q$ -colourability threshold $d_{q}^{\star}$ coincides with $d_{q}^{*}$ from (1.10) for all $q\geq 3$ .

Concerning the random regular graph $\mathbb{G}(n,d)$ , Frieze and Łuczak [25] obtained an asymptotic bound akin to (1.11) for $d=o(n^{1/3})$ , which Cooper, Frieze, Reed and Riordan [17] extended to $d\leq n^{1-\Omega(1)}$ . Further, Krivelevich, Sudakov, Vu and Wormald [29] obtained an asymptotic formula akin to Bollobás’ result [8] for degrees $n^{6/7+\Omega(1)}\leq d\leq 0.9n$ . The best prior bounds on $\chi(\mathbb{G}(n,d))$ with fixed $d$ were stated in Section 1.1.

The physicists’ cavity method has inspired a great deal of rigorous work. Perhaps the most prominent example is the proof of the $k$ -SAT threshold conjecture for large $k$ by Ding, Sly and Sun [21]. The proof of the lower bound on the $k$ -SAT threshold is based on an impressive second moment argument, while the proof of the upper bound relies on the interpolation method. The way we use the interpolation method here is reminiscent of its application in [21]. Further problems in which the 1RSB cavity method has been vindicated include the independent set problem on random regular graphs [20], the regular $k$ -NAESAT problem [19] and the regular $k$ -SAT problem [13].

As for the history of the interpolation method itself, Guerra [27] invented the technique in order to study the free energy of the Sherrington-Kirkpatrick spin glass model. The interpolation method went on to become a mainstay of the mathematical physics of spin glasses (see, e.g., [36]). Franz and Leone [24] pioneered the use of the interpolation method for combinatorial problems. The approach was further elaborated and generalised by Panchenko and Talagrand [38], and their version of the interpolation method was applied to the $k$ -SAT problem in [21]. We will use (and adapt) the Panchenko–Talagrand version as well. Moreover, an important contribution of Bayati, Gamarnik and Tetali [7] applied a different variant of the interpolation method to prove, e.g., the existence of the limit $\lim_{n\to\infty}\alpha(\mathbb{G})/n$ of the normalised independence number of the random graph $\mathbb{G}=\mathbb{G}(n,d)$ or $\mathbb{G}=\mathbb{G}(n,d/n)$ . This version of the interpolation method does not provide estimates of the value of such limits. Sly, Sun and Zhang [42] combined the combinatorial interpolation scheme from [7] with the interpolation arguments from [24, 38] to derive bounds on the partition functions of random regular (and uniform) hypergraphs. For instance, [42, Theorem E.3] shows that the formula provided by the 1RSB cavity method yields an upper bound on the partition function for a variety of models. These models include the Potts antiferromagnet on the random regular graph, which plays a prominent role in the present paper as well. In particular, for the random regular graph $\mathbb{G}(n,d)$ Corollary 2.11 below, an important intermediate step towards the proof of Theorem 1.1, is a special case of [42, Theorem E.3]. Furthermore, building upon [37], Coja-Oghlan and Perkins [15] recently used the interpolation method to derive precise variational formulas for the partition functions of random regular (hyper-)graph models. The models studied in that paper include the Potts antiferromagnet as well, and the random regular graph version of Corollary 2.11 could be derived from [15, Theorem 7.6] with little effort. But since the expositions of the 1RSB interpolation method for random regular graphs in [15, 42] and for binomial random graphs in [38] are rather brisk, and since, strictly speaking, [38] does not cover the Potts model, we present the interpolation method from scratch, with a view to facilitating future uses of the method in combinatorics.

1.5. Preliminaries and notation

In order to avoid replications and case distinctions, throughout the paper we use the shorthand $\mathbb{G}$ to denote either the random regular graph $\mathbb{G}(n,d)$ or the binomial random graph $\mathbb{G}(n,d/n)$ . Most of the statements and arguments in the following sections are generic and apply to either model. There are just a few steps where we will need to treat the two models separately. If $\mathbb{G}=\mathbb{G}(n,d/n)$ is the binomial random graph then we let $\boldsymbol{D}\sim{\rm Po}(d)$ be a Poisson variable, while in the case of the random regular graph we let $\boldsymbol{D}=d$ deterministically. In either case we let $(\boldsymbol{D}_{i})_{i\geq 1}$ be independent copies of $\boldsymbol{D}$ .

As per common practice, we use the $O(\,\cdot\,)$ -notation to refer to the limit $n\to\infty$ . In our calculations we tacitly assume that $n$ is sufficiently large for the various estimates to be valid. In addition, in Section 5 we use $O_{q}(\,\cdot\,)$ -notation to refer to the limit of large $q$ as in Corollaries 1.2 and 1.4.

For a finite set $\Omega\neq\emptyset$ we denote by $\mathcal{P}(\Omega)$ the set of probability distributions on $\Omega$ . We identify $\mathcal{P}(\Omega)$ with the standard simplex in $\mathbb{R}^{\Omega}$ . Accordingly, $\mathcal{P}(\Omega)$ inherits its topology from $\mathbb{R}^{\Omega}$ . Further, we write $\mathcal{P}^{2}(\Omega)$ for the space of probability measures on $\mathcal{P}(\Omega)$ . We endow $\mathcal{P}^{2}(\Omega)$ with the weak topology, thus obtaining a Polish space. Additionally, $\mathcal{P}^{3}(\Omega)$ denotes the space of probability measures on $\mathcal{P}^{2}(\Omega)$ .

For a probability measure $\mu$ on a discrete probability space $\mathcal{X}$ we denote by $\boldsymbol{\sigma}^{\mu},\boldsymbol{\sigma}^{\mu,1},\boldsymbol{\sigma}^{\mu,2},\ldots\in\mathcal{X}$ independent samples drawn from $\mu$ . Where the reference to $\mu$ is apparent we omit $\mu$ from the superscripts and just write $\boldsymbol{\sigma}$ , $\boldsymbol{\sigma}^{1}$ , etc. For a function $X:\Omega^{\ell}\to\mathbb{R}$ we denote the expectation of $X(\boldsymbol{\sigma}^{1},\ldots,\boldsymbol{\sigma}^{\ell})$ by $\left\langle{{X},{\mu}}\right\rangle$ . Thus,

[TABLE]

Finally, we need the following version of a Markov random field. A factor graph

[TABLE]

consists of

•

a finite set $V$ of variable nodes,

•

a finite set $C$ of constraint nodes,

•

a finite or countable range $\Omega_{v}$ for each $v\in V$ ,

•

a subset $\partial a\subset V$ for each $a\in C$ ,

•

a weight function $\psi_{a}:\prod_{v\in\partial a}\Omega_{v}\to[0,\infty)$ for each $a\in C$ .

A factor graph can be represented by a bipartite graph with vertex sets $V$ and $C$ where the neighbourhood of $a\in C$ is just $\partial a$ . We further define the function $\psi_{\mathcal{G}}:\prod_{v\in V}\Omega_{v}\to[0,\infty)$ by

[TABLE]

for all $\sigma=(\sigma_{v})_{v\in V}\in\prod_{v\in V}\Omega_{v}$ , where $\sigma_{\partial a}$ denotes the restriction of $\sigma$ to $\partial a$ . Finally, the partition function $Z(\mathcal{G})$ of $G$ is defined by

[TABLE]

If $0<Z(\mathcal{G})<\infty$ then $\mathcal{G}$ gives rise to a probability distribution

[TABLE]

that is called the Boltzmann distribution of $\mathcal{G}$ .

2. Outline

We proceed to survey the proofs of the main results, deferring most technical details to the following sections.

2.1. The Potts antiferromagnet

The goal is to derive a lower bound on the chromatic number of the random graph $\mathbb{G}=\mathbb{G}(n,d)$ or $\mathbb{G}=\mathbb{G}(n,d/n)$ . We tackle this problem indirectly by way of a weighted version of the $q$ -colourability problem. To be precise, the $q$ -spin Potts antiferromagnet at inverse temperature $\beta>0$ on a multigraph $G=(V,E)$ is the probability distribution $\mu_{G,\beta}$ on $[q]^{V}$ defined by

[TABLE]

Here it is understood that each edge of $G$ contributes to the products in (2.1) and (2.2) according to its multiplicity. The strictly positive quantity $Z_{\beta}(G)$ , known as the partition function, ensures that $\mu_{G,\beta}$ is a probability measure. Moreover, we observe that the probability mass $\mu_{G,\beta}(\sigma)$ is governed by the number of edges that $\sigma$ renders monochromatic. Indeed, the product in (2.1) imposes an $\exp(-\beta)$ ‘penalty factor’ for every monochromatic edge. Thus, larger values of $\beta$ deliver higher penalties to monochromatic edges. In particular, if $\sigma$ is a $q$ -colouring of $G$ then the product evaluates to one. Therefore, the partition function is lower-bounded by the total number of $q$ -colourings of $G$ and $\lim_{\beta\to\infty}Z_{\beta}(G)$ equals the number of $q$ -colourings. Hence, $\chi(G)>q$ if there exists $\beta>0$ such that $Z_{\beta}(G)<1$ .

Thus, our approach is to show that there exists $\beta>0$ such that if $d$ exceeds the thresholds stated in Theorems 1.1 and 1.3 then w.h.p. $\log Z_{\beta}(\mathbb{G})<0$ . To facilitate the analysis of $Z_{\beta}$ we will work with slightly modified and (for our purposes) more amenable random graph models. Specifically, fixing $\varepsilon>0$ , we let

[TABLE]

be a Poisson variable conditioned on not exceeding $dn/2$ . Define $\boldsymbol{G}(n,d/n)$ as the random multigraph on the vertex set $V_{n}=\{v_{1},\ldots,v_{n}\}$ obtained by inserting $\boldsymbol{m}$ independent random edges $e_{1},\ldots,e_{\boldsymbol{m}}$ chosen uniformly out of all ${\binom{n}{2}}$ possible edges. Similarly, let $\boldsymbol{G}(n,d)$ be the random multigraph obtained from the following version of the configuration model: choose a matching $\boldsymbol{\Gamma}$ of size $\boldsymbol{m}$ of the complete graph on $V_{n}\times[d]$ uniformly at random. Then obtain $\boldsymbol{G}(n,d)$ by inserting one $vw$ -edge for every matching edge $\{(v,i),(w,j)\}\in\boldsymbol{\Gamma}$ . In order to avoid case distinctions, we use the symbol $\boldsymbol{G}$ to denote either $\boldsymbol{G}(n,d/n)$ or $\boldsymbol{G}(n,d)$ .

Working with the Potts antiferromagnet rather than directly with the graph colouring problem offers two advantages. First, the partition function $Z_{\beta}(G)$ is always positive and $\log Z_{\beta}(G)$ enjoys a Lipschitz property with respect to edge additions/deletions. Indeed, adding or deleting a single edge can change $\log Z_{\beta}(G)$ by an additive term of at most $\beta$ in absolute value. (See Section 3.1 below.) Second, as a consequence of this Lipschitz property it is easy to prove that $\log Z_{\beta}(\mathbb{G})$ is tightly concentrated about its expectation. Although similar statements already appear in the literature (e.g., [6, 15]), we include the proof for completeness.

Proposition 2.1.

For any $\varepsilon,\delta,\beta>0$ there is $\xi>0$ such that for sufficiently large $n$ we have

[TABLE]

Proposition 2.1 implies that the partition functions of $\mathbb{G}$ and $\boldsymbol{G}$ do not differ too much.

Corollary 2.2.

For any $\beta>0$ we have $\limsup_{\varepsilon\to 0}\limsup_{n\to\infty}\frac{1}{n}\left|{\mathbb{E}\left[{\log Z_{\beta}(\mathbb{G})}\right]-\mathbb{E}\left[{\log Z_{\beta}(\boldsymbol{G})}\right]}\right|=0.$

Finally, thanks to the following corollary it suffices to bound $\mathbb{E}[\log Z_{\beta}(\mathbb{G})]$ to show that $\mathbb{G}$ fails to be $q$ -chromatic.

Corollary 2.3.

If there is $\beta>0$ such that $\limsup_{n\to\infty}\frac{1}{n}\mathbb{E}\left[{\log Z_{\beta}(\mathbb{G})}\right]<0,$ then $\chi(\mathbb{G})>q$ w.h.p.

Proof.

If $\chi(G)\leq q$ then $Z_{\beta}(G)\geq 1$ for all $\beta>0$ . Hence,

[TABLE]

Now, assume that $\limsup_{n\to\infty}\frac{1}{n}\mathbb{E}[\log Z_{\beta}(\mathbb{G})]<-\delta<0$ for some $\beta>0$ . Then Proposition 2.1 implies that $\log Z_{\beta}(\mathbb{G})\leq-\delta n/2$ w.h.p., and thus $\limsup_{n\to\infty}\mathbb{P}\left[{\log Z_{\beta}(\mathbb{G})\geq 0}\right]=0$ . Thus, (2.5) shows that $\chi(\mathbb{G})>q$ w.h.p.∎

The proofs of Proposition 2.1 and Corollary 2.2 can be found in Section 3. At the end of Section 2 we show how these results are used to prove our main theorems.

2.2. The interpolation scheme

The study of the partition function $Z_{\beta}(\mathbb{G})$ is closely intertwined with the study of the probability distribution $\mu_{\mathbb{G},\beta}$ from (2.1). What turns the latter task into a challenge is the possible presence of extensive stochastic dependencies amongst the colours that $\boldsymbol{\sigma}\in[q]^{V_{n}}$ , drawn from $\mu_{\boldsymbol{G},\beta}$ , assigns to the different vertices. While there are short range dependencies between the colour of a vertex $v$ and the colours of vertices in its proximity, the expansion properties of $\mathbb{G}$ are apt to cause long-range dependencies as well.

To cope with this issue, we are going to compare $\mathbb{G}$ with another random graph model $\boldsymbol{G}_{1}$ in which the dependencies between the vertices are more manageable. Specifically, we will upper-bound $\mathbb{E}[\log Z_{\beta}(\mathbb{G})]$ in terms of $\mathbb{E}[\log Z_{\beta}(\boldsymbol{G}_{1})]$ . To this end we will construct an interpolating family of random graphs $(\boldsymbol{G}_{t})_{t\in[0,1]}$ such that $\boldsymbol{G}_{0}$ essentially coincides with the random graph $\boldsymbol{G}$ from Section 2.1. To compare $\boldsymbol{G}_{0}$ and $\boldsymbol{G}_{1}$ we will show that $\frac{\partial}{\partial t}\mathbb{E}[\log Z_{\beta}(\boldsymbol{G}_{t})]$ is non-negative. This general proof strategy is known as the interpolation method. The specific interpolation scheme $(\boldsymbol{G}_{t})_{t\in[0,1]}$ that we use is an adaptation of the construction that Panchenko and Talagrand [38] used to study binary problems on binomial random hypergraphs (e.g., random $k$ -SAT formulas). In the case of random regular graphs, the present construction can actually be viewed as a special case of the interpolation scheme from [15]. But since we need to perform the analysis for the binomial random graph anyway, a unified treatment of both models incurs little overhead.

The elements $\boldsymbol{G}_{t}$ of the interpolation scheme will not be plain random graphs but random factor graphs. To construct the interpolating family, fix a probability measure $\mathfrak{r}\in\mathcal{P}^{3}([q])$ as well as parameters $\varepsilon,\beta>0$ and a probability distribution $\gamma$ on $\mathbb{N}$ . Let $(\boldsymbol{r}_{i},\boldsymbol{r}_{i,j},\boldsymbol{r}_{i}^{\prime},\boldsymbol{r}_{i}^{\prime\prime},\hat{\boldsymbol{r}})_{i,j\geq 1}$ be mutually independent random variables with distribution $\mathfrak{r}$ ; thus, $\boldsymbol{r}_{i},\boldsymbol{r}_{i,j},\boldsymbol{r}_{i}^{\prime},\boldsymbol{r}_{i}^{\prime\prime},\hat{\boldsymbol{r}}\in\mathcal{P}^{2}([q])$ . Next, given $(\boldsymbol{r}_{i},\boldsymbol{r}_{i,j},\boldsymbol{r}_{i}^{\prime},\boldsymbol{r}_{i}^{\prime\prime},\hat{\boldsymbol{r}})_{i,j}$ let

[TABLE]

be a set of mutually independent random variables such that the ${\boldsymbol{\rho}}_{h,i}$ have distribution $\boldsymbol{r}_{i}$ , the ${\boldsymbol{\rho}}_{h,i,j}$ have distribution $\boldsymbol{r}_{i,j}$ , the ${\boldsymbol{\rho}}_{h,i}^{\prime}$ have distribution $\boldsymbol{r}_{i}^{\prime}$ , the ${\boldsymbol{\rho}}_{h,i}^{\prime\prime}$ have distribution $\boldsymbol{r}_{i}^{\prime\prime}$ and the $\hat{\boldsymbol{\rho}}_{h}$ have distribution $\hat{\boldsymbol{r}}$ . Thus, all random variables in $\{{\boldsymbol{\rho}}_{h,i},\,{\boldsymbol{\rho}}_{h,i,j},\,{\boldsymbol{\rho}}_{h,i}^{\prime},\,{\boldsymbol{\rho}}_{h,i}^{\prime\prime},\,\hat{\boldsymbol{\rho}}_{h}\,\mid\,i,j,h\geq 1\}$ are mutually independent given $(\boldsymbol{r}_{i},\boldsymbol{r}_{i,j},\boldsymbol{r}_{i}^{\prime},\boldsymbol{r}_{i}^{\prime\prime},\hat{\boldsymbol{r}})_{i,j\geq 1}$ . Additionally, let

[TABLE]

be mutually independent and independent of everything else. Define the event

[TABLE]

and write $(\boldsymbol{m}_{t},\boldsymbol{m}_{t}^{\prime},\boldsymbol{m}_{t}^{\prime\prime})$ for $(\boldsymbol{M}_{t},\boldsymbol{M}_{t}^{\prime},\boldsymbol{M}_{t}^{\prime\prime})$ given $\mathcal{M}$ .

Remark 2.4.

Although the above description of the random variables is complete and correct, now seems to be a propitious moment to dwell on the measure-theoretic basis of the construction. It can be implemented on a standard Borel space. To this end we identify the space $\mathcal{P}([q])$ with the standard simplex in $\mathbb{R}^{q}$ . Thus, $\mathcal{P}([q])$ inherits the Euclidean topology and the corresponding Borel algebra. Let

[TABLE]

be a measurable function and let

[TABLE]

be mutually independent random variables that are uniformly distributed on the unit interval $[0,1]$ , all defined on a common standard Borel space. Then $\mathfrak{R}$ induces a distribution $\mathfrak{r}\in\mathcal{P}^{3}([q])$ as for a given $x\in[0,1]$ we naturally obtain a distribution $\mathfrak{R}_{x}\in\mathcal{P}^{2}([q])$ , namely the distribution of the $\mathcal{P}([q])$ -valued random variable $\mathfrak{R}_{x,\boldsymbol{y}_{1,1}}$ . Consequently, the distribution $\mathfrak{r}$ of the $\mathcal{P}^{2}([q])$ -valued random variable $\mathfrak{R}_{\boldsymbol{x}_{1}}$ belongs to the space $\mathcal{P}^{3}([q])$ . Indeed, since $\mathcal{P}([q])$ is a complete separable metric space, any distribution $\mathfrak{r}\in\mathcal{P}^{3}([q])$ can be represented by a map $\mathfrak{R}$ in this manner. Now, the above $\boldsymbol{r}_{i}$ can be identified with the $\mathcal{P}^{2}([q])$ -valued random variables $\mathfrak{R}_{\boldsymbol{x}_{i}}$ , and similarly for $\boldsymbol{r}_{i,j},\boldsymbol{r}_{i}^{\prime},\boldsymbol{r}_{i}^{\prime\prime},\hat{\boldsymbol{r}}_{i}$ . Moreover, the ${\boldsymbol{\rho}}_{h,i},{\boldsymbol{\rho}}_{h,i,j},{\boldsymbol{\rho}}_{h,i}^{\prime},{\boldsymbol{\rho}}_{h,i}^{\prime\prime},\hat{\boldsymbol{\rho}}_{h}$ can be identified with $\mathfrak{R}_{\boldsymbol{x}_{i},\boldsymbol{y}_{h,i}},\mathfrak{R}_{\boldsymbol{x}_{i,j},\boldsymbol{y}_{h,i,j}},\mathfrak{R}_{\boldsymbol{x}_{i}^{\prime},\boldsymbol{y}_{h,i}^{\prime}},\mathfrak{R}_{\boldsymbol{x}_{i}^{\prime\prime},\boldsymbol{y}_{h,i}^{\prime\prime}},\mathfrak{R}_{\hat{\boldsymbol{x}},\hat{\boldsymbol{y}}_{h}}.$

All the factor graphs $\boldsymbol{G}_{t}$ have variable nodes

[TABLE]

with $s$ ranging over $\mathbb{N}$ (that is, $\Omega_{s}=\mathbb{N}$ ), and $v_{1},\ldots,v_{n}$ ranging over $[q]$ . The constraint nodes are

[TABLE]

How constraint and variable nodes are connected depends on whether $\mathbb{G}$ is the binomial or the regular random graph.

Definition 2.5 (binomial case).

The connections between the constraint and variable are as follows.

•

Each $e_{i}$ , $i\in[\boldsymbol{m}_{t}]$ , is adjacent to a random pair of two distinct variable nodes from $V_{n}$ ; these pairs are drawn uniformly and independently of everything else.

•

Each $a_{i}$ , $i\in[\boldsymbol{m}_{t}^{\prime}]$ , is adjacent to $s$ and one random variable node from $V_{n}$ drawn uniformly and independently of everything else.

•

The constraint nodes $g,b_{1},\ldots,b_{\boldsymbol{m}_{t}^{\prime\prime}}$ are adjacent to the variable node $s$ only.

The construction in the random regular case resembles the ‘configuration model’.

Definition 2.6 (regular case).

Let $\boldsymbol{\Gamma}_{t}$ be a uniformly random maximal matching of the complete bipartite graph with vertex classes

[TABLE]

this matching covers the left vertex set completely because $2\boldsymbol{m}_{t}+\boldsymbol{m}_{t}^{\prime}\leq dn$ .

•

Each constraint node $e_{i}$ is adjacent to the variable nodes $v,w$ for which $\boldsymbol{\Gamma}_{t}$ contains edges between $(e_{i},1)$ and $\{v\}\times[d]$ and $(e_{i},2)$ and $\{w\}\times[d]$ .

•

Each $a_{i}$ is adjacent to $s$ and to the variable node $w$ for which $\boldsymbol{\Gamma}_{t}$ contains an edge between $a_{i}$ and $\left\{{w}\right\}\times[d]$ .

•

The constraints $g,b_{1},\ldots,b_{\boldsymbol{m}_{t}^{\prime\prime}}$ are adjacent to $s$ only.

Finally, we need to define the weight functions of the constraint nodes: let

[TABLE]

Thus, $\psi_{g}$ simply weighs the value $s$ according to the given probability distribution $\gamma$ . Moreover, the constraint nodes $e_{i}$ simulate the effect of the edges of the original graph $\boldsymbol{G}$ as in the definition (2.1) of the Potts model. Indeed, if the variable nodes adjacent to $e_{i}$ are coloured the same then the weight is $\exp(-\beta)$ ; otherwise it is one. Moreover, $\psi_{a_{i}}$ weighs the colour $\sigma$ of the adjacent variable node from $V_{n}$ according to ${\boldsymbol{\rho}}_{\sigma_{s},i}$ . Further, $\psi_{b_{i}}(\sigma_{s})$ is determined by the probability that two colours chosen independently from ${\boldsymbol{\rho}}_{\sigma_{s},i}^{\prime},{\boldsymbol{\rho}}_{\sigma_{s},i}^{\prime\prime}\in\mathcal{P}([q])$ coincide. The total weight $\psi_{\boldsymbol{G}_{t}}$ , partition function $Z(\boldsymbol{G}_{t})$ and the Boltzmann distribution $\mu_{\boldsymbol{G}_{t}}$ are defined by the general formulas (1.13)–(1.15). In the physics literature the $a_{i}$ are called external fields [34]. A similar construction involving an extra $\mathbb{N}$ -valued variable node $s$ was used in [42].

At ‘time’ $t=1$ (2.7) ensures that $\boldsymbol{m}_{t}=\boldsymbol{m}_{t}^{\prime\prime}=0$ . Thus, the only constraints present are the $a_{i}$ . Each of them is connected to the variable node $s$ and to one other variable node. Hence, the factor graph is star-shaped with constraint node $s$ at the centre. In effect, the variable nodes $v_{1},\ldots,v_{n}$ are dependent only through $s$ .

By contrast, at $t=0$ (2.7) yields $\boldsymbol{m}_{t}^{\prime}=0$ . Thus, the factor graph contains only constraints of type $e_{i}$ and of type $b_{i}$ . In effect, $\boldsymbol{G}_{0}$ decomposes into two parts. The connected component of $s$ contains all the constraint nodes $b_{i}$ , none of which is connected with $v_{1},\ldots,v_{n}$ . Thus, once more there is a star structure with $s$ at the centre, and it is not too difficult to write out the partition function of this component. Furthermore, the factor graph induced on $v_{1},\ldots,v_{n}$ and $e_{1},\ldots,e_{\boldsymbol{m}_{1}}$ is essentially identical to the original graph $\boldsymbol{G}$ . More specifically, the Boltzmann distribution $\mu_{\boldsymbol{G}_{0}}$ mimics that of the Potts antiferromagnet $\mu_{\boldsymbol{G},\beta}$ from (2.1). The only, for our purposes negligible, difference is that $\boldsymbol{G}_{0}$ typically has slightly fewer than $dn/2$ constraint nodes of the type $e_{i}$ . Thus, we can relate the partition functions $Z_{\beta}(\boldsymbol{G})$ and $Z(\boldsymbol{G}_{0})$ ; see Figure 2 for an illustration.

We observe that the distribution of the degrees of $v_{1},\ldots,v_{n}$ remains essentially the same for $0\leq t\leq 1$ . Specifically, in the regular case most variables have degree exactly $d$ throughout the interpolation, and in the binomial case the degrees are approximately ${\rm Po}((1-\varepsilon)d)$ distributed. Additionally, the total number of constraints remains (essentially) constant throughout the interpolation as well. Indeed, at $t=0$ there are about $(1-\varepsilon)dn/2$ constraints of type $e_{i}$ and about the same number of constraints $b_{i}$ , while at $t=1$ we have about $(1-\varepsilon)dn$ constraints of type $a_{i}$ .

As mentioned above, the idea behind the construction is to compare $\mathbb{E}[\log Z_{\beta}(\mathbb{G})]$ with the partition function of a simpler model where correlations amongst $v_{1},\ldots,v_{n}$ are amenable to a precise analysis. The following two propositions spell out this relationship precisely.

Proposition 2.7.

Let

[TABLE]

Then for any $\delta,\beta>0$ there exists $\varepsilon_{0}(d,\delta,\beta)>0$ such that for all $0<\varepsilon<\varepsilon_{0}(d,\delta,\beta)$ and for all $n>1/\varepsilon_{0}(d,\delta,\beta)$ we have $\mathbb{E}[\log Z(\boldsymbol{G}_{0})]\geq{\mathbb{E}[\log Z_{\beta}(\mathbb{G})]}+\mathbb{E}[\log Y^{\prime}]-\delta n$ .

Furthermore, the following proposition shows that $Z(\boldsymbol{G}_{1})$ dominates $Z(\boldsymbol{G}_{0})$ . The proof is based on estimating the derivative $\frac{\partial}{\partial t}\mathbb{E}[\log Z(\boldsymbol{G}_{t})]$ .

Proposition 2.8.

We have $\mathbb{E}[\log Z(\boldsymbol{G}_{0})]\leq\mathbb{E}[\log Z(\boldsymbol{G}_{1})]+o(n).$

Finally, we introduce a convenient proxy for the partition function of $\boldsymbol{G}_{1}$ : let

[TABLE]

Corollary 2.9.

For any $\beta>0$ we have ${\mathbb{E}[\log Z_{\beta}(\mathbb{G})]}\leq\mathbb{E}[\log Y]-\mathbb{E}[\log Y^{\prime}]+o(n)$ .

The proofs of Propositions 2.7 and 2.8 and Corollary 2.9 can be found in Section 4. We are thus left to study $Y,Y^{\prime}$ , which are approximations to the partition function of the factor graph $\boldsymbol{G}_{1}$ and the partition function of the $s$ -component of $\boldsymbol{G}_{0}$ , respectively.

Let us wrap up by dwelling on the intended combinatorial semantics of construction. The nodes $v_{1},\ldots,v_{n}$ and $e_{1},\ldots,e_{\boldsymbol{m}_{t}}$ clearly mimic the orignial Potts antiferromagnet. But as we move along we replace more and more $e_{i}$ by external fields $a_{j}$ . These are meant to capture the physics intuition as to the nature of the interactions between variable nodes in the random graph $\mathbb{G}$ ; Corollary 2.9 corroborates the physics picture to the extent that it yields an upper bound on the partition function. Specifically, the impact that an actual edge $e=vw$ of $\mathbb{G}$ has on an incident vertex $v$ is thought to be governed by the local graph structure around the other vertex $w$ in the graph $\mathbb{G}-e$ obtained by removing $e$ [34]. Since short cycles are scarce, the local graph structure will likely be a tree. Indeed, it will just be a $(d-1)$ -ary tree in the random regular graph, and a ${\rm Po}(d)$ Galton-Watson tree in the binomial case. In the binomial case, the specific tree structure is apt to impact the influence that $w$ exerts on $v$ . For example, if the Galton-Watson tree dies out quickly, then it will be easy to colour the entire tree properly regardless of the colour of $v$ . Thus, the edge $e=vw$ will be of little consequence. By contrast, in the event of a relatively dense tree, choosing a specific colour for $v$ might have repercussions on a large number of other vertices. The random variables $\boldsymbol{r}_{i}$ are meant to capture the randomness of the tree structure pending on vertex $w$ . But for the sake of simplicity, we do not incorporate an actual distribution on trees into our construction. Instead, we make do with the distribution $\boldsymbol{r}_{i}\in\mathcal{P}^{2}([q])$ that is meant to just capture the ensuing impact that $w$ has on $v$ .

Furthermore, the variable node $s$ is intended to represent the conjectured structure of the Boltzmann distribution $\mu_{\mathbb{G},\beta}$ . To elaborate: according to physics intuition, the distribution $\mu_{\mathbb{G},\beta}$ partitions the phase space $[q]^{V_{n}}$ into an unbounded number of ‘clusters’ $(S_{i})_{i}$ for $d$ close to the $q$ -colourability threshold and $\beta$ large [35, 44]. Inside a cluster, i.e., under the conditional distribution $\mu_{\mathbb{G},\beta}(\,\cdot\,\mid S_{i})$ , most vertices $w$ are strongly polarised towards a particular colour. In other words, the conditional marginals $\mu_{\mathbb{G},\beta}(\{\boldsymbol{\sigma}_{w}=c\,\cdot\,\mid S_{i})$ for $c\in[q]$ are typically either fairly close to zero or to one, while of course overall the marginal of the colour of each vertex is just uniform. The variable node $s$ is intended to represent the choice of the cluster $S_{i}$ . Thus, the distribution $\boldsymbol{r}_{i}\in\mathcal{P}^{2}([q])$ , which mimics the local graph structure, determines how the marginal of $w$ is distributed given a cluster index, and then the sample ${\boldsymbol{\rho}}_{\sigma_{s},i}(c)$ represents the actual realisation of the distribution of the colour inside cluster number $\sigma_{s}$ . Finally, $\gamma(s)$ models the distribution of the relative cluster volumes $\mu_{\mathbb{G},\beta}(S_{i})$ .

2.3. Poisson-Dirichlet weights

While the expression $\mathbb{E}[\log Y]-\mathbb{E}[\log Y^{\prime}]$ from Corollary 2.9 already bears a certain resemblance to (1.9), an important difference remains. Namely, the expressions $Y,Y^{\prime}$ inside the logarithm still contain $n$ , the number of vertices. If the probability distribution $\gamma$ is an atom, that is, $\gamma(h)=1$ for some $h\in\mathbb{N}$ , then we can produce the same joint distribution on $\sigma_{v_{1}},\ldots,\sigma_{v_{n}}$ by deleting $s$ and $g$ from the factor graphs $G_{0}$ and $G_{1}$ and replacing $\sigma_{s}$ by $h$ in the expressions for $Y$ and $Y^{\prime}$ . This causes $Y$ and $Y^{\prime}$ to factorise:

[TABLE]

In particular, long-range correlations are completely absent in the target $\boldsymbol{G}_{1}$ of the interpolation. (The modified $\boldsymbol{G}_{1}$ with $s$ and $g$ deleted consists of $n$ connected components, each containing exactly one $v_{i}$ .) In physics jargon the bound on $\mathbb{E}[\log Z_{\beta}(\boldsymbol{G})]$ that can be obtained from (2.10) is called the replica symmetric bound. While the replica symmetric bound easily implies the first moment bound (1.1), it does not appear sufficient to prove Theorems 1.1 and 1.3 for any $q\geq 3$ .

Fortunately there is another choice of the distribution $\gamma$ that leads to a simple formula. Recall that the Poisson-Dirichlet distribution with parameter $y>0$ is defined as follows. Let $\boldsymbol{P}\subset(0,\infty)$ be the countable point set generated by a Poisson point process on $(0,\infty)$ with density $x^{-1-y}{\mathrm{d}}x$ , independent of all other sources of randomness that have been introduced thus far. Further, let $(\boldsymbol{p}_{h})_{h\geq 1}$ be the sequence that comprises the points from $\boldsymbol{P}$ in decreasing order, i.e., $\boldsymbol{p}_{h}\geq\boldsymbol{p}_{h+1}$ for all $h$ . Since $y>0$ , we have $\sum_{h=1}^{\infty}\boldsymbol{p}_{h}<\infty$ almost surely. Therefore,

[TABLE]

defines a probability measure on $\mathbb{N}$ , the Poisson-Dirichlet law. To be precise, ${\boldsymbol{\gamma}}$ is a random probability measure which depends on $\boldsymbol{P}$ . This distribution is used in the following lemma, which enables us to simplify $\mathbb{E}[\log Y],\ \mathbb{E}[\log Y^{\prime}]$ .

Lemma 2.10 ([38, Proposition 1] and [43, Proposition 6.5.15]).

Suppose that $0<y<1$ and that $(X_{h})_{h\geq 1}$ are positive identically distributed random variables with bounded second moments, mutually independent and independent of ${\boldsymbol{\gamma}}$ . Then

[TABLE]

In the physics literature, the Poisson-Dirichlet distribution has been postulated as the correct distribution of the relative cluster sizes [34, 35]. Moreover, Panchenko and Talagrand [38] used Lemma 2.10 to bound the partition function of the random $k$ -SAT model. We apply Lemma 2.10 in a similar manner to upper bound $\mathbb{E}[\log Z_{\beta}(\mathbb{G})]$ . Specifically, let ${\mathcal{R}}$ be the $\sigma$ -algebra generated by $(\boldsymbol{r}_{i},\boldsymbol{r}_{i,j},\boldsymbol{r}_{i}^{\prime},\boldsymbol{r}_{i}^{\prime\prime},\boldsymbol{D}_{i})_{i,j\geq 1}$ . Thanks to Lemma 2.10, we can simplify the bound from Corollary 2.9 as follows.

Corollary 2.11.

For any $y,\beta>0$ and $\mathfrak{r}\in\mathcal{P}^{3}([q])$ we have

[TABLE]

Proof.

Choose $\varepsilon>0$ small enough and assume that $n$ is sufficiently large. Moreover, for all $h\in\mathbb{N}$ let

[TABLE]

Applying Corollary 2.9 to the random distribution ${\boldsymbol{\gamma}}$ , we obtain

[TABLE]

Hence, Lemma 2.10 yields

[TABLE]

clearly, since $X_{1},X_{1}^{\prime}$ do not depend on $\boldsymbol{P}$ , the outer $\mathbb{E}\left[{\,\cdot\,}\right]$ in (2.13) is on $(\boldsymbol{r}_{i},\boldsymbol{r}_{i,j},\boldsymbol{r}_{i}^{\prime},\boldsymbol{r}_{i}^{\prime\prime},\boldsymbol{D}_{i})_{i\geq 1}$ only. Further, because the ${\boldsymbol{\rho}}_{h,i,j},{\boldsymbol{\rho}}_{h,i}^{\prime},{\boldsymbol{\rho}}_{h,i}^{\prime\prime}$ are mutually independent given ${\mathcal{R}}$ , we obtain

[TABLE]

The assertion follows from (2.13)–(2.15). ∎

2.4. The zero temperature limit

To actually deduce a bound on the chromatic number from Proposition 2.11 we need to fix the three remaining parameters $\beta,y,\mathfrak{r}$ . Since the Potts model approaches the graph colouring problem in the limit of large $\beta$ , it seems natural to take the limit $\beta\to\infty$ . In physics jargon, we take the ‘temperature’ $1/\beta$ to zero. Moreover, physics intuition suggests sending the ‘Parisi parameter’ $y$ to zero as well. Ding et al. [21] took similar limits to derive the upper bound on the $k$ -SAT threshold from the formula for the $k$ -SAT partition function from [38].

With respect to $\mathfrak{r}$ , we make two different choices, depending on whether $\mathbb{G}$ is regular or binomial. Let us begin with the regular case. For $i\in[q]$ , let $\eta_{i}\in\mathcal{P}([q])$ be the atom on colour $i$ . Moreover, let $\eta_{0}=q^{-1}\boldsymbol{1}\in\mathcal{P}([q])$ be the uniform distribution on the $q$ colours. Then for a given $\alpha\in[0,1]$ we define

[TABLE]

Geometrically, we can think of $r_{\alpha}$ as a discrete distribution on the standard simplex $\mathcal{P}([q])\subset\mathbb{R}^{q}$ that places mass $\alpha$ on the centre and distributes the remaining mass $1-\alpha$ equally amongst the $q$ vertices of the simplex. Let

[TABLE]

be the atom on $r_{\alpha}$ . Further, the expression (1.10) for the binomial random graph involves a probability distribution $\mathfrak{a}$ on $[0,1]$ . Given any $\mathfrak{a}\in\mathcal{P}([0,1])$ , we define

[TABLE]

Observe that the integrand is the distribution $\mathfrak{r}_{\alpha}\in\mathcal{P}^{3}([q])$ from (2.17), and thus $\mathfrak{r}_{\mathfrak{a}}\in\mathcal{P}^{3}([q])$ . Plugging $\mathfrak{r}_{\alpha}$ or $\mathfrak{r}_{\mathfrak{a}}$ into Proposition 2.11, we finally obtain the expressions from (1.8) and (1.10).

Proposition 2.12.

If $\mathbb{G}=\mathbb{G}(n,d)$ is the random regular graph then

[TABLE]

Moreover, if $\mathbb{G}=\mathbb{G}(n,d/n)$ is the binomial model then

[TABLE]

The proof of Proposition 2.12 can be found in Section 4.4.

Now we have all the pieces in place to complete the proofs of the main theorems.

Proof of Theorem 1.1.

Fix $q\geq 3$ and assume that $\Sigma_{d,q}<0$ for some $d\geq 3$ . (This holds when $d=d_{q}$ , for example.) Then Proposition 2.12 yields $y,\beta>0$ and $\alpha\in[0,1]$ such that $\phi_{\beta,y}(\mathfrak{r}_{\alpha})<0$ . In particular we can take $\alpha$ to be the value which minimises $\Sigma_{d,q}(\cdot)$ . Consequently, Corollary 2.11 implies that $\limsup_{n\to\infty}\frac{1}{n}\mathbb{E}[\log Z_{\beta}(\mathbb{G})]<0$ . Therefore, Corollary 2.3 implies that

[TABLE]

We are left to prove that $\mathbb{G}(n,d^{\prime})$ also fails to be $q$ -chromatic w.h.p. for all $d^{\prime}>d$ . To see this, we observe that the property of being $q$ -colourable is decreasing; that is, if a graph $G$ is $q$ -colourable then so is every subgraph $G^{\prime}$ of $G$ . Now, [26, Theorem 9.36] shows that if $d^{\prime}>d$ and if a decreasing property $\mathcal{A}$ is satisfied for $\mathbb{G}(n,d^{\prime})$ w.h.p. then $\mathbb{G}(n,d)$ enjoys $\mathcal{A}$ w.h.p., too. Thus, (2.19) implies that $\chi(\mathbb{G}(n,d^{\prime}))>q$ for all $d^{\prime}>d$ . ∎

Proof of Theorem 1.3.

Once more we fix $q\geq 3$ and suppose that $\Sigma^{*}_{d,q}<0$ for some $d>0$ . (This holds when $d=d_{q}^{*}$ , for example.) Then by Proposition 2.12 there exist $y,\beta>0$ and $\mathfrak{a}\in\mathcal{P}([0,1])$ such that $\phi_{\beta,y}(\mathfrak{r}_{\mathfrak{a}})<0$ and thus $\limsup_{n\to\infty}\frac{1}{n}\mathbb{E}[\log Z_{\beta}(\mathbb{G})]<0$ by Corollary 2.11. Hence, Corollary 2.3 yields

[TABLE]

Finally, due to monotonicity, (2.20) implies that $\chi(\mathbb{G}(n,d^{\prime}/n))>q$ w.h.p. for all $d^{\prime}>d$ . ∎

Given Theorems 1.1 and 1.3 the asymptotic formulas detailed in Corollary 1.2 and Corollary 1.4 follow from routine calculations, which we defer to Section 5.

3. Concentration

After proving Proposition 2.1 in Section 3.1, we prove Corollary 2.2 in Section 3.2.

3.1. Proof of Proposition 2.1

The proof is based on Azuma’s inequality and the Lipschitz property of the random variable $\log Z_{\beta}(\,\cdot\,)$ . Indeed, (2.2) shows that if a multigraph $G^{\prime}$ is obtained from $G$ by adding one single edge then $e^{-\beta}\leq Z_{\beta}(G^{\prime})/Z_{\beta}(G)\leq 1$ , and hence

[TABLE]

We pick a small enough $\zeta=\zeta(\varepsilon,\delta,\beta)>0$ and a smaller $\xi=\xi(\varepsilon,\delta,\beta,\zeta)>0$ . We treat the binomial random graph and the random regular graph separately, tacitly assuming in either case that $n$ is sufficiently large.

3.1.1. The binomial random graph

Writing $\boldsymbol{M}\sim{\rm Bin}(\binom{n}{2},d/n)$ for the number of edges of $\mathbb{G}$ and invoking the Chernoff bound, we obtain

[TABLE]

Further, let $\boldsymbol{G}_{n,m}$ be the random multigraph on $n$ vertices comprising $m$ edges chosen uniformly and independently out of all ${\binom{n}{2}}$ possible edges. Let $\mathcal{S}$ be the event that $\boldsymbol{G}_{n,m}$ is simple. It is well known that

[TABLE]

Moreover, providing $\xi$ is chosen small enough, Azuma’s inequality and (3.1) imply that

[TABLE]

The estimates (3.3)–(3.4) imply that for all $m\leq dn/2+\zeta n$ ,

[TABLE]

Since

[TABLE]

the bound (3.5) shows that for all $m\leq dn/2+\zeta n$ ,

[TABLE]

Further, because $\mathbb{G}\mid(\boldsymbol{M}=m)$ and $\boldsymbol{G}_{n,m}\mid\mathcal{S}$ are identically distributed, (3.5) and (3.7) show that

[TABLE]

Moreover, combining (3.2), (3.6) and (3.8), we obtain (2.4).

To prove the second assertion, we recall that $\boldsymbol{m}\sim{\rm Po}_{\leq dn/2}((1-\varepsilon)dn/2)$ . We thus obtain the tail bound

[TABLE]

for sufficiently small $\xi$ . Since $\boldsymbol{G}\mid(\boldsymbol{m}=m)$ and $\boldsymbol{G}_{n,m}$ are identically distributed, (3.4) yields

[TABLE]

Finally, providing that $\zeta$ is chosen small enough, (3.1) and (3.9) imply that

[TABLE]

Therefore, the second part of (2.4) follows from (3.9) and (3.10).

3.1.2. The random regular graph

We recall that the random regular graph $\mathbb{G}$ can be constructed via the configuration model by drawing a perfect matching $\boldsymbol{e}_{1},\ldots,\boldsymbol{e}_{dn/2}$ of the complete graph on the vertex set $V_{n}\times[d]$ uniformly at random. To be precise, the sequence $\boldsymbol{e}_{1},\ldots,\boldsymbol{e}_{dn/2}$ is constructed by successively drawing a uniformly random edge $\boldsymbol{e}_{i+1}$ that connects two distinct vertices of the complete graph on $V_{n}\times[d]$ that are not incident with $\boldsymbol{e}_{1},\ldots,\boldsymbol{e}_{i}$ . Let $\boldsymbol{G}^{\prime}$ be the random multigraph on $[n]$ obtained by inserting for each matching edge $\boldsymbol{e}_{i}=\{(v,h),(w,j)\}$ an edge between $v$ and $w$ and let $\mathcal{S}$ denote the event that $\boldsymbol{G}^{\prime}$ is simple. It is well known that

[TABLE]

see, e.g., [26, Corollary 9.7]. Moreover, $\mathbb{G}$ is distributed as $\boldsymbol{G}^{\prime}$ given $\mathcal{S}$ .

To prove the first inequality we consider the filtration $({\mathcal{E}}_{t})_{t\in[dn/2]}$ with ${\mathcal{E}}_{t}$ generated by $\boldsymbol{e}_{1},\ldots,\boldsymbol{e}_{t}$ . Then the sequence $(\mathbb{E}\left[{\log Z_{\beta}(\boldsymbol{G}^{\prime})\mid{\mathcal{E}}_{t}}\right])_{t\in[dn/2]}$ is a Doob martingale. Moreover, (3.1) implies that

[TABLE]

Therefore, Azuma’s inequality yields

[TABLE]

The first assertion thus follows from (3.11) and (3.13). Further, we can think of $\boldsymbol{G}$ as the multigraph obtained by inserting the edges induced by $\boldsymbol{e}_{1},\ldots,\boldsymbol{e}_{\boldsymbol{m}}$ only. Hence, arguing as for (3.13) but stopping after $\boldsymbol{m}$ steps gives

[TABLE]

Finally, the second assertion follows from (3.1), (3.9) and (3.14).

3.2. Proof of Corollary 2.2

Given $\delta,\beta>0$ we choose small enough $\varepsilon=\varepsilon(\delta,\beta)>0$ , $\zeta=\zeta(\delta,\beta,\varepsilon)$ , $\xi=\xi(\delta,\beta,\varepsilon,\zeta)$ and assume that $n$ is sufficiently large. Once more we treat the binomial and the regular models separately.

3.2.1. The binomial random graph

We continue to denote the total number of edges of the binomial graph $\mathbb{G}=\mathbb{G}(n,d/n)$ by $\boldsymbol{M}$ and by $\boldsymbol{G}_{n,\boldsymbol{M}}$ the random multigraph obtained by including $\boldsymbol{M}$ uniformly and independently chosen edges. Due to (3.2) and (3.9), with probability $1-\exp(-\Omega(n))$ , we can obtain $\boldsymbol{G}_{n,\boldsymbol{M}}$ from $\boldsymbol{G}$ by adding or removing no more than $2\varepsilon dn$ edges. Hence, provided $\varepsilon$ is small enough, (3.1) ensures that

[TABLE]

Furthermore, with $\mathcal{S}$ the event that $\boldsymbol{G}_{n,\boldsymbol{M}}$ is simple, $\mathbb{G}$ is distributed as $\boldsymbol{G}_{n,\boldsymbol{M}}$ given $\mathcal{S}$ . Therefore, (3.3) and (3.4) imply that

[TABLE]

Finally, the assertion follows from (3.15) and (3.16).

3.2.2. The random regular graph

As in Section 3.1.2 we denote by $\boldsymbol{G}^{\prime}$ the random multigraph with $dn/2$ edges drawn from the configuration model. By the principle of deferred decisions we can think of $\boldsymbol{G}^{\prime}$ as being obtained from $\boldsymbol{G}$ by adding the missing $dn/2-\boldsymbol{m}$ edges. Hence, provided that $\varepsilon$ is sufficiently small, (3.1) implies that

[TABLE]

Furthermore, as $\mathbb{G}$ is distributed as $\boldsymbol{G}^{\prime}$ given the event $\mathcal{S}$ , (3.11) and (3.13) yield

[TABLE]

The assertion follows from (3.17) and (3.18).

4. Interpolation

In this section we carry out the technical details of the interpolation argument. Section 4.1 contains the proof of Proposition 2.7 while Section 4.2 deals with the proof of Proposition 2.8. Subsequently, Section 4.3 contains the proof of Proposition 2.9 and finally, in Section 4.4 we prove Proposition 2.12.

4.1. Proof of Proposition 2.7

A glimpse at (2.7) reveals that the random factor graph $\boldsymbol{G}_{0}$ consists of constraint nodes $e_{1},\ldots,e_{\boldsymbol{m}_{0}}$ and $b_{1},\ldots,b_{\boldsymbol{m}_{0}^{\prime\prime}}$ only. (See also the left side of Figure 2.) The constraints $e_{1},\ldots,e_{\boldsymbol{m}_{0}}$ are adjacent to the variables $V_{n}$ but not to $s$ , while $b_{1},\ldots,b_{\boldsymbol{m}_{0}^{\prime\prime}}$ are adjacent to $s$ but not to $V_{n}$ . Consequently, the partition function factorises:

[TABLE]

Hence

[TABLE]

and by construction we have

[TABLE]

Additionally, $Y^{\prime}$ is distributed as $\mathcal{Y}$ given $\boldsymbol{m}_{0}^{\prime\prime}=\lfloor dn/2\rfloor$ . Hence, since $\mathbb{P}[\boldsymbol{m}_{0}^{\prime\prime}>dn/2]=e^{-\Omega(n)}$ , we can couple $\mathcal{Y}$ and $Y^{\prime}$ such that

[TABLE]

Since for any $s\in\mathbb{N}$ we have $\exp(-\beta)\leq\psi_{b_{i}}(s)\leq 1$ for all $i$ , by (2.7) and (4.3) and applying Poisson tail bounds gives

[TABLE]

Combining (4.1), (4.2) and (4.4), we obtain

[TABLE]

Finally, the assertion follows from (4.5) and Corollary 2.2.

4.2. Proof of Proposition 2.8

We begin by defining a set $\mathcal{C}_{t}$ of variable nodes of $\boldsymbol{G}_{t}$ , along with a probability distribution $P_{t}$ on $\mathcal{C}_{t}$ . In the binomial case ( $\mathbb{G}$ is the binomial random graph) let $\mathcal{C}_{t}=V_{n}$ and let $P_{t}$ be the uniform distribution on $\mathcal{C}_{t}$ . In the regular case ( $\mathbb{G}$ is the random regular graph) let $\mathcal{C}_{t}$ be the set of all vertices $v\in V_{n}$ of degree $d_{\boldsymbol{G}_{t}}(v)$ strictly less than $d$ in $\boldsymbol{G}_{t}$ , and providing that $\mathcal{C}_{t}\neq\emptyset$ we define, for all $v\in\mathcal{C}_{t}$ ,

[TABLE]

In both the binomial and the regular case we refer to the elements of $\mathcal{C}_{t}$ as cavities. Assuming that $\mathcal{C}_{t}\neq\emptyset$ , we denote by $\boldsymbol{c}_{1},\boldsymbol{c}_{1}^{\prime},\boldsymbol{c}_{2},\boldsymbol{c}_{2}^{\prime},\ldots\in\mathcal{C}_{t}$ cavities drawn independently from $P_{t}$ . Note that $\mathbb{P}(\mathcal{C}_{t}=\emptyset)=e^{-\Omega(n)}$ .

The proof of Proposition 2.8 relies on coupling arguments. Specifically, we will couple $\boldsymbol{G}_{t}$ with three random factor graphs obtained by adding one more constraint of each of the three types of constraints:

•

assuming that $2\boldsymbol{m}_{t}+\boldsymbol{m}_{t}^{\prime}\leq dn-2$ , we obtain $\boldsymbol{G}_{t}^{\prime}$ from $\boldsymbol{G}_{t}$ by adding one more constraint $e_{\boldsymbol{m}_{t}+1}$ as per Definition 2.5 or 2.6, respectively; if $2\boldsymbol{m}_{t}+\boldsymbol{m}_{t}^{\prime}>dn-2$ then we let $\boldsymbol{G}_{t}^{\prime}=\boldsymbol{G}_{t}$ .

•

assuming that $2\boldsymbol{m}_{t}+\boldsymbol{m}_{t}^{\prime}<dn$ , we obtain $\boldsymbol{G}_{t}^{\prime\prime}$ from $\boldsymbol{G}_{t}$ by adding one more constraint $a_{\boldsymbol{m}_{t}^{\prime}+1}$ in accordance with Definition 2.5 or 2.6, respectively; if $2\boldsymbol{m}_{t}+\boldsymbol{m}_{t}^{\prime}=dn$ then we let $\boldsymbol{G}_{t}^{\prime\prime}=\boldsymbol{G}_{t}$ .

•

finally, obtain $\boldsymbol{G}_{t}^{\prime\prime\prime}$ from $\boldsymbol{G}_{t}$ by adding one more constraint $b_{\boldsymbol{m}_{t}^{\prime\prime}+1}$ .

The following lemma expresses the derivative of $\mathbb{E}\left[{\log Z_{\beta}(\boldsymbol{G}_{t})}\right]$ in terms of these three enhanced factor graphs. Let us observe for future reference that

[TABLE]

which follows from the fact that $\boldsymbol{G}_{t}$ has at most $1+\boldsymbol{m}_{t}+\boldsymbol{m}_{t}^{\prime}+\boldsymbol{m}_{t}^{\prime\prime}\leq dn+1$ constraint nodes, and that the weight functions of the constraint nodes $e_{i},a_{i},b_{i}$ satisfy $\exp(-\beta)\leq\psi_{e_{i}},\psi_{a_{i}},\psi_{b_{i}}\leq 1$ .

Lemma 4.1.

We have

[TABLE]

Proof.

Recalling that $\boldsymbol{m}_{t},\boldsymbol{m}_{t}^{\prime},\boldsymbol{m}_{t}^{\prime\prime}$ are distributed as the independent Poisson variables $\boldsymbol{M}_{t},\boldsymbol{M}_{t}^{\prime},\boldsymbol{M}_{t}^{\prime\prime}$ from (2.7) given the event $\mathcal{M}=\{2\boldsymbol{M}_{t}+\boldsymbol{M}_{t}^{\prime}\leq dn,\,\boldsymbol{M}_{t}+\boldsymbol{M}_{t}^{\prime}+\boldsymbol{M}_{t}^{\prime\prime}\leq dn\}$ from (2.8), we see that

[TABLE]

The conditional expectation on the right hand side is independent of $t$ . But the means of $\boldsymbol{M}_{t},\boldsymbol{M}_{t}^{\prime},\boldsymbol{M}_{t}^{\prime\prime}$ are governed by $t$ . Hence, we need to differentiate $\mathbb{P}\left[{\boldsymbol{M}_{t}=m,\,\boldsymbol{M}_{t}^{\prime}=m^{\prime},\,\boldsymbol{M}_{t}^{\prime\prime}=m^{\prime\prime}\mid\mathcal{M}}\right]$ . For $(m,m^{\prime},m^{\prime\prime})\in\mathcal{M}$ we obtain

[TABLE]

The product rule yields

[TABLE]

which simplifies to

[TABLE]

Moreover, differentiating $-\mathbb{P}[\mathcal{M}]=\mathbb{P}[\mathcal{M}^{c}]-1$ gives

[TABLE]

Combining (4.8)–(4.12) and using $\mathbb{P}\left[{\mathcal{M}}\right]=1-\exp(-\Omega(n))$ , we obtain

[TABLE]

By the principle of deferred decisions, if $(m,m^{\prime},m^{\prime\prime})\in\mathcal{M}$ then we can think of $\boldsymbol{G}_{t}$ given $\boldsymbol{m}_{t}=m,\boldsymbol{m}_{t}^{\prime}=m^{\prime},\boldsymbol{m}_{t}^{\prime\prime}=m^{\prime\prime}$ as resulting from $\boldsymbol{G}_{t}$ given $\boldsymbol{m}_{t}=m-1,\boldsymbol{m}_{t}^{\prime}=m^{\prime},\boldsymbol{m}_{t}^{\prime\prime}=m^{\prime\prime}$ via the insertion of one more constraint $e_{\boldsymbol{m}_{t}}$ . Therefore,

[TABLE]

The definition (2.7) of the Poisson variables ensures that $\mathbb{P}[(\boldsymbol{m}_{t}+1,\boldsymbol{m}_{t}^{\prime},\boldsymbol{m}_{t}^{\prime\prime})\in\mathcal{M}]=1-\exp(-\Omega(n))$ . Hence, (4.6) and (4.14) yield

[TABLE]

Similarly,

[TABLE]

Thus, the assertion follows from (4.13), (4.15), (4.16) and (4.17). ∎

Let $\mathfrak{C}$ be the event that $|\mathcal{C}_{t}|\geq n^{2/3}$ . The choice of the parameters (2.7) ensures that

[TABLE]

We proceed to calculate the three expressions on the r.h.s. of (4.7). Recall the function $\psi_{\boldsymbol{G}_{t}}$ and the Boltzmann distribution $\mu_{\boldsymbol{G}_{t}}$ which correspond to $\boldsymbol{G}_{t}$ , defined as in (1.13) and (1.15) with $\boldsymbol{\gamma}$ from (2.11). Also recall the bracket notation from (1.12).

Lemma 4.2.

We have

[TABLE]

Proof.

Since (4.6) shows that $\log Z_{\beta}(\boldsymbol{G}_{t}),\log Z_{\beta}(\boldsymbol{G}_{t}^{\prime})=O(n)$ , (4.18) implies that

[TABLE]

Moreover, conditioned on the event $\mathfrak{C}$ , the factor graph $\boldsymbol{G}_{t}^{\prime}$ results from $\boldsymbol{G}_{t}$ via the addition of a single constraint $e_{\boldsymbol{m}_{t}+1}$ . Denoting by $\boldsymbol{u},\boldsymbol{v}$ the variable nodes that $e_{\boldsymbol{m}_{t}+1}$ joins, we obtain

[TABLE]

(Here the sum is over all $\sigma=(\sigma_{s},\sigma_{v_{1}},\ldots,\sigma_{v_{n}})\in\mathbb{N}\times[q]^{n}$ , recalling that $V_{n}=\{v_{1},\ldots,v_{n}\}$ .) In particular, since $\exp(-\beta)\leq\psi_{e_{\boldsymbol{m}_{t}+1}}(\boldsymbol{\sigma})\leq 1$ , we have $-\beta\leq\log Z_{\beta}(\boldsymbol{G}_{t}^{\prime})-\log Z_{\beta}(\boldsymbol{G}_{t})\leq 0$ . Further, conditioned on $\mathfrak{C}$ , the probability that two cavities $\boldsymbol{c}_{1},\boldsymbol{c}_{2}$ chosen independently with distribution $P_{t}$ coincide is $o(1)$ . Hence, recalling the construction of the probability distribution $P_{t}$ on the set $\mathcal{C}_{t}$ of cavities, we notice that the distribution of the pair $(\boldsymbol{u},\boldsymbol{v})$ and the distribution of the pair $(\boldsymbol{c}_{1},\boldsymbol{c}_{2})$ have total variation distance $o(1)$ . Consequently, (4.20) yields

[TABLE]

The assertion follows from (4.19) and (4.21). ∎

Lemma 4.3.

We have

[TABLE]

Proof.

Just as in the proof of Lemma 4.2 we have

[TABLE]

Denote by $\boldsymbol{v}\in V_{n}$ the variable node adjacent to the new constraint $\boldsymbol{a}_{m_{t}^{\prime}+1}$ of $\boldsymbol{G}_{t}^{\prime\prime}$ . Then conditioned on $\mathfrak{C}$ we have

[TABLE]

By construction, the variable node $\boldsymbol{v}$ is distributed according to $P_{t}$ , the law of $\boldsymbol{c}_{1}$ . Hence,

[TABLE]

Combining (4.22) and (4.23) completes the proof. ∎

Lemma 4.4.

We have

[TABLE]

Proof.

This follows from similar manipulations as in the proofs of Lemmas 4.2 and 4.3. ∎

Proof of Proposition 2.8.

Let

[TABLE]

Combining Lemmas 4.1–4.4, we see that

[TABLE]

We are going to show that $\Delta_{\ell}\geq 0$ for all $\ell\geq 1$ ; then the assertion follows from (4.24).

Thus, fix $\ell\geq 1$ and let $\boldsymbol{\sigma}^{(1)},\boldsymbol{\sigma}^{(2)},\ldots,\boldsymbol{\sigma}^{(\ell)}$ denote independent samples from $\mu_{\boldsymbol{G}_{t}}$ . Since the expectation of the product of independent random variables equals the product of their expectations, we can rewrite $\Delta_{\ell}$ as

[TABLE]

To simplify the last expression, we introduce for $\tau\in[q]^{\ell}$ ,

[TABLE]

Further, recall the family $(\hat{\boldsymbol{\rho}}_{i})_{i\geq 1}$ of distributions $\hat{\boldsymbol{\rho}}_{i}\in\mathcal{P}([q])$ drawn from $\hat{\boldsymbol{r}}\in\mathcal{P}^{2}([q])$ from (2.6). Writing $\mathbb{E}^{\prime}$ for the expectation over $(\hat{\boldsymbol{\rho}}_{i})_{i\geq 1}$ only, let

[TABLE]

Since $\boldsymbol{c}_{1},\boldsymbol{c}_{2}$ and $({\boldsymbol{\rho}}_{s,\boldsymbol{m}_{t}^{\prime}+1},{\boldsymbol{\rho}}_{s,\boldsymbol{m}_{t}^{\prime\prime}+1}^{\prime},{\boldsymbol{\rho}}_{s,\boldsymbol{m}_{t}^{\prime\prime}+1}^{\prime\prime})_{s\geq 1}$ in (4.25) are mutually independent, we can interchange the order in which expectations are taken and rewrite (4.25) as

[TABLE]

Finally, the assertion follows from (4.24) and (4.26). ∎

4.3. Proof of Corollary 2.9

We begin by estimating the partition function of $\boldsymbol{G}_{1}$ .

Lemma 4.5.

For any $\delta>0$ there is $\varepsilon>0$ such that for all large enough $n$ we have $\mathbb{E}[\log Z(\boldsymbol{G}_{1})]\leq\mathbb{E}[\log Y]+\delta n$ .

Proof.

Let $\boldsymbol{d}_{1},\boldsymbol{d}_{2},\ldots,\boldsymbol{d}_{n}$ denote the degrees of the variable nodes $v_{1},\ldots,v_{n}$ in $\boldsymbol{G}_{1}$ . Each of the constraints $a_{j}$ is adjacent to only one of the variable nodes from $V_{n}$ . For each $v_{i}$ , suppose the constraints $a_{i_{1}},\ldots,a_{\boldsymbol{d}_{i}}$ are adjacent to $v_{i}$ and let ${\boldsymbol{\rho}}_{\sigma_{s},i,h}$ denote the distribution associated with $a_{i_{h}}$ , for $h\in[\boldsymbol{d}_{i}]$ . (In the definition of the interpolation scheme, this distribution is denoted ${\boldsymbol{\rho}}_{\sigma_{s},i_{h}}$ .) Then we can write

[TABLE]

Suppose first that $\mathbb{G}$ is the binomial random graph and let $\boldsymbol{d}_{1}^{\prime},\ldots,\boldsymbol{d}_{n}^{\prime}\sim{\rm Po}((1-\varepsilon)d)$ be independent random variables. The construction of $\boldsymbol{G}_{1}$ ensures that $(\boldsymbol{d}_{1},\ldots,\boldsymbol{d}_{n})$ is distributed as $(\boldsymbol{d}_{1}^{\prime},\ldots,\boldsymbol{d}_{n}^{\prime})$ given $\boldsymbol{d}_{1}^{\prime}+\cdots+\boldsymbol{d}_{n}^{\prime}\leq dn$ . Since this event occurs with probability $1-\exp(-\Omega(n))$ , we conclude that $d_{\mathrm{TV}}((\boldsymbol{d}_{1},\ldots,\boldsymbol{d}_{n}),(\boldsymbol{d}_{1}^{\prime},\ldots,\boldsymbol{d}_{n}^{\prime}))=\exp(-\Omega(n))$ . Therefore, (4.27) yields

[TABLE]

To compare this last expression with $Y$ from (2.9), let $\boldsymbol{\Delta}_{i}\sim{\rm Po}(\varepsilon d)$ be independent random variables for $i\in[n]$ . Then we can couple the $\boldsymbol{D}_{i}$ from (2.9) and the $\boldsymbol{d}_{i}^{\prime}$ from (4.28) by letting $\boldsymbol{D}_{i}=\boldsymbol{d}_{i}^{\prime}+\boldsymbol{\Delta}_{i}$ . Thus, since each factor in (4.28) lies in the interval $[\exp(-\beta),1]$ , we obtain the estimate

[TABLE]

whence the assertion follows. Second, if $\mathbb{G}$ is the random regular graph then $\boldsymbol{D}_{i}=d$ deterministically. Hence, letting $\boldsymbol{\Delta}_{i}=d-\boldsymbol{d}_{i}$ , we obtain (4.29) in this case as well. ∎

Proof of Corollary 2.9.

The corollary follows from Proposition 2.7, Proposition 2.8 and Lemma 4.5 by taking $\delta$ to zero. ∎

4.4. Proof of Proposition 2.12

We will calculate the limits of the two terms appearing in (2.11) separately. To facilitate a unified treatment, let $\mathfrak{a}\in\mathcal{P}([0,1])$ be the given probability distribution in the binomial case and let $\mathfrak{a}=\delta_{\alpha}$ for $\alpha\in[0,1]$ in the case of the random regular graph. Also let $(\boldsymbol{\alpha}_{i})_{i\geq 1}$ be independent samples from $\mathfrak{a}$ .

Lemma 4.6.

We have

[TABLE]

Proof.

For $c\in[q]$ let $U_{c}=\{\forall h\in[\boldsymbol{D}_{1}]:{\boldsymbol{\rho}}_{1,1,h}\neq\delta_{c}\}$ and let $U=\bigcup_{c\in[q]}\,U_{c}$ . Then

[TABLE]

(The lower bound in the first line is trivial, while the upper bound follows since $U_{c}$ fails for each $c\in[q]$ . The upper bound in the second line is trivial, while the lower bound follows by taking the term corresponding to some colour $c$ where $U_{c}$ holds.) Consequently, we obtain

[TABLE]

pointwise. Furthermore, by inclusion/exclusion,

[TABLE]

Since ${\boldsymbol{\rho}}_{1,1,1},\ldots,{\boldsymbol{\rho}}_{1,1,\boldsymbol{D}_{1}}$ are mutually independent given ${\mathcal{R}}$ , for any set $Q\subseteq[q]$ of size $k$ we find that

[TABLE]

using (2.16)–(2.18). Hence, (4.31) yields

[TABLE]

Finally, the assertion follows from (4.30) and (4.32). ∎

Lemma 4.7.

We have

[TABLE]

Proof.

Let $U$ be the event that there exists $c\in[q]$ such that ${\boldsymbol{\rho}}_{1,1}^{\prime}={\boldsymbol{\rho}}_{1,1}^{\prime\prime}=\delta_{c}$ . Then

[TABLE]

(The sum over $\tau$ equals 0 if ${\boldsymbol{\rho}}_{1,1}^{\prime}$ and ${\boldsymbol{\rho}}_{1,1}^{\prime\prime}$ are atoms on two different colours, and equals $1/q$ otherwise.) Therefore, we have pointwise convergence

[TABLE]

Since $\mathbb{P}\left[{U\mid{\mathcal{R}}}\right]=(1-\boldsymbol{\alpha}_{1})(1-\boldsymbol{\alpha}_{2})/q$ , using (2.16)–(2.18), the assertion follows from (4.33). ∎

Proof of Proposition 2.12.

The proposition follows from Lemmas 4.6 and 4.7 immediately. ∎

5. Asymptotics

We perform asymptotic expansions of $\Sigma_{d,q}(\,\cdot\,)$ , $\Sigma_{d,q}^{*}(\,\cdot\,)$ in the limit of large $q$ to prove Corollaries 1.2 and 1.4. In this section, the notation $\tilde{O}_{q}(\cdot)$ suppresses polynomials in $\log q$ , and both $O_{q}(\cdot)$ and $\tilde{O}_{q}(\cdot)$ refer to the limit $q\to\infty$ .

5.1. Proof of Corollary 1.2

Write $\Sigma_{d,q}(\alpha)=S-T$ , where

[TABLE]

We will let

[TABLE]

with $c=O_{q}(1)$ , and expand $S$ and $T$ asymptotically in the limit $q\to\infty$ . Substituting for $d$ in $S$ gives

[TABLE]

Observe that

[TABLE]

and so, for the $i=0$ term we have the expansion

[TABLE]

Moreover, for $i\geq 1$ we have

[TABLE]

Plugging (5.4) and (5.5) into (5.2) gives

[TABLE]

Similarly, substituting for $\alpha$ and $d$ in $T$ gives

[TABLE]

Hence,

[TABLE]

Consequently, if $c\leq 1-\varepsilon_{q}$ where $\varepsilon_{q}\to 0$ slowly enough then $\Sigma_{d,q}(\alpha)<0$ for large enough $q$ . This completes the proof of Corollary 1.2.

5.2. Proof of Corollary 1.4

With $\alpha,d$ as in (5.1) we consider the distribution $\mathfrak{a}=\delta_{\alpha}$ . With $\boldsymbol{D}\sim{\rm Po}(d)$ let

[TABLE]

First,

[TABLE]

To see this, we interpret the sum in the middle as an inclusion/exclusion formula. Namely, choose $\boldsymbol{c}_{1},\ldots,\boldsymbol{c}_{\boldsymbol{D}}\in\{0,1,\ldots,q\}$ independently such that the probability of drawing [math] equals $\alpha$ and the probability of drawing $i\in[q]$ equals $(1-\alpha)/q$ . Then the sum equals the probability of the event $[q]\setminus\left\{{\boldsymbol{c}_{1},\ldots,\boldsymbol{c}_{\boldsymbol{D}}}\right\}\neq\emptyset$ , which is clearly lower bounded by $\alpha^{\boldsymbol{D}}=(2q)^{-\boldsymbol{D}}.$ Poisson tail bounds show that $\mathbb{P}[\,|\boldsymbol{D}-d|\geq 10\sqrt{q}\log q]=O_{q}(q^{-4})$ and combining this with (5.7) gives

[TABLE]

Hence, let $\boldsymbol{\Delta}$ be distributed as $\boldsymbol{D}-d$ given $\left|{\boldsymbol{D}-d}\right|\leq 10\sqrt{q}\log q$ . Then

[TABLE]

For the $i=0$ term, using (5.1) and (5.3), we have the expansion

[TABLE]

Moreover, for $i\geq 1$ , using the fact that $\boldsymbol{\Delta}/q=\tilde{O}(q^{-1/2})$ , we obtain

[TABLE]

Plugging (5.10) and (5.11) into (5.8), we obtain

[TABLE]

Since $d=\mathbb{E}[\boldsymbol{D}]$ and since conditioning on $\left|{\boldsymbol{D}-d}\right|\leq 10\sqrt{q}\log q$ does not shift the mean of $\boldsymbol{D}$ by more than $O_{q}(1/q)$ , we obtain $\mathbb{E}[\boldsymbol{\Delta}]=O_{q}(1/q)$ . Using this and (5.12) yields

[TABLE]

Combining (5.13) with the expansion (5.1) of $T$ , we finally obtain

[TABLE]

Thus, setting $c\leq 1-\varepsilon_{q}$ with $\varepsilon_{q}\to 0$ slowly, we see that $\Sigma^{*}_{d,q}<0$ for large enough $q$ . This completes the proof of Corollary 1.4.

Acknowledgment.

We thank Viktor Harangi and an anonymous reviewer for their very careful reading of our manuscript and their extremely accurate and helpful comments, which led to several improvements and corrections.

Bibliography44

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] D. Achlioptas, E. Friedgut: A sharp threshold for k 𝑘 k -colorability. Random Struct. Algorithms 14 (1999) 63–70.
2[2] D. Achlioptas, C. Moore: Almost all graphs with average degree 4 are 3-colorable. Journal of Computer and System Sciences 67 (2003) 441–471.
3[3] D. Achlioptas, C. Moore: The chromatic number of random regular graphs. Proc. 8th RANDOM (2004) 219–228.
4[4] D. Achlioptas, A. Naor: The two possible values of the chromatic number of a random graph. Annals of Mathematics 162 (2005) 1333–1349.
5[5] N. Alon, M. Krivelevich: The concentration of the chromatic number of random graphs. Combinatorica 17 (1997) 303–313
6[6] V. Bapst, A. Coja-Oghlan, S. Hetterich, F. Rassmann, D. Vilenchik: The condensation phase transition in random graph coloring. Communications in Mathematical Physics 341 (2016) 543–606.
7[7] M. Bayati, D. Gamarnik, P. Tetali: Combinatorial approach to the interpolation method and scaling limits in sparse random graphs. Annals of Probability 41 (2013) 4080–4115.
8[8] B. Bollobás: The chromatic number of random graphs. Combinatorica 8 (1988) 49–55

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Lower bounds on the chromatic number of random graphs

Abstract.

1. Introduction

1.1. Motivation and background

1.2. The random regular graph

Theorem 1.1**.**

Corollary 1.2**.**

1.3. The binomial random graph

Theorem 1.3**.**

Corollary 1.4**.**

1.4. Related work

1.5. Preliminaries and notation

2. Outline

2.1. The Potts antiferromagnet

Proposition 2.1**.**

Corollary 2.2**.**

Corollary 2.3**.**

Proof.

2.2. The interpolation scheme

Remark 2.4**.**

Definition 2.5** (binomial case).**

Definition 2.6** (regular case).**

Proposition 2.7**.**

Proposition 2.8**.**

Corollary 2.9**.**

2.3. Poisson-Dirichlet weights

Lemma 2.10** ([38, Proposition 1] and [43, Proposition 6.5.15]).**

Corollary 2.11**.**

Proof.

2.4. The zero temperature limit

Proposition 2.12**.**

Proof of Theorem 1.1.

Proof of Theorem 1.3.

3. Concentration

3.1. Proof of Proposition 2.1

3.1.1. The binomial random graph

3.1.2. The random regular graph

3.2. Proof of Corollary 2.2

3.2.1. The binomial random graph

3.2.2. The random regular graph

4. Interpolation

4.1. Proof of Proposition 2.7

4.2. Proof of Proposition 2.8

Lemma 4.1**.**

Proof.

Lemma 4.2**.**

Proof.

Lemma 4.3**.**

Proof.

Lemma 4.4**.**

Proof.

Proof of Proposition 2.8.

4.3. Proof of Corollary 2.9

Lemma 4.5**.**

Proof.

Proof of Corollary 2.9.

4.4. Proof of Proposition 2.12

Lemma 4.6**.**

Proof.

Lemma 4.7**.**

Proof.

Proof of Proposition 2.12.

5. Asymptotics

5.1. Proof of Corollary 1.2

5.2. Proof of Corollary 1.4

Acknowledgment.

Theorem 1.1.

Corollary 1.2.

Theorem 1.3.

Corollary 1.4.

Proposition 2.1.

Corollary 2.2.

Corollary 2.3.

Remark 2.4.

Definition 2.5 (binomial case).

Definition 2.6 (regular case).

Proposition 2.7.

Proposition 2.8.

Corollary 2.9.

Lemma 2.10 ([38, Proposition 1] and [43, Proposition 6.5.15]).

Corollary 2.11.

Proposition 2.12.

Lemma 4.1.

Lemma 4.2.

Lemma 4.3.

Lemma 4.4.

Lemma 4.5.

Lemma 4.6.

Lemma 4.7.