Efficiently list-edge coloring multigraphs asymptotically optimally

Fotis Iliopoulos; Alistair Sinclair

arXiv:1812.10309·cs.DM·December 8, 2021

Efficiently list-edge coloring multigraphs asymptotically optimally

Fotis Iliopoulos, Alistair Sinclair

PDF

TL;DR

This paper develops polynomial-time algorithms for asymptotically optimal list-edge coloring of multigraphs, transforming non-constructive probabilistic proofs into constructive algorithms using advanced local search and correlation decay techniques.

Contribution

It introduces a constructive approach to Kahn's asymptotic results on edge coloring multigraphs by leveraging local search analysis and correlation decay methods.

Findings

01

Algorithms match Kahn's asymptotic bounds

02

Use of local search analysis for constructive algorithms

03

Correlation decay exploited for efficiency

Abstract

We give polynomial time algorithms for the seminal results of Kahn, who showed that the Goldberg-Seymour and List-Coloring conjectures for (list-)edge coloring multigraphs hold asymptotically. Kahn's arguments are based on the probabilistic method and are non-constructive. Our key insight is to show that the main result of Achlioptas, Iliopoulos and Kolmogorov for analyzing local search algorithms can be used to make constructive applications of a powerful version of the so-called Lopsided Lovasz Local Lemma. In particular, we use it to design algorithms that exploit the fact that correlations in the probability spaces on matchings used by Kahn decay with distance.

Equations147

b_{i} \leq x_{i} j \in L (i) \prod (1 - x_{j}) for all i \in [m],

b_{i} \leq x_{i} j \in L (i) \prod (1 - x_{j}) for all i \in [m],

γ_{i} = τ \in Ω max σ \in f_{i} \sum \frac{μ ( σ )}{μ ( τ )} ρ_{i} (σ, τ) .

γ_{i} = τ \in Ω max σ \in f_{i} \sum \frac{μ ( σ )}{μ ( τ )} ρ_{i} (σ, τ) .

γ_{i} \leq (1 - ϵ) x_{i} j \in Γ (i) \prod (1 - x_{j}) for every i \in [m]

γ_{i} \leq (1 - ϵ) x_{i} j \in Γ (i) \prod (1 - x_{j}) for every i \in [m]

T_{0} = lo g_{2} (σ \in Ω max \frac{θ ( σ )}{μ ( σ )}) + j \in [m] \sum lo g_{2} (\frac{1}{1 - x _{j}}) .

T_{0} = lo g_{2} (σ \in Ω max \frac{θ ( σ )}{μ ( σ )}) + j \in [m] \sum lo g_{2} (\frac{1}{1 - x _{j}}) .

ν (M) = \frac{λ ( M )}{\sum _{M^{'} \in M (G)} λ ( M ^{'} )} .

ν (M) = \frac{λ ( M )}{\sum _{M^{'} \in M (G)} λ ( M ^{'} )} .

Pr (e \in M ∣ Q) \in (1 \pm ϵ) Pr (e \in M),

Pr (e \in M ∣ Q) \in (1 \pm ϵ) Pr (e \in M),

d_{i} := τ \in Ω max \frac{ν _{i} ( τ )}{μ ( τ )} \geq 1,

d_{i} := τ \in Ω max \frac{ν _{i} ( τ )}{μ ( τ )} \geq 1,

γ_{i} = τ \in Ω max \frac{1}{μ ( τ )} σ \in f_{i} \sum μ (σ) ρ_{i} (σ, τ) = d_{i} \cdot μ (f_{i}) .

γ_{i} = τ \in Ω max \frac{1}{μ ( τ )} σ \in f_{i} \sum μ (σ) ρ_{i} (σ, τ) = d_{i} \cdot μ (f_{i}) .

\displaystyle\mu\Bigl{(}f_{i}\mid\bigcap_{j\in S}\overline{f_{j}}\Bigr{)}\leq\gamma_{i},

\displaystyle\mu\Bigl{(}f_{i}\mid\bigcap_{j\in S}\overline{f_{j}}\Bigr{)}\leq\gamma_{i},

μ (f_{i} ∣ F_{S})

μ (f_{i} ∣ F_{S})

\frac{\sum _{τ \in F_{S}} \sum _{σ \in f_{i} \cap F_{S}} μ ( σ ) ρ _{i} ( σ , τ )}{μ ( F _{S} )}

\frac{\sum _{τ \in F_{S}} \sum _{σ \in f_{i} \cap F_{S}} μ ( σ ) ρ _{i} ( σ , τ )}{μ ( F _{S} )}

= \frac{\sum _{τ \in F_{S}} μ ( τ ) \sum _{σ \in f_{i} \cap F_{S}} \frac{μ ( σ )}{μ ( τ )} ρ _{i} ( σ , τ )}{μ ( F _{S} )}

\leq \frac{\sum _{τ \in F_{S}} μ ( τ ) ( max _{τ^{'} \in Ω} \sum _{σ \in f_{i}} \frac{μ ( σ )}{μ ( τ ^{'} )} ρ _{i} ( σ , τ ^{'} ) )}{μ ( F _{S} )}

= γ_{i} .

f_{v} = {σ \in Ω : d_{G_{σ}} (v) > c^{*} - \frac{ϵ}{4} N} .

f_{v} = {σ \in Ω : d_{G_{σ}} (v) > c^{*} - \frac{ϵ}{4} N} .

f_{H} = {σ \in Ω : H \subseteq G_{σ}} .

f_{H} = {σ \in Ω : H \subseteq G_{σ}} .

E (H) \leq \frac{∣ V ( H ) ∣ - 1}{2} c^{*} .

E (H) \leq \frac{∣ V ( H ) ∣ - 1}{2} c^{*} .

∣ E (H) ∣ = ∣ E (F) ∣ + ∣ E (H^{'}) ∣ \leq \frac{∣ V ( F ) ∣ c ^{*}}{2} + ∣ E (H^{'}) ∣.

∣ E (H) ∣ = ∣ E (F) ∣ + ∣ E (H^{'}) ∣ \leq \frac{∣ V ( F ) ∣ c ^{*}}{2} + ∣ E (H^{'}) ∣.

∣ E (H) ∣ \leq (∣ V (F) ∣ + ∣ V (H^{'}) ∣ - 1) c^{*} /2 = (∣ V (H) ∣ - 1) c^{*} /2.

∣ E (H) ∣ \leq (∣ V (F) ∣ + ∣ V (H^{'}) ∣ - 1) c^{*} /2 = (∣ V (H) ∣ - 1) c^{*} /2.

Q_{H} (d, σ) = (M_{1} - S_{< d} (H), \dots, M_{N} - S_{< d} (H)),

Q_{H} (d, σ) = (M_{1} - S_{< d} (H), \dots, M_{N} - S_{< d} (H)),

t = 8 (K + 1)^{2} δ^{- 1} + 2.

t = 8 (K + 1)^{2} δ^{- 1} + 2.

\displaystyle\gamma_{f}\cdot\bigl{(}1+\max_{f^{\prime}\in F}(|\Gamma(f^{\prime})|\bigr{)}\cdot\mathrm{e}\leq 3/4\enspace\text{for every flaw $f$},

\displaystyle\gamma_{f}\cdot\bigl{(}1+\max_{f^{\prime}\in F}(|\Gamma(f^{\prime})|\bigr{)}\cdot\mathrm{e}\leq 3/4\enspace\text{for every flaw $f$},

γ_{f_{v}} = τ \in Ω max μ (σ \in f_{v} ∣ Q_{v} (t, σ) = Q_{v} (t, τ)) .

γ_{f_{v}} = τ \in Ω max μ (σ \in f_{v} ∣ Q_{v} (t, σ) = Q_{v} (t, τ)) .

ω \in f_{v} \sum \frac{μ ( ω )}{μ ( τ )} ρ_{f_{v}} (ω, τ) = μ (σ \in f_{v} ∣ Q_{v} (t, σ) = Q_{v} (t, τ)) .

ω \in f_{v} \sum \frac{μ ( ω )}{μ ( τ )} ρ_{f_{v}} (ω, τ) = μ (σ \in f_{v} ∣ Q_{v} (t, σ) = Q_{v} (t, τ)) .

\frac{μ ( ω )}{μ ( τ )} = i = 1 \prod N \frac{ν ( M _{i}^{ω} )}{ν ( M _{i} )} = i = 1 \prod N \frac{λ ( M _{i}^{ω} \cap E ( G _{< t + 1} ( v )))}{λ ( M _{i} \cap E ( G _{< t + 1} ( v )))} .

\frac{μ ( ω )}{μ ( τ )} = i = 1 \prod N \frac{ν ( M _{i}^{ω} )}{ν ( M _{i} )} = i = 1 \prod N \frac{λ ( M _{i}^{ω} \cap E ( G _{< t + 1} ( v )))}{λ ( M _{i} \cap E ( G _{< t + 1} ( v )))} .

ρ_{f_{v}} (ω, τ)

ρ_{f_{v}} (ω, τ)

= i = 1 \prod N \frac{λ ( M _{i} \cap E ( G _{< t + 1} ( v )))}{\sum _{M \in M_{t + 1}^{i} (v, τ)} λ ( M )}

\frac{μ ( ω )}{μ ( τ )} ρ_{f_{v}} (ω, τ) = i = 1 \prod N \frac{λ ( M _{i}^{ω} \cap E ( G _{< t + 1} ( v )))}{\sum _{M \in M_{t + 1}^{i} (v, τ)} λ ( M )} .

\frac{μ ( ω )}{μ ( τ )} ρ_{f_{v}} (ω, τ) = i = 1 \prod N \frac{λ ( M _{i}^{ω} \cap E ( G _{< t + 1} ( v )))}{\sum _{M \in M_{t + 1}^{i} (v, τ)} λ ( M )} .

i = 1 \prod N \frac{λ ( M _{i}^{ω} \cap E ( G _{< t + 1} ( v )))}{\sum _{M \in M_{t + 1}^{i} (v, τ)} λ ( M )} = i = 1 \prod N ν (M_{i}^{σ} = M_{i}^{ω} ∣ Q_{v}^{i} (t, σ) = Q_{v}^{i} (t, τ)) = μ (σ = ω ∣ Q_{v} (t, σ) = Q_{v} (t, τ)) .

i = 1 \prod N \frac{λ ( M _{i}^{ω} \cap E ( G _{< t + 1} ( v )))}{\sum _{M \in M_{t + 1}^{i} (v, τ)} λ ( M )} = i = 1 \prod N ν (M_{i}^{σ} = M_{i}^{ω} ∣ Q_{v}^{i} (t, σ) = Q_{v}^{i} (t, τ)) = μ (σ = ω ∣ Q_{v} (t, σ) = Q_{v} (t, τ)) .

ω \in f_{v} \sum \frac{μ ( ω )}{μ ( τ )} ρ_{f_{v}} (ω, τ) = ω \in f_{v} \sum μ (σ = ω ∣ Q_{v} (t, σ) = Q_{v} (t, τ)) = μ (σ \in f_{v} ∣ Q_{v} (t, σ) = Q_{v} (t, τ)),

ω \in f_{v} \sum \frac{μ ( ω )}{μ ( τ )} ρ_{f_{v}} (ω, τ) = ω \in f_{v} \sum μ (σ = ω ∣ Q_{v} (t, σ) = Q_{v} (t, τ)) = μ (σ \in f_{v} ∣ Q_{v} (t, σ) = Q_{v} (t, τ)),

T_{Δ} (s) := \frac{( s Δ s )}{( Δ - 1 ) s + 1},

T_{Δ} (s) := \frac{( s Δ s )}{( Δ - 1 ) s + 1},

T_{Δ} (s) \leq \frac{( \frac{Δ s \cdot e}{s} ) ^{s}}{( Δ - 1 ) s + 1} = \frac{( Δ \cdot e ) ^{s}}{( Δ - 1 ) s + 1} \leq (e Δ)^{s - 1}

T_{Δ} (s) \leq \frac{( \frac{Δ s \cdot e}{s} ) ^{s}}{( Δ - 1 ) s + 1} = \frac{( Δ \cdot e ) ^{s}}{( Δ - 1 ) s + 1} \leq (e Δ)^{s - 1}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Efficiently list-edge coloring multigraphs asymptotically optimally

Fotis Iliopoulos

University of California Berkeley

[email protected] Research supported by NSF grant CCF-1514434 and the Onassis Foundation.

Alistair Sinclair

University of California Berkeley

[email protected] Research supported by NSF grants CCF-1514434 and CCF-1815328.

Abstract

We give polynomial time algorithms for the seminal results of Kahn [22, 23], who showed that the Goldberg-Seymour and List-Coloring conjectures for (list-)edge coloring multigraphs hold asymptotically. Kahn’s arguments are based on the probabilistic method and are non-constructive. Our key insight is that we can combine sophisticated techniques due to Achlioptas, Iliopoulos and Kolmogorov [2] for the analysis of local search algorithms with correlation decay properties of the probability spaces on matchings used by Kahn in order to construct efficient edge-coloring algorithms.

Keywords— edge-coloring, mulitgraphs, Goldberg-Seymour conjecture, list-edge-coloring conjecture

1 Introduction

In graph edge coloring one is given a (multi)graph $G=(V,E)$ and the goal is to find an assignment of one of $q$ colors to each edge $e\in E$ so that no pair of adjacent edges share the same color. The chromatic index, $\chi_{e}(G)$ , of $G$ is the smallest integer $q$ for which this is possible. In the more general list-edge coloring problem, a list of $q$ allowed colors is specified for each edge. A graph is $q$ -list-edge colorable if it has a list-coloring no matter how the lists are assigned to each edge. The list chromatic index, $\chi^{\ell}_{e}(G)$ , is the smallest $q$ for which $G$ is $q$ -list-edge colorable.

Edge coloring is one of the most fundamental and well-studied coloring problems with various applications in computer science (e.g., [7, 12, 21, 22, 23, 34, 36, 38, 39, 40, 42]). To give just one representative example, if edges represent data packets then an edge coloring with $q$ colors specifies a schedule for exchanging the packets directly and without node contention. In this paper we are interested in designing algorithms for efficiently edge coloring and list-edge coloring multigraphs. To formally describe our results, we need some notation.

For a multigraph $G$ , let $\mathcal{M}(G)$ denote the set of matchings of $G$ . A fractional edge coloring is a set $\{M_{1},\ldots,M_{\ell}\}$ of matchings and corresponding positive real weights $\{w_{1},\ldots,w_{\ell}\}$ , such that the sum of the weights of the matchings containing each edge is one, i.e., $\forall e\in E$ , $\sum_{M_{i}:e\in M_{i}}w_{i}=1$ . A fractional edge coloring is a fractional edge $c$ -coloring if $\sum_{M\in\mathcal{M}(G)}w_{M}=c$ . The fractional chromatic index of $G$ , denoted by $\chi_{e}^{*}(G)$ , is the minimum $c$ such that $G$ has a fractional edge $c$ -coloring.

Let $\Delta=\Delta(G)$ be the maximum degree of $G$ and define $\Gamma:=\max_{H\subseteq V,|H|\geq 2}\frac{|E(H)|}{\lfloor|H|/2\rfloor}$ , where $E(H)$ is the set of edges of the induced subgraph $H$ . Both of these quantities are obvious lower bounds for the chromatic index and it is known [9] that $\chi_{e}^{*}(G)=\max(\Delta,\Gamma)$ . Furthermore, Padberg and Rao [35] show that the fractional chromatic index of a multigraph, and indeed an optimal fractional edge coloring, can be computed in polynomial time.

Goldberg and Seymour independently stated the now famous conjecture that every multigraph $G$ satisfies $\chi_{e}(G)\leq\max\left(\Delta+1,\lceil\chi_{e}^{*}(G)\rceil\right)$ . In a seminal paper [22], Kahn showed that the Goldberg-Seymour conjecture holds asymptotically:

Theorem 1.1 ([22]).

*The chromatic index of a multigraph $G$ satisfies $\chi_{e}(G)\leq(1+o(1))\chi_{e}^{*}(G).$ *

(Here $o(1)$ denotes a term that tends to zero as $\chi_{e}(G)\to\infty$ .) Later Kahn proved the analogous result for list-edge coloring [23], establishing that the List Coloring Conjecture, which asserts that $\chi_{e}^{\ell}(G)=\chi_{e}(G)$ for any multigraph $G$ , also holds asymptotically:

Theorem 1.2 ([23]).

The list chromatic index of a multigraph $G$ satisfies $\chi_{e}^{\ell}(G)\leq(1+o(1))\chi_{e}^{*}(G).$

The proofs of Kahn use the probabilistic method and are not constructive. The main contribution of this paper is to provide polynomial time algorithms for the above results, as follows:

Theorem 1.3.

There exists a randomized algorithm that, given a multigraph $G$ on $n$ vertices, constructs a $(1+o(1))\chi_{e}^{*}(G)$ -edge coloring of $G$ in expected polynomial time.

Theorem 1.4.

There exists a randomized algorithm that, given a multigraph $G$ on $n$ vertices and an arbitrary list of $(1+o(1))\chi_{e}^{*}(G)$ colors for each edge, constructs a valid list-edge coloring of $G$ in expected polynomial time.

Clearly, Theorem 1.4 subsumes Theorem 1.3. Moreover, in a very recent breakthrough [6], Chen, Jing and Zang proved the (non-asymptotic) Goldberg-Seymour conjecture without exploiting the arguments of Kahn. Even before this work, the results of Sanders and Steurer [38] and Scheide [40] already give deterministic polynomial time algorithms for edge coloring multigraphs asymptotically optimally, again without exploiting the arguments of Kahn. Nonetheless, we choose to present the proof of Theorem 1.3 for three reasons. First and most importantly, its proof is significantly easier than that of Theorem 1.4, while it contains many of the key ideas required for proving Theorem 1.4. Second, our algorithms and techniques are very different from those of [6, 38, 40]. Finally, we show that the algorithm of Theorem 1.3 is commutative, a notion introduced by Kolmogorov [27]. This fact may be of independent interest as we discuss in Remark 2.1 in Section 2.2.

To the best of our knowledge, Theorem 1.4 is the first result to give an asymptotically optimal polynomial time algorithm for list-edge coloring multigraphs.

1.1 Technical Overview

The proofs of Theorems 1.1 and 1.2 are based on a very sophisticated variation of what is known as the semi-random method (also known as the “naive coloring procedure”), which is the main technical tool behind some of the strongest graph coloring results, e.g., [20, 21, 25, 30]. The idea is to gradually color the graph in iterations, until we reach a point where we can finish the coloring using a greedy algorithm. In its most basic form, each iteration consists of the following simple procedure: assign to each edge a color chosen uniformly at random; then uncolor any edge which receives the same color as one of its neighbors. Using the Lovász Local Lemma (LLL) [10] and concentration inequalities, one typically shows that, with positive probability, the resulting partial proper coloring has useful properties that allow for the continuation of the argument in the next iteration. For a nice exposition of both the method and the proofs of Theorems 1.1 and 1.2, the reader is referred to [31].

The key new ingredient in Kahn’s arguments is the method of assigning colors to edges. For each color $c$ , we choose a matching $M_{c}$ from some hard-core distribution on $\mathcal{M}(G)$ and assign the color $c$ to the edges in $M_{c}$ . The idea is that, by assigning each color exclusively to the edges of one matching, we avoid conflicting color assignments and the resulting uncolorings.

The existence of such hard-core distributions is guaranteed by the characterization of the matching polytope due to Edmonds [9] and a result by Lee [28] (also shown independently by Rabinovich et al. [37]). The crucial fact about them is that they are endowed with very useful approximate stochastic independence properties, as was shown by Kahn and Kayll in [24]. In particular, for every edge $e$ , conditioning on events that are determined by edges far enough from $e$ in the graph does not effectively alter the probability of $e$ being in the matching.

The reason why this property is important is because it enables the application of a sophisticated version of what is known as the Lopsided Lovász Local Lemma. Recall that the original statement of the LLL asserts, roughly, that, given a family of “bad” events in a probability space, if each bad event individually is not very likely and, in addition, is independent of all but a small number of other bad events, then the probability of avoiding all bad events is strictly positive. The Lopsided LLL used by Kahn generalizes this criterion as follows. For each bad event $B$ , we fix a positive real number $\mu_{B}$ and require that conditioning on all but a small number of other bad events doesn’t make the probability of $B$ larger than $\mu_{B}$ . Then, provided the $\mu_{B}$ are small enough, the conclusion of the LLL still holds. In other words, one replaces the “probability of a bad event” in the original LLL statement with the “boosted” probability of the event, and the notion of “independence” by the notion of “sufficiently mild negative correlation”.

Notably, the breakthrough result of Moser and Tardos [32, 33] that made the LLL constructive for the vast majority of its applications does not apply in this case, mainly for two reasons. First, the algorithm of Moser and Tardos applies only when the underlying probability measure of the LLL application is a product over explicitly presented variables. Second, it relies on a particular type of dependency (defined by shared variables). The lack of an efficient algorithm for Lopsided LLL applications is the primary obstacle to making the arguments of Kahn constructive.

Our main technical contribution is the design and analysis of such algorithms. Towards this goal, we use the flaws-actions framework introduced in [1] and further developed in [2, 3, 4, 16, 18]. In particular, we use the algorithmic LLL criterion for the analysis of stochastic local search algorithms developed by Achlioptas, Iliopoulos and Kolmogorov in [2]. We start by showing that there is a connection between this criterion and the version of the Lopsided LLL used by Kahn, in the sense that the former can be seen as the constructive counterpart of the latter. However, this observation by itself is not sufficient, since the result of [2] is a tool for analyzing a given stochastic local search algorithm. Thus, we are still left with the task of designing the algorithm before using it. Nonetheless, this connection provides valuable intuition on how to realize this task. Moreover, we believe it is of independent interest as it provides an explanation for the success of various algorithms (such as [29]) inspired by the techniques of Moser and Tardos, which were not tied to a known form of the LLL.

To get a feeling for the nature of our algorithms, it is helpful to have some intuition for the criterion of [2]. There, the input is the algorithm to be analyzed and a probability measure $\mu$ over the state space of the algorithm. The goal of the algorithm is to reach a state that avoids a family of bad subsets of the space which we call flaws. It does this by focusing on a flaw that is currently present at each step, and taking a (possibly randomized) action to address it. At a high level, the role of the measure is to gauge how efficiently the algorithm rids the state of flaws, by quantifying the trade-off between the probability that a flaw is present at some inner state of the execution of the algorithm and the number of other flaws each flaw can possibly introduce when the algorithm addresses it. In particular, the quality of the convergence criterion is affected by the compatibility between the measure and the algorithm.

Roughly, the states of our algorithm will be matchings in a multigraph (corresponding to color classes) and the goal will be to construct matchings that avoid certain flaws. To that end, our algorithm will locally modify each flawed matching by (re)sampling matchings in subgraphs of $G$ according to distributions induced by the hard-core distributions used in Kahn’s proof. The fact that correlations decay with distance in these distributions allows us to prove that, while the changes are local, and hence not many new flaws are introduced at each step, the compatibility of our algorithms with these hard-core distributions is high enough to allow us to successfully apply the criterion of [2].

1.2 Organization of the Paper

In Section 2 we present the necessary background. In Section 3 we show a useful connection between the version of the Lopsided LLL used by Kahn and the algorithmic LLL criterion of [2]. In Section 4 we present the proof of Theorem 1.3. In Section 5, we sketch the proof of Theorem 1.2 and then prove Theorem 1.4.

2 Background and Preliminaries

2.1 The Lopsided Lovász Local Lemma

Erdős and Spencer [11] noted that independence in the LLL can be replaced by positive correlation, yielding the original version of what is known as the Lopsided LLL, more sophisticated versions of which have also been established in [5, 8]. Below we state the Lopsided LLL in one of its most powerful forms.

Theorem 2.1 (General Lopsided LLL).

Let $(\Omega,\mu)$ be a probability space and $\mathcal{B}=\{B_{1},B_{2},\ldots,B_{m}\}$ be a set of $m$ (bad) events. For each $i\in[m]$ , let $L(i)\subseteq[m]\setminus\{i\}$ be such that $\mu(B_{i}\mid\bigcap_{j\in S}\overline{B_{j}})\leq b_{i}$ for every $S\subseteq[m]\setminus(L(i)\cup\{i\})$ . If there exist positive real numbers $\{x_{i}\}_{i=1}^{m}$ such that

[TABLE]

then the probability that none of the events in $\mathcal{B}$ occurs is at least $\prod_{i=1}^{m}(1-x_{i})>0$ .

Note that in most applications of the Lopsided LLL the definition of sets $\{L(i)\}_{i\in[m]}$ is “symmetric”, in the sense that if $j\in L(i)$ then $i\in L(j)$ for every $i,j\in[m]$ . With that in mind, any (undirected) graph on $[m]$ that includes every edge $(i,j)$ such that $j\in L(i)$ or $i\in L(j)$ is called a lopsidependency graph.

2.2 An Algorithmic LLL Criterion.

Let $\Omega$ be a discrete state space, and let $F=\{f_{1},f_{2},\ldots,f_{m}\}$ be a collection of subsets (which we call flaws) of $\Omega$ . We define $\bigcup_{i\in[m]}f_{i}=\Omega^{*}$ . Our goal is to find a state $\sigma\in\Omega\setminus\Omega^{*}$ ; we refer to such states as flawless.

For a state $\sigma$ , we denote by $U(\sigma)=\{f_{j}\in F\text{ s.t. }f_{j}\ni\sigma\}$ the set of flaws present in $\sigma$ . We consider local search algorithms working on $\Omega$ which, in each flawed state $\sigma\in\Omega^{*}$ , choose a flaw $f_{i}$ in $U(\sigma)$ and randomly move to a nearby state in an effort to fix $f_{i}$ . We will assume that, for every flaw $f_{i}$ and every state $\sigma\in f_{i}$ , there is a non-empty set of actions $a(i,\sigma)\subseteq\Omega$ such that addressing flaw $f_{i}$ in state $\sigma$ amounts to selecting the next state $\tau$ from $a(i,\sigma)$ according to some probability distribution $\rho_{i}(\sigma,\tau)$ . Note that potentially $a(i,\sigma)\cap f_{i}\neq\emptyset$ , i.e., addressing a flaw does not necessarily imply removing it. We write $\sigma\xrightarrow{i}\tau$ to denote the fact that the algorithm addresses flaw $f_{i}$ at $\sigma$ and moves to $\tau$ .

Throughout the paper we consider algorithms that start from a state $\sigma\in\Omega$ picked from an initial distribution $\theta$ , and then repeatedly pick a flaw that is present in the current state and address it. The algorithm always terminates when it encounters a flawless state.

Definition 2.2 (Causality).

We say that flaw $f_{i}$ causes $f_{j}$ if there exists a transition $\sigma\xrightarrow{i}\tau$ such that (i) $f_{j}\ni\tau$ ; (ii) either $f_{i}=f_{j}$ or $f_{j}\not\ni\sigma$ .

Definition 2.3 (Causality Graph).

Any (undirected) graph $C=C(\Omega,F)$ on $[m]$ that includes every edge $(i,j)$ such that either $f_{i}$ causes $f_{j}$ or $f_{j}$ causes $f_{i}$ is called a causality graph. We write $\Gamma(i)$ for the set of neighbors of $i$ in this graph. We also write $i\sim j$ to denote that $j\in\Gamma(i)$ (or equivalently, $i\in\Gamma(j)$ ).

For a given probability measure $\mu$ supported on the state space $\Omega$ , and for each flaw $f_{i}$ , we define the charge

[TABLE]

In Section 3 we give the intuition behind the definition of charges and also draw a connection with the parameters $b_{i}$ in Theorem 2.1. We are now ready to state the main result of [2].

Theorem 2.4 ([2]).

Assume that, at each step, the algorithm chooses to address the lowest indexed flaw according to an arbitrary, but fixed, permutation of $[m]$ . If there exist positive real numbers $x_{i}\in(0,1)$ for $1\leq i\leq m$ such that

[TABLE]

for some $\epsilon\in(0,1)$ , then the algorithm reaches a flawless object within $(T_{0}+s)/\epsilon$ steps with probability at least $1-2^{-s}$ , where

[TABLE]

We also describe another theorem that can be used to show convergence in a polynomial number of steps, even when the number of flaws is super-polynomial, assuming that the algorithm has a nice “commutativity” property which we describe next.

Definition 2.5.

For $i\in[m]$ , let $A_{i}$ denote the $|\Omega|\times|\Omega|$ matrix defined by $A_{i}[\sigma,\sigma^{\prime}]=\rho_{i}(\sigma,\sigma^{\prime})$ if $\sigma\in f_{i}$ , and $A_{i}[\sigma,\sigma^{\prime}]=0$ otherwise. An algorithm defined by matrices $A_{i}$ , $i\in[m]$ , is commutative with respect to a causality relation $\sim$ if for every $i,j\in[m]$ such that $i\nsim j$ we have $A_{i}A_{j}=A_{j}A_{i}$ .

Remark 2.1.

As shown in [27, 18, 16], commutative algorithms have several additional nice properties: they are often parallelizable, their output distribution approximates the so-called “LLL-distribution”, etc. Here we use the fact that commutative algorithms converge in polynomial time even in the presence of superpolynomially many flaws, assuming that the causality graph can be covered by a polynomial number of cliques (see Theorem 2.6 below). It is also worth noting that, if there were an efficient parallel algorithm for sampling matchings in multigraphs, namely a parallel version of the MCMC algorithm of Theorem 2.10 which we discuss in the next section and which we use in our algorithm for Theorem 1.3, then our proof directly implies a parallel algorithm for Theorem 1.3. The study of parallel versions of MCMC sampling algorithms has been initiated recently in [13, 14].

We note that Definition 2.5 was introduced in [16], as a generalization of the combinatorial definition of commutativity introduced in [27]. While the latter would suffice for our purposes, we choose to work with Definition 2.5 due to its compactness.

Theorem 2.6.

Let $\mathcal{A}$ be a commutative algorithm with respect to a causality relation $\sim$ . Assume there exist positive real numbers $\{x_{i}\}_{i\in[m]}$ in $(0,1)$ such that condition (3) holds. Assume further that the causality graph induced by $\sim$ can be covered by $n$ cliques with potentially further edges between them. Setting $\delta:=\min_{i\in[m]}x_{i}\prod_{j\in\Gamma(i)}(1-x_{j})$ , the expected number of steps performed by $\mathcal{A}$ is at most $t=O\left(\max_{\sigma\in\Omega}\frac{\theta(\sigma)}{\mu(\sigma)}\cdot\frac{n}{\epsilon}\log\frac{n\log(1/\delta)}{\epsilon}\right)$ , and for any parameter $\lambda\geq 1$ , $\mathcal{A}$ terminates within $\lambda t$ resamplings with probability $1-\mathrm{e}^{-\lambda}$ .

As shown in [18, Theorem 3.2], the proof of Theorem 2.6 reduces to that of the analogous result of Hauepler, Saha and Srinivasan [15] for the Moser-Tardos algorithm, and hence we omit it.

2.3 Hard-Core Distributions on Matchings

A probability distribution $\nu$ on the matchings of a multigraph $G$ is hard-core if it is obtained by associating to each edge $e$ a positive real $\lambda(e)$ (called the activity of $e$ ) so that the probability of any matching $M$ is proportional to $\prod_{e\in M}\lambda(e)$ . Thus, recalling that $\mathcal{M}(G)$ denotes the set of matchings of $G$ , and setting $\lambda(M)=\prod_{e\in M}\lambda(e)$ for each $M\in\mathcal{M}(G)$ , we have

[TABLE]

The characterization of the matching polytope due to Edmonds [9] and a result of Lee [28] (which was also shown independently by Rabinovich et al. [37]) imply the following connection between fractional edge colorings and hard-core probability distributions on matchings. Before describing it, we need a definition.

For any probability distribution $\nu$ on the matchings of a multigraph $G$ , we refer to the probability that a particular edge $e$ is in the random matching as the marginal of $\nu$ at $e$ . We write $(\nu_{e_{1}},\ldots,\nu_{e_{|E(G)|}})$ for the collection of marginals of $\nu$ at all the edges $e_{i}\in E(G)$ .

Theorem 2.7 ([28, 37]).

There is a hard-core probability distribution $\nu$ with marginals $(\frac{1}{c},\ldots,\frac{1}{c})$ if and only if there is a fractional $c^{\prime}$ -edge coloring of $G$ with $c^{\prime}<c$ , i.e., if and only if $\chi_{e}^{*}<c$ .

Kahn and Kayll [24] proved that the probability distribution promised by Theorem 2.7 is endowed with very useful approximate stochastic independence properties.

Definition 2.8.

Suppose we choose a random matching $M$ from some probability distribution. We say that an event $Q$ is $t$ -distant from a vertex $v$ if $Q$ is completely determined by the choice of all matching edges at distance at least $t$ from $v$ . We say that $Q$ is $t$ -distant from an edge $e$ if it is $t$ -distant from both endpoints of $e$ .

Theorem 2.9 ([24]).

For any $\delta>0$ , there exists a $K=K(\delta)$ such that for any multigraph $G$ with fractional chromatic index $c$ there is a hard-core distribution $\nu$ with marginals $(\frac{1-\delta}{c},\ldots,\frac{1-\delta}{c})$ such that:

(a)

for every $e\in E(G)$ , $\lambda(e)\leq\frac{K}{c}$ and hence $\forall v\in V(G)$ , $\sum_{e\ni v}\lambda(e)\leq K$ ; 2. (b)

for every $\epsilon\in(0,1)$ , if we choose a matching $M$ according to $\nu$ then, for any edge $e$ and event $Q$ which is $t$ -distant from $e$ ,

[TABLE]

where $t=t(\epsilon)=8(K+1)^{2}\epsilon^{-1}+2$ .

We conclude this subsection by stating the result of Jerrum and Sinclair [19] for sampling from hard-core distributions on matchings. We also describe a few of its applications that will be helpful in our proofs. The algorithm of [19] works by simulating a rapidly mixing Markov chain on matchings, whose stationary distribution is the desired hard-core distribution $\nu$ , and outputting the final state.

Theorem 2.10 ([19], Corollary 4.3).

Let $G$ be a multigraph, $\{\lambda(e)\}_{e\in E(G)}$ a vector of activities associated with the edges of $G$ , and $\nu$ the corresponding hard-core distribution. Let $n=|V(G)|$ be the number of vertices of $G$ and define $\lambda^{\prime}=\max\{\max_{u,v\in V(G)}\sum_{e\ni\{u,v\}}\lambda(e),1\}$ . There exists an algorithm that, for any $\epsilon>0$ , runs in time ${\rm poly}(n,\lambda^{\prime},\log\epsilon^{-1})$ and outputs a matching in $G$ drawn from a distribution $\nu^{\prime}$ such that $\|\nu-\nu^{\prime}\|_{\mathrm{TV}}\leq\epsilon$ .

Remark 2.2.

[19]** establishes this result for matchings in (simple) graphs. However, the extension to multigraphs is immediate: make the graph simple by replacing each set of multiple edges $e_{1},\ldots,e_{\ell}$ between a pair of vertices $u,v$ by a single edge $e$ of activity $\lambda(e)=\sum_{i}\lambda(e_{i})$ ; then use the algorithm to sample a matching from the hard-core distribution in the resulting simple graph; finally, for each edge $e=\{u,v\}$ in this matching, select one of the corresponding multiple edges $e_{i}\ni\{u,v\}$ with probability $\lambda(e_{i})/\sum_{i}\lambda(e_{i})$ . Note that the running time will depend polynomially on the maximum activity $\lambda^{\prime}$ in the simple graph, as claimed.

Note that, via a standard argument, the algorithm of Theorem 2.10 can be used to design a fully-polynomial randomized approximation scheme (f.p.r.a.s.) for the partition function of a hard-core probability distribution on the matchings of a multigraph $G$ — namely, for the quantity $Z_{\lambda}(G)=\sum_{M\in\mathcal{M}(G)}\lambda(M)$ .

Theorem 2.11 ([19], Corollary 4.4).

Let $G$ be a multigraph, $\{\lambda(e)\}_{e\in E(G)}$ a vector of activities associated with the edges of $G$ , and $Z_{\lambda}(G)$ the corresponding partition function. Let $n=|V(G)|$ be the number of vertices of $G$ and define $\lambda^{\prime}=\max\{\max_{u,v\in V(G)}\sum_{e\ni\{u,v\}}\lambda(e),1\}$ . There exists an algorithm that, for any $\epsilon>0$ , runs in time $\mathrm{poly}(n,\lambda^{\prime},1/\epsilon)$ and outputs a quantity $\widetilde{Z}_{G}(\lambda)$ such that $\Pr\left((1-\epsilon)Z_{G}(\lambda)\leq\widetilde{Z}_{G}(\lambda)\leq(1+\epsilon)Z_{G}(\lambda)\right)\geq 3/4.$

Remark 2.3.

The estimate in Theorem 2.11 could be arbitrarily bad with probability $1/4$ . However, this probability can be reduced to any desired $\delta>0$ by performing $O(\log\delta^{-1})$ trials and taking the median.

Theorem 2.11 allows us to design a f.p.r.a.s. for the edge-marginals of a hard-core probability distribution on the matchings of a multigraph $G$ .

Corollary 2.12.

Let $G$ be a multigraph, $\{\lambda(e)\}_{e\in E(G)}$ a vector of activities associated with the edges of $G$ , and $\nu$ the corresponding hard-core distribution. Let $n=|V(G)|$ be the number of vertices of $G$ and define $\lambda^{\prime}=\max\{\max_{u,v\in V(G)}\sum_{e\ni\{u,v\}}\lambda(e),1\}$ . There exists an algorithm that, for any edge $e$ , $\epsilon>0$ and $\delta>0$ , runs in time $\mathrm{poly}(n,\lambda^{\prime},1/\epsilon,\log\delta^{-1})$ and outputs a quantity $\widetilde{\nu}_{e}$ such that $\Pr\left((1-\epsilon)\nu_{e}\leq\widetilde{\nu}_{e}\leq(1+\epsilon)\nu_{e}\right)\geq 1-\delta$ , where $\nu_{e}$ is the marginal of $\nu$ at $e$ .

Proof.

Let $G_{e}$ be the mutligraph obtained by removing $e$ along with every other edge of $G$ adjacent to it. Let $Z_{\lambda}(G)$ , $Z_{\lambda}(G_{e})$ denote the partition functions corresponding to multigraphs $G,G_{e}$ with respect to $\{\lambda(e)\}_{e\in E(G)}$ , respectively. Observe now that $\nu_{e}=\lambda(e)\cdot Z_{\lambda}(G_{e})/Z_{\lambda}(G)$ . Using the f.p.r.a.s. promised by Theorem 2.11 (and Remark 2.3) to get appropriately accurate estimates for $Z_{\lambda}(G),Z_{\lambda}(G_{e})$ , we directly obtain an estimate for $\nu_{e}$ that satisfies the guarantees of Corollary 2.12.

∎

Finally, one can use Theorem 2.11 as a subroutine in the algorithm of Singh and Vishnoi [41] to obtain the following result.

Corollary 2.13.

Let $G$ be a multigraph on $n$ vertices and let $\delta\in(0,1)$ be a parameter. Let $\nu=\nu_{\delta}$ be the hard-core probability distribution over the matchings of $G$ promised by Theorem 2.9. For every $\eta>0$ , there exists a $\mathrm{poly}(n,\log\eta^{-1},\log\delta^{-1})$ -time algorithm that computes a set of edge activities $\{\lambda^{\prime}(e)\}_{e\in E(G)}$ such that the corresponding hard-core distribution $\nu^{\prime}$ satisfies $\|\nu-\nu^{\prime}\|_{\mathrm{TV}}\leq\eta$ .

Proof.

Corollary 2.13 follows in a straightforward way from the main results of Singh and Vishnoi [41] and Jerrum and Sinclair [19]. Briefly, the main result of [41] states that finding a distribution that approximates $\nu$ can be seen as the solution of a max-entropy distribution estimation problem which can be efficiently solved given a “generalized counting oracle” for $\nu$ . The latter oracle is provided by Theorem 2.11. ∎

3 Causality, Lopsidependency and Approximate Resampling Oracles

In this section we show a connection between Theorem 2.1 and Theorem 2.4. While this section is not essential to the proof of our main results, it does provide useful intuition since it implies the following natural approach to making applications of the Lopsided LLL algorithmic: we start designing a local search algorithm for addressing the flaws that correspond to bad events by considering the family of probability distributions $\{\rho_{i}(\sigma,\cdot)\}_{i\in[m],\sigma\in f_{i}}$ whose supports induce a causality graph that coincides with the lopsidependency graph of the Lopsided LLL application of interest. This is typically a straightforward task. The key to successful implementation is our ability to make the way in which the algorithm addresses flaws sufficiently compatible with the underlying probability measure $\mu$ . To make this precise, we first recall an algorithmic interpretation of the notion of charges defined in (2).

As shown in [2], the charge $\gamma_{i}$ captures the compatibility between the actions of the algorithm for addressing flaw $f_{i}$ and the measure $\mu$ . To see this, consider the probability, $\nu_{i}(\tau)$ , of ending up in state $\tau$ after (i) sampling a state $\sigma\in f_{i}$ according to $\mu$ , and then (ii) addressing $f_{i}$ at $\sigma$ . Define the distortion associated with $f_{i}$ as

[TABLE]

i.e., the maximum possible inflation of a state probability incurred by addressing $f_{i}$ (relative to its probability under $\mu$ , and averaged over the initiating state $\sigma\in f_{i}$ according to $\mu$ ). Now observe from (2) that

[TABLE]

An algorithm for which $d_{i}=1$ is called a resampling oracle [17] for $f_{i}$ , and notice that it perfectly removes the conditional of the addressed flaw. However, designing resampling oracles for sophisticated measures can be impossible by local search. This is because small, but non-vanishing, correlations can travel arbitrarily far in $\Omega$ . Thus, allowing for some distortion can be very helpful, especially in cases where correlations decay with distance.

Theorem 3.1 below shows that Theorem 2.4 is the algorithmic counterpart of Theorem 2.1.

Theorem 3.1.

Given a family of flaws $F=\{f_{1},\ldots,f_{m}\}$ over a state space $\Omega$ , an algorithm $\mathcal{A}$ with causality graph $C$ with neighborhoods $\Gamma(\cdot)$ , and a measure $\mu$ over $\Omega$ , then for each $S\subseteq F\setminus\Gamma(i)$ we have

[TABLE]

where the $\gamma_{i}$ are the charges of the algorithm as defined in (2).

Proof.

Let $F_{S}:=\bigcap_{j\in S}\overline{f_{j}}$ . Observe that

[TABLE]

where the second equality holds because each $\rho_{i}(\sigma,\cdot)$ is a probability distribution, and the third by the definition of causality and the fact that $S\subseteq F\setminus\Gamma(i)$ . Now notice that changing the order of summation in (7) gives

[TABLE]

∎

In words, Theorem 3.1 shows that causality graph $C$ is a lopsidependency graph with respect to measure $\mu$ with $b_{i}=\gamma_{i}$ for all $i\in[m]$ . Thus, when designing an algorithm for an application of Theorem 2.1 using Theorem 3.1, we have to make sure that the induced causality graph coincides with the lopsidependency graph, and that the measure distortion induced when addressing flaw $f_{i}$ is sufficiently small so that the resulting charge $\gamma_{i}$ is bounded above by $b_{i}$ .

4 Edge Coloring Multigraphs: Proof of Theorem 1.3

We follow the exposition of the proof of Kahn in [31]. Note that throughout the proof we assume that the maximum degree $\Delta$ of the input multigraph $G$ satisfies $\Delta\geq\Delta_{0}$ for some appropriately large constant $\Delta_{0}$ .

The key to the proof of Theorem 1.3 is the following lemma.

Lemma 4.1.

For all $\epsilon>0$ , there exists $\chi_{0}=\chi_{0}(\epsilon)$ such that if $\chi_{e}^{*}(G)\geq\chi_{0}$ then we can find $N=\lfloor\chi_{e}^{*}(G)^{\frac{3}{4}}\rfloor$ matchings in $G$ whose deletion leaves a multigraph $G^{\prime}$ with $\chi_{e}^{*}(G^{\prime})\leq\chi_{e}^{*}(G)-(1+\epsilon)^{-1}N$ in expected $\mathrm{poly}(n,\ln\frac{1}{\epsilon})$ time.

Remark 4.1.

Since $\chi_{e}^{*}(G)=\mathrm{poly}(n)$ , we may assume that $\epsilon\geq\frac{1}{\mathrm{poly}(n)}$ without loss of generality . Therefore, the expected running time of the algorithm promised by Lemma 4.1 is $\mathrm{poly}(n)$ .

Using the algorithm of Lemma 4.1 recursively, for every $\epsilon>0$ we can efficiently find an edge coloring of $G$ using at most $(1+\epsilon)\chi_{e}^{*}+\chi_{0}$ colors as follows. First, we compute $\chi_{e}^{*}(G)$ using the algorithm of Padberg and Rao [35]. If $\chi_{e}^{*}\geq\chi_{0}$ , then we apply Lemma 4.1 to get a multigraph $G^{\prime}$ with $\chi_{e}^{*}(G^{\prime})\leq\chi_{e}^{*}(G)-(1+\epsilon)^{-1}N$ . We can now color $G^{\prime}$ recursively using at most $(1+\epsilon)\chi_{e}^{*}(G^{\prime})+\chi_{0}\leq(1+\epsilon)\chi_{e}^{*}(G)-N+\chi_{0}$ colors. Using one extra color for each of the $N$ matchings promised by Lemma 4.1, we can then complete the coloring of $G$ , proving the claim. In the base case where $\chi_{e}^{*}(G)<\chi_{0}$ , we color $G$ greedily using $2\Delta-1$ colors. The fact that $2\Delta-1\leq 2\chi_{e}^{*}-1<\chi_{e}^{*}+\chi_{0}$ concludes the proof of Theorem 1.3 as the number of recursive calls is at most $n$ .

In the following sections, we prove Lemma 4.1. In Section 4.1 we describe the local search algorithm behind Lemma 4.1, and in Section 4.2 we prove its convergence. In Sections 4.3, 4.4 we prove two important auxiliary lemmas that are used in our convergence proof.

4.1 The Algorithm

Observe that we only need to prove Lemma 4.1 for $\epsilon<\frac{1}{10}$ since, clearly, if it holds for $\epsilon$ then it holds for all $\epsilon^{\prime}>\epsilon$ . So we fix $\epsilon\in(0,0.1)$ and let $c^{*}=\chi_{e}^{*}(G)-(1+\epsilon)^{-1}N$ . Our goal will be to delete $N$ matchings from $G$ to get a multigraph $G^{\prime}$ which has fractional chromatic index at most $c^{*}$ .

The flaws.

Let $\Omega=\mathcal{M}(G)^{N}$ be the set of possible $N$ -tuples of matchings of $G$ . For a state $\sigma=(M_{1},\ldots,M_{N})\in\Omega$ let $G_{\sigma}$ denote the multigraph obtained by deleting the $N$ matchings $M_{1},\ldots,M_{N}$ from $G$ . For a vertex $v\in V(G)$ we define $d_{G_{\sigma}}(v)$ to be the degree of $v$ in $G_{\sigma}$ . We now define the following flaws. For every vertex $v\in V(G)$ let

[TABLE]

For every connected subgraph $H$ of $G$ with an odd number of vertices and such that (i) $|V(H)|\leq\frac{8\Delta}{\epsilon N}$ , and (ii) $|E(H)|>\left(\frac{|V(H)|-1}{2}\right)c^{*}$ , let

[TABLE]

The following lemma implies that it suffices to find a flawless state. (This lemma was proved in [22], but we include a proof here for completeness.)

Lemma 4.2 ([22]).

Any flawless state $\sigma$ satisfies $\chi_{e}^{*}(G_{\sigma})\leq c^{*}$ .

Proof.

Edmonds’ characterization [9] of the matching polytope implies that the chromatic index of $G_{\sigma}$ is at most $c^{*}$ if

$\forall v:d_{G_{\sigma}}(v)\leq c^{*}$ ; and 2. 2.

$\forall H\subseteq G_{\sigma}$ with an odd number of vertices:

[TABLE]

Now clearly, addressing every flaw of the form $f_{v}$ establishes condition $1$ . By summing degrees it also implies that, for every subgraph $F$ , $|E(F)|\leq\frac{|V(F)|(c^{*}-\epsilon N/4)}{2}\leq\ \frac{|V(F)|}{2}c^{*}$ .

Moreover, any odd subgraph $H$ can be decomposed into a connected component $H^{\prime}$ with an odd number of vertices, and a (possibly empty) subgraph $F$ with an even number of vertices. Since there are no edges between $F$ and $H^{\prime}$ , in the absence of $f_{v}$ flaws we obtain

[TABLE]

Thus it suffices to prove condition 2 for the connected odd subgraph $H^{\prime}$ , for if $|E(H^{\prime})|\leq(|V(H^{\prime})|-1)c^{*}/2$ then we have

[TABLE]

Now, again by summing degrees, we see that if no $f_{v}$ flaw is present then condition $2$ can fail only for $H$ with fewer than $\frac{8\Delta}{\epsilon N}$ vertices, concluding the proof. Indeed, in the absence of $f_{v}$ flaws, we have $|E(H)|\leq|V(H)|(c^{*}-\epsilon N/4)/2$ and, since $c^{*}\leq\chi_{e}^{*}(G)\leq 2\Delta$ , if $|V(H)|(c^{*}-\epsilon N/4)/2\geq(|V(H)|-1)c^{*}/2$ then $|V(H)|\leq c^{*}/((\epsilon/4)N)\leq 8\Delta/\epsilon N$ . ∎

To describe an efficient algorithm for finding flawless states we need to (i) determine the initial distribution of the algorithm and show that it is efficiently samplable; (ii) show how to address each flaw efficiently; (iii) show that the expected number of steps of the algorithm is polynomial; and finally (iv) show that we can search for flaws in polynomial time, so that each step is efficiently implementable.

The initial distribution.

Apply Theorem 2.9 with $\delta=\frac{\epsilon}{4}$ . Let $\nu$ be the promised hard-core probability distribution, $\lambda=\{\lambda(e)\}$ the vector of activities associated with it, and $K$ the corresponding constant. Note that the activities $\lambda(e)$ defining $\nu$ are not readily available. However, recalling Corollary 2.13 we see that we can efficiently compute a set of activities that gives an arbitrarily good approximation to the desired distribution $\nu$ .

For a parameter $\eta>0$ and a distribution $p$ , we say that we $\eta$ -approximately sample from $p$ to express the fact that we sample from a distribution $\tilde{p}$ such that $\|p-\tilde{p}\|_{\mathrm{TV}}\leq\eta$ . Set $\eta=\frac{1}{n^{\beta}}$ , where $\beta$ is a sufficiently large constant to be specified later, and let $\nu^{\prime}$ be the distribution promised by Corollary 2.13. The initial distribution of our algorithm, $\theta$ , is obtained by $\eta$ -approximately sampling $N$ random matchings (independently) from $\nu^{\prime}$ . Observe that $\|\theta-\mu\|_{\mathrm{TV}}\leq 2\eta N$ , where $\mu$ denotes the probability distribution over $\Omega$ induced by taking $N$ independent samples from $\nu$ .

Addressing flaws.

For an integer $d>0$ and a connected subgraph $H$ , let $S_{<d}(H)$ be the set of vertices within distance strictly less than $d$ of a vertex $u\in V(H)$ . Given a state $\sigma=(M_{1},\ldots,M_{N})$ , a subgraph $H$ , and $d>0$ let

[TABLE]

where we define $M-X=M\cap E(G-X)$ . Moreover, let $Q_{H}^{i}(d,\sigma)=M_{i}-S_{<d}(H)$ denote the $i$ -th entry of $Q_{H}(d,\sigma)$ . (In words, $Q_{H}^{i}(\sigma,d)$ is the set of edges of $M_{i}$ with the property that both their endpoints are at distance at least $d$ from $H$ .) Finally, let $G_{<d+1}(H)$ be the multigraph induced by $S_{<d+1}(H)$ and $\mathcal{M}_{d+1}^{i}=\mathcal{M}_{d+1}^{i}(H,\sigma)$ be the set of matchings of $G_{<d+1}(H)$ that are “compatible” with $Q_{H}^{i}(d,\sigma)$ . That is, for any matching $M$ in $\mathcal{M}_{d+1}^{i}$ we have that $M\cup Q_{H}^{i}(d,\sigma)$ is also a matching of $G$ . More specifically, note that $\mathcal{M}^{i}_{d+1}(H,\sigma)$ corresponds to the set of matchings of the following multigraph $G_{i,<d+1}(H)$ . Let $V_{i,d}$ denote the set of vertices of $S_{<d+1}(H)$ that belong to edges in $Q_{H}^{i}(\sigma,d)$ . Multigraph $G_{i,<d+1}(H)$ is induced by $S_{<d+1}(H)\setminus V_{i,d}$ .

We consider the procedure Resample below which takes as input a connected subgraph $H$ , a state $\sigma$ and a positive integer $d\leq n$ , and which will be used to address flaws.

Throughout the proof, we fix the parameter

[TABLE]

To address $f_{v},f_{H}$ in state $\sigma$ , we invoke procedures Resample $(\{v\},\sigma,t)$ and Resample $(H,\sigma,t)$ , respectively.

Searching for flaws.

Notice that we can compute $c^{*}$ in polynomial time using the algorithm of Padberg and Rao [35]. Therefore, given a state $\sigma\in\Omega$ , we can search for flaws of the form $f_{v}$ in polynomial time. However, the flaws of the form $f_{H}$ are potentially exponentially many, so a brute-force search does not suffice for our purposes.

Fortunately, the result of Padberg and Rao provides a polynomial time oracle for this problem as well. Recall Edmonds’ characterization used in the proof of Lemma 4.2. The constraints over odd subgraphs $H$ are called matching constraints. Recall further that in the proof of Lemma 4.2 we showed that, in the absence of $f_{v}$ -flaws, the only matching constraints that could possibly be violated correspond to $f_{H}$ flaws. On the other hand, the oracle of Padberg and Rao can decide in polynomial time whether $G$ has a fractional $c$ -coloring or return a violated matching constraint, for every $c\geq 0$ . Hence, if our algorithm prioritizes $f_{v}$ flaws over $f_{H}$ flaws, this oracle can be used to detect the latter in polynomial time.

4.2 Proof of Lemma 4.1

We are left to show that the expected number of steps of the algorithm is polynomial and that each step can be executed in polynomial time. To that end, we will show that both of these statements are true assuming that the initial distribution $\theta$ is $\mu$ instead of approximately $\mu$ , and that in Lines 4, 5 of the procedure Resample $(H,\sigma,d)$ we perfectly sample from the hard-core probability distribution induced by activities $\{\lambda(e)\}_{e\in E(G_{i,<d}(H))}$ instead of $\eta$ -approximately sampling from $p$ . We can maximally couple the approximate and ideal distributions, and then take the constant $\beta$ in the definition of the approximation parameter $\eta$ to be sufficiently large. The latter implies that the probability that the coupling will fail during the execution of the algorithm is negligible (i.e., at most $\frac{1}{n^{c}}$ ). Since the fractional chromatic index of a multigraph can be computed in polynomial time, we can absorb the probability that the coupling fails into the polynomial expected running time by executing our algorithm sufficiently many times. That is, we execute our algorithm for a number of steps that is twice its expected running time, and if the edge coloring it produces is not a desirable one, we repeat the process.

For an integer $d>0$ and a vertex $v$ , let $F_{d}(v)$ be the set of flaws indexed by a vertex of $S_{<d}(v)$ or a subgraph $H$ intersecting $S_{<d}(v)$ . For each set $H$ for which we have defined $f_{H}$ we let $F_{d}(H)=\bigcup_{v\in V(H)}F_{d}(v)$ . For each flaw $f_{v}$ we define the causality neighborhood $\Gamma(f_{v})=F_{t+2}(v)$ , and for each flaw $f_{H}$ we define $\Gamma(f_{H})=F_{t+2}(H)$ , where $t$ is as defined in the previous subsection. Notice that this is a valid choice because flaw $f_{v}$ can only cause flaws in $F_{t+1}(v)$ and flaw $f_{H}$ can only cause flaws in $F_{t+1}(H)$ . The reason why we choose these neighborhoods to be larger than seemingly necessary is because, as we will see, with respect to this causality graph our algorithm is commutative, allowing us to apply Theorem 2.6.

Lemma 4.3.

Let $f\in\{f_{v},f_{H}\}$ for a vertex $v$ and a connected subgraph $H$ of $G$ with an odd number of vertices and let $D=\Delta^{t+2\Delta^{\frac{1}{3}}+4}$ . We have:

(a)

$\gamma_{f}\leq\frac{1}{2\mathrm{e}D}$ * ;* 2. (b)

$|\Gamma(f)|\leq D$ ,

where the charges are computed with respect to the measure $\mu$ and the algorithm that samples from the ideal distributions.

Lemma 4.4.

For each pair of flaws $f\nsim g$ , the matrices $A_{f},A_{g}$ commute.

The proof of Lemma 4.3 can be found in Section 4.3. Lemma 4.4 establishes that our algorithm is commutative with respect to the causality relation $\sim$ induced by neighborhoods $\Gamma(\cdot)$ . Its proof can be found in Section 4.4.

Now, setting $x_{f}=\frac{1}{1+\max_{f^{\prime}\in F}|\Gamma(f^{\prime})|}$ for each flaw $f$ , we see that condition (3) with $\epsilon=1/4$ is implied by

[TABLE]

which is true for large enough $\Delta$ according to Lemma 4.3. Notice further that the causality graph induced by $\sim$ can be covered by $n$ cliques, one for each vertex of $G$ , with potentially further edges between them. Indeed, flaws indexed by subgraphs that contain a certain vertex of $G$ form a clique in the causality graph. Combining Lemma 4.4 with the latter observation, we are able to apply Theorem 2.6, which implies that our algorithm terminates after an expected number of at most $O\bigl{(}\max_{\sigma\in\Omega}\frac{\theta(\sigma)}{\mu(\sigma)}\cdot n\log(n\log(1/\delta))\bigr{)}=O(n\log n)$ steps. (This is because we assume that $\theta=\mu$ per our discussion above.)

This completes the proof of Lemma 4.1 and hence, as explained at the beginning of Section 4, Theorem 1.3 follows. It remains, however, to go back and prove Lemmas 4.3 and 4.4, which we do in the next two subsections.

4.3 Proof of Lemma 4.3

Proof of part (a).

We will need the following key lemma, which was essentially proved in [22]. Its proof can be found in Appendix A. Recall that $\mu$ is the distribution over $\Omega$ induced by taking $N$ independent samples from $\nu$ .

Lemma 4.5.

For any random state $\sigma$ distributed according to $\mu$ :

(i)

for every flaw $f_{v}$ and state $\tau\in\Omega$ : $\mu(\sigma\in f_{v}\mid Q_{v}(t,\sigma)=Q_{v}(t,\tau))\leq\frac{1}{2\mathrm{e}D}$ ; and 2. (ii)

for every flaw $f_{H}$ and state $\tau\in\Omega$ : $\mu(\sigma\in f_{H}\mid Q_{H}(t,\sigma)=Q_{H}(t,\tau))\leq\frac{1}{2\mathrm{e}D}$ .

We show the proof of part (a) of Lemma 4.3 only for the case of $f_{v}$ - flaws, as the proof for $f_{H}$ - flaws is very similar. Specifically, our goal will be to prove that

[TABLE]

Lemma 4.5 then concludes the proof.

Recalling the definition of $\gamma_{f}$ from (2) we see that, in order to prove (9), it suffices to show that, for $\sigma$ distributed according to $\mu$ and any state $\tau\in\Omega$ ,

[TABLE]

Indeed, maximizing (10) over $\tau\in\Omega$ yields (9) and completes the proof.

Fix $\tau=(M_{1},M_{2},\ldots,M_{N})\in\Omega$ . To compute the sum on the left-hand side of (10) we need to determine the set of states $\mathrm{In}_{v}(\tau)\subseteq f_{v}$ for which $\rho_{f_{v}}(\omega,\tau)>0$ . To do this, recall that given as input a state $\omega=(M_{1}^{\omega},M_{2}^{\omega},\ldots,M_{N}^{\omega})\in f_{v}$ , procedure Resample( $v,\omega,t)$ modifies one by one each matching $M_{i}$ , $i\in[N]$ , “locally” around $v$ . In particular, observe that the support of the distribution for updating $M_{i}$ is exactly the set $\mathcal{M}_{t+1}^{i}(v,\omega)$ , and hence it must be the case that $Q_{v}^{i}(t,\omega)=Q_{v}^{i}(t,\tau)$ for every $i\in[N]$ and state $\omega\in\mathrm{In}_{v}(\tau)$ . This also implies that, for every such $\omega$ ,

[TABLE]

Recall now that we have assumed that the hard-core distribution in Lines 4, 5 of Resample $(v,\omega,t)$ is induced by the ideal vector of activities $\lambda$ . In particular, we have

[TABLE]

since $Q_{v}^{i}(t,\omega)=Q_{v}^{i}(t,\tau)$ , which combined with (11) yields

[TABLE]

Letting $\sigma=(M_{1}^{\sigma},\ldots,M_{N}^{\sigma})$ be a random state distributed from $\mu$ we see that, by definition, the right-hand side of (13) equals:

[TABLE]

Finally, combining (13) and (14), we obtain that

[TABLE]

concluding the proof of the first part of Lemma 4.3.

∎

Proof of part (b).

For this proof we will use the following well-known proposition, which we also prove here for completeness.

Proposition 4.6.

For every vertex $v$ there are at most $(\mathrm{e}\Delta)^{s-1}$ sets of vertices $S$ such that (i) $v\in S$ ; (ii) $|S|=s$ ; and (iii) $G[S]$ is connected.

Proof.

The number of such sets is bounded by the number of distinct $s$ -vertex trees which are rooted at $v$ . The latter quantity is bounded by the number of distinct $\Delta$ -ary rooted trees with $s$ vertices, which is

[TABLE]

see e.g. [26]. It is not hard to see that $T_{\Delta}(s)\leq(\mathrm{e}\Delta)^{s-1}$ for $s\in\{1,2\}$ and $\Delta\geq 1$ . For $s\geq 3$ , we obtain

[TABLE]

for sufficiently large $\Delta$ , concluding the proof. Note that in deriving the first inequality we used that ${a\choose b}\leq(a\cdot\mathrm{e}/b)^{b}$ for positive integers $b\leq a$ .

∎

To prove part (b) of Lemma 4.3 it suffices to show that

[TABLE]

for every vertex $v$ . Indeed, (16) clearly suffices if $f=f_{v}$ . If $f=f_{H}$ , notice that every $H$ for which we define $F_{t+2}(H)$ has fewer than $\Delta$ vertices (assuming $\Delta$ is sufficiently large) and, therefore, every $F_{t+2}(H)$ has less than $D=\Delta^{t+2\Delta^{1/3}+4}$ elements.

Towards proving (16), at first notice that every set $S_{<t+2}(v)$ has at most $\Delta^{t+2}$ elements. Moreover, using Proposition 4.6 we obtain that, for sufficiently large $\Delta$ , every vertex $u$ is in at most

[TABLE]

sets $H$ corresponding to a flaw $f_{H}$ . Note that in deriving the second inequality above we used the fact that $N=\lfloor\chi_{e}^{*}(G)^{3/4}\rfloor=\Theta(\Delta^{3/4})$ , which in turn implies that $\frac{8\Delta}{\epsilon N}\leq 2\Delta^{1/3}$ for sufficiently large $\Delta$ . Overall:

[TABLE]

for sufficiently large $\Delta$ , concluding the proof. ∎

4.4 Proof of Lemma 4.4

Fix states $\sigma_{1}=(M_{1},M_{2},\ldots,M_{N})\in f$ and $\sigma_{2}=(M_{1}^{\prime},M_{2}^{\prime},\ldots,M_{N}^{\prime})\in g$ such that $f\not\sim g$ . To prove that the matrices $A_{f},A_{g}$ commute, we need to show that for every such pair

[TABLE]

To that end, let $H_{f},H_{g}$ be the subgraphs (which may consist only of a single vertex) associated with flaws $f$ and $g$ , respectively. Since $f\nsim g$ we have $\min_{u\in V(H_{f}),v\in V(H_{g})}\mathrm{dist}(u,v)\geq t+2$ , where $\mathrm{dist}(u,v)$ denotes the length of the shortest path between $u$ and $v$ . Notice that this implies $S_{<t+2}(H_{f})\cap S_{<t+2}(H_{g})=\emptyset$ .

Consider a pair of transitions $\sigma_{1}\xrightarrow{f}\tau$ , $\tau\xrightarrow{g}\sigma_{2}$ , where $\tau=(M_{1}^{\prime\prime},\ldots,M_{N}^{\prime\prime})$ , and so that $\rho_{f}(\sigma_{1},\tau)>0$ , $\rho_{g}(\tau,\sigma_{2})>0$ . The facts that procedure Resample $(\sigma,f,t)$ only modifies the input set of matchings locally within $S_{<t+1}(H_{f})$ , that $\rho_{g}(\tau,\sigma_{2})>0$ , and that $S_{<t+2}(H_{f})\cap S_{<t+2}(H_{g})=\emptyset$ imply that (i) $\sigma_{1}\in g$ ; and (ii) for every $i\in[N]$ , $M_{i}\cap(S_{<t+2}(H_{g}))=M_{i}^{\prime\prime}\cap(S_{<t+2}(H_{g}))$ . Notice now that the probability distribution $\rho_{g}(\tau,\cdot)$ depends only on $(M_{1}^{\prime\prime}\cap S_{<t+2}(H_{g}),\ldots,M_{N}^{\prime\prime}\cap S_{t+2}(H_{g}))$ . Hence, (i) and (ii) imply that the probability distribution $\rho_{g}(\sigma_{1},\cdot)$ is well defined and, in addition, there exists a natural bijection $b_{g}$ between the action set $a(g,\tau)$ and the action set $a(g,\sigma_{1})$ so that $\rho_{g}(\tau,\tau^{\prime})=\rho_{g}(\sigma_{1},b_{g}(\tau^{\prime}))$ for every $\tau^{\prime}\in a(g,\tau)$ . This is because both distributions are implemented by sampling from the set of matchings of the same multigraph according to the same probability distribution.

Now let $\tau^{\prime}=b_{g}(\sigma_{2})$ . A symmetric argument implies that $\tau^{\prime}\in f$ and that there exists a natural bijection $b_{f}$ between $a(f,\sigma_{1})$ and $a(f,\tau^{\prime})$ so that $\rho_{f}(\sigma_{1},\sigma)=\rho_{f}(\tau^{\prime},b_{f}(\sigma))$ for every $\sigma\in a(f,\sigma_{1})$ . In particular, notice that $\sigma_{2}=b_{f}(\tau)$ and that

[TABLE]

Overall, what we have shown is a bijective mapping that sends any pair of transitions $\sigma_{1}\xrightarrow{f}\tau,\tau\xrightarrow{g}\sigma_{2}$ to a pair of transitions $\sigma_{1}\xrightarrow{g}\tau^{\prime},\tau^{\prime}\xrightarrow{f}\sigma_{2}$ and which satisfies (18). This establishes (17), concluding the proof. $\square$

5 List-Edge Coloring Multigraphs: Proof of Theorem 1.4

In this section we review the proof of Theorem 1.2 and then prove its constructive version, Theorem 1.4. Again, throughout the proof we assume that the maximum degree $\Delta$ of the input multigraph $G$ satisfies $\Delta\geq\Delta_{0}$ for some appropriately large constant $\Delta_{0}$ .

In Section 5.1 we give a high-level sketch of the existential proof of Kahn, and we state the key technical results from that paper (Theorems 5.1, 5.2, and Lemma 5.3). As we will see, our main contribution is to make Theorem 5.1 constructive. Towards this end, we describe our local search algorithm in Section 5.2, where we also prove its correctness assuming it converges (Lemma 5.4), as well as an important property of the flaws we consider (Lemma 5.5). Finally, in Section 5.3 we prove that our search algorithm has expected polynomial running time, concluding the proof of Theorem 1.4.

5.1 A High Level Sketch of the Existential Proof

As we explained in the introduction, the non-constructive proof of Theorem 1.2 is a sophisticated version of the semi-random method and proceeds by partially coloring the edges of the multigraph in iterations, until at some point the coloring can be completed greedily. (More accurately, the method establishes the existence of such a sequence of desirable partial colorings.)

We will follow the exposition in [31]. In each iteration, we have a list $L_{e}$ of acceptable colors for each edge $e$ . We assume that each $L_{e}$ originally has $C$ colors for some $C\geq(1+\epsilon)\chi_{e}^{*}(G)$ , where $\epsilon>0$ is an arbitrarily small constant. For each color $i$ , we let $G_{i}$ be the subgraph of $G$ formed by the edges for which $i$ is acceptable. Since $G_{i}\subseteq G,\chi_{e}^{*}(G_{i})\leq\chi_{e}^{*}(G)$ . Thus, Theorem 2.9 implies that we can find a hard-core distribution on the matchings of $G_{i}$ with marginals $(\frac{1}{C},\ldots,\frac{1}{C})$ whose activity vector $\lambda_{i}$ satisfies $\lambda_{i}(e)\leq\frac{K}{C}$ for all $e$ , where $K=K(\epsilon)$ is a constant.

In each iteration, we will use the same activity vector $\lambda_{i}$ to generate the random matchings assigned to color $i$ . Of course, in each iteration we restrict our attention to the subgraph of $G_{i}$ obtained by deleting the set $E^{*}$ of edges colored (with any color) in previous iterations, and the endpoints of the set of edges $E_{i}^{*}$ colored $i$ in previous iterations. (Thus, although we use the same activity vector for each color in each iteration, the induced hard-core distributions may vary significantly.) Further, we will make sure that our distributions have the property that for each edge $e$ , the expected number of matchings containing $e$ is very close to $1$ . (In other words, the sum over $i$ of the probabilities that edge $e$ is a part of the matching corresponding to color $i$ is close to $1$ .)

We apply the Lopsided LLL in the following probability space. For each color $i$ , independently, we choose a matching $M_{i}\in G_{i}$ from the corresponding distribution. Next, we activate each edge in $M_{i}$ independently with probability $\alpha:=\frac{1}{\log\Delta(G)}$ ; we assign colors only to activated edges in order to ensure that very few edges are assigned more than one color. We then update the multigraph by deleting the colored edges, and update the lists $L_{e}$ by deleting any color assigned to an edge incident on $e$ . We give a more detailed description below.

Notice that our argument needs to ensure that (i) at the beginning of each iteration the induced hard-core distributions are such that, for each uncolored edge $e$ , the expected number of random matchings containing $e$ is very close to $1$ ; and (ii) after some number of iterations, we can complete the coloring greedily.

As far as the latter condition is concerned, notice that if (i) holds throughout then, in each iteration, the probability that an edge retains a color remains close to the activation probability $\alpha$ . This allows us to prove that the maximum degree in the uncolored multigraph drops by a factor of about $1-\alpha$ in each iteration. Hence, after $\log_{\frac{1}{1-\alpha}}3K$ iterations, the maximum degree in the uncolored multigraph will be less than $\frac{\Delta}{2K}$ . Furthermore, for each $e$ and $i$ , the probability that $e$ is in the random matching of color $i$ is at most $\lambda_{i}(e)\leq\frac{K}{C}$ . Since (i) continues to hold, this implies there are at least $\frac{C}{K}>\frac{\Delta}{K}$ colors available for each edge, and so the coloring can be completed greedily. (Recall that the $C>\chi_{e}^{*}(G)\geq\Delta$ .)

An Iteration.

For each color $i$ , pick a matching $M_{i}$ according to a hard-core probability distribution $\mu_{i}$ on $\mathcal{M}(G_{i})$ with activities $\lambda_{i}$ such that for some constant $K$ :

(a)

$\forall e\in E(G),\sum_{i}\mu_{i}(e\in M_{i})\approx 1$ ; and 2. (b)

$\forall i,\forall e\in E(G),\lambda_{i}(e)\leq\frac{K}{C}$ and hence $\forall v\in V(G),\sum_{L_{e}\ni i}\lambda_{i}(e)\leq K$ . 2. 2.

For each $i$ , activate each edge of $M_{i}$ independently with probability $\alpha=\frac{1}{\log\Delta(G)}$ , to obtain a new matching $F_{i}\subseteq M_{i}$ . We color the edges of $F_{i}$ with color $i$ and delete $V(F_{i})$ from $G_{i}$ . We also delete from $G_{i}$ every edge not in $M_{i}$ which is in $F_{j}$ for some $j\neq i$ . We do not delete edges of $(M_{i}-F_{i})\cap F_{j}$ from $G_{i}$ . (Note that this may result in edges receiving more than one color, which is not a problem since we can always pick one of them arbitrarily at the end of the iterative procedure.) 3. 3.

Perform an equalizing coin flip for each edge $e$ of $G_{i}$ so that the probability that $e$ is both colored and removed from $G_{i}$ in either Step 2 or Step 3 is exactly $\alpha$ . (See also Remark 5.1 below.)

Remark 5.1.

Note that the expected number of edges that are both colored and removed from $G_{i}$ in Step 2 is less than $\alpha|E(G_{i})|$ because, although the expected number of colors retained by an edge is very close to $\alpha$ , some edges may be assigned more than one color. Performing “equalizing coin flips” in Step 3 is a standard idea that helps in avoiding several technical difficulties that stem from the latter fact.

The outcome of an iteration is defined to be the choices of matchings, activations, and equalizing coin flips. Let $\mathrm{Out}=\mathrm{Out}_{\ell}$ denote the random variable that equals the outcome of the $\ell$ -th iteration. (In what follows, we will focus on a specific iteration $\ell$ and so we will omit the subscript.)

For each edge $e=(u,v)$ , we define a bad event $A_{e}$ as follows. Let $G_{i}^{\prime}$ be the multigraph obtained after carrying out the modifications to $G_{i}$ in Steps 2 and 3 of the above iteration. Let $t=8(K+1)^{2}(\log\Delta)^{20}+2$ , recall the definition of $S_{<t}(H)$ for subgraph $H$ , and let $G_{<t}(H)$ denote the corresponding induced subgraph. Let $Z_{i}$ be a random matching in $G_{i}^{\prime}\cap G_{<t}(\{u,v\})$ sampled from the hard-core probability distribution induced by activity vector $\lambda_{i}$ . Let $A_{e}$ be the event that

[TABLE]

To get some intuition behind the definition of event $A_{e}$ , let $M_{i}^{\prime}$ be a random matching in $G_{i}^{\prime}$ chosen according to the hard-core distribution with activities $\lambda_{i}$ . Since correlations decay with distance, one can show that $\Pr(e\in M_{i}^{\prime}\mid\mathrm{Out})$ is within a factor $1+\frac{1}{(\log\Delta)^{20}}$ of $\Pr(e\in Z_{i}\mid\mathrm{Out})$ . Thus, according to (19), avoiding bad event $A_{e}$ implies that $\sum_{i}\Pr(e\in M_{i}^{\prime})\approx\sum_{i}\Pr(e\in M_{i})\approx 1$ , which is what is required in order to maintain property (i) at the beginning of the next iteration. In particular, it is straightforward to see that avoiding all bad events $\{A_{e}\}_{e\in E(G)}$ guarantees that

[TABLE]

for sufficiently large $\Delta$ , which is what we really need. The reason we consider $Z_{i}$ and not $M_{i}^{\prime}$ is that events defined with respect to the former are mildly negatively correlated with most other bad events, making it possible to apply the Lopsided LLL.

Further, for each vertex $v$ we define $A_{v}$ to be the event that the proportion of edges incident on $v$ which are colored in the iteration is less than $\alpha-\frac{1}{(\log\Delta)^{4}}$ .

It can be formally shown that, if we avoid all bad events, then (i) holds, i.e., at the beginning of the next iteration we can choose new probability distributions so that for each uncolored edge $e$ we maintain the property that the expected number of random matchings containing $e$ is very close to $1$ , and, moreover, after $\log_{\frac{1}{1-\alpha}}3K$ iterations we can complete the coloring greedily.

Theorem 5.1 ([23]).

Assume that (20) holds for the edge marginals of the matching distributions of iteration $\ell$ . Then, with positive probability, the same is true for the matching distributions of iteration $\ell+1$ .

Theorem 5.2 ([23]).

If we can avoid the bad events of the first $\log_{\frac{1}{1-\alpha}}3K$ iterations, then we can complete the coloring greedily.

Proving Theorems 5.1 and 5.2 is the heart of the proof of Theorem 1.2. The most difficult part is proving that, for any $x\in V\cup E$ , the probability of event $A_{x}$ is very close to [math] conditioned on any choice of outcomes for distant events. (This is needed in order to apply the Lopsided LLL.) Given Theorem 5.1, the proof of Theorem 5.2 follows, as we have already explained, from the fact that in each iteration the expected number of random matchings containing each uncolored edge $e$ is very close to $1$ and, therefore, the probability that $e$ retains a color remains close to $\alpha$ .

Below we state the key lemma that is proven in [23], and which we will also use in the analysis of our algorithm.

Recall the definition of $t$ . For a subgraph $H$ , we let $R_{H}$ be the random outcome of our iteration in $G-S_{<t^{2}}(H)$ , i.e., $R_{H}$ consists of $\bigcup_{i}\left(M_{i}-S_{<t^{2}}(H)\right)$ together with the choices of the activated edges in $G-S_{<t^{2}}(H)$ which determine the $\bigcup_{i}\left(F_{i}-S_{<t^{2}}(H)\right)$ , and the outcomes of the equalizing coin flips for edges in this subgraph.

Lemma 5.3 ([23]).

For every $x\in E\cup V$ and possible choice $R_{x}^{*}$ for $R_{x}$ , we have $\Pr(A_{x}\mid R_{x}=R_{x}^{*})\leq\frac{1}{\Delta^{3(t^{2}+t+2)}}$ .

In the remaining sections we will focus on providing an efficient algorithm for Theorem 5.1 which, combined with Theorem 5.2, will imply the proof of Theorem 1.4.

As a final remark, we note that detecting whether bad events $\{A_{e}\}_{e\in E(G)}$ are present in a state is not a tractable task since it entails the exact computation of edge marginals of hardcore distributions over matchings. In order to overcome this obstacle, we will define flaws $\{f_{e}\}_{e\in E(G)}$ whose absence provides somewhat weaker guarantees than the removal of bad events $\{A_{e}\}_{e\in E(G)}$ , but nonetheless implies (20) for every edge. To decide whether a flaw $f_{e}$ is present in a state, we will use the results of [19] to estimate the corresponding edge marginals of random variables $M_{i}$ and $Z_{i}$ for every color $i$ . Note that since we will only perform an approximation, we will not be able to check for (19) directly. However, our approximation will be tight enough so that, even in this case, (20) will still hold for every edge. We give the details below.

5.2 The Algorithm

Let $\mathcal{U}$ denote the set of uncolored edges and $N=|\bigcup_{e\in\mathcal{U}}L_{e}|$ , the cardinality of the set of colors that appear in the list of available colors of some uncolored edge. For a color $i\in[N]$ , recall that $G_{i}$ denotes the subgraph of uncolored edges that contain $i$ in their list of available colors. Finally, let $E_{i}=|E(G_{i})|$ and $\mathcal{S}=\mathcal{S}(T)$ be the set of all binary strings of length $T$ , where $T$ is a parameter to be defined later. An element of $\mathcal{S}$ should be thought of as the input “randomness” to a subroutine of our algorithm whose purpose will be to estimate edge-marginals of distributions $\{\mu_{i}\}_{i\in[N]}$ .

Define $\Omega=\prod_{i\in[N]}\left(\mathcal{M}(G_{i})\times\{0,1\}^{E_{i}}\times\{0,1\}^{E_{i}}\times\mathcal{S}^{E_{i}}\right)$ . We consider an arbitrary but fixed ordering over $\mathcal{U}$ , so that each state $\sigma\in\Omega$ can be represented as $\sigma=\left((M_{1},a_{1},h_{1},s_{1}),\ldots,(M_{N},a_{N},h_{N},s_{N})\right)$ , where $M_{i},a_{i},h_{i}$ are the matching, activation and equalizing coin flip vectors, respectively, that correspond to color $i$ , so that edge $e$ is activated in $G_{i}$ if $a_{i}(e)=1$ and is marked to be removed if $h_{i}(e)=1$ . Additionally, $s_{i}$ is the tuple of strings corresponding to the particular element of $\mathcal{S}^{E_{i}}$ at state $\sigma$ . As we will see, tuples $\{s_{i}\}_{i\in[N]}$ are defined for purely technical reasons, and specifically for properly bypassing the issue of detecting the presence of $f_{e}$ -flaws that we mentioned earlier.

Recalling Corollary 2.12, we see that we are able to obtain a $1\pm 1/n^{\beta}$ approximation for the marginal $\mu_{i}(e)$ , $i\in[N]$ , of an edge $e$ with probability at least $1-1/n^{\beta}$ in polynomial time, where $\beta$ is a fixed and sufficiently large positive constant. This fact will be useful to us in two ways.

First, recall that for color $i$ we choose a matching according to probability distribution $\mu_{i}$ , and we define $\mathrm{Eq}_{i}(e)$ to be the probability of success of the equalizing coin flip that corresponds to edge $e$ and color $i$ . Note that, given access to the marginals of $\mu_{i}$ , the value of $\mathrm{Eq}_{i}(e)$ can be computed efficiently. Of course, and as we just explained, we will have only (arbitrarily good) estimates of the marginals of $\mu_{i}$ , but as in the proof of Theorem 1.3, this suffices for our purposes. Indeed, through sampling we can efficiently get an estimate $\mathrm{Eq}_{i}^{\prime}(e)$ that is within a $1\pm 1/n^{c}$ factor of the correct value $\mathrm{Eq}_{i}(e)$ with probability at least $1-1/n^{c}$ , where $c=c(\beta)$ is a sufficiently large constant, and hence guarantee that the total variation distance between the resampling probability distributions used by the algorithm and the ideal ones is negligible, i.e., at most $1/n^{c}$ . (Later we will argue that we can maximally couple the approximate and ideal distributions and proceed with an argument identical to the one we used in the proof of Theorem 1.3, where we absorb the probability that the coupling fails into the expected polynomial running time of the algorithm — recall our discussion in the beginning of Section 4.2)

Second, we let $T_{1}=T_{1}(\beta)=\mathrm{poly}(n)$ be a fixed polynomial upper bound on the number of random bits required by the sampling algorithm (whose existence is guaranteed by Theorem 2.10) for approximating $\Pr(e\in M_{i})$ , for an arbitrary color $i\in[N]$ and an arbitrary edge $e$ , within a factor $1\pm 1/n^{\beta}$ with probability at least $1-1/n^{\beta}$ . We let $T_{2}$ be an analogous fixed polynomial upper bound for estimating $\Pr(e\in Z_{i}\mid\mathrm{Out})$ for arbitrary $\mathrm{Out}$ , and define $T=T_{1}+T_{2}$ .

We let $p$ be the probability distribution over $\Omega$ that is induced by the product of the $\mu_{i}$ ’s, activation flips, equalizing coin flips, and the uniform distribution over $\mathcal{S}^{E_{i}}$ , for each color $i\in[N]$ . In other words, $p$ is the probability distribution over $\Omega$ induced by the iteration along with some extra randomness that is used for sampling from $\mathcal{S}(T)^{E_{i}}$ .

The initial distribution.

Recall that each edge $e$ initially has a list $L_{e}$ of size at least $(1+\epsilon)\chi_{e}^{*}(G)$ . As we have already seen in Corollary 2.13, the results of [19, 41] imply that for every color $i$ and parameter $\eta=1/n^{\beta}$ , where $\beta>0$ is a sufficiently large constant, there exists a $\mathrm{poly}(n,\ln\frac{1}{\epsilon})$ -algorithm that computes a vector $\lambda^{\prime}_{i}$ such that the induced hard-core distribution $\eta$ -approximates in variation distance the hard-core distribution induced by vector $\lambda_{i}$ . Let $p^{\prime}$ be the distribution obtained in an identical way to $p$ but using vectors $\lambda^{\prime}_{i}$ instead of vectors $\lambda_{i}$ . The initial distribution $\theta$ of our algorithm is obtained by $\eta$ -approximately sampling from $p^{\prime}$ . Theorem 2.10 implies that this can be done in polynomial time.

Finding and addressing flaws.

We define a flaw $f_{v}$ for each bad event $A_{v}$ . To define flaw $f_{e}$ corresponding to an edge $e$ , we first recall the definitions of $T_{1},T_{2}$ . In particular, recall that the description of a state $\sigma$ determines a binary string $s=s(\sigma)\in\mathcal{S}$ of length $T_{1}+T_{2}$ for each color $i$ and edge $e\in E(G_{i})$ . We will think of $s$ as a concatenation of two strings of length $T_{1}$ and $T_{2}$ , respectively, that can and will be used as the “input randomness” to a sampling algorithm that estimates $\Pr(e\in M_{i})$ and $\Pr(e\in Z_{i}\mid\mathrm{Out}(\sigma))$ , respectively. (Here $\mathrm{Out}(\sigma)$ is the evaluation of random variable $\mathrm{Out}$ at $\sigma$ .) Indeed, let $\widetilde{\Pr_{\sigma}}(e\in M_{i})$ be the resulting, deterministic (given $s(\sigma)$ ) estimation of $\Pr(e\in M_{i})$ and, similarly, let $\widetilde{\Pr_{\sigma}}(e\in Z_{i}\mid\mathrm{Out}(\sigma))$ be the resulting estimation of $\Pr(e\in Z_{i}\mid\mathrm{Out}(\sigma))$ . Finally, we define flaw $f_{e}$ to be the set of states $\sigma\in\Omega$ such that

[TABLE]

We fix an arbitrary ordering $\pi$ over $V\cup E$ . In each step, the algorithm finds the lowest indexed flaw according to $\pi$ that is present in the current state and addresses it.

Clearly, checking if vertex-flaws $f_{v}$ are present in the current state can be done efficiently. The same is true for edge-flaws $f_{e}$ given Theorem 2.10. What is perhaps not so clear, however, is whether the definition of $f_{e}$ -flaws is sufficient for our purposes, and how it relates to the definition of bad events $A_{e}$ .

To address these questions, recall first that we can use the results of [19] to approximate the edge marginals of the corresponding distributions within a $(1\pm\eta)$ -factor with probability at least $1-\eta$ , in time $\mathrm{poly}(n,\frac{1}{\eta})$ , where $\eta=1/n^{\beta}$ . Our approach will be to first argue that, assuming our edge marginal estimates were always within a $(1\pm\eta)$ -factor of the true values, then our algorithm would terminate in expected polynomial time, and then use a coupling argument similar to the one described in the beginning of Section 4.2 to show that we can make this assumption in our analysis at a negligible price.

More formally, given a state $\sigma=\left((M_{1},a_{1},h_{1},s_{1}),\ldots,(M_{N},a_{N},h_{N},s_{N})\right)$ , let $M(\sigma)=(M_{1},\ldots,M_{N})$ , $a(\sigma)=(a_{1},\ldots,a_{N})$ , and $h(\sigma)=(h_{1},\ldots,h_{N})$ , and define $\xi(\sigma)=(M(\sigma),a(\sigma),h(\sigma))$ . For each edge $e$ , color $i$ , and state $\sigma$ , let $\mathcal{S}_{i}^{\prime}(e)=\mathcal{S}_{i}^{\prime}(e,\xi(\sigma))\subseteq\mathcal{S}$ be the set of strings with the property that, if our marginal estimators use them as input randomness in state $\sigma$ for edge $e$ , then they are guaranteed to provide a $(1\pm\eta)$ -factor approximation of the true marginals of $e$ . Crucially, observe that $|\mathcal{S}_{i}^{\prime}(e)|/|\mathcal{S}|\geq 1-1/n^{c}$ for a constant $c=c(\beta)$ which can be made arbitrarily large by increasing $\beta$ . Let $\Omega^{\prime}\subseteq\Omega$ be the subspace of $\Omega$ induced by removing every state $\sigma=\left((M_{1},a_{1},h_{1},s_{1}),\ldots,(M_{N},a_{N},h_{N},s_{N})\right)$ such that there exists an $i\in[N]$ for which $s_{i}\notin\prod_{e\in E(G_{i})}\mathcal{S}_{i}^{\prime}(e)$ . That is, $\Omega^{\prime}$ is the subspace of $\Omega$ in which our edge-marginal approximations are guaranteed to be within a $(1\pm\eta)$ -factor of the true values. Finally, let $\mu$ be the distribution induced by conditioning on the event that a sample from $p$ belongs to $\Omega^{\prime}$ . Equivalently, to take a sample from $\mu$ we first sample from the product of the $\mu_{i}$ ’s, activation flips, and equalizing coin flips to obtain a tuple $\xi=(M,a,h)$ , and then sample uniformly an element from $\prod_{i=1}^{N}\prod_{e\in E(G_{i})}\mathcal{S}_{i}^{\prime}(e,\xi)$ .

The following two lemmas justify our definition of $f_{e}$ -flaws. Specifically, Lemma 5.4 shows that avoiding all $f_{e}$ -flaws is sufficient for the purposes of our analysis (recall Theorem 5.1), while Lemma 5.5 bounds the probability of each flaw (with respect to $\mu$ ).

Lemma 5.4.

Condition (20) holds for every edge $e$ and every state $\sigma\in\Omega^{\prime}$ such that $\sigma\notin f_{e}$ .

Proof.

Since for every state $\sigma\in\Omega^{\prime}$ we have that $\widetilde{\Pr_{\sigma}}(e\in Z_{i}\mid\mathrm{Out}(\sigma)),\widetilde{\Pr_{\sigma}}(e\in M_{i})$ are within a $(1\pm\eta)$ factor of the respective true marginals, we have that for every state $\sigma\in\Omega^{\prime}\setminus f_{e}$ :

[TABLE]

Recalling that $\Pr(e\in M_{i}^{\prime}\mid\sigma)$ is within a $(1+\frac{1}{(\log\Delta)^{20}})$ -factor of $\Pr(e\in Z_{i}\mid\sigma)$ , we can deduce that if flaw $f_{e}$ is not present in a state $\sigma\in\Omega^{\prime}$ , then (20) holds for sufficiently large $\beta,\Delta$ , as claimed. ∎

Lemma 5.5.

For every $x\in E\cup V$ and possible choice $R_{x}^{*}$ for $R_{x}$ we have $\mu(f_{x}\mid R_{x}=R_{x}^{*})\leq\frac{1}{\Delta^{3(t^{2}+t+2)}}$ .

Proof.

For $f_{v}$ flaws the claim follows almost immediately from Lemma 5.3, so we focus on proving it for the case of $f_{e}$ -flaws. In particular, we show that

[TABLE]

as this implies our claim per Lemma 5.3.

Recall that $f_{e}$ is a subset of $\Omega$ , i.e., the “augmented” space where each state is associated with a tuple of strings from $\prod_{i\in[N]}\mathcal{S}^{E_{i}}$ , while event $A_{e}$ is a subset of the original probability space that is induced by the family of random matchings, activations, and equalizing coin flips for each edge. Recall also that, by definition, $\mu$ assigns zero probability mass to $f_{e}\setminus\Omega^{\prime}$ , i.e., the part of $f_{e}$ where we have no guarantees about the quality of approximation of our edge-marginal estimators. In order to establish (22) we “project” $f_{e}\cap\Omega^{\prime}$ to the original probability space to get an event $\widetilde{A}_{e}$ . That is, the elements of $\widetilde{A}_{e}$ are induced by the elements of $f_{e}$ by ignoring the coordinate that corresponds to the tuple of strings from $\prod_{i\in[N]}\mathcal{S}^{E_{i}}$ . By definition, $\mu(f_{e}\mid R_{e}=R_{e}^{*})=\Pr(\widetilde{A}_{e}\mid R_{e}=R_{e}^{*})$ .

In addition, for every elementary event $\xi\in\widetilde{A}_{e}$ we have

[TABLE]

for sufficiently large $\Delta$ . Note that the first inequality follows from (21) and the fact that we only consider elements in $f_{e}\cap\Omega^{\prime}$ , i.e., states in which our edge-marginal approximations are within a $(1\pm\eta)$ -factor from the true values. Recalling (19), we see that inequality (23) implies that $\Pr(\widetilde{A}_{e}\mid R_{e}=R_{e}^{*})\leq\Pr(A_{e}\mid R_{e}=R_{e}^{*})$ (and, therefore, also (22)), concluding the proof.

∎

Summarizing, we may assume without loss of generality that we are able to accurately and efficiently search for edge-flaws $f_{e}$ , and that their probability with respect to measure $\mu$ is bounded above by $\Delta^{-3(t^{2}+t+2)}$ conditional on any instantiation of $R_{e}$ .

Recall the procedure Resample described in Section 4.1. Below we describe procedure Fix that takes as input a subgraph $H$ and a state $\sigma$ . In the description of Fix below we invoke procedure Resample with an extra parameter, namely an activity vector $\lambda_{i}^{\prime}$ for each color $i$ . By that we mean that in Lines 4, 5 of Resample we use the vector $\lambda_{i}^{\prime}$ to define $p$ . Finally, recall that we defined $t=8(K+1)^{2}(\log\Delta)^{20}+2$ .

Theorem 2.10 implies that procedure Fix runs in polynomial time for any input subgraph $H$ and state $\sigma$ . To address flaws $f_{v},f_{\{u_{1},u_{2}\}}$ in a state $\sigma$ we invoke Fix( $\{v\},\sigma$ ) and Fix( $\{u_{1},u_{2}\},\sigma$ ), respectively.

5.3 Proof of Theorem 1.4

Similarly to the proof of Theorem 1.3, for our analysis we will assume that our algorithm samples from the “ideal” matchings distributions, i.e., the ones induced by the vectors $\lambda_{i}$ rather than by the approximations $\lambda_{i}^{\prime}$ . We will also assume that each equalizing coin corresponding to a color $i\in[N]$ and an edge $e$ is flipped with probability of success $\mathrm{Eq}_{i}(e)$ instead of $\mathrm{Eq}_{i}^{\prime}(e)$ , and that we update string $s_{i}(e)$ by sampling uniformly from $\mathcal{S}_{i}^{\prime}(e)$ instead of $\mathcal{S}$ . Under these assumptions, we will prove that our algorithm terminates in expected polynomial time. Recalling the proof of Theorem 1.3, the latter allows us to invoke an identical coupling argument and show that the price of making these assumptions is to increase the failure probability of our algorithm by an additive $1/n^{\gamma}$ , where $\gamma=\gamma(\beta)$ can be made arbitrarily large by increasing $\beta$ . This error probability can be subsumed by the expected running time of our algorithm.

For two flaws $f_{x_{1}},f_{x_{2}}$ , where $x_{1},x_{2}\in V\cup E$ , we consider the causality relation $f_{x_{1}}\sim f_{x_{2}}$ iff $\mathrm{dist}(x_{1},x_{2})\leq t^{2}+t+2$ . By inspecting procedure Fix it is not hard to verify that this is a valid choice for a causality graph in the sense that no flaw $f$ can cause flaws outside $\Gamma(f)$ . This is because, in order to determine whether a flaw $f_{x}$ is present in a state $\sigma$ , we only need information about $\sigma$ in $G\cap S_{<t}(x)$ , and procedure FIX locally modifies the state within a radius at most $t^{2}$ of the input subgraph $H$ .

The algorithmic proof of Theorem 5.1, which as we explained earlier is the key ingredient in making Kahn’s result constructive, follows almost immediately by combining Theorem 2.4 with Lemma 5.6 below, whose proof can be found in Section 5.3.1.

Lemma 5.6.

Let $f\in\{f_{e},f_{v}\}$ for an edge $e$ and a vertex $v$ . Then:

[TABLE]

where the charges are computed with respect to measure $\mu$ and the algorithm that samples from the ideal distributions.

Constructive Proof of Theorem 5.1.

Recall from (8) that, setting $x_{f}=\frac{1}{1+\max_{f\in F}|\Gamma(f)|}$ for each flaw $f$ , condition (3) with $\epsilon=1/4$ is implied by

[TABLE]

Clearly, for each flaw $f$ , $|\Gamma(f)|=O(\Delta^{2(t^{2}+t+2)})$ so, by Lemma 5.6, condition (24) is satisfied for all sufficiently large $\Delta$ . Thus, Theorem 2.4 implies that, for every multigraph with large enough degree $\Delta_{0}$ , the algorithm for each iteration terminates after an expected number of

[TABLE]

steps. ∎

Finally, the proof of Theorem 1.4 is concluded by combining the algorithm for Theorem 5.1 with the greedy algorithm of Theorem 5.2. It remains only for us to prove Lemma 5.6 stated above. This we do in the next subsection.

5.3.1 Proof of Lemma 5.6.

Let $\Omega_{1}=\prod_{i=1}^{N}\mathcal{M}(G_{i})$ and $\Omega_{2}=\Omega_{3}=\prod_{i=1}^{N}\{0,1\}^{E_{i}}$ . For notational convenience, sometimes we write $\Omega_{1}^{i}=\mathcal{M}(G_{i})$ and $\Omega_{2}^{i}=\Omega_{3}^{i}=\{0,1\}^{E_{i}}$ , for $i\in[N]$ .

Let $\nu_{1}$ be the distribution over $\Omega_{1}$ induced by the product of distributions $\mu_{i}$ , $i\in[N]$ . Let also $\nu_{2},\nu_{3}$ be the distributions over $\Omega_{2}$ and $\Omega_{3}$ induced by the product of activation and equalizing coin flips of each color $i\in[N]$ , respectively. Recall that we can take a sample from $\mu$ by sampling from $\nu_{1}\times\nu_{2}\times\nu_{3}$ to obtain a tuple $\xi=(M,a,h)\in\Omega_{1}\times\Omega_{2}\times\Omega_{3}$ , and then sample uniformly from an element from $\prod_{i=1}^{N}\prod_{e\in E(G_{i})}\mathcal{S}_{i}^{\prime}(e,\xi)$ . Moreover, note that each $\nu_{j}$ is the product of $N$ distributions $\nu_{j}^{i}$ , one for each color $i\in[N]$ . For example, notice that $\nu_{1}^{i}$ is another name for $\mu_{i}$ , while $\nu_{2}^{i}$ is the product measure over the edges of $G_{i}$ induced by flipping a coin with probability $\alpha$ for each edge.

For $\sigma_{1}=(M_{1},M_{2},\ldots,M_{N})\in\Omega_{1}$ , a subgraph $H$ , and an integer $d>0$ , we define the quantities $Q_{H}(d,\sigma_{1})=\left(M_{1}-S_{<d}(H),\ldots,M_{N}-S_{<d}(H)\right)$ and $Q_{H}^{i}(d,\sigma_{1})=M_{i}-S_{<d}(H)$ , similarly to the proof of Lemma 4.3. Moreover, for $\sigma_{2}\in\Omega_{2}$ that represents the outcome of the activations, we let $A_{H}(d,\sigma_{2})$ denote the restriction of $\sigma_{2}$ to $M_{i}-S_{<d}(H)$ for each color $i\in[N]$ . In the same fashion, for $\sigma_{3}\in\Omega_{3}$ that represents the outcome of the equalizing coin flips, we let $C_{H}(d,\sigma_{3})$ denote the restriction of $\sigma_{3}$ to $M_{i}-S_{<d}(H)$ for each color $i\in[N]$ . For $\sigma_{2}\in\Omega_{2},\sigma_{3}\in\Omega_{3}$ , we also define $A_{H}^{i}(d,\sigma_{2})$ and $C_{H}^{i}(d,\sigma_{3})$ , $i\in[N]$ , similarly to $Q_{H}^{i}(d,\sigma_{1})$ . Finally, for $\xi=(\sigma_{1},\sigma_{2},\sigma_{3})\in\Omega_{1}\times\Omega_{2}\times\Omega_{3}$ , define $R_{H}(d,\xi)=(Q_{H}(d,\sigma_{1}),A_{H}(d,\sigma_{2}),C_{H}(d,\sigma_{3}))$ .

Our goal will be to show that, for every $x\in V\cup E$ ,

[TABLE]

where $\sigma$ is a random state distributed according to $\mu$ . Note that combining (25) with Lemma 5.5 will conclude the proof of Lemma 5.6.

We only prove (25) for $f_{e}$ -flaws, since the proof for $f_{v}$ flaws is very similar. Observe that whether flaw $f_{e}$ is present at a state $\sigma$ is determined by $\bigcup_{i=1}^{N}\left(G_{i}\cap G_{<t}(e)\right)$ , the entries of the activation and equalizing flip vectors of each color $i\in[N]$ that correspond to edges in $G_{i}\cap G_{<t}(e)$ , and the value of the “input randomness” strings $\{s_{i}(e)\}_{i=1}^{N}$ . With that in mind, for each color $i$ let $M_{i}(t,e)=M_{i}\cap E(G_{i}\cap G_{<t}(e))$ and $a_{i}(t,e),h_{i}(t,e)$ denote the (random) vectors constraining the entries of the activation and equalizing coin flip vectors for color $i$ that correspond to the edges of $G_{i}\cap G_{<t}(e)$ . Let also $\mathcal{D}_{i}(t,e)$ denote the domain of possible values of $(M_{i}(t,e),a_{i}(t,e),h_{i}(t,e),s_{i}(e))$ .

The fact that we can determine whether $f_{e}$ is present in a state by examining local information around $e$ implies that there exists a set $X_{e}=X_{e}(t)$ of vectors of size $N$ such that the $i$ -th entry of a vector $x\in X_{e}$ is an element of $\mathcal{D}_{i}(t,e)$ , and so that

[TABLE]

For a state $\sigma\in\Omega^{\prime}$ , let $x_{e}^{\sigma}$ be the $N$ -dimensional random vector whose $i$ -th entry is $(M_{i}(t,e),a_{i}(t,e),h_{i}(t,e),s_{i}(e))$ . According to (26), for $\tau\in\Omega^{\prime}$ we have

[TABLE]

since the random choices of matching, activation, and equalizing coin flips for each color are independent. For an $N$ -dimensional vector $x$ whose $i$ -th entry is an element of $\mathcal{D}_{i}(t,e)$ , we write $x_{i}(j)$ to denote the $j$ -th element of tuple $x_{i}$ . Thus, recalling the definition of the distributions $\nu_{j}^{i}$ , we have

[TABLE]

where $\xi_{i}=(x_{i}(1),x_{i}(2),x_{i}(3))$ .

Recall now that for a subgraph $H$ , multigraph $G_{<d+1}(H)$ is induced by $S_{<d+1}(H)$ , and $\mathcal{M}_{d+1}^{i}(H,\sigma)$ is the set of matchings of $G_{<d+1}(H)$ that are compatible with $Q_{H}^{i}(d,\sigma_{1})$ . Hence,

[TABLE]

Moreover, we clearly have

[TABLE]

We will use (27)-(31) to show that, for $\sigma$ distributed according to $\mu$ , and any state $\tau\in\Omega^{\prime}$ ,

[TABLE]

According to the definition of $\gamma_{f_{e}}$ , maximizing (32) over $\tau\in\Omega^{\prime}$ yields (25).

To compute the sum in (32) we need to determine the set of states $\mathrm{In}_{e}(\tau)=\{\omega:\rho_{f_{e}}(\omega,\tau)>0\}$ . We claim that for each $\omega\in\mathrm{In}_{e}(\tau)$ we have that $R_{e}(t^{2},\omega)=R_{e}(t^{2},\tau)$ .

To see this, let

[TABLE]

where $\omega_{j},\tau_{j}\in\Omega_{j}$ and $\omega_{j}^{i},\tau_{j}^{i}\in\Omega_{j}^{i}$ for $j\in\{1,2,3\}$ and $\omega_{4},\tau_{4}$ are tuples of input randomness strings. To express the probability distribution $\rho_{f_{e}}(\omega,\cdot)$ in a convenient way we consider the following $3N$ distributions. For each $i\in[N]$ we have a probability distribution $\rho_{f_{e}}^{i,1}(\omega_{1}^{i},\cdot)$ corresponding to Line 3 of FIX and color $i$ , and similarly, for $\omega_{2}^{i},\omega_{3}^{i}$ we have probability distributions $\rho_{f_{e}}^{i,2}(\omega_{2}^{i},\cdot),\rho_{f_{e}}^{i,3}(\omega_{3}^{i},\cdot)$ , corresponding to Lines 5, 6 of FIX and color $i$ , respectively. Recalling procedure Resample, we see that the support of $\rho_{f_{e}}^{i,1}(\omega_{1}^{i},\cdot)$ is $\mathcal{M}_{t^{2}+1}^{i}(e,\omega_{1})$ , and hence it must be the case that $Q_{e}^{i}(t^{2},\omega_{1})=Q_{e}^{i}(t^{2},\tau_{1})$ for every $i\in[N]$ and state $\omega\in\mathrm{In}_{e}(\tau)$ . Similarly, by inspecting procedure FIX one can verify that $A_{e}^{i}(t^{2},\omega_{2})=A_{e}^{i}(t^{2},\tau_{2})$ and that $C_{e}^{i}(t^{2},\omega_{3})=C_{e}^{i}(t^{2},\tau_{3})$ for each $i\in[N]$ . Hence, $R_{e}(t^{2},\omega)=R_{e}(t^{2},\tau)$ , as claimed.

For each $\omega\in f_{e}$ ,

[TABLE]

since we have assumed that in Line 7 of FIX we update string $s_{i}(e)$ by sampling uniformly from $\mathcal{S}_{i}^{\prime}(e)$ instead of $\mathcal{S}$ .

We will now give an alternative expression for each $r_{i,j}(\omega)$ in order to relate (33) to (32). We start with $r_{i,1}(\omega)$ . The fact that $Q_{e}^{i}(t^{2},\omega_{1})=Q_{e}^{i}(t^{2},\tau_{1})$ for each $\omega\in\mathrm{In}_{e}(\tau)$ implies that

[TABLE]

To see this recall the definition of mutligraph $G_{i,<d+1}(H)$ in the text above the definition of procedure Resample.

Furthermore, since we have assumed that the hard-core distribution in Lines 4, 5 of Resample is induced by the ideal vector of activities $\lambda_{i}$ , we have

[TABLE]

Combining (34) with (35) and the fact that $Q_{e}^{i}(t^{2},\omega_{1})=Q_{e}^{i}(t^{2},\tau_{1})$ we obtain

[TABLE]

Recall now the definitions of $a_{i}(t,e)$ and $h_{i}(t,e)$ . The fact that $A_{e}^{i}(t^{2},\omega_{2})=A_{e}^{i}(t^{2},\tau_{2})$ for each $\omega\in\mathrm{In}_{e}(\tau)$ implies that

[TABLE]

Further, since in Line 5 of FIX we simply flip a coin independently with success probability $\alpha$ for each edge of $G_{i,<t^{2}+1}(e)$ , we have

[TABLE]

where the sum in the denominator ranges over all the possible values for $a_{i}(t,e)$ . Thus, combining (37) with (38) we get

[TABLE]

Finally, an identical argument shows that

[TABLE]

where the sum in the denominator ranges over all the possible values for $h_{i}(t,e)$ .

For $x\in X_{e}$ , let $\Omega_{e,x}=\{\omega:x_{e}^{\omega}=x\}$ . For $\sigma$ distributed according to $\mu$ , the left-hand side of (32) can be written as

[TABLE]

where $\xi_{i}=(x_{i}(1),x_{i}(2),x_{i}(3))$ , concluding the proof of (32). Note that (5.3.1) follows by (29) and (36) for $j=1$ , (30) and (39) for $j=2$ , and (31) and (40) for $j=3$ . This concludes the proof of (25) and hence of Lemma 5.6

6 Acknowledgements

We thank Dimitris Achlioptas for many helpful discussions. We are also thankful to anonymous reviewers for helping us correct inaccuracies in previous versions of the paper and, in particular, for the idea to include the random bits used for approximating the edge-marginals in the algorithm of Section 5.2 in the definition of its state space, which simplified our approach.

Appendix A Proof of Lemma 4.5

We will need the following standard concentration bound (see, e.g., [31, Section 10.1]).

Lemma A.1.

Let $X$ be a random variable determined by $n$ independent trials $T_{1},\ldots,T_{n}$ , and such that changing the outcome of any one trial can affect $X$ by at most $c$ . Then

[TABLE]

Proof of Part (a) of Lemma 4.5.

Recall that $t=8(K+1)^{2}\delta^{-1}+2$ and that $\delta=\frac{\epsilon}{4}$ . Consider a random state $\sigma$ distributed according to $\mu$ and a fixed state $\tau\in\Omega$ , and notice that applying Theorem 2.9 with the parameter $\epsilon$ instantiated to $\delta$ and our choice of $t$ imply that

[TABLE]

for any vertex $v$ , any edge $e$ incident on $v$ and any $i\in[N]$ . This implies

[TABLE]

Now, recalling that $N=\lfloor\chi_{e}^{*}(G)^{\frac{3}{4}}\rfloor\sim\Delta^{3/4}$ and $\epsilon\leq\frac{1}{10}$ , for sufficiently large $\Delta$ we have

[TABLE]

Further, since $c^{*}=\chi_{e}^{*}(G)-(1+\epsilon)^{-1}N$ and $\epsilon\leq\frac{1}{10}$ , (43) yields

[TABLE]

As the choices of the $M_{i}$ are independent and each affects the degree of $v$ in $G^{\prime}$ by at most $1$ , we can apply Lemma A.1 with $\lambda=(\frac{\epsilon}{3}-\frac{\epsilon}{4})N=\frac{\epsilon}{12}N$ to prove part (a). In particular, since $N=\lfloor\chi_{e}^{*}(G)^{\frac{3}{4}}\rfloor\sim\Delta^{3/4}$ we have

[TABLE]

for any constant $C$ for sufficiently large $\Delta$ . ∎

Proof of Part (b) of Lemma 4.5.

The proof of part (b) is similar. Consider again a random state $\sigma$ distributed according to $\mu$ and fix a state $\tau\in\Omega$ . Theorem 2.9 implies that for each $i\in[N]$ , the probability that an edge $e$ with both endpoints in $H$ is in $M_{i}$ , conditional on $Q_{H}^{i}(t,\sigma)=Q_{H}^{i}(t,\tau)$ , is at least $(1-\delta)\frac{1-\delta}{\chi_{e}^{*}(G)}\geq\frac{1-\frac{\epsilon}{2}}{\chi_{e}^{*}(G)}$ . Moreover, Edmonds’ characterization of the matching polytope (which we have already seen in the the proof of Lemma 4.2) implies that the number of edges in $G$ with both endpoints in $H$ is at most $\chi_{e}^{*}(G)\lfloor\frac{V(H)-1}{2}\rfloor$ . Similar calculations to those in part (a) reveal that

[TABLE]

where $E_{\sigma}(H)$ is the set of edges of $G_{\sigma}$ induced by $H$ . Since the choices of matchings $M_{i}$ are independent and each affects $|E_{\sigma}(H)|$ by at most $\frac{|V(H)|-1}{2}$ , we can again apply Lemma A.1 to prove part (b). ∎

Bibliography42

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Dimitris Achlioptas and Fotis Iliopoulos. Random walks that find perfect objects and the Lovász local lemma. J. ACM , 63(3):22:1–22:29, July 2016.
2[2] Dimitris Achlioptas, Fotis Iliopoulos, and Vladimir Kolmogorov. A local lemma for focused stochastic algorithms. SIAM J. Comput. , 48(5):1583–1602, 2019.
3[3] Dimitris Achlioptas, Fotis Iliopoulos, and Alistair Sinclair. Beyond the Lovász local lemma: Point to set correlations and their algorithmic applications. In David Zuckerman, editor, 60th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2019, Baltimore, Maryland, USA, November 9-12, 2019 , pages 725–744. IEEE Computer Society, 2019.
4[4] Dimitris Achlioptas, Fotis Iliopoulos, and Nikos Vlassis. Stochastic control via entropy compression. In Ioannis Chatzigiannakis, Piotr Indyk, Fabian Kuhn, and Anca Muscholl, editors, 44th International Colloquium on Automata, Languages, and Programming, ICALP 2017, July 10-14, 2017, Warsaw, Poland , volume 80 of LIP Ics , pages 83:1–83:13. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2017.
5[5] Michael Albert, Alan Frieze, and Bruce Reed. Multicoloured Hamilton cycles. Electronic Journal of Combinatorics , 2(1):R 10, 1995.
6[6] Guantao Chen, Guangming Jing, and Wenan Zang. Proof of the Goldberg-Seymour conjecture on edge-colorings of multigraphs. ar Xiv preprint ar Xiv:1901.10316 , 2019.
7[7] Guantao Chen, Xingxing Yu, and Wenan Zang. Approximating the chromatic index of multigraphs. Journal of Combinatorial Optimization , 21(2):219–246, 2011.
8[8] Andrzej Dudek, Alan Frieze, and Andrzej Ruciński. Rainbow Hamilton cycles in uniform hypergraphs. The Electronic Journal of Combinatorics , 19(1):46, 2012.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Efficiently list-edge coloring multigraphs asymptotically optimally

Abstract

1 Introduction

Theorem 1.1** ([22]).**

Theorem 1.2** ([23]).**

Theorem 1.3**.**

Theorem 1.4**.**

1.1 Technical Overview

1.2 Organization of the Paper

2 Background and Preliminaries

2.1 The Lopsided Lovász Local Lemma

Theorem 2.1** (General Lopsided LLL).**

2.2 An Algorithmic LLL Criterion.

Definition 2.2** (Causality).**

Definition 2.3** (Causality Graph).**

Theorem 2.4** ([2]).**

Definition 2.5**.**

Remark 2.1**.**

Theorem 2.6**.**

2.3 Hard-Core Distributions on Matchings

Theorem 2.7** ([28, 37]).**

Definition 2.8**.**

Theorem 2.9** ([24]).**

Theorem 2.10** ([19], Corollary 4.3).**

Remark 2.2**.**

Theorem 2.11** ([19], Corollary 4.4).**

Remark 2.3**.**

Corollary 2.12**.**

Proof.

Corollary 2.13**.**

Proof.

3 Causality, Lopsidependency and Approximate Resampling Oracles

Theorem 3.1**.**

Proof.

4 Edge Coloring Multigraphs: Proof of Theorem 1.3

Lemma 4.1**.**

Remark 4.1**.**

4.1 The Algorithm

The flaws.

Lemma 4.2** ([22]).**

Proof.

The initial distribution.

Addressing flaws.

Searching for flaws.

4.2 Proof of Lemma 4.1

Lemma 4.3**.**

Lemma 4.4**.**

4.3 Proof of Lemma 4.3

Proof of part (a).

Lemma 4.5**.**

Proof of part (b).

Proposition 4.6**.**

Proof.

4.4 Proof of Lemma 4.4

5 List-Edge Coloring Multigraphs: Proof of Theorem 1.4

5.1 A High Level Sketch of the Existential Proof

Remark 5.1**.**

Theorem 5.1** ([23]).**

Theorem 5.2** ([23]).**

Lemma 5.3** ([23]).**

5.2 The Algorithm

The initial distribution.

Finding and addressing flaws.

Lemma 5.4**.**

Proof.

Lemma 5.5**.**

Proof.

5.3 Proof of Theorem 1.4

Lemma 5.6**.**

Constructive Proof of Theorem 5.1.

5.3.1 Proof of Lemma 5.6.

6 Acknowledgements

Appendix A Proof of Lemma 4.5

Lemma A.1**.**

Theorem 1.1 ([22]).

Theorem 1.2 ([23]).

Theorem 1.3.

Theorem 1.4.

Theorem 2.1 (General Lopsided LLL).

Definition 2.2 (Causality).

Definition 2.3 (Causality Graph).

Theorem 2.4 ([2]).

Definition 2.5.

Remark 2.1.

Theorem 2.6.

Theorem 2.7 ([28, 37]).

Definition 2.8.

Theorem 2.9 ([24]).

Theorem 2.10 ([19], Corollary 4.3).

Remark 2.2.

Theorem 2.11 ([19], Corollary 4.4).

Remark 2.3.

Corollary 2.12.

Corollary 2.13.

Theorem 3.1.

Lemma 4.1.

Remark 4.1.

Lemma 4.2 ([22]).

Lemma 4.3.

Lemma 4.4.

Lemma 4.5.

Proposition 4.6.

Remark 5.1.

Theorem 5.1 ([23]).

Theorem 5.2 ([23]).

Lemma 5.3 ([23]).

Lemma 5.4.

Lemma 5.5.

Lemma 5.6.

Lemma A.1.