Efficiently list-edge coloring multigraphs asymptotically optimally
Fotis Iliopoulos, Alistair Sinclair

TL;DR
This paper develops polynomial-time algorithms for asymptotically optimal list-edge coloring of multigraphs, transforming non-constructive probabilistic proofs into constructive algorithms using advanced local search and correlation decay techniques.
Contribution
It introduces a constructive approach to Kahn's asymptotic results on edge coloring multigraphs by leveraging local search analysis and correlation decay methods.
Findings
Algorithms match Kahn's asymptotic bounds
Use of local search analysis for constructive algorithms
Correlation decay exploited for efficiency
Abstract
We give polynomial time algorithms for the seminal results of Kahn, who showed that the Goldberg-Seymour and List-Coloring conjectures for (list-)edge coloring multigraphs hold asymptotically. Kahn's arguments are based on the probabilistic method and are non-constructive. Our key insight is to show that the main result of Achlioptas, Iliopoulos and Kolmogorov for analyzing local search algorithms can be used to make constructive applications of a powerful version of the so-called Lopsided Lovasz Local Lemma. In particular, we use it to design algorithms that exploit the fact that correlations in the probability spaces on matchings used by Kahn decay with distance.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Efficiently list-edge coloring multigraphs asymptotically optimally
Fotis Iliopoulos
University of California Berkeley
[email protected] Research supported by NSF grant CCF-1514434 and the Onassis Foundation.
Alistair Sinclair
University of California Berkeley
[email protected] Research supported by NSF grants CCF-1514434 and CCF-1815328.
Abstract
We give polynomial time algorithms for the seminal results of Kahn [22, 23], who showed that the Goldberg-Seymour and List-Coloring conjectures for (list-)edge coloring multigraphs hold asymptotically. Kahn’s arguments are based on the probabilistic method and are non-constructive. Our key insight is that we can combine sophisticated techniques due to Achlioptas, Iliopoulos and Kolmogorov [2] for the analysis of local search algorithms with correlation decay properties of the probability spaces on matchings used by Kahn in order to construct efficient edge-coloring algorithms.
Keywords— edge-coloring, mulitgraphs, Goldberg-Seymour conjecture, list-edge-coloring conjecture
1 Introduction
In graph edge coloring one is given a (multi)graph and the goal is to find an assignment of one of colors to each edge so that no pair of adjacent edges share the same color. The chromatic index, , of is the smallest integer for which this is possible. In the more general list-edge coloring problem, a list of allowed colors is specified for each edge. A graph is -list-edge colorable if it has a list-coloring no matter how the lists are assigned to each edge. The list chromatic index, , is the smallest for which is -list-edge colorable.
Edge coloring is one of the most fundamental and well-studied coloring problems with various applications in computer science (e.g., [7, 12, 21, 22, 23, 34, 36, 38, 39, 40, 42]). To give just one representative example, if edges represent data packets then an edge coloring with colors specifies a schedule for exchanging the packets directly and without node contention. In this paper we are interested in designing algorithms for efficiently edge coloring and list-edge coloring multigraphs. To formally describe our results, we need some notation.
For a multigraph , let denote the set of matchings of . A fractional edge coloring is a set of matchings and corresponding positive real weights , such that the sum of the weights of the matchings containing each edge is one, i.e., , . A fractional edge coloring is a fractional edge -coloring if . The fractional chromatic index of , denoted by , is the minimum such that has a fractional edge -coloring.
Let be the maximum degree of and define , where is the set of edges of the induced subgraph . Both of these quantities are obvious lower bounds for the chromatic index and it is known [9] that . Furthermore, Padberg and Rao [35] show that the fractional chromatic index of a multigraph, and indeed an optimal fractional edge coloring, can be computed in polynomial time.
Goldberg and Seymour independently stated the now famous conjecture that every multigraph satisfies . In a seminal paper [22], Kahn showed that the Goldberg-Seymour conjecture holds asymptotically:
Theorem 1.1** ([22]).**
*The chromatic index of a multigraph satisfies *
(Here denotes a term that tends to zero as .) Later Kahn proved the analogous result for list-edge coloring [23], establishing that the List Coloring Conjecture, which asserts that for any multigraph , also holds asymptotically:
Theorem 1.2** ([23]).**
The list chromatic index of a multigraph satisfies
The proofs of Kahn use the probabilistic method and are not constructive. The main contribution of this paper is to provide polynomial time algorithms for the above results, as follows:
Theorem 1.3**.**
There exists a randomized algorithm that, given a multigraph on vertices, constructs a -edge coloring of in expected polynomial time.
Theorem 1.4**.**
There exists a randomized algorithm that, given a multigraph on vertices and an arbitrary list of colors for each edge, constructs a valid list-edge coloring of in expected polynomial time.
Clearly, Theorem 1.4 subsumes Theorem 1.3. Moreover, in a very recent breakthrough [6], Chen, Jing and Zang proved the (non-asymptotic) Goldberg-Seymour conjecture without exploiting the arguments of Kahn. Even before this work, the results of Sanders and Steurer [38] and Scheide [40] already give deterministic polynomial time algorithms for edge coloring multigraphs asymptotically optimally, again without exploiting the arguments of Kahn. Nonetheless, we choose to present the proof of Theorem 1.3 for three reasons. First and most importantly, its proof is significantly easier than that of Theorem 1.4, while it contains many of the key ideas required for proving Theorem 1.4. Second, our algorithms and techniques are very different from those of [6, 38, 40]. Finally, we show that the algorithm of Theorem 1.3 is commutative, a notion introduced by Kolmogorov [27]. This fact may be of independent interest as we discuss in Remark 2.1 in Section 2.2.
To the best of our knowledge, Theorem 1.4 is the first result to give an asymptotically optimal polynomial time algorithm for list-edge coloring multigraphs.
1.1 Technical Overview
The proofs of Theorems 1.1 and 1.2 are based on a very sophisticated variation of what is known as the semi-random method (also known as the “naive coloring procedure”), which is the main technical tool behind some of the strongest graph coloring results, e.g., [20, 21, 25, 30]. The idea is to gradually color the graph in iterations, until we reach a point where we can finish the coloring using a greedy algorithm. In its most basic form, each iteration consists of the following simple procedure: assign to each edge a color chosen uniformly at random; then uncolor any edge which receives the same color as one of its neighbors. Using the Lovász Local Lemma (LLL) [10] and concentration inequalities, one typically shows that, with positive probability, the resulting partial proper coloring has useful properties that allow for the continuation of the argument in the next iteration. For a nice exposition of both the method and the proofs of Theorems 1.1 and 1.2, the reader is referred to [31].
The key new ingredient in Kahn’s arguments is the method of assigning colors to edges. For each color , we choose a matching from some hard-core distribution on and assign the color to the edges in . The idea is that, by assigning each color exclusively to the edges of one matching, we avoid conflicting color assignments and the resulting uncolorings.
The existence of such hard-core distributions is guaranteed by the characterization of the matching polytope due to Edmonds [9] and a result by Lee [28] (also shown independently by Rabinovich et al. [37]). The crucial fact about them is that they are endowed with very useful approximate stochastic independence properties, as was shown by Kahn and Kayll in [24]. In particular, for every edge , conditioning on events that are determined by edges far enough from in the graph does not effectively alter the probability of being in the matching.
The reason why this property is important is because it enables the application of a sophisticated version of what is known as the Lopsided Lovász Local Lemma. Recall that the original statement of the LLL asserts, roughly, that, given a family of “bad” events in a probability space, if each bad event individually is not very likely and, in addition, is independent of all but a small number of other bad events, then the probability of avoiding all bad events is strictly positive. The Lopsided LLL used by Kahn generalizes this criterion as follows. For each bad event , we fix a positive real number and require that conditioning on all but a small number of other bad events doesn’t make the probability of larger than . Then, provided the are small enough, the conclusion of the LLL still holds. In other words, one replaces the “probability of a bad event” in the original LLL statement with the “boosted” probability of the event, and the notion of “independence” by the notion of “sufficiently mild negative correlation”.
Notably, the breakthrough result of Moser and Tardos [32, 33] that made the LLL constructive for the vast majority of its applications does not apply in this case, mainly for two reasons. First, the algorithm of Moser and Tardos applies only when the underlying probability measure of the LLL application is a product over explicitly presented variables. Second, it relies on a particular type of dependency (defined by shared variables). The lack of an efficient algorithm for Lopsided LLL applications is the primary obstacle to making the arguments of Kahn constructive.
Our main technical contribution is the design and analysis of such algorithms. Towards this goal, we use the flaws-actions framework introduced in [1] and further developed in [2, 3, 4, 16, 18]. In particular, we use the algorithmic LLL criterion for the analysis of stochastic local search algorithms developed by Achlioptas, Iliopoulos and Kolmogorov in [2]. We start by showing that there is a connection between this criterion and the version of the Lopsided LLL used by Kahn, in the sense that the former can be seen as the constructive counterpart of the latter. However, this observation by itself is not sufficient, since the result of [2] is a tool for analyzing a given stochastic local search algorithm. Thus, we are still left with the task of designing the algorithm before using it. Nonetheless, this connection provides valuable intuition on how to realize this task. Moreover, we believe it is of independent interest as it provides an explanation for the success of various algorithms (such as [29]) inspired by the techniques of Moser and Tardos, which were not tied to a known form of the LLL.
To get a feeling for the nature of our algorithms, it is helpful to have some intuition for the criterion of [2]. There, the input is the algorithm to be analyzed and a probability measure over the state space of the algorithm. The goal of the algorithm is to reach a state that avoids a family of bad subsets of the space which we call flaws. It does this by focusing on a flaw that is currently present at each step, and taking a (possibly randomized) action to address it. At a high level, the role of the measure is to gauge how efficiently the algorithm rids the state of flaws, by quantifying the trade-off between the probability that a flaw is present at some inner state of the execution of the algorithm and the number of other flaws each flaw can possibly introduce when the algorithm addresses it. In particular, the quality of the convergence criterion is affected by the compatibility between the measure and the algorithm.
Roughly, the states of our algorithm will be matchings in a multigraph (corresponding to color classes) and the goal will be to construct matchings that avoid certain flaws. To that end, our algorithm will locally modify each flawed matching by (re)sampling matchings in subgraphs of according to distributions induced by the hard-core distributions used in Kahn’s proof. The fact that correlations decay with distance in these distributions allows us to prove that, while the changes are local, and hence not many new flaws are introduced at each step, the compatibility of our algorithms with these hard-core distributions is high enough to allow us to successfully apply the criterion of [2].
1.2 Organization of the Paper
In Section 2 we present the necessary background. In Section 3 we show a useful connection between the version of the Lopsided LLL used by Kahn and the algorithmic LLL criterion of [2]. In Section 4 we present the proof of Theorem 1.3. In Section 5, we sketch the proof of Theorem 1.2 and then prove Theorem 1.4.
2 Background and Preliminaries
2.1 The Lopsided Lovász Local Lemma
Erdős and Spencer [11] noted that independence in the LLL can be replaced by positive correlation, yielding the original version of what is known as the Lopsided LLL, more sophisticated versions of which have also been established in [5, 8]. Below we state the Lopsided LLL in one of its most powerful forms.
Theorem 2.1** (General Lopsided LLL).**
Let be a probability space and be a set of (bad) events. For each , let be such that for every . If there exist positive real numbers such that
[TABLE]
then the probability that none of the events in occurs is at least .
Note that in most applications of the Lopsided LLL the definition of sets is “symmetric”, in the sense that if then for every . With that in mind, any (undirected) graph on that includes every edge such that or is called a lopsidependency graph.
2.2 An Algorithmic LLL Criterion.
Let be a discrete state space, and let be a collection of subsets (which we call flaws) of . We define . Our goal is to find a state ; we refer to such states as flawless.
For a state , we denote by the set of flaws present in . We consider local search algorithms working on which, in each flawed state , choose a flaw in and randomly move to a nearby state in an effort to fix . We will assume that, for every flaw and every state , there is a non-empty set of actions such that addressing flaw in state amounts to selecting the next state from according to some probability distribution . Note that potentially , i.e., addressing a flaw does not necessarily imply removing it. We write to denote the fact that the algorithm addresses flaw at and moves to .
Throughout the paper we consider algorithms that start from a state picked from an initial distribution , and then repeatedly pick a flaw that is present in the current state and address it. The algorithm always terminates when it encounters a flawless state.
Definition 2.2** (Causality).**
We say that flaw causes if there exists a transition such that (i) ; (ii) either or .
Definition 2.3** (Causality Graph).**
Any (undirected) graph on that includes every edge such that either causes or causes is called a causality graph. We write for the set of neighbors of in this graph. We also write to denote that (or equivalently, ).
For a given probability measure supported on the state space , and for each flaw , we define the charge
[TABLE]
In Section 3 we give the intuition behind the definition of charges and also draw a connection with the parameters in Theorem 2.1. We are now ready to state the main result of [2].
Theorem 2.4** ([2]).**
Assume that, at each step, the algorithm chooses to address the lowest indexed flaw according to an arbitrary, but fixed, permutation of . If there exist positive real numbers for such that
[TABLE]
for some , then the algorithm reaches a flawless object within steps with probability at least , where
[TABLE]
We also describe another theorem that can be used to show convergence in a polynomial number of steps, even when the number of flaws is super-polynomial, assuming that the algorithm has a nice “commutativity” property which we describe next.
Definition 2.5**.**
For , let denote the matrix defined by if , and otherwise. An algorithm defined by matrices , , is commutative with respect to a causality relation if for every such that we have .
Remark 2.1**.**
As shown in [27, 18, 16], commutative algorithms have several additional nice properties: they are often parallelizable, their output distribution approximates the so-called “LLL-distribution”, etc. Here we use the fact that commutative algorithms converge in polynomial time even in the presence of superpolynomially many flaws, assuming that the causality graph can be covered by a polynomial number of cliques (see Theorem 2.6 below). It is also worth noting that, if there were an efficient parallel algorithm for sampling matchings in multigraphs, namely a parallel version of the MCMC algorithm of Theorem 2.10 which we discuss in the next section and which we use in our algorithm for Theorem 1.3, then our proof directly implies a parallel algorithm for Theorem 1.3. The study of parallel versions of MCMC sampling algorithms has been initiated recently in [13, 14].
We note that Definition 2.5 was introduced in [16], as a generalization of the combinatorial definition of commutativity introduced in [27]. While the latter would suffice for our purposes, we choose to work with Definition 2.5 due to its compactness.
Theorem 2.6**.**
Let be a commutative algorithm with respect to a causality relation . Assume there exist positive real numbers in such that condition (3) holds. Assume further that the causality graph induced by can be covered by cliques with potentially further edges between them. Setting , the expected number of steps performed by is at most , and for any parameter , terminates within resamplings with probability .
As shown in [18, Theorem 3.2], the proof of Theorem 2.6 reduces to that of the analogous result of Hauepler, Saha and Srinivasan [15] for the Moser-Tardos algorithm, and hence we omit it.
2.3 Hard-Core Distributions on Matchings
A probability distribution on the matchings of a multigraph is hard-core if it is obtained by associating to each edge a positive real (called the activity of ) so that the probability of any matching is proportional to . Thus, recalling that denotes the set of matchings of , and setting for each , we have
[TABLE]
The characterization of the matching polytope due to Edmonds [9] and a result of Lee [28] (which was also shown independently by Rabinovich et al. [37]) imply the following connection between fractional edge colorings and hard-core probability distributions on matchings. Before describing it, we need a definition.
For any probability distribution on the matchings of a multigraph , we refer to the probability that a particular edge is in the random matching as the marginal of at . We write for the collection of marginals of at all the edges .
Theorem 2.7** ([28, 37]).**
There is a hard-core probability distribution with marginals if and only if there is a fractional -edge coloring of with , i.e., if and only if .
Kahn and Kayll [24] proved that the probability distribution promised by Theorem 2.7 is endowed with very useful approximate stochastic independence properties.
Definition 2.8**.**
Suppose we choose a random matching from some probability distribution. We say that an event is -distant from a vertex if is completely determined by the choice of all matching edges at distance at least from . We say that is -distant from an edge if it is -distant from both endpoints of .
Theorem 2.9** ([24]).**
For any , there exists a such that for any multigraph with fractional chromatic index there is a hard-core distribution with marginals such that:
- (a)
for every , and hence , ; 2. (b)
for every , if we choose a matching according to then, for any edge and event which is -distant from ,
[TABLE]
where .
We conclude this subsection by stating the result of Jerrum and Sinclair [19] for sampling from hard-core distributions on matchings. We also describe a few of its applications that will be helpful in our proofs. The algorithm of [19] works by simulating a rapidly mixing Markov chain on matchings, whose stationary distribution is the desired hard-core distribution , and outputting the final state.
Theorem 2.10** ([19], Corollary 4.3).**
Let be a multigraph, a vector of activities associated with the edges of , and the corresponding hard-core distribution. Let be the number of vertices of and define . There exists an algorithm that, for any , runs in time and outputs a matching in drawn from a distribution such that .
Remark 2.2**.**
[19]** establishes this result for matchings in (simple) graphs. However, the extension to multigraphs is immediate: make the graph simple by replacing each set of multiple edges between a pair of vertices by a single edge of activity ; then use the algorithm to sample a matching from the hard-core distribution in the resulting simple graph; finally, for each edge in this matching, select one of the corresponding multiple edges with probability . Note that the running time will depend polynomially on the maximum activity in the simple graph, as claimed.
Note that, via a standard argument, the algorithm of Theorem 2.10 can be used to design a fully-polynomial randomized approximation scheme (f.p.r.a.s.) for the partition function of a hard-core probability distribution on the matchings of a multigraph — namely, for the quantity .
Theorem 2.11** ([19], Corollary 4.4).**
Let be a multigraph, a vector of activities associated with the edges of , and the corresponding partition function. Let be the number of vertices of and define . There exists an algorithm that, for any , runs in time and outputs a quantity such that
Remark 2.3**.**
The estimate in Theorem 2.11 could be arbitrarily bad with probability . However, this probability can be reduced to any desired by performing trials and taking the median.
Theorem 2.11 allows us to design a f.p.r.a.s. for the edge-marginals of a hard-core probability distribution on the matchings of a multigraph .
Corollary 2.12**.**
Let be a multigraph, a vector of activities associated with the edges of , and the corresponding hard-core distribution. Let be the number of vertices of and define . There exists an algorithm that, for any edge , and , runs in time and outputs a quantity such that , where is the marginal of at .
Proof.
Let be the mutligraph obtained by removing along with every other edge of adjacent to it. Let , denote the partition functions corresponding to multigraphs with respect to , respectively. Observe now that . Using the f.p.r.a.s. promised by Theorem 2.11 (and Remark 2.3) to get appropriately accurate estimates for , we directly obtain an estimate for that satisfies the guarantees of Corollary 2.12.
∎
Finally, one can use Theorem 2.11 as a subroutine in the algorithm of Singh and Vishnoi [41] to obtain the following result.
Corollary 2.13**.**
Let be a multigraph on vertices and let be a parameter. Let be the hard-core probability distribution over the matchings of promised by Theorem 2.9. For every , there exists a -time algorithm that computes a set of edge activities such that the corresponding hard-core distribution satisfies .
Proof.
Corollary 2.13 follows in a straightforward way from the main results of Singh and Vishnoi [41] and Jerrum and Sinclair [19]. Briefly, the main result of [41] states that finding a distribution that approximates can be seen as the solution of a max-entropy distribution estimation problem which can be efficiently solved given a “generalized counting oracle” for . The latter oracle is provided by Theorem 2.11. ∎
3 Causality, Lopsidependency and Approximate Resampling Oracles
In this section we show a connection between Theorem 2.1 and Theorem 2.4. While this section is not essential to the proof of our main results, it does provide useful intuition since it implies the following natural approach to making applications of the Lopsided LLL algorithmic: we start designing a local search algorithm for addressing the flaws that correspond to bad events by considering the family of probability distributions whose supports induce a causality graph that coincides with the lopsidependency graph of the Lopsided LLL application of interest. This is typically a straightforward task. The key to successful implementation is our ability to make the way in which the algorithm addresses flaws sufficiently compatible with the underlying probability measure . To make this precise, we first recall an algorithmic interpretation of the notion of charges defined in (2).
As shown in [2], the charge captures the compatibility between the actions of the algorithm for addressing flaw and the measure . To see this, consider the probability, , of ending up in state after (i) sampling a state according to , and then (ii) addressing at . Define the distortion associated with as
[TABLE]
i.e., the maximum possible inflation of a state probability incurred by addressing (relative to its probability under , and averaged over the initiating state according to ). Now observe from (2) that
[TABLE]
An algorithm for which is called a resampling oracle [17] for , and notice that it perfectly removes the conditional of the addressed flaw. However, designing resampling oracles for sophisticated measures can be impossible by local search. This is because small, but non-vanishing, correlations can travel arbitrarily far in . Thus, allowing for some distortion can be very helpful, especially in cases where correlations decay with distance.
Theorem 3.1 below shows that Theorem 2.4 is the algorithmic counterpart of Theorem 2.1.
Theorem 3.1**.**
Given a family of flaws over a state space , an algorithm with causality graph with neighborhoods , and a measure over , then for each we have
[TABLE]
where the are the charges of the algorithm as defined in (2).
Proof.
Let . Observe that
[TABLE]
where the second equality holds because each is a probability distribution, and the third by the definition of causality and the fact that . Now notice that changing the order of summation in (7) gives
[TABLE]
∎
In words, Theorem 3.1 shows that causality graph is a lopsidependency graph with respect to measure with for all . Thus, when designing an algorithm for an application of Theorem 2.1 using Theorem 3.1, we have to make sure that the induced causality graph coincides with the lopsidependency graph, and that the measure distortion induced when addressing flaw is sufficiently small so that the resulting charge is bounded above by .
4 Edge Coloring Multigraphs: Proof of Theorem 1.3
We follow the exposition of the proof of Kahn in [31]. Note that throughout the proof we assume that the maximum degree of the input multigraph satisfies for some appropriately large constant .
The key to the proof of Theorem 1.3 is the following lemma.
Lemma 4.1**.**
For all , there exists such that if then we can find matchings in whose deletion leaves a multigraph with in expected time.
Remark 4.1**.**
Since , we may assume that without loss of generality . Therefore, the expected running time of the algorithm promised by Lemma 4.1 is .
Using the algorithm of Lemma 4.1 recursively, for every we can efficiently find an edge coloring of using at most colors as follows. First, we compute using the algorithm of Padberg and Rao [35]. If , then we apply Lemma 4.1 to get a multigraph with . We can now color recursively using at most colors. Using one extra color for each of the matchings promised by Lemma 4.1, we can then complete the coloring of , proving the claim. In the base case where , we color greedily using colors. The fact that concludes the proof of Theorem 1.3 as the number of recursive calls is at most .
In the following sections, we prove Lemma 4.1. In Section 4.1 we describe the local search algorithm behind Lemma 4.1, and in Section 4.2 we prove its convergence. In Sections 4.3, 4.4 we prove two important auxiliary lemmas that are used in our convergence proof.
4.1 The Algorithm
Observe that we only need to prove Lemma 4.1 for since, clearly, if it holds for then it holds for all . So we fix and let . Our goal will be to delete matchings from to get a multigraph which has fractional chromatic index at most .
The flaws.
Let be the set of possible -tuples of matchings of . For a state let denote the multigraph obtained by deleting the matchings from . For a vertex we define to be the degree of in . We now define the following flaws. For every vertex let
[TABLE]
For every connected subgraph of with an odd number of vertices and such that (i) , and (ii) , let
[TABLE]
The following lemma implies that it suffices to find a flawless state. (This lemma was proved in [22], but we include a proof here for completeness.)
Lemma 4.2** ([22]).**
Any flawless state satisfies .
Proof.
Edmonds’ characterization [9] of the matching polytope implies that the chromatic index of is at most if
; and 2. 2.
with an odd number of vertices:
[TABLE]
Now clearly, addressing every flaw of the form establishes condition . By summing degrees it also implies that, for every subgraph , .
Moreover, any odd subgraph can be decomposed into a connected component with an odd number of vertices, and a (possibly empty) subgraph with an even number of vertices. Since there are no edges between and , in the absence of flaws we obtain
[TABLE]
Thus it suffices to prove condition 2 for the connected odd subgraph , for if then we have
[TABLE]
Now, again by summing degrees, we see that if no flaw is present then condition can fail only for with fewer than vertices, concluding the proof. Indeed, in the absence of flaws, we have and, since , if then . ∎
To describe an efficient algorithm for finding flawless states we need to (i) determine the initial distribution of the algorithm and show that it is efficiently samplable; (ii) show how to address each flaw efficiently; (iii) show that the expected number of steps of the algorithm is polynomial; and finally (iv) show that we can search for flaws in polynomial time, so that each step is efficiently implementable.
The initial distribution.
Apply Theorem 2.9 with . Let be the promised hard-core probability distribution, the vector of activities associated with it, and the corresponding constant. Note that the activities defining are not readily available. However, recalling Corollary 2.13 we see that we can efficiently compute a set of activities that gives an arbitrarily good approximation to the desired distribution .
For a parameter and a distribution , we say that we -approximately sample from to express the fact that we sample from a distribution such that . Set , where is a sufficiently large constant to be specified later, and let be the distribution promised by Corollary 2.13. The initial distribution of our algorithm, , is obtained by -approximately sampling random matchings (independently) from . Observe that , where denotes the probability distribution over induced by taking independent samples from .
Addressing flaws.
For an integer and a connected subgraph , let be the set of vertices within distance strictly less than of a vertex . Given a state , a subgraph , and let
[TABLE]
where we define . Moreover, let denote the -th entry of . (In words, is the set of edges of with the property that both their endpoints are at distance at least from .) Finally, let be the multigraph induced by and be the set of matchings of that are “compatible” with . That is, for any matching in we have that is also a matching of . More specifically, note that corresponds to the set of matchings of the following multigraph . Let denote the set of vertices of that belong to edges in . Multigraph is induced by .
We consider the procedure Resample below which takes as input a connected subgraph , a state and a positive integer , and which will be used to address flaws.
Throughout the proof, we fix the parameter
[TABLE]
To address in state , we invoke procedures Resample and Resample , respectively.
Searching for flaws.
Notice that we can compute in polynomial time using the algorithm of Padberg and Rao [35]. Therefore, given a state , we can search for flaws of the form in polynomial time. However, the flaws of the form are potentially exponentially many, so a brute-force search does not suffice for our purposes.
Fortunately, the result of Padberg and Rao provides a polynomial time oracle for this problem as well. Recall Edmonds’ characterization used in the proof of Lemma 4.2. The constraints over odd subgraphs are called matching constraints. Recall further that in the proof of Lemma 4.2 we showed that, in the absence of -flaws, the only matching constraints that could possibly be violated correspond to flaws. On the other hand, the oracle of Padberg and Rao can decide in polynomial time whether has a fractional -coloring or return a violated matching constraint, for every . Hence, if our algorithm prioritizes flaws over flaws, this oracle can be used to detect the latter in polynomial time.
4.2 Proof of Lemma 4.1
We are left to show that the expected number of steps of the algorithm is polynomial and that each step can be executed in polynomial time. To that end, we will show that both of these statements are true assuming that the initial distribution is instead of approximately , and that in Lines 4, 5 of the procedure Resample we perfectly sample from the hard-core probability distribution induced by activities instead of -approximately sampling from . We can maximally couple the approximate and ideal distributions, and then take the constant in the definition of the approximation parameter to be sufficiently large. The latter implies that the probability that the coupling will fail during the execution of the algorithm is negligible (i.e., at most ). Since the fractional chromatic index of a multigraph can be computed in polynomial time, we can absorb the probability that the coupling fails into the polynomial expected running time by executing our algorithm sufficiently many times. That is, we execute our algorithm for a number of steps that is twice its expected running time, and if the edge coloring it produces is not a desirable one, we repeat the process.
For an integer and a vertex , let be the set of flaws indexed by a vertex of or a subgraph intersecting . For each set for which we have defined we let . For each flaw we define the causality neighborhood , and for each flaw we define , where is as defined in the previous subsection. Notice that this is a valid choice because flaw can only cause flaws in and flaw can only cause flaws in . The reason why we choose these neighborhoods to be larger than seemingly necessary is because, as we will see, with respect to this causality graph our algorithm is commutative, allowing us to apply Theorem 2.6.
Lemma 4.3**.**
Let for a vertex and a connected subgraph of with an odd number of vertices and let . We have:
- (a)
* ;* 2. (b)
,
where the charges are computed with respect to the measure and the algorithm that samples from the ideal distributions.
Lemma 4.4**.**
For each pair of flaws , the matrices commute.
The proof of Lemma 4.3 can be found in Section 4.3. Lemma 4.4 establishes that our algorithm is commutative with respect to the causality relation induced by neighborhoods . Its proof can be found in Section 4.4.
Now, setting for each flaw , we see that condition (3) with is implied by
[TABLE]
which is true for large enough according to Lemma 4.3. Notice further that the causality graph induced by can be covered by cliques, one for each vertex of , with potentially further edges between them. Indeed, flaws indexed by subgraphs that contain a certain vertex of form a clique in the causality graph. Combining Lemma 4.4 with the latter observation, we are able to apply Theorem 2.6, which implies that our algorithm terminates after an expected number of at most O\bigl{(}\max_{\sigma\in\Omega}\frac{\theta(\sigma)}{\mu(\sigma)}\cdot n\log(n\log(1/\delta))\bigr{)}=O(n\log n) steps. (This is because we assume that per our discussion above.)
This completes the proof of Lemma 4.1 and hence, as explained at the beginning of Section 4, Theorem 1.3 follows. It remains, however, to go back and prove Lemmas 4.3 and 4.4, which we do in the next two subsections.
4.3 Proof of Lemma 4.3
Proof of part (a).
We will need the following key lemma, which was essentially proved in [22]. Its proof can be found in Appendix A. Recall that is the distribution over induced by taking independent samples from .
Lemma 4.5**.**
For any random state distributed according to :
- (i)
for every flaw and state : ; and 2. (ii)
for every flaw and state : .
We show the proof of part (a) of Lemma 4.3 only for the case of - flaws, as the proof for - flaws is very similar. Specifically, our goal will be to prove that
[TABLE]
Lemma 4.5 then concludes the proof.
Recalling the definition of from (2) we see that, in order to prove (9), it suffices to show that, for distributed according to and any state ,
[TABLE]
Indeed, maximizing (10) over yields (9) and completes the proof.
Fix . To compute the sum on the left-hand side of (10) we need to determine the set of states for which . To do this, recall that given as input a state , procedure Resample( modifies one by one each matching , , “locally” around . In particular, observe that the support of the distribution for updating is exactly the set , and hence it must be the case that for every and state . This also implies that, for every such ,
[TABLE]
Recall now that we have assumed that the hard-core distribution in Lines 4, 5 of Resample is induced by the ideal vector of activities . In particular, we have
[TABLE]
since , which combined with (11) yields
[TABLE]
Letting be a random state distributed from we see that, by definition, the right-hand side of (13) equals:
[TABLE]
Finally, combining (13) and (14), we obtain that
[TABLE]
concluding the proof of the first part of Lemma 4.3.
∎
Proof of part (b).
For this proof we will use the following well-known proposition, which we also prove here for completeness.
Proposition 4.6**.**
For every vertex there are at most sets of vertices such that (i) ; (ii) ; and (iii) is connected.
Proof.
The number of such sets is bounded by the number of distinct -vertex trees which are rooted at . The latter quantity is bounded by the number of distinct -ary rooted trees with vertices, which is
[TABLE]
see e.g. [26]. It is not hard to see that for and . For , we obtain
[TABLE]
for sufficiently large , concluding the proof. Note that in deriving the first inequality we used that for positive integers .
∎
To prove part (b) of Lemma 4.3 it suffices to show that
[TABLE]
for every vertex . Indeed, (16) clearly suffices if . If , notice that every for which we define has fewer than vertices (assuming is sufficiently large) and, therefore, every has less than elements.
Towards proving (16), at first notice that every set has at most elements. Moreover, using Proposition 4.6 we obtain that, for sufficiently large , every vertex is in at most
[TABLE]
sets corresponding to a flaw . Note that in deriving the second inequality above we used the fact that , which in turn implies that for sufficiently large . Overall:
[TABLE]
for sufficiently large , concluding the proof. ∎
4.4 Proof of Lemma 4.4
Fix states and such that . To prove that the matrices commute, we need to show that for every such pair
[TABLE]
To that end, let be the subgraphs (which may consist only of a single vertex) associated with flaws and , respectively. Since we have , where denotes the length of the shortest path between and . Notice that this implies .
Consider a pair of transitions , , where , and so that , . The facts that procedure Resample only modifies the input set of matchings locally within , that , and that imply that (i) ; and (ii) for every , . Notice now that the probability distribution depends only on . Hence, (i) and (ii) imply that the probability distribution is well defined and, in addition, there exists a natural bijection between the action set and the action set so that for every . This is because both distributions are implemented by sampling from the set of matchings of the same multigraph according to the same probability distribution.
Now let . A symmetric argument implies that and that there exists a natural bijection between and so that for every . In particular, notice that and that
[TABLE]
Overall, what we have shown is a bijective mapping that sends any pair of transitions to a pair of transitions and which satisfies (18). This establishes (17), concluding the proof.
5 List-Edge Coloring Multigraphs: Proof of Theorem 1.4
In this section we review the proof of Theorem 1.2 and then prove its constructive version, Theorem 1.4. Again, throughout the proof we assume that the maximum degree of the input multigraph satisfies for some appropriately large constant .
In Section 5.1 we give a high-level sketch of the existential proof of Kahn, and we state the key technical results from that paper (Theorems 5.1, 5.2, and Lemma 5.3). As we will see, our main contribution is to make Theorem 5.1 constructive. Towards this end, we describe our local search algorithm in Section 5.2, where we also prove its correctness assuming it converges (Lemma 5.4), as well as an important property of the flaws we consider (Lemma 5.5). Finally, in Section 5.3 we prove that our search algorithm has expected polynomial running time, concluding the proof of Theorem 1.4.
5.1 A High Level Sketch of the Existential Proof
As we explained in the introduction, the non-constructive proof of Theorem 1.2 is a sophisticated version of the semi-random method and proceeds by partially coloring the edges of the multigraph in iterations, until at some point the coloring can be completed greedily. (More accurately, the method establishes the existence of such a sequence of desirable partial colorings.)
We will follow the exposition in [31]. In each iteration, we have a list of acceptable colors for each edge . We assume that each originally has colors for some , where is an arbitrarily small constant. For each color , we let be the subgraph of formed by the edges for which is acceptable. Since . Thus, Theorem 2.9 implies that we can find a hard-core distribution on the matchings of with marginals whose activity vector satisfies for all , where is a constant.
In each iteration, we will use the same activity vector to generate the random matchings assigned to color . Of course, in each iteration we restrict our attention to the subgraph of obtained by deleting the set of edges colored (with any color) in previous iterations, and the endpoints of the set of edges colored in previous iterations. (Thus, although we use the same activity vector for each color in each iteration, the induced hard-core distributions may vary significantly.) Further, we will make sure that our distributions have the property that for each edge , the expected number of matchings containing is very close to . (In other words, the sum over of the probabilities that edge is a part of the matching corresponding to color is close to .)
We apply the Lopsided LLL in the following probability space. For each color , independently, we choose a matching from the corresponding distribution. Next, we activate each edge in independently with probability ; we assign colors only to activated edges in order to ensure that very few edges are assigned more than one color. We then update the multigraph by deleting the colored edges, and update the lists by deleting any color assigned to an edge incident on . We give a more detailed description below.
Notice that our argument needs to ensure that (i) at the beginning of each iteration the induced hard-core distributions are such that, for each uncolored edge , the expected number of random matchings containing is very close to ; and (ii) after some number of iterations, we can complete the coloring greedily.
As far as the latter condition is concerned, notice that if (i) holds throughout then, in each iteration, the probability that an edge retains a color remains close to the activation probability . This allows us to prove that the maximum degree in the uncolored multigraph drops by a factor of about in each iteration. Hence, after iterations, the maximum degree in the uncolored multigraph will be less than . Furthermore, for each and , the probability that is in the random matching of color is at most . Since (i) continues to hold, this implies there are at least colors available for each edge, and so the coloring can be completed greedily. (Recall that the .)
An Iteration.
For each color , pick a matching according to a hard-core probability distribution on with activities such that for some constant :
- (a)
; and 2. (b)
and hence . 2. 2.
For each , activate each edge of independently with probability , to obtain a new matching . We color the edges of with color and delete from . We also delete from every edge not in which is in for some . We do not delete edges of from . (Note that this may result in edges receiving more than one color, which is not a problem since we can always pick one of them arbitrarily at the end of the iterative procedure.) 3. 3.
Perform an equalizing coin flip for each edge of so that the probability that is both colored and removed from in either Step 2 or Step 3 is exactly . (See also Remark 5.1 below.)
Remark 5.1**.**
Note that the expected number of edges that are both colored and removed from in Step 2 is less than because, although the expected number of colors retained by an edge is very close to , some edges may be assigned more than one color. Performing “equalizing coin flips” in Step 3 is a standard idea that helps in avoiding several technical difficulties that stem from the latter fact.
The outcome of an iteration is defined to be the choices of matchings, activations, and equalizing coin flips. Let denote the random variable that equals the outcome of the -th iteration. (In what follows, we will focus on a specific iteration and so we will omit the subscript.)
For each edge , we define a bad event as follows. Let be the multigraph obtained after carrying out the modifications to in Steps 2 and 3 of the above iteration. Let , recall the definition of for subgraph , and let denote the corresponding induced subgraph. Let be a random matching in sampled from the hard-core probability distribution induced by activity vector . Let be the event that
[TABLE]
To get some intuition behind the definition of event , let be a random matching in chosen according to the hard-core distribution with activities . Since correlations decay with distance, one can show that is within a factor of . Thus, according to (19), avoiding bad event implies that , which is what is required in order to maintain property (i) at the beginning of the next iteration. In particular, it is straightforward to see that avoiding all bad events guarantees that
[TABLE]
for sufficiently large , which is what we really need. The reason we consider and not is that events defined with respect to the former are mildly negatively correlated with most other bad events, making it possible to apply the Lopsided LLL.
Further, for each vertex we define to be the event that the proportion of edges incident on which are colored in the iteration is less than .
It can be formally shown that, if we avoid all bad events, then (i) holds, i.e., at the beginning of the next iteration we can choose new probability distributions so that for each uncolored edge we maintain the property that the expected number of random matchings containing is very close to , and, moreover, after iterations we can complete the coloring greedily.
Theorem 5.1** ([23]).**
Assume that (20) holds for the edge marginals of the matching distributions of iteration . Then, with positive probability, the same is true for the matching distributions of iteration .
Theorem 5.2** ([23]).**
If we can avoid the bad events of the first iterations, then we can complete the coloring greedily.
Proving Theorems 5.1 and 5.2 is the heart of the proof of Theorem 1.2. The most difficult part is proving that, for any , the probability of event is very close to [math] conditioned on any choice of outcomes for distant events. (This is needed in order to apply the Lopsided LLL.) Given Theorem 5.1, the proof of Theorem 5.2 follows, as we have already explained, from the fact that in each iteration the expected number of random matchings containing each uncolored edge is very close to and, therefore, the probability that retains a color remains close to .
Below we state the key lemma that is proven in [23], and which we will also use in the analysis of our algorithm.
Recall the definition of . For a subgraph , we let be the random outcome of our iteration in , i.e., consists of together with the choices of the activated edges in which determine the , and the outcomes of the equalizing coin flips for edges in this subgraph.
Lemma 5.3** ([23]).**
For every and possible choice for , we have .
In the remaining sections we will focus on providing an efficient algorithm for Theorem 5.1 which, combined with Theorem 5.2, will imply the proof of Theorem 1.4.
As a final remark, we note that detecting whether bad events are present in a state is not a tractable task since it entails the exact computation of edge marginals of hardcore distributions over matchings. In order to overcome this obstacle, we will define flaws whose absence provides somewhat weaker guarantees than the removal of bad events , but nonetheless implies (20) for every edge. To decide whether a flaw is present in a state, we will use the results of [19] to estimate the corresponding edge marginals of random variables and for every color . Note that since we will only perform an approximation, we will not be able to check for (19) directly. However, our approximation will be tight enough so that, even in this case, (20) will still hold for every edge. We give the details below.
5.2 The Algorithm
Let denote the set of uncolored edges and , the cardinality of the set of colors that appear in the list of available colors of some uncolored edge. For a color , recall that denotes the subgraph of uncolored edges that contain in their list of available colors. Finally, let and be the set of all binary strings of length , where is a parameter to be defined later. An element of should be thought of as the input “randomness” to a subroutine of our algorithm whose purpose will be to estimate edge-marginals of distributions .
Define . We consider an arbitrary but fixed ordering over , so that each state can be represented as , where are the matching, activation and equalizing coin flip vectors, respectively, that correspond to color , so that edge is activated in if and is marked to be removed if . Additionally, is the tuple of strings corresponding to the particular element of at state . As we will see, tuples are defined for purely technical reasons, and specifically for properly bypassing the issue of detecting the presence of -flaws that we mentioned earlier.
Recalling Corollary 2.12, we see that we are able to obtain a approximation for the marginal , , of an edge with probability at least in polynomial time, where is a fixed and sufficiently large positive constant. This fact will be useful to us in two ways.
First, recall that for color we choose a matching according to probability distribution , and we define to be the probability of success of the equalizing coin flip that corresponds to edge and color . Note that, given access to the marginals of , the value of can be computed efficiently. Of course, and as we just explained, we will have only (arbitrarily good) estimates of the marginals of , but as in the proof of Theorem 1.3, this suffices for our purposes. Indeed, through sampling we can efficiently get an estimate that is within a factor of the correct value with probability at least , where is a sufficiently large constant, and hence guarantee that the total variation distance between the resampling probability distributions used by the algorithm and the ideal ones is negligible, i.e., at most . (Later we will argue that we can maximally couple the approximate and ideal distributions and proceed with an argument identical to the one we used in the proof of Theorem 1.3, where we absorb the probability that the coupling fails into the expected polynomial running time of the algorithm — recall our discussion in the beginning of Section 4.2)
Second, we let be a fixed polynomial upper bound on the number of random bits required by the sampling algorithm (whose existence is guaranteed by Theorem 2.10) for approximating , for an arbitrary color and an arbitrary edge , within a factor with probability at least . We let be an analogous fixed polynomial upper bound for estimating for arbitrary , and define .
We let be the probability distribution over that is induced by the product of the ’s, activation flips, equalizing coin flips, and the uniform distribution over , for each color . In other words, is the probability distribution over induced by the iteration along with some extra randomness that is used for sampling from .
The initial distribution.
Recall that each edge initially has a list of size at least . As we have already seen in Corollary 2.13, the results of [19, 41] imply that for every color and parameter , where is a sufficiently large constant, there exists a -algorithm that computes a vector such that the induced hard-core distribution -approximates in variation distance the hard-core distribution induced by vector . Let be the distribution obtained in an identical way to but using vectors instead of vectors . The initial distribution of our algorithm is obtained by -approximately sampling from . Theorem 2.10 implies that this can be done in polynomial time.
Finding and addressing flaws.
We define a flaw for each bad event . To define flaw corresponding to an edge , we first recall the definitions of . In particular, recall that the description of a state determines a binary string of length for each color and edge . We will think of as a concatenation of two strings of length and , respectively, that can and will be used as the “input randomness” to a sampling algorithm that estimates and , respectively. (Here is the evaluation of random variable at .) Indeed, let be the resulting, deterministic (given ) estimation of and, similarly, let be the resulting estimation of . Finally, we define flaw to be the set of states such that
[TABLE]
We fix an arbitrary ordering over . In each step, the algorithm finds the lowest indexed flaw according to that is present in the current state and addresses it.
Clearly, checking if vertex-flaws are present in the current state can be done efficiently. The same is true for edge-flaws given Theorem 2.10. What is perhaps not so clear, however, is whether the definition of -flaws is sufficient for our purposes, and how it relates to the definition of bad events .
To address these questions, recall first that we can use the results of [19] to approximate the edge marginals of the corresponding distributions within a -factor with probability at least , in time , where . Our approach will be to first argue that, assuming our edge marginal estimates were always within a -factor of the true values, then our algorithm would terminate in expected polynomial time, and then use a coupling argument similar to the one described in the beginning of Section 4.2 to show that we can make this assumption in our analysis at a negligible price.
More formally, given a state , let , , and , and define . For each edge , color , and state , let be the set of strings with the property that, if our marginal estimators use them as input randomness in state for edge , then they are guaranteed to provide a -factor approximation of the true marginals of . Crucially, observe that for a constant which can be made arbitrarily large by increasing . Let be the subspace of induced by removing every state such that there exists an for which . That is, is the subspace of in which our edge-marginal approximations are guaranteed to be within a -factor of the true values. Finally, let be the distribution induced by conditioning on the event that a sample from belongs to . Equivalently, to take a sample from we first sample from the product of the ’s, activation flips, and equalizing coin flips to obtain a tuple , and then sample uniformly an element from .
The following two lemmas justify our definition of -flaws. Specifically, Lemma 5.4 shows that avoiding all -flaws is sufficient for the purposes of our analysis (recall Theorem 5.1), while Lemma 5.5 bounds the probability of each flaw (with respect to ).
Lemma 5.4**.**
Condition (20) holds for every edge and every state such that .
Proof.
Since for every state we have that are within a factor of the respective true marginals, we have that for every state :
[TABLE]
Recalling that is within a -factor of , we can deduce that if flaw is not present in a state , then (20) holds for sufficiently large , as claimed. ∎
Lemma 5.5**.**
For every and possible choice for we have .
Proof.
For flaws the claim follows almost immediately from Lemma 5.3, so we focus on proving it for the case of -flaws. In particular, we show that
[TABLE]
as this implies our claim per Lemma 5.3.
Recall that is a subset of , i.e., the “augmented” space where each state is associated with a tuple of strings from , while event is a subset of the original probability space that is induced by the family of random matchings, activations, and equalizing coin flips for each edge. Recall also that, by definition, assigns zero probability mass to , i.e., the part of where we have no guarantees about the quality of approximation of our edge-marginal estimators. In order to establish (22) we “project” to the original probability space to get an event . That is, the elements of are induced by the elements of by ignoring the coordinate that corresponds to the tuple of strings from . By definition, .
In addition, for every elementary event we have
[TABLE]
for sufficiently large . Note that the first inequality follows from (21) and the fact that we only consider elements in , i.e., states in which our edge-marginal approximations are within a -factor from the true values. Recalling (19), we see that inequality (23) implies that (and, therefore, also (22)), concluding the proof.
∎
Summarizing, we may assume without loss of generality that we are able to accurately and efficiently search for edge-flaws , and that their probability with respect to measure is bounded above by conditional on any instantiation of .
Recall the procedure Resample described in Section 4.1. Below we describe procedure Fix that takes as input a subgraph and a state . In the description of Fix below we invoke procedure Resample with an extra parameter, namely an activity vector for each color . By that we mean that in Lines 4, 5 of Resample we use the vector to define . Finally, recall that we defined .
Theorem 2.10 implies that procedure Fix runs in polynomial time for any input subgraph and state . To address flaws in a state we invoke Fix() and Fix(), respectively.
5.3 Proof of Theorem 1.4
Similarly to the proof of Theorem 1.3, for our analysis we will assume that our algorithm samples from the “ideal” matchings distributions, i.e., the ones induced by the vectors rather than by the approximations . We will also assume that each equalizing coin corresponding to a color and an edge is flipped with probability of success instead of , and that we update string by sampling uniformly from instead of . Under these assumptions, we will prove that our algorithm terminates in expected polynomial time. Recalling the proof of Theorem 1.3, the latter allows us to invoke an identical coupling argument and show that the price of making these assumptions is to increase the failure probability of our algorithm by an additive , where can be made arbitrarily large by increasing . This error probability can be subsumed by the expected running time of our algorithm.
For two flaws , where , we consider the causality relation iff . By inspecting procedure Fix it is not hard to verify that this is a valid choice for a causality graph in the sense that no flaw can cause flaws outside . This is because, in order to determine whether a flaw is present in a state , we only need information about in , and procedure FIX locally modifies the state within a radius at most of the input subgraph .
The algorithmic proof of Theorem 5.1, which as we explained earlier is the key ingredient in making Kahn’s result constructive, follows almost immediately by combining Theorem 2.4 with Lemma 5.6 below, whose proof can be found in Section 5.3.1.
Lemma 5.6**.**
Let for an edge and a vertex . Then:
[TABLE]
where the charges are computed with respect to measure and the algorithm that samples from the ideal distributions.
Constructive Proof of Theorem 5.1.
Recall from (8) that, setting for each flaw , condition (3) with is implied by
[TABLE]
Clearly, for each flaw , so, by Lemma 5.6, condition (24) is satisfied for all sufficiently large . Thus, Theorem 2.4 implies that, for every multigraph with large enough degree , the algorithm for each iteration terminates after an expected number of
[TABLE]
steps. ∎
Finally, the proof of Theorem 1.4 is concluded by combining the algorithm for Theorem 5.1 with the greedy algorithm of Theorem 5.2. It remains only for us to prove Lemma 5.6 stated above. This we do in the next subsection.
5.3.1 Proof of Lemma 5.6.
Let and . For notational convenience, sometimes we write and , for .
Let be the distribution over induced by the product of distributions , . Let also be the distributions over and induced by the product of activation and equalizing coin flips of each color , respectively. Recall that we can take a sample from by sampling from to obtain a tuple , and then sample uniformly from an element from . Moreover, note that each is the product of distributions , one for each color . For example, notice that is another name for , while is the product measure over the edges of induced by flipping a coin with probability for each edge.
For , a subgraph , and an integer , we define the quantities and , similarly to the proof of Lemma 4.3. Moreover, for that represents the outcome of the activations, we let denote the restriction of to for each color . In the same fashion, for that represents the outcome of the equalizing coin flips, we let denote the restriction of to for each color . For , we also define and , , similarly to . Finally, for , define .
Our goal will be to show that, for every ,
[TABLE]
where is a random state distributed according to . Note that combining (25) with Lemma 5.5 will conclude the proof of Lemma 5.6.
We only prove (25) for -flaws, since the proof for flaws is very similar. Observe that whether flaw is present at a state is determined by , the entries of the activation and equalizing flip vectors of each color that correspond to edges in , and the value of the “input randomness” strings . With that in mind, for each color let and denote the (random) vectors constraining the entries of the activation and equalizing coin flip vectors for color that correspond to the edges of . Let also denote the domain of possible values of .
The fact that we can determine whether is present in a state by examining local information around implies that there exists a set of vectors of size such that the -th entry of a vector is an element of , and so that
[TABLE]
For a state , let be the -dimensional random vector whose -th entry is . According to (26), for we have
[TABLE]
since the random choices of matching, activation, and equalizing coin flips for each color are independent. For an -dimensional vector whose -th entry is an element of , we write to denote the -th element of tuple . Thus, recalling the definition of the distributions , we have
[TABLE]
where .
Recall now that for a subgraph , multigraph is induced by , and is the set of matchings of that are compatible with . Hence,
[TABLE]
Moreover, we clearly have
[TABLE]
We will use (27)-(31) to show that, for distributed according to , and any state ,
[TABLE]
According to the definition of , maximizing (32) over yields (25).
To compute the sum in (32) we need to determine the set of states . We claim that for each we have that .
To see this, let
[TABLE]
where and for and are tuples of input randomness strings. To express the probability distribution in a convenient way we consider the following distributions. For each we have a probability distribution corresponding to Line 3 of FIX and color , and similarly, for we have probability distributions , corresponding to Lines 5, 6 of FIX and color , respectively. Recalling procedure Resample, we see that the support of is , and hence it must be the case that for every and state . Similarly, by inspecting procedure FIX one can verify that and that for each . Hence, , as claimed.
For each ,
[TABLE]
since we have assumed that in Line 7 of FIX we update string by sampling uniformly from instead of .
We will now give an alternative expression for each in order to relate (33) to (32). We start with . The fact that for each implies that
[TABLE]
To see this recall the definition of mutligraph in the text above the definition of procedure Resample.
Furthermore, since we have assumed that the hard-core distribution in Lines 4, 5 of Resample is induced by the ideal vector of activities , we have
[TABLE]
Combining (34) with (35) and the fact that we obtain
[TABLE]
Recall now the definitions of and . The fact that for each implies that
[TABLE]
Further, since in Line 5 of FIX we simply flip a coin independently with success probability for each edge of , we have
[TABLE]
where the sum in the denominator ranges over all the possible values for . Thus, combining (37) with (38) we get
[TABLE]
Finally, an identical argument shows that
[TABLE]
where the sum in the denominator ranges over all the possible values for .
For , let . For distributed according to , the left-hand side of (32) can be written as
[TABLE]
where , concluding the proof of (32). Note that (5.3.1) follows by (29) and (36) for , (30) and (39) for , and (31) and (40) for . This concludes the proof of (25) and hence of Lemma 5.6
6 Acknowledgements
We thank Dimitris Achlioptas for many helpful discussions. We are also thankful to anonymous reviewers for helping us correct inaccuracies in previous versions of the paper and, in particular, for the idea to include the random bits used for approximating the edge-marginals in the algorithm of Section 5.2 in the definition of its state space, which simplified our approach.
Appendix A Proof of Lemma 4.5
We will need the following standard concentration bound (see, e.g., [31, Section 10.1]).
Lemma A.1**.**
Let be a random variable determined by independent trials , and such that changing the outcome of any one trial can affect by at most . Then
[TABLE]
Proof of Part (a) of Lemma 4.5.
Recall that and that . Consider a random state distributed according to and a fixed state , and notice that applying Theorem 2.9 with the parameter instantiated to and our choice of imply that
[TABLE]
for any vertex , any edge incident on and any . This implies
[TABLE]
Now, recalling that and , for sufficiently large we have
[TABLE]
Further, since and , (43) yields
[TABLE]
As the choices of the are independent and each affects the degree of in by at most , we can apply Lemma A.1 with to prove part (a). In particular, since we have
[TABLE]
for any constant for sufficiently large . ∎
Proof of Part (b) of Lemma 4.5.
The proof of part (b) is similar. Consider again a random state distributed according to and fix a state . Theorem 2.9 implies that for each , the probability that an edge with both endpoints in is in , conditional on , is at least . Moreover, Edmonds’ characterization of the matching polytope (which we have already seen in the the proof of Lemma 4.2) implies that the number of edges in with both endpoints in is at most . Similar calculations to those in part (a) reveal that
[TABLE]
where is the set of edges of induced by . Since the choices of matchings are independent and each affects by at most , we can again apply Lemma A.1 to prove part (b). ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Dimitris Achlioptas and Fotis Iliopoulos. Random walks that find perfect objects and the Lovász local lemma. J. ACM , 63(3):22:1–22:29, July 2016.
- 2[2] Dimitris Achlioptas, Fotis Iliopoulos, and Vladimir Kolmogorov. A local lemma for focused stochastic algorithms. SIAM J. Comput. , 48(5):1583–1602, 2019.
- 3[3] Dimitris Achlioptas, Fotis Iliopoulos, and Alistair Sinclair. Beyond the Lovász local lemma: Point to set correlations and their algorithmic applications. In David Zuckerman, editor, 60th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2019, Baltimore, Maryland, USA, November 9-12, 2019 , pages 725–744. IEEE Computer Society, 2019.
- 4[4] Dimitris Achlioptas, Fotis Iliopoulos, and Nikos Vlassis. Stochastic control via entropy compression. In Ioannis Chatzigiannakis, Piotr Indyk, Fabian Kuhn, and Anca Muscholl, editors, 44th International Colloquium on Automata, Languages, and Programming, ICALP 2017, July 10-14, 2017, Warsaw, Poland , volume 80 of LIP Ics , pages 83:1–83:13. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2017.
- 5[5] Michael Albert, Alan Frieze, and Bruce Reed. Multicoloured Hamilton cycles. Electronic Journal of Combinatorics , 2(1):R 10, 1995.
- 6[6] Guantao Chen, Guangming Jing, and Wenan Zang. Proof of the Goldberg-Seymour conjecture on edge-colorings of multigraphs. ar Xiv preprint ar Xiv:1901.10316 , 2019.
- 7[7] Guantao Chen, Xingxing Yu, and Wenan Zang. Approximating the chromatic index of multigraphs. Journal of Combinatorial Optimization , 21(2):219–246, 2011.
- 8[8] Andrzej Dudek, Alan Frieze, and Andrzej Ruciński. Rainbow Hamilton cycles in uniform hypergraphs. The Electronic Journal of Combinatorics , 19(1):46, 2012.
